RSS

How to search FASTA in website: ncbi.nlm.nih.gov

11 Mei

Hallo Readers.

kali ini kita akan belajar bagaimana mencari FASTA dalam situs berbasis science seperti ncbi.nlm.nih.gov

FASTA

A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line (defline) is distinguished from the sequence data by a greater-than (“>”) symbol at the beginning. It is recommended that all lines of text be shorter than 80 characters in length. An example sequence in FASTA format is:

		>gi|129295|sp|P01013|OVAX_CHICK GENE X PROTEIN (OVALBUMIN-RELATED)
		QIKDLLVSSSTDLDTTLVLVNAIYFKGMWKTAFNAEDTREMPFHVTKQESKPVQMMCMNNSFNVATLPAE
		KMKILELPFASGDLSMLVLLPDEVSDLERIEKTINFEKLTEWTNPNTMEKRRVKVYLPQMKIEEKYNLTS
		VLMALGMTDLFIPSANLTGISSAESLKISQAVHGAFMELSEDGIEMAGSTGVIEDIKHSPESEQFRADHP
		FLFLIKHNPTNTIVYFGRYWSP

Blank lines are not allowed in the middle of FASTA input.

Sequences are expected to be represented in the standard IUB/IUPAC amino acid and nucleic acid codes, with these exceptions: lower-case letters are accepted and are mapped into upper-case; a single hyphen or dash can be used to represent a gap of indeterminate length; and in amino acid sequences, U and * are acceptable letters (see below). Before submitting a request, any numerical digits in the query sequence should either be removed or replaced by appropriate letter codes (e.g., N for unknown nucleic acid residue or X for unknown amino acid residue). The nucleic acid codes supported are:

		A  adenosine          C  cytidine             G  guanine
		T  thymidine          N  A/G/C/T (any)        U  uridine 
		K  G/T (keto)         S  G/C (strong)         Y  T/C (pyrimidine) 
		M  A/C (amino)        W  A/T (weak)           R  G/A (purine)        
		B  G/T/C              D  G/A/T                H  A/C/T      
		V  G/C/A              -  gap of indeterminate length

For those programs that use amino acid query sequences (BLASTP and TBLASTN), the accepted amino acid codes are:

		A  alanine               P  proline       
		B  aspartate/asparagine  Q  glutamine      
		C  cystine               R  arginine      
		D  aspartate             S  serine      
		E  glutamate             T  threonine      
		F  phenylalanine         U  selenocysteine      
		G  glycine               V  valine        
		H  histidine             W  tryptophan        
		I  isoleucine            Y  tyrosine
		K  lysine                Z  glutamate/glutamine
		L  leucine               X  any
		M  methionine            *  translation stop
		N  asparagine            -  gap of indeterminate length

NOTE:
¹ The degenerate nucleotide codes in red are treated as mismatches in nucleotide alignment. Too many such degenerate codes within an input nucleotide query will cause blast.cgi to reject the input. For protein queries, too many nucleotide-like code (A,C,G,T,N) may also cause similar rejection.
² For protein code, U is replaced by X first before the search since it is not specified in any scoring matrices.
³ blast.cgi will not take “-” in the query. To represent gaps, use a string of N or X instead.

Langkah pertama buka situs ncbi.nlm.nih.gov

Halaman awal yang akan tampil adalah seperti gambar dibawah ini, selanjutnya All Database diubah menjadi nucleotide

halaman awal1

Gambar halaman awal

Langkah selanjutnya ketik hormon yang akan dicari dan spesiesnya

halaman ke21

Gambar tampilan searching hormon

Kemudian akan terlihat

halaman 31

Tampilan FASTA pada “Human insulin gen” pada gambar terlihat rangkaian basa nitrogen yang spesifik sesuai dengan spesies.

insulin human

Gambar “Human insulin gen”

Tampilan FASTA pada “Octodon Degus Insulin mRNA) insulin degusTampilan FASTA pada “giberelin sunflower”

sunfloer

Tampilan FASTA pada Black Goat

insulin goat2

Tampilan FASTA berupa rangkaian basa nitrogen yang berbeda-beda sesuai spesies masing-masing. sekian terimakasih

 
Tinggalkan komentar

Ditulis oleh pada 11 Mei 2015 inci Tak Berkategori

 

Tinggalkan komentar