Difference between revisions of "CSC334 Lab1"

From dftwiki3
Jump to: navigation, search
Line 3: Line 3:
 
In this lab we retrieve a DNA sequence in FASTA format that we can use for various experiments.
 
In this lab we retrieve a DNA sequence in FASTA format that we can use for various experiments.
  
# Point your browser to [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed www.ncbi.nlm.nih.gov]
+
* Point your browser to [http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed www.ncbi.nlm.nih.gov]
# Select Nucleotide from the drop-down menu, and enter escherichia coli in the search box.
+
* Select Nucleotide from the drop-down menu, and enter escherichia coli in the search box.
 
[[Image:CSC334_Lab1_1.png | thumb | 300px| frame | Picture 1]]
 
[[Image:CSC334_Lab1_1.png | thumb | 300px| frame | Picture 1]]
  
# Click on the first link, in our case on AB426820, and select FASTA in the display box.  You should get something like this:
+
* Click on the first link, in our case on AB426820, and select FASTA in the display box.  You should get something like this:
 +
 
  >gi|194306025|dbj|AB426820.1| Escherichia coli ompT mRNA for outer membrane protease T, partial cds, strain: JCM 5491
 
  >gi|194306025|dbj|AB426820.1| Escherichia coli ompT mRNA for outer membrane protease T, partial cds, strain: JCM 5491
 
  TGGGAATAGTCCTGACAACCCCTATTGCGATCAGCTCTTTTGCTTCTACCGAGACTTTATCGTTTACTCC
 
  TGGGAATAGTCCTGACAACCCCTATTGCGATCAGCTCTTTTGCTTCTACCGAGACTTTATCGTTTACTCC

Revision as of 16:29, 21 July 2008

Retrieving a DNA sequence

In this lab we retrieve a DNA sequence in FASTA format that we can use for various experiments.

  • Point your browser to www.ncbi.nlm.nih.gov
  • Select Nucleotide from the drop-down menu, and enter escherichia coli in the search box.
Picture 1
  • Click on the first link, in our case on AB426820, and select FASTA in the display box. You should get something like this:
>gi|194306025|dbj|AB426820.1| Escherichia coli ompT mRNA for outer membrane protease T, partial cds, strain: JCM 5491
TGGGAATAGTCCTGACAACCCCTATTGCGATCAGCTCTTTTGCTTCTACCGAGACTTTATCGTTTACTCC
TGACAACATAAATGCGGACATTAGTCTTGGAACTCTGAGCGGAAAAACAAAAGAGCGTGTTTATCTAGCC 
GAAGAAGGAGGCCGAAAGGTCAGTCAACTTGACTGGAAATTCAATAACGCTGCAATTATTAAAGGTGCAA
TTAATTGGGATTTGATGCCCCAGATATCTATCGGGGCTGCTGGCTGGACAACTCTCGGTAGCCGAGGTGG  
CAATATGGTCGATCGGGACTGGATGGATTCCAGTAACCCCGGAACCTGGACGGATGAAAGTAGACACCCT 
GATACACAACTCAATTATGCCAACGAATTTGATCTGAATATCAGAGGCTGGCTCCCCAACGAACCCAATT
ACCGCCTGGGACTCATGGCCGGATATCAGGAAAGCCGTTATAGCTTTACAGCCAGAGGGGGTTCCTATAT
CTACAGTTCTGAGGAGGGATTCAGAGATGATATCGGCTCCTTCCCGAATGGAGAAAGAGCAATCGGCTAC
AAACAACGTTTTAAAATGCCCTACATTGGCTTGACTGGAAGTTATCGTTATGAAGATTTTGAGCTAGGTG
GTACATTTAAATACAGCGGCTGGGTGGAAGCATTTGATAACGATGAACACTATGACCCAGGAAAAAGAAT
CACTTATCGCAGTAAAGTCAAAGACCAAAATTACTATTCTGTTGCAGTCAATGCAGGTTATTACGTAACG
CCTAATGCAAAAGTTTATATTGAAGGCGCATGGAATCGGGTTACGAATAAAAAAGGTGATACTTCACTTT
ATGATCACAATGATAACACTTCTGACTACAGCAAAAATGGTGCAGGCATAGAAAACTATAACTTCATCAC
TACTGCTGGTC