CSC334 Lab2

From dftwiki3
Jump to: navigation, search

<meta name="keywords" content="computer science, bioinformatics, DNA, CSC334, Lab" /> <meta name="description" content="DNA Sequence Logo Lab" /> <meta name="title" content="Bioinformatics Lab" /> <meta name="abstract" content="DNA Sequence Logo" /> <meta name="author" content="thiebaut at cs.smith.edu" />

Back to CSC334 Lab Page


Retrieving Protein sequences from ExPASy

ExPASy is a database of proteins created by a pioneer of bioinformatics: Amos Bairoch

Methodology


Figure 1











  • The Accession Number of the protein found is P06968. Its name in the Swiss-Prot database is DUT_ECOLI, but it is also known as EC 3.6.1.23, or dUTPase, or dUTP pyrophospatase.
  • Find the Sequence Information at the bottom of the page, and select P06968 in FASTA format for the display. You should get something like this:
>sp|P06968|DUT_ECOLI Deoxyuridine 5'-triphosphate nucleotidohydrolase OS=Escherichia coli (strain K12) GN=dut PE=1 SV=1
MKKIDVKILDPRVGKEFPLPTYATSGSAGLDLRACLNDAVELAPGDTTLVPTGLAIHIAD
PSLAAMMLPRSGLGHKHGIVLGNLVGLIDSDYQGQLMISVWNRGQDSFTIQPGERIAQMI
FVPVVQAEFNLVEDFDATDRGEGGFGHSGRQ



Back to CSC334 Lab Page
© D. Thiebaut 2008