| You are going to use wEMBOSS @ the Swiss EMBnet node to answer some application examples. wEMBOSS @ EMBnet can be accessed anonymously (username: "guestwemboss" and password: "guest&wemboss"). Before you start working, please create a new Project (e.g. by giving your family name as title of the project) [screenshot]. You can then continue working within you personal project. |
| 1 Retrieve Sequences and Files |
|---|
|
Retrieve the sequence in fasta format of the Swiss-Prot entry P57727 with the seqret program . (Hint: Don't forget to first runshowdb to check which databases are accessible and their names). |
| 2 Find Protein domains/motifs |
|
How many different domains are present in the Swiss-Prot entry P57727 ? (Hint: Use the EMBOSS application patmatmotifs to search your sequence against the PROSITE motif database. Please select the output option of the program to provide full documentation for the matching patterns.) Is the information about the protein domains also annotated in the Features FT lines of the SwissProt entry file? |
| 3 Pairwise sequence alignment |
|
Please run
a blastp (using the web interface at the ExPASy
server) against the Swiss-Prot database (tick the checkbox) to search for homologous of the human protein corresponding to the Swiss-Prot entry P57727. |
| 4 Producing a restriction map |
| Use the DNA sequence (see example 1) of the gene that codes for protein P57727 and search for enzymes that cut a minimum of once and a maximum of twice, and have a recognition site length of at least six bases. (Hint: use the program wossname with the keyword 'restriction' as input or check the list of programs grouped by type to find a program to perform restriction maps). |
| 5 Translation |
| Retrieve the complete sequence entry for the DNA sequence of the gene that codes for protein P57727.
Which part of the DNA sequence correspond to the coding sequence (CDS)? (Hint:
use the program entret
and check the FT lines). Translate
the coding sequence to the corresponding protein product. (Hint:
use the program wossname
with the keyword 'translation' as input or check the list of programs grouped
by type to find a suitable program to translate DNA into protein). Compare this result with the one obtained by using the EMBOSS application getorf. Report only ORF with a minimum nucleotide length of 300 nucleotides. Do you see a difference between the annotated CDS and the predicted one? How can you explain this? |
| 6 Designing primers |
Design
the 6 best primers for the DNA sequence of the gene that codes for protein P57727.
How many primer
pairs are considered OK by the program? |
Questions can be sent to: L. Bordoli (Lorenza.Bordoli@unibas.ch) or L. Falquet (Laurent.Falquet@isb-sib.ch)