|
A first pass using GCG
Given the following sequence fragment:
krsranmnnstttgpanntssnktfldnfeetrtnkllde
do the following:
- Discover what sequence in the public databases is most similar to it.
- Download this sequence to your account
- Using this starting point, find sequence in the public databases which may be homologous to it for the following model organisms: Saccharomyces cerevisiae, Caenorhabditis elegans, Arabidopsis thaliana, Mus musculus, Homo sapiens
- Report on the quantity of identity between the initial search transcript and that found in these organisms. Do so in a table with columns for: organism, accession/id #, number of identities, number of similarities, # gaps, and any available statistical information.
- Using gcg, perform a dotplot between a) the human and yeast versions and b) the mouse and human versions of the protein. (I suggest setting your plotter to a png or gif output and inserting the output into your document).
- Examine the plot to look for insertions/deletions, tandem repeats, inversions, or other interesting features and report them.
|