Introductory Instructions

For those with no experience I have provided four sequences: (a) a DNA sequence, (b) a protein sequence, (c) a RNA sequence and (d) four protein sequences all in FASTA format. Prior to trying out a Web Site select the sequence and the FASTA line and copy to clipboard. Each of these Sites recommended in Online Analysis Tools will have a box into which you can "Paste" your sequence. Then click on the button labeled "Search," "Run" or "Submit." If in doubt use the default setting that the sites provide, but for the more adventuresome, some of the sites offer the chance of modifying the search strategy.

(a) DNA sequence

>DNA sequence
      TCTTGAAATCCATTTTTAGCCCAACCAGATCATCCCGCGCGAAGTGCTCGAACGAGGCTT
      CGCACCCGTCCGGATTGGTCATCCGGTCCGGGATGGGGAACAAGATCATCAATGTTTGTC
      CATGGAGGAAAAACATGGCGTTTCGACCGCTTCATGATCGTATTCTCGTCCGCCGCGTCG
      AGTCCGAAGAGAAGACCAAAGGCGGCATCATCATCCCCGACACTGCCAAGGAGAAGCCCC
      AGGAAGGCGAAGTCCTCGCTGTAGGTCCCGGCGCGCGCGGCGAACAGGGTCAGATCCAGC
      CGCTCGACGTCAAGGTGGGCGACCGCATCCTGTTCGGCAAGTGGTCCGGCACCGAGTCAA
      GATCGACGGAGAAGATCTCCTGATCATGAAGGAAAGCGATGTCATGGGAATCATCGAGGC
      CCGGGCGCCGAGAAGATAGCCGCCTGATAACGCGAAGATACAGTCAACAAGCTGCCTATC
    

(b) Protein sequence

>Protein sequence
      MAQLSLQHIQKIYDNQVHVVKDFNLEIADKEFIVFVAASGCGKSTTLRMIAGLEEISGGD
      LLIDGKRMNDVPAKARNIAMVFQNYALYPHMTVYDNMAFGLKMQKIAKEVIDERVNWAAQ
      ILGLREYLKRKPGALSGGQRQRVALGRAIVREAGVFLMDEPLSNLDAKLRVQMRAEISKL
      HQKLNTTMIYVTHDQTEAMTMATRIVIMKDGIVQQVGAPKTVYNQPANMFVSGFIGSPAM
      NFIRGTIDGDKFVTETLKLTIPEEKLAVLKTQESLHKPIVMGIRPEDIHPDAQEENNISA
      KISVAELTGAEFMLYTTVGGTS
    

(c) RNA sequence

>RNA sequence
      CGCAGGGTGGAGAAGTGGTCATCTCGCCGGGCCCATAACCCGGAGATCGCTGGTTCGAAT
      CCAGCCCTTGCTACCA
    

(d) Four protein sequences – for alignments and phylogenetic analyses

      >A protein  
        MNIRPLHDRVIIKREEVETRSAGGIVLTGSAATKSTRAKVLAVGKGRILENGTVQPLDVK
        VGDVIFNDGYGVKAEKIDGEEVLIISENDILAIVE  

      >B protein  
        MADIKFRPLHDRVVVRRVESEAKTAGGIIIPDTAKEKPQEGEVVAAGAGARDEAGKLVPL
        DVKAGDRVLFGKWSGTEVKIGGEDLLIMKESDILGIVG 

      >C protein  
        MNIRPLHDRVIVKRKEVETKSAGGIVLTGSAAAKSTRGEVLAVGNGRILENGEVKPLDVKV
        GDIVIFNDGYGVKSEKIDNEEVLIMSENDILAIVEA  

      >D protein  
        MKLRPLHDRVVIRRSEEETKTAGGIVLPGSAAEKPNRGEVVAVGTGRVLDNGEVRALAVKG
        DKVVFGPYSGSNAIKVDGEELLVMGESEILAVLED