DNA, RNA, and Proteins

Advantages
1. homology is (relatively) easy to determine
2. evolution is well characterized (ML models)
3. Give lots of characters although one gene alone is still not generally enough
  1. ~1-10 kb of reasonably variable sequence for a big phylogeny
4. Methods are easy
Disadvantages
1. Do not sample entire genome
  1. consequently get single gene phylogeny; this may be an advantage or disadvantage
  2. always try to remember the constraints of the system you are working with
2. Relatively expensive
Organellar genes
1. Remember that inheritance is typically uniparental
2. Usually single copy -- this is nice, no confusion with paralogy
3. However, in some cases the cell may have a population of different organellar genomes
  - "Calibrating the Mitochondrial Clock"; Science 279:28 (1998)
Nuclear genes
1. Plants and animals are diploid or polyploid
  - Protists may be haploid, diploid, or polyploid
2. Nuclear genes are often in gene families
3. With sexual reproduction, things get complicated

Draw nucleic acid sequences with 5' end on left, 3' end on right
Draw amino acid sequences with amino terminus to left, carboxyl terminus to right
First codon in coding sequence is (usually) Methionine (= start codon)
Stop codon is somewhat variable between taxa, but is often TAA
Several genes are often encoded in an operon Ð transcribed together

Put in a test tube:
1. Primers that match a known region of template DNA
  1. Degenerate primers have broader specificity
  2. Use a molar excess of primers (Michaelis-Menton kinetics)
  3. one primer at 5' end, one at 3' end
  4. most critical bases are those at 3' end of primer
  5. DNA polymerases won't work on single stranded DNA, so a primer is needed to initiate polymerization
2. A thermostable DNA polymerase
  1. Several enzymes, inc. Taq polymerase( From Thermus aquaticus)
  2. Taq is tolerant, but error prone
  3. Enzymes with higher fidelity are also available, e.g., Pfu polymerae (Pyrococcus furiosus)
3. A reaction buffer suitable for the enzyme
4. Magnesium - influences stringency of reaction
5. the four deoxynucleotides (ACGT), in approximately equal abundance
6. Template DNA
Temperature cycle
1. Melt - denature the DNA (ca 94°C)
2. Anneal - high annealing temperature for high stringency, low annealing temperature for low stringency (ca. 55°C)
3. Extend - at optimal temperature for polymerase activity (72°C for Taq polymerase)
Can greatly amplify a chosen DNA sequence.
Things to know about
1. Exonuclease activity of polymerases influences effective primer sensitivity
2. Risk of contaminants! PCR may amplify gene from DNA other than the intended target.
3. May produce a mix of different products!
Environmental DNA

An alternative way to generate large quanties of a sequence of interest
1. Use engineered bacterial plasmid (or other vector)
2. Cut template DNA with restriction enzyme(s)
3. Cut plasmid with the same enzyme
4. Mix the two, allow annealing
5. Ligate with DNA Ligase
6. Transform a bacterial cell with the new (chimaeric) plasmid
7. Grow up bacterium
  1. Plasmid is engineered to make it easy to work with, e.g.,
    1. vector confers antibiotic resistance
    2. color change if insert is present
8. (Perhaps) use phage properties to generate single-stranded DNA
  1. Single stranded DNA is easy to sequence
Get lots of DNA, even by PCR standards
Clones are easy to store & relatively stable
Each clone is unique, i.e., derived from a single DNA molecule even if a population of sequences were present in the original template.
1. Screen clones for desired properties
2. Cloning of PCR product can be used to study complex PCR product
More complex than PCR, slower, more specialized facilities

OK, so youÕve got lots of DNA...

Melt template
Anneal a sequencing primer
Nondegenerate if possible
Only one direction
Label with radioisotopically or fluorescently labeled nucleotides
Synthesize DNA in four vials, each with a small fraction of one dideoxynucleotide (ddA, ddC, ddG, ddT)
dideoxynucleotide will terminate DNA polymerization upon incorporation
run out on acrylamide gel.
read like climbing a ladder

Draw with amino terminus on L, carboxy terminus on R
Edman degradation Ð classical method, often still used
Label amino terminus with PITC
Release terminal AA by cyclizing at different pH
Repeat Lots of related methods
Commercially available Ð just Ôsend it outÕ
Requires a fair bit of the peptide
Can only sequence relatively short piece
Requires fairly pure protein
Mass Spectrometry -- can also do DNA sequencing
Not widely used
Allows sequence from small quantity of polypeptide
Moderate mixtures are no problem
Requires expertise and expensive instrumentation
It is often easiest to determine DNA sequence first, then translate (electronically)
But nothing can actually substitute for a genuine peptide sequence when needed.
Not all compartments use the same genetic code
Protistologists are advised to verify use of the ÔuniversalÕ genetic code.