CBMG 688N/O Home Syllabus Links Hints

Overview of Phylogenetic Methods

Common elements of most methods

Trees as a representation of evolutionary history

Dichotomously branching trees, perhaps with zero-length branches leading to polytomies

In mathematical terms, an acyclic graph with interior vertices of degree three (or higher)

Characters, character states, character-state changes

Branch length

Taxa (also called OTUs), in our case often represented by biological sequences such as DNA

Parsimony

Parsimony methods favor the tree that requires the smallest number of character-state changes

Parsimony as an optimality criterion

In standard parsimony, branch length is not important per se; it is the overall length of the tree that is optimized.

Tree searching (branch swapping) to find the best tree

Can be prone to long-branch attraction if there are superimposed substitutions

Distance (Clustering)

Distance methods cluster taxa according to a distance measure

Distances can be calculated (or measured) in many ways

Some distance measures take into account superimposed substitutions

Branch lengths are intrinsically meaningful (as is the "diameter" of the tree

There are many different distance methods depending upon how distances are calculated and on how tree is found

Likelihood

Uses a model of character-state substitution to relate data to hypothesis

Particularly valuable for DNA, RNA, and Amino-Acid data

Models of point mutation have been described

There are also well-established models of site-to-site rate variation

Codon-specific models

Tree topology includes both the branching order and branch lengths

Likelihood methods can also be used to test other hypotheses

Maximum Likelihood

Uses tree-search methods similar to those used in Parsimony analysis

If hypothesis is tree topology, best tree is the tree with the highest likelihood

This is the tree that would be most likely to give rise to the data given the model

Bayesian

Uses a Metropolis-Hastings Coupled Markov Chain Monte Carlo (MCMCMC) algorithm to estimate the probability of different trees given the data

Confidence assessment


Baldauf SL (2003) The deep roots of eukaryotes. Science 300:1703-1706

Stewart CB (1993) The Powers and Pitfalls of Parsimony. Nature 361:603-607