PHYLIP
- Phylip is a phylogenetic analysis package written by Joe Felsenstein (you
may recognize him from such phylogenetic concepts as "The Felsenstein Zone"
and "The F84 Model").
- Phylip (like GCG) is a collection of small programs that each perform
a relatively simple task.
- These programs can be used together to accomplish complex analyses,
and the package includes analytical methods that are not available elsewhere.
- The programs are installed on the University of Maryland AITS UNIX cluster
at ~delwiche/bin/phylip.
- Use ls to view the components of the package.
ls ~delwiche/bin/phylip
- Unfortunately, the user interface is virtually non-existant, although once
you learn phylip's peculiarities, it is easy to use.
- Phylip is available free, although you do need to register as a user.
- Be sure to do this -- it helps Joe justify his support.
- Documentation is available locally in ~delwiche/bin/phylip/docs
ls ~delwiche/bin/phylip/docs
- Read the general documentation for phylip:
more ~delwiche/bin/phylip/docs/main.doc
- There is (approximately) one documentation file per program.
more ~delwiche/bin/phylip/docs/dnadist.doc
- This will show you the documentation for dnadist, the program used to
calculate distances from dna sequence data.
- Notice that there is also a program to calculate distances from amino acid
data:
more ~delwiche/bin/phylip/docs/protdist.doc
- To use these programs you will have to have an input file in phylip format.
- The basic phylip format data file consists of a file where the first
line contains two numbers, the number of taxa, and the number of characters.
- There may also be some additional characters on this first line, which
are used to control the behavior of the program (for example, if the data
are interleaved, this must be indicated on the first line).
- Following the first line is the data matrix.
- The first ten characters (exactly!) of each line are the taxon name
-- if the name is less than ten characters just add spaces.
- The taxon names are followed by data, i.e., nucleotide of amino
acid sequences.
7 50
thermotogaATGGCGAAGGAAAAATTTGTGAGAACAAAACCGCATGTTAACGTTGGAAC
TthermophiATGGCGAAGGGCGAGTTTGTTCGGACGAAGCCTCACGTGAACGTGGGGAC
TaquaticusATGGCGAAGGGCGAGTTTATCCGGACGAAGCCCCACGTGAACGTGGGGAC
deinonema-ATGGCTAAGGGAACGTTTGAACGCACCAAACCCCACGTGAACGTGGGCAC
ChlamydiaBATGTCAAAAGAAACTTTTCAACGTAATAAGCCTCATATCAACATAGGGGC
flexistipsATGTCCAAGCAAAAGTACGAAAGGAAGAAACCTCACGTAAACGTAGGCAC
borrelia-bATGGCAAAAGAAGTTTTTCAAAGAACAAAGCCGCACATGAATGTTGGAAC