Main >
DART >
PhylogeneticAlignmentReader
Selected highlights of the Phylogenetic Alignment literature
Features of phylogenetic alignment
- PhylogeneticAlignment, aka StatisticalAlignment
- Explicit probabilistic representations for evolutionary models
- Stochastic grammars, sampled sequence trajectories on phylogenetic trees
- Combined Bayesian inference of phylogeny and alignment
Roots
- Likelihood phylogeny
- Maximum likelihood alignment
- Minimum message length alignment & finite state machines
- MCMC multiple alignment by Gibbs sampling
- Paired-sequence hidden Markov models and stochastic grammars
Deriving phyloalignment algorithms from indel models
- Thorne JL, Kishino H, Felsenstein J. An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol. 1991 Aug;33(2):114-24. (PDF)
- Arguably the central paper of the statistical phylo-alignment literature. Develops the most basic "TKF91 model" for pairwise alignment
- Hein J. An algorithm for statistical alignment of sequences related by a binary tree. Pac Symp Biocomput. 2001:179-90.
- applies TKF91 to a tree
- also M. Steel and J. Hein, Applying the Thorne–Kishino–Felsenstein model to sequence evolution on a star-shaped tree, Appl. Math. Lett. 14 (2001), p. 679.
- Holmes I, Bruno WJ. Evolutionary HMMs: a Bayesian approach to multiple alignment. Bioinformatics. 2001 Sep;17(9):803-20. (PDF)
- develops MCMC for TKF91 using HMM theory
- Holmes I. Using guide trees to construct multiple-sequence evolutionary HMMs. Bioinformatics. 2003;19 Suppl 1:i147-57. (PDF) (errata)
- constructs multi-sequence HMMs systematically by transducer composition
- Holmes I. Phylocomposer and phylodirector: analysis and visualization of transducer indel models. Bioinformatics. 2007 Dec 1;23(23):3263-4. Epub 2007 Sep 5.
(pdf)
- first implementation of a general-purpose phylogenetic transducer algorithm (more info here: PhyloComposer)
- Knudsen B, Miyamoto MM. Sequence alignments and pair hidden Markov models using evolutionary history. J Mol Biol. 2003 Oct 17;333(2):453-60.
- Lunter G, Miklos I, Drummond A, Jensen JL, Hein J. Bayesian coestimation of phylogeny and sequence alignment. BMC Bioinformatics. 2005 Apr 1;6:83.
Beyond point substitution models
RNA models
Protein models
Other papers following from TKF91
- Thorne JL, Kishino H, Felsenstein J. Inching toward reality: an improved likelihood model of sequence evolution. J Mol Evol. 1992 Jan;34(1):3-16.
- Thorne JL, Churchill GA. Estimation and reliability of molecular sequence alignments. Biometrics. 1995 Mar;51(1):100-13.
- Miklos I. An improved algorithm for statistical alignment of sequences related by a star tree. Bull Math Biol. 2002 Jul;64(4):771-9.
- Hein J, Wiuf C, Knudsen B, Moller MB, Wibling G. Statistical alignment: computational properties, homology testing and goodness-of-fit. J Mol Biol. 2000 Sep 8;302(1):265-79.
- Metzler D. Statistical alignment based on fragment insertion and deletion models. Bioinformatics. 2003 Mar 1;19(4):490-9.
- Holmes I. Using evolutionary Expectation Maximization to estimate indel rates. Bioinformatics. 2005 May 15;21(10):2294-300. Epub 2005 Feb 24. (PDF)
- Lunter GA, Miklos I, Song YS, Hein J. An efficient algorithm for statistical multiple alignment on arbitrary phylogenetic trees. J Comput Biol. 2003;10(6):869-89.
- Jojic V, Jojic N, Meek C, Geiger D, Siepel A, Haussler D, Heckerman D. Efficient approximations for learning phylogenetic HMM models from data. Bioinformatics. 2004 Aug 4;20 Suppl 1:i161-8.
- Siepel A, Haussler D. Combining phylogenetic and hidden Markov models in biosequence analysis. J Comput Biol. 2004;11(2-3):413-28.
- Metzler D. Statistical alignment based on fragment insertion and deletion models. Bioinformatics. 2003 Mar 1;19(4):490-9.
Reading the indel-rate signal
Reconstruction of indel history
In light of the interest in ancestral genome reconstruction,
several authors have approached the problem of indel modeling from this slightly different direction.
Typically these approaches require the input alignment to be specified and this leads the authors to a slightly different formalism,
although the methods can still be reformulated within the framework of
string transducers.
- Blanchette et al
- Sinha et al
- Ortheus: Paten, Birney et al
- MichaelJordan and Alexandre Bouchard-Côté
- Alexandre Bouchard-Côté, Michael I. Jordan and Dan Klein. (2009) Efficient Inference in Phylogenetic InDel Trees. Advances in Neural Information Processing Systems 21 (NIPS). Vancouver, Canada. (paper)
- Poster & slides here: http://www.stat.ubc.ca/~bouchard/

Copyright © 2008-2013 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback