BMC Evolutionary Biology
|
Viewing options:Associated material:Related literature:- Articles citing this article
- Other articles by authors
- Related articles/pages
Tools:Post to:
|
Methodology articleIncorporating indel information into phylogeny estimation for rapidly emerging pathogensBenjamin D Redelings1 and Marc A Suchard2,3,4  1
Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27606, USA 2
Department of Biomathematics, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095, USA 3
Department of Human Genetics, David Geffen School of Medicine at UCLA, Los Angeles, CA 90095, USA 4
Department of Biostatistics, UCLA Schoold of Public Health, Los Angeles, CA 90095, USA author email corresponding author email
BMC Evolutionary Biology 2007,
7:40doi:10.1186/1471-2148-7-40 Abstract
Background
Phylogenies of rapidly evolving pathogens can be difficult to resolve because of the small number of substitutions that accumulate in the short times since divergence. To improve resolution of such phylogenies we propose using insertion and deletion (indel) information in addition to substitution information. We accomplish this through joint estimation of alignment and phylogeny in a Bayesian framework, drawing inference using Markov chain Monte Carlo. Joint estimation of alignment and phylogeny sidesteps biases that stem from conditioning on a single alignment by taking into account the ensemble of near-optimal alignments.
Results
We introduce a novel Markov chain transition kernel that improves computational efficiency by proposing non-local topology rearrangements and by block sampling alignment and topology parameters. In addition, we extend our previous indel model to increase biological realism by placing indels preferentially on longer branches. We demonstrate the ability of indel information to increase phylogenetic resolution in examples drawn from within-host viral sequence samples. We also demonstrate the importance of taking alignment uncertainty into account when using such information. Finally, we show that codon-based substitution models can significantly affect alignment quality and phylogenetic inference by unrealistically forcing indels to begin and end between codons.
Conclusion
These results indicate that indel information can improve phylogenetic resolution of recently diverged pathogens and that alignment uncertainty should be considered in such analyses. |