Identifying recombinants in human and primate immunodeficiency virus sequence alignments using quartet scanning
-
* Corresponding author: Philippe Lemey philippe.lemey@gmail.com
1 Rega Institute, Katholieke Universiteit Leuven, Minderbroedersstraat 10, 3000 Leuven, Belgium
2 School of Computing Sciences, University of East Anglia, NR4 7TJ, Norwich, UK
3 Institute of Infectious Disease and Molecular Medicine, Faculty Of Health Sciences, University of Cape Town, Observatory 7925, South Africa
BMC Bioinformatics 2009, 10:126 doi:10.1186/1471-2105-10-126
Published: 27 April 2009Abstract
Background
Recombination has a profound impact on the evolution of viruses, but characterizing recombination patterns in molecular sequences remains a challenging endeavor. Despite its importance in molecular evolutionary studies, identifying the sequences that exhibit such patterns has received comparatively less attention in the recombination detection framework. Here, we extend a quartet-mapping based recombination detection method to enable identification of recombinant sequences without prior specifications of either query and reference sequences. Through simulations we evaluate different recombinant identification statistics and significance tests. We compare the quartet approach with triplet-based methods that employ additional heuristic tests to identify parental and recombinant sequences.
Results
Analysis of phylogenetic simulations reveal that identifying the descendents of relatively old recombination events is a challenging task for all methods available, and that quartet scanning performs relatively well compared to the triplet based methods. The use of quartet scanning is further demonstrated by analyzing both well-established and putative HIV-1 recombinant strains. In agreement with recent findings, we provide evidence that the presumed circulating recombinant CRF02_AG is a 'pure' lineage, whereas the presumed parental lineage subtype G has a recombinant origin. We also demonstrate HIV-1 intrasubtype recombination, confirm the hybrid origin of SIV in chimpanzees and further disentangle the recombinant history of SIV lineages in a primate immunodeficiency virus data set.
Conclusion
Quartet scanning makes a valuable addition to triplet-based methods for identifying recombinant sequences without prior specifications of either query and reference sequences. The new method is available in the VisRD v.3.0 package http://www.cmp.uea.ac.uk/~vlm/visrd webcite.