This article is part of the supplement: The 2007 International Conference on Bioinformatics & Computational Biology (BIOCOMP'07)
Gene rearrangement analysis and ancestral order inference from chloroplast genomes with inverted repeat
1 Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
2 Department of Biology, Pennsylvania State University, University Park, PA 16802, USA
3 School of Computer and Communication Sciences, Swiss Federal Institute of Technology (EPFL), EPFL IC LCBB, INJ 230, Station 14, CH-1015 Lausanne, Switzerland
4 Swiss Institute of Bioinformatics
Citation and License
BMC Genomics 2008, 9(Suppl 1):S25 doi:10.1186/1471-2164-9-S1-S25Published: 20 March 2008
Genome evolution is shaped not only by nucleotide substitutions, but also by structural changes including gene and genome duplications, insertions, deletions and gene order rearrangements. The most popular methods for reconstructing phylogeny from genome rearrangements include GRAPPA and MGR. However these methods are limited to cases where equal gene content or few deletions can be assumed. Since conserved duplicated regions are present in many chloroplast genomes, the inference of inverted repeats is needed in chloroplast phylogeny analysis and ancestral genome reconstruction.
We extend GRAPPA and develop a new method GRAPPA-IR to handle chloroplast genomes. A test of GRAPPA-IR using divergent chloroplast genomes from land plants and green algae recovers the phylogeny congruent with prior studies, while analysis that do not consider IR structure fail to obtain the accepted topology. Our extensive simulation study also confirms that GRAPPA has better accuracy then the existing methods.
Tests on a biological and simulated dataset show GRAPPA-IR can accurately recover the genome phylogeny as well as ancestral gene orders. Close analysis of the ancestral genome structure suggests that genome rearrangement in chloroplasts is probably limited by inverted repeats with a conserved core region. In addition, the boundaries of inverted repeats are hot spots for gene duplications or deletions. The new GRAPPA-IR is available from http://phylo.cse.sc.edu webcite.