Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Research article

Evolution of the mitochondrial genome in snakes: Gene rearrangements and phylogenetic relationships

Jie Yan, Hongdan Li and Kaiya Zhou*

Author Affiliations

Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing 210046, PR China

For all author emails, please log on.

BMC Genomics 2008, 9:569  doi:10.1186/1471-2164-9-569

The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2164/9/569


Received:8 July 2008
Accepted:28 November 2008
Published:28 November 2008

© 2008 Yan et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Snakes as a major reptile group display a variety of morphological characteristics pertaining to their diverse behaviours. Despite abundant analyses of morphological characters, molecular studies using mitochondrial and nuclear genes are limited. As a result, the phylogeny of snakes remains controversial. Previous studies on mitochondrial genomes of snakes have demonstrated duplication of the control region and translocation of trnL to be two notable features of the alethinophidian (all serpents except blindsnakes and threadsnakes) mtDNAs. Our purpose is to further investigate the gene organizations, evolution of the snake mitochondrial genome, and phylogenetic relationships among several major snake families.

Results

The mitochondrial genomes were sequenced for four taxa representing four different families, and each had a different gene arrangement. Comparative analyses with other snake mitochondrial genomes allowed us to summarize six types of mitochondrial gene arrangement in snakes. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (BI, ML, MP, NJ) arrived at a similar topology, which was used to reconstruct the evolution of mitochondrial gene arrangements in snakes.

Conclusion

The phylogenetic relationships among the major families of snakes are in accordance with the mitochondrial genomes in terms of gene arrangements. The gene arrangement in Ramphotyphlops braminus mtDNA is inferred to be ancestral for snakes. After the divergence of the early Ramphotyphlops lineage, three types of rearrangements occurred. These changes involve translocations within the I

    Q
M tRNA gene cluster and the duplication of the CR. All phylogenetic methods support the placement of Enhydris plumbea outside of the (Colubridae + Elapidae) cluster, providing mitochondrial genomic evidence for the familial rank of Homalopsidae.

Background

Snakes are a large group of reptiles with a broad range of morphological features, of which many are evolutionarily selected by their habitats. Snakes have conventionally been divided into two groups. The fossorial scolecophidians (blindsnakes and threadsnakes) are small snakes with a small gape size that feed on small prey on a frequent basis. The second major group, the alethinophidians (or "true snakes") are more ecologically diverse and most species feed on relatively large prey on an infrequent basis. True snakes are further split into the Henophidia and the Caenophidia. The caenophidians, which are also called advanced snakes, include the aquatic genus Acrochordus and the Colubroidea. The Colubroidea is subdivided into the families Atractaspididae, Elapidae, Viperidae, and Colubridae. A small colubrid subfamily, Homalopsinae, was first attributed familial rank by Günther in 1864[1], and was recognized as subfamily in the 20th century by most researchers until it was reassigned familial status in recent years [2-4].

Recent phylogenetic analyses, based primarily on molecular analyses of a few mitochondrial or nuclear genes failed to reach a consensus in several aspects [2,3,5,6]. For instance, the composition of the family Colubridae, the putative paraphyly and the hierarchical structuring into subfamilies remain contentious issues. The mitochondrial genome has several advantages for phylogenetic studies [7,8], and has been widely used in constructing animal phylogeny including snakes [9].

Previous studies of snake mitochondrial genomes have demonstrated that duplication of the control region and translocation of trnL are two visible features of the alethinophidian mtDNAs [9,10]. Moreover, translocation and pseudogenization of trnP have been found in some caenophidian snakes [10]. The Texas threadsnake (Leptotyphlops dulcis) possesses a different gene arrangement and loses its origin of light strand replication (OL) [11]. In the present study, we determined complete mitochondrial DNA sequences from four snake families. The sequence information allowed the gene organizations, mitochondrial genome evolution, and phylogenetic relationships among these major snake families to be identified.

Results

Characteristics of the snake mitochondrial genomes

The general characteristics of four snake (Table 1) mitochondrial genomes are summarized in Table 2. These complete mt genomes range from 16,397 to 17,548 bps in size. Length differences are largely due to the variation in lengths and/or numbers of the control region. In three of the four genomes (Deinagkistrodon acutus, Naja naja, and Enhydris plumbea), two control regions are found in the positions identical to those in other alethinophidian taxa mtDNAs. MtDNA sequence for Ramphotyphlops braminus is considerably smaller due to the absence of the control region duplication. All the genomes contain 13 protein-coding genes, 2 rRNAs genes, and 22 tRNAs genes. The base compositions in these mtDNAs are skewed similarly to other vertebrate mtDNAs [12], with more A-T base pairs than G-C base pairs and greater A+C content in the gene-rich strand than in the gene-poor strand.

Table 1. List of taxa used in this study

Table 2. General characteristics of four snake mitochondrial genomes

By comparing the gene arrangements in 14 known snake mitochondrial genomes, six types of gene organization were summarized and shown in Figure 1. Type I and II represent gene organizations of two scoleophidian snakes. They both have similar organization to that of typical vertebrate except the absence of an identifiable origin of light strand replication (OL) in the scolecophidian mtDNAs. Moreover, in the mitochondrial genomeof Leptotyphlops dulcis (Type II), trnQ was translocated from the I

    Q
M tRNA gene cluster to the W
    ANCY
cluster (the tRNA genes are abbreviated by single letters representing the amino acids to be decoded, and the sense strand of the underlined tRNA genes is the heavy strand), giving rise to the IM plus W
    QANCY
organizations [11]. The common features of the remaining four arrangements (type III to VI) are the duplication of the control region and translocation of trnL, which have been noted in previous studies [9,10]. Six alethinophidian snakes from six families shared type III, and type IV represented an arrangement shared between Colubridae and Homalopsidae. In these two types, the functional trnP is located next to CRI (location in typical vertebrate mtDNA), with or without a pseudogene (P*) close to CR II (there are two control regions in most snake mitogenomes except scolecophidian snakes shown as type I and II). In several previous studies, the viperid snakes appear to have another gene rearrangement in which a functional trnP moved from the end of CR I to that of CR II [9,10,13]. In this study, the positional switch of trnP is found for another viperid taxon Deinagkistrodon acutus, and is not found for the nonviperid snakes (Fig. 1). Gene arrangements in viperids can be summarize as type V and VI, with only a difference in the pseudogene (P*).

thumbnailFigure 1. Comparison of mitochondrial gene organizations of snakes. Gene arrangements are presented for the following six types of snakes. I: Ramphotyphlops braminus; II:Leptotyphlops dulcis; III: Naja naja, Acrochordus granulatus, Boa constrictor, Cylindrophis ruffus, Python regius, Xenopeltis unicolor; IV: Dinodon semicarinatus, Pantherophis slowinskii, Enhydris plumbea; V: Ovophis okinavensis; VI: Deinagkistrodon acutus, Agkistrodon piscivorus. L1, L2, S1, S2. P* represent genes for trnL(UUR), trnL(CUN), trnS(AGY), trnS(UCN), and a pseudogene for trnP. tRNAs transcribed from the gene-rich and gene-poor strand were specified by noting their names above and below the gene map respectively. The tree topology on the right side was simplified from Fig. 2.

Phylogenetic analyses

The results of the phylogenetic reconstructions are displayed in Figure 2. Our phylogenetic estimates strongly support Ramphotyphlops braminus being the sister lineage to all remaining species sampled, and that the scolecophidian snakes do not form a monophyletic grouping. All of the analyses were congruent in receiving the monophyly of Alethinophidia (c in Figure 2). Within this monophyletic clade, Henophidia and Caenophidia were clustered as monophyletic sister groups (k and d, respectively). Our estimates of relationships among the henophidians are similar to that of Dong and Kumazawa [9] in the rejection of the traditional expectations of a sister-group relationship between boids and pythonids (m in Fig. 2) and the deep divergence of non-macrostomatan cylindrophiids (l in Fig 2). Consistent with recent molecular phylogenies of caenophidian snakes [2,3,5], we find support for the Viperidae as the deepest diverging lineage within the Colubroidea, sister to a clade containing colubrids, elapids, and homalopsids. The position of Enhydris plumbea outside of the (Colubridae + Elapidae) cluster seems strongly supported by all phylogenetic methods (f in Fig. 2). Traditional placement of Enhydris plumbea within Colubridae based on morphology was not retrieved by our analyses. Node d, corresponding to the position of Acrochordus granulatus, is the only one not reconstructed in all tree-building methods. In Bayesian and ML tree based on nucleotide data set and Bayesian tree on amino acid data set, the placement of Acrochordidae to the sister group to Colubroidea was well supported.

thumbnailFigure 2. Phylogenetic analyses of the amino acid and nucleotide sequence data sets. The phylogram shown is the best maximum likelihood tree (-lnL = 34008.56) obtained from the nucleotide data set. Bar represents 0.1 mutations per site. Nodes receiving support by one or more of the applied phylogenetic methods, i. e., Bayesian inference (BI), Maximum likelihood (ML), maximum parsimony (MP), Neighbor-joining (NJ) are labelled with lowercase letters. nt = values obtained from the nucleotide data set; and pr = values obtained from the amino acid data set. Bootstrap and Bayesian inference values are listed in the table. NR means that the corresponding nodal relationship was not reconstructed in the tree topology. Types I to VI are corresponding to those in Fig. 1.

To further substantiate the above premises, we performed the SH statistical test [14] on both amino acid and nucleotide data sets that allows comparison of alternative phylogenetic hypotheses. Results of the SH test strongly rejected the placement of Enhydris plumbea within the colubrids cluster (P < 0.001). The monophyly of scolecophidian snakes was not rejected using both amino acid and nucleotide acid data sets (P > 0.05).

Discussion

Evolution of snake mitochondrial genomes

Based on the phylogenetic relationships among the tested snakes and the comparisons of their gene organizations (Fig. 1, 2), we estimated the processes of evolutionary events occurred in snake mitochondrial genomes. In early snake lineages (type I and II), gene arrangements are similar to that of typical vertebrate, but OL was lost within the W

    ANCY
tRNA gene cluster. Incompatible with the commonly accepted view on monophyly of scolecophidian snakes [15-18], our phylogenetic estimates strongly supports Ramphotyphlops braminus being the sister lineage to all remaining species sampled. Given that the monophyly of scolecophidian snakes was not rejected in SH statistical test, loss of OL may occur in two different scenarios, independently (if nonmonophyly) or descend from a common ancestry (if monophyly). After the divergence of the Ramphotyphlops lineage, changes involving the I
    Q
M
cluster took place. First in type II, trnQ underwent a long distance translocation (~1.2 kb) from one gene cluster to another (Fig. 1, 2). Subsequently, in the early alethinophidian lineage, the control region was duplicated and trnL relocated to the I
    Q
M
cluster, giving rise to type III which is present in most alethinophidian snakes (including henophidians, Acrochordus and Naja). New types emerged during the split in Caenophidia. Type IV is found in two branches, Dinodon semicarinatus, Pantherophis slowinskii (Colubridae), and Enhydris plumbea (Homalopsidae), and characteristic changes (P*) likely appeared ahead of node f, which then disappeared in Elapidae. It is also conceivable that the present of P* was resulted from independent evolution in Colubridae and Homalopsidae. Distinct arrangements (type V and VI) were found in viperids, suggesting that trnP was translocated in early stage of the viperid radiations [9]. Type VI, with no pseudogenes close to CRI, was found in two paraphyletic taxa, suggesting that P* could have been independently eliminated.

Familial rank of Homalopsidae

The Homalopsinae have been generally recognized as a valid monophyletic clade within the Colubridae [19,20] and assigned a subfamilial rank, despite they being assigned familial [1] or tribal [21] status historically. Recent molecular studies placed the Homalopsinae as the sister group to most other members of the Colubroidea [2,3,5], and a familial status has been reassigned accordingly [2-4].

In this study, the placement of Enhydris plumbea, a representative of the Homalopsidae, as the sister lineage to the Colubridae + Elapidae clade was strongly supported by all phylogenetic methods. Moreover, SH test strongly rejected the hypothesis that Enhydris plumbea falls within the colubrids cluster (P < 0.001). The familial rank of Homalopsidae is therefore considered well-supported. Our work for the first time establishes the monophyly and distinctiveness of this family with phylogenetic evidence derived from complete mitochondrial genome sequences.

Conclusion

In this study, six types of mitochondrial gene arrangement in snakes are summarized. Two notable features of the alethinophidian mtDNA, duplication of the CR and translocation of trnL, are presented. The gene arrangement in Ramphotyphlops braminus mtDNA is indentical to that found in typical vertebrates, suggesting an ancestral arrangement. The well supported phylogenetic topology helps to reconstruct the evolution of mitochondrial gene arrangements in snakes. We propose that, after the divergence of the early Ramphotyphlops lineage, three types of changes involving the I

    Q
M gene cluster occurred. These include the translocation of trnQ in the early Leptotyphlops lineage, the duplication of CR and translocation of trnL in the early alethinophidian lineage, and the translocation of trnP in the early viperid lineage. All phylogenetic methods support the placement of Enhydris plumbea outside of the (Colubridae + Elapidae) cluster, providing mitochondrial genomic evidence for the familial rank of Homalopsidae. The monophyly of Scolecophidia is not rejected in our study. However, a more comprehensive sampling of snake mitochondrial genomes is necessary to further refine the phylogenetic relationships among major groups of snakes.

Methods

Samples, DNA amplification, and sequencing

Snakes from three alethinophidian families and one scoleophidian family were sampled (Table 1). Total DNA was extracted from a small quantity (20 mg) of tissues by DNeasy Tissue Kit (Qiagen). Several short mtDNA fragments were amplified using Ex-Taq DNA polymerase (Takara) and sequenced in order to design taxon-specific primers. PCRs were performed in a MJ PTC-200 thermal cycler under the profile: 5 min at 95°C followed by 35 cycles of 95°C for 30 s, 50–55°C for 30 s, and 72°C for 90 s. PCR products of 1~2.5 kb were purified and then sequenced employing an ABI 310 or 3700 system with bi-directional and several internal primers. Short fragments were assembled into a continuous sequence. In the mtDNA sequences thus obtained, 37 individual genes were identified based on corresponding homologues from other vertebrates. Identification of tRNA genes was based on their secondary structures using software DNASIS 2.5 (Hitachi Engineering, Tokyo, Japan), whereas boundaries of rRNA genes and control regions were tentatively defined by the boundaries of adjacent coding genes. The mtDNA sequences, with annotations, have been deposited at GenBank (DQ343647DQ343650).

Taxa, alignment and phylogenetic analyses

We assembled 14 serpent ingroups with complete mitochondrial genomes available, and chose 4 taxa from 4 saurian families (Amphisbaena schmidti [22]; Eumeces egregius [23];Iguana iguana [24]; and Varanus komodoensis [25]) as outgroups (Table 1). Two data sets were prepared for concatenated amino acid sequences and for concatenated light-strand nucleotide sequences of the 12 protein genes. Nad6, the only protein gene encoded by the light strand, has been excluded for increased proportion of T and G in all codon positions due to the strand-specific base composition bias of mtDNAs. Multiple alignments were analyzed with the Gblocks program [26] to select conserved amino acid residues, which was later used as a backbone to align the corresponding nucleotide sequences.

The level of saturation in the whole codons, and at the first, second, and the third codon positions was independently analyzed using scatter plot graphics, by comparing the uncorrected p-distance with the distance calculated by applying the best-fit evolutionary model (GTR + I + G) selected by the Modeltest 3.7 [27]. The third positions of the protein genes were removed from the nucleotide data set because of high substitutional rates and consequent saturation as a source of noise in phylogenetic analyses. Thus a final alignment of 6566 bases was obtained.

Phylogenetic analyses were carried out using maximum likelihood (ML), Bayesian (BI), maximum parsimony (MP) and neighbor-joining (NJ) methods. The ML analyses with the nucleotide data set were conducted with PAUP*4.0b10 by a heuristic search with TBR branch swapping with 10 random taxon additions. The general reversible model (GTR + I + G) and parameters optimized by Modeltest 3.7 were used. Bayesian phylogenetic analyses of the nucleotide sequences were performed with MrBayes 3.1 [28] using a GTR + I + G model. The Markov chain Monte Carlo process was set to run four chains simultaneously. Posterior probabilities were calculated from the majority-rule consensus trees constructed after excluding the burn-in.

ML analyses with the amino acid data set were conducted using PUZZLE 5.2 [29] with the mtREV24 substitution matrix and amino acid frequency estimated from the data set. The Bayesian analyses of the amino acid data were conducted with MrBayes 3.0 using the mtREV24 + I + G model and an empirical amino acid frequency. The Bayesian tree and posterior probability values were obtained using the same procedures described above.

Authors' contributions

JY conducted the amplification and sequence assembly of the mitochondrial genomes of three snakes. She collected, analyzed and summarized the data, and drafted the manuscript. HDL conducted the amplification and sequence assembly of the mitochondrial genomes of Enhydris plumbea. KYZ conceived the study and participated in its design and data interpretation, and preparation of the manuscript. All authors read and approved the final manuscript.

Acknowledgements

We would like to thank Xiang Ji, Hongying Sun and Jianhua Dai for help in sample collection. Support for this research was provided by the National Natural Science Foundation of China (NSFC) grant (No. 30570249) to KYZ.

References

  1. Günther ACLG, ed.: Reptiles of British India. London: The Ray Society; 1864.

  2. Lawson R, Slowinski JB, Crother BI, Burbrink FT: Phylogeny of the Colubroidea (Serpentes): new evidence from mitochondrial and nuclear genes.

    Mol Phylogenet Evol 2005, 37(2):581-601. PubMed Abstract | Publisher Full Text OpenURL

  3. Vidal N, Delmas AS, David P, Cruaud C, Couloux A, Hedges SB: The phylogeny and classification of caenophidian snakes inferred from seven nuclear protein-coding genes.

    C R Biol 2007, 330(2):182-187. PubMed Abstract | Publisher Full Text OpenURL

  4. Wiens JJ, Kuczynski CA, Smith SA, Mulcahy DG, Sites JW Jr, Townsend TM, Reeder TW: Branch lengths, support, and congruence: testing the phylogenomic approach with 20 nuclear Loci in snakes.

    Syst Biol 2008, 57(3):420-431. PubMed Abstract | Publisher Full Text OpenURL

  5. Kelly CM, Barker NP, Villet MH: Phylogenetics of advanced snakes (Caenophidia) based on four mitochondrial genes.

    Syst Biol 2003, 52(4):439-459. PubMed Abstract | Publisher Full Text OpenURL

  6. Vidal N, Hedges SB: Higher-level relationships of snakes inferred from four nuclear and mitochondrial genes.

    C R Biol 2002, 325(9):977-985. PubMed Abstract | Publisher Full Text OpenURL

  7. Saccone C, Gissi C, Reyes A, Larizza A, Sbisa E, Pesole G: Mitochondrial DNA in metazoa: degree of freedom in a frozen event.

    Gene 2002, 286(1):3-12. PubMed Abstract | Publisher Full Text OpenURL

  8. Boore JL: Animal mitochondrial genomes.

    Nucleic Acids Res 1999, 27(8):1767-1780. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  9. Dong S, Kumazawa Y: Complete mitochondrial DNA sequences of six snakes: phylogenetic relationships and molecular evolution of genomic features.

    J Mol Evol 2005, 61(1):12-22. PubMed Abstract | Publisher Full Text OpenURL

  10. Kumazawa Y, Ota H, Nishida M, Ozawa T: Gene rearrangements in snake mitochondrial genomes: highly concerted evolution of control-region-like sequences duplicated and inserted into a tRNA gene cluster.

    Mol Biol Evol 1996, 13(9):1242-1254. PubMed Abstract | Publisher Full Text OpenURL

  11. Kumazawa Y: Mitochondrial DNA sequences of five squamates: phylogenetic affiliation of snakes.

    DNA Res 2004, 11(2):137-144. PubMed Abstract | Publisher Full Text OpenURL

  12. Asakawa S, Kumazawa Y, Araki T, Himeno H, Miura K, Watanabe K: Strand-specific nucleotide composition bias in echinoderm and vertebrate mitochondrial genomes.

    J Mol Evol 1991, 32(6):511-520. PubMed Abstract | Publisher Full Text OpenURL

  13. Jiang ZJ, Castoe TA, Austin CC, Burbrink FT, Herron MD, McGuire JA, Parkinson CL, Pollock DD: Comparative mitochondrial genomics of snakes: extraordinary substitution rate dynamics and functionality of the duplicate control region.

    BMC Evol Biol 2007, 7:123. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  14. Shimodaira H, Hasegawa M: Multiple comparisons of log-likelihoods with applications to phylogenetic inference.

    Mol Biol Evol 1999, 16(8):1114-1116. OpenURL

  15. Lee MSY, Scanlon JD: Snake phylogeny based on osteology, soft anatomy and ecology.

    Biol Rev 2002, 77(3):333-401. PubMed Abstract | Publisher Full Text OpenURL

  16. Heise PJ, Maxson LR, Dowling HG, Hedges SB: Higher-level snake phylogeny inferred from mitochondrial DNA sequences of 12S rRNA and 16S rRNA genes.

    Mol Biol Evol 1995, 12(2):259-265. PubMed Abstract | Publisher Full Text OpenURL

  17. Vidal N, David P: New insights into the early history of snakes inferred from two nuclear genes.

    Mol Phylogenet Evol 2004, 31(2):783-787. PubMed Abstract | Publisher Full Text OpenURL

  18. Underwood G, British M: A contribution to the classification of snakes. British Museum (Natural History); 1967.

  19. Zug GR, Vitt LJ, Caldwell JP, eds.: Herpetology. San Diego: Academic Press; 2001.

  20. Greene HW, ed.: Snakes: the evolution of mystery in nature. Berkeley: Univ. of California Press; 1997.

  21. Dowling HG, Duellman WE, eds.: Systematic herpetology: a synopsis of families and higher categories. New York: HISS Publications; 1978.

  22. Macey JR, Papenfuss TJ, Kuehl JV, Fourcade HM, Boore JL: Phylogenetic relationships among amphisbaenian reptiles based on complete mitochondrial genomic sequences.

    Mol Phylogenet Evol 2004, 33(1):22-31. PubMed Abstract | Publisher Full Text OpenURL

  23. Kumazawa Y, Nishida M: Complete mitochondrial DNA sequences of the green turtle and blue-tailed mole skink: statistical evidence for archosaurian affinity of turtles.

    Mol Biol Evol 1999, 16(6):784-792. PubMed Abstract | Publisher Full Text OpenURL

  24. Janke A, Erpenbeck D, Nilsson M, Arnason U: The mitochondrial genomes of the iguana (Iguana iguana) and the caiman (Caiman crocodylus): implications for amniote phylogeny.

    Proc Biol Sci 2001, 268(1467):623-631. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Kumazawa Y, Endo H: Mitochondrial genome of the Komodo dragon: efficient sequencing method with reptile-oriented primers and novel gene rearrangements.

    DNA Res 2004, 11(2):115-125. PubMed Abstract | Publisher Full Text OpenURL

  26. Castresana J: Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

    Mol Biol Evol 2000, 17(4):540-552. PubMed Abstract | Publisher Full Text OpenURL

  27. Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution.

    Bioinformatics 1998, 14(9):817-818. PubMed Abstract | Publisher Full Text OpenURL

  28. Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models.

    Bioinformatics 2003, 19(12):1572-1574. PubMed Abstract | Publisher Full Text OpenURL

  29. Schmidt HA, Strimmer K, Vingron M, von Haeseler A: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing.

    Bioinformatics 2002, 18(3):502-504. PubMed Abstract | Publisher Full Text OpenURL