Email updates

Keep up to date with the latest news and content from BMC Evolutionary Biology and BioMed Central.

Open Access Research article

Comparative mitochondrial genomics within and among species of killifish

Andrew Whitehead

Author Affiliations

Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA

BMC Evolutionary Biology 2009, 9:11  doi:10.1186/1471-2148-9-11


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2148/9/11


Received:20 August 2008
Accepted:13 January 2009
Published:13 January 2009

© 2009 Whitehead; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

This study was motivated by the observation of unusual mitochondrial haplotype distributions and associated physiological differences between populations of the killifish Fundulus heteroclitus distributed along the Atlantic coast of North America. A distinct "northern" haplotype is fixed in all populations north of New Jersey, and does not appear south of New Jersey except in extreme upper-estuary fresh water habitats, and northern individuals are known to be more tolerant of hyposmotic conditions than southern individuals. Complete mitochondrial genomes were sequenced from individuals from northern coastal, southern coastal, and fresh water populations (and from out-groups). Comparative genomics approaches were used to test multiple evolutionary hypotheses proposed to explain among-population genome variation including directional selection and hybridization.

Results

Structure and organization of the Fundulus mitochondrial genome is typical of animals, yet subtle differences in substitution patterns exist among populations. No signals of directional selection or hybridization were detected. Mitochondrial genes evolve at variable rates, but all genes exhibit very low dN/dS ratios across all lineages, and the southern population harbors more synonymous polymorphism than other populations.

Conclusion

Evolution of mitochondrial genomes within Fundulus is primarily governed by interaction between strong purifying selection and demographic influences, including larger historical population size in the south. Though directional selection and hybridization hypotheses were not supported, adaptive processes may indirectly contribute to partitioning of variation between populations.

Background

Fundulus heteroclitus is a euryhaline killifish that occupies diverse habitats along steep osmotic and thermal gradients; large populations are found in extreme upper-estuary fresh water sites through brackish estuarine habitats to marine environments, and across a steep latitudinal temperature gradient along the Atlantic coast of North America from Nova Scotia (Canada) to Florida (USA). This species has long been of interest to science in part because of their extraordinary physiological resilience in the face of osmotic stress [1], and as a model for studying adaptive evolutionary variation along ecological clines [2]. To date, the bulk of evolutionary studies on this species have focused on adaptive variation related to temperature and pollution (see review in Burnett et al. [3]). However, other studies reveal unusual patterns of mitochondrial haplotype distributions that covary with differences in osmotic habitat [4] and physiological tolerance to osmotic stress [5,6]. Together, results from these studies converge on the hypothesis that natural selection has helped shape mitochondrial genomic variation as an adaptive response to physiological challenges posed by life in fresh water habitats. The purpose of this study was to compare mitochondrial genome variation within and among Fundulus species to test this and related hypotheses.

For coastal populations of F. heteroclitus, a phylogeographic break that centers on the Hudson River separates "northern" from "southern" types, with a narrow zone of introgression found in northern New Jersey. This phylogeographic break is supported by data from mitochondrial haplotype variation [4,7,8], but also from studies of variation in nuclear proteins [2,7,9], microsatellites [10,11], and gene expression [12]. Few studies in F. heteroclitus have examined adaptive variation associated with potential selective agents other than temperature and pollution, such as strong osmotic gradients in estuarine environments.

Hyposmotic tolerance varies widely among Fundulus species [13]. Though F. heteroclitus is widely tolerant of osmotic extremes [13], populations are distributed along steep salinity gradients in nature and within-species variation in hyposmotic stress tolerance exists. For example, adult fish from northern populations are more tolerant of hyposmotic transfer than southern populations [6], and individuals from northern populations have higher reproductive success (fertilization success and larval survival) in low salinities than those from southern populations [5]. Intriguingly, mitochondrial haplotype data show that the "northern" haplotype is never found south of the phylogeographic break, except in extreme upper estuary habitats that are exclusively fresh water [4]. In these southern fresh water habitats the northern haplotype is fixed, but only dozens of miles downstream, after transition to slightly brackish water habitats, the southern haplotype is fixed (unpublished data).

I hypothesize (as have others [4,6]) that different mitochondrial genotypes are favored by natural selection in different osmotic environments, and that the "northern" haplotype provides an adaptive advantage in hyposmotic habitats. One model that could account for the unusual contemporary distribution of mitochondrial haplotypes is as follows: As Pleistocene glaciers receded, successful invaders of newly emergent northern habitats were likely those that could tolerate low salinities leading to fixation of those genotypes during northward expansion, and those same genotypes remain fixed in southern habitats that are fresh water. It is physiologically plausible that mitochondrial genes may be involved in adaptation to fresh water habitats since hyposmotic acclimation is very energetically demanding [14,15] requiring the recruitment of mitochondria-rich cells in gill epithelia [16]. An alternate hypothesis is that mitochondrial haplotypes from the exclusively fresh water species F. diaphanus have introgressed into F. heteroclitus populations accounting for the unique haplotype distribution and hyposmotic tolerance in "northern" F. heteroclitus. This possibility has been recognized by others [6,17] and is plausible since F. diaphanus live exclusively in fresh water, co-occur with F. heteroclitus in fresh water and northern habitats, these two species are known to hybridize where they co-occur, and hybrids are only successful from pairings of female F. diaphanus with male F. heteroclitus [17,18]. Another alternative is that southern fresh water populations are remnants from a northward expansion of northern haplotypes following Pleistocene glaciation.

To test these hypotheses I sequenced complete mitochondrial genomes (mitogenomes) from northern, southern, and fresh water populations of F. heteroclitus, from F. grandis (the sister taxon to F. heteroclitus) and from F. diaphanus. Mitogenome sequences were used to test for the influence of neutral drift, directional selection, and selective constraint on intraspecific and interspecific variation, and to define phylogenetic relationships among populations and species.

Methods

Populations and species used

Complete mitochondrial genomes were sequenced from two individuals from each of five taxa including three populations of Fundulus heteroclitus, one population F. grandis (Fgds; Cocodrie, Louisiana) which is the sister species to F. heteroclitus, and one population of F. diaphanus (Fdia; Piscataway Park, Maryland) which is a co-occurring congener known to sometimes hybridize with F. heteroclitus [18]. F. heteroclitus were sampled from Maine (Fhet-ME; north of phylogeographic break, northern haplotype), Georgia (Fhet-GA; south of phylogeographic break, southern haplotype), and from an extreme upper-estuary fresh water habitat (Fhet-MDPP; south of phylogeographic break, northern haplotype). This fresh water F. heteroclitus population is from the Potomac River at Piscataway Park in Maryland. This is the same site as site "26" in Duvernell et al. [11] and site "PS" in Smith et al. [4], and is the same site from which F. diaphanus were collected for this study.

DNA preparation, PCR, and sequencing

Genomic DNA was extracted from fresh livers using Qiagen DNeasy extraction kits, and PCR primers were from Lee et al. [19] or Miya & Nishida [20] or specifically designed for this study (see Additional file 1). Long-distance PCR (TaKaRa LA taq) was first used to amplify three overlapping regions covering the complete mitogenome sequence. Long PCR amplifications were performed in 50 μl reactions including 10 μM final concentration of each primer under cycling conditions including 40 cycles of 94°C denaturation for 30-s followed by 15-min annealing/extension at 68°C, with a final 72°C hold for 10-min. Long amplification products were used as the template for a series of 39 overlapping PCR reactions. PCR was performed in 20 μl reactions including 0.25 μM final concentration of each primer under cycling conditions including 5 cycles of 20-s denaturation at 94°C, 15-s annealing at 37°C, and 45-s extension at 72°C, followed by 25 cycles of 20-s denaturation at 94°C, 15-s annealing at 45°C, and 45-s extension at 72°C. Amplification products from both long-distance and nested PCR reactions were electrophoresed on 1% agarose to verify size and cleaned using Ampure magnetic beads (Agencourt Bioscience).

Additional file 1. Table 1: Primers used for PCR and sequencing of Fundulus mitochondrial genomes. A list of primer names and sequences used for PCR and sequencing.

Format: XLS Size: 29KB Download file

This file can be viewed with: Microsoft Excel ViewerOpen Data

Both strands of PCR products were directly sequenced using Big-Dye Terminator cycle sequencing (Applied Biosystems) in 10 μl reactions including 0.5 μl Big Dye and 0.32 μM final concentration of sequencing primers (forward and reverse PCR primers). The cycling profile consisted of 40 cycles of 15-s denaturation at 94°C, 20-s annealing at 37°C, and 4-min extension at 60°C. Sequencing reaction products were cleaned using CleanSeq magnetic beads (Agencourt Bioscience) before electrophoresis on an Applied Biosystems 3130XL Genetic Analyzer.

Genome investigations

A consensus sequence was determined for each individual genome following assembly of individual reads in Gap4 [21]. Complete mitogenome consensus sequences were aligned using ClustalW in MEGA 4.0 [22] and alignments annotated in SEQUIN http://www.ncbi.nlm.nih.gov/Sequin/sequin.hlp.html webcite. Transfer RNAs were identified using tRNAscan [23] and protein coding genes by alignment with Kryptolebias marmoratus and Gambusia affinis mitogenome sequences (GenBank accession numbers AF283503 and AP004422, respectively) coupled with determination of in-frame start and stop codon positions.

Genome-wide characteristics were enumerated and compared among lineages. These characteristics included genome-wide, gene-specific, and taxon-specific differences in G/C content, synonymous and amino acid replacement substitution rates, transition/transversion substitution ratios, and codon usage bias. Topological predictions of transmembrane segments of electron transport proteins were determined from the consensus prediction from multiple models [24-28] using the server-based software TOPCONS http://topcons.cbr.su.se/ webcite with the sequence of individual Fhet-ME2 as the query sequence. Radical amino acid polymorphisms were identified according to the criteria outlined in Woolley et al. [29].

A suite of models was used to test for the influence of different evolutionary forces shaping mitogenome variation within and among species. First, dN/dS ratios were calculated for the entire mitogenome and for each gene separately to test for lineage-specific departures from neutrality (ratios significantly greater than 1.0 are evidence for directional selection). Second, the branch-sites model A test 2 [30] in CODEML (PAML 4.0, [31]) was used to test for departures from neutrality at specific substitution sites, since dN/dS analyses of whole mitogenomes and whole genes may be insensitive for detecting selection acting on a small number of important substitutions. Likelihood ratio tests were used to compare the null neutral model (model = 2, NSsites = 2, fix_omega = 1, omega = 1) against the alternate model of branch-specific positive selection (model = 2, NSsites = 2, fix_omega = 0, omega = 1.5) on the branch leading to F. heteroclitus northern clade. The well-supported tree indicated in Figure 1 was used as the input tree for this model. A Bayes empirical Bayes procedure [30,31] was used to test the likelihood of alternate evolutionary forces (neutral drift, purifying selection, directional selection) governing the evolution of specific substitutions along the northern branch. Third, McDonald-Kreitman tests [32] were used to test for departure from neutral evolution across Fundulus lineages by testing for significant excess of amino acid replacement fixations or excess of synonymous polymorphisms relative to divergence between pairs of taxa [33]. A neutrality index (NI) was calculated as the ratio of the number of synonymous fixed to synonymous polymorphic sites divided by the number of amino acid replacement fixed to replacement polymorphic sites [34], and statistical significance evaluated using the Fisher's exact test (two-tailed) performed in DnaSP [35]. Neutrality index values were calculated for pairings of F. grandis with each of the three F. heteroclitus populations. Index values significantly greater than 1.0 indicate excess amino acid polymorphism relative to divergence and significantly less than 1.0 indicate a deficiency of amino acid polymorphism [33,34,36].

thumbnailFigure 1. Bayesian phylogeny of concatenated mitochondrial genome protein sequences (excluding third positions of codons) with posterior probabilities indicated only if less than 100.

Phylogenetic analyses were used to test for possible hybridization between F. diaphanus and co-occurring F. heteroclitus populations. These analyses examined the relationship of Fundulus populations and species (Acanthopterygii; Atherinomorpha; Cyprinodontiformes; Fundulidae) to each other and to other closely related fishes for which complete mitogenomes exist including Cyprinodon rubrofluviatilis (Acanthopterygii; Atherinomorpha; Cyprinodontiformes; Cyprinodontidae), Gambusia affinis (Acanthopterygii; Atherinomorpha; Cyprinodontiformes; Poeciliidae), Kryptolebias marmoratus (Acanthopterygii; Atherinomorpha; Cyprinodontiformes; Rivulidae), Oryzias latipes (Acanthopterygii; Atherinomorpha; Beloniformes), Cololabis saira (Acanthopterygii; Atherinomorpha; Beloniformes), Cypselurus hiraii (Acanthopterygii; Atherinomorpha; Beloniformes), Exocoetus volitans (Acanthopterygii; Atherinomorpha; Beloniformes), and Mugil cephalus (Acanthopterygii; Mugilomorpha) (GenBank accession numbers EF442803, AP004422, AF283503, AP004421, AP002932, AB182653, AP002933, and AP002930, respectively). Protein-encoding gene sequences were concatenated and aligned using ClustalW and Bayesian methods were used to reconstruct phylogenetic relationships using MrBayes 3 [37] with third codon positions excluded (Nst = 6, Rrates = Invgamma, Ngen = 10,000,000, sampling every 10 gen, nchains = 8, nruns = 2, burnin = 0.25, outgroup = M. cephalus, all other parameters set at default values). These analyses were run on the parallel version of MrBayes 3.1.2 compiled on the LONI high performance computing system at LSU http://www.hpc.lsu.edu/loni/ webcite.

Results and discussion

Genome-wide characteristics

Fundulus mitochondrial genomes (16,524–16,531 bp long; GenBank:FJ445394FJ445403) are typical in that they contain the identical gene complement (22 transfer RNA genes, 2 ribosomal RNA genes, and 13 protein-coding genes) and gene order as found in most vertebrate mitochondrial genomes. The control region is long (865–870 bp) with many conserved blocks and found between tRNA-Pro and tRNA-Phe. The origin of light strand replication is a short highly conserved block (36 bp) found between tRNA-Asn and tRNA-Cys. Gene start sites are typically immediately downstream of the end of the previous gene, and several genes start before the end of the preceding gene. For example, the end of ATPase8 and beginning of ATPase6 overlap 10 bp, ND4L and ND4 overlap 7 bp, and the ends of ND5 and ND6 (coded on the opposite strand) overlap 4 bp. Indeed, the genomes are highly compact as only 73 bp of non-coding intergenic spacer DNA is found. All genes are encoded on the heavy chain except for ND6 and eight tRNA genes (tRNA-Gln, tRNA-Ala, tRNA-Asn, tRNA-Cys, tRNA-Tyr, tRNA-Ser (UCN), tRNA-Glu, and tRNA-Pro).

All four nucleotides are equally represented in most mitogenome regions including tRNA, rRNA, and 1st and 2nd positions of codons in protein coding regions (Figure 2a), though a significant bias against G-ending four-fold degenerate codon sites was detected in protein coding regions (p < 0.001, Chi-square goodness of fit test). This bias toward A-T enrichment at third position codon sites is consistent across genes except ND6 (Figure 2b), and consistent across all 10 mitogenome sequences (Figure 2c), though the bias is not nearly as dramatic as in Drosophila where nearly 94% of all codons end in A or T [38]. Strand-specific differences in substitution patterns are also observed. ND6 is the only gene encoded on the light chain and A→G substitutions predominate, whereas T→C substitutions predominate in the remaining genes that are encoded on the heavy chain. The same strand-biased substitution pattern is also observed in fruit flies [39] and these genome-scale phenomena may be the result of interaction between mutational tendencies and selection for translation efficiency [33].

thumbnailFigure 2. Base composition for A) each region of the mitochondrial genome summed across all genomes sequenced, B) each protein coding region summed across all genomes sequenced, and C) for each of the 10 individual genome sequenced, including two individuals from each of F. diaphanus (Fdia), F. grandis (Fgds), F. heteroclitus Georgia population (Fhet-GA), F. heteroclitus Maine population (Fhet-ME) and F. heteroclitus fresh water Maryland population (Fhet-MDPP).

Mutation rates vary across genomic regions, but only subtle differences appear among lineages. Non-coding intergenic spacer regions have the highest mutation rate (33% of sites were variable among all 10 Fundulus genomes) followed by protein coding regions (21% of sites variable) (Figure 3). The lowest mutation rates are in tRNA (6% of sites variable) followed by rRNA (8% of sites variable) and the D-loop (13% of sites variable). This pattern appears inconsistent across lineages. In branches separating sister taxa F. heteroclitus and F. grandis, rates of substitution in non-coding intergenic spacer regions appear to decrease to match or fall behind substitution rates in protein coding regions (Figure 3). However, only 73 bases constitute intergenic non-coding DNA, so such a small proportion of the overall genome may be subject to spurious differences in mutation rate across lineages given the small number of genomes sequenced. Similarly, transition/transversion rate differences in intergenic regions are likely to be spurious; that is, apparent intergenic transition/transversion differences between F. heteroclitus lineages are due to just one mutation. Excluding intergenic DNA, few differences remain among lineages in relative mutation rates of tRNA, rRNA, D-loop, and protein coding regions, except that in the F. heteroclitus Georgia lineage tRNA and rRNA mutation rates are disproportionately lower and D-loop mutations disproportionately higher than the overall pattern (p < 0.001, Chi-square goodness of fit test).

thumbnailFigure 3. Overall and branch-specific relative transition (red) and transversion (blue) mutation rates (substitutions per 100 bases) for different genomic regions, and branch-specific dN/dS ratios (number below branch). Table includes number of pairwise (with F. grandis) synonymous (S) or amino-acid replacing (R) and fixed (F) or polymorphic (P) mutations, including neutrality index (NI) from McDonald Kreitman (MK) test performed in dnaSP. Significance of MK is tested is indicated as follows; * 0.01 < P < 0.05; ** 0.001 < P < 0.01; *** P < 0.001 for Fisher's exact test with Bonferroni correction for multiple tests. Colored lines indicate lineages referred to in Figure 5.

Evolutionary mechanisms

Amino acid polymorphisms tend to be unevenly distributed across proteins (Figure 4); regions of sparse amino acid polymorphism, especially of sparse radical polymorphism, could be the result of stronger purifying selection due to the presence of functional domains. Indeed, transmembrane regions of ND2, ND4, and ND5 are located in the membrane-embedded arm of the NADH dehydrogenase complex and likely play an active role in proton pumping [40], and in Fundulus these regions harbor significantly fewer radical amino acid polymorphisms (p > 0.05, Chi-square test of independence) relative to the loop regions of these proteins (Figure 4) which is consistent with data from mammals [41].

thumbnailFigure 4. Topological predictions of transmembrane domains for mitochondrially-encoded electron transport proteins and location of amino acid polymorphisms in Fundulus. The X-axis represent amino acid site for each protein, and predicted transmembrane regions are highlighted by gray bars, where white bars represent loop regions. Positions of amino acid sites that are polymorphic among the 10 mitochondrial genomes sequenced are indicated by circles. Radical amino acid polymorphisms are highlighted as closed circles, and polymorphisms that are fixed exclusively in the northern F. heteroclitus lineage are highlighted in red.

Only five amino-acid replacement mutations are fixed exclusively in the northern lineage (including Maryland fresh water and Maine individuals: green lineage in Figure 3). Two occur in ND1 (Pro→Ser at amino acid site 73, Thr→Ala at site 264) and one in each of ND2 (Ala→Thr at site 344), ATPase8 (Glu→Asp at site 42), and ND5 (Ile→Val at site 555). All ND subunits are located in the membrane-bound portion of the NADH dehydrogenase complex; ND1 is thought to be located at the junction between the matrix arm and membrane-embedded arm, and ND2 and ND5 are located in the membrane-embedded arm where proton pumping occurs [40]. ATPase subunit 8 is a core component of the F0 complex of ATP synthase and may serve an important role in subunit assembly [42]. Though these substitutions could represent candidate loci for directional selection, statistical support is not strong. Branch-site tests indicate low posterior probabilities (less than 50%) for a model of positive selection in the northern lineage for each of these five sites. However, given the relatively limited phylogenetic sampling included in this study (only five taxa) the branch-sites tests may have insufficient power to detect positive selection [43], and more thorough phylogenetic sampling within Fundulus may increase statistical support for positive selection at these candidate loci. Only one amino acid replacement is shared between F. diaphanus and the northern populations of F. heteroclitus (Ile→Val at site 592 at edge of inner membrane region in ND5), offering little support of convergent mutation conferring hyposmotic tolerance between these fresh water tolerant and co-occurring groups. Over 160 synonymous mutations distinguish northern from southern populations of F. heteroclitus, though no lineage-specific codon usage biases were evident.

In protein coding regions, mutation rates indicate that mitogenome proteins evolve at variable rates, and patterns of substitution differ between populations of F. heteroclitus. Ratios of synonymous to non-synonymous mutations indicate that NADH dehydrogenase (ND) genes are evolving relatively quickly whereas cytochrome oxidase (CO) genes are evolving relatively slowly (Figures 5 and 6), consistent with findings in mammals [44]. Indeed, branch-site models indicate that more sites are evolving according to a model of neutral evolution (in contrast to a model of strong selective constraint) in the ND genes than in the CO genes (on average 1.79 sites per 100 for ND genes compared to 0.14 sites per 100 for CO genes). Very low dN/dS ratios are characteristic for all genes across all lineages (Figures 3 and 5), consistent with strong selective constraint governing mitogenome evolution. However, subtle differences appear in patterns of amino acid variation within and between Georgia and Maine populations, as reflected by differences in the neutrality index.

thumbnailFigure 5. Number of non-synonymous (A) and synonymous (B) mutations per site, and dN/dS ratio (C), for each gene and for each lineage. Colors indicate lineage, and match those in Figure 3. Note that Y-axis scales are different for each sub-figure.

thumbnailFigure 6. Mitochondrial protein sequence variation within and among northern (Maine and Maryland fresh-water) and Georgia populations. Blue hatch bars represent synonymous mutations that are variable within population (Variable-S), blue solid bars represent synonymous mutations that are fixed between populations (Fixed-S), red hatch bars represent amino acid replacement mutations that are variable within population (Variable-R), and red solid bars represent amino acid replacement mutations that are fixed between populations (Fixed-R).

Comparisons of fixed and variable synonymous and replacement mutations between each F. heteroclitus population and F. grandis allow calculation of the neutrality index (NI) which is the ratio of synonymous fixed sites to synonymous polymorphic sites divided by the ratio of replacement fixed sites to replacement polymorphic sites (Figure 3) [34]. Note that only two genomes were sequenced per taxon, so rates of within-group polymorphism are certainly underestimated but should not be biased among taxa being compared. NI values greater than 1.0 are common for animal mitochondrial genes [33,45], where amino acid variation is higher within taxa than expected relative to variation between taxa. This observation rejects neutral expectations, and excess amino acid polymorphism may reflect either the influence of balancing selection or selection against slightly deleterious mutation. All F. heteroclitus populations show a general pattern of NI values significantly greater than 1.0, though this pattern is most striking in the Maine population which is the only population in which the NI remains significantly greater than 1.0 after Bonferroni adjustment of p-values. This observation is consistent with the slightly deleterious model since historical population sizes are considered much larger in Georgia compared to northern populations [9,11], and excess deleterious mutation should be harder to detect in larger populations because of the proportionately greater efficiency of purifying selection. These results are consistent with NI values from Drosophila taxa of different historical population sizes [33], but do not reject the alternative explanation that balancing selection may be maintaining functional polymorphism in the Maine population.

Georgia individuals harbor twice the number of synonymous mutations compared to northern populations (Figures 3 and 6), again likely reflecting much larger historical population size in which synonymous mutations are more likely to be maintained but mildly deleterious replacement mutations are efficiently purged. Indeed, synonymous polymorphic sites are consistently fewer across genes within the Maine population compared to the Georgia population (Figure 6), and chi-squared tests of 2 × 4 contingency tables indicate that the Georgia mutation ratios are significantly different from those of both northern populations (p > 0.01 for both comparisons), but northern ratios are not different from each other (p = 0.77).

In agreement with very low dN/dS ratios, branch-site tests indicate that over 95% of substitutions appear to be evolving according to a model of strong selective constraint (substitutions with posterior probabilities > 90% for site class 0 using BEB in CODEML) across lineages. In the lineage leading to F. heteroclitus northern clade only 33 sites (out of 3,813 possible sites) reject the evolutionary model of strong selective constraint, and all 33 of these sites best fit a model of neutral evolution. Similarly, the vast majority of substitutions along all other branches in the phylogeny best fit a model of strong selective constraint.

Polymorphism in transcriptional control regions may affect protein expression levels, and therefore be phenotypically important. Presumed functional domains within the teleost control region, such as conserved sequence blocks (CSB) in the central conserved region, CSB-II, and CSB-III (as identified in Lee et al. [46]), are all highly conserved across Fundulus species. The only mutation unique to northern populations within these regions is at the beginning of the central conserved region (C→T at mitogenome position 15958 in F. heteroclitus ME-2), but the functional importance of this polymorphism has yet to be determined.

Together, these lineage-specific patterns of substitution, dN/dS ratios, and branch-site tests offer little support for the influence of direct selection on mitogenome evolution within Fundulus heteroclitus. Though five amino acid substitutions are unique to the northern clade, lineage-specific mitogenome differences more likely reflect differences in historical population size. Low dN/dS ratios coupled with results from branch-site tests indicate the influence of strong selective constraint, and an excess of synonymous polymorphisms within the Georgia population (relative to northern populations and relative to replacement polymorphisms within Georgia) is consistent with larger historical population size. Nuclear data also indicate greater genetic diversity in southern versus northern populations of F. heteroclitus [9,11]. Larger populations can harbor more purely neutral mutations and are more efficient at purging slightly deleterious mutations [47,48]. These results support the model that mitogenome evolution in killifish is primarily governed by interaction between strong selective constraint and demographic parameters, which is consistent with broader patterns of mitogenome evolution in other organisms including flies [49], copepods [50], rats [44], and humans [51].

Phylogenetic analyses

Phylogenetic analyses of concatenated mitogenome protein sequences indicate that within Fundulus the Maine population and the Maryland fresh water population mitogenome types are very similar, all three F. heteroclitus populations are monophyletic and form a clade with sister species F. grandis, and F. diaphanus is the outgroup (Figure 1). Posterior probabilities for branching nodes were high, and sequence from two nuclear genes supports the same topology (data not shown). Broader analysis of all complete mitochondrial genomes (concatenated proteins) that are currently available for the clade indicate that cyprinodontiformes and beloniformes fishes are reciprocally monophyletic (Figure 1), in contrast to the findings reported in Miya et al. [52]. Furthermore, Fundulus taxa appear to be relatively recently derived among the cyprinodontiformes lineages. These data provide no evidence for hybridization between fresh water F. heteroclitus and F. diaphanus in the upper Chesapeake estuary, and thus does not contribute to explaining the unusual distribution of mitochondrial genotypes in extreme upper-estuary habitats.

Conclusion

These comparative data illuminate the relative influence of evolutionary forces that shape mitochondrial genome variation in killifish, but do not offer strong support for hypotheses proposed to explain unusual distributions of mitochondrial types in nature. Patterns of synonymous and amino acid replacement substitutions and branch-site models of codon substitution provide little evidence that directional selection has directly influenced mitochondrial genome sequence variation between southern, northern, and fresh water populations of F. heteroclitus. However, five amino acid replacements and one mutation in a conserved and presumably functional block of the control region are unique to the northern clade and may be candidates for further study. Also, since all F. heteroclitus mitogenomes form a monophyletic group with F. diaphanus as the out-group, there is no evidence for introgression with F. diaphanus in northern and fresh water sites. Substitution patterns indicate the important role of purifying selection governing mitochondrial genome evolution, and the most apparent lineage-specific patterns of variation within F. heteroclitus is an excess of synonymous relative to replacement polymorphism in the Georgia population (Figure 3). This could reflect a larger historical population size, in which synonymous (purely neutral) polymorphisms are retained but slightly deleterious replacement variants are purged.

Since these data do not support two explicit hypotheses, the evolutionary mechanisms responsible for maintaining "northern" mitochondrial types in southern fresh water habitats remains unresolved. One possible explanation is that fresh water types are relicts from an historical northward expansion following Pleistocene glaciation. An alternate explanation is that southern types that reside in brackish habitats are competitively excluded from contributing to the gene pool in nearby fresh water habitats where northern types reside. We know that transition from fresh water to only slightly brackish demands considerable physiological adjustment [53]. We also know that northern types have higher fitness [5] and better osmotic compensatory ability [6] than southern types in fresh water. One hypothetical mechanism keeping gene pools distinct may be through breakdown of intergenomic coadaptation where unmatched nuclear-mitochondrial hybrids exhibit lower fitness (reviewed in Rand et al. [54]). One way to test the competitive exclusion and relict population hypotheses would be to densely sample populations along the salinity cline to test for sharp inflections (indicating competitive exclusion) or gradual transitions (indicating introgression) in allele frequencies across the ecological boundary between fresh water and slightly brackish habitats. This work is ongoing in the Whitehead laboratory.

Authors' contributions

AW designed experiments, carried out laboratory work and statistical analysis, and drafted the manuscript.

Acknowledgements

Special thanks to Dr. Douglas Crawford (University of Miami) and Dr. Mark Batzer (Louisiana State University) for access to sequencing facilities, and to Dr. Scott Herke (Louisiana State University) and Ms. Jen Roach (Louisiana State University) for assistance with sequencing. Earlier drafts of the manuscript benefitted from helpful comments from Dr. Michael Hellberg (Louisiana State University), and reviews from Dr. David Rand (Brown University) and a second anonymous reviewer. This research was supported by NSF grants BES-0652006 and EF-0723771 to AW.

References

  1. Wood CM, Marshall WS: Ion balance, acid-base regulation, and chloride cell function in the Common Killifish, Fundulus heteroclitus – a euryhaline estuarine teleost.

    Estuaries 1994, 17(1A):34-52. Publisher Full Text OpenURL

  2. Powers DA, Smith M, Gonzalez-Villasenor I, DiMichelle L, Crawford DL, Bernardi G, Lauerman T: A multidisciplinary approach to the selectionist/neutralist controversy using the model teleost, Fundulus heteroclitus. In Oxford Surveys in Evolutionary Biology. Volume 9. Edited by Futuyma D, Antonovics J. New York, NY: Oxford University Press; 1993::43-108. OpenURL

  3. Burnett KG, Bain LJ, Baldwin WS, Callard GV, Cohen S, Di Giulio RT, Evans DH, Gómez-Chiarri M, Hahn ME, Hoover CA, et al.: Fundulus as the premier teleost model in environmental biology: Opportunities for new insights using genomics.

    Comp Biochem Physiol Part D Genomics Proteomics 2007, 2(4):257-286. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  4. Smith MW, Chapman RW, Powers DA: Mitochondrial DNA analysis of Atlantic Coast, Chesapeake Bay, and Delaware Bay populations of the teleost Fundulus heteroclitus indicates temporally unstable distributions over geologic time.

    Mol Mar Biol Biotech 1998, 7(2):79-87. OpenURL

  5. Able KW, Palmer RE: Salinity effects on fertilization success and larval mortality of Fundulus heteroclitus.

    Copeia 1988, (2):345-350. Publisher Full Text OpenURL

  6. Scott GR, Rogers JT, Richards JG, Wood CA, Schulte PM: Intraspecific divergence of ionoregulatory physiology in the euryhaline teleost Fundulus heteroclitus: possible mechanisms of freshwater adaptation.

    J Exp Biol 2004, 207(19):3399-3410. PubMed Abstract | Publisher Full Text OpenURL

  7. Bernardi G, Sordino P, Powers DA: Concordant Mitochondrial and Nuclear-DNA Phylogenies for Populations of the Teleost Fish Fundulus-Heteroclitus.

    P Natl Acad Sci USA 1993, 90(20):9271-9274. Publisher Full Text OpenURL

  8. Gonzalez-Villasenor LI, Powers DA: Mitochondrial DNA restriction site polymorphisms in the teleost Fundulus heteroclitus support secondary intergradation.

    Evolution 1990, 44(1):27-37. Publisher Full Text OpenURL

  9. Ropson IJ, Brown DC, Powers DA: Biochemical genetics of Fundulus heteroclitus (L) .6. Geographical variation in the gene frequencies of 15 loci.

    Evolution 1990, 44(1):16-26. Publisher Full Text OpenURL

  10. Adams SM, Lindmeier JB, Duvernell DD: Microsatellite analysis of the phylogeography, Pleistocene history and secondary contact hypotheses for the killifish, Fundulus heteroclitus.

    Molecular Ecology 2006, 15(4):1109-1123. PubMed Abstract | Publisher Full Text OpenURL

  11. Duvernell DD, Lindmeier JB, Faust KE, Whitehead A: Relative influences of historical and contemporary forces shaping the distribution of genetic variation in the Atlantic killifish, Fundulus heteroclitus.

    Molecular Ecology 2008, 17(5):1344-1360. PubMed Abstract | Publisher Full Text OpenURL

  12. Whitehead A, Crawford DL: Neutral and adaptive variation in gene expression.

    P Natl Acad Sci USA 2006, 103(14):5425-5430. Publisher Full Text OpenURL

  13. Griffith RW: Environment and salinity tolerance in the genus Fundulus.

    Copeia 1974, 319-331. Publisher Full Text OpenURL

  14. Kidder GW, Petersen CW, Preston RL: Energetics of osmoregulation: I. Oxygen consumption by Fundulus heteroclitus.

    J Exp Zool Part A 2006, 305A(4):309-317. Publisher Full Text OpenURL

  15. Kidder GW, Petersen CW, Preston RL: Energetics of osmoregulation: II. Water flux and osmoregulatory work in the euryhaline fish, Fundulus heteroclitus.

    J Exp Zool Part A 2006, 305A(4):318-327. Publisher Full Text OpenURL

  16. Evans DH, Piermarini PM, Choe KP: The multifunctional fish gill: Dominant site of gas exchange, osmoregulation, acid-base regulation, and excretion of nitrogenous waste.

    Physiological Reviews 2005, 85(1):97-177. PubMed Abstract | Publisher Full Text OpenURL

  17. Chavez CH, Turgeon J: Asexual and sexual hybrids between Fundulus diaphanus and F. heteroclitus in the Canadian Atlantic region.

    Molecular Ecology 2007, 16(7):1467-1480. PubMed Abstract | Publisher Full Text OpenURL

  18. Dawley RM: Clonal hybrids of the common laboratory fish Fundulus heteroclitus.

    P Natl Acad Sci USA 1992, 89(6):2485-2488. Publisher Full Text OpenURL

  19. Lee JS, Miya M, Lee YS, Kim CG, Park EH, Aoki Y, Nishida M: The complete DNA sequence of the mitochondrial genome of the self fertilizing fish Rivulus marmoratus (Cyprinodontiformes, Rivulidae) and the first description of duplication of a control region in fish.

    Gene 2001, 280(1–2):1-7. PubMed Abstract | Publisher Full Text OpenURL

  20. Miya M, Nishida M: Organization of the mitochondrial genome of a deep-sea fish, Gonostoma gracile (Teleostei: Stomiiformes): First example of transfer RNA gene rearrangements in bony fishes.

    Mar Biotechnol 1999, 1(5):416-426. PubMed Abstract | Publisher Full Text OpenURL

  21. Bonfield JK, Smith KF, Staden R: A new DNA sequence assembly program.

    Nucleic Acids Res 1995, 23(24):4992-4999. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  22. Kumar S, Nei M, Dudley J, Tamura K: MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences.

    Brief Bioinform 2008, 9(4):299-306. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  23. Lowe TM, Eddy SR: tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence.

    Nucleic Acids Res 1997, 25(5):955-964. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  24. Bernsel A, Viklund H, Falk J, Lindahl E, von Heijne G, Elofsson A: Prediction of membrane-protein topology from first principles.

    P Natl Acad Sci USA 2008, 105(20):7177-7181. Publisher Full Text OpenURL

  25. Granseth E, Viklund H, Elofsson A: ZPRED: Predicting the distance to the membrane center for residues in alpha-helical membrane proteins.

    Bioinformatics 2006, 22(14):E191-E196. PubMed Abstract | Publisher Full Text OpenURL

  26. Hessa T, Meindl-Beinker NM, Bernsel A, Kim H, Sato Y, Lerch-Bader M, Nilsson I, White SH, von Heijne G: Molecular code for transmembrane-helix recognition by the Sec61 translocon.

    Nature 2007, 450(7172):1026-U1022. PubMed Abstract | Publisher Full Text OpenURL

  27. Viklund H, Elofsson A: Best alpha-helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information.

    Protein Science 2004, 13(7):1908-1917. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Viklund H, Elofsson A: OCTOPUS: improving topology prediction by two-track ANN-based preference scores and an extended topological grammar.

    Bioinformatics 2008, 24(15):1662-1668. PubMed Abstract | Publisher Full Text OpenURL

  29. Woolley S, Johnson J, Smith MJ, Crandall KA, McClellan DA: TreeSAAP: Selection on Amino Acid Properties using phylogenetic trees.

    Bioinformatics 2003, 19(5):671-672. PubMed Abstract | Publisher Full Text OpenURL

  30. Zhang JZ, Nielsen R, Yang ZH: Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level.

    Mol Biol Evol 2005, 22(12):2472-2479. PubMed Abstract | Publisher Full Text OpenURL

  31. Yang ZH: PAML 4: Phylogenetic analysis by maximum likelihood.

    Mol Biol Evol 2007, 24(8):1586-1591. PubMed Abstract | Publisher Full Text OpenURL

  32. Mcdonald JH, Kreitman M: Adaptive Protein Evolution at the Adh Locus in Drosophila.

    Nature 1991, 351(6328):652-654. PubMed Abstract | Publisher Full Text OpenURL

  33. Rand DM, Kann LM: Mutation and selection at silent and replacement sites in the evolution of animal mitochondrial DNA.

    Genetica 1998, 103:393-407. Publisher Full Text OpenURL

  34. Rand DM, Kann LM: Excess amino acid polymorphism in mitochondrial DNA: contrasts among genes from Drosophila, mice, and humans.

    Mol Biol Evol 1996, 13(6):735-748. PubMed Abstract | Publisher Full Text OpenURL

  35. Rozas J, Sanchez-DelBarrio JC, Messeguer X, Rozas R: DnaSP, DNA polymorphism analyses by the coalescent and other methods.

    Bioinformatics 2003, 19(18):2496-2497. PubMed Abstract | Publisher Full Text OpenURL

  36. Nachman MW: Deleterious mutations in animal mitochondrial DNA.

    Genetica 1998, 103:61-69. Publisher Full Text OpenURL

  37. Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models.

    Bioinformatics 2003, 19(12):1572-1574. PubMed Abstract | Publisher Full Text OpenURL

  38. Clary DO, Wolstenholme DR: The mitochondrial DNA molecule of Drosophila yakuba – nucleotide sequence, gene organization, and genetic code.

    J Mol Evol 1985, 22(3):252-271. PubMed Abstract | Publisher Full Text OpenURL

  39. Garesse R: Drosophila melanogaster mitochondrial DNA: gene organization and evolutionary considerations.

    Genetics 1988, 118(4):649-663. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  40. Brandt U: Energy converting NADH: Quinone oxidoreductase (Complex I).

    Annu Rev Biochem 2006, 75:69-92. PubMed Abstract | Publisher Full Text OpenURL

  41. da Fonseca RR, Johnson WE, O'Brien SJ, Ramos MJ, Antunes A: The adaptive evolution of the mammalian mitochondrial genome.

    Bmc Genomics 2008., 9 PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  42. Hadikusumo RG, Meltzer S, Choo WM, Jeanfrancois MJB, Linnane AW, Marzuki S: The Definition of Mitochondrial H+-Atpase Assembly Defects in Mit-Mutants of Saccharomyces-Cerevisiae with a Monoclonal-Antibody to the Enzyme Complex as an Assembly Probe.

    Biochim Biophys Acta 1988, 933(1):212-222. PubMed Abstract | Publisher Full Text OpenURL

  43. Anisimova M, Bielawski JP, Yang Z: Accuracy and power of Bayes prediction of amino acid sites under positive selection.

    Mol Biol Evol 2002, 19(6):950-958. PubMed Abstract | Publisher Full Text OpenURL

  44. Stewart JB, Freyer C, Elson JL, Wredenberg A, Cansu Z, Trifunovic A, Larsson N-G: Strong purifying selection in transmission of mammalian mitochondrial DNA.

    PLoS Biology 2008, 6(1):e10. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  45. Weinreich DM, Rand DM: Contrasting Patterns of Nonneutral Evolution in Proteins Encoded in Nuclear and Mitochondrial Genomes.

    Genetics 2000, 156(1):385-399. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  46. Lee WJ, Conroy J, Howell WH, Kocher TD: Structure and evolution of teleost mitochondrial control regions.

    J Mol Evol 1995, 41(1):54-66. PubMed Abstract | Publisher Full Text OpenURL

  47. Ohta T: Population size and rate of evolution.

    J Mol Evol 1972, 1(4):305-314. PubMed Abstract | Publisher Full Text OpenURL

  48. Ohta T: Slightly deleterious mutant substitutions in evolution.

    Nature 1973, 246(5428):96-98. PubMed Abstract | Publisher Full Text OpenURL

  49. Ballard JWO: Comparative genomics of mitochondrial DNA in members of the Drosophila melanogaster subgroup.

    J Mol Evol 2000, 51(1):48-63. PubMed Abstract | Publisher Full Text OpenURL

  50. Willett CS, Burton RS: Evolution of interacting proteins in the mitochondrial electron transport system in a marine copepod.

    Mol Biol Evol 2004, 21(3):443-453. PubMed Abstract | Publisher Full Text OpenURL

  51. Elson JL, Turnbull DM, Howell N: Comparative genomics and the evolution of human mitochondrial DNA: Assessing the effects of selection.

    Am J Hum Genet 2004, 74(2):229-238. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  52. Miya M, Takeshima H, Endo H, Ishiguro NB, Inoue JG, Mukai T, Satoh TP, Yamaguchi M, Kawaguchi A, Mabuchi K, et al.: Major patterns of higher teleostean phylogenies: a new perspective based on 100 complete mitochondrial DNA sequences.

    Mol Phylogenet Evol 2003, 26(1):121-138. PubMed Abstract | Publisher Full Text OpenURL

  53. Philpott CW, Copeland DE: Fine structure of chloride cells from three species of Fundulus.

    J Cell Biol 1963, 18(2):389-404. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  54. Rand DM, Haney RA, Fry AJ: Cytonuclear coevolution: the genomics of cooperation.

    Trends Ecol Evol 2004, 19(12):645-653. PubMed Abstract | Publisher Full Text OpenURL