Email updates

Keep up to date with the latest news and content from BMC Evolutionary Biology and BioMed Central.

Open Access Research article

RUNX2 tandem repeats and the evolution of facial length in placental mammals

Marie A Pointer123, Jason M Kamilar245, Vera Warmuth1, Stephen G B Chester2, Frédéric Delsuc6, Nicholas I Mundy1, Robert J Asher1* and Brenda J Bradley12*

Author Affiliations

1 Department of Zoology, University of Cambridge, Cambridge, CB2 3EJ, UK

2 Department of Anthropology, Yale University, New Haven, Connecticut 06511, USA

3 Department of Zoology, University of Oxford, Oxford, OX1 3PS, UK

4 Department of Anatomy, Midwestern University, Glendale, Arizona 85308, USA

5 School of Human Evolution and Social Change, Arizona State University, Tempe, Arizona 85287, USA

6 Institut des Sciences de l'Evolution, UMR5554-CNRS-IRD, Université Montpellier II, Montpellier, France

For all author emails, please log on.

BMC Evolutionary Biology 2012, 12:103  doi:10.1186/1471-2148-12-103


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2148/12/103


Received:12 January 2012
Accepted:28 June 2012
Published:28 June 2012

© 2012 Pointer et al.; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

When simple sequence repeats are integrated into functional genes, they can potentially act as evolutionary ‘tuning knobs’, supplying abundant genetic variation with minimal risk of pleiotropic deleterious effects. The genetic basis of variation in facial shape and length represents a possible example of this phenomenon. Runt-related transcription factor 2 (RUNX2), which is involved in osteoblast differentiation, contains a functionally-important tandem repeat of glutamine and alanine amino acids. The ratio of glutamines to alanines (the QA ratio) in this protein seemingly influences the regulation of bone development. Notably, in domestic breeds of dog, and in carnivorans in general, the ratio of glutamines to alanines is strongly correlated with facial length.

Results

In this study we examine whether this correlation holds true across placental mammals, particularly those mammals for which facial length is highly variable and related to adaptive behavior and lifestyle (e.g., primates, afrotherians, xenarthrans). We obtained relative facial length measurements and RUNX2 sequences for 41 mammalian species representing 12 orders. Using both a phylogenetic generalized least squares model and a recently-developed Bayesian comparative method, we tested for a correlation between genetic and morphometric data while controlling for phylogeny, evolutionary rates, and divergence times. Non-carnivoran taxa generally had substantially lower glutamine-alanine ratios than carnivorans (primates and xenarthrans with means of 1.34 and 1.25, respectively, compared to a mean of 3.1 for carnivorans), and we found no correlation between RUNX2 sequence and face length across placental mammals.

Conclusions

Results of our diverse comparative phylogenetic analyses indicate that QA ratio does not consistently correlate with face length across the 41 mammalian taxa considered. Thus, although RUNX2 might function as a ‘tuning knob’ modifying face length in carnivorans, this relationship is not conserved across mammals in general.

Keywords:
Mammalian evolution; Prognathism; Molecular evolution; Primates; Afrotheria; Xenarthra; Morphology

Background

Work on the molecular basis of animal adaptation is providing ample evidence that small genetic changes can underlie striking phenotypic differences, both within and between species [1-4]. Of recent particular interest is the role of short tandemly repeating DNA elements (i.e. microsatellites) as evolutionary ‘tuning knobs’ [5]. Microsatellites are mutation-prone and often show substantial length variation in repeat number [6]. These repeats are common in mammalian genomes, often occurring within coding exons, potentially translating into expansions and contractions of poly-amino acid stretches [7,8]. Moreover, proteins containing such repeats are often involved in development [9,10]. Thus, the function of a protein can be easily modified via alterations in repeat tract length [8,10] and such modifications may have less drastic pleiotropic effects than other types of mutation [4].

An example of this is the correlation between specific tandem repeats and variation in midfacial length (i.e., the degree of prognathism, or the jutting of the face and jaw) in carnivorans [11-13]. Variation in this striking morphological trait appears to be causatively associated with variation in coding sequence repeats within the gene RUNX2 (Runt-related transcription factor 2).

RUNX2 (also known as Cbfa1; Entrez Gene ID 860) codes for a transcription factor that plays an important role in mediating osteoblast differentiation and activity [14]. Its function is critical for building and repairing bone. The activity of RUNX2 is mediated by a few functional domains, one of which consists of a stretch of glutamines (abbreviated Q) followed by a stretch of alanines (A) (reviewed in [15]). For RUNX2, the ratio of glutamines to alanines (QA ratio) appears to be positively correlated with the transcriptional activity of the protein [13,16].

RUNX2 mutations in humans cause the skeletal disease cleidocranial dysplasia (MIM 119600) and have been associated with a shortening or protruding of the face and other skeletodental pathologies [17,18], some of which correspond to changes in certain clades of nonhuman mammals [19]. In general, work on RUNX2 in humans and mice indicates that when RUNX2 is up-regulated, bone development is accelerated [20]. Interestingly, in comparisons of the modern human and Neanderthal genomes, RUNX2 is specifically highlighted by Green et al. [21] as one of few genes showing a genetic signature of a selective sweep in the modern human lineage. Assuming the Neanderthal sequence data are valid, this would suggest a potentially important role for RUNX2 in human-specific cranial/skeletal features.

The QA ratio of RUNX2 is correlated with facial length across breeds of dog (92 breeds of Canis lupus familiaris; see [11] and across carnivorans in general (30 species; [13]). Specifically, dog breeds and other carnivoran species with a high QA ratio tend to have a relatively long rostrum or "face". Furthermore, Sears et al. [13] noted that the correlation was stronger among dogs, bears, raccoons and their relatives (i.e., caniforms) than among cats, civets, and mongooses (i.e., feliforms), related to the fact that the rostrum in caniforms shows more positive allometry during growth than in feliforms. That is, compared to a cat, a dog shows more growth in the face relative to the braincase during its ontogeny.

The possibility that this correlation represents a general mechanism responsible for at least some aspects of skull growth across non-carnivoran mammals has not yet been thoroughly investigated. Facial shape and length are highly variable in many mammalian clades, including afrotherians (e.g., tenrecs), xenarthrans (e.g., anteaters) and euarchontoglires (e.g., primates). Morphometrically, the skull shows a tremendous amount of continuous variation across mammals, an observation which lends itself to the metaphor of the "tuning knob" as a genetic basis behind this variation [5,22]. RUNX2, specifically the QA ratio, is thus a good candidate for examining the genetic basis of variation in skull proportions across mammals.

Here we assess the extent to which the correlation between skull shape and RUNX2 QA (Figure 1, Table 1) ratio holds true across a variety of non-carnivoran placental mammals. In particular, we focus on xenarthrans, afrotherians, and primates as these orders show marked variations in face length, which are thought to be adaptively associated with diet and ecology [23], and for these orders we have access to morphometric and genetic data. In analyzing data for 41 placental mammal species representing 12 orders (Table 1), we find no correlation between RUNX2 sequence and face length (Figures 2, and 3; Tables 2, and 3) across our sample as a whole.

thumbnailFigure 1 . Cranial measurements taken in this study. Macaque cranium in (A) ventral and (B) rostral views. CBL = distance from lateral-most point of occipital condyle to anterior premaxilla; Oc-Pat = distance from lateral-most point of occipital condyle to caudal margin of palate; Nose = anteroventral margin of orbit at lacrimal foramen to anterolateral margin of nasal bone; Face = anteroventral margin of orbit at lacrimal foramen to anterior premaxilla (specimen: ZMB 70003). Scale bar = 10mm.

Table 1. Mammalian taxa examined in this study, RUNX2 QA ratios, sources of sequence data, and face-length measures

thumbnailFigure 2 . Phylogeny of xenarthrans and afrotherians illustrating lack of correlation between QA ratio and face length. The phylogeny of xenarthrans (A, after [24]; position of Chlamyphorus after [25]) and afrotherians (B, after [26,27]) is given with corresponding facial proportions and QA ratios in selected species using graph at right. Gray boxes represent ratio face-to-skull length (bottom scale; higher values indicate a longer face); black diamonds represent ratio of glutamine to alanine in RUNX2 binding site (top scale). Note relative lack of correlation between large (e.g., Myrmecophaga) vs. short- (e.g., Chlamyphorus) faced species. Specimens shown (not to scale) are Myrmecophaga tridactyla (AMNH 1873), Tamandua tetradactyla (ZMB 38396, image reversed), Priodontes sp. (AMNH 1871), Chaetophractus villosus (AMNH 240), Chlamyphorus truncatus (AMNH 5487), Microgale dobsoni (UMZC 5458-B), Tenrec ecaudatus ZMB 90377), and Hemicentetes semispinosus (ZMB 5007).

thumbnailFigure 3 . Joint Bayesian reconstructions of RUNX2 QA ratios and relative nose length in 41 placental mammals. Reconstructions for RUNX2 QA ratios are given on the left (A), while reconstructions for relative nose length (Nose/CBL ratio) are on the right (B). Dark and light shaded disks at nodes indicate 95% credibility intervals of inferred ancestral ratios.

Table 2. Results of phylogenetic generalized least squares models examining the potential relationship between RUNX2 QA ratios and relative face-length and nose-length in mammals

Table 3. Results of Bayesian comparative analyses as implemented in the CoEvol program [[28]]

Results

RUNX2 tandem repeat ratios

The translated amino acid sequences for the 41 species are provided in the Supplementary Material ( Additional File 1). The RUNX2 amino acid sequences flanking the tandem repeat region were generally conserved across taxa providing evidence that orthologous rather than paralogous sequences were successfully retrieved. In support of this, all publicly-available mammalian genomes contain a single annotated RUNX2 locus, suggesting that this gene has 1:1 orthology across mammals and has not been subject to duplication. QA ratios calculated from the translated sequences range from 0.82 (Choloepus didactylus, Xenarthra) to 5.33 (Equus caballus, Perissodactyla, but see below) (Table 1). With the exception of Equus, these QA ratios were generally lower than those of carnivorans [13]. In their analysis of Carnivora, Sears et al. [13] report a lowest value of 1.5 for the QA ratio in Potos flavus, which is close to the range of domesticated dog breeds (approximately 1.2 – 1.45; based on Figure 2B in [11]). Glutamine and alanine amino acid counts for non-carnivoran mammals (glutamines: 14-29, alanines: 13-18) were not markedly different than those of domestic dog breeds (glutamines: 18-20; alanines: 12-17), although carnivorans are on the low end of alanine counts (Table 1). Across the mammals included in our dataset, the majority of the QA ratios lie between 1 and 2 (34 out of 41 species analyzed). Xenarthrans had the lowest variation in QA ratio across sampled species (variance = 0.067) and laurasiatherians had the highest (variance = 1.60). We make such taxonomic comparisons tentatively, as species within clades are not equally distant to each other and the species sampled here represent only a subset of the extant diversity for each clade.

Additional file 1 . Amino acid alignment of RUNX2 showing the QA region with some flanking sequence. An asterisk indicates that this sequence was identical in more than one individual.

Format: PDF Size: 148KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

Across the 41 mammals tested, both the polyQ and polyA are variable, but the polyQ has a greater range of values than the polyA repeat (polyQ ranges from 10-31 repeats, polyA ranges from 3-19 repeats) ( Additional File 1). Also, intraspecific variation was detected in two (Myrmecophaga tridactyla, the giant anteater, and Tamandua tetradactyla, the southern tamandua) of the four species where multiple individuals were sequenced (Table 1).

Although RUNX2 is generally conserved across mammals, three of the RUNX2 sequences obtained via genome browsers (three-lined squirrel, bottlenose dolphin and horse) contain distinctive amino acid changes ( Additional File 1). We cannot conclusively exclude the possibility of errors in the online genome sequence annotations, and this is of particular concern for the horse where there is extensive divergence in the 3’ flanking sequence. The unusually high QA ratio of 5.33 for Equus caballus is mostly due to the highly shortened poly A sequence (3 repeats compared to 7-17 repeats for other laurasiatherians), and, thus, this result should be viewed with caution. However, there is no a priori reason to suspect sequence or annotation errors for the three-lined squirrel or bottlenose dolphin; in these cases the sequences flanking the unusual amino acids are highly conserved, as are other mammalian sequences. We also note that the flanking sequences for some of the novel species (Dasypus, Hemicentetes) are short, but in these cases we can confirm the sequence from original trace files. The removal of flanking sequence was done when an ambiguous base pair was present in the sequence trace file. Sequences were only included in both the alignment and further analysis if the QA region of the sequence contained no ambiguous trace file calls; thus the QA ratio is likely to be correct in these cases, even though the flanking sequence was partially truncated.

The polyA repeat was often interrupted by a single valine ( Additional File 1). In fact, the presence of an interrupting amino acid has an interesting phylogenetic distribution (and intraspecific variation in Myrmecophaga), as most afrotherians and xenarthrans exhibit an interrupted polyA sequence but laurasiatherians or euarchontoglires do not. Regarding the polyQ repeat, only Microcebus murinus, the grey mouse lemur, has an interrupted glutamine sequence (by histidine; Table 1). In further analyses (see below), we considered both the total QA ratio (ignoring any interrupting amino acids) and the QA ratio for uninterrupted sequences only. For example, we scored the total QA ratio for the mouse lemur as 1.25 (20Q : 16A) in the initial analysis, but we also ran the analyses with a mouse lemur QA ratio of 0.38 (6Q : 16A) (Table 1). Whether we included total QA ratios or uninterrupted QA ratios, the direction and significance of the correlation results did not change (Table 2).

Since the goal of this study is to examine whether the pattern Sears et al. [13] describe for carnivorans holds for mammals in general, we took morphological measurements comparable to those described in Sears et al. [13] (Figure 1). After adjusting measurements relative to body size by dividing by proxy measures of overall size (following [13]; CBL, condylobasal length; and Oc-Pat, distance from lateral-most point of occipital condyle to caudal margin of palate; Figure 1), Dasypus (armadillo) and Myrmecophaga (giant anteater) had the longest ‘nose’ (measured from anteroventral margin of orbit at lacrimal foramen to anterolateral margin of nasal bone). These taxa, along with Sus (domestic pig) and Tursiops (dolphin) had the longest ‘face’ (measured from anteroventral margin of orbit at lacrimal foramen to the anterior premaxilla). At the other extreme, Callithrix (marmoset), Bradypus (sloth) and Mustela (weasel) showed the shortest face; CallithrixPongo (orangutan) and Nomascus (gibbon) had the shortest nose.

Those species with the longest and shortest rostra described above were not those with the largest and smallest RUNX2 QA ratios, especially in xenarthrans and afrotherians (Figure 2). Also, we did not recover a significant correlation between QA ratio and indices of rostrum size in non-carnivorans. None of our phylogenetic generalized linear models showed a significant relationship between QA ratios and relative face and nose lengths (Table 2). This was true regardless of our method of accounting for body size. In fact, closely related pairs of species that differ substantially in relative rostrum size (e.g., the anteaters Myrmecophaga vs. Tamandua; armadillos Priodontes vs. Chlamyphorus; and tenrecs Hemicentetes vs. Microgale) did not show correspondingly different QA ratios (Figure 2). For example, the extraordinarily long-faced Myrmecophaga had a slightly lower QA ratio (1.24) than the short-faced Chlamyphorus (1.53; see Figure 2).

Similar results were found for primates. The hamadryas baboon (Papio hamadryas) and the rhesus macaque (Macaca mulatta) have identical QA ratios (1.41) but the baboons have markedly longer faces (e.g. Face/Oc-Pat for baboon: 1.57, for macaque: 0.99). As is the case for the armadillo, anteater, and tenrec examples mentioned above (Figure 2), gibbons have a QA ratio similar to that of orangutans (1.35) and this also matches the ancestral reconstruction of the ratio at the great ape node. However, gibbons have much shorter faces than the great apes (e.g. Face/Oc-Pat for gibbon: 0.58, for orangutans: 0.89).

Thus, in contrast to the trend apparent among carnivorans in which long-faced species possess high QA ratios [13], this correlation does not seem to hold across placental mammals in general, and particularly not among xenarthrans or afrotherians (Table 2, Figure 2).

In order to characterize these observations further, we used the newly developed Bayesian comparative method of Lartillot and Poujol [28] (see Methods) to study the correlation between RUNX2 QA ratios and relative face/nose lengths (Table 3). As a proof of concept example, the method was first applied to the Sears et al. carnivoran dataset [13]. Using 30 MT-CYTB (mitochondrial cytochrome b) sequences and fossil calibrations to control for phylogeny and divergence times, the Bayesian method retrieved the positive correlation between RUNX2 QA ratio and relative facial length in Carnivora with a posterior probability of 0.95 (Table 3). However, similar analyses of our placental dataset using 41 VWF (von Willebrand factor) nuclear sequences and fossil calibrations to control for phylogeny and divergence times showed no significant correlation between RUNX2 QA ratio and the four relative face/nose size-controlled variables tested (Table 3). These results confirm the results of the PGLS (phylogenetic generalized least squares) analyses in demonstrating statistical independence between RUNX2 QA ratio and facial length at the global level of placentals.

Moreover, the Bayesian method also allowed the joint ancestral reconstruction of both QA and facial length ratios under the Brownian assumption (Figure 3). Figure 3a illustrates the relative homogeneity of the RUNX2 QA ratio in non-carnivoran placentals. The 95% credibility interval for the ancestral placental QA ratio is 1.14-1.80, whereas the ancestral carnivoran QA ratio is estimated at a larger value (1.57-2.63). Xenarthrans (1.01-1.65), afrotherians (1.00-1.80), and primates (1.05-1.71) showed remarkably comparable ancestral values. Also the relatively modest value (1.24-1.94) reconstructed for the Laurasiatheria ancestral node suggests that the RUNX2 QA ratio has increased in Carnivora possibly as a response to selection for increasing facial length in this clade. The Bayesian joint reconstruction of relative facial length shows a much greater variability among placentals than the QA ratio (Figure 3b). A larger heterogeneity is observed in both xenarthrans and afrotherians with relatively large ancestral facial ratios being inferred for each group. This contrasts with the homogeneity of their RUNX2 QA ratios (Figure 3a). Also, as previously noted, primates are characterized by relatively reduced facial length ratios compared to the other placental groups. This suggests that the relative facial length has been reduced in the ancestral branch leading to living primates. Overall, the comparison between the two panels of Figure 3 highlights the general lack of correlation between the RUNX2 QA ratio and relative facial length in placental mammals.

Discussion

Our study provides the first example of using a fully integrative Bayesian method for testing a correlation between genetic and morphometric data while controlling for phylogeny, evolutionary rates, and divergence times. Our results show that the RUNX2 QA ratio is not correlated with facial length across non-carnivoran placental mammals. The RUNX2 QA tandem repeat may yet play an important role in influencing variation in face length in these mammals. However, striking changes in prognathism have occurred along several mammalian lineages without associated changes in QA ratio. Variation in QA ratio has similarly occurred without corresponding changes to facial length phenotype (Figure 2).

Another interesting result is the unusually low QA ratio for both Choloepus species (Linnaeus’s and Hoffman’s two-toed sloth), the only placental mammals in our sample with a QA ratio below 1 (i.e. polyA is longer than polyQ). However, this cannot be the sole factor determining the short face length of sloths as Bradypus tridactylus (pale-throated three-toed sloth) had a QA ratio above 1 (1.23). It should be noted, however, that the two-toed and three-toed sloths are not particularly closely related. Genetic evidence suggests that Bradypus and Choloepus evolved an arboreal lifestyle independently [29] and separated more than 20 million years ago [24]. If face length also shortened independently, then it may be that selection has acted differently upon RUNX2 in the two-toed than in the three-toed sloth, explaining the higher QA ratio in Bradypus.

Some work suggests that the relationship between tandem repeats and anatomical form might not be linear [11,30]. However, with our preliminary data set of 1-10 data points per order, fitting a quadratic or exponential line seems arbitrary [31] and does not markedly improve fit. Although the nature of this study is a broad comparison across 12 orders of mammals, future work focusing on deep sampling within individual orders would be worthwhile.

In addition, Sears et al. [13] noted a weaker QA ratio in carnivorans without positively allometric growth of the face (e.g. felids). A larger sample within orders, particularly for primates [32], that accounts for variation in the degree of positive allometry across species (e.g. baboon vs. mouse lemur) may similarly reveal a stronger correlation within clades characterized by positively allometric growth of the face (e.g. papionines).

Another potential problem is the presence of intraspecific variation [32]. Although only likely to produce slight variation of the QA ratio for each species, across a phylogeny it will add to the noise within the independent contrasts analysis and reduce the signal of any potential association. Notably, examination of the RUNX2 sequence data from the 1000 genomes project [33] indicates virtually no intraspecific variation in QA ratio among humans. This may be due to the fact that humans have undergone a recent population bottleneck and generally show relatively low levels of intraspecific variation compared to other mammals [34]. Future studies that better sample the intraspecific diversity of RUNX2 will be able to test the extent to which the low diversity seen among humans applies across mammals.

This work represents a first step in examining the role of RUNX2 in the evolution of facial length across placental mammals. The obvious next step is to examine possible associations between RUNX2 tandem repeat ratios and prognathism within groups, especially those that show marked variation in facial length, such as the papionines [32,35,36]. Similarly, it would be interesting to examine variation at the other two functional domains of RUNX2 [15] or in the cis-regulatory regions of the RUNX2 gene across mammals. Moreover, exciting new linkage and heritability studies of craniofacial variation in populations for which we have genetic maps [37,38] will undoubtedly yield more detailed insights on the genetics of mammalian facial morphology, including new candidate loci for cross-taxa comparisons.

Conclusion

This study evaluates a simple molecular polymorphism (variation at the bone-growth gene RUNX2) that is thought to underlie functionally-important anatomical variation (relative face length). Although previous studies [11-13] demonstrated a clear, and likely causative, correlation between the QA-ratio of RUNX2 and face length in carnivorans, our analyses, controlling for phylogeny, evolutionary rates, and divergence times, show that the RUNX2 QA ratio is not generally correlated with facial length across placental mammals.

Methods

DNA sequence data

RUNX2 DNA sequence data were obtained using both online genomic resources (24 species), and DNA isolation, amplification and sequencing (17 species). We retrieved DNA sequences corresponding to the poly-glutamine (polyQ) and poly-alanine (polyA) tandem repeat and flanking regions of RUNX2 from Ensembl (releases 63 and 64; 17 species) and Genbank via BLASTN [39] (7 species) (Table 1). Notably, the QA ratio could not be reconstructed from the fragmented reads of the Neanderthal genome [40].

For the 17 species sequenced de novo in this study (Table 1), samples came from live animals and museum specimens, and sampling complied with institutional animal care and use protocols. Samples from live animals included feces, as well as hair collected during routine handling (e.g. for health checks). Museum specimens included skin, muscle, connective tissue, and bone. All xenarthran samples come from the Collection of Preserved Mammalian Tissues of the Institut des Sciences de l'Evolution, Montpellier [41].

We extracted DNA from approximately 0.02 to 0.05g of each sample tissue using the QIAamp DNA Mini kit (Qiagen) following the manufacturer’s instructions. PCR primer sequences are the same as those used by Sears et al. (2007). PCRs were performed using the Extensor Hi-Fidelity PCR enzyme (Thermo Scientific). An external touchdown PCR using primers Sears Ext F (5’-TTGTGATGCGTATTCCCGTA-3’) or Sears Int F (5’ATCCGAGCACCAGCCGGCGGCGCTTCAG-5’) with Sears Ext R (5’-ACSGAGCACAGGAAGTTGGG-3’) was performed on approximately 100ng of template DNA with the following cycling conditions: 95°C 2mins, 15x (95°C 30s, 61.3 (down 0.5°C per cycle) 30s, 72°C 50s), 20x (95°C 30s, 54.3°C 30s, 72°C 50s), 72°C 5 mins final extension. The product of the external PCR was then diluted to a 1/10 concentration and 0.5μl was added as template for the internal PCR. This nested PCR using primers Sears Int F and Sears Int R (5’-GTGGTCVGCGATGATCTCSAC-3’) was run with the following cycling condition: 94°C 3 mins, 40x (94°C 30s, 62°C 45s, 72°C 45s), 72°C 5 mins final extension. The product of this second reaction was run out on a 1.5% agarose gel containing ethidium bromide. When an amplicon of roughly 300bp in size was visible under UV light, the DNA was extracted, purified (QIAquick PCR purification kit, Qiagen) and sequenced (Big Dye, version 1.3).

The RUNX2 DNA sequences were analyzed and translated using sequence alignment and editing software (Lasergene, DNASTAR, BioEdit, ClustalW) and the QA ratios were calculated from the translated sequences. Sequences of sufficient length (> 100bp) have been deposited in GenBank (accession numbers: JQ405327 - JQ405335).

Morphological measurements

We quantified facial length for the 41 placental mammal species listed above by measuring adult crania from museum collections in Cambridge (UMZC, University Museum of Zoology, Cambridge), Berlin (ZMB, Zoologisches Museum Berlin), New Haven (YPM, Yale Peabody Museum), Washington DC (USNM, Smithsonian Institution, National Museum of Natural History) and New York (AMNH, American Museum of Natural History). Specimen numbers for each species are listed in the Supplementary Material ( Additional File 2). We took four measurements of each cranium with digital calipers (Face, Nose, CBL, Oc-Pat; Figure 2) based on the landmarks described by Sears et al. [13]. We measured both total facial length (Face), measured from the anteroventral margin of the orbit at the lacrimal foramen to the anterior margin of the premaxilla, and nasal length (Nose), measured from the anteroventral margin of the orbit at the lacrimal foramen to the anterolateral margin of the nasal bone. We also took two proxy measures of body size: Cranial base length (Oc-Pat), i.e., the distance from lateral-most point of the occipital condyle to the caudal margin of the palate, and condylobasal or total cranial length (CBL), i.e., the distance from the lateral-most point of the occipital condyle to the anterior-most premaxilla, as illustrated in Figure 2. These measures of skull length are positively and significantly correlated with body mass across a wide range of mammals [42,43]. For most species that exhibit sexual dimorphism and for which information on sex was available in museum collections, we measured at least two males and two females. Although the males and females of markedly dimorphic species (e.g. chimpanzee, gorilla, baboon) differed by as much as 20% for the absolute values of nose and face length, the relative measurements within species were similar, especially using total cranial length (CBL) as a proxy for body size (Table 1). Species averages for each measurement are given in the supplementary material ( Additional File 3).

Additional file 2. Source and specimens IDs for skulls measured for each species.

Format: XLS Size: 34KB Download file

This file can be viewed with: Microsoft Excel ViewerOpen Data

Additional file 3. Skull measurements for each species (in cm).

Format: XLSX Size: 47KB Download fileOpen Data

We standardized the two absolute measures of facial length (Face and Nose) by dividing each by body size proxies: cranial base (Oc-Pat) and condylobasal length (CBL). This gave us two relative values for each measurement.

Face length measures for humans seem longer than expected because on the human cranium the occipital condyles are closer to the palate, as the foramen magnum points ventrally. Therefore, ratios with CBL in the denominator may appear apelike due to the proximity of foramen magnum-palate, not due to a long face.

Comparative analyses

Interspecific datasets cannot be analyzed using traditional statistical methods because of sample non-independence due to shared evolutionary history [44]. Phylogenetically independent contrasts [45] have been commonly used in comparative studies, yet this approach assumes that trait variation perfectly follows a Brownian motion pattern across the phylogenetic tree (i.e. traits exhibit strong phylogenetic signal).

A more recent approach, phylogenetic generalized least squares (PGLS) [46], is more powerful [47]. This method includes an extra parameter, lambda, which varies continuously from zero to one and is optimized by maximum likelihood. A lambda value of zero indicates that phylogeny has no importance in the model, and is in fact identical to a model that does not account for the evolutionary history of species. A value of one indicates that the model’s error structure follows Brownian motion perfectly. In practice, many models exhibit a lambda value somewhere between zero and one. This and related methods are becoming increasingly popular in comparative biology with the greater usability of advanced statistical software [48-50].

We conducted PGLS analyses using the caper package for R [51,52]. Before analysis we natural log transformed the variables to better meet the assumptions of parametric statistics [53]. We constructed a phylogeny, including branch lengths as represented by estimated divergence times, from a number of sources [26,27,54-56]. These studies did not use the same data to calibrate their trees. Therefore, we used the published divergence times for the terminal taxa but had to adjust the divergence times for some of the deeper branches. This was necessary because the distance between each terminal taxon and the root of the tree needs to be consistent (i.e. the tree should be ultrametric). In addition, we examined how sensitive our results were to our choice of phylogeny by conducting a second set of analyses using the mammal supertree presented in Bininda-Emonds [57,58]. These models produced qualitatively similar results to our initial analyses. We examined our results for outliers by visually inspecting Q-Q plots and fitted vs. observed value plots. We also defined outliers as data points that exhibited standardized phylogenetic residuals greater than three or less than negative three [53].

We also used a recently developed Bayesian comparative method implemented in the CoEvol program [28]. This integrative approach uses a Bayesian framework that allows joint reconstruction of variations in molecular evolutionary rates, divergence times, and continuous variables. These parameters are modeled as a multivariate Brownian diffusion process along the branches of the phylogenetic tree while taking their covariance into account. We applied this method to evaluate the correlation between the RUNX2 QA ratio and four size-controlled face/nose length variables while controlling for phylogeny, evolutionary rates and divergence times.

We first applied this Bayesian method to the Sears et al. dataset [13]. Since this is the only marker for which the 30 species measured by Sears et al. [13] have been sequenced, we used the mitochondrial MT-CYTB gene to control for phylogeny along with the topology and the eight fossil calibrations proposed by Eizirik et al. [59]. We then applied the same methodology to our placental dataset. In this case however, we used VWF exon 28 nuclear sequences to control for phylogeny, because it is the only gene currently available for the 41 species of our sample. We also used the 20 fossil calibrations proposed by Benton et al. [60] that were compatible with our taxon sampling. In both cases, CoEvol was run for 30,000 MCMC cycles sampling parameters every 10 cycles. The first 500 samples were discarded as the burnin and posterior averages were estimated on the remaining 2500 points. The CoEvol datasets have been deposited in the Dryad Repository [61].

Abbreviations

RUNX2, Runt-related transcription factor 2; QA ratio, ratio of glutamines to alanines; CBL, distance from lateral-most point of occipital condyle to anterior premaxilla; Oc-Pat, distance from lateral-most point of occipital condyle to caudal margin of palate; NOSE, anteroventral margin of orbit at lacrimal foramen to anterolateral margin of nasal bone; FACE, anteroventral margin of orbit at lacrimal foramen to anterior premaxilla; MT-CYTB, mitochondrial cytochrome b; VWF, von Willebrand factor.

Competing interests

The authors have no competing interests.

Author contributions

Conceived and designed the project: MAP, BJB, RJA, NIM. Collected and analyzed data: MAP, BJB, JMK, VW, SGBC, FD, RJA. Wrote the paper: MAP, BJB, RJA, JMK, FD. Edited the manuscript: all authors. All authors read and approved the final manuscript.

Acknowledgments

We are grateful to Stephen Montgomery, Rich Lawler, Dieter Lukas and Mary Silcox for helpful discussions and comments on earlier versions of this manuscript. We also thank the University Museum of Zoology Cambridge, and Mercedes Okumura and the Leverhulme Centre for Human Evolutionary Studies for access to specimens. We are grateful to Linda Gordon at the US Natural History Museum, Ross MacPhee and Nicole Edminson at the American Museum of Natural History, and Kristof Zyskowski at the Yale Peabody Museum for collection access and assistance. We also thank François Catzeflis (Mammalian Tissue Collection of the Institut des Sciences de l'Evolution de Montpellier), Philippe Cerdan, Adrian Fowler, Eric Hansen, Guillermo Lemus, Alison Limentani, Jim Loughry, Jean-François Mauffrey, Jesus Mavarez, Colleen McDonough, Chris Raxworthy, Wolfram Rietschel, Mariella Superina, Richard Trueman, John Trupkiewicz, Jean-Christophe Vié, and Amanda Yeomans for access to tissue samples. We thank the AnthroTree Workshop and Natalie Cooper for assistance with the R-script. We are grateful to The Leverhulme Trust (NIM, RJA), Natural Environment Research Council, UK (NIM, BJB), the Scientific Council of the Université Montpellier 2 (FD), the Centre National de la Recherche Scientifique (FD), the Yale Department of Anthropology (BJB), and Yale University (BJB, SGBC, JMK) for funding. This is contribution ISEM 2012-030 of the Institut des Sciences de l’Evolution de Montpellier.

References

  1. Nadeau N, Jiggins C: Golden age for evolutionary genetics? Genomic studies of adaptation in natural populations.

    Trends Genet 2011, 26:484-492. OpenURL

  2. Grenier CS, Scott JW: From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design. 2nd edition. Wiley-Blackwell, Oxford; 2004. OpenURL

  3. Hoekstra HE: Genetics, development and evolution of adaptive pigmentation in vertebrates.

    Heredity 2006, 97:222-234. PubMed Abstract | Publisher Full Text OpenURL

  4. Wagner GP, Lynch VJ: The gene regulatory logic of transcription factor evolution.

    Trends Ecol Evol 2008, 23:377-385. PubMed Abstract | Publisher Full Text OpenURL

  5. Kashi Y, King DG: Simple sequence repeats as advantageous mutators in evolution.

    Trends Genet 2006, 22:253-259. PubMed Abstract | Publisher Full Text OpenURL

  6. Haerty W, Golding GB: Genome-wide evidence for selection acting on single amino acid repeats.

    Genome 2010, Research 20:755-760. OpenURL

  7. Wren JD, Forgacs E, Fondon JW, Pertsemlidis A, Cheng SY, et al.: Repeat polymorphisms within gene regions: Phenotypic and evolutionary implications.

    Am J Hum Genet 2000, 67:345-356. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  8. Caburet S, Cocquet J, Vaiman D, Veitia RA: Coding repeats and evolutionary "agility".

    Bioessays 2005, 27:581-587. PubMed Abstract | Publisher Full Text OpenURL

  9. Karlin S, Burge C: Trinucleotide repeats and long homopeptides in genes and roteins associated with nervous system disease and development.

    Proc Natl Acad Sci USA 1996, 93:1560-1565. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  10. Alba MM, Guigo R: Comparative analysis of amino acid repeats in rodents and humans.

    Genome Res 2004, 14:549-554. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  11. Fondon JW, Garner HR: Molecular origins of rapid and continuous morphological evolution.

    Proc Natl Acad Sci USA 2004, 101:18058-18063. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  12. Fondon JW, Garner HR: Detection of length-dependent effects of tandem repeat alleles by 3-D geometric decomposition of craniofacial variation.

    Dev Genes Evol 2007, 217:79-85. PubMed Abstract | Publisher Full Text OpenURL

  13. Sears KE, Goswami A, Flynn JJ, Niswander LA: The correlated evolution of Runx2 tandem repeats, transcriptional activity, and facial length in Carnivora.

    Evol Dev 2007, 9:555-565. PubMed Abstract | Publisher Full Text OpenURL

  14. Otto F, Thornell AP, Crompton T, Denzel A, Gilmour KC, et al.: Cbfa1, a candidate gene for cleidocranial dysplasia syndrome, is essential for osteoblast differentiation and bone development.

    Cell 1997, 89:765-771. PubMed Abstract | Publisher Full Text OpenURL

  15. Ziros PG, Basdra EK, Papavassiliou AG: Runx2: of bone and stretch.

    Int J Biochem Cell Biol 2008, 40:1659-1663. PubMed Abstract | Publisher Full Text OpenURL

  16. Thirunavukkarasu K, Mahajan M, McLarren KW, Stifani S, Karsenty G: Two domains unique to osteoblast-specific transcription factor Osf2/Cbfa1 contribute to its transactivation function and its inability to heterodimerize with Cbf beta.

    Mol Cell Biol 1998, 18:4197-4208. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Zhou G, Chen Y, Zhou L, Thirunavukkarasu K, Hecht J, et al.: CBFA1 mutation analysis and functional correlation with phenotypic variability in cleidocranial dysplasia.

    Hum Mol Genet 1999, 8:2311-2316. PubMed Abstract | Publisher Full Text OpenURL

  18. Mundlos S: Cleidocranial dysplasia: clinical and molecular genetics.

    J Med Genet 1999, 36:177-182. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Asher RJ, Lehmann T: Dental eruption in afrotherian mammals.

    BMC Biol 2008, 6:14. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  20. Komori T: Regulation of bone development and maintenance by Runx2.

    Front Biosci 2008, 13:898-903. PubMed Abstract | Publisher Full Text OpenURL

  21. Green R, Krause J, Briggs A, Maricic T, Stenzel U, et al.: A Draft Sequence of the Neandertal Genome.

    Science 2010, 710-722. OpenURL

  22. King DG, Soller M, Kashi Y: Evolutionary tuning knobs.

    Endeavour 1997, 21:36-40. Publisher Full Text OpenURL

  23. Nowak R: Walker's Mammals of the World. 6th edition. Johns Hopkins Univ Press, Baltimore; 1999. PubMed Abstract OpenURL

  24. Delsuc F, Vizcaino SF, Douzery EJP: Influence of Tertiary paleoenvironmental changes on the diversification of South American mammals: a relaxed molecular clock study within xenarthrans.

    BMC Evol Biol 2004, 4. OpenURL

  25. Möller-Krull M, Delsuc F, Churakov G, et al.: Retroposed elements and their flanking regions resolve the evolutionary history of xenarthran mammals (armadillos, anteaters, and sloths).

    Mol Biol Evol 2007, 24:2573-2582. PubMed Abstract | Publisher Full Text OpenURL

  26. Asher RJ, Maree S, Bronner G, Bennett NC, Bloomer P, et al.: A phylogenetic estimate for golden moles (Mammalia, Afrotheria, Chrysochloridae).

    BMC Evol Biol 2010, 10:69. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  27. Asher RJ, Bennett N, Lehmann T: The new framework for understanding placental mammal evolution.

    Bioessays 2009, 31:853-864. PubMed Abstract | Publisher Full Text OpenURL

  28. Lartillot N, Poujol R: A phylogenetic model for investigating correlated evolution of substitution rates and continuous phenotypic characters.

    Mol Biol Evol 2011, 28:729-744. PubMed Abstract | Publisher Full Text OpenURL

  29. Greenwood AD, Castresana J, Feldmaier-Fuchs G, Paabo S: A molecular phylogeny of two extinct sloths.

    Mol Phylogenet Evol 2001, 18:94-103. PubMed Abstract | Publisher Full Text OpenURL

  30. Galant R, Carroll SB: Evolution of a transcriptional repression domain in an insect Hox protein.

    Nature 2002, 415:910-913. PubMed Abstract | Publisher Full Text OpenURL

  31. Quader S, Isvaran K, Hale RE, Miner BG, Seavy NE: Nonlinear relationships and phylogenetically independent contrasts.

    J Evol Biol 2004, 17:709-715. PubMed Abstract | Publisher Full Text OpenURL

  32. Banovich NE, Ritzman TB, Stone AC: The Runx2 gene is an important determinant of facial morphology in primates.

    Am J Phys Anthropol 2011, 144:81-81. OpenURL

  33. 1000 Genomes Browserwww.1000genomes.org webcite

  34. Kaessmann H, Wiebe V, Weiss G, Pääbo S: Great ape DNA sequences reveal a reduced diversity and an expansion in humans.

    Nat Genet 2001, 27:155-156. PubMed Abstract | Publisher Full Text OpenURL

  35. Singleton M: Patterns of cranial shape variation in the Papionini (Primates : Cercopithecinae).

    J Hum Evol 2002, 42:547-578. PubMed Abstract | Publisher Full Text OpenURL

  36. Fleagle JG, Gilbert CC, Baden AL: Primate Cranial Diversity.

    Am J Phys Anthropol 2010, 142:565-578. PubMed Abstract | Publisher Full Text OpenURL

  37. Richtsmeier JT, Weiss KM, Buchanan A, Walker A, Jablonski N, et al.: Developmental genetic basis of primate craniofacial variation and human origins.

    Am J Phys Anthropol 2007, 140:198. OpenURL

  38. Willmore RCKE, Rogers J, Richtsmeier JT, Cheverud JM: Genetic variation in baboon craniofacial sexual dimorphism.

    Evolution 2009, 63:799-806. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  39. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool.

    J Molecular Biology 1990, 215:403-410. OpenURL

  40. Neanderthal Genome Browserhttp://genome.ucsc.edu/Neandertal/ webcite

  41. Catzeflis FM: Animal tissue collections for molecular genetics and Systematics.

    Trends Ecolology and Evolution 1991, 6:168. Publisher Full Text OpenURL

  42. Janis CM: Correlation of cranial and dentail variables with body size in ungulates and macropodoids. In Body size in mammalian paleobiology: estimation and biological implications. Edited by Damuthand J, MacFadden BJ. Cambridge University Press, New York; 1990:255-298. OpenURL

  43. Fitch WT: Skull dimensions in relation to body size in nonhuman primates: the causal bases for acoustic allometry.

    Zoology 2000, 103:40-58. OpenURL

  44. Freckleton RP: The seven deadly sins of comparative analysis.

    J Evol Biol 2009, 22:1367-1375. PubMed Abstract | Publisher Full Text OpenURL

  45. Felsenstein J: Phylogenies and the comparative method.

    Am Nat 1985, 125:1-15. Publisher Full Text OpenURL

  46. Pagel M: Inferring the historical patterns of biological evolution.

    Nature 1999, 401:877-884. PubMed Abstract | Publisher Full Text OpenURL

  47. Rohlf FJ: Comparative methods for the analysis of continuous variables: Geometric interpretations.

    Evolution 2001, 55:2143-2160. PubMed Abstract OpenURL

  48. Cooper N, Freckleton RP, Jetz W: Phylogenetic conservatism of environmental niches in mammals.

    Proceedings of the Royal Society B-Biological Sciences 2011, 278:2384-2391. Publisher Full Text OpenURL

  49. Kamilar JM, Bradley BJ: Interspecific variation in primate coat color supports Gloger’s rule.

    J Biogeogr 2011, 38:2270-2277. Publisher Full Text OpenURL

  50. Heesy CP, Kamilar JM, Willms J: Retinogeniculostriate pathway components scale with orbit convergence only in primates and not in other mammals.

    Brain Behav Evol 2011, 77:105-115. PubMed Abstract | Publisher Full Text OpenURL

  51. Orme CDL, Freckleton RP, Thomas GH, Petzoldt T, Fritz SA, Isaac NJB:

    Comparative Analyses of Phylogenetics and Evolution in R. 2011.

    R package version 0.4

    OpenURL

  52. R Development Core Team: R: A Language And Environment For Statistical Computing. R Foundation for Statistical Computing, Vienna; 2007. OpenURL

  53. Quinn G, Keough M: Experimental Design and Data Analysis for Biologists. Cambridge University Press, Cambridge; 2002. OpenURL

  54. Arnold C, Matthews LJ, Nunn CL: The 10k Trees Website: A New Online Resource for Primate Phylogeny.

    Evolutionary Anthropology 2010, 19:114-118. Publisher Full Text OpenURL

  55. Hallstrom BM, Janke A: Resolution among major placental mammal interordinal relationships with genome data imply that speciation influenced their earliest radiations.

    BMC Evol Biol 2008, 8:162. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  56. Poux C, Madsen O, Glos J, de Jong WW, Vences M: Molecular phylogeny and divergence times of Malagasy tenrecs: Influence of data partitioning and taxon sampling on dating analyses.

    BMC Evol Biol 2008, 8:102. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  57. Bininda-Emonds ORP, Cardillo M, Jones KE, MacPhee RDE, Back RMD, et al.: The delayed rise of present-day mammals.

    Nature 2007, 446:507-512. PubMed Abstract | Publisher Full Text OpenURL

  58. Bininda-Emonds ORP, Cardillo M, Jones KE, MacPhee RDE, Beck RMD, et al.: The delayed rise of present-day mammals (vol 446, pg 507, 2007).

    Nature 2008, 456:274-274. OpenURL

  59. Eizirik E, Murphy WJ, Koepfli KP, Johnson WE, Dragoo JW, et al.: Pattern and timing of diversification of the mammalian order Carnivora inferred from multiple nuclear gene sequences.

    Mol Phylogenet Evol 2010, 56:49-63. PubMed Abstract | Publisher Full Text OpenURL

  60. Benton MJ, Donoghue PCJ, Asher RJ: Calibrating and constraining molecular clocks. In Timetree of Life. Edited by Hedges KSSB. Oxford University Press; 2009:35-86. OpenURL

  61. Dryad Repositoryhttp://dx.doi.org/10.5061/dryad.fr84hd25 webcite