<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2148-11-148</ui><ji>1471-2148</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>The evolutionary history of the SAL1 gene family in eutherian mammals</p>
</title>
<aug>
<au id="A1"><snm>Meslin</snm><fnm>Camille</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><insr iid="I4"/><email>camille.meslin@tours.inra.fr</email></au>
<au id="A2"><snm>Brimau</snm><fnm>Fanny</fnm><insr iid="I5"/><email>fanny.brimau@univ-lille1.fr</email></au>
<au id="A3"><snm>Meillour</snm><mnm>Nagnan-Le</mnm><fnm>Patricia</fnm><insr iid="I5"/><email>patricia.le-meillour@univ-lille1.fr</email></au>
<au id="A4"><snm>Callebaut</snm><fnm>Isabelle</fnm><insr iid="I6"/><email>isabelle.callebaut@impmc.upmc.fr</email></au>
<au id="A5"><snm>Pascal</snm><fnm>G&#233;raldine</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><insr iid="I4"/><email>geraldine.pascal@tours.inra.fr</email></au>
<au ca="yes" id="A6"><snm>Monget</snm><fnm>Philippe</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><insr iid="I4"/><email>philippe.monget@tours.inra.fr</email></au>
</aug>
<insg>
<ins id="I1"><p>UMR85 Physiologie de la Reproduction et des Comportements, INRA, Nouzilly, F-37380, France</p></ins>
<ins id="I2"><p>UMR6175, CNRS, Nouzilly, F-37380, France</p></ins>
<ins id="I3"><p>Universit&#233; Fran&#231;ois Rabelais de Tours, Tours, F-37041, France</p></ins>
<ins id="I4"><p>Haras Nationaux, Nouzilly, F-37380, France</p></ins>
<ins id="I5"><p>Unit&#233; de Glycobiologie Structurale et Fonctionnelle, INRA, UMR 8576 CNRS/Universit&#233; Lille1, Villeneuve d'Ascq Cedex, F-59655, France</p></ins>
<ins id="I6"><p>IMPMC, UMR7590, CNRS, Universit&#233; Pierre et Marie Curie, Paris, 75005, France</p></ins>
</insg>
<source>BMC Evolutionary Biology</source>
<issn>1471-2148</issn>
<pubdate>2011</pubdate>
<volume>11</volume>
<issue>1</issue>
<fpage>148</fpage>
<url>http://www.biomedcentral.com/1471-2148/11/148</url>
<xrefbib><pubidlist><pubid idtype="pmpid">21619679</pubid><pubid idtype="doi">10.1186/1471-2148-11-148</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>4</day><month>3</month><year>2011</year></date></rec><acc><date><day>28</day><month>5</month><year>2011</year></date></acc><pub><date><day>28</day><month>5</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Meslin et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>SAL1 (salivary lipocalin) is a member of the OBP (Odorant Binding Protein) family and is involved in chemical sexual communication in pig. SAL1 and its relatives may be involved in pheromone and olfactory receptor binding and in pre-mating behaviour. The evolutionary history and the selective pressures acting on SAL1 and its orthologous genes have not yet been exhaustively described. The aim of the present work was to study the evolution of these genes, to elucidate the role of selective pressures in their evolution and the consequences for their functions.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Here, we present the evolutionary history of SAL1 gene and its orthologous genes in mammals. We found that (1) SAL1 and its related genes arose in eutherian mammals with lineage-specific duplications in rodents, horse and cow and are lost in human, mouse lemur, bushbaby and orangutan, (2) the evolution of duplicated genes of horse, rat, mouse and guinea pig is driven by concerted evolution with extensive gene conversion events in mouse and guinea pig and by positive selection mainly acting on paralogous genes in horse and guinea pig, (3) positive selection was detected for amino acids involved in pheromone binding and amino acids putatively involved in olfactory receptor binding, (4) positive selection was also found for lineage, indicating a species-specific strategy for amino acid selection.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>This work provides new insights into the evolutionary history of SAL1 and its orthologs. On one hand, some genes are subject to concerted evolution and to an increase in dosage, suggesting the need for homogeneity of sequence and function in certain species. On the other hand, positive selection plays a role in the diversification of the functions of the family and in lineage, suggesting adaptive evolution, with possible consequences for speciation and for the reinforcement of prezygotic barriers.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>The barriers that lead to divergence of species during the course of evolution were classified by Dobzhansky in two categories: prezygotic and postzygotic reproductive barriers <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. Postzygotic reproductive barriers concern all the events that occur after fertilization, such as reduced hybrid viability and fertility, while prezygotic reproductive barriers concern isolation of sexual partners via ecological, temporal or behavioral isolation. Pheromones play a key role in pre-mating recognition of sexual partners <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. These compounds are defined as substances released by an animal that are able to induce specific behavioral and/or endocrinological reactions in a sexual partner of the same species <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Through these reactions, they could be involved in mate choice and sexual selection.</p>
<p>Odorant binding proteins (OBP) are small soluble proteins that are present in the olfactory apparatus as well as in biological fluids such as saliva, urine or vaginal discharge, and are able to bind pheromones (for review see <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>). OBP are assumed to be directly involved in chemical communication and in the pre-mating recognition process. Three hypotheses are proposed concerning their mechanism of action. The first is that olfactory receptors can recognize the OBP/pheromone complex, not just the pheromone alone. The second hypothesis is that the pheromone can be transferred to olfactory receptors only if assisted by the OBP. The third hypothesis is that the ligand can spontaneously dissociate from the complex with OBP and bind to the receptor as a "free pheromone" <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>.</p>
<p>The role of saliva in chemical communication between males and females is well established in pig <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>, like the role of urine in mouse <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. In pig, saliva contains the pheromonal steroids 5&#945;-androst-16-en-3-one and 5&#945;-androst-16-en-3&#945;-ol, as well as abundant quantities of salivary lipocalin (SAL1), the most abundant OBP isolated from submaxillary glands of mature males. When extracted from its source, this protein is associated with both pheromonal steroids <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>, and appears to play a key role in the standing reflex in the sow <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp> and also in the boar's libido <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. SAL1 is also expressed in the nasal and vomeronasal area, but devoid of ligand <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. SAL1 exhibits a classical structure of lipocalins characterized by a fully conserved N-terminal -G-X-W- motif and the typical folding pattern of a nine-stranded antiparallel &#946;-barrel forming an internal ligand binding site for small hydrophobic molecules <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>, despite relatively low sequence similarity <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. SAL1 also possesses a glycosylation site on Asn53. Two natural variants have been identified in which in three residues differ (Val61, Ile64 and Ala89 of isoform A are respectively Ala, Val and Val in isoform B). Two residues (Val61 and Ala89) are located inside the &#946;-barrel while the third residue (Ile64) is located next to the &#946;-barrel, suggesting that these minor structural differences lead to ligand binding specificities <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>.</p>
<p>Olfactory receptors are located on the olfactory sensory neurons of the main olfactory system in mammals and on the vomeronasal organ in rodents and other non-primate species <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. Several authors examined the evolution of olfactory receptors, but few studies of lipocalins and OBP have been performed. Ganfornina et al. <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> undertook phylogenetic analysis of prokaryotic and eukaryotic lipocalins and showed that this family appeared early and is composed of 13 monophyletic clades. These authors also showed that ancestral lipocalin clades in the phylogenetic tree are able to bind large ligands while more recent lipocalin clades, such as clades composed of OBP and MUP (Mouse Urinary Protein), bind smaller ligands. They also found that later clades had higher rates of amino acid substitution, more flexible protein structures and greater ligand-binding efficiency than more ancestral lipocalins.</p>
<p>Logan et al. <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp> undertook an extensive study of the <it>Mup </it>cluster in the mouse genome. They identified 21 <it>Mup </it>genes and 21 <it>Mup </it>pseudogenes on chromosome 4. They also identified <it>Mup </it>gene expansion in rat (9 genes and 13 pseudogenes), in horse (3 genes) and in mouse lemur (2 genes and 1 pseudogene) in the same syntenic region. Orangutan, chimpanzee, dog, pig (with SAL1), bushbaby and rhesus monkey have only one <it>Mup </it>gene in the syntenic region. The inferred phylogeny, the accumulation of synonymous substitutions, and the genomic organization of the <it>Mup </it>loci suggest that gene expansion occurred independently in several species <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>.</p>
<p>In the light of previous analyses, the aim of the present work was to study the evolution of SAL1 which is involved in pre-mating recognition in pig. We wanted to determine if selective pressures act on these proteins and to check if positive selection may play a role in binding specificity toward ligands or olfactory receptors.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>Identification of SAL1 homologous genes, genomic localization and phylogenetic study</p>
</st>
<p>We found similar sequences to pig SAL1 in 13 other mammalian species: cow, horse, dog, guinea pig, rat, mouse, rabbit, macaque, chimpanzee, gorilla, marmoset and elephant (Figure <figr fid="F1">1</figr> and table S1 in additional file <supplr sid="S1">1</supplr>). The sequences identified were located at the same syntenic locus between the neighboring genes SLC46A2 and ZFP37 and form the SAL1 family. Putative pseudogenes were identified in mouse, rat, mouse lemur, bushbaby and orangutan (Figure <figr fid="F1">1</figr> and figure S1 in additional file <supplr sid="S1">1</supplr>). The exact number of genes we identified in mouse, rat, chimpanzee, guinea pig, cow, mouse lemur, orangutan and bushbaby differed from that found by Logan et al. <abbrgrp>
<abbr bid="B17">17</abbr>
</abbrgrp>, who failed to identify several of these genes in a previous study, probably due to lower quality genome annotation. Mouse and rat genes form two important clusters of duplicates on chromosome 4 and 5, respectively, composed of 21 genes and 21 pseudogenes in mouse, and 10 genes and 11 pseudogenes in rat. Duplicates are present in cow (2 genes), horse (5 genes) and guinea pig (5 genes). All genes of each species form a cluster, suggesting cis-duplication events after speciation. Pig, dog, rhesus monkey and chimpanzee possess a single gene similar to SAL1 in the same syntenic region. In human, the gene is a known pseudogene <abbrgrp>
<abbr bid="B18">18</abbr>
</abbrgrp> due to a G-to-A nucleotide substitution at the donor site of the second intron, resulting in the split of the ORF of the coding sequences. This substitution was not found in chimpanzee and other primates. In the Neandertal Genome <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>, we found the same genomic organization as in human in ENSEMBL <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, namely the two genes SLC46A2 and ZFP37 surrounding a predicted SAL1 pseudogene. After multiple sequence alignment of this genomic region between chimpanzee, gorilla, Neandertal and human, we found the same substitution in the Neandertal genome, suggesting the emergence of this mutation in the common ancestor of Neandertal and human (Figure <figr fid="F2">2</figr>). The SLC46A2/ZFP37 locus is not present in frog (<it>Xenopus tropicalis</it>), birds (<it>Gallus gallus</it>, <it>Taeniopygia guttata</it>), bony fishes (<it>Takifugu rubripes, Tetraodon nigroviridis, Gasterosteus aculeatus, Danio rerio </it>and <it>Oryzias latipes)</it>, monotremes (<it>Ornithorhynchus anatinus</it>) and marsupials (<it>Monodelphis domestica)</it>, suggesting this family emerged in eutherian mammals. Orthology and paralogy relationships between the identified genes were inferred from the phylogenetic tree (Figure <figr fid="F3">3</figr>). Monophyletic clades formed by genes belonging to the same species were supported by very high bootstrap values (94.5 to 100%), suggesting that gene duplications occurred independently in mouse, rat, guinea pig, horse, and cow. The relationships between some species were not clear because of low bootstrap values for some nodes (34.3 to 68.1%), even if rodents and primate clades were supported by high bootstrap values (99.7 and 100%, respectively). The percentage of identity between sequences of this family is highly variable, not only between species but also between paralogs. In mouse, for example, Mup11 and Mup18 amino acid sequences are strictly identical. In rat, some paralogs are more distinct and pairwise identity ranges from 84 to 97%, so we tested paralog datasets for gene conversion events.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Lineage specific expansion of the SAL1 family</p></caption><text>
   <p><b>Lineage specific expansion of the SAL1 family</b>. Genomic localization of SAL1 and its orthologous genes in 13 species: pig (<it>Sus scrofa</it>), cow (<it>Bos taurus</it>), horse (<it>Equus caballus</it>), dog (<it>Canis familiaris</it>), guinea pig (<it>Cavia porcellus</it>), rat (<it>Rattus norvegicus</it>), mouse (<it>Mus musculus</it>), rabbit (<it>Oryctolagus cuniculus</it>), macaque (<it>Macaca mulatta</it>), chimpanzee (<it>Pan troglodytes</it>), gorilla (<it>Gorilla gorilla</it>), marmoset (<it>Callithrix jacchus</it>) and elephant (<it>Loxodonta Africana</it>). Genes that belong to SAL1 family are in black. Pseudogenes are in gray. Orthologous flanking genes SNX30, SLC46A2, ZFP37, SLC31A2 and FKBP15 are in pink, red, green, blue and purple, respectively. Numbers of chromosomes and localization of genes on chromosomes are indicated near each species.</p>
</text><graphic file="1471-2148-11-148-1" hint_layout="double"/></fig>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Table S1 - Identification of SAL orthologs and co-orthologs</b>. This table summarizes access numbers of SAL orthologs and co-orthologs, and gene locations in genomes. Genes involved in gene conversion events are also indicated. <b>Figure S1 - Putative pseudogenes </b>Evidence for putative pseudogenes in mouse lemur, bushbaby and orangutan are indicated.</p>
</text>
<file name="1471-2148-11-148-S1.DOC">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Loss of SAL1 orthologous gene in human and Neandertal genomes</p></caption><text>
   <p><b>Loss of SAL1 orthologous gene in human and Neandertal genomes</b>. The underlined A represents the G-to-A substitution in the donor site of the second intron in human and Neandertal genomic sequences, resulting in a shift in the ORF and in the pseudogenization of the gene in the two species.</p>
</text><graphic file="1471-2148-11-148-2" hint_layout="single"/></fig>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Phylogenetic tree of SAL1 family</p></caption><text>
   <p><b>Phylogenetic tree of SAL1 family</b>. The phylogenetic tree was reconstructed using maximum likelihood (ML) and rooted by midpoint rooting. Bootstrap values are given for main branches, in bold when nodes are strongly supported (>80%). The 50 mammalian SAL1 amino-acid sequences related to the 50 identified functional genes were used to reconstruct the phylogenetic tree. After removal of gaps, the dataset comprised 119 sites. Blue circles represent speciation and red squares duplication. Branches on the tree that were tested for positive selection on species clades are indicated by black arrows.</p>
</text><graphic file="1471-2148-11-148-3" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Evolution of paralogs in the SAL1 family</p>
</st>
<p>Gene conversion can occur between paralogous regions if they have sufficient sequence identity. To determine whether the identified clusters of paralogs underwent gene conversion events, we searched for statistical evidence of this phenomenon using the GENECONV program <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>. The first control analysis of "Randomize sites", which randomizes the order of polymorphic sites before analysis, detected no gene conversion event in horse, rat, mouse and guinea pig datasets, implying the results of subsequent GENECONV analyses are reliable. As shown in Table <tblr tid="T1">1</tblr>, most of the paralogs of guinea pig and mouse are involved in gene conversion events while in horse and in rat, respectively 3 and 4 genes out of a total of 5 and 10 are involved. The length of the converted tract varied greatly among species, from 11 bp for the shortest tract in guinea pig, to 529 bp for the longest tract in mouse. To determine which type of selective pressure (positive, neutral or purifying selection) shaped the evolution of these genes after gene duplication, we assessed selective pressure using the nonsynonymous/synonymous substitution rate ratio (&#969;) with codon-substitution models, where &#969;&lt; 1 is purifying selection, &#969; = 1 is neutral evolution and &#969; &gt; 1 is consistent with positive Darwinian selection <abbrgrp>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>. We performed a branch-site-based analysis by defining each branch supporting a paralogous gene as a foreground branch for PAML. In each species where the SAL1 gene has been duplicated, only one gene underwent positive selection (Table <tblr tid="T2">2</tblr>). Significant Likelihood Ratio Tests (LRTs) were found for the five genes, confirming that a positive selection model fits the data. For cow, mouse and rat genes, only a few (one or two) positively selected sites were detected, whereas in horse and particularly in guinea pig, more positively selected sites (6 and 15 sites, respectively) were detected.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Interlocus gene conversion events</p></caption><tblbdy cols="5">
      <r>
         <c ca="center">
            <p>
               <b>Species</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Number of sequences in the dataset </b>
               <sup>
                  <b>(1)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Number of sequence involved in a gene conversion event </b>
               <sup>
                  <b>(2)</b>
               </sup>
            </p>
         </c>
         <c ca="center" cspan="2">
            <p>
               <b>Converted tract length (bp)</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="2">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>min</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>max</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Horse</p>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>57</p>
         </c>
         <c ca="center">
            <p>74</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Rat</p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>161</p>
         </c>
         <c ca="center">
            <p>273</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Mouse</p>
         </c>
         <c ca="center">
            <p>21</p>
         </c>
         <c ca="center">
            <p>20</p>
         </c>
         <c ca="center">
            <p>107</p>
         </c>
         <c ca="center">
            <p>529</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Guinea pig</p>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>135</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>(1) </sup>All sequences are indicated in the table S1 of the additional file <supplr sid="S1">1</supplr>.</p>
      <p><sup>(2) </sup>Sequences involved in gene conversion events are indicated in bold in the table S1 of the additional file <supplr sid="S1">1</supplr>.</p>
   </tblfn></tbl>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Parameter estimates and likelihood scores for branch-site models for paralogs</p></caption><tblbdy cols="6">
      <r>
         <c ca="center">
            <p>
               <b>Genes</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Model</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>l </it>
               </b>
               <sup>
                  <b>(1)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Estimates of parameters </b>
               <sup>
                  <b>(2)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>2&#916;<it>l </it></b>
               <sup>
                  <b>(3)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Positively selected sites (BEB)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Cow LOC783399</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.941163</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, (&#961;1 = 0.68), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>27.10 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5647.392083</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.30, &#961;1 = 0.66, (&#961;2 = 0.05), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = &#8734;</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1 site p > 99%: 36Y</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Guinea pig ENSCPOG00000023399</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.050892</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.25, (&#961;1 = 0.50), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>56.40 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5631.851037</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.27, &#961;1 = 0.55, (&#961;2 = 0.18), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = 336.70</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>7 sites p > 99%: 24R, 27A, 113L, 118Q, 123T, 125T, 128T, 8 sites p > 95%: 31L, 25E, 26T, 85V, 114T, 117T, 122V, 126L</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Horse LOC100053653</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5665.171131</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.22, (&#961;1 = 0.47), &#969;0 = 0.23, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>15.86 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5657.239232</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.09, &#961;1 = 0.19, (&#961;2 = 0.72), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = 6.44</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>6 sites p > 95%: 81E, 97A, 123Q, 135K, 166K, 168F</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Mouse Mup19</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.941163</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, (&#961;1 = 0.67), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>53.87 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5634.004112</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.30, &#961;1 = 0.66, (&#961;2 = 0.04), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = &#8734;</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2 sites p > 99%: 174K, 177F</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Rat LOC298116</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.941163</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, (&#961;1 = 0.67), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>8.69 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5656.597218</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, &#961;1 = 0.66, (&#961;2 = 0.02), &#969;0 = 0.24, (&#969;1 = 1), &#969;2 = &#8734;</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1 site p > 99%: 135A</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>(1) </sup>Log- likelihood values.</p>
      <p><sup>(2) </sup>&#961;0, &#961;1, and &#961;2 are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively. &#969;0, &#969;1 and &#969;2 represented dN/dS for each class (purifying, neutral and positive selection, respectively).</p>
      <p><sup>(3) </sup>*** significant at <it>p </it>&lt; 0.001.</p>
   </tblfn></tbl>
</sec>
<sec>
<st>
<p>Positively selected sites in the SAL1 family and putative biological significance</p>
</st>
<p>To identify the selective pressure on the SAL1 family in eutherian mammals, we performed a site-based analysis with PAML (Table <tblr tid="T3">3</tblr>). After removal of gaps, 119 sites were analyzed using the codeml <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp> and Selecton <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> programs. In both comparisons (M1a vs. M2a, M8a vs. M8), LRTs were significant (<it>p </it>&lt; 0.001) for the dataset. Moreover, the AIC<sub>c </sub>score of MEC was lower than that of M8a, indicating that MEC fits the data better. Comparisons of the LRT and AICc scores were significant, implying that selective forces varied among sites between genes. According to M2a and M8 models, 23 to 29% of sites underwent positive selection, respectively. Four sites (9V, 72Y, 73A and 90E) were identified as positively selected sites with a p-value of at least 95% by the three models (M2a, M8 and MEC), a strong indication of positive selection for these four amino acids. Six sites (10T, 62R, 75C, 86A, 159R and 162Q) were identified by the M2a and M8 models. Six sites (6Q, 11S, 63K, 71F, 113G and 119L) were identified by M8, and one site (163L) was identified by MEC.</p>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Parameter estimates and likelihood scores for site models</p></caption><tblbdy cols="5">
      <r>
         <c ca="center">
            <p>
               <b>Model</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>l </it>
               </b>
               <sup>
                  <b>(1)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Estimates of parameters </b>
               <sup>
                  <b>(2)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>2&#916;<it>l </it></b>
               <sup>
                  <b>(3)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Positively selected sites (BEB) </b>
               <sup>
                  <b>(4)</b>
               </sup>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M0</p>
         </c>
         <c ca="center">
            <p>-5729.43479</p>
         </c>
         <c ca="center">
            <p>&#969; = 0.92</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M1a</p>
         </c>
         <c ca="center">
            <p>-5660.94116</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M2a</p>
         </c>
         <c ca="center">
            <p>-5643.14631</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.24, &#961;1 = 0.53, &#961;s = 0.23, &#969;s = 2.08</p>
         </c>
         <c ca="center">
            <p>35.59 *** (M2a vs M1a)</p>
         </c>
         <c ca="center">
            <p>1 site p > 99%: <b>72Y</b>, 9 sites p > 95%: <b>9V</b>, <it>10T</it>, <it>62R</it>, <b><ul>73A</ul></b>, <it><ul>75C</ul></it>, <it>86A</it>, <b>90E</b>, <it>159R</it>, <it>162Q</it></p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M7</p>
         </c>
         <c ca="center">
            <p>-5661.09613</p>
         </c>
         <c ca="center">
            <p>p = 0.64, q = 0.24</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M8a</p>
         </c>
         <c ca="center">
            <p>-5657.82124</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.41, &#961;1 = 0.59, p = 1.69, q = 3.62</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M8</p>
         </c>
         <c ca="center">
            <p>-5641.30866</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.71, &#961;s = 0.29, p = 0.83, q = 0.44, &#969;s = 1.84</p>
         </c>
         <c ca="center">
            <p>33.02 *** (M8 vs M8a)</p>
         </c>
         <c ca="center">
            <p>9 sites p > 99%: <b>9V</b>, <it>10T</it>, <it>62R</it>, <b>72Y</b>, <b><ul>73A</ul></b>, <it><ul>75C</ul></it>, <it>86A</it>, <b>90E</b>, <it>162Q</it>; 7 sites p > 95%: 6Q, 11S, 63K, 71F, 113G, <ul>119L</ul>, <it>159R</it></p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <b>Model</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>AICc score</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Estimates of parameters (2)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>Positively selected sites (BEB) (4)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>M8a</p>
         </c>
         <c ca="center">
            <p>18637.95571</p>
         </c>
         <c ca="center">
            <p>p = 1.10, q = 1.73</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c cspan="5">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>MEC</p>
         </c>
         <c ca="center">
            <p>18220.66368</p>
         </c>
         <c ca="center">
            <p>p = 0.80, q = 2.56</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5 sites p > 95%: <b>9V</b>, <b>72Y</b>, <b><ul>73A</ul></b>, <b>90E</b>, 163L</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>(1) </sup>Log- likelihood values.</p>
      <p><sup>(2) </sup>&#969;S: average dN/dS ratio for sites subject to positive selection (models M2a and M8), p and q: shape parameters for the beta distribution of &#969; (models M7, M8 and MEC). &#961;0, &#961;1, and &#961;S are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively.</p>
      <p><sup>(3) </sup>*** significant at <it>p </it>&lt; 0.001.</p>
      <p><sup>(4) </sup>Bold: <it>P </it>> 95% for the three comparisons (M2a vs. M1a, M8 vs. M8a, MEC vs. M8a).</p>
      <p>Italic: <it>P </it>> 95% for the two comparisons (M2a vs. M1a and M8 vs. M8a).</p>
      <p>Underlined: amino acids involved in androstenol and androstenone binding.</p>
      <p>Site numbers and amino acids refer to the pig SAL1 reference sequence PDB: <ext-link ext-link-id="1GM6" ext-link-type="pdb">1GM6</ext-link>.</p>
   </tblfn></tbl>
<p>To determine if positively selected sites are located in regions of interest, these sites were mapped on the 3D structure of SAL1 (PDB:<ext-link ext-link-id="1GM6" ext-link-type="pdb">1GM6</ext-link>) (Figure <figr fid="F4">4</figr>). To assess the biological significance of these sites, ligand binding sites determined by Spinelli et al. <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp> were also mapped on the 3D structure. Interestingly, three sites under positive selection matched amino acids that are directly involved in androstenol and androstenone binding (73A, 75C and 119L). Side chains of amino acids involved in ligand binding projected into the ligand binding pocket, which is formed by a relatively small internal cavity poorly accessible to solvent, whereas side chains of the majority of positively selected sites projected out of the binding pocket, except for three amino acids, suggesting that positive selection does not only play a role in pheromone binding specificity, but also in interaction with partners such as receptors. Relative solvent accessibility (RSA) of positively selected sites was determined by ASAView and is shown in Figure <figr fid="F5">5</figr>. We used the same classification as Rost et al., <abbrgrp>
<abbr bid="B26">26</abbr>
</abbrgrp>: a residue is classified as buried when the RSA is &lt;9%, as exposed when the RSA is &gt;35%, and as intermediate when the RSA is between 9 and 35%. We found three buried sites (75C, 119L and 159R), six intermediate sites (9V, 62R, 63K, 71F, 73A and 163L) and seven exposed sites (10T, 11S, 72Y, 86A, 90E, 113G and 162Q), indicating that most of the positively selected sites are located at the surface of the protein, and are perhaps involved in other functions than pheromone binding, however these remain to be identified. We observed no specific clustering of these sites at the surface of the protein exposed to the solvent (Figure <figr fid="F4">4</figr>).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Positive selection acting on orthologs</p></caption><text>
   <p><b>Positive selection acting on orthologs</b>. Map of the amino acids involved in the binding of ligand <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> on the SAL1 3D structure are in blue, and positively selected sites in pink (PDB: <ext-link ext-link-id="1GM6" ext-link-type="pdb">1GM6</ext-link>). Amino acids shown in a van der Waals representation (orange) were both involved in ligand binding and were positively selected. Positively selected amino acids were identified by PAML computations using site models.</p>
</text><graphic file="1471-2148-11-148-4" hint_layout="double"/></fig>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Positively selected sites and solvent accessibility</p></caption><text>
   <p><b>Positively selected sites and solvent accessibility</b>. Positively charged, negatively charged, polar and non-polar residues are in blue, red, green and gray respectively, the same as in the ASAView. Bold: amino acids with <it>P </it>> 95% for the three pairs of comparison (M2a vs. M1a, M8 vs. M8a, MEC vs. M8a). Italics: amino acids with <it>P </it>> 95% for the two pairs of comparison (M2a vs. M1a and M8 vs. M8a). Underlined: amino acids involved in androstenol and androstenone binding.</p>
</text><graphic file="1471-2148-11-148-5" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Positive selection events in marmoset, dog, guinea pig, horse and mouse clades</p>
</st>
<p>The comparison between site models of PAML detects positive selection only if the &#969; ratio averaged over all branches on the tree is greater than 1, but positive selection can also be expected to affect only a few amino acid residues in certain lineages. For this reason, we used branch-site models <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp> that are designed to detect signals of local episodic positive selection in order to determine whether different species underwent selective pressure. We tested the 12 species clades as foreground branches with branch-site models of PAML. The branches tested are shown in Figure <figr fid="F3">3</figr>. We were unable to draw any conclusions concerning cow, gorilla, elephant, macaque, rat and pig clades, as the LRTs were not significant. We found significant LRTs in the marmoset, dog, guinea pig, horse, mouse and rabbit clades, suggesting that SAL1 orthologs underwent positive selection in these species, but we were unable to identify any selected sites in rabbit (Table <tblr tid="T4">4</tblr>). For the other species, positively selected sites were mapped on the SAL1 structure. Mapping of site 14H identified in dog was not possible because this amino acid is situated at the beginning of the N-terminal end, which was not crystallized. Sites identified in marmoset, mouse and dog were not located in the binding pocket of the protein unlike some sites identified in guinea pig and horse. For the two latter clades, one (75C) and two (75C and 119L) positively selected sites, respectively, matched pheromone binding sites (Figure <figr fid="F6">6</figr>).</p>
<tbl id="T4"><title><p>Table 4</p></title><caption><p>Parameter estimates and likelihood scores for branch- site models for 5 species</p></caption><tblbdy cols="6">
      <r>
         <c ca="center">
            <p>
               <b>Species</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Model</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>
                  <it>l </it>
               </b>
               <sup>
                  <b>(1)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Estimates of parameters </b>
               <sup>
                  <b>(2)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>2&#916;<it>l </it></b>
               <sup>
                  <b>(3)</b>
               </sup>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Positively selected sites (BEB)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Marmoset</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.37202</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.18, (&#961;1 = 0.39), &#969;0 = 0.23, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>18.47 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5651.1375</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.30, &#961;1 = 0.63, (&#961;2 = 0.07), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = 60.38</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2 sites p > 95%: 159R, 161F</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Dog</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.55998</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.25, (&#961;1 = 0.52), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>6.78 **</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5657.16845</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.28, &#961;1 = 0.58, (&#961;2 = s0.14), &#969;0 = 0.23, (&#969;1 = 1), &#969;2 = 11.15</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2 sites p > 95%: <b>14H</b>, 144Y</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Guinea pig</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.94116</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, (&#961;1 = 0.67), &#969;0 = 0.24, (&#969;1 = 1)</p>
         </c>
         <c ca="center">
            <p>13.68 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5654.10045</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.27, &#961;1 = 0.51, (&#961;2 = 0.22), &#969;0 = 0.25, (&#969;1 = 1), &#969;2 = 3.94</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3 sites p > 95%: 11S, <ul>75C</ul>, 104V</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Horse</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5660.94121</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.32, (&#961;1 = 0.67), &#969;0 = 0.24, (&#969;1 = 1 )</p>
         </c>
         <c ca="center">
            <p>27.22 ***</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5647.32999</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.30, &#961;1 = 0.61, (&#961;2 = 0.09), &#969;0 = 0.24, (&#969;1 = 1), &#969;2 = 7.57</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2 sites p > 99%: <ul>75C</ul>, <ul>119L</ul>; 1 site p > 95%: 144Y</p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>Mouse</p>
         </c>
         <c ca="center">
            <p>Null</p>
         </c>
         <c ca="center">
            <p>-5658.60187</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.24, (&#961;1 = 0.55), &#969;0 = 0.19, (&#969;1 = 1 )</p>
         </c>
         <c ca="center">
            <p>10.75 **</p>
         </c>
         <c ca="center">
            <p>Not allowed</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c cspan="3">
            <hr/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="1">
            <hr/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>Alternative</p>
         </c>
         <c ca="center">
            <p>-5653.22783</p>
         </c>
         <c ca="center">
            <p>&#961;0 = 0.25, &#961;1 = 0.58, (&#961;2 = 0.17), &#969;0 = 0.19, (&#969;1 = 1), &#969;2 = 3.25</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>1 site p > 95%: 162Q</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>(1) </sup>Log- likelihood values.</p>
      <p><sup>(2) </sup>&#961;0, &#961;1, and &#961;2 are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively. &#969;0, &#969;1 and &#969;2 represented dN/dS for each class (purifying, neutral and positive selection, respectively).</p>
      <p><sup>(3) </sup>** significant at <it>p </it>&lt; 0.01</p>
      <p>*** significant at <it>p </it>&lt; 0.001.</p>
      <p><sup>(4) </sup>Underlined: amino acids involved in androstenol and androstenone binding.</p>
      <p>Site numbers and amino acids refer to the pig SAL1 reference sequence PDB: <ext-link ext-link-id="1GM6" ext-link-type="pdb">1GM6</ext-link>, except for the amino acid in bold, which is not part of the structure but comes from the dog genome sequence XP_855342.1.</p>
   </tblfn></tbl>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Positive selection acting on species clades</p></caption><text>
   <p><b>Positive selection acting on species clades</b>. Positively selected sites identified on species clades are shown in a van der Waals representation on the SAL1 3D structure. Positively selected sites that matched amino acids involved in ligand binding are in yellow, the others in pink. Amino acids involved in ligand binding are in blue, the same as in Figure 4. Positively selected amino acids were identified by PAML computations using branch site models. A: in marmoset, B: in guinea pig, C: in horse, D: in mouse and E: in dog.</p>
</text><graphic file="1471-2148-11-148-6" hint_layout="double"/></fig>
</sec>
</sec>
<sec>
<st>
<p>Discussion</p>
</st>
<p>Phylogenomic analyses showed that the SAL1 family originated in eutherian mammals and that genes belonging to this family were duplicated after speciation events in five mammalian species. In certain living species, such as mouse lemur, bushbaby, orangutan and human, the gene has been lost, as it has in the Neanderthal genome. We can date the loss of the gene in hominid before the Neandertal-modern human split, 400,000 to 350,000 years ago <abbrgrp>
<abbr bid="B28">28</abbr>
</abbrgrp>. The number of duplication events varies greatly among species. In mouse and rat, massive cis-duplication events have occurred, with respectively 42 and 22 genes in the cluster, followed by gene loss with respectively 21 and 11 pseudogenes.</p>
<p>Gene duplication represents a source of new genetic material, and can lead to evolutionary novelties. The fate of duplicated genes can follow different models of evolution, with different selective pressures acting on the genes <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. We checked for a change in selective pressure in all paralogs identified in the SAL1 family and found that only a few paralogs underwent positive selection: one gene in cow, guinea pig, horse, mouse and rat. Moreover, few sites of these genes were identified. A large proportion (66%) of each gene evolved under neutrality and only a small proportion (2 to 5%) under positive selection. However, among the single genes identified as positively selected in guinea pig and horse, a larger proportion of sites evolved under positive selection (18 and 72%, respectively) and more sites were identified as being positively selected (15 sites in guinea pig and 6 sites in horse).</p>
<p>Because sequences of some paralogs share high similarity, we searched for gene conversion in our paralogous gene datasets and found extensive interlocus gene conversion events in mouse and guinea pig, and to a lesser extent in horse and rat. Karn and Laukaitis <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp> compared the mouse <it>Mup </it>cluster with a gene tree published by Mudge et al. <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp> and suggested that concerted evolution masked the common origin of the gene and neighboring pseudogenes <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>. Our results confirmed this hypothesis, indicating extensive gene conversion in the mouse <it>Mup </it>cluster. This extensive gene conversion phenomenon led to sequence homogenization and is the cause of the concerted evolution of these genes. Such extensive concerted evolution suggests that, at least in mouse and guinea pig, both maintenance of sequence homogeneity and increased gene dosage are important for these species. The evolution of SAL1 paralogs resembles the evolution of the &#946;-globin gene family. In this family, paralogous copies evolved under a process of functional divergence and there is evidence for two gene conversion events in mouse and goat clusters composed of &#946;-globin duplicated genes There is also evidence for variable selective pressure among sites for &#946; and &#947;-globin genes with 4 to 9% of sites evolving under positive selection <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>.</p>
<p>By combining phylogenetic, gene conversion and selective pressure results on paralogs evolution, we can try to describe the fate of duplicated genes, in which duplication can be seen as an advantageous phenomenon for the species concerned, by combining two scenarios from Innan and Kondrashov <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. In the first scenario, one could consider the massive duplication in rat and mouse as a gene amplification where the increase in dosage of these genes is beneficial. This scenario of evolution corresponds to category IIa described by Innan and Kondrashov <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp>. In this model, if selection for the duplicated copy is weak, pseudogenization can occur if a null mutation is fixed, which is the case in both mouse and rat. The occurrence of gene conversions that maintain sequence similarity and promote conservation of gene copies could be consistent with that hypothesis, but the high frequency of gene conversion events is not restricted to mouse and rat. In fact, guinea pigs, which do not harbor large gene amplifications, have the highest frequency of conversion events per gene copy among the species tested. The beneficial increase in dosage has already been shown to apply to genes that mediate the interaction between the organism and the environment <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, as is true of genes of the SAL1 family. However, we also showed that among the many duplicates in rat, mouse and guinea pig, one gene per species is under positive selection so increased gene dosage and gene conversion events are not the only driving force of the evolution of these genes in these species. For these positively selected duplicates, it is the scenario of the category III <abbrgrp>
<abbr bid="B29">29</abbr>
</abbrgrp> which fits, where a new copy can be fixed and preserved by positive selection, leading to the possible emergence of a new function for the positively selected gene.</p>
<p>To study selective pressure in the SAL1 family in more detail, we tested the amino acids changing occurring in the 12 branches supporting species as positively selected by PAML. Our results showed that marmoset, dog, guinea pig, horse and mouse branches underwent positive selection just after divergence. This evolutionary scenario likely reflects the ability of the SAL1 family to diverge and to adapt to new behaviour between sexual partners. A previous study on mouse and rat genes identified 32 sites as positively selected on rodent co-orthologs of SAL1 <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. In that study, mouse and rat genes were considered together, whereas in our study, mouse and rat genes were analyzed separately, this explains the difference between the two results. Indeed, we only identified one site under positive selection in mouse and no positive selection in rat. The difference between the two results is also due to a difference in the probability threshold chosen to determine whether a site is subject to positive selection or not. In Emes et al. <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>, a site was said to be positively selected if the probability for one model is &gt; 0.90 and &gt; 0.50 for at least one other model. In our study, we chose to consider only sites whose probability was &gt; 0.95 in order to minimize false positives.</p>
<p>Finally, we compared tests of variable selective pressures for the family using several PAML codon models. We found evidence for positive selection in a small proportion of sites. Because positive selection is known to play a role in the diversification of protein functions, we mapped all positively selected sites on the 3D structure, in order to assess their biological significance for the gene family as a whole and for each species independently. Apart from the three amino acids that were under positive selection and involved in ligand binding, the other amino acids identified by site models of PAML analyses projected out of the binding pocket. Moreover, the majority of these sites were exposed to solvent. If these sites were involved in the interaction with pheromones, they would be found preferentially in the hydrophobic core and would be buried. We thus propose that positive selection plays a role not only in the binding specificity but also in the interaction between the protein and its environment. We were not able to draw any conclusions concerning selective pressures on each site involved in ligand binding, because gaps in the multiple sequence alignment made these calculations impossible. Nevertheless, for the 16 amino acids involved in pheromone binding, we identified three sites that probably evolved under purifying constraints (87Y, 91N and 93F) and four sites that probably evolved under relaxed constraints (60F, 85V, 121E and 123Y). The three sites that evolved under purifying constraints may be essential for protein function, because they were well conserved during the evolution of the family. In rodent populations, Emes et al. <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp> found that MUPs, which are co-orthologs of SAL1, exhibited amino acids under positive selection, and that these positively selected sites were located at the interface between MUPs and their receptors, probably V2R receptors on the vomeronasal organ. They also found evidence that olfactory receptors, such as V2Rs, underwent positive selection. The hypothesis they proposed is that this adaptation phenomenon is due to conspecific competition, resulting in well adapted pheromones, pheromone binding proteins such as MUP, and olfactory receptors <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp>. Our results allow us to extend this hypothesis because positive selection also drives the evolution of pheromone binding proteins in other eutherian mammals. So for all the family, and not just for rodents, there is an adaptive evolution of these proteins to their ligands and maybe their receptors, too. It would be interesting to test if V2R receptors are subject to positive selection, not only in rodents but also in other mammals. Several authors reported evidence for positive selection on other OR genes in mammals <abbrgrp>
<abbr bid="B35">35</abbr>
<abbr bid="B36">36</abbr>
<abbr bid="B37">37</abbr>
<abbr bid="B38">38</abbr>
<abbr bid="B39">39</abbr>
</abbrgrp>, with possible involvement of positively selected sites in the binding property of proteins. Moreno-Estrada <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp> suggested that positive selection could be at the origin of a new ligand binding capability or the modification of odorant perception and could improve the overall degenerated OR gene repertoire, at least in human. In insects, co-evolution of the two enzymes involved in the pheromone biosynthetic pathway and in the pheromone receptor has been suggested to play a role in the speciation process <abbrgrp>
<abbr bid="B40">40</abbr>
</abbrgrp>. It would be interesting to test co-evolution of enzyme/receptor, pheromone/receptor and OBP/receptor in mammals.</p>
<p>In mice, MUPs are important for the delivery, via urine, of chemical signals conveying information about the sex and hormonal status of the animal who release the scent mark <abbrgrp>
<abbr bid="B41">41</abbr>
</abbrgrp>. In pig, SAL1 may be involved in pre-mating recognition by binding pig specific sex pheromones in saliva <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. In both species, these proteins are involved in conspecific recognition in the context of reproduction. When the genomes of marine mammals are completed, it will be interesting to search for SAL1 orthologs. Indeed, in such a different environment, chemical communication between sexual partners is probably not mediated by the same olfactory cues as in terrestrial mammals. If a SAL1 ortholog is found in marine mammal genomes, it will be interesting to discover if it evolved under relaxed constraints or positive selection.</p>
<p>It is well established that reproduction is a very competitive process, and that selective pressures on genes involved in the process are not rare (for a review, see <abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp>). Positive Darwinian selection is not atypical, especially for genes involved in sensory perception and mate choice <abbrgrp>
<abbr bid="B43">43</abbr>
</abbrgrp>. Our results demonstrated that (i) positively selected sites differ between genes and (ii) positively selected sites are involved in ligand binding and are putatively involved in receptor binding. Such a selective pressure on these proteins could be at the origin of a divergence process between species and thus contribute to the speciation phenomenon by reinforcing prezygotic barriers. To test this hypothesis, we performed <it>in vitro </it>mutagenesis experiments on SAL1, but the poor folding of the resulting proteins prevented further experimentation.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The SAL1 gene family originated in eutherian mammals and duplicated after speciation in cow, horse, guinea pig and rodents. Some duplicated genes underwent concerted evolution with extensive gene conversion. Others were subject to positive selection at different sites, and our knowledge of the 3D structure of this protein suggests that the selected sites are involved in pheromone binding and possibly in olfactory receptor binding. This result suggests a functional divergence between species because positively selected sites differ between species. All these data suggest that the evolution of the SAL1 family allows a species-specific strategy to transduce pheromonal signals in mammals, reinforcing species divergence through species-specific sexual behaviour.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Phylogenetic and syntenic analyses</p>
</st>
<p>The protein sequence of the pig salivary lipocalin (SAL1) was retrieved from GenBank (<url>http://www.ncbi.nlm.nih.gov/genbank/</url>) <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp> (<ext-link ext-link-id="NP_998979.1" ext-link-type="gen">NP_998979.1</ext-link>). Proteins from other species were searched by using TBLASTN with porcine protein sequence as the query against all mammalian genomes available on the NCBI (<url>http://www.ncbi.nlm.nih.gov/mapview/</url>) <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp> and ENSEMBL databases (<url>http://www.ensembl.org/index.html</url>) <abbrgrp>
<abbr bid="B46">46</abbr>
</abbrgrp>. Identified proteins were then located on genomes for syntenic analyses of the most recent genome sequence assemblies: pig (<it>Sus scrofa</it>: ENSEMBL Sscrofa9), cow (<it>Bos Taurus</it>: NCBI Btau5.2), horse (<it>Equus caballus</it>: NCBI EquCab2.0), dog (<it>Canis familiaris</it>: ENSEMBL CanFam2.0), guinea pig (<it>Cavia porcellus</it>: ENSEMBL cavPor3), rat (<it>Rattus norvegicus</it>: NCBI RGSC 3.4), mouse (<it>Mus musculus</it>: NCBIM37), rabbit (<it>Oryctolagus cuniculus</it>: ENSEMBL OryCun2), rhesus monkey (<it>Macaca mulatta</it>: NCBI Build 1.2), chimpanzee (<it>Pan troglodytes</it>: NCBI Build 2.1), gorilla (<it>Gorilla gorilla</it>: ENSEMBL gorGor3), marmoset (<it>Callithrix jacchus</it>: ENSEMBL C_jacchus3.2.1) and elephant (<it>Loxodonta Africana</it>: ENSEMBL loxAfr3). To improve homology assignment, we only included genes from the same syntenic region in the final dataset. Sequences with no syntenic information were discarded. No genes were identified in other available mammalian genomes, and existing genome assemblies did not allow us to identify the syntenic region. Multiple sequence alignments were performed using the Clustal W algorithm <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp>. The chimpanzee sequence was removed from the dataset in order to have the most possible informative sites. All alignment gap sites were removed before phylogenetic analyses. Phylogenetic trees were reconstructed using maximum likelihood (ML) in PhyML 3.0 <abbrgrp>
<abbr bid="B48">48</abbr>
</abbrgrp> in order to establish orthologous and paralogous relationships among the gene datasets. Bootstrap values <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp> were estimated with 1000 replications and the tree was rooted using the midpoint rooting method. Orthology and paralogy relationships were inferred from the resulting phylogenetic tree.</p>
</sec>
<sec>
<st>
<p>Gene conversion</p>
</st>
<p>The four clusters of paralogs identified for the guinea pig, horse, rat and mouse were tested for interlocus gene conversion, i.e. nonreciprocal transfer of genetic information between genes of the same locus, using GENECONV version 1.81 <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>, which is a widely used method for detecting partial gene conversion <abbrgrp>
<abbr bid="B50">50</abbr>
</abbrgrp>. Each subset alignment was analyzed using the Clustal W algorithm <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> to search for pairs of sequences sufficiently similar to suggest gene conversion events. Three <it>p</it>-values were calculated and compared to assess the significance of the results. Evidence for gene conversion was strong when a fragment had a <it>p</it>-value &lt; 0.05 for at least two different types of statistical tests. In each alignment, indels and missing data were treated as a single polymorphism. All polymorphic sites were tested for evidence of gene conversion using adjusted mismatch penalties of 0, 1 or 2, to enable detection of both ancient and recent gene conversion events.</p>
</sec>
<sec>
<st>
<p>Evolutionary analyses</p>
</st>
<p>To investigate selective pressure, we used the CODEML application in the PAML package version 4.4 <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>, which allows the ratio dN/dS to vary across codons and estimates the probability for each codon to be under positive selection. The alignments resulted from Clustal W <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> and PAL2NAL <abbrgrp>
<abbr bid="B51">51</abbr>
</abbrgrp>.</p>
<sec>
<st>
<p>Study of selective pressure in the SAL1 family</p>
</st>
<p>To determine if selective pressure varied among sites in the SAL1 family, we used site models implemented in PAML <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>, which allows the &#969; ratio to vary among sites <abbrgrp>
<abbr bid="B52">52</abbr>
<abbr bid="B53">53</abbr>
</abbrgrp>. Like for reconstruction of the phylogenetic tree, the chimpanzee sequence (the shortest sequence) was removed in order to have the most possible informative sites. We used three pairs of models including M1a (nearly neutral: 0 &lt; <it>&#969;</it>
<sub>
<it>0 </it>
</sub>&lt;1 and &#969;<sub>1 </sub>= 1) versus M2a (positive selection: 0 &lt; <it>&#969;</it>
<sub>
<it>0 </it>
</sub>&lt; 1, <it>&#969;</it>
<sub>
<it>1 </it>
</sub>= 1 and <it>&#969;</it>
<sub>
<it>2 </it>
</sub>&gt;1) <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>, M8a (beta &amp; <it>&#969;</it>
<sub>
<it>s </it>
</sub>= 1: 0 &lt; <it>&#969; </it>&lt; 1 and <it>&#969;</it>
<sub>
<it>s </it>
</sub>= 1) versus M8 <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp> and MEC (a combined mechanistic-empirical model implemented in the Selecton server, <url>http://selecton.tau.ac.il/index.html</url>) <abbrgrp>
<abbr bid="B25">25</abbr>
<abbr bid="B55">55</abbr>
</abbrgrp> versus M8a and the PhyML generated tree for the analysis. Likelihood ratio tests were used to compare log likelihood values for M1a vs. M2a and M8a vs. M8 <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp>. The Akaike information criterion (AIC<sub>c </sub>score) was used to compare M8a and MEC <abbrgrp>
<abbr bid="B55">55</abbr>
</abbrgrp>. Bayes Empirical Bayes (BEB) method <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> implemented in PAML was used to estimate posterior probabilities of selection on each codon, probabilities &gt; 0.95 were considered significant.</p>
</sec>
<sec>
<st>
<p>Study of selective pressure on species and paralogs</p>
</st>
<p>To determine whether different species underwent selective pressure, we used the branch-site models of PAML <abbrgrp>
<abbr bid="B27">27</abbr>
<abbr bid="B57">57</abbr>
</abbrgrp>, which estimate different dN/dS values among branches and among sites. These models can detect a short episode of positive selection if it occurs in a small fraction of amino acids. We tested 13 branches as the foreground branch (i.e. the branch for which positive selection is allowed), eight branches leading to a species (pig, dog, rabbit, macaque, human, gorilla, marmoset and elephant) and five internal branches situated after speciation and before duplication events (in cow, horse, guinea pig, rat and mouse). Figure <figr fid="F3">3</figr> shows which branches on the phylogenetic tree were tested for positive species selection. We tested each individual branch that led to a paralog in order to detect selective pressures following duplication events. We also used the PhyML generated tree for the analysis. Two models were used to test for positive selection, one model called 'alternative' in which the foreground branch may have some sites under positive selection, and one model called 'null' in which the foreground branch may have different proportions of sites under neutral evolution than the background branch. For the 'alternative' model, three classes were defined: &#969;0: dN/dS &lt; 1, &#969;1: dN/dS = 1 and &#969;2: dN/dS&#8805;1, while in the 'null' model, &#969;2 was fixed to 1. Like for the site model, LRT <abbrgrp>
<abbr bid="B52">52</abbr>
</abbrgrp> and BEB <abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> were used.</p>
</sec>
</sec>
<sec>
<st>
<p>Putative function of positively selected sites</p>
</st>
<p>To assess the functionality of positively selected sites, the sites were positioned on the SAL1 structure (PDB: <ext-link ext-link-id="1GM6" ext-link-type="pdb">1GM6</ext-link>
<abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>) and their positions evaluated against the accessible surface area (ASA) of amino acids in SAL1 as determined by ASAView <abbrgrp>
<abbr bid="B58">58</abbr>
</abbrgrp>. SAL1 androstenol and androstenone binding sites were previously determined by Spinelli et al. <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>. These amino acids were positioned on the SAL1 structure. Molecular graphics images were produced using the UCSF Chimera package <abbrgrp>
<abbr bid="B59">59</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>CM performed the main data collection and analyses. GP provided advice on bioinformatic analyses. IC performed protein structural analyses. CM, FB and PNLM performed mutagenesis experiments and protein production. PM designed the study and helped guide the general analyses. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>CM is funded by a MENRT PhD fellowship. This work was supported by INRA.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Genetics and the origin of species/by Theodosius Dobzhansky</p></title><aug><au><snm>Dobzhansky</snm><fnm>T</fnm></au></aug><publisher>New York: Columbia University Press</publisher><pubdate>1964</pubdate></bibl><bibl id="B2"><title><p>On the scent of speciation: the chemosensory system and its role in premating isolation</p></title><aug><au><snm>Smadja</snm><fnm>C</fnm></au><au><snm>Butlin</snm><fnm>RK</fnm></au></aug><source>Heredity</source><pubdate>2008</pubdate><volume>102</volume><issue>1</issue><fpage>77</fpage><lpage>97</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1601-5223.1985.tb00468.x</pubid><pubid idtype="pmpid" link="fulltext">18685572</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Pheromones': a new term for a class of biologically active substances</p></title><aug><au><snm>Karlson</snm><fnm>P</fnm></au><au><snm>Luscher</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>1959</pubdate><volume>183</volume><issue>4653</issue><fpage>55</fpage><lpage>56</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/183055a0</pubid><pubid idtype="pmpid">13622694</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Mammalian odorant binding proteins. <it>Biochimica et Biophysica Acta (BBA)</it></p></title><aug><au><snm>Tegoni</snm><fnm>M</fnm></au><au><snm>Pelosi</snm><fnm>P</fnm></au><au><snm>Vincent</snm><fnm>F</fnm></au><au><snm>Spinelli</snm><fnm>S</fnm></au><au><snm>Campanacci</snm><fnm>V</fnm></au><au><snm>Grolli</snm><fnm>S</fnm></au><au><snm>Ramoni</snm><fnm>R</fnm></au><au><snm>Cambillau</snm><fnm>C</fnm></au></aug><source>Protein Structure and Molecular Enzymology</source><pubdate>2000</pubdate><volume>1482</volume><issue>1-2</issue><fpage>229</fpage><lpage>240</lpage><xrefbib><pubid idtype="doi">10.1016/S0167-4838(00)00167-9</pubid></xrefbib></bibl><bibl id="B5"><title><p>The role of perireceptor events in vertebrate olfaction</p></title><aug><au><snm>Pelosi</snm><fnm>P</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2001</pubdate><volume>58</volume><issue>4</issue><fpage>503</fpage><lpage>509</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/PL00000875</pubid><pubid idtype="pmpid" link="fulltext">11361085</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Reproductive behaviour of pigs</p></title><aug><au><snm>Signoret</snm><fnm>JP</fnm></au></aug><source>J Reprod Fertil Suppl</source><pubdate>1970</pubdate><volume>11</volume><issue>11</issue><note>Suppl 11:105+.</note></bibl><bibl id="B7"><title><p>Multiple roles of major urinary proteins in the house mouse, Mus domesticus</p></title><aug><au><snm>Beynon</snm><fnm>RJ</fnm></au><au><snm>Hurst</snm><fnm>JL</fnm></au></aug><source>Biochem Soc Trans</source><pubdate>2003</pubdate><volume>31</volume><issue>Pt 1</issue><fpage>142</fpage><lpage>146</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12546672</pubid></xrefbib></bibl><bibl id="B8"><title><p>Lipocalins of boar salivary glands binding odours and pheromones</p></title><aug><au><snm>Marchese</snm><fnm>S</fnm></au><au><snm>Pes</snm><fnm>D</fnm></au><au><snm>Scaloni</snm><fnm>A</fnm></au><au><snm>Carbone</snm><fnm>V</fnm></au><au><snm>Pelosi</snm><fnm>P</fnm></au></aug><source>Eur J Biochem</source><pubdate>1998</pubdate><volume>252</volume><issue>3</issue><fpage>563</fpage><lpage>568</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1432-1327.1998.2520563.x</pubid><pubid idtype="pmpid" link="fulltext">9546674</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>PIG COURTSHIP BEHAVIOR - PHEROMONAL PROPERTY OF ANDROSTENE STEROIDS IN MALE SUB-MAXILLARY SECRETION</p></title><aug><au><snm>Perry</snm><fnm>GC</fnm></au><au><snm>Patterson</snm><fnm>RLS</fnm></au><au><snm>Macfie</snm><fnm>HJH</fnm></au><au><snm>Stinson</snm><fnm>CG</fnm></au></aug><source>Animal Production</source><pubdate>1980</pubdate><volume>31</volume><issue>OCT</issue><fpage>191</fpage><lpage>199</lpage></bibl><bibl id="B10"><title><p>Functional Characterization of Olfactory Binding Proteins for Appeasing Compounds and Molecular Cloning in the Vomeronasal Organ of Pre-pubertal Pigs</p></title><aug><au><snm>Guiraudie</snm><fnm>G</fnm></au><au><snm>Pageat</snm><fnm>P</fnm></au><au><snm>Cain</snm><fnm>AH</fnm></au><au><snm>Madec</snm><fnm>I</fnm></au><au><snm>Meillour</snm><fnm>PN-L</fnm></au></aug><source>Chem Senses</source><pubdate>2003</pubdate><volume>28</volume><issue>7</issue><fpage>609</fpage><lpage>619</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/chemse/bjg052</pubid><pubid idtype="pmpid" link="fulltext">14578123</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Purification, cloning and characterisation of odorant- and pheromone-binding proteins from pig nasal epithelium</p></title><aug><au><snm>Scaloni</snm><fnm>A</fnm></au><au><snm>Paolini</snm><fnm>S</fnm></au><au><snm>Brandazza</snm><fnm>A</fnm></au><au><snm>Fantacci</snm><fnm>M</fnm></au><au><snm>Bottiglieri</snm><fnm>C</fnm></au><au><snm>Marchese</snm><fnm>S</fnm></au><au><snm>Navarrini</snm><fnm>A</fnm></au><au><snm>Fini</snm><fnm>C</fnm></au><au><snm>Ferrara</snm><fnm>L</fnm></au><au><snm>Pelosi</snm><fnm>P</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2001</pubdate><volume>58</volume><issue>5-6</issue><fpage>823</fpage><lpage>834</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11437241</pubid></xrefbib></bibl><bibl id="B12"><title><p>The lipocalin protein family: structure and function</p></title><aug><au><snm>Flower</snm><fnm>DR</fnm></au></aug><source>Biochem J</source><pubdate>1996</pubdate><volume>318</volume><issue>Pt 1</issue><fpage>1</fpage><lpage>14</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1217580</pubid><pubid idtype="pmpid" link="fulltext">8761444</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Boar salivary lipocalin. Three-dimensional X-ray structure and androsterol/androstenone docking simulations</p></title><aug><au><snm>Spinelli</snm><fnm>S</fnm></au><au><snm>Vincent</snm><fnm>F</fnm></au><au><snm>Pelosi</snm><fnm>P</fnm></au><au><snm>Tegoni</snm><fnm>M</fnm></au><au><snm>Cambillau</snm><fnm>C</fnm></au></aug><source>Eur J Biochem</source><pubdate>2002</pubdate><volume>269</volume><issue>10</issue><fpage>2449</fpage><lpage>2456</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1432-1033.2002.02901.x</pubid><pubid idtype="pmpid" link="fulltext">12027882</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Cloning, post-translational modifications, heterologous expression and ligand-binding of boar salivary lipocalin</p></title><aug><au><snm>Loebel</snm><fnm>D</fnm></au><au><snm>Scaloni</snm><fnm>A</fnm></au><au><snm>Paolini</snm><fnm>S</fnm></au><au><snm>Fini</snm><fnm>C</fnm></au><au><snm>Ferrara</snm><fnm>L</fnm></au><au><snm>Breer</snm><fnm>H</fnm></au><au><snm>Pelosi</snm><fnm>P</fnm></au></aug><source>Biochem J</source><pubdate>2000</pubdate><volume>350</volume><issue>Pt 2</issue><fpage>369</fpage><lpage>379</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1221263</pubid><pubid idtype="pmpid" link="fulltext">10947950</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Sensing Odorants and Pheromones with Chemosensory Receptors</p></title><aug><au><snm>Touhara</snm><fnm>K</fnm></au><au><snm>Vosshall</snm><fnm>LB</fnm></au></aug><source>Annual Review of Physiology</source><pubdate>2009</pubdate><volume>71</volume><issue>1</issue><fpage>307</fpage><lpage>332</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1146/annurev.physiol.010908.163209</pubid><pubid idtype="pmpid" link="fulltext">19575682</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>A Phylogenetic Analysis of the Lipocalin Protein Family</p></title><aug><au><snm>Ganfornina</snm><fnm>MD</fnm></au><au><snm>Gutierrez</snm><fnm>G</fnm></au><au><snm>Bastiani</snm><fnm>MSD</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2000</pubdate><volume>17</volume><issue>1</issue><fpage>114</fpage><lpage>126</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10666711</pubid></xrefbib></bibl><bibl id="B17"><title><p>Species Specificity in Major Urinary Proteins by Parallel Evolution</p></title><aug><au><snm>Logan</snm><fnm>DW</fnm></au><au><snm>Marton</snm><fnm>TF</fnm></au><au><snm>Stowers</snm><fnm>L</fnm></au></aug><source>PLoS ONE</source><pubdate>2008</pubdate><volume>3</volume><issue>9</issue><fpage>e3280</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0003280</pubid><pubid idtype="pmcid">2533699</pubid><pubid idtype="pmpid" link="fulltext">18815613</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primates</p></title><aug><au><snm>Zhang</snm><fnm>ZD</fnm></au><au><snm>Frankish</snm><fnm>A</fnm></au><au><snm>Hunt</snm><fnm>T</fnm></au><au><snm>Harrow</snm><fnm>J</fnm></au><au><snm>Gerstein</snm><fnm>M</fnm></au></aug><source>Genome Biol</source><pubdate>2010</pubdate><volume>11</volume><issue>3</issue><fpage>R26</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2010-11-3-r26</pubid><pubid idtype="pmcid">2864566</pubid><pubid idtype="pmpid" link="fulltext">20210993</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>A draft sequence of the Neandertal genome</p></title><aug><au><snm>Green</snm><fnm>RE</fnm></au><au><snm>Krause</snm><fnm>J</fnm></au><au><snm>Briggs</snm><fnm>AW</fnm></au><au><snm>Maricic</snm><fnm>T</fnm></au><au><snm>Stenzel</snm><fnm>U</fnm></au><au><snm>Kircher</snm><fnm>M</fnm></au><au><snm>Patterson</snm><fnm>N</fnm></au><au><snm>Li</snm><fnm>H</fnm></au><au><snm>Zhai</snm><fnm>W</fnm></au><au><snm>Fritz</snm><fnm>MH</fnm></au><etal/></aug><source>Science</source><pubdate>2010</pubdate><volume>328</volume><issue>5979</issue><fpage>710</fpage><lpage>722</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1188021</pubid><pubid idtype="pmpid" link="fulltext">20448178</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Ensembl's 10th year</p></title><aug><au><snm>Flicek</snm><fnm>P</fnm></au><au><snm>Aken</snm><fnm>BL</fnm></au><au><snm>Ballester</snm><fnm>B</fnm></au><au><snm>Beal</snm><fnm>K</fnm></au><au><snm>Bragin</snm><fnm>E</fnm></au><au><snm>Brent</snm><fnm>S</fnm></au><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Clapham</snm><fnm>P</fnm></au><au><snm>Coates</snm><fnm>G</fnm></au><au><snm>Fairley</snm><fnm>S</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>38</volume><issue>Database issue</issue><fpage>D557</fpage><lpage>562</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2808936</pubid><pubid idtype="pmpid" link="fulltext">19906699</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Statistical tests for detecting gene conversion</p></title><aug><au><snm>Sawyer</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>1989</pubdate><volume>6</volume><issue>5</issue><fpage>526</fpage><lpage>538</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">2677599</pubid></xrefbib></bibl><bibl id="B22"><title><p>Statistical methods for detecting molecular adaptation</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au><au><snm>Bielawski</snm><fnm>JP</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2000</pubdate><volume>15</volume><issue>12</issue><fpage>496</fpage><lpage>503</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0169-5347(00)01994-7</pubid><pubid idtype="pmpid" link="fulltext">11114436</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au><au><snm>Swanson</snm><fnm>WJ</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2002</pubdate><volume>19</volume><issue>1</issue><fpage>49</fpage><lpage>57</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11752189</pubid></xrefbib></bibl><bibl id="B24"><title><p>PAML 4: phylogenetic analysis by maximum likelihood</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><issue>8</issue><fpage>1586</fpage><lpage>1591</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm088</pubid><pubid idtype="pmpid" link="fulltext">17483113</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Selecton: a server for detecting evolutionary forces at a single amino-acid site</p></title><aug><au><snm>Doron-Faigenboim</snm><fnm>A</fnm></au><au><snm>Stern</snm><fnm>A</fnm></au><au><snm>Mayrose</snm><fnm>I</fnm></au><au><snm>Bacharach</snm><fnm>E</fnm></au><au><snm>Pupko</snm><fnm>T</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><issue>9</issue><fpage>2101</fpage><lpage>2103</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti259</pubid><pubid idtype="pmpid" link="fulltext">15647294</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Conservation and prediction of solvent accessibility in protein families</p></title><aug><au><snm>Rost</snm><fnm>B</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au></aug><source>Proteins</source><pubdate>1994</pubdate><volume>20</volume><issue>3</issue><fpage>216</fpage><lpage>226</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/prot.340200303</pubid><pubid idtype="pmpid">7892171</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au><au><snm>Nielsen</snm><fnm>R</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2002</pubdate><volume>19</volume><issue>6</issue><fpage>908</fpage><lpage>917</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12032247</pubid></xrefbib></bibl><bibl id="B28"><title><p>Close correspondence between quantitative- and molecular-genetic divergence times for Neandertals and modern humans</p></title><aug><au><snm>Weaver</snm><fnm>TD</fnm></au><au><snm>Roseman</snm><fnm>CC</fnm></au><au><snm>Stringer</snm><fnm>CB</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2008</pubdate><volume>105</volume><issue>12</issue><fpage>4645</fpage><lpage>4649</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0709079105</pubid><pubid idtype="pmcid">2290803</pubid><pubid idtype="pmpid" link="fulltext">18347337</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>The evolution of gene duplications: classifying and distinguishing between models</p></title><aug><au><snm>Innan</snm><fnm>H</fnm></au><au><snm>Kondrashov</snm><fnm>F</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2010</pubdate><volume>11</volume><issue>2</issue><fpage>97</fpage><lpage>108</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20051986</pubid></xrefbib></bibl><bibl id="B30"><title><p>The mechanism of expansion and the volatility it created in three pheromone gene clusters in the mouse (Mus musculus) genome</p></title><aug><au><snm>Karn</snm><fnm>RC</fnm></au><au><snm>Laukaitis</snm><fnm>CM</fnm></au></aug><source>Genome Biol Evol</source><pubdate>2009</pubdate><volume>1</volume><fpage>494</fpage><lpage>503</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2839280</pubid><pubid idtype="pmpid" link="fulltext">20333217</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Dynamic instability of the major urinary protein gene family revealed by genomic and phenotypic comparisons between C57 and 129 strain mice</p></title><aug><au><snm>Mudge</snm><fnm>JM</fnm></au><au><snm>Armstrong</snm><fnm>SD</fnm></au><au><snm>McLaren</snm><fnm>K</fnm></au><au><snm>Beynon</snm><fnm>RJ</fnm></au><au><snm>Hurst</snm><fnm>JL</fnm></au><au><snm>Nicholson</snm><fnm>C</fnm></au><au><snm>Robertson</snm><fnm>DH</fnm></au><au><snm>Wilming</snm><fnm>LG</fnm></au><au><snm>Harrow</snm><fnm>JL</fnm></au></aug><source>Genome Biol</source><pubdate>2008</pubdate><volume>9</volume><issue>5</issue><fpage>R91</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2008-9-5-r91</pubid><pubid idtype="pmcid">2441477</pubid><pubid idtype="pmpid" link="fulltext">18507838</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Gene conversion and functional divergence in the beta-globin gene family</p></title><aug><au><snm>Aguileta</snm><fnm>G</fnm></au><au><snm>Bielawski</snm><fnm>JP</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>J Mol Evol</source><pubdate>2004</pubdate><volume>59</volume><issue>2</issue><fpage>177</fpage><lpage>189</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-004-2612-0</pubid><pubid idtype="pmpid" link="fulltext">15486692</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Selection in the evolution of gene duplications</p></title><aug><au><snm>Kondrashov</snm><fnm>FA</fnm></au><au><snm>Rogozin</snm><fnm>IB</fnm></au><au><snm>Wolf</snm><fnm>YI</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Genome Biol</source><pubdate>2002</pubdate><volume>3</volume><issue>2</issue><fpage>RESEARCH0008</fpage><xrefbib><pubidlist><pubid idtype="pmcid">65685</pubid><pubid idtype="pmpid" link="fulltext">11864370</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Evolution and comparative genomics of odorant- and pheromone-associated genes in rodents</p></title><aug><au><snm>Emes</snm><fnm>RD</fnm></au><au><snm>Beatson</snm><fnm>SA</fnm></au><au><snm>Ponting</snm><fnm>CP</fnm></au><au><snm>Goodstadt</snm><fnm>L</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><issue>4</issue><fpage>591</fpage><lpage>602</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.1940604</pubid><pubid idtype="pmcid">383303</pubid><pubid idtype="pmpid" link="fulltext">15060000</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>A comparison of the human and chimpanzee olfactory receptor gene repertoires</p></title><aug><au><snm>Gilad</snm><fnm>Y</fnm></au><au><snm>Man</snm><fnm>O</fnm></au><au><snm>Glusman</snm><fnm>G</fnm></au></aug><source>Genome Res</source><pubdate>2005</pubdate><volume>15</volume><issue>2</issue><fpage>224</fpage><lpage>230</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.2846405</pubid><pubid idtype="pmcid">546523</pubid><pubid idtype="pmpid" link="fulltext">15687286</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Signatures of selection in the human olfactory receptor OR5I1 gene</p></title><aug><au><snm>Moreno-Estrada</snm><fnm>A</fnm></au><au><snm>Casals</snm><fnm>F</fnm></au><au><snm>Ramirez-Soriano</snm><fnm>A</fnm></au><au><snm>Oliva</snm><fnm>B</fnm></au><au><snm>Calafell</snm><fnm>F</fnm></au><au><snm>Bertranpetit</snm><fnm>J</fnm></au><au><snm>Bosch</snm><fnm>E</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2008</pubdate><volume>25</volume><issue>1</issue><fpage>144</fpage><lpage>154</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17981927</pubid></xrefbib></bibl><bibl id="B37"><title><p>A scan for positively selected genes in the genomes of humans and chimpanzees</p></title><aug><au><snm>Nielsen</snm><fnm>R</fnm></au><au><snm>Bustamante</snm><fnm>C</fnm></au><au><snm>Clark</snm><fnm>AG</fnm></au><au><snm>Glanowski</snm><fnm>S</fnm></au><au><snm>Sackton</snm><fnm>TB</fnm></au><au><snm>Hubisz</snm><fnm>MJ</fnm></au><au><snm>Fledel-Alon</snm><fnm>A</fnm></au><au><snm>Tanenbaum</snm><fnm>DM</fnm></au><au><snm>Civello</snm><fnm>D</fnm></au><au><snm>White</snm><fnm>TJ</fnm></au><etal/></aug><source>PLoS Biol</source><pubdate>2005</pubdate><volume>3</volume><issue>6</issue><fpage>e170</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0030170</pubid><pubid idtype="pmcid">1088278</pubid><pubid idtype="pmpid" link="fulltext">15869325</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Dynamic functional evolution of an odorant receptor for sex-steroid-derived odors in primates</p></title><aug><au><snm>Zhuang</snm><fnm>H</fnm></au><au><snm>Chien</snm><fnm>MS</fnm></au><au><snm>Matsunami</snm><fnm>H</fnm></au></aug><source>Proceedings of the National Academy of Sciences</source><pubdate>2009</pubdate><volume>106</volume><issue>50</issue><fpage>21247</fpage><lpage>21251</lpage><xrefbib><pubid idtype="doi">10.1073/pnas.0808378106</pubid></xrefbib></bibl><bibl id="B39"><title><p>Adaptive diversification of vomeronasal receptor 1 genes in rodents</p></title><aug><au><snm>Shi</snm><fnm>P</fnm></au><au><snm>Bielawski</snm><fnm>JP</fnm></au><au><snm>Yang</snm><fnm>H</fnm></au><au><snm>Zhang</snm><fnm>YP</fnm></au></aug><source>J Mol Evol</source><pubdate>2005</pubdate><volume>60</volume><issue>5</issue><fpage>566</fpage><lpage>576</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-004-0172-y</pubid><pubid idtype="pmpid" link="fulltext">15983866</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Molecular genetics and evolution of pheromone biosynthesis in Lepidoptera</p></title><aug><au><snm>Roelofs</snm><fnm>WL</fnm></au><au><snm>Rooney</snm><fnm>AP</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2003</pubdate><volume>100</volume><issue>16</issue><fpage>9179</fpage><lpage>9184</lpage><xrefbib><pubidlist><pubid idtype="pmcid">170892</pubid><pubid idtype="pmpid" link="fulltext">12876197</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Major urinary proteins, alpha(2U)-globulins and aphrodisin</p></title><aug><au><snm>Cavaggioni</snm><fnm>A</fnm></au><au><snm>Mucignat-Caretta</snm><fnm>C</fnm></au></aug><source>Biochim Biophys Acta</source><pubdate>2000</pubdate><volume>1482</volume><issue>1-2</issue><fpage>218</fpage><lpage>228</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0167-4838(00)00149-7</pubid><pubid idtype="pmpid" link="fulltext">11058763</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Evolution of reproductive proteins from animals and plants</p></title><aug><au><snm>Clark</snm><fnm>NL</fnm></au><au><snm>Aagaard</snm><fnm>JE</fnm></au><au><snm>Swanson</snm><fnm>WJ</fnm></au></aug><source>Reproduction</source><pubdate>2006</pubdate><volume>131</volume><issue>1</issue><fpage>11</fpage><lpage>22</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1530/rep.1.00357</pubid><pubid idtype="pmpid" link="fulltext">16388004</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Sensory genes and mate choice: evidence that duplications, mutations, and adaptive evolution alter variation in mating cue genes and their receptors</p></title><aug><au><snm>Horth</snm><fnm>L</fnm></au></aug><source>Genomics</source><pubdate>2007</pubdate><volume>90</volume><issue>2</issue><fpage>159</fpage><lpage>175</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ygeno.2007.03.021</pubid><pubid idtype="pmpid" link="fulltext">17544617</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Rate of molecular evolution of the seminal protein gene SEMG2 correlates with levels of female promiscuity</p></title><aug><au><snm>Dorus</snm><fnm>S</fnm></au><au><snm>Evans</snm><fnm>PD</fnm></au><au><snm>Wyckoff</snm><fnm>GJ</fnm></au><au><snm>Choi</snm><fnm>SS</fnm></au><au><snm>Lahn</snm><fnm>BT</fnm></au></aug><source>Nat Genet</source><pubdate>2004</pubdate><volume>36</volume><issue>12</issue><fpage>1326</fpage><lpage>1329</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1471</pubid><pubid idtype="pmpid" link="fulltext">15531881</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Positive selection in tick saliva proteins of the Salp15 family</p></title><aug><au><snm>Schwalie</snm><fnm>PC</fnm></au><au><snm>Schultz</snm><fnm>J</fnm></au></aug><source>J Mol Evol</source><pubdate>2009</pubdate><volume>68</volume><issue>2</issue><fpage>186</fpage><lpage>191</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-008-9194-1</pubid><pubid idtype="pmpid" link="fulltext">19159966</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Identifying concerted evolution and gene conversion in mammalian gene pairs lasting over 100 million years</p></title><aug><au><snm>Carson</snm><fnm>AR</fnm></au><au><snm>Scherer</snm><fnm>SW</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2009</pubdate><volume>9</volume><issue>156</issue><fpage>156</fpage><xrefbib><pubidlist><pubid idtype="pmcid">2720389</pubid><pubid idtype="pmpid" link="fulltext">19583854</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p></title><aug><au><snm>Thompson</snm><fnm>JD</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1994</pubdate><volume>22</volume><issue>22</issue><fpage>4673</fpage><lpage>4680</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/22.22.4673</pubid><pubid idtype="pmcid">308517</pubid><pubid idtype="pmpid" link="fulltext">7984417</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p></title><aug><au><snm>Guindon</snm><fnm>S</fnm></au><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Syst Biol</source><pubdate>2003</pubdate><volume>52</volume><issue>5</issue><fpage>696</fpage><lpage>704</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150390235520</pubid><pubid idtype="pmpid" link="fulltext">14530136</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Confidence Limits on Phylogenies: An Approach Using the Bootstrap</p></title><aug><au><snm>Felsenstein</snm><fnm>J</fnm></au></aug><source>Evolution</source><pubdate>1985</pubdate><volume>39</volume><issue>4</issue><fpage>783</fpage><lpage>791</lpage><xrefbib><pubid idtype="doi">10.2307/2408678</pubid></xrefbib></bibl><bibl id="B50"><title><p>Evaluation of methods for detecting recombination from DNA sequences: empirical data</p></title><aug><au><snm>Posada</snm><fnm>D</fnm></au></aug><source>Molecular biology and evolution</source><pubdate>2002</pubdate><volume>19</volume><issue>5</issue><fpage>708</fpage><lpage>717</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11961104</pubid></xrefbib></bibl><bibl id="B51"><title><p>PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments</p></title><aug><au><snm>Suyama</snm><fnm>M</fnm></au><au><snm>Torrents</snm><fnm>D</fnm></au><au><snm>Bork</snm><fnm>P</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><volume>34</volume><issue>Web Server issue</issue><fpage>W609</fpage><lpage>612</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1538804</pubid><pubid idtype="pmpid" link="fulltext">16845082</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene</p></title><aug><au><snm>Nielsen</snm><fnm>R</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>Genetics</source><pubdate>1998</pubdate><volume>148</volume><issue>3</issue><fpage>929</fpage><lpage>936</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1460041</pubid><pubid idtype="pmpid" link="fulltext">9539414</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Maximum likelihood estimation on large phylogenies and analysis of adaptive evolution in human influenza virus A</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>J Mol Evol</source><pubdate>2000</pubdate><volume>51</volume><issue>5</issue><fpage>423</fpage><lpage>432</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11080365</pubid></xrefbib></bibl><bibl id="B54"><title><p>Pervasive adaptive evolution in mammalian fertilization proteins</p></title><aug><au><snm>Swanson</snm><fnm>WJ</fnm></au><au><snm>Nielsen</snm><fnm>R</fnm></au><au><snm>Yang</snm><fnm>Q</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2003</pubdate><volume>20</volume><issue>1</issue><fpage>18</fpage><lpage>20</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12519901</pubid></xrefbib></bibl><bibl id="B55"><title><p>A combined empirical and mechanistic codon model</p></title><aug><au><snm>Doron-Faigenboim</snm><fnm>A</fnm></au><au><snm>Pupko</snm><fnm>T</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><issue>2</issue><fpage>388</fpage><lpage>397</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17110464</pubid></xrefbib></bibl><bibl id="B56"><title><p>Bayes empirical bayes inference of amino acid sites under positive selection</p></title><aug><au><snm>Yang</snm><fnm>Z</fnm></au><au><snm>Wong</snm><fnm>WS</fnm></au><au><snm>Nielsen</snm><fnm>R</fnm></au></aug><source>Molecular biology and evolution</source><pubdate>2005</pubdate><volume>22</volume><issue>4</issue><fpage>1107</fpage><lpage>1118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msi097</pubid><pubid idtype="pmpid" link="fulltext">15689528</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level</p></title><aug><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Nielsen</snm><fnm>R</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2005</pubdate><volume>22</volume><issue>12</issue><fpage>2472</fpage><lpage>2479</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msi237</pubid><pubid idtype="pmpid" link="fulltext">16107592</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>ASAView: database and tool for solvent accessibility representation in proteins</p></title><aug><au><snm>Ahmad</snm><fnm>S</fnm></au><au><snm>Gromiha</snm><fnm>M</fnm></au><au><snm>Fawareh</snm><fnm>H</fnm></au><au><snm>Sarai</snm><fnm>A</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2004</pubdate><volume>5</volume><issue>51</issue><fpage>51</fpage><xrefbib><pubidlist><pubid idtype="pmcid">420234</pubid><pubid idtype="pmpid" link="fulltext">15119964</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>UCSF Chimera--a visualization system for exploratory research and analysis</p></title><aug><au><snm>Pettersen</snm><fnm>EF</fnm></au><au><snm>Goddard</snm><fnm>TD</fnm></au><au><snm>Huang</snm><fnm>CC</fnm></au><au><snm>Couch</snm><fnm>GS</fnm></au><au><snm>Greenblatt</snm><fnm>DM</fnm></au><au><snm>Meng</snm><fnm>EC</fnm></au><au><snm>Ferrin</snm><fnm>TE</fnm></au></aug><source>J Comput Chem</source><pubdate>2004</pubdate><volume>25</volume><issue>13</issue><fpage>1605</fpage><lpage>1612</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcc.20084</pubid><pubid idtype="pmpid" link="fulltext">15264254</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>