<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2148-11-166</ui><ji>1471-2148</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Evolution of the mammalian lysozyme gene family</p>
</title>
<aug>
<au ca="yes" id="A1"><snm>Irwin</snm><mi>M</mi><fnm>David</fnm><insr iid="I1"/><insr iid="I2"/><email>david.irwin@utoronto.ca</email></au>
<au id="A2"><snm>Biegel</snm><mi>M</mi><fnm>Jason</fnm><insr iid="I3"/><email>jb376641@albany.edu</email></au>
<au id="A3"><snm>Stewart</snm><fnm>Caro-Beth</fnm><insr iid="I3"/><email>cstewart@albany.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada</p></ins>
<ins id="I2"><p>Banting and Best Diabetes Centre, University of Toronto, Toronto, Canada</p></ins>
<ins id="I3"><p>Department of Biological Sciences, University at Albany, State University of New York, Albany, New York 12222, USA</p></ins>
</insg>
<source>BMC Evolutionary Biology</source>
<issn>1471-2148</issn>
<pubdate>2011</pubdate>
<volume>11</volume>
<issue>1</issue>
<fpage>166</fpage>
<url>http://www.biomedcentral.com/1471-2148/11/166</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-11-166</pubid><pubid idtype="pmpid">21676251</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>17</day><month>10</month><year>2010</year></date></rec><acc><date><day>15</day><month>6</month><year>2011</year></date></acc><pub><date><day>15</day><month>6</month><year>2011</year></date></pub></history>
<cpyrt><year>2011</year><collab>Irwin et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Lysozyme <it>c </it>(chicken-type lysozyme) has an important role in host defense, and has been extensively studied as a model in molecular biology, enzymology, protein chemistry, and crystallography. Traditionally, lysozyme <it>c </it>has been considered to be part of a small family that includes genes for two other proteins, lactalbumin, which is found only in mammals, and calcium-binding lysozyme, which is found in only a few species of birds and mammals. More recently, additional testes-expressed members of this family have been identified in human and mouse, suggesting that the mammalian lysozyme gene family is larger than previously known.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Here we characterize the extent and diversity of the lysozyme gene family in the genomes of phylogenetically diverse mammals, and show that this family contains at least eight different genes that likely duplicated prior to the diversification of extant mammals. These duplicated genes have largely been maintained, both in intron-exon structure and in genomic context, throughout mammalian evolution.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>The mammalian lysozyme gene family is much larger than previously appreciated and consists of at least eight distinct genes scattered around the genome. Since the lysozyme <it>c </it>and lactalbumin proteins have acquired very different functions during evolution, it is likely that many of the other members of the lysozyme-like family will also have diverse and unexpected biological properties.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>The vertebrate lysozyme gene family has traditionally been considered to be composed of three genes: lysozyme <it>c</it>, lactalbumin, and calcium-binding lysozyme <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
</abbrgrp>. Lysozyme <it>c</it>, chicken-type (or conventional) lysozyme, is a bacteriolytic enzyme that is secreted into many body fluids of mammals (<it>e.g., </it>blood, tears, and milk) and is found at a high concentration in the eggs of many bird species <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B5">5</abbr>
</abbrgrp>. Lysozyme <it>c </it>is widespread in nature; its protein and gene sequences have been characterized from numerous diverse vertebrate and non-vertebrate species <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Lactalbumin is related to lysozyme, with around 40% amino acid identity and nearly identical three-dimensional structure, but lacks its bacteriolytic activity <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. Lactalbumin is expressed in lactating mammary glands, where it binds a calcium ion and modifies the activity of &#946;-galactosyltransferase-1, such that the complex catalyzes the synthesis of lactose <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. Lactalbumin has recently been shown to have a second activity in the gut, where it loses the calcium ion and binds a fatty acid; this new form of lactalbumin appears to promote apoptosis of tumor cells, and thus has been renamed HAMLET (human lactalbumin made lethal to tumors) <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>. Lactalbumin appears to be found only in mammals, and is widely distributed in this group. Calcium-binding lysozyme has bacteriolytic activity like lysozyme <it>c</it>, but also shares with lactalbumin the ability to bind a calcium ion. Calcium-binding lysozymes appear to be relatively rare; they have been found in the milk of only a few mammalian species (<it>e.g</it>., horse, dog, cat, seal, and echidna), as well as in the eggs (<it>e.g., </it>pigeon) and stomachs (<it>e.g</it>., hoatzin) of some bird species <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B9">9</abbr>
</abbrgrp>. Indeed, calcium-binding lysozyme genes have not been reported for the human or rodent genomes.</p>
<p>Previous phylogenetic analyses of lysozyme <it>c</it>, lactalbumin, and calcium-binding lysozyme sequences had suggested that the earliest divergences within this gene family occurred between lysozyme <it>c </it>and the ancestor of the genes for lactalbumin and calcium-binding lysozyme, and that this initial gene duplication may have preceded the divergence of the lineages leading to fish and mammals <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. The separation of the lactalbumin and calcium-binding lysozyme genes was proposed to be more recent, with some studies <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B12">12</abbr>
</abbrgrp> suggesting a divergence on the early mammalian lineage, which would be consistent with the restriction of the lactalbumin gene to mammals. In contrast, another study <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp> suggested that the duplication generating the lactalbumin and calcium-binding lysozyme genes predated the bird-mammal divergence. Moreover, the orthology of the mammalian and avian calcium-binding lysozymes has even been questioned <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. Thus, the origin of these mammalian lysozyme-like genes remains an open question.</p>
<p>Recently, cDNAs for several additional lysozyme-like sequences have been identified from human testis cDNA libraries <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>. These cDNAs were found to be encoded by genes that are now annotated by <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> as <it>LYZL </it>(lysozyme-like): <it>LYZL2, LYZL4, LYZL6 </it>and <it>LYZL3 </it>(Synonym <it>SPACA3; SPACA</it>, Sperm acrosome associated <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. <it>SPACA3 </it>is also known as <it>SPRSA </it>
<abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> and <it>SLLP1 </it>
<abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp>). The predicted protein sequences of some of these lysozyme-like sequences have amino acid substitutions at sites important for the catalytic activity of lysozyme, suggesting that these proteins would not be able to hydrolyze the glycosidic bonds of bacterial peptidoglycan <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>. Since these four new lysozyme-like genes (<it>LYZL2, LYZL4, LYZL6</it>, and <it>SPACA3</it>) are expressed predominantly in the testes, it has been suggested that they might have a role in reproduction <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
<abbr bid="B15">15</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>. Such a role has been shown for <it>Lyzl4 </it>and <it>Spaca3 </it>in mice <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
</abbrgrp>.</p>
<p>The identification of these <it>LYZL </it>genes in the human genome suggests that the mammalian lysozyme-like gene family is larger than previously appreciated, and raises the possibility that the lysozyme-like proteins encoded by these genes may have novel biological functions. Here we have used extensive similarity searches of the human and other vertebrate genomes. We thereby identified three additional intact lysozyme-like genes in the human genome; these have been annotated in the databases, but not reported in the literature. We have also identified multiple lysozyme-like genes in the genomes of diverse vertebrates. Using a combination of phylogenetic and genomic neighborhood (or synteny) analyses, wherein the relationships of the genes that flank the lysozyme-like genes in diverse species were examined, we demonstrate that orthologs of the human lysozyme-like genes are found in the genomes of diverse mammalian species. Our analyses suggest that there were at least six, and perhaps as many as nine, diverse types (or subfamilies) of lysozyme-like genes in the genome of the common ancestor of all extant mammals, and that these diverse genes have been maintained on most mammalian lineages. This suggests that their protein products probably have essential biological functions that are yet to be identified.</p>
</sec>
<sec>
<st>
<p>Results and Discussion</p>
</st>
<sec>
<st>
<p>Number of Lysozyme Genes in the Human Genome</p>
</st>
<p>To determine the size of the lysozyme-like gene family, we performed <it>BLAST </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> similarity searches of the human genome for sequences that predict potential protein sequences similar to lysozyme <it>c</it>, and thereby identified a total of nine annotated genes (Table <tblr tid="T1">1</tblr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Of these nine annotated genes, six had previously been characterized: lysozyme <it>c </it>(<it>LYZ</it>) <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>, lactalbumin (<it>LALBA</it>, Synonym: <it>LYZL7</it>) <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, <it>LYZL2, LYZL4, LYZL6 </it>
<abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>, and <it>SPACA3 </it>(Synonyms: <it>LYZL3, SPRSA, SLLP1</it>) <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B14">14</abbr>
</abbrgrp>. The three remaining genes identified in our <it>BLAST </it>searches -- the <it>LYZL1, SPACA5 </it>(Synonym: <it>LYZL5</it>), and <it>SPACA5B </it>genes -- had been annotated as lysozyme-like in <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>, but have not been discussed in the literature. A tenth lysozyme-like sequence was later identified using our genomic neighborhood analysis (see below), but appears to be a pseudogene (<it>&#968;LYSC1</it>). These ten lysozyme-like sequences are distributed over five chromosomes in humans, with two genes each on chromosomes 10, 17, and X, three genes on chromosome 12, and one gene on chromosome 3 (Table <tblr tid="T1">1</tblr>). Each of the potentially functional genes predicts a protein sequence about 140-150 amino acids long, similar to lysozyme <it>c </it>and lactalbumin. The mature regions of these proteins are readily aligned due to the presence of many highly conserved residues, including the eight cysteines known to be involved in disulfide bonds in lysozyme and lactalbumin (Figure <figr fid="F1">1</figr>). Each of the lysozyme-like genes has been annotated as being composed of four or five exons. The 140-150 amino acid coding regions are spread over four exons in each of the genes with the introns in exactly the same locations, including phases, as found in the lysozyme <it>c </it>and lactalbumin genes <abbrgrp>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp> (Figure <figr fid="F1">1</figr>). Moreover, the <it>LYZL1, LYZL2, LYZL4, LYZL6</it>, and <it>SPACA3 </it>genes are annotated as having an additional 5' exon, which in some cases might be translated to produce proteins that have longer N-terminal regions (especially in the case of <it>SPACA3</it>; not shown in Figure <figr fid="F1">1</figr>).</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Chromosomal location of human lysozyme-like genes</p></caption><tblbdy cols="6">
      <r>
         <c ca="center">
            <p>
               <b>Gene</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Chromosome<sup>a</sup></b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Strand<sup>a</sup></b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Position<sup>a</sup></b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Intact<sup>b</sup></b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Protein ID<sup>c</sup></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LYZ</it>
            </p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>69,742,134 -69,748,013</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000261267</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LALBA</it>
            </p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>48,961,468 -48,963,829</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000301046</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>&#968;LYSC1</it>
            </p>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>49,024,938 - 49,026,186</p>
         </c>
         <c ca="center">
            <p>N</p>
         </c>
         <c ca="center">
            <p>None (pseudogene)</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LYZL1</it>
            </p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>29,577,990 - 29,607,257</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000364650</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LYZL2</it>
            </p>
         </c>
         <c ca="center">
            <p>10</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>30,895,152 - 30,918,691</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000364467</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LYZL4</it>
            </p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>42,438,570 - 42,452,092</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000287748</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>LYZL6</it>
            </p>
         </c>
         <c ca="center">
            <p>17</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>34,261,548 - 34,270,674</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000293274</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>SPACA3</it>
            </p>
         </c>
         <c ca="center">
            <p>17</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>31,318,887 - 31,324,895</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000269053</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>SPACA5</it>
            </p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>47,863,734 - 47,869,126</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000366139</p>
         </c>
      </r>
      <r>
         <c ca="center">
            <p>
               <it>SPACA5B</it>
            </p>
         </c>
         <c ca="center">
            <p>X</p>
         </c>
         <c ca="center">
            <p>+</p>
         </c>
         <c ca="center">
            <p>47,986,603 - 47,991,995</p>
         </c>
         <c ca="center">
            <p>Y</p>
         </c>
         <c ca="center">
            <p>ENSP00000304762</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a </sup>- Chromosomal localization, strand, and coordinates from release 57 of the human genome available in the <it>Ensembl </it>database (<url>http://www.ensembl.org</url><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>)</p>
      <p><sup>b </sup>- Y, full-length protein sequence; N, no intact open reading frame</p>
      <p><sup>c </sup>- <it>Ensembl </it>release 57 protein ID, None means no <it>Ensembl </it>Protein ID is available</p>
   </tblfn></tbl>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Supplementary Table 1</b>. This file is in PDF format. Location of lysozyme genes in vertebrate genomes.</p>
</text>
<file name="1471-2148-11-166-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Alignment of human lysozyme-like proteins</p></caption><text>
   <p><b>Alignment of human lysozyme-like proteins</b>. Amino acid sequences of predicted human lysozyme-like proteins are aligned. The number +1 identifies the N-terminal amino acid of the mature protein and the signal peptides are shown in italics, based upon homology with lysozyme and lactalbumin. The number symbols (#) identify the active site residues (positions Glu-35 and Asp-52 of the mature protein sequence) of lysozyme <it>c</it>. The solid black triangles above the sequences indicate the positions in the coding sequence that are interrupted by introns in the genes. Asterisks below the sequences identify residues that are perfectly conserved among the sequences, while the symbols ":" and "." indicate conserved and semi-conserved residues, respectively.</p>
</text><graphic file="1471-2148-11-166-1" hint_layout="single"/></fig>
<p>Most of the predicted human proteins (Figure <figr fid="F1">1</figr>) show between 30% and 53% amino acid sequence identity in pairwise comparisons (Table <tblr tid="T2">2</tblr>), suggesting that the gene duplications that gave rise to them are fairly ancient (or, alternatively, that these proteins have evolved at extremely rapid rates). If these gene duplications were indeed ancient, then we would expect to find these genes in diverse mammalian species (as we did; see below). In contrast, the LYZL1/LYZL2 and SPACA5/SPACA5B protein pairs are 97% and 100% identical, respectively (Table <tblr tid="T2">2</tblr>); this suggests that their genes likely duplicated fairly recently, and thus these duplicates are predicted to be more limited phylogenetically. The <it>LYZL1 </it>and <it>LYZL2 </it>genes are both located on human chromosome 10, but are separated by about 1 Mb; moreover, these genes are embedded within 60 kb long repeated sequences that are greater than 95% identical (not shown). This suggests that the <it>LYZL1 </it>and <it>LYZL2 </it>gene duplicates were generated as part of a recent segmental duplication on chromosome 10. Likewise, the <it>SPACA5 </it>and <it>SPACA5B </it>genes are both on the X chromosome, separated by about 120 kb, and are within long (~100 kb) repeated DNA sequences that have high sequence identity (not shown). Thus, the <it>LYZL1/LYZL2 </it>and <it>SPACA5/SPACA5B </it>gene pairs both appear to have originated from relatively recent and large genomic segmental duplications, a common form of gene duplication in mammals <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>. The high sequence identity of these gene pairs also could be due, at least in part, to concerted evolution <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>, as discussed below.</p>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Pairwise identity, in percent, of human lysozyme-like protein sequences</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>LALBA</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>LYZL1</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>LYZL2</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>LYZL4</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>LYZL6</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SPACA3</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SPACA5</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>SPACA5B</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LYZ</p>
         </c>
         <c ca="center">
            <p>37</p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
         <c ca="center">
            <p>41</p>
         </c>
         <c ca="center">
            <p>45</p>
         </c>
         <c ca="center">
            <p>53</p>
         </c>
         <c ca="center">
            <p>44</p>
         </c>
         <c ca="center">
            <p>44</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LALBA</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="center">
            <p>30</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
         <c ca="center">
            <p>33</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LYZL1</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>97</p>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LYZL2</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
         <c ca="center">
            <p>43</p>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LYZL4</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>46</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>40</p>
         </c>
         <c ca="center">
            <p>40</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>LYZL6</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>45</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
         <c ca="center">
            <p>47</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>SPACA3</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>SPACA5</p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>100</p>
         </c>
      </r>
   </tblbdy></tbl>
</sec>
<sec>
<st>
<p>Lysozyme Genes in Other Vertebrate Genomes</p>
</st>
<p>To determine whether an expanded lysozyme-like gene family is a general feature of mammalian (and other vertebrate) genomes, we conducted intensive homology searches for lysozyme-like genes in all vertebrate genomes available in the <it>Ensembl </it>and <it>Pre!Ensembl </it>databases (<url>http://www.ensembl.org</url>, <url>http://pre.ensembl.org/index.html</url>) <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp>. These results are listed in Supplementary Table <tblr tid="T1">1</tblr> (Additional file <supplr sid="S1">1</supplr>: Table S1), and summarized for mammals in Figure <figr fid="F2">2</figr>. At least one lysozyme-like gene was found in each vertebrate genome, with the exception of the lamprey. Multiple lysozyme-like genes were identified in all of the mammalian species ranging from 5 in opossum to 18 in cow. The fewest number of genes were found in bony fish, where only one gene was identified per genome, and in birds, where at most two genes were identified per genome. The anole lizard and <it>Xenopus tropicalis</it>, the lone representatives of reptiles and amphibians with sequenced genomes, had 8 and 16 lysozyme-like genes, respectively.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Diversity and evolution of mammalian lysozyme-like genes</p></caption><text>
   <p><b>Diversity and evolution of mammalian lysozyme-like genes</b>. A phylogeny, adapted from recent phylogenomic analyses <abbrgrp><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>, of the mammalian species examined in this study is shown; branch lengths are not proportional to evolutionary time. The numbers of each type of lysozyme-like gene identified in the mammalian genomes are shown by symbols on the right of the Figure. Species with genomes of higher quality (better coverage and assembly) are labeled in bold and with asterisks. The status of the genes in each genome are indicated by symbols, listed in the key box in the figure, where solid boxes indicate genes that are predicted to encode full-length sequences, open boxes indicate incomplete genes, which may be due to either incomplete genome sequences or may be pseudogenes, the Greek letter psi identifies well-characterized pseudogenes, zero indicates that no gene was identified in our searches, and the 'not' sign indicates genes that appear to have been deleted from the genomes. The question mark beside the sloth <it>Lyzl8 </it>gene indicates that this is a tentative assignment. Inferred evolutionary events are mapped onto their most likely lineage, based upon the data presented in this figure, coupled with our various evolutionary analyses. Duplication of genes is indicated by a gene name with an arc over it. Gene deletion is indicated by an X preceding the name of the deleted gene. X? means that the deletion is uncertain, as the missing gene may actually exist in a gap in the descendant genome(s). Pseudogene generation is indicated by the psi (&#968;) that precedes the gene name; if the pseudogene is associated with a gene duplication event it also has an arc. If multiple events are shown on a lineage, the order shown does not imply historical order of occurrence.</p>
</text><graphic file="1471-2148-11-166-2" hint_layout="double"/></fig>
<p>Importantly, the <it>BLAST </it>searches identified potential orthologs in most mammalian genomes (Figure <figr fid="F2">2</figr>) of all of the divergent lysozyme-like genes found in the human genome (<it>LYZ, LALB, LYZL1</it>/<it>2, LYZL4, LYZL6, SPACA3, </it>and <it>SPACA5</it>). Initial assignments of orthology of these mammalian genes were based upon sequence similarity, but were subsequently confirmed by performing genomic neighborhood and phylogenetic analyses (see below). Our combined findings about gene number and orthologous relationships of the mammalian lysozyme-like gene family members are outlined in Figure <figr fid="F2">2</figr>. In contrast to the mammalian genes, the lysozyme-like genes found in the other vertebrate genomes could not be readily classified into the above subfamilies based upon sequence similarities; this is because these genes and their encoded proteins displayed similar levels of sequence identity to all of the different mammalian paralogs. Furthermore, no evidence of synteny with the mammalian genes was found for any of the non-mammalian vertebrate genes, except for the lysozyme <it>c </it>gene. Thus, none of the non-mammalian vertebrate lysozyme-like genes could be definitively classified as orthologs of any of the lysozyme-like mammalian genes, other than <it>Lyz </it>itself.</p>
<p>Many of the genes that were identified in our searches were only partial sequences, most likely due to the incomplete nature of the genomes in question. However, all but one of these genes were consistent with a structure similar to that of the mammalian lysozyme and lactalbumin genes -- that is, their coding regions appeared to be composed of four exons having similar intron-exon structures <abbrgrp>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. The lone exception was a <it>Lyzl1/2</it>-like gene found in the treeshrew (from Genescaffold_6044), which had a nearly full-length coding sequence that contained stop codons and frameshifts, but no introns; thus, this gene appears to be a processed pseudogene. Taken together, these observations suggest that essentially all of the vertebrate lysozyme-like genes have been generated by duplications of genomic DNA, rather than by reverse-transcription and insertion into genome.</p>
</sec>
<sec>
<st>
<p>Phylogeny of Vertebrate Lysozymes</p>
</st>
<p>The presence of multiple lysozyme-like genes in all mammalian genomes, as well as in the genomes of several other vertebrate species, raises the possibility that the lysozyme-like gene family may have amplified early in vertebrate evolution. To examine this issue, and to further establish the orthology-paralogy relationships of lysozyme-like genes, we conducted a series of phylogenetic analyses (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figure S1-S3).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Phylogeny of vertebrate lysozyme-like genes</p></caption><text>
   <p><b>Phylogeny of vertebrate lysozyme-like genes</b>. Bayesian phylogenetic tree of vertebrate lysozyme genes. The tree shown was generated with <it>MrBayes </it><abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp> using DNA sequences of diverse vertebrate lysozyme-like sequences. (For sequences see Additional files 1 and 18: Tables S1 and S2 for sequences.) The alignment used to generate the tree is shown in Additional file 19: Figure S17. The trees were built after 2,000,000 generations with nst = 6 and rates = gamma (the TrNef+I+G model was selected as the best model by ModelTest <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>). Insect and amphioxus sequences were used to root the tree. Solid diamonds indicate nodes that represent the gene duplication events that generated the different types of lysozyme-like genes, or subfamilies, found in mammals. The support for each clade containing each subfamily of mammalian lysozyme-like gene is boxed, with posterior probabilities above the lineage and bootstrap support from maximum likelihood analysis below (from Additional file 3: Figure S2). The remaining lineages only display the posterior probabilities from the Bayesian analysis.</p>
</text><graphic file="1471-2148-11-166-3" hint_layout="double"/></fig>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>Supplementary Figure 1</b>. This file is in PDF format. Phylogeny of vertebrate lysozyme-like sequences generated by <it>PhyloBayes</it>.</p>
</text>
<file name="1471-2148-11-166-S2.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>Supplementary Figure 2</b>. This file is in PDF format. Phylogeny of vertebrate lysozyme-like sequences generated by <it>PhyML</it>.</p>
</text>
<file name="1471-2148-11-166-S3.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S4">
<title>
<p>Additional file 4</p>
</title>
<text>
<p>
<b>Supplementary Figure 3</b>. This file is in PDF format. Phylogeny of only mammalian lysozyme-like sequences generated by <it>MrBayes </it>with support for the orthologous genes by different phylogenetic methods.</p>
</text>
<file name="1471-2148-11-166-S4.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>Importantly, as illustrated by the Bayesian analysis shown in Figure <figr fid="F3">3</figr>, all of these phylogenetic analyses suggested that most lineages of mammals have eight different types of lysozyme-like genes (or pseudogenes): <it>Lyz, Lalba, Lysc1, Lyzl1/2, Lyzl4, Lyzl6, Spaca3</it>, and <it>Spaca5</it>. Regardless of type of phylogenetic analysis, these mammalian genes always clustered together as monophyletic groups, or clades, supporting their orthologous relationships. These mammalian gene clades routinely had high statistical support, again regardless of method used. These results are consistent with the orthologous relationships suggested by the original <it>BLAST </it>searches, as well as with our genomic neighborhood analyses (described in more detail in the sections below). However, the genes from the other vertebrates did not consistently group with any of these mammalian orthologs, with the exception of some of the lysozyme <it>c </it>sequences.</p>
<p>In addition, it is clear that most, if not all, of the eight mammalian lysozyme-like genes duplicated and diverged from each other prior to the divergence of the earliest mammalian lineages. This is clear for at least two reasons. First, both the platypus and the eutherian genomes contain copies of most of these gene duplicates; therefore, these genes must have diverged earlier than did the species lineages. Second, some of the non-mammalian vertebrate genes appear to have phylogenetic affinity for some of the mammalian gene lineages, although few have much statistical support. This is particularly evident for many of the lizard genes which, as illustrated in Figure <figr fid="F3">3</figr>, tend to branch with various mammalian orthologs. If this result is not an artifact, then many of the lysozyme-like genes must have duplicated prior to the mammal-reptile (or even mammal-amphibian) divergence. If this were the case, however, then these gene duplicates must have been deleted from the genomes of birds.</p>
<p>Whereas our phylogenetic analyses supported the monophyly of each of the mammalian lysozyme-like gene duplicates, the relationships between the paralogs were not resolved well (see Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3). While many of our phylogenetic analyses, including the one shown in Figure <figr fid="F3">3</figr> (and Additional file <supplr sid="S3">3</supplr>: Figure S2), suggested that the <it>Lalba </it>clade was the earliest diverging lineage and that most of the <it>Lyzl </it>and <it>Spaca </it>genes (<it>Lyzl1/2, Lyzl4, Lyzl6, Spaca3</it>, and <it>Spaca5</it>, but not <it>Lyzl8</it>) were most closely related to each other, these relationships were not consistently found (<it>e.g</it>., see Additional file <supplr sid="S2">2</supplr>: Figure S1). Therefore, the phylogenetic analyses are inconclusive concerning the relationships of the different subfamilies of mammalian lysozyme-like genes.</p>
<p>The phylogenetic trees also suggested the possibility that at least some of the gene divergences occurred very early in vertebrate evolution, <it>i.e</it>., prior to the mammal-fish divergence. For example, the mammalian <it>Lyz </it>gene sequences were found to branch with <it>Lyz </it>genes from fish, rather than with the other mammalian lysozyme-like genes (Figure <figr fid="F3">3</figr>); if this branching order reflects the actual evolutionary history of the genes (rather than phylogenetic affinity based upon conserved lysozyme protein structure and function), then the <it>Lyz </it>gene lineage must have diverged from the other lysozyme-like genes prior to the mammal-fish divergence. Again, if this were true, then many species lineages must have deleted the duplicates from their genomes. Below, we discuss each subfamily of lysozyme-like gene in the mammals, and consider their potential non-mammalian orthologs.</p>
</sec>
<sec>
<st>
<p>Lysozyme <it>c </it>(<it>Lyz</it>) Genes</p>
</st>
<p>The <it>Lyz </it>gene is the best-studied lysozyme-like gene, and has been extensively characterized in many species <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. Our genome searches and phylogenetic analyses identified many genes that appear to be orthologous to <it>Lyz </it>in diverse vertebrates (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S1">1</supplr>-<supplr sid="S3">3</supplr>: Table S1, and Figures S1 and S2). To confirm the orthology of the <it>Lyz </it>genes, we used a genomic neighborhood analysis, wherein we examined the orthology of the genes that flank the <it>Lyz </it>gene in diverse species. The avian and mammalian <it>Lyz </it>genes are flanked by the <it>Cpsf6 </it>and <it>Yeats4 </it>genes (Figure <figr fid="F4">4</figr>). This organization is maintained in species with tandemly duplicated <it>Lyz </it>genes, such as rodents and cow, where the <it>Cpsf6 </it>and <it>Yeats4 </it>genes are found flanking a cluster of <it>Lyz </it>genes (Figure <figr fid="F4">4</figr>). A slightly different organization is seen in the opossum, which has four <it>Lyz </it>genes; in this case the <it>Cpsf6 </it>gene is upstream of two of the <it>Lyz </it>genes and the <it>Yeats4 </it>gene is downstream of the other two, but these two clusters are separated by about 12 Mb of DNA that potentially was inserted into the opossum genome (Figure <figr fid="F4">4</figr>). Of the numerous lysozyme genes found in <it>Xenopus tropicalis</it>, the <it>LyzA </it>gene was found to be most closely related to the avian and mammalian <it>Lyz </it>genes in the phylogenetic trees (Figure <figr fid="F3">3</figr>); consistent with this, the <it>LyzA </it>gene is located adjacent to a <it>Yeats4 </it>ortholog (Figure <figr fid="F4">4</figr>), suggesting it is a true ortholog of the mammalian <it>Lyz </it>gene. Most fish <it>Lyz </it>sequences branch in a clade with mammalian <it>Lyz </it>genes (Figure <figr fid="F3">3</figr>). However, although the <it>Cpsf6 </it>and <it>Yeats4 </it>genes are neighbors in the fish genomes, the fish lysozyme genes are not adjacent to either of these genes; thus the genomic neighborhood analysis does not provide support for the orthology of fish and mammalian <it>Lyz </it>genes. Intriguingly, the zebrafish <it>Lyz </it>gene was found to be in a different genomic context from other fish <it>Lyz </it>genes (results not shown), which would agree with the hypothesis generated by the phylogenetic analyses (see Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>: Figures S1 and S2) that the zebrafish <it>Lyz </it>gene is not truly orthologous to the other fish <it>Lyz </it>genes.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Genomic neighborhood surrounding the lysozyme (<it>Lyz</it>) gene</p></caption><text>
   <p><b>Genomic neighborhood surrounding the lysozyme (<it>Lyz</it>) gene</b>. The relative organization and orientation (with arrowheads indicating the direction of transcription) of genes near the <it>Lyz </it>genes in representative diverse vertebrate genomes. Species and chromosomes (or contigs or scaffolds) are from <it>Ensembl </it><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, and are shown under each gene array. Gene sizes and distances between genes are not to scale. The distance between the human <it>CFPS6 </it>and <it>YEATS4 </it>genes is about 90 kb. Gene symbols are: <it>CPM</it>, Carboxypeptidase M; <it>CPSF6</it>, Cleavage and polyadenylation specificity factor subunit 6; <it>YEATS4</it>, YEATS domain-containing protein 4 (Synonym: <it>Gas41</it>, Glioma-amplified sequence 41); <it>FRS2</it>, Fibroblast growth factor receptor substrate 2 (FGFR substrate 2); <it>C12orf71</it>, Chromosome 12 open reading frame 71; <it>V1r</it>, a member of the family of vomeronasal receptor gene family; <it>Nup107</it>, nucleoprotein 107 kDa. By necessity, the cow <it>Lyz </it>genes are, shown on two lines, but are actually contiguous in the genome. The opossum <it>Lyz </it>genes are at two non-adjacent locations on chromosome 8.</p>
</text><graphic file="1471-2148-11-166-4" hint_layout="single"/></fig>
<p>The number of <it>Lyz </it>genes found in mammalian genomes varied from 1, in most species, to 12 (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Multiple <it>Lyz </it>genes had previously been identified in the genomes of mice and rats <abbrgrp>
<abbr bid="B26">26</abbr>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
</abbrgrp>, rabbits <abbrgrp>
<abbr bid="B30">30</abbr>
<abbr bid="B31">31</abbr>
</abbrgrp>, and artiodactyls <abbrgrp>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
<abbr bid="B35">35</abbr>
<abbr bid="B36">36</abbr>
</abbrgrp>. In addition to these species, multiple <it>Lyz </it>genes were found in many other species, including guinea pig (9 genes), elephant (8 genes), armadillo (5 genes), sloth (6 genes), opossum (4 genes), and wallaby (8 genes) (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Phylogenetic analysis of the <it>Lyz </it>sequences, suggested that the multiple genes in diverse species are due to independent gene duplications or amplification events (Figure <figr fid="F5">5</figr>). However, previous work has shown that <it>Lyz </it>genes have been subjected to concerted evolution on the ruminant and rodent lineages <abbrgrp>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
<abbr bid="B35">35</abbr>
<abbr bid="B36">36</abbr>
<abbr bid="B37">37</abbr>
</abbrgrp>. Since the inference of independent gene duplication on sister lineages, instead of on their common ancestral lineage, is a pattern generated by concerted evolution, the distributions of duplicated <it>Lyz </it>genes (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1) combined with the phylogenetic analysis (Figure <figr fid="F5">5</figr>) suggests that concerted evolution might also have occurred on lineages such as Afrotheria (<it>e.g</it>., elephant and hyrax) and marsupials (opossum and wallaby).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Phylogeny of mammalian lysozyme <it>c </it>(<it>Lyz</it>) genes</p></caption><text>
   <p><b>Phylogeny of mammalian lysozyme <it>c </it>(<it>Lyz</it>) genes</b>. A Bayesian phylogenetic tree of mammalian lysozyme <it>c </it>genes was generated by <it>MrBayes </it><abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp> using the DNA coding sequences of mammalian <it>Lyz </it>sequences. This tree was built with nst = 2 and rates = gamma as selected by <it>ModelTest </it><abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>. The tree was rooted with the platypus <it>Lyz </it>sequence. The posterior probability support for each node is shown.</p>
</text><graphic file="1471-2148-11-166-5" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Lysozyme-like 1/2 (<it>Lyzl1/2</it>) Genes</p>
</st>
<p>As mentioned above, the <it>Lyzl1 </it>and <it>Lyzl2 </it>genes in the human genome appear to have been generated recently via a genomic segmental duplication. Indeed, with the exception of those primate species that are close relatives of human, most other mammals have only a single gene sequence similar to <it>Lyzl1 </it>or <it>Lyzl2 </it>(Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Comparison of the genomic neighborhoods surrounding the <it>Lyzl1/2 </it>genes in diverse mammals demonstrates orthology of these genes, as the genes adjacent to them are either <it>Bambi </it>or <it>Dnm1p17</it>, or both (Additional file <supplr sid="S5">5</supplr>: Figure S4). Interestingly, although an ortholog of <it>Lyzl1/2 </it>was found in the platypus genome, one could not be found in the opossum, even though the <it>Bambi </it>and <it>Dnm1p17 </it>genes are adjacent in this species; this suggests that the <it>Lyzl1/2 </it>gene was deleted from the opossum genome (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S5">5</supplr>: Figure S4). Both human and macaque have duplicated <it>Lyzl1/2 </it>genes that reside in similar genomic neighborhoods (Additional file <supplr sid="S5">5</supplr>: Figure S4), which should imply that the <it>Lyzl1/2 </it>gene duplication occurred prior to the human-macaque divergence. Phylogenetic analysis of the <it>Lyzl1/2 </it>sequences, however, implies that independent gene duplication events occurred on the macaque, human, and marmoset lineages (Figure <figr fid="F6">6</figr>). A more likely scenario than multiple independent gene duplications in these closely-related primate species is that the original <it>Lyzl1/2 </it>gene duplication event occurred in their common ancestor (as diagrammed in Figure <figr fid="F2">2</figr>), and concerted evolution between the <it>Lyzl1 </it>and <it>Lyzl2 </it>genes has obscured this original event. Although a <it>Lyzl1/2 </it>pseudogene was found in the treeshrew (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1), it was generated independently, as discussed above. Potential <it>Lyzl1/2 </it>orthologs in zebrafish and lizard were suggested by phylogenetic analysis (Figure <figr fid="F3">3</figr>); however, these genes were not in genomic neighborhoods similar to those of the mammalian genes (results not shown), so their evolutionary relationships remain ambiguous.</p>
<suppl id="S5">
<title>
<p>Additional file 5</p>
</title>
<text>
<p>
<b>Supplementary Figure 4</b>. This file is in PDF format. Conservation of genomic organization near <it>Lyzl1/2 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S5.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Phylogeny of mammalian lysozyme-like 1/2 (<it>Lyz1/2</it>) genes</p></caption><text>
   <p><b>Phylogeny of mammalian lysozyme-like 1/2 (<it>Lyz1/2</it>) genes</b>. A Bayesian phylogenetic tree of mammalian lysozyme-like 1 and 2 genes was generated by <it>MrBayes </it><abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp> using the DNA coding sequences of mammalian <it>Lyzl1/2 </it>sequences. This tree was built with nst = 6 and rates = gamma as selected by <it>ModelTest </it><abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>. The tree was rooted with the platypus <it>Lyzl2 </it>sequence. The posterior probability support for each node is shown.</p>
</text><graphic file="1471-2148-11-166-6" hint_layout="single"/></fig>
</sec>
<sec>
<st>
<p>Lysozyme-like 4 (<it>Lyzl4</it>) Genes</p>
</st>
<p>The <it>Lyzl4 </it>gene was either found as a single copy, or was missing, in all of the mammalian genomes examined (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Many of the missing genes likely reflect incomplete genomes rather than deletions. Genomic neighborhood analysis confirmed the orthology of the <it>Lyzl4 </it>genes across placental mammals. However, the flanking genes are on different chromosomes in opossum, suggesting that chromosomal recombination had occurred (Additional file <supplr sid="S6">6</supplr>: Figure S5). Phylogenetic analyses of <it>Lyzl4 </it>sequences were consistent with it being a single copy gene (Additional file <supplr sid="S7">7</supplr>: Figure S6). The only potential non-mammalian orthologs of <it>Lyzl4 </it>identified by the phylogenetic analyses were from the lizard (<it>LyzM </it>and <it>LyzN </it>in Figure <figr fid="F3">3</figr>, but no evidence for orthology in Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3); however, the lizard and mammalian genes were not in similar genomic neighborhoods (data not shown).</p>
<suppl id="S6">
<title>
<p>Additional file 6</p>
</title>
<text>
<p>
<b>Supplementary Figure 5</b>. This file is in PDF format. Conservation of genomic organization near <it>Lyzl4 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S6.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S7">
<title>
<p>Additional file 7</p>
</title>
<text>
<p>
<b>Supplementary Figure 6</b>. This file is in PDF format. Phylogeny of <it>Lyzl4 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S7.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Lysozyme-like 6 (<it>Lyzl6</it>) Genes</p>
</st>
<p>Most mammals exhibited only one <it>Lyzl6 </it>gene, although the opossum had none. Yet, in contrast to most of the other paralogs, great variation in the number of <it>Lyzl6 </it>genes was observed across mammals, with five genes identified in the dog and four genes identified in both the alpaca and the hyrax (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Given the distant relationships of these three species, these gene duplication events must have occurred independently. The placental and wallaby <it>Lyzl6 </it>genes reside in a conserved genomic neighborhood, which again suggests that this gene was deleted on the opossum lineage (Additional file <supplr sid="S8">8</supplr>: Figure S7). The presence of multiple <it>Lyzl6 </it>genes in a genome raises the possibility of concerted evolution; however, sufficient data were not available to allow examination of this possibility for the <it>Lyzl6 </it>genes (Figure <figr fid="F2">2</figr>, and Additional files <supplr sid="S1">1</supplr> and <supplr sid="S9">9</supplr>: Table S1 and Figure S8). The phylogenetic analyses did not suggest any candidates for <it>Lyzl6 </it>orthologs in non-mammalian species (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3).</p>
<suppl id="S8">
<title>
<p>Additional file 8</p>
</title>
<text>
<p>
<b>Supplementary Figure 7</b>. This file is in PDF format. Conservation of genomic organization near <it>Lyzl6 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S8.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S9">
<title>
<p>Additional file 9</p>
</title>
<text>
<p>
<b>Supplementary Figure 8</b>. This file is in PDF format. Phylogeny of <it>Lyzl6 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S9.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Sperm acrosomal protein 3 (<it>Spaca3</it>) Genes</p>
</st>
<p>
<it>Spaca3</it>, like <it>Lyzl4</it>, was not found to be duplicated in any of the mammalian genomes examined (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). <it>Spaca3 </it>resides in a conserved genomic neighborhood in placental mammals; however, a <it>Spaca3 </it>gene is absent from this genomic neighborhood in the opossum (Additional file <supplr sid="S10">10</supplr>: Figure S9). While the wallaby and platypus <it>Spaca3 </it>genes could not be placed in a genomic context due to the short lengths of their genomic contigs (Additional file <supplr sid="S10">10</supplr>: Figure S9), phylogenetic analysis of these sequences (Additional file <supplr sid="S11">11</supplr>: Figure S110 was consistent with them being orthologs. These results suggest that the <it>Spaca3 </it>gene was deleted on the opossum lineage. Again, no non-mammalian orthologs were suggested by phylogenetic analysis (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3).</p>
<suppl id="S10">
<title>
<p>Additional file 10</p>
</title>
<text>
<p>
<b>Supplementary Figure 9</b>. This file is in PDF format. Conservation of genomic organization near <it>Spaca3 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S10.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S11">
<title>
<p>Additional file 11</p>
</title>
<text>
<p>
<b>Supplementary Figure 10</b>. This file is in PDF format. Phylogeny of <it>Spaca3 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S11.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Sperm acrosomal protein 5 (<it>Spaca5</it>) Genes</p>
</st>
<p>The <it>Spaca5 </it>gene was found only within placental mammals, with no orthologs suggested by phylogenetic analysis or similarity searches in marsupials, platypus, or other vertebrates (Figures 2 and 3, and Additional files <supplr sid="S1">1</supplr>-<supplr sid="S4">4</supplr>: Table S1 and Figures S1-S3). Thus, it is possible that this gene duplication happened in the ancestor of placental mammals. Genomic neighborhood analysis showed that the <it>Spaca5 </it>gene was in a similar neighborhood on the human, macaque, mouse, and dog X chromosomes (Additional file <supplr sid="S12">12</supplr>: Figure S11); this genomic region was not found in marsupials, platypus, or other vertebrates (results not shown). The <it>SPACA5 </it>gene was found to be uniquely duplicated in the human genome (Figure <figr fid="F2">2</figr>, Additional files <supplr sid="S1">1</supplr> and <supplr sid="S13">13</supplr>: Table S1 and Figure S12). A very recent duplication of <it>SPACA5</it>, since human-chimpanzee divergence, could account for the perfect identity of the protein sequences (Table <tblr tid="T2">2</tblr>) without requiring concerted evolution; however, concerted evolution between the human <it>SPACA5 </it>and <it>SPACA5B </it>genes cannot be excluded.</p>
<suppl id="S12">
<title>
<p>Additional file 12</p>
</title>
<text>
<p>
<b>Supplementary Figure 11</b>. This file is in PDF format. Conservation of genomic organization near <it>Spaca5 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S12.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S13">
<title>
<p>Additional file 13</p>
</title>
<text>
<p>
<b>Supplementary Figure 12</b>. This file is in PDF format. Phylogeny of <it>Spaca5 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S13.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Lysozyme-like 8 (<it>Lyzl8</it>) Gene</p>
</st>
<p>The platypus genome contained one lysozyme-like gene, named <it>Lyzl8</it>, which did not group with any of the other mammalian genes (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3). All of our phylogenetic analyses supported the designation of <it>Lyzl8 </it>as a unique lysozyme-like gene duplicate, as the platypus gene did not fall within any of the other monophyletic gene groups. The relationship of the platypus <it>Lyzl8 </it>gene to the other lysozyme-like genes was highly labile in the phylogenetic analyses (Figure <figr fid="F3">3</figr>, and Additional files <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>: Figures S1 and S2). This result is in accord with the fact that the platypus <it>Lyzl8 </it>gene (or protein) showed little similarity to any of the other lysozyme-like genes (or proteins) in our <it>BLAST </it>searches. When the platypus <it>Lyzl8 </it>gene was used as a query to search mammalian genomes, only one genomic sequence -- from the sloth (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1) -- was found to have greater similarity to <it>Lyzl8 </it>than to any other lysozyme-like gene. When the short sloth sequence was used as a query against the platypus genome, its best match was the <it>Lyzl8 </it>gene. However, the sloth genomic contig was short, containing only a single exon, and therefore could not be used for phylogenetic or genomic neighborhood analysis; thus, the evidence supporting orthology of the sloth sequence to the platypus <it>Lyzl8 </it>gene is very weak. Thus, at present, it is not clear whether this gene duplication happened on the ancestral mammal lineage, with subsequent losses on most descendant lineages, or on the monotreme lineage.</p>
</sec>
<sec>
<st>
<p>Lactalbumin (<it>Lalba</it>) and Calcium-binding Lysozyme (<it>Lysc1</it>) Genes</p>
</st>
<p>Mammalian <it>Lalba </it>genes have been well characterized, and are typically single copy in mammals <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp> (Additional file <supplr sid="S14">14</supplr>: Figure S13). Curiously, it was previously reported that multiple <it>Lalba </it>genes exist in the bovine and ovine genomes <abbrgrp>
<abbr bid="B38">38</abbr>
<abbr bid="B39">39</abbr>
</abbrgrp>, but here we found only a single copy of the <it>Lalba </it>gene in the cow genome (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Whether this reflects differences in the sources of DNA in the different studies or is due to incomplete genome assembly is unknown. The only genome that revealed a duplicate <it>Lalba </it>gene was the pika (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). Despite hypotheses about an early origin of the <it>Lalba </it>gene <abbrgrp>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
</abbrgrp>, no good candidates for non-mammalian orthologs were identified by our phylogenetic (Figure <figr fid="F3">3</figr>, and Additional file <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3) or genomic neighborhood (Figure <figr fid="F7">7</figr>, and results not shown) analyses.</p>
<suppl id="S14">
<title>
<p>Additional file 14</p>
</title>
<text>
<p>
<b>Supplementary Figure 13</b>. This file is in PDF format. Phylogeny of <it>Lalba </it>genes.</p>
</text>
<file name="1471-2148-11-166-S14.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Genomic neighborhood surrounding the lactalbumin (<it>Lalba</it>) and calcium-binding lysozyme (<it>Lysc1</it>) genes</p></caption><text>
   <p><b>Genomic neighborhood surrounding the lactalbumin (<it>Lalba</it>) and calcium-binding lysozyme (<it>Lysc1</it>) genes</b>. The relative organization and orientation (arrowheads indicated direction of transcription) of genes near the <it>Lalba </it>and <it>Lysc1 </it>genes in representative genomes from <it>Ensembl </it><abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Species and chromosome (or SuperContig for platypus) are indicated below each gene array. Gene sizes and distances are not to scale. The distance between the human <it>LALBA </it>and <it>CCNT1 </it>genes is about 150 kb. Gene symbols are: <it>OLFR</it>, a member of the Olfactory receptor gene family; <it>C12orf41</it>, chromosome 12 open reading frame 41; <it>CCNT1</it>, Cyclin-T1 (CycT1, Cyclin-T); <it>Mip</it>, major intrinsic protein of lens fiber; <it>Spryd4</it>, SPRY domain containing 4; <it>Gls2</it>, glutaminase 2. A large sequence gap exists in the cow genome, indicated by the parentheses, near the expected location of the <it>Lysc1 </it>gene. The platypus <it>Lalba </it>gene is on a small contig, as indicated by the shorter line flanked by dotted lines, that has not been annotated to contain (nor do we find) any other genes. Genes in the horse genome (not shown) are organized similar to those shown for the dog. Genes in the chimpanzee, gorilla, orangutan, baboon, and elephant genomes (not shown) are similar to those of the human and macaque. Genes in the rat and guinea pig genomes (not shown) are similar to those of the mouse.</p>
</text><graphic file="1471-2148-11-166-7" hint_layout="single"/></fig>
<p>An intriguing observation from our genomic neighborhood analysis of was that the mammalian calcium-binding lysozyme gene (<it>Lysc1</it>) is located adjacent to the <it>Lalba </it>gene in the dog (Figure <figr fid="F7">7</figr>) and horse (not shown) genomes. Both previous phylogenetic analyses <abbrgrp>
<abbr bid="B9">9</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
<abbr bid="B12">12</abbr>
</abbrgrp> and our new phylogenetic analyses (Figure <figr fid="F2">2</figr>, and Additional files <supplr sid="S2">2</supplr>-<supplr sid="S4">4</supplr>: Figures S1-S3) suggested that the <it>Lysc1 </it>gene originated prior to the radiation of mammals. However, our <it>tBLASTn </it>searches using either dog or horse <it>Lysc1 </it>identified similar sequences in the genomes of only a few diverse mammals -- dog, cat, horse, shrew, sloth, and mouse lemur (Figure <figr fid="F2">2</figr>, and Additional file <supplr sid="S1">1</supplr>: Table S1). It is also noteworthy that the mammalian (<it>Lysc1</it>) and avian calcium-binding lysozyme genes are not closely related in our phylogenies, a finding in agreement with some earlier analyses <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. Thus, it is reasonable to speculate that calcium binding evolved independently in these bird and mammal lysozymes. The newly identified <it>Lysc1</it>-like genomic sequences all were found on short genomic contigs (Additional file <supplr sid="S1">1</supplr>: Table S1); nonetheless, both the cat and mouse lemur genomic contigs also encode part of the <it>c12orf41 </it>gene (Additional file <supplr sid="S15">15</supplr>: Figure S14A), which is adjacent to the <it>Lysc1 </it>gene in both the dog and horse genomes (Figure <figr fid="F7">7</figr>). This suggests that the <it>Lysc1 </it>gene may be near the <it>c12orf41 </it>gene in other mammalian genomes. Using a strategy that has previously worked to identify genes that could not be found through typical <it>BLAST </it>searches <abbrgrp>
<abbr bid="B40">40</abbr>
<abbr bid="B41">41</abbr>
</abbrgrp>, we focused carefully on the sequences between the <it>Lalba </it>and <it>c12orf41 </it>genes. In 17 of the 37 mammalian genomes available from <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B25">25</abbr>
</abbrgrp>, the <it>Lalba </it>and <it>c12orf41 </it>genes were contained in contiguous genomic sequences. In 18 of the 20 species this genomic region was fragmented into several small genomic contigs; thus, we cannot exclude the possibility that in these genomes the two genes are contiguous. In the pig and the little brown bat this genomic region was not fragmented. In the pig, the current genome assembly does not encode the <it>Lalba </it>gene and the <it>c12orf41 </it>gene is embedded within a very large genomic fragment, suggesting that the <it>Lalba </it>- <it>c12orf41 </it>genomic region has been reorganized in the pig genome (or that this region has been incorrectly assembled). In the little brown bat, the <it>Lalba </it>gene is embedded in a large genomic fragment that was not annotated to include <it>c12orf41 </it>(although our <it>BLAST </it>searches did identify a very small fragment with strong similarity). A more careful examination of the little brown bat genomic contig revealed that most of the genomic region is composed of unsequenced gaps.</p>
<suppl id="S15">
<title>
<p>Additional file 15</p>
</title>
<text>
<p>
<b>Supplementary Figure 14</b>. This file is in PDF format. Conservation of genomic sequences between <it>Lalba </it>and <it>c12orf41 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S15.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>For the 17 genomes that did have linked <it>Lalba </it>and <it>c12orf41 </it>genes, the distance between these two genes ranged from ~50 kb (mouse, rat, and rabbit) to ~250 kb (cow and opossum). For all of these genomes, except the opossum (see below), the only genes (or pseudogenes) annotated as existing between <it>Lalba </it>and <it>c12orf41 </it>were olfactory receptor-like genes, which are not very useful for identifying orthologous and conserved genomic neighborhoods due to their abundance. In the opossum, in addition to the olfactory receptor-like genes, three additional genes were annotated between <it>Lalba </it>and <it>c12orf41</it>: the genes <it>Mip, Spryd4</it>, and <it>Gls2</it>. Unfortunately, the wallaby genome is poorly assembled near the <it>Lalba </it>and <it>c12orf41 </it>genes, and thus the neighboring genes could not be identified. Although the <it>Mip, Spryd4</it>, and <it>Gls2 </it>genes reside on the same chromosome as <it>Lalba </it>and <it>c12orf41 </it>in many mammals (<it>e.g., </it>human, rat, guinea pig, cow, horse, and elephant), they are found greater than 8 Mb away; furthermore, in some species (<it>e.g</it>., mouse and dog) they are on different chromosomes. These observations suggest that the organization of the <it>Lalba, c12orf41, Mip, Spryd4</it>, and <it>Gls2 </it>genes, and potentially a <it>Lysc1 </it>gene, has changed between the opossum (and possibly other marsupials) and placental mammals.</p>
<p>The genomic sequence between the <it>Lalba </it>and <it>c12orf41 </it>genes for the 17 genomes where these two genes were linked was aligned with <it>MultiPipMaker </it>
<abbrgrp>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
</abbrgrp>. Sequences with similarity to the <it>Lysc1 </it>gene were not observed in 9 of the genomic sequences -- those from marmoset, mouse, rat, guinea pig, rabbit, treeshrew, cow, little brown bat, and opossum (Additional file <supplr sid="S15">15</supplr>: Figure S14B). It should be noted, however, that for 3 of these species (cow, little brown bat, and marmoset) these genomic sequences contain large amounts of unknown sequence (<it>i.e</it>., sequence gaps). Thus, there are only 6 species with nearly complete genomic sequences spanning the <it>Lalba </it>and <it>c12orf41 </it>genes for which we have good evidence for the actual absence of a <it>Lysc1 </it>gene or pseudogene -- mouse, rat, guinea pig, rabbit, treeshrew, and opossum. Pairwise sequence alignments between the mouse, rat, or guinea pig genomic sequences with those from dog or horse (or primates) using <it>PipMaker </it>
<abbrgrp>
<abbr bid="B42">42</abbr>
</abbrgrp> revealed that a large genomic region, which could potentially encode a <it>Lysc1 </it>gene, is missing from these rodent genomes (results not shown). This suggests that this genomic region, including the <it>Lysc1 </it>gene, was deleted either early on the rodent lineage or in the common ancestor of rodents and close relatives (<it>e.g., </it>rabbit), but after the divergence of the rodent lineage from the primate lineage (see Figure <figr fid="F2">2</figr>).</p>
<p>Interestingly, some of the genomes -- including those from certain haplorrhine primate species (human, chimpanzee, gorilla, orangutan, macaque, baboon, and tarsier) and the elephant -- do possess sequences between the <it>Lalba </it>and <it>c12orf41 </it>genes that aligned with three of the four exons (exons 2 through 4) of the horse and dog <it>Lysc1 </it>genes (Figures 7 and 8, and Additional files <supplr sid="S15">15</supplr> and <supplr sid="S16">16</supplr>: Figures S14C and S15). All of these genomic sequences, except for that of the tarsier, are on large genomic segments that have only a few short unsequenced gaps. It is unlikely that all of these genomic sequences have been similarly misassembled, thus we conclude that exon 1 was deleted from all of these genes, and therefore the <it>Lysc1 </it>gene is a pseudogene in all of these species (Figure <figr fid="F2">2</figr>). In addition to missing exon 1, all of these <it>Lysc1</it>-like gene sequences have both frameshift insertions and/or deletions and in-frame stop codons, strengthening the conclusion that they are pseudogenes (Figure <figr fid="F8">8</figr>, and Additional file <supplr sid="S16">16</supplr>: Figure S15). The loss of exon 1 from the <it>Lysc1 </it>gene of haplorrhine primates and elephant must have been independent events (Figure <figr fid="F2">2</figr>) as the mouse lemur, a strepsirrhine primate, has a <it>Lysc1 </it>gene that has an intact exon 1 (Figure <figr fid="F8">8</figr>, and Additional file <supplr sid="S16">16</supplr>: Figure S15), plus the primates and elephants are quite distant relatives.</p>
<suppl id="S16">
<title>
<p>Additional file 16</p>
</title>
<text>
<p>
<b>Supplementary Figure 15</b>. This file is in PDF format. DNA sequences of <it>Lysc1 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S16.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Alignment of predicted calcium-binding lysozymes (<it>Lysc1</it>)</p></caption><text>
   <p><b>Alignment of predicted calcium-binding lysozymes (<it>Lysc1</it>)</b>. Inferred amino acid sequences of predicted <it>Lysc1 </it>genes for diverse mammals are shown in the single-letter amino acid code. The DNA sequences are shown in Additional file <supplr sid="S16">16</supplr>: Figure S15. The number +1 identifies the N-terminal residue of the mature protein, and the signal peptides are shown in italics. The solid black triangles above the sequence indicate locations of the introns in the gene, with the exon number shown above the protein sequence. Dashes (-) identify gaps introduced to maximize alignment and refer to the absence of homologous sequence. Questions marks (?) indicate gaps introduced to maximize alignment, but are also potential sequence that may exist in sequence gaps in the genome assemblies (= missing data). Codons that have one or two base deletions, and thus would have a frame shift, are marked by an <b>X</b>. Asterisks identify in-frame stop codons in the sequences.</p>
</text><graphic file="1471-2148-11-166-8" hint_layout="single"/></fig>
<p>Intact <it>Lysc1 </it>genes that predict potentially functional calcium-binding lysozymes were found in only a few species (dog, horse, and shrew), whereas pseudogenes were found on several lineages (primates, elephant, and sloth). Phylogenetic and genomic analyses suggested that the pair of <it>Lysc1 </it>genes found in the shrew resulted from a tandem gene duplication event on the lineage leading to this species (Additional files <supplr sid="S1">1</supplr> and <supplr sid="S17">17</supplr>: Table S1 and Figure S16); the divergence of the predicted protein sequences of the two genes suggests that they are not undergoing concerted evolution, however. The <it>Lysc1 </it>gene was deleted from the genome on the lineages leading to rodents (mouse, rat, guinea pig, and squirrel) and treeshrews. Taken together, the above observations suggest that the <it>Lysc1 </it>gene likely arose from a duplication of the lactalbumin gene early in mammalian evolution, and was inactivated several times independently, as summarized in Figure <figr fid="F2">2</figr>.</p>
<suppl id="S17">
<title>
<p>Additional file 17</p>
</title>
<text>
<p>
<b>Supplementary Figure 16</b>. This file is in PDF format. Phylogeny of <it>Lysc1 </it>genes.</p>
</text>
<file name="1471-2148-11-166-S17.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>Here we have shown that the mammalian lysozyme gene family is much larger than previously anticipated, and is composed of at least eight distantly-related members (<it>Lyz, Lalba, Lysc1, Lyzl1/2, Lyzl4, Lyzl6, Spaca3</it>, and <it>Spaca5</it>) in most mammalian species. These observations suggest that this family experienced several duplication events prior to the origin of mammals. Several other gene families also experienced such amplifications near the origin of mammals, such as those generating the gene families for keratin-associated proteins <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>, kallikriens <abbrgrp>
<abbr bid="B45">45</abbr>
<abbr bid="B46">46</abbr>
</abbrgrp>, and bitter taste receptors <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp>. Amplification of these latter genes has been suggested to be associated with development of new mammal-specific features -- <it>e.g</it>., hair (keratin-associated proteins), skin (kallikriens), and diet (bitter taste receptors) <abbrgrp>
<abbr bid="B44">44</abbr>
<abbr bid="B45">45</abbr>
<abbr bid="B46">46</abbr>
<abbr bid="B47">47</abbr>
</abbrgrp>. Intriguingly, lactalbumin is essential for lactose synthesis in mammary glands, a mammal-specific trait <abbrgrp>
<abbr bid="B2">2</abbr>
<abbr bid="B4">4</abbr>
<abbr bid="B7">7</abbr>
</abbrgrp>. These observations raise the possibility that other members of the lysozyme-like family have also evolved mammal-specific roles. The new lysozyme-like genes have been largely conserved within mammals, suggesting that they provide important biological functions. The products of the <it>Spaca3 </it>and <it>Lyzl4 </it>genes have recently been shown to be involved in fertilization in mice <abbrgrp>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
</abbrgrp>. Much further study is needed to identify the enzymatic activities (if any) and biological functions of these newly identified lysozyme-like proteins.</p>
<p>Similar to the keratin-associated protein <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp> and bitter taste receptor <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> gene families, genes for the lysozyme-like proteins are dispersed over several chromosomes (Table <tblr tid="T1">1</tblr>). The mechanisms by which these original gene duplications occurred are unclear, as the genes that flank the dispersed lysozyme-like genes show no homology to each other, implying that they were not generated by large segmental duplication events (as we observed for the duplications of <it>LYZL1/LYZL2 </it>and <it>SPACA5/SPACA5B </it>in the human genome). The lysozyme-like gene family also shares with the keratin associated protein <abbrgrp>
<abbr bid="B44">44</abbr>
</abbrgrp>, kallikrein <abbrgrp>
<abbr bid="B45">45</abbr>
</abbrgrp>, and bitter taste receptor <abbrgrp>
<abbr bid="B47">47</abbr>
</abbrgrp> gene families the propensity for lineage-specific gene duplications (see Figures 2 and 3). The lineage-specific expansions, in contrast to the initial duplications, have frequently been tandem in nature. Such tandem organization increases the likelihood that the duplicated genes could be involved in concerted evolution <abbrgrp>
<abbr bid="B22">22</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>, which our phylogenetic analyses suggest have occurred in the <it>Lyz </it>and <it>Lyzl1/2 </it>subfamilies. The <it>Lyz </it>subfamily showed the greatest tendency to tandemly duplicate and evolve in concert, whereas the other lysozyme-like genes typically showed conservation in copy number. Tandem duplication or amplification of the <it>Lyz </it>gene has previously been observed in certain mammals, including the ruminants and rodents, where lysozyme appears to function as a digestive enzyme in the gut <abbrgrp>
<abbr bid="B3">3</abbr>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
<abbr bid="B30">30</abbr>
<abbr bid="B31">31</abbr>
<abbr bid="B32">32</abbr>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>. It is of interest to note that many of the species that we found to possess multiple <it>Lyz </it>genes -- <it>e.g., </it>elephant and wallaby -- are also herbivorous species, and thus may use lysozyme as a digestive enzyme upon gut bacteria. The need for higher levels of digestive lysozymes in the guts of fermenting herbivores could have driven the fixation of the tandem duplications in these lineages. Gene conversion between the tandem duplicates might then provide a mechanism whereby favorable mutations in one gene copy could spread to the other copies in the cluster <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B36">36</abbr>
</abbrgrp>, as well as a mechanism for retention of sequence similarity <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp> in well-adapted proteins.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Database Searches</p>
</st>
<p>All vertebrate genomes maintained in the <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> and <it>Pre!Ensembl </it>
<abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> databases (release 57, see Additional file <supplr sid="S1">1</supplr>: Table S1 for a full list) were searched in April 2010 for lysozyme-like sequences. We initially searched the genomes using the <it>tBLASTn </it>algorithm <abbrgrp>
<abbr bid="B20">20</abbr>
<abbr bid="B48">48</abbr>
</abbrgrp> using previously-characterized human and rodent lysozyme <it>c </it>and lactalbumin sequences. Subsequent <it>tBLASTn </it>searches used all of the identified putative lysozyme-like protein sequences. Similar searches were conducted using additional databases (<it>e.g</it>., genome assemblies and ESTs) available at the NCBI website <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp>. After identification of the dog and horse calcium-binding lysozyme gene, the other mammalian genome assemblies on the <it>Ensembl </it>database were searched using these sequences using <it>tBLASTn </it>for similar sequences. All sequences that had E-scores below 0.01 were examined. Sequences identified by <it>BLAST </it>searches were used in reciprocal <it>BLASTx </it>searches of the human, mouse and dog proteomes to ensure that their best matches were lysozyme-like sequences. Sequences that were unannotated to encode lysozyme-like sequences (see Additional file <supplr sid="S1">1</supplr>: Table S1) were examined to identify potential coding sequences using published methods <abbrgrp>
<abbr bid="B50">50</abbr>
<abbr bid="B51">51</abbr>
<abbr bid="B52">52</abbr>
</abbrgrp>. Insect and amphioxus lysozyme sequences, used as outgroups for the phylogenetic analysis (see below), were identified by searches of the NCBI ENTREZ protein database <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp> for <it>Drosophila </it>
<abbrgrp>
<abbr bid="B53">53</abbr>
</abbrgrp> and amphioxus <abbrgrp>
<abbr bid="B54">54</abbr>
</abbrgrp> lysozymes; these protein sequences were then used in <it>tBLASTn </it>
<abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp> searches of the <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> and NCBI databases <abbrgrp>
<abbr bid="B49">49</abbr>
</abbrgrp> for related sequences. Several insect sequences were downloaded to represent the diversity of insect lysozyme sequences.</p>
<p>Genomic comparisons of DNA sequences near the lysozyme-like genes were conducted using <it>PipMaker </it>and <it>MultiPipMaker </it>
<abbrgrp>
<abbr bid="B42">42</abbr>
<abbr bid="B43">43</abbr>
<abbr bid="B55">55</abbr>
</abbrgrp>. Genes neighboring the lysozyme-like genes were identified from the genome assemblies at <it>Ensembl </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> and <it>Pre!Ensembl </it>
<abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>. The organization of genes adjacent to the lysozyme-like genes was used to determine whether the genes of interest reside in conserved genomic neighborhoods.</p>
</sec>
<sec>
<st>
<p>Phylogenetic Analysis</p>
</st>
<p>Phylogenies of vertebrate lysozyme-like gene coding sequences were generated with sequences from human, mouse, dog, horse, opossum, wallaby, and platypus, representing the diversity of mammals, as well as those from other vertebrate species (see Additional file <supplr sid="S1">1</supplr>: Table S1) and outgroups (Additional file <supplr sid="S18">18</supplr>: Table S2). Lysozyme-like coding sequences were aligned using <it>MAFFT </it>
<abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> and <it>Clustal </it>
<abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp>, as implemented at the <it>Guidance </it>web site <abbrgrp>
<abbr bid="B58">58</abbr>
<abbr bid="B59">59</abbr>
</abbrgrp>, using default parameters. (A <it>MAFFT </it>alignment of all the full-length sequences is provided in Additional file <supplr sid="S19">19</supplr>: Figure S17). Protein sequences were used as guides to generate the DNA sequence alignments. The reliability of the alignments was examined using <it>Guidance </it>
<abbrgrp>
<abbr bid="B58">58</abbr>
<abbr bid="B59">59</abbr>
</abbrgrp> and trimmed alignments using sites that had values above the default cut-off of 0.93 were generated. Insect and/or amphioxus lysozyme sequences were used to root the trees of vertebrate lysozyme-like sequences.</p>
<suppl id="S18">
<title>
<p>Additional file 18</p>
</title>
<text>
<p>
<b>Supplementary Table 2</b>. This file is in PDF format. Outgroup lysozyme sequences used for phylogenetic analysis.</p>
</text>
<file name="1471-2148-11-166-S18.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<suppl id="S19">
<title>
<p>Additional file 19</p>
</title>
<text>
<p>
<b>Supplementary Figure 17</b>. This file is in Word format. FASTA formatted <it>MAFFT </it>alignment of lysozyme DNA sequences</p>
</text>
<file name="1471-2148-11-166-S19.DOC">
   <p>Click here for file</p>
</file>
</suppl>
<p>Phylogenetic trees of the sequences were generated by a variety of methods including <it>MrBayes </it>3.1.2 <abbrgrp>
<abbr bid="B60">60</abbr>
<abbr bid="B61">61</abbr>
</abbrgrp>, <it>PhyloBayes </it>3.2f <abbrgrp>
<abbr bid="B62">62</abbr>
</abbrgrp>, and <it>PhyML </it>
<abbrgrp>
<abbr bid="B63">63</abbr>
</abbrgrp>, <it>MEGA</it>4.0.2 <abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp>, and <it>PAUP* </it>4beta10 <abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp>. Bayesian trees were generated from coding sequences with <it>MrBayes </it>3.1.2 using parameters selected by hierarchical likelihood ratio tests with <it>ModelTest </it>version 3.8, as implemented on the ModelTest server <abbrgrp>
<abbr bid="B66">66</abbr>
<abbr bid="B67">67</abbr>
<abbr bid="B68">68</abbr>
</abbrgrp>. <it>MrBayes </it>was run for 2,000,000 generations with four simultaneous Metropolis-coupled Monte Carlo Markov chains sampled every 100 generations. The average standard deviation of split frequencies dropped to less than 0.02 for all analyses. The first 25% of the trees were discarded as burn-in with the remaining samples used to generate the consensus trees. Trace files generated by <it>MrBayes </it>were examined by <it>Tracer </it>
<abbrgrp>
<abbr bid="B69">69</abbr>
</abbrgrp> to verify if they had converged. Bayesian phylogenies were also generated from protein sequences using <it>PhyloBayes</it>, with two chains being used with the automatic stopping rule set to terminate the analysis when <it>bpcomp </it>and <it>tracecomp </it>indicated that discrepancies between the chains was equal to or below 0.2 and all effective sizes were greater than 100. The first 10% of the trees were discarded as burnin. <it>PAUP* </it>was used to construct parsimony trees. Bootstrapped maximum likelihood trees, 100 replications, were generated by <it>PhyML </it>
<abbrgrp>
<abbr bid="B63">63</abbr>
</abbrgrp> on the <it>PhyML </it>webserver <abbrgrp>
<abbr bid="B70">70</abbr>
</abbrgrp> using parameters for the substitution model suggested by <it>ModelTest</it>. The maximum likelihood search was initiated from a tree generated by <it>BIONJ </it>and the best tree was identified after heuristic searches using the nearest neighbor interchange (NNI) algorithm. <it>MEGA4 </it>
<abbrgrp>
<abbr bid="B64">64</abbr>
</abbrgrp> was used to construct bootstrapped (1000 replications) neighbor-joining distance trees, using either Maximum Composite Likelihood distances for the DNA sequences or JTT distances for the proteins sequences. Bootstrapped parsimony trees were also generated by <it>PAUP </it>
<abbrgrp>
<abbr bid="B65">65</abbr>
</abbrgrp>, with 1000 replications and the same search method used for maximum likelihood.</p>
<p>With respect to orthology-paralogy issues, the choice of outgroup, the alignment method (<it>MAFFT </it>
<abbrgrp>
<abbr bid="B56">56</abbr>
</abbrgrp> or <it>Clustal </it>
<abbrgrp>
<abbr bid="B57">57</abbr>
</abbrgrp>), and the use of full-length or trimmed (based on <it>Guidance </it>scores <abbrgrp>
<abbr bid="B58">58</abbr>
<abbr bid="B59">59</abbr>
</abbrgrp>) alignments had little influence on the key findings of these analyses. Methods that relied on shorter sequences (<it>i.e</it>., trimmed alignments or protein sequences) or simpler models of sequence evolution (<it>i.e</it>., neighbor-joining or parsimony) tended to yield weaker support for the earlier diverging lineages, but none of our analyses were in significant conflict with the key inferences of the phylogeny presented in Figure <figr fid="F3">3</figr>.</p>
<p>For phylogenies that contained just mammalian lysozyme-like sequences, <it>Lalba </it>sequences were arbitrarily used to root the trees. When only mammalian lysozyme-like gene sequences were used for the phylogenetic analyses, then stronger support for each of the orthologous groups was found with all of the phylogenetic methods used including Bayesian inference, maximum likelihood, distance, and parsimony (see Additional file <supplr sid="S4">4</supplr>: Figure S3). To generate gene-specific phylogenies, the platypus sequence was used as a root, except for <it>Lysc1 </it>and <it>Spaca5 </it>where the platypus does not have these sequences. For <it>Lysc1</it>, the sloth sequence was used to root the tree, whereas for <it>Spaca5 </it>the elephant and tenrec sequences provided the root.</p>
</sec>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>DMI and CBS together designed the research and outlined the manuscript. DMI, JMB, and CBS obtained and analyzed the data. DMI drafted the manuscript. All of the authors have read, edited, and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>This work has been supported by grants from the Natural Sciences and Engineering Research Council (to DMI) and from the National Institutes of Health and SUNY-Albany (to CBS). We thank the Associate Editor and two anonymous reviewers for their comments that have helped improve this manuscript.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Lysozyme and alpha-lactalbumin: structure, function, and interrelationships</p></title><aug><au><snm>McKenzie</snm><fnm>HA</fnm></au><au><snm>White FH</snm><fnm>Jr</fnm></au></aug><source>Adv Protein Chem</source><pubdate>1991</pubdate><volume>41</volume><fpage>173</fpage><lpage>315</lpage><xrefbib><pubid idtype="pmpid">2069076</pubid></xrefbib></bibl><bibl id="B2"><title><p>alpha-lactalbumins and lysozymes</p></title><aug><au><snm>McKenzie</snm><fnm>HA</fnm></au></aug><source>Lysozymes: model enzymes in biochemistry and molecular biology</source><publisher>Basel, Birkh&#228;user Verlag</publisher><editor>Joll&#232;s, P.</editor><pubdate>1996</pubdate><fpage>365</fpage><lpage>409</lpage></bibl><bibl id="B3"><title><p>Animal lysozymes <it>c </it>and <it>g</it>: an overview</p></title><aug><au><snm>Prager</snm><fnm>EM</fnm></au><au><snm>Joll&#232;s</snm><fnm>P</fnm></au></aug><source>Lysozymes: model enzymes in biochemistry and molecular biology</source><publisher>Basel, Birkh&#228;user Verlag</publisher><editor>Joll&#232;s P.</editor><pubdate>1996</pubdate><fpage>9</fpage><lpage>31</lpage></bibl><bibl id="B4"><title><p>Molecular divergence of lysozymes and alpha-lactalbumin</p></title><aug><au><snm>Qasba</snm><fnm>PK</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Crit Rev Biochem Mol Biol</source><pubdate>1997</pubdate><volume>32</volume><fpage>255</fpage><lpage>306</lpage><xrefbib><pubidlist><pubid idtype="doi">10.3109/10409239709082574</pubid><pubid idtype="pmpid">9307874</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Lysozymes in the animal kingdom</p></title><aug><au><snm>Callewaert</snm><fnm>L</fnm></au><au><snm>Michiels</snm><fnm>CW</fnm></au></aug><source>J Biosci</source><pubdate>2010</pubdate><volume>35</volume><fpage>127</fpage><lpage>160</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s12038-010-0015-5</pubid><pubid idtype="pmpid" link="fulltext">20413917</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Isolation and characterization of vertebrate lysozyme genes</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Yu</snm><fnm>M</fnm></au><au><snm>Wen</snm><fnm>Y</fnm></au></aug><source>Lysozymes: model enzymes in biochemistry and molecular biology</source><publisher>Basel, Birkh&#228;user Verlag</publisher><editor>Joll&#232;s P.</editor><pubdate>1996</pubdate><fpage>225</fpage><lpage>241</lpage></bibl><bibl id="B7"><title><p>alpha-Lactalbumin: structure and function</p></title><aug><au><snm>Permyakov</snm><fnm>EA</fnm></au><au><snm>Berliner</snm><fnm>LJ</fnm></au></aug><source>FEBS Lett</source><pubdate>2000</pubdate><volume>473</volume><fpage>269</fpage><lpage>274</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0014-5793(00)01546-5</pubid><pubid idtype="pmpid" link="fulltext">10818224</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Can misfolded proteins be beneficial? The HAMLET case</p></title><aug><au><snm>Pettersson-Kastberg</snm><fnm>J</fnm></au><au><snm>Aits</snm><fnm>S</fnm></au><au><snm>Gustafsson</snm><fnm>L</fnm></au><au><snm>Mossberg</snm><fnm>A</fnm></au><au><snm>Storm</snm><fnm>P</fnm></au><au><snm>Trulsson</snm><fnm>M</fnm></au><au><snm>Persson</snm><fnm>F</fnm></au><au><snm>Mok</snm><fnm>KH</fnm></au><au><snm>Svanborg</snm><fnm>C</fnm></au></aug><source>Ann Med</source><pubdate>2009</pubdate><volume>41</volume><fpage>162</fpage><lpage>176</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/07853890802502614</pubid><pubid idtype="pmpid" link="fulltext">18985467</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Calcium-binding lysozymes</p></title><aug><au><snm>Nitta</snm><fnm>K</fnm></au><au><snm>Tsuge</snm><fnm>H</fnm></au><au><snm>Shimazaki</snm><fnm>K</fnm></au><au><snm>Sugai</snm><fnm>S</fnm></au></aug><source>Biol Chem Hoppe Seyler</source><pubdate>1988</pubdate><volume>369</volume><fpage>671</fpage><lpage>675</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1515/bchm3.1988.369.2.671</pubid><pubid idtype="pmpid">3214551</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>cDNA and amino acid sequences of rainbow trout (<it>Oncorhynchus mykiss</it>) lysozymes and their implications for the evolution of lysozyme and lactalbumin</p></title><aug><au><snm>Dautigny</snm><fnm>A</fnm></au><au><snm>Prager</snm><fnm>EM</fnm></au><au><snm>Pham-Dinh</snm><fnm>D</fnm></au><au><snm>Joll&#232;s</snm><fnm>J</fnm></au><au><snm>Pakdel</snm><fnm>F</fnm></au><au><snm>Grinde</snm><fnm>B</fnm></au><au><snm>Joll&#232;s</snm><fnm>P</fnm></au></aug><source>J Mol Evol</source><pubdate>1991</pubdate><volume>32</volume><fpage>187</fpage><lpage>98</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF02515392</pubid><pubid idtype="pmpid">1901095</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Sequences of two highly divergent canine type <it>c </it>lysozymes: implications for the evolutionary origins of the lysozyme/alpha-lactalbumin superfamily</p></title><aug><au><snm>Grobler</snm><fnm>JA</fnm></au><au><snm>Rao</snm><fnm>KR</fnm></au><au><snm>Pervaiz</snm><fnm>S</fnm></au><au><snm>Brew</snm><fnm>K</fnm></au></aug><source>Arch Biochem Biophys</source><pubdate>1994</pubdate><volume>313</volume><fpage>360</fpage><lpage>366</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/abbi.1994.1399</pubid><pubid idtype="pmpid" link="fulltext">8080284</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>The evolution of lysozyme and alpha-lactalbumin</p></title><aug><au><snm>Nitta</snm><fnm>K</fnm></au><au><snm>Sugai</snm><fnm>S</fnm></au></aug><source>Eur J Biochem</source><pubdate>1989</pubdate><volume>182</volume><fpage>111</fpage><lpage>118</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1432-1033.1989.tb14806.x</pubid><pubid idtype="pmpid" link="fulltext">2731545</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>SLLP1, a unique, intra-acrosomal, non-bacteriolytic, <it>c </it>lysozyme-like protein of human spermatozoa</p></title><aug><au><snm>Mandal</snm><fnm>A</fnm></au><au><snm>Klotz</snm><fnm>KL</fnm></au><au><snm>Shetty</snm><fnm>J</fnm></au><au><snm>Jayes</snm><fnm>FL</fnm></au><au><snm>Wolkowicz</snm><fnm>MJ</fnm></au><au><snm>Bolling</snm><fnm>LC</fnm></au><au><snm>Coonrod</snm><fnm>SA</fnm></au><au><snm>Black</snm><fnm>MB</fnm></au><au><snm>Diekman</snm><fnm>AB</fnm></au><au><snm>Haystead</snm><fnm>TA</fnm></au><au><snm>Flickinger</snm><fnm>CJ</fnm></au><au><snm>Herr</snm><fnm>JC</fnm></au></aug><source>Biol Reprod</source><pubdate>2003</pubdate><volume>68</volume><fpage>1525</fpage><lpage>1537</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">12606493</pubid></xrefbib></bibl><bibl id="B14"><title><p>SPRASA, a novel sperm protein involved in immune-mediated infertility</p></title><aug><au><snm>Chiu</snm><fnm>WW</fnm></au><au><snm>Erikson</snm><fnm>EK</fnm></au><au><snm>Sole</snm><fnm>CA</fnm></au><au><snm>Shelling</snm><fnm>AN</fnm></au><au><snm>Chamley</snm><fnm>LW</fnm></au></aug><source>Hum Reprod</source><pubdate>2004</pubdate><volume>19</volume><fpage>243</fpage><lpage>249</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/humrep/deh050</pubid><pubid idtype="pmpid" link="fulltext">14747161</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Molecular cloning and characterization of three novel lysozyme-like genes, predominantly expressed in the male reproductive system of humans, belonging to the <it>c</it>-type lysozyme/alpha-lactalbumin family</p></title><aug><au><snm>Zhang</snm><fnm>K</fnm></au><au><snm>Gao</snm><fnm>R</fnm></au><au><snm>Zhang</snm><fnm>H</fnm></au><au><snm>Cai</snm><fnm>X</fnm></au><au><snm>Shen</snm><fnm>C</fnm></au><au><snm>Wu</snm><fnm>C</fnm></au><au><snm>Zhao</snm><fnm>S</fnm></au><au><snm>Yu</snm><fnm>L</fnm></au></aug><source>Biol Reprod</source><pubdate>2005</pubdate><volume>73</volume><fpage>1064</fpage><lpage>1071</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1095/biolreprod.105.041889</pubid><pubid idtype="pmpid" link="fulltext">16014814</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Ensembl Genome Browser [<url>http://www.ensembl.org/index.html</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B17"><title><p>Immunogenicity of a multi-component recombinant human acrosomal protein vaccine in female <it>Macaca fascicularis</it></p></title><aug><au><snm>Kurth</snm><fnm>BE</fnm></au><au><snm>Digilio</snm><fnm>L</fnm></au><au><snm>Snow</snm><fnm>P</fnm></au><au><snm>Bush</snm><fnm>LA</fnm></au><au><snm>Wolkowicz</snm><fnm>M</fnm></au><au><snm>Shetty</snm><fnm>J</fnm></au><au><snm>Mandal</snm><fnm>A</fnm></au><au><snm>Hao</snm><fnm>Z</fnm></au><au><snm>Reddi</snm><fnm>PP</fnm></au><au><snm>Flickinger</snm><fnm>CJ</fnm></au><au><snm>Herr</snm><fnm>JC</fnm></au></aug><source>J Reprod Immunol</source><pubdate>2008</pubdate><volume>77</volume><fpage>126</fpage><lpage>141</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jri.2007.06.001</pubid><pubid idtype="pmcid">2481230</pubid><pubid idtype="pmpid" link="fulltext">17643494</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Lyzl4, a novel mouse sperm-related protein, is involved in fertilization</p></title><aug><au><snm>Sun</snm><fnm>R</fnm></au><au><snm>Shen</snm><fnm>R</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Xu</snm><fnm>G</fnm></au><au><snm>Chi</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Ren</snm><fnm>J</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Fei</snm><fnm>J</fnm></au></aug><source>Acta Biochem Biphys Sinica</source><pubdate>2011</pubdate><volume>43</volume><fpage>346</fpage><lpage>353</lpage><xrefbib><pubid idtype="doi">10.1093/abbs/gmr017</pubid></xrefbib></bibl><bibl id="B19"><title><p>Mouse SLLP1, a sperm lysozyme-like protein involved in sperm-egg binding and fertilization</p></title><aug><au><snm>Herrero</snm><fnm>MB</fnm></au><au><snm>Mandal</snm><fnm>A</fnm></au><au><snm>Digilio</snm><fnm>LC</fnm></au><au><snm>Coonrod</snm><fnm>SA</fnm></au><au><snm>Maier</snm><fnm>B</fnm></au><au><snm>Herr</snm><fnm>JC</fnm></au></aug><source>Develop Biol</source><pubdate>2005</pubdate><volume>284</volume><fpage>126</fpage><lpage>142</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ydbio.2005.05.008</pubid><pubid idtype="pmpid" link="fulltext">15982649</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au><au><snm>Sch&#228;ffer</snm><fnm>AA</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1997</pubdate><volume>25</volume><fpage>3389</fpage><lpage>3402</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/25.17.3389</pubid><pubid idtype="pmcid">146917</pubid><pubid idtype="pmpid" link="fulltext">9254694</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>The human lysozyme gene. Sequence organization and chromosomal localization</p></title><aug><au><snm>Peters</snm><fnm>CW</fnm></au><au><snm>Kruse</snm><fnm>U</fnm></au><au><snm>Pollwein</snm><fnm>R</fnm></au><au><snm>Grzeschik</snm><fnm>KH</fnm></au><au><snm>Sippel</snm><fnm>AE</fnm></au></aug><source>Eur J Biochem</source><pubdate>1989</pubdate><volume>182</volume><fpage>507</fpage><lpage>516</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1432-1033.1989.tb14857.x</pubid><pubid idtype="pmpid" link="fulltext">2546758</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Organization and sequence of the human alpha-lactalbumin gene</p></title><aug><au><snm>Hall</snm><fnm>L</fnm></au><au><snm>Emery</snm><fnm>DC</fnm></au><au><snm>Davies</snm><fnm>MS</fnm></au><au><snm>Parker</snm><fnm>D</fnm></au><au><snm>Craig</snm><fnm>RK</fnm></au></aug><source>Biochem J</source><pubdate>1987</pubdate><volume>242</volume><fpage>735</fpage><lpage>742</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1147772</pubid><pubid idtype="pmpid">2954544</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Primate segmental duplications: crucibles of evolution, diversity and disease</p></title><aug><au><snm>Bailey</snm><fnm>JA</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><fpage>552</fpage><lpage>64</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">16770338</pubid></xrefbib></bibl><bibl id="B24"><title><p>Neutral and non-neutral evolution of duplicated genes with gene conversion</p></title><aug><au><snm>Fawcett</snm><fnm>JA</fnm></au><au><snm>Innan</snm><fnm>H</fnm></au></aug><source>Genes</source><pubdate>2011</pubdate><volume>2</volume><fpage>191</fpage><lpage>209</lpage><xrefbib><pubid idtype="doi">10.3390/genes2010191</pubid></xrefbib></bibl><bibl id="B25"><title><p>Ensembl Pre-release Genome Browser [<url>http://pre.ensembl.org/index.html</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B26"><title><p>Recruitment of lysozyme as a major enzyme in the mouse gut: duplication, divergence, and regulatory evolution</p></title><aug><au><snm>Hammer</snm><fnm>MF</fnm></au><au><snm>Schilling</snm><fnm>JW</fnm></au><au><snm>Prager</snm><fnm>EM</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au></aug><source>J Mol Evol</source><pubdate>1987</pubdate><volume>24</volume><fpage>272</fpage><lpage>279</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF02111240</pubid><pubid idtype="pmpid">3106642</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Repetitive sequence involvement in the duplication and divergence of mouse lysozyme genes</p></title><aug><au><snm>Cross</snm><fnm>M</fnm></au><au><snm>Renkawitz</snm><fnm>R</fnm></au></aug><source>EMBO J</source><pubdate>1990</pubdate><volume>9</volume><fpage>1283</fpage><lpage>1288</lpage><xrefbib><pubidlist><pubid idtype="pmcid">551806</pubid><pubid idtype="pmpid">2323338</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Mouse lysozyme M gene: isolation, characterization, and expression studies</p></title><aug><au><snm>Cross</snm><fnm>M</fnm></au><au><snm>Mangelsdorf</snm><fnm>I</fnm></au><au><snm>Wedel</snm><fnm>A</fnm></au><au><snm>Renkawitz</snm><fnm>R</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1988</pubdate><volume>85</volume><fpage>6232</fpage><lpage>6236</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.85.17.6232</pubid><pubid idtype="pmcid">281943</pubid><pubid idtype="pmpid" link="fulltext">3413093</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Evolution of rodent lysozymes: isolation and sequence of the rat lysozyme genes</p></title><aug><au><snm>Yeh</snm><fnm>TC</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>1993</pubdate><volume>2</volume><fpage>65</fpage><lpage>75</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mpev.1993.1007</pubid><pubid idtype="pmpid" link="fulltext">8081549</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Secretion of colonic isozyme of lysozyme in association with cecotrophy of rabbits</p></title><aug><au><snm>C&#225;mara</snm><fnm>VM</fnm></au><au><snm>Prieur</snm><fnm>DJ</fnm></au></aug><source>Am J Physiol</source><pubdate>1984</pubdate><volume>247</volume><fpage>G19</fpage><lpage>G23</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">6540054</pubid></xrefbib></bibl><bibl id="B31"><title><p>Colonic lysozymes of rabbit (Japanese white): recent divergence and functional conversion</p></title><aug><au><snm>Ito</snm><fnm>Y</fnm></au><au><snm>Hirashima</snm><fnm>M</fnm></au><au><snm>Yamada</snm><fnm>H</fnm></au><au><snm>Imoto</snm><fnm>T</fnm></au></aug><source>J Biochem</source><pubdate>1994</pubdate><volume>116</volume><fpage>1346</fpage><lpage>1353</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">7706228</pubid></xrefbib></bibl><bibl id="B32"><title><p>Multiple cDNA sequences and the evolution of bovine stomach lysozyme</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au></aug><source>J Biol Chem</source><pubdate>1989</pubdate><volume>264</volume><fpage>11387</fpage><lpage>11393</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">2738070</pubid></xrefbib></bibl><bibl id="B33"><title><p>Evolutionary genetics of ruminant lysozymes</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Prager</snm><fnm>EM</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au></aug><source>Anim Genet</source><pubdate>1992</pubdate><volume>23</volume><fpage>193</fpage><lpage>202</lpage><xrefbib><pubid idtype="pmpid">1503255</pubid></xrefbib></bibl><bibl id="B34"><title><p>Evolution of the bovine lysozyme gene family: changes in expression and reversion of function</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>J Mol Evol</source><pubdate>1995</pubdate><volume>41</volume><fpage>299</fpage><lpage>312</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF01215177</pubid><pubid idtype="pmpid">7563116</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Evolution of cow nonstomach lysozyme genes</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>Genome</source><pubdate>2004</pubdate><volume>47</volume><fpage>1082</fpage><lpage>1090</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1139/g04-075</pubid><pubid idtype="pmpid" link="fulltext">15644966</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Concerted evolution of ruminant stomach lysozymes. Characterization of lysozyme cDNA clones from sheep and deer</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Wilson</snm><fnm>AC</fnm></au></aug><source>J Biol Chem</source><pubdate>1990</pubdate><volume>265</volume><fpage>4944</fpage><lpage>4952</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">2318875</pubid></xrefbib></bibl><bibl id="B37"><title><p>Mosaic evolution of ruminant stomach lysozyme genes</p></title><aug><au><snm>Wen</snm><fnm>Y</fnm></au><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>Mol Phylogenet Evol</source><pubdate>1999</pubdate><volume>13</volume><fpage>474</fpage><lpage>482</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mpev.1999.0651</pubid><pubid idtype="pmpid" link="fulltext">10620405</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>The bovine and ovine genomes contain multiple sequences homologous to the alpha-lactalbumin-encoding gene</p></title><aug><au><snm>Soulier</snm><fnm>S</fnm></au><au><snm>Mercier</snm><fnm>JC</fnm></au><au><snm>Vilotte</snm><fnm>JL</fnm></au><au><snm>Anderson</snm><fnm>J</fnm></au><au><snm>Clark</snm><fnm>AJ</fnm></au><au><snm>Provot</snm><fnm>C</fnm></au></aug><source>Gene</source><pubdate>1989</pubdate><volume>83</volume><fpage>331</fpage><lpage>338</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0378-1119(89)90119-4</pubid><pubid idtype="pmpid">2583529</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Complete sequence of a bovine alpha-lactalbumin pseudogene: the region homologous to the gene is flanked by two directly repeated LINE sequences</p></title><aug><au><snm>Vilotte</snm><fnm>JL</fnm></au><au><snm>Soulier</snm><fnm>S</fnm></au><au><snm>Mercier</snm><fnm>JC</fnm></au></aug><source>Genomics</source><pubdate>1993</pubdate><volume>16</volume><fpage>529</fpage><lpage>532</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/geno.1993.1223</pubid><pubid idtype="pmpid" link="fulltext">8390967</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Identification of cDNA coding for a homologue to mammalian leptin from pufferfish, <it>Takifugu rubripes</it></p></title><aug><au><snm>Kurokawa</snm><fnm>T</fnm></au><au><snm>Uji</snm><fnm>S</fnm></au><au><snm>Suzuki</snm><fnm>T</fnm></au></aug><source>Peptides</source><pubdate>2005</pubdate><volume>26</volume><fpage>745</fpage><lpage>750</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.peptides.2004.12.017</pubid><pubid idtype="pmpid" link="fulltext">15808904</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Evolution of the vertebrate glucose-dependent insulinotropic polypeptide (GIP) gene</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Zhang</snm><fnm>T</fnm></au></aug><source>Comp Biochem Physiol Part D</source><pubdate>2006</pubdate><volume>1</volume><fpage>385</fpage><lpage>95</lpage></bibl><bibl id="B42"><title><p>PipMaker--a web server for aligning two genomic DNA sequences</p></title><aug><au><snm>Schwartz</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Frazer</snm><fnm>KA</fnm></au><au><snm>Smit</snm><fnm>A</fnm></au><au><snm>Riemer</snm><fnm>C</fnm></au><au><snm>Bouck</snm><fnm>J</fnm></au><au><snm>Gibbs</snm><fnm>R</fnm></au><au><snm>Hardison</snm><fnm>R</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au></aug><source>Genome Res</source><pubdate>2000</pubdate><volume>10</volume><fpage>577</fpage><lpage>586</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.10.4.577</pubid><pubid idtype="pmcid">310868</pubid><pubid idtype="pmpid" link="fulltext">10779500</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences</p></title><aug><au><snm>Schwartz</snm><fnm>S</fnm></au><au><snm>Elnitski</snm><fnm>L</fnm></au><au><snm>Li</snm><fnm>M</fnm></au><au><snm>Weirauch</snm><fnm>M</fnm></au><au><snm>Riemer</snm><fnm>C</fnm></au><au><snm>Smit</snm><fnm>A</fnm></au><au><cnm>NISC Comparative Sequencing Program</cnm></au><au><snm>Green</snm><fnm>ED</fnm></au><au><snm>Hardison</snm><fnm>RC</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><fpage>3518</fpage><lpage>3524</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg579</pubid><pubid idtype="pmcid">168985</pubid><pubid idtype="pmpid" link="fulltext">12824357</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Molecular evolution of the keratin associated protein gene family in mammals, role in the evolution of mammalian hair</p></title><aug><au><snm>Wu</snm><fnm>DD</fnm></au><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Zhang</snm><fnm>YP</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2008</pubdate><volume>8</volume><fpage>241</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-8-241</pubid><pubid idtype="pmcid">2528016</pubid><pubid idtype="pmpid" link="fulltext">18721477</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Evolutionary history of tissue kallikreins</p></title><aug><au><snm>Pavlopoulou</snm><fnm>A</fnm></au><au><snm>Pampalakis</snm><fnm>G</fnm></au><au><snm>Michalopoulos</snm><fnm>I</fnm></au><au><snm>Sotiropoulou</snm><fnm>G</fnm></au></aug><source>PLoS One</source><pubdate>2010</pubdate><volume>5</volume><fpage>e13781</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0013781</pubid><pubid idtype="pmcid">2967472</pubid><pubid idtype="pmpid" link="fulltext">21072173</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Functional roles of human kallikrein-related peptidases</p></title><aug><au><snm>Sotiropoulou</snm><fnm>G</fnm></au><au><snm>Pampalakis</snm><fnm>G</fnm></au><au><snm>Diamandis</snm><fnm>EP</fnm></au></aug><source>J Biol Chem</source><pubdate>2009</pubdate><volume>284</volume><fpage>32989</fpage><lpage>3294</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.R109.027946</pubid><pubid idtype="pmcid">2785139</pubid><pubid idtype="pmpid" link="fulltext">19819870</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Dynamic evolution of bitter taste receptor genes in vertebrates</p></title><aug><au><snm>Dong</snm><fnm>D</fnm></au><au><snm>Jones</snm><fnm>G</fnm></au><au><snm>Zhang</snm><fnm>S</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2009</pubdate><volume>9</volume><fpage>12</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-9-12</pubid><pubid idtype="pmcid">2646699</pubid><pubid idtype="pmpid" link="fulltext">19144204</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>BLAST: Basic Local Alignment Search Tool [<url>http://blast.ncbi.nlm.nih.gov/Blast.cgi</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B49"><title><p>National Center for Biotechnology Information [<url>http://www.ncbi.nlm.nih.gov/</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B50"><title><p>Ancient duplications of the human proglucagon gene</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>Genomics</source><pubdate>2002</pubdate><volume>79</volume><fpage>741</fpage><lpage>746</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/geno.2002.6762</pubid><pubid idtype="pmpid" link="fulltext">11991725</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Molecular evolution of the vertebrate goose-type lysozyme genes</p></title><aug><au><snm>Irwin</snm><fnm>DM</fnm></au><au><snm>Gong</snm><fnm>Z</fnm></au></aug><source>J Mol Evol</source><pubdate>2003</pubdate><volume>56</volume><fpage>234</fpage><lpage>242</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00239-002-2396-z</pubid><pubid idtype="pmpid" link="fulltext">12574869</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Fish proglucagon genes have differing coding potential</p></title><aug><au><snm>Zhou</snm><fnm>L</fnm></au><au><snm>Irwin</snm><fnm>DM</fnm></au></aug><source>Comp Biochem Physiol</source><pubdate>2004</pubdate><volume>137B</volume><fpage>255</fpage><lpage>264</lpage></bibl><bibl id="B53"><title><p>The lysozyme locus in <it>Drosophila melanogaster</it>: different genes are expressed in midgut and salivary glands</p></title><aug><au><snm>Kylsten</snm><fnm>P</fnm></au><au><snm>Kimbrell</snm><fnm>DA</fnm></au><au><snm>Daffre</snm><fnm>S</fnm></au><au><snm>Samakovlis</snm><fnm>C</fnm></au><au><snm>Hultmark</snm><fnm>D</fnm></au></aug><source>Mol Gen Genet</source><pubdate>1992</pubdate><volume>232</volume><fpage>335</fpage><lpage>343</lpage><xrefbib><pubid idtype="pmpid">1588905</pubid></xrefbib></bibl><bibl id="B54"><title><p>Characterization, organization and expression of AmphiLysC, an acidic c-type lysozyme gene in amphioxus <it>Branchiostoma belcheri tsingtauense</it></p></title><aug><au><snm>Liu</snm><fnm>M</fnm></au><au><snm>Zhang</snm><fnm>S</fnm></au><au><snm>Liu</snm><fnm>Z</fnm></au><au><snm>Li</snm><fnm>H</fnm></au><au><snm>Xu</snm><fnm>A</fnm></au></aug><source>Gene</source><pubdate>2006</pubdate><volume>367</volume><fpage>110</fpage><lpage>117</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">16360291</pubid></xrefbib></bibl><bibl id="B55"><title><p>PipMaker and MultiPipMaker [<url>http://pipmaker.bx.psu.edu/pipmaker/</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B56"><title><p>MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform</p></title><aug><au><snm>Katoh</snm><fnm>K</fnm></au><au><snm>Misawa</snm><fnm>K</fnm></au><au><snm>Kuma</snm><fnm>K</fnm></au><au><snm>Miyata</snm><fnm>T</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2002</pubdate><volume>30</volume><fpage>3059</fpage><lpage>3066</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkf436</pubid><pubid idtype="pmcid">135756</pubid><pubid idtype="pmpid" link="fulltext">12136088</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p></title><aug><au><snm>Thompson</snm><fnm>JD</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au><au><snm>Gibson</snm><fnm>TJ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1994</pubdate><volume>22</volume><fpage>4673</fpage><lpage>80</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/22.22.4673</pubid><pubid idtype="pmcid">308517</pubid><pubid idtype="pmpid" link="fulltext">7984417</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>The Guidance Server [<url>http://guidance.tau.ac.il/</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B59"><title><p>GUIDANCE: a web server for assessing alignment confidence scores</p></title><aug><au><snm>Penn</snm><fnm>O</fnm></au><au><snm>Privman</snm><fnm>E</fnm></au><au><snm>Ashkenazy</snm><fnm>H</fnm></au><au><snm>Landan</snm><fnm>G</fnm></au><au><snm>Graur</snm><fnm>D</fnm></au><au><snm>Pupko</snm><fnm>T</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><fpage>W23</fpage><lpage>W28</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq443</pubid><pubid idtype="pmcid">2896199</pubid><pubid idtype="pmpid" link="fulltext">20497997</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>MRBAYES: Bayesian inference of phylogeny</p></title><aug><au><snm>Huelsenbeck</snm><fnm>JP</fnm></au><au><snm>Ronquist</snm><fnm>F</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>754</fpage><lpage>755</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.8.754</pubid><pubid idtype="pmpid" link="fulltext">11524383</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>MRBAYES 3: Bayesian phylogenetic inference under mixed models</p></title><aug><au><snm>Ronquist</snm><fnm>F</fnm></au><au><snm>Huelsenbeck</snm><fnm>JP</fnm></au></aug><source>Bioinformatics</source><pubdate>2003</pubdate><volume>19</volume><fpage>1572</fpage><lpage>1574</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btg180</pubid><pubid idtype="pmpid" link="fulltext">12912839</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process</p></title><aug><au><snm>Lartillot</snm><fnm>N</fnm></au><au><snm>Philippe</snm><fnm>H</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2004</pubdate><volume>21</volume><fpage>1095</fpage><lpage>1109</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msh112</pubid><pubid idtype="pmpid" link="fulltext">15014145</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p></title><aug><au><snm>Guindon</snm><fnm>S</fnm></au><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Systematic Biology</source><pubdate>2003</pubdate><volume>52</volume><fpage>696</fpage><lpage>704</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1080/10635150390235520</pubid><pubid idtype="pmpid" link="fulltext">14530136</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) Software Version 4.0</p></title><aug><au><snm>Tamura</snm><fnm>K</fnm></au><au><snm>Dudley</snm><fnm>J</fnm></au><au><snm>Nei</snm><fnm>M</fnm></au><au><snm>Kumar</snm><fnm>S</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2007</pubdate><volume>24</volume><fpage>1596</fpage><lpage>1599</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msm092</pubid><pubid idtype="pmpid" link="fulltext">17488738</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>PAUP* Phylogenetic analysis using parsimony and other methods, version 4.0b10</p></title><aug><au><snm>Swofford</snm><fnm>DL</fnm></au></aug><publisher>Sunderland, Sinauer Associates</publisher><pubdate>2002</pubdate></bibl><bibl id="B66"><title><p>ModelTest: testing the model of DNA substitution</p></title><aug><au><snm>Posada</snm><fnm>D</fnm></au><au><snm>Crandall</snm><fnm>KA</fnm></au></aug><source>Bioinformatics</source><pubdate>2003</pubdate><volume>14</volume><fpage>817</fpage><lpage>818</lpage></bibl><bibl id="B67"><title><p>ModelTest Server: a web-based tool for the statistical selection of models of nucleotide substitution online</p></title><aug><au><snm>Posada</snm><fnm>D</fnm></au></aug><source>Nucl Acids Res</source><pubdate>2006</pubdate><volume>34</volume><fpage>W700</fpage><lpage>W703</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl042</pubid><pubid idtype="pmcid">1538795</pubid><pubid idtype="pmpid" link="fulltext">16845102</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>ModelTest Server 1.0 [<url>http://darwin.uvigo.es/software/modeltest_server.html</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B69"><title><p>MCMC Trace Analysis Package, version 1.5 [<url>http://tree.bio.ed.ac.uk/software/tracer/</url>]</p></title><aug><au><snm>Rambaut</snm><fnm>A</fnm></au><au><snm>Drummond</snm><fnm>AJ</fnm></au></aug></bibl><bibl id="B70"><title><p>PhyML 3.0: new algorithms, methods and utilities [<url>http://www.atgc-montpellier.fr/phyml/</url>]</p></title><aug><au><snm></snm><fnm></fnm></au></aug></bibl><bibl id="B71"><title><p>Using genomic data to unravel the root of the placental mammal phylogeny</p></title><aug><au><snm>Murphy</snm><fnm>WJ</fnm></au><au><snm>Pringle</snm><fnm>TH</fnm></au><au><snm>Crider</snm><fnm>TA</fnm></au><au><snm>Springer</snm><fnm>MS</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au></aug><source>Genome Res</source><pubdate>2007</pubdate><volume>17</volume><fpage>413</fpage><lpage>421</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.5918807</pubid><pubid idtype="pmcid">1832088</pubid><pubid idtype="pmpid" link="fulltext">17322288</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Resolution among major placental mammal interordinal relationships with genome data imply that speciation influenced their earliest radiations</p></title><aug><au><snm>Hallstr&#246;m</snm><fnm>BM</fnm></au><au><snm>Janke</snm><fnm>A</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2008</pubdate><volume>8</volume><fpage>162</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2148-8-162</pubid><pubid idtype="pmcid">2435553</pubid><pubid idtype="pmpid" link="fulltext">18505555</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Confirming the phylogeny of mammals by use of large comparative sequence data sets</p></title><aug><au><snm>Prasad</snm><fnm>AB</fnm></au><au><snm>Allard</snm><fnm>MW</fnm></au><au><cnm>NISC Comparative Sequencing Program</cnm></au><au><snm>Green</snm><fnm>ED</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2008</pubdate><volume>25</volume><fpage>1795</fpage><lpage>1808</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msn104</pubid><pubid idtype="pmcid">2515873</pubid><pubid idtype="pmpid" link="fulltext">18453548</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>