<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2003-4-5-r31</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Duplication and selection in the evolution of primate &#946;-defensin genes</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Semple</snm>
               <mi>AM</mi>
               <fnm>Colin</fnm>
               <insr iid="I1"/>
               <email>Colin.Semple@hgu.mrc.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Rolfe</snm>
               <fnm>Mark</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3">
               <snm>Dorin</snm>
               <mi>R</mi>
               <fnm>Julia</fnm>
               <insr iid="I1"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>MRC Human Genetics Unit, Western General Hospital, Crewe Road, Edinburgh, EH4 2XU, UK</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2003</pubdate>
         <volume>4</volume>
         <issue>5</issue>
         <fpage>R31</fpage>
         <url>http://genomebiology.com/2003/4/5/R31</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2003-4-5-r31</pubid>
               <pubid idtype="pmpid">12734011</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>2</month>
               <year>2003</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>18</day>
               <month>3</month>
               <year>2003</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>3</day>
               <month>4</month>
               <year>2003</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>4</month>
               <year>2003</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2003</year>
         <collab>Semple et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all
media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <shorttitle>
         <p>Duplication and selection in the evolution of primate &#946;-defensin genes</p>
      </shorttitle>
      <shortabs>
         <p>Peptides such as &#946;-defensins are an important component in innate immunity. A combination of laboratory and computational techniques have been used to characterize the main human &#946;-defensin locus on chromosome 8p22-p23. The genomic structures and expression patterns of four novel human &#946;-defensin genes and a related pseudogene reveal prior duplication events and selection.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Innate immunity is the first line of defense against microorganisms in vertebrates and acts by providing an initial barrier to microorganisms and triggering adaptive immune responses. Peptides such as &#946;-defensins are an important component of this defense, providing a broad spectrum of antimicrobial activity against bacteria, fungi, mycobacteria and several enveloped viruses. &#946;-defensins are small cationic peptides that vary in their expression patterns and spectrum of pathogen specificity. Disruptions in &#946;-defensin function have been implicated in human diseases, including cystic fibrosis, and a fuller understanding of the variety, function and evolution of human &#946;-defensins might form the basis for novel therapies. Here we use a combination of laboratory and computational techniques to characterize the main human &#946;-defensin locus on chromosome 8p22-p23.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In addition to known genes in the region we report the genomic structures and expression patterns of four novel human &#946;-defensin genes and a related pseudogene. These genes show an unusual pattern of evolution, with rapid divergence between second exon sequences that encode the mature &#946;-defensin peptides matched by relative stasis in first exons that encode signal peptides.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>We conclude that the 8p22-p23 locus has evolved by successive rounds of duplication followed by substantial divergence involving positive selection, to produce a diverse cluster of paralogous genes established before the human-baboon divergence more than 23 million years ago. Positive selection, disproportionately favoring alterations in the charge of amino-acid residues, is implicated as driving second exon divergence in these genes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010011">Immunology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The vertebrate innate immune system provides protection against a wide range of pathogenic microorganisms, and defensins are an important component of this response as well as having a role in adaptive immunity. In mammals, the defensins can be divided into the &#945;- and &#946;-defensin subfamilies on the basis of differences in the spacing of six, conserved cysteine residues. The &#945;-defensins are produced by neutrophils and intestinal Paneth cells, whereas the &#946;-defensins are mainly produced by epithelial cells in contact with the environment. The functions of human &#946;-defensins seem to be disrupted in cystic fibrosis and inflammatory skin lesions such as psoriasis <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. A fuller knowledge of the human complement of &#946;-defensins may therefore be useful in understanding human disease as well as in the design of novel, synthetic antimicrobial peptides.</p>
         <p>The known human &#946;-defensin genes show a conserved two-exon structure: the first exon encodes a signal peptide whereas the second exon encodes a short propiece and the mature defensin peptide with a characteristic six-cysteine motif and many basic amino-acid residues <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The &#946;-defensin genes are present at five syntenic loci in the human and mouse genomes, with the main locus on human chromosome 8p22-23 and mouse chromosome 8A3 <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. All four, full-length, human &#946;-defensins that are present in the public databases are from 8p22-23 (GenBank sequence accession numbers are human &#946;-defensin 1 or <it>DEFB1</it>, Q09753; <it>DEFB4 </it>(formerly <it>DEFB2</it>), O15263; <it>DEFB103 </it>(formerly <it>DEFB3</it>), NP_061131; <it>DEFB104 </it>(formerly <it>DEFB4</it>), CAC85520), but there are substantial differences in their coding sequences, expression patterns and antimicrobial activities. <it>DEFB1 </it>is constitutively expressed in many tissues (respiratory tract, kidney, urogenital and oral cavity epithelia) whereas <it>DEFB4 </it>is expressed in response to bacterial infection or proinflammatory agonists in respiratory tract epithelial cells, and epidermal and gingival keratinocytes. Both DEFB1 and DEFB4 proteins have salt-sensitive, bactericidal activity against a spectrum of Gram-positive and Gram-negative enteric, urinary tract, and respiratory bacteria <it>in vitro </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. DEFB103 is expressed in epithelial cells, adult heart, skeletal muscle, placenta and fetal thymus, it has broad-spectrum antimicrobial activity under conditions of low salt and (unusually among &#946;-defensins) it retains activity against <it>Staphylococcus aureus </it>even in physiological saline. <it>DEFB104 </it>achieves highest expression in the testis (with lower levels in gastric antrum, neutrophils, uterus, thyroid, lung and kidney) and was found to be inducible in the respiratory epithelium upon exposure to <it>Pseudomonas aeruginosa </it>or <it>Streptococcus pneumoniae </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <p>The evolution of various genes involved in the vertebrate immune system has involved duplication followed by selection to provide responses to a wide range of pathogens, with well documented examples in immunoglobulin <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and major histocompatibility complex genes <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Hughes and Yeager <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> studied the evolution of &#945;-defensins and found evidence for duplication followed by diversification driven by positive selection. Similar phenomena were also implicated in the evolution of bovine &#946;-defensins <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and amphibian antimicrobial peptides <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In contrast, <it>DEFB1 </it>was found not to vary significantly across primates <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>Here we describe a combined strategy to identify further &#946;-defensin genes in the draft human genome sequence using computational techniques and verification using reverse transcription-PCR. Four full-length, novel genes (<it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108</it>) and a related pseudogene <it>DEFB109p </it>are reported, as well as their expression patterns and evidence for their evolution by duplication and positive selection.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>All TBLASTX <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> matches to human bacterial artificial chromosome (BAC) clone sequences were in the 8p22-p23 region in a subsection of FPC contig ctg45 (1 April freeze WashU Accession Map Layout Files <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>) bounded by the BAC clones RP11-161B1 (AC079018) and SCb-177K12 (AF252831) and consisting of 53 BACs in total. These 53 BACs were deemed to represent the human &#946;-defensin gene family locus and were masked for repetitive sequences using RepeatMasker <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This locus was the subject of a more sensitive search for the presence of novel human &#946;-defensins, using a hidden Markov model constructed from an alignment of the GenBank &#946;-defensin sequences mentioned above. As well as the known, full-length human &#946;-defensins and the related epididymis-specific <it>SPAG11 </it>(formerly <it>EP2</it>) gene <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, two novel &#946;-defensin genes, <it>DEFB105 </it>and <it>DEFB106</it>, were identified in this search and were then incorporated into the previous hidden Markov model. Further searches with this revised model identified a further three genes: <it>DEFB107</it>, <it>DEFB108 </it>and <it>DEFB109p</it>. None of these five genes was found in the EMBL sequence database (24 June 2002 release) or in the Ensembl genomic annotation database (version 6.28.1 <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>). The novel gene <it>DEFB109p </it>appears to be a pseudogene as it contains a premature stop codon within its first exon, as observed in three independently sequenced, overlapping 8p22-p23 BAC sequences (accession numbers AC068974, AC087203 and AF252830). Given the absence of premature stop codons in the other four genes, despite considerable divergence among them (they encode only 18-28% identical amino-acid residues), it is unlikely that they too are pseudogenes.</p>
         <p>Many putative final exon fragments from novel mouse &#946;-defensins were recently reported by Schutte <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. However, their data were incomplete and were not supported by experimental verification. They used computational techniques to identify sequences matching a central portion of the mature defensin peptide including the six-cysteine motif characteristic of &#946;-defensins. At best, this method could only identify incomplete final exons encoding this region of the peptide. No attempt was made to delineate precisely the boundaries of final exons or to identify first exons by these authors. In the present study we restrict our attention to complete genes, present in a BAC-clone-based map of the region and verified as encoding real transcripts by RT-PCR. Our full-length novel genes correspond to five final exon fragments (<it>DEFB5</it>, <it>DEFB6</it>, <it>DEFB7</it>, <it>DEFB8 </it>and <it>DEFB9</it>) reported by Schutte <it>et al. </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and we have adopted the official HUGO Human Gene Nomenclature Committee <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> names for these fragments: <it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108 </it>and <it>DEFB109p </it>(amended from <it>DEFB109</it>) respectively.</p>
         <p>RT-PCR amplification confirmed the presence of the computationally predicted, functional genes, but sequencing of these products and subsequent alignment to genomic sequence also uncovered several instances where the actual gene structures differed from those that were predicted. In two cases, <it>DEFB106 </it>and <it>DEFB107</it>, splice sites other than those predicted were used and it was found that the predicted coding sequences had been respectively 36 base-pairs (bp) longer and 9 bp shorter than the actual coding sequences. In the case of <it>DEFB105 </it>RT-PCR verified the two predicted exons but also uncovered an additional, 42-bp intervening exon. The RT-PCR analysis also produced interesting results with respect to the coding sequence of <it>DEFB108</it>. In four independent, sequenced RT-PCR products from one mRNA sample, the amplified <it>DEFB108 </it>sequence was consistently found to differ from the computationally predicted sequence from the human public draft sequence at three nucleotide sites, but was otherwise identical. The three differences observed between predicted and amplified sequences respectively were an A to G at nucleotide 62 (which causes a conservative amino-acid change from lysine to arginine), a T to C at nucleotide 111 (which is a synonymous change) and a C to T at nucleotide 120 (also synonymous). This indicates novel human polymorphisms in <it>DEFB108 </it>and is consistent with observations of high degrees of polymorphism for other human &#946;-defensins <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> (see also Additional data files 1-6 for the novel gene sequences confirmed by RT-PCR and the computationally predicted sequence of <it>DEFB109p</it>).</p>
         <p>Figure <figr fid="F1">1a</figr> depicts the relative positions and orientations of the four novel genes, the novel pseudogene <it>DEFB109p </it>and four known genes in the vicinity: <it>DEFB2-4 </it>and the related epididymis-specific <it>SPAG11</it>. This cluster is about 350 kilobases (kb) centromeric of the <it>DEFB1 </it>gene. A phylogenetic tree relating functional human &#946;-defensins (Figure <figr fid="F1">1b</figr>) reflects the spatial distribution of these genes, with the cluster of genes in Figure <figr fid="F1">1a</figr> appearing to form a clade separate from <it>DEFB1</it>. The simplest explanation for the origin of this cluster is a series of local duplication events followed by substantial divergence. In addition it seems that the four novel functional genes are more closely related to <it>DEFB104 </it>than to <it>DEFB4 </it>or <it>DEFB103</it>. Three of these genes, <it>DEFB106</it>, <it>DEFB105 </it>and <it>DEFB108</it>, encode mature peptides exhibiting the same spacing of conserved cysteine residues as seen in <it>DEFB104</it>, although in <it>DEFB105 </it>there is an extra cysteine residue (amino acid 43) towards the amino-terminal end of the mature peptide. <it>DEFB105 </it>also encodes an unusually long propiece peptide region in the second of its three exons. Strikingly, the DEFB107 protein lacks the first canonical &#946;-defensin cysteine altogether and instead has a serine residue at the same position (Figure <figr fid="F2">2</figr>). The changes in the number of cysteine residues seen in DEFB105 and DEFB107 are likely to have important functional consequences. The predicted mature peptides for the four novel, functional &#946;-defensins presented here have a similar proportion of cationic residues to human DEFB104, with higher proportions of anionic residues (10-13%) than are seen in human DEFB1, DEFB4 and DEFB103 (less than 4%). Indeed the pI of DEFB107 and DEFB108 are 6.74 and 6.89 respectively, whereas all other &#946;-defensins described to date are cationic. This relative increase in anionic residues is expected to affect function, as the action of defensins initially involves interactions between the cationic mature defensin peptides and anionic membrane lipids <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Expression analysis for the novel, functional human genes was carried out by RT-PCR on a panel of human RNA samples and the novel gene PCR products were confirmed by hybridization to an internal probe (Figure <figr fid="F3">3</figr>). Expression of all four novel genes was readily detected in testis. A longer exposure period revealed low levels of expression of <it>DEFB108 </it>in the liver (Figure <figr fid="F3">3</figr>, left-hand panel of <it>DEFB108</it>). Expression was not detected in any of the other tissues analyzed.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Genomic organization of novel human &#946;-defensin genes</p>
            </caption>
            <text>
               <p>Genomic organization of novel human &#946;-defensin genes. <b>(a) </b>The genomic organization of novel human &#946;-defensin genes <it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108 </it>and <it>DEFB109p </it>on 8p22-p23. The horizontal lines represent the three BAC clones in which all novel genes were found. Exons are represented as triangles with the vertical side representing the position of the exon. Exons above the horizontal lines are transcribed from the strand represented by the original BAC clone sequence entries whereas those below are transcribed in the opposite direction from the complementary strand. The region depicted is about 350 kb centromeric of DEFB1. Those genes marked with an asterisk were found to have orthologs in baboon genomic sequences (AC116558 and AC116559). <b>(b) </b>A phylogenetic tree of functional human &#946;-defensins using the prepropeptide sequences encoded by the genes shown in (a). The phylogenetic tree was rooted with chicken gallinacin 1 (GGGAL1; P46156) and the reliability of each branch was assessed using 1,000 bootstrap replications.</p>
            </text>
            <graphic file="gb-2003-4-5-r31-1"/>
         </fig>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Alignment and phylogenetic tree of human and baboon &#946;-defensin protein sequences</p>
            </caption>
            <text>
               <p><b>(a) </b>Alignment and <b>(b) </b>phylogenetic tree of human and baboon &#946;-defensin protein sequences. The tree was rooted with chicken gallinacin 1 (GGGAL1; P46156) and the reliability of each branch was assessed using 1,000 bootstrap replications. The alignment shows the same sequences with the estimated locations of the signal peptide and mature peptide regions; the intervening region is the propiece. The long arrow indicates the position of first introns: in each case except DEFB105 the intron splits the codon that encodes the residue immediately before the arrow. The short arrow indicates the second intron, found only in DEFB105. The shading represents the degree of conservation at each position in the alignment, taking into account similar physicochemical properties of residues. The six canonical cysteines are indicated under the appropriate alignment positions. X at residue 14 denotes the location of the premature stop codon in <it>DEFB109p</it>.</p>
            </text>
            <graphic file="gb-2003-4-5-r31-2"/>
         </fig>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Expression patterns of novel human &#946;-defensin genes</p>
            </caption>
            <text>
               <p>Expression patterns of novel human &#946;-defensin genes. RT-PCR analysis of novel human gene expression carried out on a panel of human RNA samples. The tissues are indicated with a plus (+) and a minus (-) reverse transcriptase reaction shown for each sample. GAPDH RT-PCR was carried out as a control.</p>
            </text>
            <graphic file="gb-2003-4-5-r31-3"/>
         </fig>
         <p>Six genes highly similar (85-98% identical at the amino-acid level) to three of the novel human &#946;-defensins (<it>DEFB105</it>, <it>DEFB106 </it>and <it>DEFB107</it>) as well as to <it>DEFB4</it>, <it>DEFB103 </it>and <it>DEFB104</it>, were found within two olive baboon (<it>Papio cynocephalus anubis</it>) draft genomic sequences (GenBank accession numbers AC116558 and AC116559) using BLAST. Full-length sequences were obtained for each baboon gene except the putative <it>DEFB4 </it>ortholog (which lacks a first exon because of gaps in the draft genomic sequences); in all other cases the exonic structure of the putative baboon ortholog was identical to that of the human gene (Figure <figr fid="F2">2</figr>). It is very likely that these sequences originate from the baboon locus orthologous to the human region under study, but without more complete sequence or mapping data for the baboon genome it is impossible to be certain. These novel baboon gene sequences (see Additional data files for the accession numbers), together with the published sequence for olive baboon &#946;-defensin 1 (AAK61474) and the full-length human &#946;-defensins formed the basis for our evolutionary analyses.</p>
         <p>Figure <figr fid="F4">4</figr> shows d<sub>N </sub>(number of nonsynonymous substitutions per nonsynonymous site) plotted against d<sub>S </sub>(number of synonymous substitutions per synonymous site) for comparisons between the first exon (which encodes the signal peptide) and second exon (which encodes the mature defensin) from all human and baboon genes (the full d<sub>N</sub>, d<sub>S </sub>and d<sub>N </sub>- d<sub>S </sub>estimates for all first and second exon comparisons are available as additional data file 7). Two major trends are observable. In the vast majority of first exon comparisons d<sub>S </sub>exceeds d<sub>N </sub>(Figure <figr fid="F4">4a</figr>) and this excess is statistically significant in almost every case according to two-tailed Z-test results for all human and baboon genes, but there were no significant excesses of d<sub>S </sub>according to the more rigorous Fisher's exact test (data not shown). Thus the rates of substitution in first exons indicate that they are evolving approximately neutrally, perhaps under weak purifying selection. The pattern seen in the second exon comparisons is quite different (Figure <figr fid="F4">4b</figr>). In the second exons d<sub>N </sub>often exceeds d<sub>S</sub>, and even in these short sequences, for certain comparisons this excess reaches statistical significance. Significant excesses of d<sub>N </sub>over d<sub>S </sub>are seen between <it>DEFB1 </it>and <it>DEFB104 </it>and between <it>DEFB103 </it>and <it>DEFB107</it>, with similar effects seen among the second exons of orthologous baboon genes (Table <tblr tid="T1">1</tblr>) using the method of Nei and Gojobori <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>The rates of synonymous and nonsynonymous substitutions within first and second exons</p>
            </caption>
            <text>
               <p>The rates of synonymous and nonsynonymous substitutions within first and second exons. Graphs of the number of synonymous substitutions per synonymous site (d<sub>S</sub>) against the number of nonsynonymous substitutions per nonsynonymous site (d<sub>N</sub>) for comparisons between <b>(a) </b>the first exons and <b>(b) </b>second exons of all genes in the dataset. In each case the diagonal line represents d<sub>N </sub>= d<sub>S</sub>.</p>
            </text>
            <graphic file="gb-2003-4-5-r31-4"/>
         </fig>
         <tbl id="T1" hint_layout="double">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Results of comparisons between the second exons of human and baboon genes demonstrating positive selection</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="center">
                     <p><it>DEFB1</it> vs <it>DEFB104</it></p>
                  </c>
                  <c cspan="2" ca="center">
                     <p><it>DEFB103</it> vs <it>DEFB107</it></p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="4">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Homo sapiens</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>P. cynocephalus anubis</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>H. sapiens</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>P. cynocephalus anubis</it>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>S*</p>
                  </c>
                  <c ca="left">
                     <p>17.917&#177;1.180</p>
                  </c>
                  <c ca="left">
                     <p>17.167&#177;1.169</p>
                  </c>
                  <c ca="left">
                     <p>16.917&#177;1.165</p>
                  </c>
                  <c ca="left">
                     <p>16.833&#177;1.124</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>N</p>
                  </c>
                  <c ca="left">
                     <p>48.083&#177;1.144</p>
                  </c>
                  <c ca="left">
                     <p>48.833&#177;1.181</p>
                  </c>
                  <c ca="left">
                     <p>49.083&#177;1.156</p>
                  </c>
                  <c ca="left">
                     <p>49.167&#177;1.124</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>s</p>
                  </c>
                  <c ca="left">
                     <p>6.25&#177;1.954</p>
                  </c>
                  <c ca="left">
                     <p>6.25&#177;1.954</p>
                  </c>
                  <c ca="left">
                     <p>4.5&#177;1.796</p>
                  </c>
                  <c ca="left">
                     <p>4&#177;1.767</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>n</p>
                  </c>
                  <c ca="left">
                     <p>35.75&#177;3.482</p>
                  </c>
                  <c ca="left">
                     <p>34.75&#177;3.559</p>
                  </c>
                  <c ca="left">
                     <p>27.5&#177;3.382</p>
                  </c>
                  <c ca="left">
                     <p>27&#177;3.373</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>d<sub>S</sub></p>
                  </c>
                  <c ca="left">
                     <p>0.349&#177;0.103</p>
                  </c>
                  <c ca="left">
                     <p>0.364&#177;0.106</p>
                  </c>
                  <c ca="left">
                     <p>0.266&#177;0.104</p>
                  </c>
                  <c ca="left">
                     <p>0.238&#177;0.105</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>d<sub>N</sub></p>
                  </c>
                  <c ca="left">
                     <p>0.744&#177;0.062</p>
                  </c>
                  <c ca="left">
                     <p>0.712&#177;0.063</p>
                  </c>
                  <c ca="left">
                     <p>0.560&#177;0.064</p>
                  </c>
                  <c ca="left">
                     <p>0.549&#177;0.067</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Z-test<sup>&#8224;</sup></p>
                  </c>
                  <c ca="left">
                     <p>0.001</p>
                  </c>
                  <c ca="left">
                     <p>0.005</p>
                  </c>
                  <c ca="left">
                     <p>0.020</p>
                  </c>
                  <c ca="left">
                     <p>0.008</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Fisher's<sup>&#8225;</sup></p>
                  </c>
                  <c ca="left">
                     <p>0.012</p>
                  </c>
                  <c ca="left">
                     <p>0.019</p>
                  </c>
                  <c ca="left">
                     <p>0.060</p>
                  </c>
                  <c ca="left">
                     <p>0.023</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Charge<sup>&#167;</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>p</it>
                        <sub>C</sub>
                     </p>
                  </c>
                  <c ca="left">
                     <p>0.498&#177;0.088</p>
                  </c>
                  <c ca="left">
                     <p>0.535&#177;0.102</p>
                  </c>
                  <c ca="left">
                     <p>0.447&#177;0.094</p>
                  </c>
                  <c ca="left">
                     <p>0.409&#177;0.094</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>p</it>
                        <sub>R</sub>
                     </p>
                  </c>
                  <c ca="left">
                     <p>0.784&#177;0.077</p>
                  </c>
                  <c ca="left">
                     <p>0.830&#177;0.086</p>
                  </c>
                  <c ca="left">
                     <p>0.571&#177;0.087</p>
                  </c>
                  <c ca="left">
                     <p>0.575&#177;0.085</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>p</it><sub>R</sub>/<it>p</it><sub>C</sub></p>
                  </c>
                  <c ca="left">
                     <p>1.57<sup>&#182;</sup></p>
                  </c>
                  <c ca="left">
                     <p>1.55<sup>&#182;</sup></p>
                  </c>
                  <c ca="left">
                     <p>1.28</p>
                  </c>
                  <c ca="left">
                     <p>1.41</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>M-Y<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>p</it>
                        <sub>C</sub>
                     </p>
                  </c>
                  <c ca="left">
                     <p>0.577&#177;0.116</p>
                  </c>
                  <c ca="left">
                     <p>0.708&#177;0.108</p>
                  </c>
                  <c ca="left">
                     <p>0.598&#177;0.124</p>
                  </c>
                  <c ca="left">
                     <p>0.581&#177;0.124</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>
                        <it>p</it>
                        <sub>R</sub>
                     </p>
                  </c>
                  <c ca="left">
                     <p>0.618&#177;0.092</p>
                  </c>
                  <c ca="left">
                     <p>0.634&#177;0.100</p>
                  </c>
                  <c ca="left">
                     <p>0.467&#177;0.080</p>
                  </c>
                  <c ca="left">
                     <p>0.446&#177;0.082</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>p</it><sub>R</sub>/<it>p</it><sub>C</sub></p>
                  </c>
                  <c ca="left">
                     <p>1.0</p>
                  </c>
                  <c ca="left">
                     <p>0.90</p>
                  </c>
                  <c ca="left">
                     <p>0.78</p>
                  </c>
                  <c ca="left">
                     <p>0.77</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>*Estimates (6SE) of the number of synonymous sites (S), number of nonsynonymous sites, numbers of synonymous substitutions (s), numbers of nonsynonymous substitutions (n), the number of synonymous substitutions per synonymous site (d<sub>S</sub>) and the number of nonsynonymous substitutions per nonsynonymous site (d<sub>N</sub>). <sup>&#8224;</sup>The result of a two-tailed Z-test of d<sub>N </sub>- d<sub>S </sub>= 0. <sup>&#8225;</sup>The result of a Fisher's exact test. <sup>&#167;</sup>Rates of radical (<it>p</it><sub>R</sub>) and conservative (<it>p</it><sub>C</sub>) changes in amino-acid properties, with the ratio of radical to conservative changes (<it>p</it><sub>R</sub>/<it>p</it><sub>C</sub>) for residues categorized in terms of their charges. <sup>&#182;</sup><it>p</it><sub>R </sub>is significantly greater than <it>p</it><sub>C</sub>. <sup>#</sup>Rates of radical (<it>p</it><sub>R</sub>) and conservative (<it>p</it><sub>C</sub>) changes in amino-acid properties, with the ratio of radical to conservative changes (<it>p</it><sub>R</sub>/<it>p</it><sub>C</sub>) for residues categorized in terms of the Miyata-Yasunaga classification (M-Y; a combination of polarity and volume).</p>
            </tblfn>
         </tbl>
         <p>Moreover, in these comparisons d<sub>S </sub>tends to be rather low relative to the rest of the data set (mean d<sub>S </sub>= 0.464). Similar results are obtained using this method either modified to take account of the transition-to-transversion ratio R <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> or using the Jukes-Cantor correction, although the unmodified method is thought to be a more reliable basis for the detection of positive selection <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. If we assume that synonymous substitutions (which are selectively neutral or nearly so) have accumulated regularly with time, such a pattern of substitution suggests that duplication was followed by rapid nonsynonymous change that subsequently decelerated. Most comparisons involving the second exons of <it>DEFB103</it>, <it>DEFB104 </it>and <it>DEFB107 </it>with the second exons of other genes in the dataset show excesses of d<sub>N </sub>over d<sub>S</sub>, but these excesses fail to reach significance by either of the tests used (data not shown). Although some of the comparisons shown in Figure <figr fid="F4">4b</figr> indicate an excess of d<sub>S </sub>over d<sub>N</sub>, this is never significant by either of the two statistical tests used.</p>
         <p>There is no detectable similarity between the introns of the genes under study except between the five putatively orthologous pairs of human and baboon genes where intronic sequence is available (<it>DEFB103</it>, <it>DEFB104</it>, <it>DEFB105</it>, <it>DEFB106 </it>and <it>DEFB107</it>): each pair shares 74-91% identity over 80-99% of their lengths in spite of many small indel events. In every case these pairs of orthologous introns show a substitution rate that is not significantly different from the value of d<sub>S </sub>obtained for the coding sequences. In particular, the orthologous intron sequence comparisons for <it>DEFB103</it>, <it>DEFB104 </it>and <it>DEFB107 </it>give substitution-rate estimates of 0.1066 &#177; 0.010, 0.0816 &#177; 0.005 and 0.0666 &#177; 0.004 respectively, which are consistent with the d<sub>S </sub>estimates for the coding sequences of these pairs: 0.1666 &#177; 0.077, 0.0856 &#177; 0.058 and 0.0536 &#177; 0.049. Thus, the excess of nonsynonymous substitutions observed for the second exons of these genes is not attributable to artificially low d<sub>S </sub>estimates as a result of sampling error, but is caused by a real increase in d<sub>N </sub>relative to d<sub>S</sub>, which is the pattern expected to be generated by positive selection.</p>
         <p>Averages for the ratio of radical to conservative amino-acid changes, <it>p</it><sub>R</sub>/<it>p</it><sub>C </sub>calculated over 47 mammalian genes were reported as 0.81 and 0.49 for charge and the Miyata-Yasunaga (polarity and volume) classification respectively <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The equivalent values from Table <tblr tid="T1">1</tblr>, for comparisons between the second exons of genes showing evidence of positive selection, are all greater than these averages. Furthermore Table <tblr tid="T1">1</tblr> shows that there has been a higher rate of change with respect to charge than with respect to polarity and volume (Miyata-Yasunaga amino-acid classification). That is, where there is evidence of positive selection, most nonsynonymous changes have tended to change the charges of the residues encoded but have tended to conserve the polarities and volumes of those residues.</p>
         <p>Likelihood ratio tests (LRTs), as implemented by the PAML package <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> also indicate the operation of positive selection at sites within second exons. These tests indicate whether data (the substitutions inferred from an alignment) are best explained by one of two models of &#969; = d<sub>N</sub>/d<sub>S</sub>. Since &#969; is a measure of selective pressure on proteins, these models can be used to assess the evidence for variable selective pressures among sites. In a test for positive selection (the presence of sites at which &#969; > 1) two statistical distributions are compared: a null model that uses a distribution that does not allow for sites with &#969; > 1 and another model which does allow such sites. Three pairs of site-specific likelihood models were compared that assume variable selective pressure (as determined by the value of &#969;) among sites but no variation among sequences in the dataset: M0 (one-ratio) and M3 (discrete), M1 (neutral) and M2 (selection), and M7 (beta) and M8 (beta + &#969;) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The second exon of the pseudogene <it>DEFB109p </it>was omitted from the analysis. The discrete model (M3) with two site classes suggested that 41% of second-exon sites are under positive selection with &#969;= 1.67 and identified nine amino-acid sites under positive selection at the 95% cutoff. M3 was a significantly better fit to the data than the one-ratio model M0; the LRT statistic is 2&#916;l = -1061.79-(-1004.12) = 115.32, and <it>p </it>&lt; 0.001 with two degrees of freedom. However the M0-M3 comparison is essentially a test of variability in the &#969; ratio among sites and does not constitute a rigorous test of positive selection. Model M1 (neutral) assumes two site classes with &#969; = 0 and &#969;<sub>1 </sub>= 1 fixed and with the proportions p and p<sub>1 </sub>estimated. Model M2 (selection) adds a third site class with the ratio &#969;<sub>2 </sub>estimated, it suggests that about 47% of sites are under positive selection with &#969;<sub>2 </sub>= 53.17 and identified 12 amino-acid sites under positive selection at the 95% cutoff. The two models can be compared using an LRT as follows, 2&#916;l = -1030.51 -(-1022.03) = 16.97; <it>p </it>&lt; 0.001 with 2 df. So model M2 is significantly better than M1. Model M7 (beta) assumes a beta distribution for &#969; over sites. The beta distribution is limited to values between 0 and 1, providing the most flexible null hypothesis for testing positive selection. Model M8 (beta + &#969;) adds another site class to M7 (beta), with the &#969; ratio estimated from the data. However, the difference between M7 and M8 is not statistically significant, as indicated by the LRT: 2&#916;l = -1006.44 -(-1006.44) = 0. Nine particular sites were implicated (in both M2 and M3 models) as being under positive selection with greater than 95% confidence: positions 43, 44, 48, 52, 56, 57, 69, 70 and 73 in Figure <figr fid="F2">2a</figr>. All these positions are close or adjacent to a conserved cysteine residue and so it is possible they are important in determining &#946;-defensin structure. In summary, the PAML analysis indicates that &#969; varies significantly between sites, and in two separate LRTs the parameters estimated suggest a substantial proportion of sites are under positive selection. In spite of this, the most stringent test (M7 versus M8) does not indicate a significant difference from neutrality. This result may be attributable to the extremely short lengths (33 codons aligned omitting positions with gaps) of the sequences aligned, such an effect was seen when analyzing short (130 codons) lysozyme sequences in the same way <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This study represents the most detailed study of a human &#946;-defensin cluster to date, including the full-length sequences of four novel genes (<it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108</it>), and a novel pseudogene (<it>DEFB109p</it>), their expression patterns and sequences for the baboon orthologs of six genes from this cluster. The 8p22-23 defensin locus appears to have evolved by successive rounds of duplication followed by substantial divergence, to produce a diverse cluster of paralogous genes defined by these four novel genes and four known &#946;-defensin genes (<it>DEFB4</it>, <it>DEFB103 </it>and <it>DEFB104</it>). Divergence has been most rapid within the second exons of these genes, which encode the mature &#946;-defensin peptide, with many comparisons between paralogous genes showing an excess of nonsynonymous over synonymous substitutions. Statistically significant evidence of elevated nonsynonymous change is seen by two methods in the second exons, indicating the action of positive selection. By contrast, comparisons between the first exons of genes from this cluster, which encode a signal peptide, show an excess of synonymous substitutions consistent with neutral evolution or weak purifying selection. The duplication and subsequent positive selection of these genes predates human-baboon divergence more than 23 million years ago and is consistent with observations that <it>DEFB1 </it>has undergone very little change during the evolution of primates <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The positive selection observed has tended to change the charges of residues encoded more than other qualities such as residue polarity or volume. As seems to be the case with other antimicrobial peptides, such as MHC receptors, immunoglobulins and &#945;-defensins <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, this selection may be a response to the rapid evolution of pathogens.</p>
         <p>In this study the computationally predicted gene structures were found by laboratory work to deviate from the actual structures in three out of five novel genes. These errors in the predictions arose in spite of the fact that the predictions were based on all the &#946;-defensin protein-sequence data available and involved three completed BAC sequences rather than unfinished, gapped sequence. This has implications for purely computational approaches to novel gene discovery such as that of Schutte <it>et al</it>. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p><it>DEFB104 </it>and <it>DEFB108 </it>have no detectable orthologs in the mouse genome and therefore appear to have arisen by duplication since the divergence of rodents and primates; alternatively, there could have been a loss of these genes in the rodent lineage. They are also the best candidates for primate-specific &#946;-defensins as they lack orthologs within all other known mammalian defensins, although our knowledge of mammalian defensins is currently incomplete. As we have shown, the evolution of the <it>DEFB104 </it>mature peptide has been driven by positive selection since its emergence, which is consistent with its novel antimicrobial properties <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. All defensins were thought to exist as monomers stabilized by three disulfide bridges between their three pairs of conserved cysteines <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. However, when compared with other known human &#946;-defensins (<it>DEFB1</it>, <it>DEFB4 </it>and <it>DEFB103</it>), <it>DEFB104 </it>was found to have a different number of residues between its second and third, and between its fourth and fifth cysteine residues. Furthermore, <it>DEFB104 </it>was found to have bactericidal activity against <it>Pseudomonas aeruginosa </it>that was more than sixfold stronger than for any other &#946;-defensin <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Three of the novel genes described here (<it>DEFB105</it>, <it>DEFB106 </it>and <it>DEFB108</it>) encode mature peptides exhibiting the same spacing of conserved cysteine residues as seen in <it>DEFB104 </it>as well as sharing similar expression patterns, with highest expression in the testes. The functional divergence of &#946;-defensin genes appears to have continued following human-baboon divergence, as exemplified by <it>DEFB107 </it>which displays a serine residue instead of the first canonical cysteine seen in the baboon ortholog. It is notable that a novel mouse &#946;-defensin gene (<it>Defr1</it>), which also lacks the first canonical cysteine, has potent antimicrobial activity against a spectrum of pathogens <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. In addition a polymorphism in the <it>DEFB1 </it>gene which alters the first canonical cysteine to a serine residue has been shown to produce a peptide which is as active against the microorganisms tested as the usual form <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The unusual amino-acid composition of the proteins encoded by the novel genes presented here suggests that they may possess novel functions, indeed the cationic nature of &#946;-defensins is lost in DEFB107 and DEFB108. As shown recently, there may be more subtle consequences of variation in &#946;-defensin protein sequences, affecting dimerization as well as net charge and disulfide bridges <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. It is worth noting that recent research has identified additional functions for &#946;-defensins that link the innate and adaptive immune response. Both human and mouse &#946;-defensins have been shown to be chemotactic for immature dendritic cells and memory T cells via the CCR6 chemokine receptor <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The mouse &#946;-defensin <it>Defb2 </it>has been shown to act directly on immature dendritic cells as an endogenous ligand for Toll-like receptor 4 (TLR-4), inducing upregulation of co-stimulatory molecules and dendritic cell maturation. These events, in turn, trigger robust, type-1 polarized adaptive immune responses <it>in vivo</it>, which suggests that &#946;-defensins may have an important role in immunosurveillance against pathogens <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
         <p>The expression of the novel antimicrobial peptides reported here in the human male reproductive tract is also of interest. Recent work has shown that the male urogenital tracts in mammals express a broad range of &#945;- and &#946;-defensins, with human <it>DEFB1 </it>expressed in testicular biopsies, seminal plasma and ejaculated spermatozoa <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Together with our results demonstrating that <it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107 </it>and <it>DEFB108 </it>are predominantly expressed in the testes, these data suggest that the male reproductive tract has a complex innate defense mechanism.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>The 8p22-23 &#946;-defensin locus has evolved by duplication and subsequent divergence, to produce a diverse cluster of paralogous genes defined by four novel genes (<it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108</it>), a novel pseudogene (<it>DEFB109p</it>), and three known genes (<it>DEFB4</it>, <it>DEFB103 </it>and <it>DEFB104</it>). We present full-length sequences for the four novel genes, their expression patterns and the predicted sequences for the baboon orthologs of six genes from this cluster. Although comparisons among first-exon sequences of these human genes show little variation, the second-exon sequences that encode the mature &#946;-defensin peptides show substantial divergence. Evolutionary analyses suggest that the divergence seen in second exons has involved positive selection disproportionately favoring alterations in the charge of amino-acid residues.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Identification of novel genes</p>
            </st>
            <p>The following mammalian &#946;-defensin sequences were retrieved from GenBank. Mouse &#946;-defensins: 1-11 (NP_031869, NP_034160, NP_038784, NP_062702, NP_109659, NP_473415, NP_631966, CAC44635, NP_631965, CAD26894, CAD26895), &#946;-defensin 13 (NP_631969), &#946;-defensin 15 (NP_631970), &#946;-defensin 35 (NP_631970), defensin-related peptide (AJ344114). Human: <it>DEFB1 </it>(Q09753), <it>DEFB4 </it>(O15263), <it>DEFB103 </it>(NP_061131), <it>DEFB104 </it>(CAC85520). Rat: <it>Rattus norvegicus </it>&#946;-defensin 1 (NP_113998), &#946;-defensin 2 (O88514). Cow: <it>Bos taurus </it>&#946;-defensin 1 (P46159), &#946;-defensin 2 (P46160), &#946;-defensin 3 (P46161), &#946;-defensin 4 (P46162), &#946;-defensin 5 (P46163), &#946;-defensin 6 (P46164), &#946;-defensin 7 (P46165), &#946;-defensin 8 (P46166), &#946;-defensin 9 (P46167), &#946;-defensin 10 (P46168), &#946;-defensin 11 (P46169), &#946;-defensin 12 (P46170), &#946;-defensin 13 (P46171), tracheal antimicrobial peptide (P25068), lingual antimicrobial peptide (Q28880), enteric &#946;-defensin (O02775), &#946;-defensin C7 (O18815). Pig: <it>Sus scrofa </it>&#946;-defensin 1 (O62697). Goat: <it>Capra hircus </it>&#946;-defensin 1 (O97946), &#946;-defensin 2 (CAA08905). Sheep: <it>Ovis aries </it>&#946;-defensin 1 (O19038), &#946;-defensin 2 (O19039). Rhesus monkey: <it>Macaca mulatta </it>&#946;-defensin 1 (O18794), &#946;-defensin 2 (AAK26259). Olive baboon: <it>Pabio cynocephalus anubis </it>&#946;-defensin 1 (AAK61474). Chimpanzee: <it>Pan troglodytes </it>&#946;-defensin 1 (AAF04110), &#946;-defensin 2 (AAF20154), &#946;-defensin 3 (AAK61549). These sequences were used as TBLASTX (version 2.1.1 with default settings <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>) queries against the HTG (high-throughput genomic) section of the EMBL database (15 July 2001 release). Hidden Markov models were constructed using HMMER (version 2.1.1 <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>) to process CLUSTALW (version 1.82 with default settings <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>) multiple sequence alignments. These models were searched against genomic sequence using WISE2 (version 2-1-20c with the human gene model option <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>). CLUSTALW alignments of diverse second-exon sequences were corrected using the patterns of gaps seen in the corresponding protein-sequence alignments that were more highly conserved.</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary analyses</p>
            </st>
            <p>All phylogenetic trees were constructed by the neighbour-joining method <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> based on the proportion of amino-acid sites at which sequences compared were different and omitting alignment gaps. The trees constructed were rooted with chicken gallinacin 1 (<it>GGGAL1</it>; P46156) and the reliability of each branch was assessed using 1,000 bootstrap replications. In pairwise comparisons between nucleotide sequences, the number of synonymous substitutions per synonymous site (d<sub>S</sub>) and the number of nonsynonymous substitutions per nonsynonymous site (d<sub>N</sub>) were estimated using the method of Nei and Gojobori <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, modified to take account of the transition-to-transversion ratio R <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. R was estimated using the method of Kumar and Nei <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In addition, the Jukes-Cantor correction <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> was applied to account for multiple substitutions at the same site. Two codon-based tests of selection were used. Both tests are based on estimates of d<sub>S </sub>and d<sub>N</sub>. Standard errors for d<sub>S </sub>and d<sub>N </sub>were calculated using 1,000 bootstrap replicates. In the first test d<sub>S </sub>and d<sub>N </sub>and their respective variances are used in a two-tailed Z-test to test the null hypothesis that d<sub>N </sub>- d<sub>S </sub>= 0 <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. In the second test, Fisher's exact test is used to test the null hypothesis that the proportions of synonymous and nonsynonymous differences are the same <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Additional tests for the presence of sites under positive selection were carried out using the PAML package <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, which uses likelihood ratio tests (LRT) to compare models of the variation in d<sub>N</sub>/d<sub>S </sub>ratio between sites. The six models recommended by Anisimova <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> were tested: M0 (one-ratio), M1 (neutral), M2 (selection), M3 (discrete), M7 (beta), and M8 (beta+&#969;). Intron sequences were aligned using DIALIGN (version 2.1 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>) and the numbers of substitutions between them were estimated using Kimura's two-parameter method <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. All phylogenetic trees, distance calculations and codon-based tests of selection were carried out using MEGA2 <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Estimates of the proportions of radical and conservative nonsynonymous substitutions, along with their standard errors, were made using the HON-NEW program <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> in an extension of earlier methods for the measurement of conservative and radical substitution rates <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. The radical or conservative nature of nonsynonymous substitutions was assessed with respect to charge and to the polarity and volume of the amino acids (the Miyata-Yasunaga amino-acid classification <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>).</p>
         </sec>
         <sec>
            <st>
               <p>RT-PCR analysis of gene expression</p>
            </st>
            <p>A range of human RNA samples was purchased from Stratagene (Amsterdam, The Netherlands) (stomach, vulva, ovary, kidney, placenta, thyroid, lung, skeletal, uterus, breast, liver, skin, colon, heart and cervix) and human testis RNA (BD Biosciences Clontech, Oxford). cDNA synthesis was carried out using a first-strand cDNA synthesis kit (Roche, Lewes, UK) according to the instructions, using random hexameric oligonucleotides. PCRs were carried out, using 5 ml of the resultant cDNA according to the following procedures: 94&#176;C for 1 min followed by 35 cycles of 94&#176;C for 30 sec, 55&#176;C for 30 sec and 72&#176;C for 1 min, and a final round of extension for 5 min. Products were analyzed on a 4% NuSIEVE agarose gel (FMC BioProducts, Rockland, ME, USA) by electrophoresis and also cloned using pGEM-T Easy Vector System I (Promega, Southampton). Amplification of glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was carried out in parallel with conditions as for the other amplifications, but with an annealing temperature of 56&#176;C. Reactions were verified for RNA amplification by including controls without reverse transcriptase. RT-PCR products were hybridized with a radiolabeled, internal oligonucleotide probe designed to each novel sequence to confirm the presence of the correctly amplified product. Purified plasmid DNA was sequenced from both strands with ABI Prism dRhodamine Terminator Cycle Sequencing Ready Reaction Kit (PE Applied Biosystems, Warrington). Primers and internal probes were designed using the Primer3 primer design program from the Whitehead Institute, Center for Genome Research <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The sequences for the primers and internal probes used were as follows: <it>DEFB105</it>: 5' primer: TCTATTTGCTATGTTCTTCATTTTGG, internal oligo: TTCAACTGCCATCAGGTGAG, 3' primer: GCAGCAGAGAAAGTTCAGCC; <it>DEFB106</it>: 5' primer: CGTGCTCTTCTTTCTGACCC, internal oligo: TACAGGGAAGGTGATCGGAG, 3' primer: GTTCTTCATTTTTCCCGCAA; <it>DEFB107</it>: 5' primer: TTTTGGCTGCTCTCATTCTTC, internal oligo: TCACTGTGAAGCCGAATGTC, 3' primer: TGCAGCAAAATGGTGCTAAT; <it>DEFB108</it>: 5' primer: TGCTGTCCTCTTCTTCACCA, internal oligo: GCCAAGTTCTACCAGCCAAG, 3' primer: CGGCTATTTAAACATCTCCCA.</p>
            <p>The novel human gene sequences for <it>DEFB105</it>, <it>DEFB106</it>, <it>DEFB107</it>, <it>DEFB108 </it>and <it>DEFB109p</it>, as confirmed by RT-PCR, have been deposited in GenBank under sequence accession numbers AF540977, AF540978, AF540979, AF540980 and AF540981 respectively. The putative baboon orthologs of the human <it>DEFB4 </it>(second exon only), <it>DEFB103</it>, <it>DEFB104</it>, <it>DEFB105</it>, <it>DEFB106 </it>and <it>DEFB107 </it>genes have been also been deposited in GenBank under sequence accession numbers BK000556, BK000557, BK000558, BK000559, BK000560 and BK000561 respectively.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following files are available with this article: the DNA sequences for olive baboon &#946;-defensins (Additional data file <supplr sid="s1">1</supplr>), the DNA sequences for human &#946;-defensins (Additional data file <supplr sid="s2">2</supplr>), the aligned baboon and human DNA sequences (Additional data file <supplr sid="s3">3</supplr>), the aligned first-exon baboon and human DNA sequences (Additional data file <supplr sid="s4">4</supplr>), the aligned second-exon baboon and human DNA sequences (Additional data file <supplr sid="s5">5</supplr>) and the DNA sequences for baboon and human introns (Additional data file <supplr sid="s6">6</supplr>) and an Excel file listing the full dN, dS and dN - dS estimates for all first and second exon comparisons shown in Figures 4a and b (Additional data file <supplr sid="s7">7</supplr>). The alignments are in MSF format. The sequence and alignment files are also available from <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
         <suppl id="s1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>The DNA sequences for olive baboon beta-defensins</p>
            </caption>
            <text>
               <p>The DNA sequences for olive baboon beta-defensins</p>
            </text>
            <file name="gb-2003-4-5-r31-s1.doc">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>The DNA sequences for human beta-defensins</p>
            </caption>
            <text>
               <p>The DNA sequences for human beta-defensins</p>
            </text>
            <file name="gb-2003-4-5-r31-s2.doc">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>The aligned baboon and human DNA sequences in MSF format</p>
            </caption>
            <text>
               <p>The aligned baboon and human DNA sequences in MSF format</p>
            </text>
            <file name="gb-2003-4-5-r31-s3.msf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>The aligned first-exon baboon and human DNA sequences in MSF format
</p>
            </caption>
            <text>
               <p>The aligned first-exon baboon and human DNA sequences in MSF format
</p>
            </text>
            <file name="gb-2003-4-5-r31-s4.msf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>The aligned second-exon baboon and human DNA sequences in MSF format</p>
            </caption>
            <text>
               <p>The aligned second-exon baboon and human DNA sequences in MSF format</p>
            </text>
            <file name="gb-2003-4-5-r31-s5.msf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>The DNA sequences for baboon and human introns</p>
            </caption>
            <text>
               <p>The DNA sequences for baboon and human introns</p>
            </text>
            <file name="gb-2003-4-5-r31-s6.doc">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>The full d<sub>N</sub>, d<sub>S</sub> and d<sub>N</sub> - d<sub>S</sub> estimates for all first and second exon comparisons shown in Figures 4a and b</p>
            </caption>
            <text>
               <p>The full d<sub>N</sub>, d<sub>S</sub> and d<sub>N</sub> - d<sub>S</sub> estimates for all first and second exon comparisons shown in Figures 4a and b</p>
            </text>
            <file name="gb-2003-4-5-r31-s7.xls">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work benefited from the financial support of the UK Medical Research Council. We would also like to thank Gillian Morrison for her initial involvement in this project.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Human beta-defensin-1 is a salt-sensitive antibiotic in lung that is inactivated in cystic fibrosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Goldman</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Stolzenberg</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Kari</snm>
                  <fnm>UP</fnm>
               </au>
               <au>
                  <snm>Zasloff</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1997</pubdate>
            <volume>88</volume>
            <fpage>553</fpage>
            <lpage>560</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122330</pubid>
                  <pubid idtype="pmpid" link="fulltext">9038346</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Human &#946;-defensin-2.</p>
            </title>
            <aug>
               <au>
                  <snm>Schroder</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Harder</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Int J Biochem Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>31</volume>
            <fpage>645</fpage>
            <lpage>651</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">10404637</pubid>
                  <pubid idtype="doi">10.1016/S1357-2725(99)00013-8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Defensins of vertebrate animals.</p>
            </title>
            <aug>
               <au>
                  <snm>Lehrer</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Ganz</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Curr Opin Immunol</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <fpage>96</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99161</pubid>
                  <pubid idtype="pmpid" link="fulltext">11790538</pubid>
                  <pubid idtype="doi">10.1016/S0952-7915(01)00303-X</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Discovery of five conserved &#946;-defensin gene clusters using a computational search strategy.</p>
            </title>
            <aug>
               <au>
                  <snm>Schutte</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Mitros</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Bartlett</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Walters</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jia</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Welsh</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Casavant</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>McCray</snm>
                  <fnm>PB</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>2129</fpage>
            <lpage>2133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19901</pubid>
                  <pubid idtype="pmpid" link="fulltext">11854508</pubid>
                  <pubid idtype="doi">10.1073/pnas.042692699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Expression and regulation of the human beta-defensins <it>DEFB1 </it>and <it>DEFB4 </it>in intestinal epithelium.</p>
            </title>
            <aug>
               <au>
                  <snm>O'Neil</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Porter</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Elewaut</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Eckmann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ganz</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kagnoff</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>J Immunol</source>
            <pubdate>1999</pubdate>
            <volume>163</volume>
            <fpage>6718</fpage>
            <lpage>6724</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10586069</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Evolution of vertebrate immunoglobulin variable gene segments.</p>
            </title>
            <aug>
               <au>
                  <snm>Ota</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sitnikova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Curr Top Microbiol Immunol</source>
            <pubdate>2000</pubdate>
            <volume>248</volume>
            <fpage>221</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10793480</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Natural selection at major histocompatibility complex loci of vertebrates.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Yeager</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1998</pubdate>
            <volume>32</volume>
            <fpage>415</fpage>
            <lpage>435</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.32.1.415</pubid>
                  <pubid idtype="pmpid" link="fulltext">9928486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Coordinated amino acid changes in the evolution of mammalian defensins.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Yeager</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <fpage>675</fpage>
            <lpage>682</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9169560</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Evolutionary diversification of the mammalian defensins.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>1999</pubdate>
            <volume>56</volume>
            <fpage>94</fpage>
            <lpage>103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s000180050010</pubid>
                  <pubid idtype="pmpid">11213266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Roles of diversifying selection and coordinated evolution in the evolution of amphibian antimicrobial peptides.</p>
            </title>
            <aug>
               <au>
                  <snm>Duda</snm>
                  <fnm>TF</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Vanhoye</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nicolas</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>858</fpage>
            <lpage>864</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032242</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Beta-defensin 1 gene variability among non-human primates.</p>
            </title>
            <aug>
               <au>
                  <snm>Del Pero</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boniotto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zuccon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cervella</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Spano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Amoroso</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Crovella</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Immunogenetics</source>
            <pubdate>2002</pubdate>
            <volume>53</volume>
            <fpage>907</fpage>
            <lpage>913</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00251-001-0412-x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11862391</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genome Sequencing Center, Washington University in St. Louis.</p>
            </title>
            <url>http://genome.wustl.edu</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>RepeatMasker</p>
            </title>
            <url>http://ftp.genome.washington.edu/RM/RepeatMasker.html</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Multiple promoter and splicing mRNA variants of the epididymis-specific gene EP2.</p>
            </title>
            <aug>
               <au>
                  <snm>Frohlich</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Po</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>LG</fnm>
               </au>
            </aug>
            <source>J Androl</source>
            <pubdate>2000</pubdate>
            <volume>21</volume>
            <fpage>421</fpage>
            <lpage>430</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10819450</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The Ensembl genome database project.</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>38</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99161</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752248</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.38</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>HUGO Human Gene Nomenclature Committee</p>
            </title>
            <url>http://www.gene.ucl.ac.uk/nomenclature</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genotype-phenotype correlation in cystic fibrosis: The role of modifier genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Salvatore</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Scudiero</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Castaldo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Am J Med Genet</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>88</fpage>
            <lpage>95</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/ajmg.10461</pubid>
                  <pubid idtype="pmpid" link="fulltext">12124743</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Structure, function, and membrane integration of defensins.</p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Wimley</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Selsted</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>1995</pubdate>
            <volume>5</volume>
            <fpage>521</fpage>
            <lpage>527</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0959-440X(95)80038-7</pubid>
                  <pubid idtype="pmpid">8528769</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions.</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1986</pubdate>
            <volume>3</volume>
            <fpage>418</fpage>
            <lpage>426</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3444411</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Positive Darwinian selection after gene duplication in primate ribonuclease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>3708</fpage>
            <lpage>3713</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19901</pubid>
                  <pubid idtype="pmpid" link="fulltext">9520431</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.7.3708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Molecular Evolution and Phylogenetics</source>
            <publisher>New York: Oxford University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Rates of conservative and radical nonsynonymous nucleotide substitutions in mammalian nuclear genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2000</pubdate>
            <volume>50</volume>
            <fpage>56</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10654260</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>PAML: a program package for phylogenetic analysis by maximum likelihood.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>555</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>908</fpage>
            <lpage>917</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032247</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Human &#946;-defensin 4: a novel inducible peptide with a specific salt-sensitive spectrum of antimicrobial activity.</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schulz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Jimenez</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Kluver</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Adermann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Forssmann</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Frimpong-Boateng</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bals</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Forssmann</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>FASEB J</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <fpage>1819</fpage>
            <lpage>1821</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11481241</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Identification and characterisation of a novel murine &#946;-defensin related gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Morrison</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rofle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kilanowski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cross</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dorin</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Mamm Genome</source>
            <pubdate>2002</pubdate>
            <volume>13</volume>
            <fpage>445</fpage>
            <lpage>451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00335-002-3014-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">12226710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Structural and functional characterization of hBD-1(Ser35), a peptide deduced from a <it>DEFB1 </it>polymorphism.</p>
            </title>
            <aug>
               <au>
                  <snm>Circo</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Skerlavaj</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gennaro</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Amoroso</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zanetti</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2002</pubdate>
            <volume>293</volume>
            <fpage>586</fpage>
            <lpage>592</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0006-291X(02)00267-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12054642</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The solution structures of the human beta-defensins lead to a better understanding of the potent bactericidal activity of HBD3 against <it>Staphylococcus aureus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Schibli</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Hunter</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Aseyev</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Starner</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Wiencek</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>McCray</snm>
                  <fnm>PB</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Tack</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>8279</fpage>
            <lpage>8289</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M108830200</pubid>
                  <pubid idtype="pmpid" link="fulltext">11741980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Beta-defensins: linking innate and adaptive immunity through dendritic and T cell CCR6.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chertov</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bykovskaia</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Buffo</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Shogan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schroder</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Howard</snm>
                  <fnm>OM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>525</fpage>
            <lpage>528</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.286.5439.525</pubid>
                  <pubid idtype="pmpid" link="fulltext">10521347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Toll-like receptor 4-dependent activation of dendritic cells by beta-defensin 2.</p>
            </title>
            <aug>
               <au>
                  <snm>Biragyn</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ruffini</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Leifer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Klyushnenkova</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shakhov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chertov</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Shirakawa</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Farber</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Oppenheim</snm>
                  <fnm>JJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>1025</fpage>
            <lpage>1029</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075565</pubid>
                  <pubid idtype="pmpid" link="fulltext">12411706</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Expression of antimicrobial defensins in the male reproductive tract of rats, mice, and humans.</p>
            </title>
            <aug>
               <au>
                  <snm>Com</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bourgeon</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Evrard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ganz</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Colleu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jegou</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pineau</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Biol Reprod</source>
            <pubdate>2003</pubdate>
            <volume>68</volume>
            <fpage>95</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12493700</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Profile hidden Markov models.</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Using CLUSTALW for multiple sequence alignments.</p>
            </title>
            <aug>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1996</pubdate>
            <volume>266</volume>
            <fpage>383</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8743695</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>WISE2</p>
            </title>
            <url>http://www.sanger.ac.uk/Software/Wise2</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The neighbor-joining method: a new method for reconstructing phylogenetic trees.</p>
            </title>
            <aug>
               <au>
                  <snm>Saitou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1987</pubdate>
            <volume>4</volume>
            <fpage>406</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3447015</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Evolution of protein molecules.</p>
            </title>
            <aug>
               <au>
                  <snm>Jukes</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Cantor</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Mammalian Protein Metabolism</source>
            <publisher>New York: Academic Press</publisher>
            <editor>Munro HNM</editor>
            <pubdate>1969</pubdate>
            <fpage>21</fpage>
            <lpage>132</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Small-sample tests of episodic adaptive evolution: a case study of primate lysozymes.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1997</pubdate>
            <volume>14</volume>
            <fpage>1335</fpage>
            <lpage>1338</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9402743</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Accuracy and power of bayes prediction of amino acid sites under positive selection.</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bielawski</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>950</fpage>
            <lpage>958</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032251</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.</p>
            </title>
            <aug>
               <au>
                  <snm>Morgenstern</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>211</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.3.211</pubid>
                  <pubid idtype="pmpid" link="fulltext">10222408</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1980</pubdate>
            <volume>16</volume>
            <fpage>111</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7463489</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>MEGA2: molecular evolutionary genetics analysis software.</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jakobsen</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>1244</fpage>
            <lpage>1245</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.12.1244</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I major-histocompatibility-complex molecules.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Ota</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1990</pubdate>
            <volume>7</volume>
            <fpage>515</fpage>
            <lpage>524</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2283951</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Two types of amino acid substitutions in protein evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Miyata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Miyazawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yasunaga</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1979</pubdate>
            <volume>12</volume>
            <fpage>219</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubid idtype="pmpid">439147</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Primer3</p>
            </title>
            <url>http://www-genome.wi.mit.edu/genome_software/other/primer3.html</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p>MRC HGU Semple lab: data/software</p>
            </title>
            <url>http://www.hgu.mrc.ac.uk/Users/Colin.Semple/lab_data.html</url>
         </bibl>
      </refgrp>
   </bm>
</art>
