<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2005-6-7-r60</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Method</dochead>
      <bibl>
         <title>
            <p>Analysis of the <it>Macaca mulatta </it>transcriptome and the sequence divergence between <it>Macaca </it>and human</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Magness</snm>
               <mi>L</mi>
               <fnm>Charles</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A2">
               <snm>Fellin</snm>
               <fnm>P Campion</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3">
               <snm>Thomas</snm>
               <mi>J</mi>
               <fnm>Matthew</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Korth</snm>
               <mi>J</mi>
               <fnm>Marcus</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A5">
               <snm>Agy</snm>
               <mi>B</mi>
               <fnm>Michael</fnm>
               <insr iid="I3"/>
            </au>
            <au id="A6">
               <snm>Proll</snm>
               <mi>C</mi>
               <fnm>Sean</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A7">
               <snm>Fitzgibbon</snm>
               <fnm>Matthew</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A8">
               <snm>Scherer</snm>
               <mi>A</mi>
               <fnm>Christina</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A9">
               <snm>Miner</snm>
               <mi>G</mi>
               <fnm>Douglas</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A10">
               <snm>Katze</snm>
               <mi>G</mi>
               <fnm>Michael</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
            </au>
            <au id="A11" ca="yes">
               <snm>Iadonato</snm>
               <mi>P</mi>
               <fnm>Shawn</fnm>
               <insr iid="I1"/>
               <email>siadonato@illumigen.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Illumigen Biosciences Inc., Suite 450, 2203 Airport Way South, Seattle, WA 98134, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Microbiology, University of Washington, Seattle, WA 98195-8070, USA</p>
            </ins>
            <ins id="I3">
               <p>Washington National Primate Research Center, University of Washington, Seattle, WA 98195-8070, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>7</issue>
         <fpage>R60</fpage>
         <url>http://genomebiology.com/2005/6/7/R60</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15998449</pubid>
               <pubid idtype="doi">10.1186/gb-2005-6-7-r60</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>18</day>
               <month>1</month>
               <year>2005</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>4</day>
               <month>4</month>
               <year>2005</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>23</day>
               <month>5</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>6</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Magness et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Macaque transcriptome and sequence diversity between macaque and human</p>
      </shorttitle>
      <shortabs>
         <p>Putative <it>Macaca mulata </it>orthologs for over 6,000 human genes have been sequenced from eleven tissues and three species of macaque. Macaque inter- and intraspecific nucleotide diversity is also reported.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>We report the initial sequencing and comparative analysis of the <it>Macaca mulatta </it>transcriptome. Cloned sequences from 11 tissues, nine animals, and three species (<it>M. mulatta</it>, <it>M. fascicularis</it>, and <it>M. nemestrina</it>) were sampled, resulting in the generation of 48,642 sequence reads. These data represent an initial sampling of the putative rhesus orthologs for 6,216 human genes. Mean nucleotide diversity within <it>M. mulatta </it>and sequence divergence among <it>M. fascicularis</it>, <it>M. nemestrina</it>, and <it>M. mulatta </it>are also reported.</p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id=" 30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id=" 30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The sequencing of genes and genomes has become a hallmark of modern molecular biology. The resulting wealth of nucleotide sequence information has fostered advances in gene discovery, the development of genome-based technologies to study gene expression and function, and a growing interest in comparative genomics. The comparison of the human genome with the genomes of closely related species has particular appeal, and there is considerable interest in identifying genomic traits that set humans apart from other primate species <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. The recent growth in sequence information for the chimpanzee has fueled this interest <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. However, beyond that generated for chimpanzee, there has been remarkably little sequence information developed for other nonhuman primate species.</p>
         <p>The rhesus macaque (<it>Macaca mulatta</it>) is a widely used small primate model of human disease, development, and behavior. Throughout the United States, National Institutes of Health (NIH)-supported facilities house more than 25,000 nonhuman primates, including more than 15,000 rhesus macaques <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Each year, approximately 13,000 nonhuman primates are used for NIH-funded research, 65% of which are rhesus <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. These animals are used principally for infectious disease, pharmacology, and neuroscience research <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. In particular, the rhesus model is an essential tool for acquired immunodeficiency syndrome (AIDS) research and for the development of new drugs and vaccines against human immunodeficiency virus (HIV) <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>We report here on our initial efforts to sequence the rhesus macaque transcriptome. The close evolutionary relationship between rhesus and human, and its widespread use as a model for human reproduction, development, and disease, make it an ideal candidate for cDNA and genome sequencing. We have constructed cDNA libraries from a selection of diverse macaque tissues and multiple animals, and we have performed single-pass sequencing on 48,642 independent clones. This sequence information has been used to generate a rhesus macaque oligonucleotide microarray and to perform comparative analyses with human.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Sequence data collection and preliminary analysis</p>
            </st>
            <p>We prepared cloned cDNA libraries from 11 <it>M. mulatta </it>tissues derived from nine separate animals. In addition, the liver was independently sampled from one animal each of the <it>M. mulatta</it>, <it>M. nemestrina</it>, and <it>M. fascicularis </it>species. cDNA libraries were prepared by directional lambda-based cloning into <it>Escherichia coli </it>and sequenced using standard fluorescent dye-terminator chemistry. Sequencing was performed from the vector-insert junction distal to the polyadenylate sequence.</p>
            <p>A preliminary dataset of 48,642 independent clone sequences were collected as described in Table <tblr tid="T1">1</tblr>. We screened and analyzed these data as described in Materials and methods. Sequence data quality was assessed using the phred algorithm <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, with a mean of 539 high-quality base-pairs per read over the entire dataset. High-quality sequence bases are defined as those with a computed phred quality value of 20 or greater (Q &#8805; 20) and an expected error rate of less than 1%. Of the cloned sequences, 9,219 contain a mammalian polyadenylation consensus sequence followed by a polyadenosine tail <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Data meeting minimum quality criteria (<it>n </it>= 36,921) have been submitted to GenBank and contribute to all subsequent analyses. Project data and associated information are also publicly available on the project website <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
            <tbl id="T1" hint_layout="single">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Data-collection summary</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Tissue</p>
                     </c>
                     <c ca="center">
                        <p>Sequence reads</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Placenta</p>
                     </c>
                     <c ca="center">
                        <p>12,033</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Brain</p>
                     </c>
                     <c ca="center">
                        <p>10,511</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PBMC</p>
                     </c>
                     <c ca="center">
                        <p>7,056</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spleen</p>
                     </c>
                     <c ca="center">
                        <p>6,658</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Jejunum</p>
                     </c>
                     <c ca="center">
                        <p>3,840</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Liver</p>
                     </c>
                     <c ca="center">
                        <p>3,744</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ileum</p>
                     </c>
                     <c ca="center">
                        <p>2,112</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lung</p>
                     </c>
                     <c ca="center">
                        <p>1,152</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ovary</p>
                     </c>
                     <c ca="center">
                        <p>672</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Testis</p>
                     </c>
                     <c ca="center">
                        <p>480</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Duodenum</p>
                     </c>
                     <c ca="center">
                        <p>384</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>M. mulatta </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>46,626</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>M. nemestrina </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,152</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>M. fascicularis </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>864</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>48,642</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>We compared each macaque sequence to the mRNA RefSeq <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> component of GenBank using the MEGABLAST algorithm <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The most similar human sequence was identified as that reference sequence with the most significant match by bit score. In some cases, this method will identify matches between macaque and human sequences that are not orthologs, and so should be interpreted with caution. For all subsequent analyses, those macaque sequences with equally probable matches to more than one distinct human UniGene cluster have been excluded <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The entire dataset taken together provides a sampling of the putative macaque orthologs for 6,216 human genes (unique human LocusLink IDs), representing approximately 25% of the human gene content by recent estimate <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <p>Although libraries were constructed from poly(dT)-primed cDNAs, the dataset includes a significant amount of coding sequence. Of the 6,216 unique human LocusLink IDs that were sampled in macaque, 69.3% include coding sequence (mean aligned coding length = 602 bp), whereas 30.7% include only 5' or 3' untranslated region (UTR) sequence (mean aligned UTR length = 485 bp). Of those 69.3% of genes with sampled coding sequence, the average extent of coding sequence coverage in the macaque database is 49.9% (data not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Similarity of <it>Macaca </it>transcripts with human</p>
            </st>
            <p>We used the initial alignment information from the above data to define a subset of sequences whose alignment with their best human match extended 150 bp in each direction around a well defined stop codon. This dataset was used to compute the distribution of sequence similarity between macaque and human as represented by the histograms in Figure <figr fid="F1">1</figr>. The use of this constrained dataset permitted a direct comparison between the distributions for coding and noncoding sequence in the vicinity of the stop codon. Data for 1,180 macaque-human alignments are included in this analysis. Sequence-similarity distributions are not normal, with a modest tail toward lower values. The average degree of similarity for coding sequence is 97.79 &#177; 1.78% and 95.10 &#177; 4.15% for the 3' UTR. This analysis excludes data where the macaque stop codon was either mutated or in a different location relative to the human reference sequence. This analysis uses the 3' UTR proximal to the stop codon as a surrogate for all untranslated sequences. However, human-chimp comparative analysis suggests that the 5' UTR may be more divergent between species than other gene regions <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We did not have a sufficiently sized dataset to locate and independently test conservation of the 5' UTR.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Distribution of coding and noncoding sequence similarity between macaque and human</p>
               </caption>
               <text>
                  <p>Distribution of coding and noncoding sequence similarity between macaque and human. A histogram showing the degree of nucleotide sequence similarity between macaque and human for coding (blue) and noncoding (3' UTR, yellow) transcribed sequence. Sequences (<it>n </it>= 1,180) were selected that cross a well defined stop codon and that provide concurrent sampling of 150 bp of sequence both proximal and distal to the stop. The best human match for each macaque sequence was identified using MEGABLAST. The high-quality subset of these data (composed only of contiguous stretches of phred Q &#8805; 20 bp, <it>n </it>= 633) is plotted for both coding (squares) and noncoding (diamonds) sequence.</p>
               </text>
               <graphic file="gb-2005-6-7-r60-1" hint_layout="single"/>
            </fig>
            <p>In order to determine if local regions of poor data quality contribute to biases in the computed degree of sequence similarity, we recomputed the histogram using alignments composed of only high-quality (Q &#8805; 20) sequence. Constraining the dataset to include only high-quality bases (<it>n </it>= 633 sequences) did not result in significant differences in either the shape or the mean of the distributions (Figure <figr fid="F1">1</figr>).</p>
            <p>To provide a reference dataset with which to evaluate the current results, we computed the degree of sequence similarity between human and <it>Pan troglodytes </it>(chimpanzee) using the same method as above. This analysis was performed using chimpanzee expressed sequence tag (EST) and cDNA sequences, as most currently available chimpanzee reference sequences are computationally predicted and therefore lack data from the 3' UTR. However, our chimpanzee-human analysis was hampered by the relative paucity of chimpanzee full-length cDNA and EST sequence in the public databases. There are currently only 209 full-length chimpanzee cDNA sequences and 6,930 EST sequences of varying quality in GenBank.</p>
            <p>These data together provide a sampling of the 150 bp proximal and distal to the stop codon for only 134 human genes. On the basis of this small dataset, the degree of nucleotide identity between human and chimpanzee for coding and 3' UTR sequences is 98.3 &#177; 3.0% and 97.65 &#177; 3.2% respectively (Additional data file 1). As expected, the distribution of sequence similarity is strongly biased toward larger values, with 59.0% of sampled chimpanzee coding sequences and 46.3% of 3' UTR sequences identical to their best human match over the 150-bp window. The distribution of sequence identity between human and chimpanzee is presented in Additional data file 2.</p>
            <p>We expect that most observed nucleotide substitutions between macaque and human within coding sequence will be conservative. To evaluate the degree of similarity between human and macaque at the amino-acid level, we analyzed macaque sequences that overlapped with their best-matching human reference sequence by at least the terminal 450 bp proximal to the stop codon. Data from the terminal 450 bases were favored for this analysis in order to include more of the overall dataset and to be directly comparable to our previous nucleotide-based analysis. We also constrained the dataset to again include only high-quality bases. The distribution of amino-acid similarity was as expected, given the distribution of nucleotide similarity, with a bias toward higher values (Figure <figr fid="F2">2</figr>). The mean similarity between macaque and human protein sequences over the aligned window is 96.83 &#177; 4.95%. A relaxation of data quality constraints resulted in a broadening of the distribution toward lower values (data not shown).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Distribution of amino-acid sequence similarity between human and macaque</p>
               </caption>
               <text>
                  <p>Distribution of amino-acid sequence similarity between human and macaque. Sequencing reads containing the terminal 150 amino acids of each macaque gene were compared to their best human match using MEGABLAST. Only sequences composed of contiguous high-quality bases (phred Q &#8805; 20 bp, <it>n </it>= 320) throughout the terminal 150 amino acids are included. Of these sequences, 5% show less than 88% nucleotide similarity to their best-matching human homolog.</p>
               </text>
               <graphic file="gb-2005-6-7-r60-2" hint_layout="single"/>
            </fig>
            <p>We identified 21 high-quality macaque sequences with very weak amino-acid similarity (&lt; 90%) to their best-matching human reference sequence (Table <tblr tid="T2">2</tblr>). Of these, 15 are either highly expressed in placenta or immune tissue (peripheral blood mononuclear cells (PBMCs) or spleen mononuclear lymphocytes) and/or are associated with pregnancy or the immune response. The observation of poor sequence identity for immune genes is not surprising, as increased divergence and evidence for positive selection have previously been reported for members of this group <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. The most interesting example of divergence from our study is APOBEC3C, a member of the cytidine deaminase family. Rhesus macaque APOBEC3C is only approximately 85% identical to its putative human ortholog. Members of the APOBEC family are important mediators of lentivirus infection <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and accelerated evolution has been reported for several members of this gene family <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Macaque sequences showing weak identity with best human match</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Gene</p>
                     </c>
                     <c ca="left">
                        <p>Name</p>
                     </c>
                     <c ca="center">
                        <p>RefSeq ID*</p>
                     </c>
                     <c ca="center">
                        <p>Amino-acid identity (%)<sup>&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>Unigene ID*</p>
                     </c>
                     <c ca="center">
                        <p>LocusLink/ Gene ID*</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PSG11 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Pregnancy specific beta-1-glycoprotein 11</p>
                     </c>
                     <c ca="center">
                        <p>NM_203287.1</p>
                     </c>
                     <c ca="center">
                        <p>68.04</p>
                     </c>
                     <c ca="center">
                        <p>Hs.502097</p>
                     </c>
                     <c ca="center">
                        <p>5680</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PSG5 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Pregnancy specific beta-1-glycoprotein 5</p>
                     </c>
                     <c ca="center">
                        <p>NM_002781.2</p>
                     </c>
                     <c ca="center">
                        <p>73.71</p>
                     </c>
                     <c ca="center">
                        <p>Hs.534030</p>
                     </c>
                     <c ca="center">
                        <p>5673</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ANG </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Angiogenin, ribonuclease, RNase A family, 5</p>
                     </c>
                     <c ca="center">
                        <p>NM_001145.2</p>
                     </c>
                     <c ca="center">
                        <p>75.17</p>
                     </c>
                     <c ca="center">
                        <p>Hs.283749</p>
                     </c>
                     <c ca="center">
                        <p>283</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PIP </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Prolactin-induced protein</p>
                     </c>
                     <c ca="center">
                        <p>NM_002652.2</p>
                     </c>
                     <c ca="center">
                        <p>75.86</p>
                     </c>
                     <c ca="center">
                        <p>Hs.99949</p>
                     </c>
                     <c ca="center">
                        <p>5304</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GNLY </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Granulysin</p>
                     </c>
                     <c ca="center">
                        <p>NM_006433.2</p>
                     </c>
                     <c ca="center">
                        <p>76.55</p>
                     </c>
                     <c ca="center">
                        <p>Hs.105806</p>
                     </c>
                     <c ca="center">
                        <p>10578</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LAIR2 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Leukocyte-associated Ig-like receptor 2</p>
                     </c>
                     <c ca="center">
                        <p>NM_002288.3</p>
                     </c>
                     <c ca="center">
                        <p>80.13</p>
                     </c>
                     <c ca="center">
                        <p>Hs.43803</p>
                     </c>
                     <c ca="center">
                        <p>3904</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CRYL1 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Crystallin, lambda 1</p>
                     </c>
                     <c ca="center">
                        <p>NM_015974.1</p>
                     </c>
                     <c ca="center">
                        <p>80.31</p>
                     </c>
                     <c ca="center">
                        <p>Hs.370703</p>
                     </c>
                     <c ca="center">
                        <p>51084</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ARP10 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ARP10 protein</p>
                     </c>
                     <c ca="center">
                        <p>NM_181773.2</p>
                     </c>
                     <c ca="center">
                        <p>82.58</p>
                     </c>
                     <c ca="center">
                        <p>Hs.440515</p>
                     </c>
                     <c ca="center">
                        <p>164668</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>LOC151174 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical protein LOC151174</p>
                     </c>
                     <c ca="center">
                        <p>XM_371605.1</p>
                     </c>
                     <c ca="center">
                        <p>83.04</p>
                     </c>
                     <c ca="center">
                        <p>Hs.424165</p>
                     </c>
                     <c ca="center">
                        <p>151174</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GH2 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Growth hormone 2</p>
                     </c>
                     <c ca="center">
                        <p>NM_022558.2</p>
                     </c>
                     <c ca="center">
                        <p>84.56</p>
                     </c>
                     <c ca="center">
                        <p>Hs.406754</p>
                     </c>
                     <c ca="center">
                        <p>2689</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOBEC3C </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3C</p>
                     </c>
                     <c ca="center">
                        <p>NM_014508.2</p>
                     </c>
                     <c ca="center">
                        <p>85.26</p>
                     </c>
                     <c ca="center">
                        <p>Hs.441124</p>
                     </c>
                     <c ca="center">
                        <p>27350</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>NDUFC2 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NADH dehydrogenase (ubiquinone) 1, subcomplex unknown, 2</p>
                     </c>
                     <c ca="center">
                        <p>NM_004549.3</p>
                     </c>
                     <c ca="center">
                        <p>85.71</p>
                     </c>
                     <c ca="center">
                        <p>Hs.407860</p>
                     </c>
                     <c ca="center">
                        <p>4718</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SAA4 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Serum amyloid A4</p>
                     </c>
                     <c ca="center">
                        <p>NM_006512.1</p>
                     </c>
                     <c ca="center">
                        <p>85.94</p>
                     </c>
                     <c ca="center">
                        <p>Hs.512677</p>
                     </c>
                     <c ca="center">
                        <p>6291</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SEPP1 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Selenoprotein P, plasma, 1</p>
                     </c>
                     <c ca="center">
                        <p>NM_005410.1</p>
                     </c>
                     <c ca="center">
                        <p>86.07</p>
                     </c>
                     <c ca="center">
                        <p>Hs.275775</p>
                     </c>
                     <c ca="center">
                        <p>6414</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GZMB </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Granzyme B (cytotoxic T-lymphocyte-associated serine esterase 1)</p>
                     </c>
                     <c ca="center">
                        <p>NM_004131.3</p>
                     </c>
                     <c ca="center">
                        <p>86.64</p>
                     </c>
                     <c ca="center">
                        <p>Hs.1051</p>
                     </c>
                     <c ca="center">
                        <p>3002</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IFITM1 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Interferon induced transmembrane protein 1</p>
                     </c>
                     <c ca="center">
                        <p>NM_003641.2</p>
                     </c>
                     <c ca="center">
                        <p>87.2</p>
                     </c>
                     <c ca="center">
                        <p>Hs.458414</p>
                     </c>
                     <c ca="center">
                        <p>8519</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GH1 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Growth hormone 1</p>
                     </c>
                     <c ca="center">
                        <p>NM_000515.3</p>
                     </c>
                     <c ca="center">
                        <p>87.56</p>
                     </c>
                     <c ca="center">
                        <p>Hs.500468</p>
                     </c>
                     <c ca="center">
                        <p>2688</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TMEM14B </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Transmembrane protein 14B</p>
                     </c>
                     <c ca="center">
                        <p>NM_030969.2</p>
                     </c>
                     <c ca="center">
                        <p>87.72</p>
                     </c>
                     <c ca="center">
                        <p>Hs.273077</p>
                     </c>
                     <c ca="center">
                        <p>81853</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PRG2 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Proteoglycan 2</p>
                     </c>
                     <c ca="center">
                        <p>NM_002728.4</p>
                     </c>
                     <c ca="center">
                        <p>88.35</p>
                     </c>
                     <c ca="center">
                        <p>Hs.512633</p>
                     </c>
                     <c ca="center">
                        <p>5553</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MRPL40 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mitochondrial ribosomal protein L40</p>
                     </c>
                     <c ca="center">
                        <p>NM_003776.2</p>
                     </c>
                     <c ca="center">
                        <p>88.94</p>
                     </c>
                     <c ca="center">
                        <p>Hs.431307</p>
                     </c>
                     <c ca="center">
                        <p>64976</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GKN1 </it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Gastrokine 1</p>
                     </c>
                     <c ca="center">
                        <p>NM_019617.2</p>
                     </c>
                     <c ca="center">
                        <p>89.07</p>
                     </c>
                     <c ca="center">
                        <p>Hs.69319</p>
                     </c>
                     <c ca="center">
                        <p>56287</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*GenBank identifiers for best matching human homolog. <sup>&#8224;</sup>Amino-acid sequence identity between macaque and human.</p>
               </tblfn>
            </tbl>
            <p>We also identified ten placentally expressed pregnancy-related transcripts with very weak similarity to their putative human ortholog. Prominent among these are the pregnancy-specific glycoproteins (PSG5 and PSG11). For example, the best macaque match to human PSG11 shows only 68% identity and is not better matched to any other member of the human PSG family. Other placentally expressed weak orthologs include the growth mediators angiogenin (ANG) and growth hormone 1 and 2 (GH1 and GH2). Episodic accelerated evolution has previously been reported for both angiogenin and the growth hormones, although its biological and developmental implications are not well understood <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <p>We compiled amino-acid similarity data into gene functional groupings using the 'biological process' classifications from the Gene Ontology (GO) Consortium <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> (Table <tblr tid="T3">3</tblr>). Data are shown for only those classes containing three or more entries. The data reveal a wide degree of variation in class-specific values of sequence similarity between human and macaque. Highly conserved classes include those involved in intracellular signaling, small GTPase-mediated signal transduction, translation initiation, and protein biosynthesis and folding. Poorly conserved biological process groups include pregnancy and immune and inflammatory response. We note that the small size of the dataset is reflected in large standard deviations for several classes of genes.</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Mean amino-acid identity by GO ontology</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Biological process group</p>
                     </c>
                     <c ca="center">
                        <p>Mean identity (%)*</p>
                     </c>
                     <c ca="center">
                        <p>Standard deviation</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pregnancy</p>
                     </c>
                     <c ca="center">
                        <p>80.8</p>
                     </c>
                     <c ca="center">
                        <p>11.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell proliferation</p>
                     </c>
                     <c ca="center">
                        <p>92.7</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Immune response</p>
                     </c>
                     <c ca="center">
                        <p>93.9</p>
                     </c>
                     <c ca="center">
                        <p>4.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Negative regulation of cell proliferation</p>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                     <c ca="center">
                        <p>6.2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Regulation of cell cycle</p>
                     </c>
                     <c ca="center">
                        <p>94.3</p>
                     </c>
                     <c ca="center">
                        <p>5.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to oxidative stress</p>
                     </c>
                     <c ca="center">
                        <p>94.3</p>
                     </c>
                     <c ca="center">
                        <p>6.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Inflammatory response</p>
                     </c>
                     <c ca="center">
                        <p>94.4</p>
                     </c>
                     <c ca="center">
                        <p>4.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transport</p>
                     </c>
                     <c ca="center">
                        <p>95</p>
                     </c>
                     <c ca="center">
                        <p>3.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell-cell signaling</p>
                     </c>
                     <c ca="center">
                        <p>95.5</p>
                     </c>
                     <c ca="center">
                        <p>3.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Apoptosis</p>
                     </c>
                     <c ca="center">
                        <p>95.6</p>
                     </c>
                     <c ca="center">
                        <p>5.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Proteolysis and peptidolysis</p>
                     </c>
                     <c ca="center">
                        <p>96.1</p>
                     </c>
                     <c ca="center">
                        <p>4.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Positive regulation of cell proliferation</p>
                     </c>
                     <c ca="center">
                        <p>96.2</p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G-protein coupled receptor protein signaling pathway</p>
                     </c>
                     <c ca="center">
                        <p>96.3</p>
                     </c>
                     <c ca="center">
                        <p>5.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Electron transport</p>
                     </c>
                     <c ca="center">
                        <p>96.3</p>
                     </c>
                     <c ca="center">
                        <p>2.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Development</p>
                     </c>
                     <c ca="center">
                        <p>96.4</p>
                     </c>
                     <c ca="center">
                        <p>4.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Carbohydrate metabolism</p>
                     </c>
                     <c ca="center">
                        <p>96.7</p>
                     </c>
                     <c ca="center">
                        <p>3.2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Metabolism</p>
                     </c>
                     <c ca="center">
                        <p>96.9</p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Signal transduction</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell growth and/or maintenance</p>
                     </c>
                     <c ca="center">
                        <p>97.2</p>
                     </c>
                     <c ca="center">
                        <p>3.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Angiogenesis</p>
                     </c>
                     <c ca="center">
                        <p>97.3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Regulation of transcription from Pol II promoter</p>
                     </c>
                     <c ca="center">
                        <p>97.7</p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mitosis</p>
                     </c>
                     <c ca="center">
                        <p>97.7</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ubiquitin cycle</p>
                     </c>
                     <c ca="center">
                        <p>97.7</p>
                     </c>
                     <c ca="center">
                        <p>3.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Antimicrobial humoral response (sensu Vertebrata)</p>
                     </c>
                     <c ca="center">
                        <p>97.8</p>
                     </c>
                     <c ca="center">
                        <p>1.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ribosome biogenesis</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                     <c ca="center">
                        <p>1.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ion transport</p>
                     </c>
                     <c ca="center">
                        <p>98.1</p>
                     </c>
                     <c ca="center">
                        <p>0.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell adhesion</p>
                     </c>
                     <c ca="center">
                        <p>98.3</p>
                     </c>
                     <c ca="center">
                        <p>2.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Anti-apoptosis</p>
                     </c>
                     <c ca="center">
                        <p>98.5</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ubiquitin-dependent protein catabolism</p>
                     </c>
                     <c ca="center">
                        <p>98.7</p>
                     </c>
                     <c ca="center">
                        <p>1.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Regulation of transcription, DNA-dependent</p>
                     </c>
                     <c ca="center">
                        <p>98.7</p>
                     </c>
                     <c ca="center">
                        <p>1.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein folding</p>
                     </c>
                     <c ca="center">
                        <p>98.8</p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Translational initiation</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein biosynthesis</p>
                     </c>
                     <c ca="center">
                        <p>99.1</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to stress</p>
                     </c>
                     <c ca="center">
                        <p>99.4</p>
                     </c>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Intracellular protein transport</p>
                     </c>
                     <c ca="center">
                        <p>99.4</p>
                     </c>
                     <c ca="center">
                        <p>0.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glycolysis</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                     <c ca="center">
                        <p>0.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nuclear mrna splicing, via spliceosome</p>
                     </c>
                     <c ca="center">
                        <p>99.6</p>
                     </c>
                     <c ca="center">
                        <p>0.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Small gtpase mediated signal transduction</p>
                     </c>
                     <c ca="center">
                        <p>99.7</p>
                     </c>
                     <c ca="center">
                        <p>0.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein transport</p>
                     </c>
                     <c ca="center">
                        <p>99.9</p>
                     </c>
                     <c ca="center">
                        <p>0.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Intracellular signaling cascade</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Mean identity between group members and their best matching human homologs.</p>
               </tblfn>
            </tbl>
            <p>These data share similarity with recent comparative analyses between human and chimpanzee <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B24">24</abbr></abbrgrp>. For example in chimpanzee, a high degree of sequence conservation and low rates of nonsynonymous substitution were found for several biological classes, including protein transport, small GTPase-mediated signal transduction, regulation of DNA-dependent transcription, intracellular signaling, and glycolysis. However, not all biological functional groups demonstrate consistent conservation among the three species. For example, the signal transduction biological class is highly conserved between chimpanzee and human, whereas its conservation between macaque and human does not significantly deviate from the mean over all classes.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence divergence within and among macaque species</p>
            </st>
            <p>Our dataset includes sequence data from nine <it>M. mulatta</it>, one <it>M. fascicularis</it>, and one <it>M. nemestrina</it>. The breadth of the dataset provides an opportunity to conduct a preliminary analysis of the polymorphism frequency within <it>M. mulatta </it>and the degree of nucleotide divergence between macaque species. We estimated the polymorphism frequency within <it>M. mulatta </it>by assembling sequencing reads from multiple animals for the same gene using phrap <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Polymorphisms were identified by a modified version of phred that calls two alleles at each base in the assembly and assigns each allele a quality score based on combined phred quality values (C.M., unpublished work). High-scoring polymorphisms were manually verified and are presented in Table <tblr tid="T4">4</tblr> for a sample of 24 genes. This analysis includes both coding and noncoding transcribed sequences. The average nucleotide diversity (&#960;) for this gene set in <it>M. mulatta </it>is 15.8 &#177; 12.5 &#215; 10<sup>-4 </sup><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. A large standard deviation in nucleotide diversity across genes is consistent with reports from other primate species <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. The animals included in this analysis were primarily bred from wild-caught parents of Indian origin. A more comprehensive determination of nucleotide diversity will require sequence data from a greater number of genes and animals from multiple geographic locations.</p>
            <tbl id="T4" hint_layout="double">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Estimate of <it>Macaca mulatta </it>nucleotide diversity</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>Comparative length</p>
                     </c>
                     <c ca="center">
                        <p>Number of animals</p>
                     </c>
                     <c ca="center">
                        <p>Nucleotide diversity (&#960;)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ACTB </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,067</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.00110</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ACTG1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>708</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.00290</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOA1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>746</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.00200</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOA2 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>431</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.00350</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ATF4 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>469</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.00160</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>B2M </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>439</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.00000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>C15orf15 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>860</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.00117</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CAP1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>667</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.00127</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CCNI </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>547</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.00000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CDC10 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>693</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.00190</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>CTSB </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>967</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.00078</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>EEF1A1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>865</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.00235</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>EEF1G </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>771</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.00000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ENO1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>793</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.00100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FTH1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>775</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.00100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PPID </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>891</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.00148</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RPL14 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>657</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.00457</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RPL15 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>631</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.00264</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RPL3 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>796</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.00339</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RPS20 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>483</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.00155</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SLC25A5 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>749</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.00088</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TPT1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>740</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.00000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TXNIP </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>614</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.00000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>UBC </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>824</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.00281</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Mean</p>
                     </c>
                     <c ca="center">
                        <p>0.00158</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>SD</p>
                     </c>
                     <c ca="center">
                        <p>0.00125</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>We were also able to evaluate the degree of nucleotide sequence divergence between the three macaque species for a sample of 21 genes in this dataset (Table <tblr tid="T5">5</tblr>). Phred and phrap were again used to assemble overlapping sequences from multiple species and to identify species-specific variants that were then manually confirmed. Given the high degree of nucleotide similarity among the species and the small sample size, the three species did not differ beyond the measured standard deviations. However, <it>M. mulatta </it>and <it>M. fascicularis </it>appear more closely related to each other than either is to <it>M. nemestrina</it>, with an average sequence divergence between the two of 0.380 &#177; 0.380%. The degree of sequence divergence between <it>M. mulatta </it>and <it>M. nemestrina </it>is 0.588 &#177; 0.438% and 0.522 &#177; 0.419% between <it>M. fascicularis </it>and <it>M. nemestrina</it>. However, the dataset is not large enough for any of these pairwise differences to reach statistical significance.</p>
            <tbl id="T5" hint_layout="double">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Interspecies substitution rates</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Gene</p>
                     </c>
                     <c ca="center">
                        <p>Alignment length</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Number of reads</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Frequency per kilobase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>M f. </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>M.m. </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>M.n. </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>m </it>vs <it>n* </it></p>
                     </c>
                     <c ca="center">
                        <p><it>m </it>vs <it>f* </it></p>
                     </c>
                     <c ca="center">
                        <p><it>n </it>vs <it>f* </it></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ADH1B </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>819</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>2.44</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>AFP </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>537</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>11.17</p>
                     </c>
                     <c ca="center">
                        <p>7.45</p>
                     </c>
                     <c ca="center">
                        <p>3.72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ALB </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2047</p>
                     </c>
                     <c ca="center">
                        <p>> 20</p>
                     </c>
                     <c ca="center">
                        <p>> 20</p>
                     </c>
                     <c ca="center">
                        <p>> 20</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.49</p>
                     </c>
                     <c ca="center">
                        <p>0.49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>AMBP </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>731</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4.10</p>
                     </c>
                     <c ca="center">
                        <p>1.37</p>
                     </c>
                     <c ca="center">
                        <p>5.47</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ANGPTL3 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>371</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2.70</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>2.70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOA1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>746</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>76</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>10.72</p>
                     </c>
                     <c ca="center">
                        <p>4.02</p>
                     </c>
                     <c ca="center">
                        <p>5.36</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOA2 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>431</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>6.96</p>
                     </c>
                     <c ca="center">
                        <p>4.64</p>
                     </c>
                     <c ca="center">
                        <p>2.32</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOC4 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>312</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>16.03</p>
                     </c>
                     <c ca="center">
                        <p>12.82</p>
                     </c>
                     <c ca="center">
                        <p>16.03</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOE </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>217</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>4.61</p>
                     </c>
                     <c ca="center">
                        <p>4.61</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>APOH </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1007</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>2.98</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>2.98</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>B2M </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>587</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>11.93</p>
                     </c>
                     <c ca="center">
                        <p>11.93</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>EEF1A1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>920</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>>20</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FGA </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>379</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>7.92</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>7.92</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FGB </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>407</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>2.46</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>2.46</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>FGG </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>694</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>1.44</p>
                     </c>
                     <c ca="center">
                        <p>1.44</p>
                     </c>
                     <c ca="center">
                        <p>2.88</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>HPR </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>567</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>3.53</p>
                     </c>
                     <c ca="center">
                        <p>1.76</p>
                     </c>
                     <c ca="center">
                        <p>1.76</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RPL9 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>680</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>4.41</p>
                     </c>
                     <c ca="center">
                        <p>4.41</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>SERPINC1 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>787</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1.27</p>
                     </c>
                     <c ca="center">
                        <p>2.54</p>
                     </c>
                     <c ca="center">
                        <p>1.27</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>TTR </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>599</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>5.01</p>
                     </c>
                     <c ca="center">
                        <p>3.34</p>
                     </c>
                     <c ca="center">
                        <p>5.01</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>UBC </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>460</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>UGT2B7 </it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>228</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>8.77</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>8.77</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Mean</p>
                     </c>
                     <c ca="center">
                        <p>5.88</p>
                     </c>
                     <c ca="center">
                        <p>3.80</p>
                     </c>
                     <c ca="center">
                        <p>5.22</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Median</p>
                     </c>
                     <c ca="center">
                        <p>3.53</p>
                     </c>
                     <c ca="center">
                        <p>1.44</p>
                     </c>
                     <c ca="center">
                        <p>2.70</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>SD</p>
                     </c>
                     <c ca="center">
                        <p>4.38</p>
                     </c>
                     <c ca="center">
                        <p>3.80</p>
                     </c>
                     <c ca="center">
                        <p>4.19</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Pair wise interspecies substitution frequencies computed on a gene-by-gene basis <it>M.f.</it>, <it>Macaca fascicularis</it>; <it>M.m.</it>, <it>M. mulatta</it>; <it>M.n.</it>, <it>M. nemestrina</it>.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Putative rhesus sequences without human orthologs</p>
            </st>
            <p>Analysis of the entire dataset revealed a small number of transcribed macaque sequences that had little or no sequence similarity to any human cDNA or genomic sequence (Table <tblr tid="T6">6</tblr>). We speculate that some of these macaque sequences are without orthologs in the human genome. The observation of species-specific transcribed sequences among the primates is consistent with recent comparative analysis between human and chimpanzee <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B29">29</abbr></abbrgrp>. Although an absolute determination of species specificity will require a complete macaque genome sequence, we conducted preliminary computational and PCR-based analyses to test the presence or absence of these sequences in the human and other primate genomes.</p>
            <tbl id="T6" hint_layout="double">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Macaque sequences without apparent human ortholog</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Class</p>
                     </c>
                     <c ca="left">
                        <p>GenBank Accession</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Ortholog by MEGABLAST*</p>
                     </c>
                     <c ca="center">
                        <p>PCR product length<sup>&#8224;</sup></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>PCR<sup>&#8225;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Human genome</p>
                     </c>
                     <c ca="center">
                        <p>Human EST</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Macaque genome</p>
                     </c>
                     <c ca="center">
                        <p>Human genome</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>CX078602</p>
                     </c>
                     <c ca="center">
                        <p>Yes/93%<sup>&#167;</sup></p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>CX078592</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>111</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>CX078596</p>
                     </c>
                     <c ca="center">
                        <p>Yes/93%<sup>&#167;</sup></p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>123</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>CB552301</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                     <c ca="center">
                        <p>Indeterminate</p>
                     </c>
                     <c ca="center">
                        <p>Indeterminate</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>II</p>
                     </c>
                     <c ca="left">
                        <p>CX078598</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>103</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>II</p>
                     </c>
                     <c ca="left">
                        <p>CX078591</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>111</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>III</p>
                     </c>
                     <c ca="left">
                        <p>CB555845</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>127</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>Indeterminate</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>III</p>
                     </c>
                     <c ca="left">
                        <p>CB552531</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Defined as identity greater than three standard deviations below the mean. <sup>&#8224;</sup>Primer sequences are available in Materials and methods. <sup>&#8225;</sup>Tested under a variety of thermal cycling conditions and annealing temperatures. <sup>&#167;</sup>Borderline identity values are displayed.</p>
               </tblfn>
            </tbl>
            <p>As above, we used MEGABLAST to test each macaque nucleotide sequence for one or more significant hits to the human EST or genome databases. The absence of an orthologous human sequence was defined as either no significant MEGABLAST hit in the human subset of GenBank or hits with sequence identity less than three standard deviations below the mean as measured over the entire dataset (Figure <figr fid="F1">1</figr>). Because the data were not normally distributed, the identity cutoff (approximately 92.2%) was computed using the geometric mean, which relies on a logarithmic transformation of the data. All sequences meeting this cutoff definition were also outliers based on Tukey's test <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
            <p>We selected eight of the resulting macaque sequences for PCR-based analysis using a number of primate and human genomes (Table <tblr tid="T6">6</tblr>, Figure <figr fid="F2">2</figr>). The purpose of this analysis was simply to verify the presence or absence of the observed sequences in a panel of primate genomes. Selected primers had an average computed annealing temperature of 59.6 &#177; 0.9&#176;C with an average amplified length of 108 &#177; 12 bp (Materials and methods). For each primer pair, PCR analysis was conducted at several annealing temperatures between 55 and 60&#176;C. Genomic DNA was selected from independent <it>M. nemestrina </it>and <it>M. mulatta </it>animals in order to confirm the presence of these sequences in multiple independent genomes. Of the eight tested primer pairs, two resulted in amplification of consistent bands in both human and macaque genomic DNA, two were indeterminate in human but present in the macaques, and four, while obviously present in the macaque genomes, resulted in no consistent human-specific product under any cycling conditions.</p>
            <p>The eight tested sequences fall generally into three categories: those with weak sequence similarity to the human genome or human-derived ESTs (class I), those with weak sequence similarity only to genes and proteins from nonhuman species (class II), and those with no significant amino-acid or nucleotide sequence similarity to any GenBank nucleic acid or protein sequence (class III).</p>
            <p>Those with weak similarity to human sequences (class I) include CX078602, a 657-bp cDNA sequence derived from macaque liver with 79-87% nucleotide sequence identity to CYP2C18 from several mammalian species. Its closest matches to human are two regions of 86-93% identity to human chromosome 10, one of which contains four cytochrome P450 2C genes. PCR-based analysis failed to amplify a consistent band from any primate species other than <it>M. nemestrina</it>, <it>M. mulatta</it>, and <it>Lagothrix lagotricha </it>(woolly monkey) (Figure <figr fid="F3">3a</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>PCR analysis of putative macaque-specific sequences</p>
               </caption>
               <text>
                  <p>PCR analysis of putative macaque-specific sequences. PCR primers were developed from high-quality macaque cDNA sequences - <b>(a) </b>CX078602, <b>(b) </b>CX078598, and <b>(c) </b>CB555845 - and used to test for the presence or absence of the resulting amplicons in genomic DNA from 12 primate genomes, including two separate humans. Amplification conditions were the same as in Materials and methods, except that annealing was performed at 55&#176;C. Expected product sizes are as in Table 6. <b>(d) </b>Amplification primers from exon 4 of the human oligoadenylate synthetase 1 gene (<it>OAS1 </it>) are included as a positive control, resulting in the expected 648-bp product from most primate species.</p>
               </text>
               <graphic file="gb-2005-6-7-r60-3" hint_layout="double"/>
            </fig>
            <p>Likewise, CX078592 from brain demonstrated 88-90% nucleotide similarity to the IL15RA gene and other immune-derived transcripts, as well as to a region of human chromosome 10 containing <it>IL15RA</it>. PCR primers derived from this sequence amplified multiple specific products from macaque, human, and other primates (data not shown). Similarly, CX078596 from placenta, although having no significant match to any human EST, demonstrated significant similarity to a region of human chromosome 22. CX078596 contained a clear mammalian polyadenylation signal and poly(A) tail, and primers derived from this sequence amplified an appropriately sized product from macaque. Alignment of this sequence with human chromosome 22 revealed a 284-bp insertion in human relative to macaque, which was reflected by amplification of a proportionately larger product in two human genomic DNA samples (data not shown). Finally, although CB552301 from spleen demonstrated significant sequence identity to regions of human chromosomes 4 and 15 and multiple ESTs from UniGene cluster Hs.459311, we failed to amplify a specific product from any primate species using primers derived from this sequence (data not shown).</p>
            <p>The second class of sequences (class II) in Table <tblr tid="T6">6</tblr> had no identified human match, while demonstrating weak sequence identity to nucleic acid or protein sequences from other species. For example, CX078598, a 670-bp transcript from PBMCs, demonstrated weak amino-acid identity (67%) to the endogenous retrovirus (ERV)-BabFc<sup>env </sup>envelop polyprotein, a member of the ERV-F/H family of primate retroviruses <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. PCR with primers derived from CX078598 under a variety of thermal cycling conditions resulted in the consistent amplification of a product of expected size from only <it>M. mulatta </it>and <it>M. nemestrina </it>(Figure <figr fid="F2">2b</figr>). Similarly, CX078591 from macaque brain demonstrated weak amino-acid identity (20-45%) to ariadne homolog 2 (ARIH2/TRIAD1) from rodents and to two unnamed proteins from the puffer fish <it>Tetraodon nigroviridis</it>. Primers derived from this sequence amplified the appropriately sized product only from macaque genomic DNA (data not shown).</p>
            <p>The last class of sequences (class III) in Table <tblr tid="T6">6</tblr> demonstrated no significant similarity to any protein or nucleotide sequence in GenBank (represented by CB555845 and CB552531). Both showed evidence of a mammalian polyadenylation consensus sequence near their 3' terminus, with CB552531 additionally demonstrating a clear poly(A) tail. CB555845, a 485-bp sequence from spleen, amplified expected products from both <it>M. nemestrina </it>and <it>M. mulatta</it>. However, this clone was ultimately scored as indeterminate because of its consistently weak amplification of a discrete product from all hominids including human (Figure <figr fid="F2">2c</figr>). CB552531 amplified products of the expected size from macaque species and from <it>Ateles geoffroyi </it>and <it>Lemur catta</it>, but not from human (data not shown).</p>
            <p>It is important to note that PCR-based analysis of divergent sequences is subject to a variety of influences and may result in different conclusions under different conditions. Furthermore, we cannot rule out the possibility that one or more of the sequences in Table <tblr tid="T6">6</tblr> are alternatively spliced relative to human, pseudogenes, or genomic DNA contamination. However, each clone sequence in Table <tblr tid="T6">6</tblr> demonstrated similarity to known expressed sequences or a polyadenylation consensus sequence and poly(A) tail at their 3' terminus upon complete sequencing of the clones.</p>
         </sec>
         <sec>
            <st>
               <p>Development of a macaque-specific expression microarray resource</p>
            </st>
            <p>Genome-based technologies such as DNA microarrays are now commonplace in human biomedical research. Similarly, species-specific arrays exist for model organisms such as the mouse and rat, for which a considerable amount of genome information is available. In contrast, researchers wishing to carry out gene-expression analyses on nonhuman primate cells or tissues are currently forced to use human DNA microarrays. As part of our effort to bring genome-based technologies to researchers using nonhuman primates, we have used ESTs generated by this project to construct a rhesus macaque-specific oligonucleotide microarray.</p>
            <p>Oligonucleotides were designed as described in Materials and methods and arrayed onto glass slides by Agilent Technologies. Briefly, macaque cDNA sequences were assembled into 9,344 distinct clusters using The Institute for Genome Research (TIGR) clustering tools <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. From these, 7,973 macaque-specific oligonucleotide probes were identified for inclusion on the array. These probes represent the putative macaque equivalent of 3,519 unique human UniGene clusters <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and 3,045 unique human RefSeqs <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. To quality control the microarray, we measured tissue-specific differences in gene expression as a means of evaluating whether the oligonucleotides were successfully binding target sequences. For these experiments, we hybridized the microarray with probes derived from RNA isolated from various rhesus macaque tissues. Probes were paired in different combinations and two dye-flipped technical replicates were performed for each pair of samples. Of the 7,973 rhesus macaque oligonucleotides present on the microarray, 6,215 showed differential expression (equal or greater than twofold; <it>P </it>&#8804; 0.01) in at least one of the three experiments.</p>
            <p>Plots of the log-transformed ratios for genes in each experiment that showed an equal to or greater than twofold difference in expression between two tissues are shown in Figure <figr fid="F4">4</figr>. In each plot, points are colored according to the source library of the sequence used to derive the corresponding oligonucleotide. From this analysis, it is apparent that the majority of genes that were more highly expressed in the spleen correspond to sequences derived from the spleen cDNA library. Similarly, the majority of genes that were more highly expressed in the brain correspond to sequences derived from the brain cDNA library. These results show that a majority of the oligonucleotides were successfully binding target sequence. In addition, it is likely that many of the oligonucleotides that did not measure differential gene expression in these experiments are also successfully binding target sequences, as not all genes would be expected to be expressed in all tissues or to show differential levels of expression between the tissues analyzed.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Tissue-specific gene-expression patterns measured using the rhesus macaque oligonucleotide microarray</p>
               </caption>
               <text>
                  <p>Tissue-specific gene-expression patterns measured using the rhesus macaque oligonucleotide microarray. To evaluate whether arrayed oligonucleotides were binding target sequences, microarrays were hybridized with probes derived from RNA isolated from spleen, brain, or placenta. Probes were paired in different combinations as indicated: <b>(a) </b>spleen vs brain; <b>(b) </b>brain vs placenta; <b>(c) </b>spleen vs placenta. Oligonucleotides that detected a difference in gene expression of twofold or more (<it>P </it>&#8804; 0.01) between two tissues are indicated as colored points and are displayed across the <it>x</it>-axis to facilitate visualization. The <it>y</it>-axis represents the log ratio for differentially expressed genes in each tissue comparison. Points corresponding to oligonucleotides derived from spleen ESTs are colored blue, those from brain are magenta, and those from placenta are orange. Thus, in the comparison depicted in <b>(a)</b>, genes more highly expressed in the spleen are indicated by points in the upper portion of the panel (and are predominantly sequences derived from the spleen cDNA library) and genes more highly expressed in the brain are indicated by points in the lower portion on the panel (and are predominantly sequences derived from the brain cDNA library). Plots were prepared using Spotfire.</p>
               </text>
               <graphic file="gb-2005-6-7-r60-4" hint_layout="double"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Primate models are essential to the study of human biology and disease and to the development of new pharmaceutical products, many of which require primate testing before approval for use in humans. The closest living primate relatives to human are the chimpanzee and other great apes <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Human and chimp lineages diverged from a common ancestor 5-7 million years ago (Mya) and the genomes of the two species are highly conserved <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B24">24</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Experimental research using chimpanzees and other great apes is, however, significantly hampered by their size, maintenance costs, and endangered species status. The human-like qualities of the chimpanzee also make research using this animal generally unacceptable for ethical reasons. For the most part, chimpanzees are rarely used for invasive studies except, for example, when investigating diseases for which there is no other animal model (for example, hepatitis C infection) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
         <p>Old World monkeys, a group that includes macaque, baboon, and African green monkey, are our closest non-ape relatives. Old World monkeys and humans shared a common ancestor around 25 Mya, and the genomes of these organisms are highly conserved with human <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B35">35</abbr><abbr bid="B38">38</abbr></abbrgrp>. Furthermore, the biology of these organisms is such that they are an appropriate primate model for human physiology and disease. For this and other reasons, Old World monkeys are widely used in biomedical research, with members of the <it>Macaca </it>genus most frequently used <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>We report here on the first phase of a study to sequence the rhesus macaque transcriptome. Our group has collected sequence data from 48,642 cDNA clones from nine animals and 11 tissues. For the current study, standard cDNA sequencing methods were used, with an emphasis on large clone-inserts and long sequence read lengths. Alternative methods could have been used for data collection that would have resulted in less 3'-end bias (for example, ORESTES <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>) or reduced redundancy in the collected data (for example, library normalization <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>).</p>
         <p>We determined the average sequence divergence between human and macaque to be 2.21% for coding and 4.90% for noncoding sequence. An identical analysis of transcribed chimpanzee sequences demonstrated divergences of 1.70% and 2.35% for coding and noncoding sequence respectively. This is in comparison to a recently reported mean 1.44% divergence between human chromosome 21 and chimpanzee chromosome 22 over their entire length <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The continued analysis of sequence divergence between the macaque and human species will be important for translating data collected in this primate model to human biology. Recent evidence suggests that even minor inter-species sequence variation can result in large phenotypic differences between macaque models and human disease <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
         <p>In addition, we have identified gene functional groups with higher than average sequence divergence at the amino-acid level. In one example, we observe 15% amino-acid sequence divergence between putative human and macaque orthologs of the cytidine deaminase APOBEC3C. Consistent with this observation, Sawyer <it>et al. </it>have reported evidence for accelerated evolution of the primate APOBEC gene family, probably under the selective pressure of viruses <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Members of this family (for example, APOBEC3G) have antiviral activity against lentiviruses and specifically against HIV <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. APOBEC3G is packaged into nascent virions and delivered together with the viral genome into newly infected host cells. The cytidine deaminase cargo results in hypermutation of the replicating virus in target cells, thereby inhibiting virus infection. The Vif proteins of HIV and other lentiviruses bind APOBEC3G and inhibit its antiviral activity. However, the interaction between Vif and APOBEC3G is highly species and virus specific. HIV Vif can inhibit the function of human but not simian APOBEC3G <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Likewise, Yu and colleagues have recently reported that human APOBEC3B and APOBEC3C can inhibit SIV but not HIV-1 infection of human cells <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Our observation of poor sequence conservation between macaque and human APOBEC3C is consistent with a model of accelerated evolution under selective pressure for this gene family.</p>
         <p>This dataset has further enabled us to conduct a preliminary analysis of nucleotide diversity within the <it>M. mulatta </it>species and the degree of divergence among <it>M. nemestrina</it>, <it>M. fascicularis</it>, and <it>M. mulatta</it>. Mean nucleotide divergence computed over 24 genes is 15.8 &#177; 12.5 &#215; 10<sup>-4</sup>, approximately twofold greater than that computed for human transcribed sequences by several recent comprehensive studies <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B44">44</abbr></abbrgrp>. Excess nucleotide diversity in macaque versus human is consistent with observations from other primate species. In general, numerous groups have observed increased nucleotide diversity in mitochondrial <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>, sex chromosome <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>, and autosomal DNA <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp> sequences from chimpanzee, bonobo, and gorilla. Consistent with other primate species, this observation is likely to reflect a larger effective population size for macaque throughout evolution relative to human. Our analysis also confirms a high degree of sequence similarity among macaque species, with pairwis