<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-5-r84</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Regulatory conservation of protein coding and microRNA genes in vertebrates: lessons from the opossum genome</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Mahony</snm>
               <fnm>Shaun</fnm>
               <insr iid="I1"/>
               <email>shaun.mahony@ccbb.pitt.edu</email>
            </au>
            <au id="A2">
               <snm>Corcoran</snm>
               <mi>L</mi>
               <fnm>David</fnm>
               <insr iid="I2"/>
               <email>david.corcoran@hgen.pitt.edu</email>
            </au>
            <au id="A3">
               <snm>Feingold</snm>
               <fnm>Eleanor</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>eleanor.feingold@hgen.pitt.edu</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Benos</snm>
               <mi>V</mi>
               <fnm>Panayiotis</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I4"/>
               <email>benos@pitt.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Computational Biology, School of Medicine, University of Pittsburgh, Fifth Avenue, Pittsburgh, PA 15260, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, DeSoto Street, Pittsburgh, PA 15261, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, DeSoto Street, Pittsburgh, PA 15261, USA</p>
            </ins>
            <ins id="I4">
               <p>University of Pittsburgh Cancer Institute, School of Medicine, University of Pittsburgh, Centre Avenue, Pittsburgh, PA 15232, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>5</issue>
         <fpage>R84</fpage>
         <url>http://genomebiology.com/2007/8/5/R84</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17506886</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-5-r84</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>6</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>29</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>16</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>05</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Mahony et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Regulatory conservation</p>
      </shorttitle>
      <shortabs>
         <p>A study of conservation of non-coding sequences, <it>cis</it>-regulatory elements and biological functions of regulated genes in opossum and other vertebrates enables better estimation of promoter conservation and transcription factor binding site turnover among mammals</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Being the first noneutherian mammal sequenced, <it>Monodelphis domestica </it>(opossum) offers great potential for enhancing our understanding of the evolutionary processes that take place in mammals. This study focuses on the evolutionary relationships between conservation of noncoding sequences, <it>cis</it>-regulatory elements, and biologic functions of regulated genes in opossum and eight vertebrate species.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Analysis of 145 intergenic microRNA and all protein coding genes revealed that the upstream sequences of the former are up to twice as conserved as the latter among mammals, except in the first 500 base pairs, where the conservation is similar. Comparison of promoter conservation in 513 protein coding genes and related transcription factor binding sites (TFBSs) showed that 41% of the known human TFBSs are located in the 6.7% of promoter regions that are conserved between human and opossum. Some core biologic processes exhibited significantly fewer conserved TFBSs in human-opossum comparisons, suggesting greater functional divergence. A new measure of efficiency in multigenome phylogenetic footprinting (base regulatory potential rate [BRPR]) shows that including human-opossum conservation increases specificity in finding human TFBSs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Opossum facilitates better estimation of promoter conservation and TFBS turnover among mammals. The fact that substantial TFBS numbers are located in a small proportion of the human-opossum conserved sequences emphasizes the importance of marsupial genomes for phylogenetic footprinting-based motif discovery strategies. The BRPR measure is expected to help select genome combinations for optimal performance of these algorithms. Finally, although the etiology of the microRNA upstream increased conservation remains unknown, it is expected to have strong implications for our understanding of regulation of their expression.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>One of the prime motivating factors driving the sequencing of vertebrate genomes is the expectation that the role played by the functional regions of the human genome may be discerned by finding molecular level commonalities with and differences from other animals. This is especially true of the newly sequenced opossum (<it>Monodelphis domestica</it>), which is the first completed marsupial genome. Being the first noneutherian mammal sequenced, the opossum helps to clarify which sequence changes occurred before and after the divergence of mammalian ancestors from other vertebrates <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, and has already provided new insight into the evolution of mammalian major histocompatibility complex genes <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. It is also hoped that the opossum genome may yield insights into how gene regulation has evolved in vertebrates.</p>
         <p>In protein coding genes, gene regulation is primarily controlled by short DNA sequences in the vicinity of the gene's transcription start sites (TSSs), which are targets for transcription factor proteins. A high degree of evolutionary conservation of these promoter regions can be attributed to functional <it>cis</it>-regulatory elements. The increased conservation in the biologically more important parts of the promoter region has been explored by various phylogenetic footprinting algorithms, such as PhyloGibbs <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, ConSite <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, rVista <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and FOOTER <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, to improve the prediction of transcription factor binding sites (TFBSs) in vertebrate genomes. Phylogenetic footprinting is a comparative genomics approach that exploits cross-species sequence conservation in order to predict regulatory genomic elements. In the absence of evolutionary information, TFBSs can be evaluated in terms of sequence similarity scans against frequency matrices derived from alignments of known binding sites for a given transcription factor <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. However, the typical short length of TFBSs (5 to 20 base pairs [bp]) and their inherent level of sequence degeneracy makes them notoriously difficult to predict with any degree of specificity using similarity searches alone <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Phylogenetic footprinting provides a way to reduce the sequence search space to regions that are conserved (and therefore more likely to contain functional elements), thereby improving the specificity of TFBS prediction.</p>
         <p>In order to improve the performance of phylogenetic footprinting algorithms, the evolutionary aspects of the promoter regions and the TFBSs residing in them must be investigated. Evolutionary distance is an important factor in the effectiveness of phylogenetic footprinting techniques. For example, the divergence between chimpanzee and human is generally insufficient to reduce the sequence search space in any meaningful way; conversely, the divergence between <it>Drosophila </it>and human can be too large for any regulatory sequence conservation to be detected. Recently, the maximum sensitivity of phylogenetic footprinting techniques has been measured via estimations of the rate of TFBS 'turnover' between human and rodent genomes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. We consider that a TFBS has undergone turnover if the sequence in which it resides is not conserved between the species compared. High or low TFBS turnover rates do not necessarily coincide with the rate of changes in the regulatory mechanism (for instance, replacement TFBSs can arise by chance elsewhere in the promoter region or functional TFBSs may still be present in nonconserved regions). Turnover, however, corresponds to the minimum false-negative rate for detection of TFBSs via phylogenetic footprinting, and thus it serves as a critical bound on the success of such algorithms. Human-rodent TFBS turnover has been estimated at between 28% and 40% <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, suggesting that TFBSs are among the most malleable functional elements in the genomic landscape. However, although rodents and primates diverged relatively recently (approximately 90 million years ago <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>), the shorter generational time of rodents has placed a large degree of dissimilarity between the two clades, as is evident in the human-dog comparisons <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Therefore, TFBS turnover rates will have to be estimated in other mammals before a clearer picture of the selective pressure on mammalian TFBSs can emerge.</p>
         <p>Another major mechanism for control of gene expression is provided by microRNA (miRNA) genes. miRNAs are small (22 to 61 bp long), noncoding RNAs that downregulate their target genes via base complementarity to their mRNA molecules <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Each miRNA can target multiple genes and each gene can be targeted by multiple miRNAs <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. In vertebrates, their expression is tissue specific <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and has been shown to play an important role during development <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Although some miRNAs are found in the introns of coding genes and therefore are probably regulated by the promoters of the genes in which they reside <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, others are located in the intergenic parts of the genome. Little is known about the transcriptional regulation of these intergenic miRNAs, although RNA polymerase II appears to be involved in the process <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. This suggests that they may have active promoter regions that contain <it>cis</it>-regulatory elements, similar to coding genes. The following question then arises; how does the conservation in the upstream regions of the intergenic miRNA genes compare with that of the protein coding genes? In this respect, opossum and the other vertebrate species provide a broad range of evolutionary distances in which this issue may be addressed.</p>
         <p>In this report we present our findings regarding promoter conservation of all protein coding genes and upstream sequence conservation of intergenic miRNA genes in eight vertebrate genomes as compared with human. To our knowledge, this is the first time that such a comprehensive study has been conducted on potential regulatory regions of both protein coding and miRNA genes in vertebrates. Also, because the opossum genome is placed at an evolutionary midpoint relative to eutherian mammals and nonmammalian vertebrates, using it as an outgroup to the existing eutherian genomes allows for the estimation of the mammalian TFBS turnover rate. Furthermore, the opossum genome provides an opportunity to assess which transcriptional signals and regulatory mechanisms are shared between all mammals. For these reasons, the conservation rates of the promoters of 513 human genes are also analyzed in relation to the turnover of the 1,162 TFBSs they contain. Relationships between conservation of sites and identity of the corresponding transcription factors and their Gene Ontology (GO) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> categories are also investigated. Finally, we computationally re-evaluate the potential of phylogenetic footprinting in the light of the opossum genome and other recently sequenced vertebrates. A new statistical measure, the base regulatory potential rate (BRPR), is introduced to assess the efficiency of both pair-wise and multiple species comparisons in phylogenetic footprinting strategies.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Distribution of conserved blocks in the upstream regions of protein coding and intergenic miRNA genes</p>
            </st>
            <p>Conservation of the 5 kilobases (kb) upstream regions of all RefSeq protein coding genes as well as the known intergenic miRNA genes was calculated using the sliding window approach, as we describe in Materials and methods (below). We chose to focus solely on intergenic miRNAs because intronic miRNAs have been shown to be co-transcribed with their corresponding protein coding genes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Because little is known about the transcriptional regulation of non-intronic miRNA genes, we cannot assess the possible TFBS turnover. We can, however, assess whether the miRNA upstream regions evolve at the same, slower, or faster rate than those of the protein coding genes, and whether their conservation pattern across the upstream region indicates parts of potential biologic importance. The phylogenetic tree of the species examined in this paper is plotted in Figure <figr fid="F1">1</figr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of the species examined in this study</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of the species examined in this study. This phylogenetic tree is based on the University of California, Santa Cruz (UCSC) multiple alignments. The tree was generated using phyloGif [72].</p>
               </text>
               <graphic file="gb-2007-8-5-r84-1"/>
            </fig>
            <p>Table <tblr tid="T1">1</tblr> presents the number of orthologous genes in each species (derived from the MULTIZ University of California, Santa Cruz [UCSC] synteny-based alignments), the average block coverage of their upstream regions, and the average percentage identity within these conserved blocks. For the calculation of the average percentage identity, the conservation percentage of each block is multiplied by the total length of the block. In other words, the average block conservation corresponds to the number of bases that are identical in all conserved blocks of one promoter over the total length of the blocks in this promoter. The human genes were used as reference for all pair-wise comparisons. Surprisingly, we found that, with the exception of teleosts and chimp, the conservation in the upstream regions of the miRNA genes is 34% to 60% higher on average than that in the protein coding genes. This is independent of the average block identity, which remains practically the same between the two types of genes in these comparisons (Table <tblr tid="T1">1</tblr>). In all nonprimate mammals the average block coverage in the miRNA upstream sequences is significantly higher than that in the promoters of the protein coding genes (Wilcoxon rank-sum test: <it>P </it>= 6 &#215; 10<sup>-4 </sup>for opossum and <it>P </it>= 10<sup>-14 </sup>to 10<sup>-16 </sup>for rodents and dog).</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Conservation in the 5 kilobases upstream sequences in all protein coding and intergenic miRNA genes</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Human versus</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Protein coding genes</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Intergenic miRNA genes</p>
                     </c>
                     <c ca="left">
                        <p>Relative conservation</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Number of orthologous</p>
                     </c>
                     <c ca="left">
                        <p>Block coverage</p>
                     </c>
                     <c ca="left">
                        <p>Average block identity</p>
                     </c>
                     <c ca="left">
                        <p>Number of orthologous</p>
                     </c>
                     <c ca="left">
                        <p>Block coverage</p>
                     </c>
                     <c ca="left">
                        <p>Average block identity</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chimp</p>
                     </c>
                     <c ca="left">
                        <p>23,643</p>
                     </c>
                     <c ca="left">
                        <p>93.03%</p>
                     </c>
                     <c ca="left">
                        <p>98.15%</p>
                     </c>
                     <c ca="left">
                        <p>144</p>
                     </c>
                     <c ca="left">
                        <p>93.46%</p>
                     </c>
                     <c ca="left">
                        <p>98.51%</p>
                     </c>
                     <c ca="left">
                        <p>0.46%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse*</p>
                     </c>
                     <c ca="left">
                        <p>22,790</p>
                     </c>
                     <c ca="left">
                        <p>23.30%*</p>
                     </c>
                     <c ca="left">
                        <p>73.53%</p>
                     </c>
                     <c ca="left">
                        <p>142</p>
                     </c>
                     <c ca="left">
                        <p>36.17%*</p>
                     </c>
                     <c ca="left">
                        <p>74.72%</p>
                     </c>
                     <c ca="left">
                        <p>55.24%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rat*</p>
                     </c>
                     <c ca="left">
                        <p>22,161</p>
                     </c>
                     <c ca="left">
                        <p>22.46%*</p>
                     </c>
                     <c ca="left">
                        <p>73.49%</p>
                     </c>
                     <c ca="left">
                        <p>140</p>
                     </c>
                     <c ca="left">
                        <p>34.95%*</p>
                     </c>
                     <c ca="left">
                        <p>74.68%</p>
                     </c>
                     <c ca="left">
                        <p>55.61%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dog*</p>
                     </c>
                     <c ca="left">
                        <p>23,276</p>
                     </c>
                     <c ca="left">
                        <p>44.36%*</p>
                     </c>
                     <c ca="left">
                        <p>75.58%</p>
                     </c>
                     <c ca="left">
                        <p>145</p>
                     </c>
                     <c ca="left">
                        <p>61.72%*</p>
                     </c>
                     <c ca="left">
                        <p>76.96%</p>
                     </c>
                     <c ca="left">
                        <p>39.13%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Opossum*</p>
                     </c>
                     <c ca="left">
                        <p>17,334</p>
                     </c>
                     <c ca="left">
                        <p>7.28%*</p>
                     </c>
                     <c ca="left">
                        <p>74.90%</p>
                     </c>
                     <c ca="left">
                        <p>104</p>
                     </c>
                     <c ca="left">
                        <p>11.65%*</p>
                     </c>
                     <c ca="left">
                        <p>76.08%</p>
                     </c>
                     <c ca="left">
                        <p>60.03%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chicken</p>
                     </c>
                     <c ca="left">
                        <p>8,087</p>
                     </c>
                     <c ca="left">
                        <p>4.55%</p>
                     </c>
                     <c ca="left">
                        <p>74.87%</p>
                     </c>
                     <c ca="left">
                        <p>54</p>
                     </c>
                     <c ca="left">
                        <p>6.08%</p>
                     </c>
                     <c ca="left">
                        <p>76.80%</p>
                     </c>
                     <c ca="left">
                        <p>33.63%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fugu</p>
                     </c>
                     <c ca="left">
                        <p>6,257</p>
                     </c>
                     <c ca="left">
                        <p>4.13%</p>
                     </c>
                     <c ca="left">
                        <p>72.17%</p>
                     </c>
                     <c ca="left">
                        <p>47</p>
                     </c>
                     <c ca="left">
                        <p>2.73%</p>
                     </c>
                     <c ca="left">
                        <p>73.65%</p>
                     </c>
                     <c ca="left">
                        <p>-33.90%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tetraodon</p>
                     </c>
                     <c ca="left">
                        <p>7,821</p>
                     </c>
                     <c ca="left">
                        <p>3.43%</p>
                     </c>
                     <c ca="left">
                        <p>72.10%</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>2.31%</p>
                     </c>
                     <c ca="left">
                        <p>73.40%</p>
                     </c>
                     <c ca="left">
                        <p>-32.65%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>This table lists the number of genes orthologous to human genes in each of the genomes tested, the percentage of upstream sequence conservation (in >65% block identity), and the weighted average within block identity. Relative conservation (in terms of block coverage) is also listed for the microRNA (miRNA) versus protein coding genes. *Species for which the block coverage of miRNA gene upstream regions is statistically significantly higher than that of the promoters of the protein coding genes.</p>
               </tblfn>
            </tbl>
            <p>In order to investigate this surprising finding further, we plotted the sequence conservation as a function of the distance from the start of the corresponding genes (Figure <figr fid="F2">2</figr>). We found that in the first 500 bp the sequence conservation of the miRNA genes is almost identical to that of the promoters of the protein coding genes (<it>R </it>values > 0.9 and usually much higher; regression <it>t</it>-test: <it>P </it>&lt; 10<sup>-19</sup>). In protein coding genes this is typically the region with the highest concentration of the known <it>cis</it>-regulatory elements. From all known human and mouse TFBSs in TRANSFAC <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, 69.1% and 65.1%, respectively, are annotated as being located in the proximal 500 bp region (data not shown). Interestingly, Lee and coworkers <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> showed that this region is sufficient to drive expression of the miR 23a~27a~24-2 intergenic miRNA gene cluster by RNA polymerase II. Could this be a coincidence? We tested this by analyzing the upstream sequence conservation of the tRNA genes in the human genome (see Materials and methods, below). It has been long established that the <it>cis</it>-regulatory elements of the tRNA genes are located downstream of their transcription start <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. We found that the sequence conservation for the tRNA genes was constant throughout their 5 kb upstream regions (Figure <figr fid="F2">2</figr>; green dashed line).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Upstream sequence conservation of protein coding versus miRNA genes</p>
               </caption>
               <text>
                  <p>Upstream sequence conservation of protein coding versus miRNA genes. Comparison of 5-kilobase upstream sequence conservation between human and various organisms, relative to the transcription start site (TSS; protein-coding, solid blue line) and gene start (intergenic microRNA [miRNA] genes, orange line). The conservation of developmental genes (light blue dotted line) and tRNA genes (green dotted line) are also plotted for comparison purposes. For the plot 100 base pair (bp) intervals were used for the first 500 bp and 500 bp intervals thereafter.</p>
               </text>
               <graphic file="gb-2007-8-5-r84-2"/>
            </fig>
            <p>The conservation rates in both protein coding and miRNA genes decline after the first 500 bp and become almost constant. The difference between these two types of genes is that, in the case of miRNAs, the constant conservation rate is up to twofold higher than that in the protein coding genes for rodents, dog, opossum, and chicken. We found this difference to be statistically significant (Additional data file 1 [Supplementary Figure 2]). Similarly high conservation rates are observed in chimp for both types of genes, probably reflecting the generally high conservation rate throughout the genome. By contrast, similarly low conservation rates are observed for the fugu fish and tetraodon. We note, however, that the higher conservation rates are statistically significant only in the (nonprimate) mammals, including opossum (Additional data file 1).</p>
            <p>It is not clear whether this increased upstream sequence conservation is a general biologic feature of the miRNA upstream regions or is an artifact of the methods used to discover miRNA genes. It is possible, for example, that the known intergenic miRNAs happen to fall in more conserved regions of the genome. This may be related to the way in which the miRNAs were originally identified (through high similarity to known miRNAs). However, it is also possible that because miRNAs are involved in highly regulated vital cell or organismal processes such as development <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, there is a much greater selective pressure on their regulatory regions. We investigate this further by comparing the upstream sequence conservation in the miRNA genes with that of genes identified as developmental according to GO classification (Figure <figr fid="F2">2</figr>; light blue dashed line). We find that the upstream conservation of the developmental genes in all mammals is uniformly higher than the overall average and similar to the conservation of the miRNA genes, especially in the first 2,000 bp. This is true for all species examined, although in the nonmammalian vertebrates the overall upstream sequence conservation for all types of genes is similarly low (10% or lower after the first 500 bp; Figure <figr fid="F2">2</figr>). The fact that miRNA genes have been implicated in the regulation of various developmental processes <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> may partly explain the similar conservation rates in their upstream regions and the promoters of the developmental genes, also indicating that analogous mechanisms and <it>cis</it>-elements may regulate the expression of the corresponding genes. The fact that opossum sequences also exhibit similar conservation patterns, as do the sequences of eutherian species, indicates that mammalian specific evolutionary constraints are in place.</p>
            <p>In summary, the above observations are consistent with the idea that miRNAs are regulated by similar mechanisms as protein coding genes, which was also shown to be true in the few cases studied thus far <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B32">32</abbr></abbrgrp>. As more miRNA genes are identified, the issue of their transcriptional mechanism will warrant further investigation.</p>
            <p>In all of the above pair-wise comparisons, except human-chimp, the average block identity is about the same (72% to 77%; Table <tblr tid="T1">1</tblr>), regardless of the evolutionary distance or the type of gene (protein coding or miRNA). Because the block conservation threshold was 65%, this equivalency indicates that a reduction in the number of conserved blocks rather than a uniform decrease in similarity is responsible for the observed conservation rates. Such a pattern of evolution is expected if the <it>cis</it>-regulatory sites are organized in clusters located in these upstream regions. Such clusters might contain regulatory elements specific to, for instance, primates only, eutherians only, and so on.</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary turnover of transcription factor binding sites in vertebrates</p>
            </st>
            <p>We now turn to the relationship between promoter conservation of the protein coding genes and the turnover of the <it>cis</it>-regulatory elements located in them. Table <tblr tid="T2">2</tblr> presents the percentage of known human TFBSs that reside in conserved blocks for each pair of genomes tested. The number of such detectable TFBSs in each species differs depending on the number of orthologous genes identified in that species. We note that our analysis focuses on the TFBSs that are located immediately upstream of the protein coding genes (up to 5 kb). This bias is imposed by the available data. It will be interesting to see how our results compare with the evolution of DNA regulatory regions in other parts of the genome.</p>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Promoter and site conservation between human and eight vertebrate species</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Human versus</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Promoters</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Sites</p>
                     </c>
                     <c ca="left">
                        <p>BRPR</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Number of orthologous genes</p>
                     </c>
                     <c ca="left">
                        <p>Block coverage</p>
                     </c>
                     <c ca="left">
                        <p>Block nucleotide identity</p>
                     </c>
                     <c ca="left">
                        <p>Number of detectable sites</p>
                     </c>
                     <c ca="left">
                        <p>% detected</p>
                     </c>
                     <c ca="left">
                        <p>Site nucleotide identity</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chimp</p>
                     </c>
                     <c ca="left">
                        <p>512</p>
                     </c>
                     <c ca="left">
                        <p>94.06%</p>
                     </c>
                     <c ca="left">
                        <p>98.27%</p>
                     </c>
                     <c ca="left">
                        <p>1,157</p>
                     </c>
                     <c ca="left">
                        <p>94.81%</p>
                     </c>
                     <c ca="left">
                        <p>98.74%</p>
                     </c>
                     <c ca="left">
                        <p>1.009</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="left">
                        <p>506</p>
                     </c>
                     <c ca="left">
                        <p>24.20%</p>
                     </c>
                     <c ca="left">
                        <p>73.39%</p>
                     </c>
                     <c ca="left">
                        <p>1,146</p>
                     </c>
                     <c ca="left">
                        <p>72.34%</p>
                     </c>
                     <c ca="left">
                        <p>82.91%</p>
                     </c>
                     <c ca="left">
                        <p>2.887</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rat</p>
                     </c>
                     <c ca="left">
                        <p>496</p>
                     </c>
                     <c ca="left">
                        <p>23.09%</p>
                     </c>
                     <c ca="left">
                        <p>73.21%</p>
                     </c>
                     <c ca="left">
                        <p>1,129</p>
                     </c>
                     <c ca="left">
                        <p>67.14%</p>
                     </c>
                     <c ca="left">
                        <p>83.00%</p>
                     </c>
                     <c ca="left">
                        <p>2.757</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dog</p>
                     </c>
                     <c ca="left">
                        <p>507</p>
                     </c>
                     <c ca="left">
                        <p>46.05%</p>
                     </c>
                     <c ca="left">
                        <p>75.37%</p>
                     </c>
                     <c ca="left">
                        <p>1,151</p>
                     </c>
                     <c ca="left">
                        <p>73.59%</p>
                     </c>
                     <c ca="left">
                        <p>84.77%</p>
                     </c>
                     <c ca="left">
                        <p>1.535</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Opossum</p>
                     </c>
                     <c ca="left">
                        <p>389</p>
                     </c>
                     <c ca="left">
                        <p>6.72%</p>
                     </c>
                     <c ca="left">
                        <p>74.63%</p>
                     </c>
                     <c ca="left">
                        <p>912</p>
                     </c>
                     <c ca="left">
                        <p>41.23%</p>
                     </c>
                     <c ca="left">
                        <p>83.93%</p>
                     </c>
                     <c ca="left">
                        <p>5.647</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chicken</p>
                     </c>
                     <c ca="left">
                        <p>189</p>
                     </c>
                     <c ca="left">
                        <p>3.21%</p>
                     </c>
                     <c ca="left">
                        <p>74.43%</p>
                     </c>
                     <c ca="left">
                        <p>451</p>
                     </c>
                     <c ca="left">
                        <p>21.73%</p>
                     </c>
                     <c ca="left">
                        <p>85.06%</p>
                     </c>
                     <c ca="left">
                        <p>6.184</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fugu</p>
                     </c>
                     <c ca="left">
                        <p>127</p>
                     </c>
                     <c ca="left">
                        <p>3.25%</p>
                     </c>
                     <c ca="left">
                        <p>72.87%</p>
                     </c>
                     <c ca="left">
                        <p>286</p>
                     </c>
                     <c ca="left">
                        <p>11.89%</p>
                     </c>
                     <c ca="left">
                        <p>83.98%</p>
                     </c>
                     <c ca="left">
                        <p>3.331</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tetraodon</p>
                     </c>
                     <c ca="left">
                        <p>166</p>
                     </c>
                     <c ca="left">
                        <p>2.50%</p>
                     </c>
                     <c ca="left">
                        <p>73.09%</p>
                     </c>
                     <c ca="left">
                        <p>363</p>
                     </c>
                     <c ca="left">
                        <p>12.12%</p>
                     </c>
                     <c ca="left">
                        <p>80.95%</p>
                     </c>
                     <c ca="left">
                        <p>4.227</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Analysis of 1,162 known human transcription factor binding sites (TFBSs) associated with the promoters of 513 human genes between human and eight vertebrate species. The number of genes orthologous to human genes in each species, their conservation block coverage, and their average block identity are presented; also, the number of TFBSs associated with these orthologous genes in each species, the percentage of sites located in conserved regions between species, and the average nucleotide identity within TFBSs are reported. The base regulatory potential rate (BRPR) statistic is calculated from these data for each pair of genomes (see text). Block coverage is the percentage of the upstream region that is covered by conserved blocks (>50 base pairs with >65% identity); the block nucleotide identity is the percentage of nucleotides in all conserved blocks that are identical to the human sequence; and site nucleotide identity the percentage nucleotides in all detected TFBSs that are identical to the human sequence.</p>
               </tblfn>
            </tbl>
            <p>Although we confirm previously estimated rate of human-mouse TFBS turnover <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, it is particularly interesting that 27% or more of the known human TFBSs are not located in blocks conserved in mammals more distant than rodents (Table <tblr tid="T2">2</tblr>). This does not necessarily mean that the mechanisms of gene regulation have changed accordingly. Functionally equivalent TFBSs are not always located in conserved blocks, as demonstrated in a recent comparison of gene regulation in human and zebrafish RET genes <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Similarly, individual TFBSs that are not conserved between two species may have been functionally replaced by other sites for the same transcription factor in one of the species <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The finding that only about 41% of TFBSs are located in conserved human-opossum blocks is nevertheless surprising, because it points to the relative ease with which individual mammalian TFBSs may be deleted, replaced, or added.</p>
            <p>As expected, TFBS turnover increases with decreasing percentage conservation coverage of the upstream regions. Figure <figr fid="F3">3</figr> shows that opossum has low block conservation similar to that in the nonmammal vertebrate species, but it retains almost twice as many sites as chicken, which is the evolutionarily closest nonmammal. This gives a first qualitative assessment for the potential importance of the opossum genome for identification of TFBSs in phylogenetic footprinting approaches. In general, outside mammalian genomes, the percentage of the detected TFBSs is reduced with increasing evolutionary distance, although the percentage 5 kb upstream coverage remains constant.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Conserved block coverage of the 5 kilobases upstream regions versus TFBS turnover rates</p>
               </caption>
               <text>
                  <p>Conserved block coverage of the 5 kilobases upstream regions versus TFBS turnover rates. A third-order polynomial trendline is fitted for illustration. TFBS, transcription factor binding site.</p>
               </text>
               <graphic file="gb-2007-8-5-r84-3"/>
            </fig>
            <p>Table <tblr tid="T2">2</tblr> also presents the average identity within the conserved TFBSs. With the exception of human-chimp comparisons, the average identity within sites is substantially higher than the average identity in the conserved blocks and relatively constant in all genome comparisons. We found no linear correlation between the block coverage rate and the average block identity in these comparisons (<it>R </it>= 0.48). This finding supports the idea that individual TFBSs are under greater selective pressure than are the wider conserved blocks in mammalian genomes (Wilcoxon test: <it>P </it>= 0.01).</p>
            <p>Finally, Table <tblr tid="T2">2</tblr> presents the BRPR values for each pair of genomes (see Materials and methods, below). BRPR is the likelihood ratio of the posterior probability of a base being regulatory (part of a regulatory site), given that it is in a conserved region, over the <it>a priori </it>probability of being regulatory. In other words, BRPR shows how much we can improve our belief that a base (or a conserved region) is regulatory if we only focus on the conserved blocks between two or more species. One of the most surprising aspects of this study is that, on average, a relatively large percentage of TFBSs (41%) is located in only the 6.72% of the 5 kb promoter regions that are conserved between human and opossum. This gives human-opossum comparisons the second highest BRPR value among the tested pair-wise comparisons, and makes the use of opossum almost twice as effective for finding regulatory elements as the more typically used human-mouse alignments (BRPR 5.647 versus 2.887, respectively). Another interesting finding is that, because of the extensive conservation between human and dog genomes, the human-dog comparisons are not as effective as human-mouse for phylogeny-based motif discovery (Table <tblr tid="T2">2</tblr>). The maximum BRPR value occurs for human-chicken comparisons (BRPR 6.184). However, this value is very close to the opossum BRPR value and, given that only 22% of known TFBSs can be detected as conserved between human and chicken (as opposed to 41% in human-opossum), we suggest that human-opossum comparisons are more effective overall than human-chicken comparisons.</p>
            <p>Phylogenetic footprinting becomes less effective in human-fugu and human-tetraodon comparisons (Table <tblr tid="T2">2</tblr>). The Afrotherian (elephant and tenrec) or Xenarthran (armadillo) genomes that are currently undergoing low-coverage sequencing, as well as the genomes of more distant vertebrates, do not appear to offer any improvement in pair-wise phylogenetic footprinting effectiveness (all are less effective than using the mouse genome; unpublished data). However, they may offer improvement in specificity in multispecies regulatory conservation scans.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic footprinting with multispecies alignments</p>
            </st>
            <p>Thus far, the TFBS turnover rates and BRPR values were used in pair-wise comparisons in order to assess the relative effectiveness of discovering TFBSs via evolutionary conservation. Given the availability of multiple vertebrate genomes, it is naturally expected that combining conservation information from multiple sources will increase the accuracy of phylogenetic footprinting. The following question then arises; which genome combinations offer greater specificity? To address this, we evaluate all possible combinations of tested genomes (256 combinations). In the following, <it>P</it>(<it>C</it>) and <it>P</it>(<it>C|R</it>) are the prior and posterior probability, respectively, that a base is conserved, given that the base is part of a regulatory site. For consistency, both <it>P</it>(<it>C</it>) and <it>P</it>(<it>C|R</it>) are calculated over all known human sites in our dataset (1,162 sites) in all examined human upstream bases (513 genes &#215; 5,000 bp = 2.565 megabases), regardless of the species we compare.</p>
            <p>Table <tblr tid="T3">3</tblr> shows the BRPR values for all comparisons between human and two other species. Interestingly, the highest <it>BRPR </it>value in three species comparisons is achieved when human sequences are compared with both opossum and chicken (BRPR 7.26). However, only 92 of the 1,162 known human TFBSs (7.9%) may be found via this strategy. Table <tblr tid="T3">3</tblr> also shows that requiring a base to be conserved with both mouse and opossum is more effective than using either genome alone, and 31.7% of known human TFBSs may be detected in this way. The results of all tests (256 combinations) are provided in Additional data file 1. The combination with the overall highest BRPR value was human with chimp, mouse, opossum, and chicken (BRPR 7.628). We note that this maximum BRPR score places a cap on the possible value of <it>P</it>(<it>R</it>). In the unlikely event that all human-chimp-mouse-opossum-chicken conserved bases are part of TFBSs (that is, assuming <it>P</it>(<it>R|C</it>) = 1), then the maximum value of <it>P</it>(<it>R</it>) from Equation 1 (see materials and methods, below) is (7.628)<sup>-1</sup>. If we extrapolate, then we find that a maximum of 655 bp may be regulatory in the average human 5 kb upstream region. Taking the average size of a TFBS in the JASPAR database <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> of high-quality binding sites (10.658 bp) suggests that no more than 61.5 nonoverlapping TFBSs are present in the average 5 kb upstream region. This maximum value is in agreement with previous reports that estimate this number to be between 10 and 50 sites, depending on the promoter <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. The addition of six more (as yet unpublished) vertebrate species in this analysis did not yield a combination of genomes with a higher BRPR than the human-chimp-mouse-opossum-chicken combination (data not shown).</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Three-way comparisons between human and two other vertebrate species</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="left">
                        <p>Human versus</p>
                     </c>
                     <c ca="left">
                        <p>Chimp</p>
                     </c>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="left">
                        <p>Rat</p>
                     </c>
                     <c ca="left">
                        <p>Dog</p>
                     </c>
                     <c ca="left">
                        <p>Opossum</p>
                     </c>
                     <c ca="left">
                        <p>Chicken</p>
                     </c>
                     <c ca="left">
                        <p>Fugu</p>
                     </c>
                     <c ca="left">
                        <p>Tetraodon</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chimp</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>67.90%</p>
                     </c>
                     <c ca="left">
                        <p>62.48%</p>
                     </c>
                     <c ca="left">
                        <p>70.65%</p>
                     </c>
                     <c ca="left">
                        <p>31.67%</p>
                     </c>
                     <c ca="left">
                        <p>8.26%</p>
                     </c>
                     <c ca="left">
                        <p>2.75%</p>
                     </c>
                     <c ca="left">
                        <p>3.53%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="left">
                        <p>2.896</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>61.10%</p>
                     </c>
                     <c ca="left">
                        <p>59.29%</p>
                     </c>
                     <c ca="left">
                        <p>31.67%</p>
                     </c>
                     <c ca="left">
                        <p>8.35%</p>
                     </c>
                     <c ca="left">
                        <p>2.93%</p>
                     </c>
                     <c ca="left">
                        <p>3.79%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rat</p>
                     </c>
                     <c ca="left">
                        <p>2.794</p>
                     </c>
                     <c ca="left">
                        <p>3.277</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>54.22%</p>
                     </c>
                     <c ca="left">
                        <p>29.43%</p>
                     </c>
                     <c ca="left">
                        <p>8.00%</p>
                     </c>
                     <c ca="left">
                        <p>2.58%</p>
                     </c>
                     <c ca="left">
                        <p>3.44%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dog</p>
                     </c>
                     <c ca="left">
                        <p>1.561</p>
                     </c>
                     <c ca="left">
                        <p>3.070</p>
                     </c>
                     <c ca="left">
                        <p>2.940</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>27.54%</p>
                     </c>
                     <c ca="left">
                        <p>6.88%</p>
                     </c>
                     <c ca="left">
                        <p>2.93%</p>
                     </c>
                     <c ca="left">
                        <p>3.79%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Opossum</p>
                     </c>
                     <c ca="left">
                        <p>5.845</p>
                     </c>
                     <c ca="left">
                        <p>6.430</p>
                     </c>
                     <c ca="left">
                        <p>6.247</p>
                     </c>
                     <c ca="left">
                        <p>5.565</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>7.92%</p>
                     </c>
                     <c ca="left">
                        <p>2.75%</p>
                     </c>
                     <c ca="left">
                        <p>3.70%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chicken</p>
                     </c>
                     <c ca="left">
                        <p>5.864</p>
                     </c>
                     <c ca="left">
                        <p>6.939</p>
                     </c>
                     <c ca="left">
                        <p>6.875</p>
                     </c>
                     <c ca="left">
                        <p>5.891</p>
                     </c>
                     <c ca="left">
                        <p>7.262*</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>1.29%</p>
                     </c>
                     <c ca="left">
                        <p>1.20%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fugu</p>
                     </c>
                     <c ca="left">
                        <p>2.625</p>
                     </c>
                     <c ca="left">
                        <p>3.409</p>
                     </c>
                     <c ca="left">
                        <p>3.207</p>
                     </c>
                     <c ca="left">
                        <p>3.457</p>
                     </c>
                     <c ca="left">
                        <p>3.604</p>
                     </c>
                     <c ca="left">
                        <p>2.891</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2.67%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tetraodon</p>
                     </c>
                     <c ca="left">
                        <p>3.195</p>
                     </c>
                     <c ca="left">
                        <p>4.103</p>
                     </c>
                     <c ca="left">
                        <p>3.951</p>
                     </c>
                     <c ca="left">
                        <p>4.165</p>
                     </c>
                     <c ca="left">
                        <p>4.620</p>
                     </c>
                     <c ca="left">
                        <p>2.775</p>
                     </c>
                     <c ca="left">
                        <p>3.468</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Base regulatory potential rate (BRPR) for bases conserved between human and two other species is shown below the diagonal. The rates of transcription factor binding sites detected in blocks conserved between human and two other species are shown above the diagonal. *Highest BRPR value for these 3-species comparisons.</p>
               </tblfn>
            </tbl>
            <p>Most phylogenetic footprinting approaches use evolutionary conservation in order to reduce the search space to the parts of the promoters that are more likely to contain functional <it>cis</it>-regulatory elements (for example, see the reports by Sandelin and coworkers <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and Loots and Ovcharenko <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>). As combinations of more than two genomes are considered, the search space (the jointly conserved region) is reduced. At the same time, the number of sites located within these conserved regions is reduced as well, although at a slower rate. One might then ask, for a given percentage of detectable sites (maximum site sensitivity), which is the combination that minimizes the search space (thereby maximizing specificity)? We found that BRPR scores can be used to address this question. BRPR scores are reversely proportional to <it>P</it>(<it>C</it>), which is the <it>a priori </it>conservation probability (Equation 1; see Materials and methods, below). Thus, the lower the BRPR score, the larger the conserved region and the greater the chance that false-positive TFBS predictions will be made. Therefore, for a given percentage of detectable sites, one wishes to choose the combination of genomes with high BRPR values.</p>
            <p>We ranked each of the 1,162 tested human TFBSs according to the highest BRPR value from the combinations of genomes that could detect the given site. From this ranking of sites, it may be seen that some subsets of highly conserved TFBSs may be detected at much higher BRPR thresholds than those sites that are conserved only with closely related species. The proportion of TFBSs that may be detected for a given BRPR threshold is plotted in Figure <figr fid="F4">4</figr> (blue line). This figure shows, for example, that in order to guarantee detection of 75% or more of the known TFBSs, one should choose a combination of genomes with BRPR value of 1.7 or less. Naturally, these will be closely related species. By contrast, the combination of genomes with the overall maximum BRPR score (human-chimp-mouse-opossum-chicken, BRPR 7.628) includes only about 7.7% of the known TFBSs in its conserved regions, whereas the lowest possible BRPR score (human-chimp, BRPR 1.009) includes about 98%. BRPR values may be more appropriate than evolutionary distance for the purposes of weighting contributions when aiming to discover constrained regulatory sequences in multispecies alignments. We therefore suggest that when it comes to regulatory regions, the BRPR score may be more useful that the 'conservation scores' currently employed in phastCons <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> or MCS <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> approaches.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Association between BRPR scores and detectable sites</p>
               </caption>
               <text>
                  <p>Association between BRPR scores and detectable sites. For each given percent of detectable transcription factor binding sites (TFBSs), the combination of aligned genomes with the highest base regulatory potential rate (BRPR) value will yield the smaller conserved region (for phylogenetic footprinting algorithm searches). The full list of genome combinations and their BRPR values are given in Additional data file 1. The blue line presents the association between percentage of human TFBSs located in conserved regions in a combination of genomes with this BRPR value among all possible genome combinations in this study (see text for detailed description). The grey line plot is similar after the opossum genome is omitted (see text). BRPR, base regulatory potential rate.</p>
               </text>
               <graphic file="gb-2007-8-5-r84-4"/>
            </fig>
            <p>Figure <figr fid="F4">4</figr> also shows the importance of including the opossum genome in the comparisons. The grey line displays the same graph, but excluding the opossum genome from the plotted combinations. Without including the opossum genome, the BRPR threshold must be reduced to 3.5 before 20% of the known TFBSs may be found in the conserved regions. However, with the opossum included, the BRPR threshold for the same search may be increased to 6.5, indicating analogous reduction in the search space. Figure <figr fid="F4">4</figr> shows that opossum's greatest contribution in terms of phylogenetic footprinting efficiency is for the sensitivity values in the range of 10% to 33%, although smaller improvements are observed in the 55% to 65% range. The 'blocky' nature of the plot is attributable to the subsets of known TFBSs that are detectable in each of the eight species. As more distant mammalian genomes are sequenced, this plot may smooth out to give higher <it>P</it>(<it>R|C</it>) scores to more of the known TFBSs.</p>
            <p>Our preliminary results including unpublished genomes show that more sites may be predicted with increased BRPR thresholds. Only 20 human sites (1.72% of known TFBSs) are not detected by any combinatorial approach, suggesting that only a small minority of human TFBSs may not be conserved in any other species. It should also be noted that without the chimp genome, a maximum of 86.5% of the sites can be identified as conserved, suggesting that only 13.5% of known human TFBSs may be conserved only among primates. This is an interesting finding, because it establishes 86.5% as an upper limit to the proportion of TFBSs that may be found using traditional phylogenetic footprinting techniques with mouse or more distantly related species. If complete detection of all functional human TFBSs is required, then the phylogenetic shadowing technique for comparing closely related species, proposed by Boffelli and colleagues <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>, may be more effective than traditional phylogenetic footprinting for primate-specific TFBSs. However, as suggested by those authors, at least six primate genome sequences other than human will be required before phylogenetic shadowing will become effective <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Another interesting approach is presented in the recent report by Donaldson and G&#246;ttgens <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, which used the mouse genome as an outgroup compared with human and chimpanzee promoters in order to discover regulatory motifs that are conserved in one but not the other <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Exploring dependencies between transcription factor binding site nucleotide conservation and the associated transcription factors</p>
            </st>
            <p>As noted above, the nucleotide conservation within the human TFBSs (as compared with other vertebrates) is higher than the percentage identity in the conserved blocks where they reside (Table <tblr tid="T2">2</tblr>). This is expected because the regulatory nucleotides may be under stronger evolutionary pressure. Similarly, one would expect that high information content positions (the most conserved positions of the motif) are critical for the binding and thus would also be most conserved across species. This assumption does not take into consideration possible differences in the binding protein residues between species, but it has been shown to be correct for individual yeast and fruit fly transcription factors <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. However, this dependence appears to become weaker when average conservation data are calculated over positions from different vertebrate transcription factors.</p>
            <p>From the transcription factors included in our dataset, 80 have a position-specific scoring matrix (PSSM) binding model in JASPAR <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> or our manually curated set of mammalian motifs <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B46">46</abbr></abbrgrp>. These transcription factors are associated with 544 sites in our dataset. The PSSM model of the corresponding transcription factor was used to scan each of its sites from our dataset (see Materials and methods, below). Sometimes the recorded sites extend beyond the length of the PSSM model, reflecting the biochemical method used to discover these sites (for example, DNA footprinting). The highest scoring (sub)sequence was considered to be the correct target site (TFBS), and conservation of each of its nucleotides was calculated for the species in which the site was conserved. The results are plotted in Figure <figr fid="F5">5</figr>, sorted by information content of the corresponding PSSM columns. A weak but definite trend is present in the nonprimate genomes, although even transcription factor motif positions with zero information content (typically assumed to be under no selective pressure) are conserved at a higher rate than the wider conserved blocks. This finding suggests that natural selection operates almost equally strongly across the TFBS positions, regardless of the perceived role of the nucleotide in protein-DNA interactions. One possible explanation for the observed trends is that some motif positions with lower information content may play an indirect role in DNA binding, perhaps by facilitating DNA conformation or by some other mechanism (for instance, Burden and Weng <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> demonstrated conserved DNA structural features at degenerate TFBS locations).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Cross-species conservation of individual TFBS positions versus their information content</p>
               </caption>
               <text>
                  <p>Cross-species conservation of individual TFBS positions versus their information content. Conservation is measured between the human and each of the other species. Information content is measured according to the human position-specific score matrix (PSSM) model.</p>
               </text>
               <graphic file="gb-2007-8-5-r84-5"/>
            </fig>
            <p>As noted by Sauer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, for human-rodent comparisons certain transcription factors are more likely to have their TFBSs conserved across species than others. We test this finding outside eutherians by examining conservation rates of TFBSs for those factors for which at least seven instances are detectable in the corresponding comparisons. The findings for human-mouse and human-opossum comparisons are presented in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr>, and similar comparisons between human and other species are available in Additional data file 1.</p>
            <tbl id="T4" hint_layout="double">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Human-mouse TFBS conservation dependency on transcription factor identity</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Factor</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Motif</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Human versus mouse</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IC</p>
                     </c>
                     <c ca="left">
                        <p>Length</p>
                     </c>
                     <c ca="left">
                        <p>Detectable</p>
                     </c>
                     <c ca="left">
                        <p>% conserved</p>
                     </c>
                     <c ca="left">
                        <p><it>p </it>value</p>
                     </c>
                     <c ca="left">
                        <p>Over/under</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HMG</p>
                     </c>
                     <c ca="left">
                        <p>8.43</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>100.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1029</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CREB</p>
                     </c>
                     <c ca="left">
                        <p>11.52</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>94.12%</p>
                     </c>
                     <c ca="left">
                        <p>0.0257</p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>c-Myb</p>
                     </c>
                     <c ca="left">
                        <p>14.15</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>90.91%</p>
                     </c>
                     <c ca="left">
                        <p>0.1186</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NF-AT1</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>90.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1494</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IPF1</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>88.89%</p>
                     </c>
                     <c ca="left">
                        <p>0.1862</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>p50</p>
                     </c>
                     <c ca="left">
                        <p>15.63</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>87.50%</p>
                     </c>
                     <c ca="left">
                        <p>0.2292</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NF-&#954;B</p>
                     </c>
                     <c ca="left">
                        <p>13.34</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>85.71%</p>
                     </c>
                     <c ca="left">
                        <p>0.1425</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AhR</p>
                     </c>
                     <c ca="left">
                        <p>8.62</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>85.71%</p>
                     </c>
                     <c ca="left">
                        <p>0.2775</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GR</p>
                     </c>
                     <c ca="left">
                        <p>7.06</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>85.71%</p>
                     </c>
                     <c ca="left">
                        <p>0.2775</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E2F-1</p>
                     </c>
                     <c ca="left">
                        <p>10.17</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>83.33%</p>
                     </c>
                     <c ca="left">
                        <p>0.1982</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP-1</p>
                     </c>
                     <c ca="left">
                        <p>9.44</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>82.35%</p>
                     </c>
                     <c ca="left">
                        <p>0.0686</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HIF-1</p>
                     </c>
                     <c ca="left">
                        <p>11.00</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>81.82%</p>
                     </c>
                     <c ca="left">
                        <p>0.2286</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MITF</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>81.82%</p>
                     </c>
                     <c ca="left">
                        <p>0.2286</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ATF-2</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>77.78%</p>
                     </c>
                     <c ca="left">
                        <p>0.2864</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>USF1</p>
                     </c>
                     <c ca="left">
                        <p>10.37</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>77.78%</p>
                     </c>
                     <c ca="left">
                        <p>0.2864</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C/EBP&#945;</p>
                     </c>
                     <c ca="left">
                        <p>11.12</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>77.27%</p>
                     </c>
                     <c ca="left">
                        <p>0.1745</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>p53</p>
                     </c>
                     <c ca="left">
                        <p>25.74</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>72.73%</p>
                     </c>
                     <c ca="left">
                        <p>0.1897</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E2F</p>
                     </c>
                     <c ca="left">
                        <p>13.84</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>72.73%</p>
                     </c>
                     <c ca="left">
                        <p>0.2631</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>c-Ets-1</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>71.43%</p>
                     </c>
                     <c ca="left">
                        <p>0.3193</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HNF-1&#945;</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>71.43%</p>
                     </c>
                     <c ca="left">
                        <p>0.3193</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Egr-1</p>
                     </c>
                     <c ca="left">
                        <p>13.12</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>66.67%</p>
                     </c>
                     <c ca="left">
                        <p>0.2184</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>POU1F1a</p>
                     </c>
                     <c ca="left">
                        <p>7.57</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>66.67%</p>
                     </c>
                     <c ca="left">
                        <p>0.2184</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sp1</p>
                     </c>
                     <c ca="left">
                        <p>9.22</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>115</p>
                     </c>
                     <c ca="left">
                        <p>66.09%</p>
                     </c>
                     <c ca="left">
                        <p>0.0250</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HNF-1&#945;-A</p>
                     </c>
                     <c ca="left">
                        <p>13.66</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>63.64%</p>
                     </c>
                     <c ca="left">
                        <p>0.2010</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GATA-1</p>
                     </c>
                     <c ca="left">
                        <p>5.57</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>57.14%</p>
                     </c>
                     <c ca="left">
                        <p>0.1007</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCF-4</p>
                     </c>
                     <c ca="left">
                        <p>12.54</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>57.14%</p>
                     </c>
                     <c ca="left">
                        <p>0.2032</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>EBF</p>
                     </c>
                     <c ca="left">
                        <p>21.10</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>50.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1120</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP-2&#945;A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>47.83%</p>
                     </c>
                     <c ca="left">
                        <p>0.0073</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ER-&#945;</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>45.45%</p>
                     </c>
                     <c ca="left">
                        <p>0.0405</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Crx</p>
                     </c>
                     <c ca="left">
                        <p>11.60</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>42.86%</p>
                     </c>
                     <c ca="left">
                        <p>0.0772</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gfi1</p>
                     </c>
                     <c ca="left">
                        <p>7.60</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>35.29%</p>
                     </c>
                     <c ca="left">
                        <p>0.0012</p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AR</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>14.29%</p>
                     </c>
                     <c ca="left">
                        <p>0.0022</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Factors with more than seven sites detectable between the two species are shown. The <it>p </it>values given pertain to the observed percentage of conserved sites, and were determined using the Fisher's exact test. Over/under, specifies over-conservation or under-conservation of the sites of the corresponding transcription factor (by Fisher's exact test) at the 5% significance level; *Significant under-representation after <it>p </it>value correction (using Bonferroni). Detectable, total number of human transcription factor binding sites located in promoters of mouse orthologous genes; % conserved, percentage of detectable sites that are in conserved regions; IC, information content (total); Length, length of the motif; N/A, there is no available position-specific score matrix model for this transcription factor; TFBS, transcription factor binding site.</p>
               </tblfn>
            </tbl>
            <tbl id="T5" hint_layout="double">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Human-opossum TFBS conservation dependency on transcription factor identity</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Factor</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Motif</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Human versus opossum</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IC</p>
                     </c>
                     <c ca="left">
                        <p>Length</p>
                     </c>
                     <c ca="left">
                        <p>Detectable</p>
                     </c>
                     <c ca="left">
                        <p>% conserved</p>
                     </c>
                     <c ca="left">
                        <p><it>p </it>value</p>
                     </c>
                     <c ca="left">
                        <p>Over/under</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HMG</p>
                     </c>
                     <c ca="left">
                        <p>8.43</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>100.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0020</p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>p50</p>
                     </c>
                     <c ca="left">
                        <p>15.63</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>75.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0470</p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MITF</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>70.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0487</p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CREB</p>
                     </c>
                     <c ca="left">
                        <p>11.52</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>69.23%</p>
                     </c>
                     <c ca="left">
                        <p>0.0287</p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E2F-1</p>
                     </c>
                     <c ca="left">
                        <p>10.17</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>60.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1228</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GR</p>
                     </c>
                     <c ca="left">
                        <p>7.06</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>57.14%</p>
                     </c>
                     <c ca="left">
                        <p>0.2056</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HNF-1&#945;</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>57.14%</p>
                     </c>
                     <c ca="left">
                        <p>0.2056</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>POU1F1a</p>
                     </c>
                     <c ca="left">
                        <p>7.57</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>55.56%</p>
                     </c>
                     <c ca="left">
                        <p>0.1794</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E2F</p>
                     </c>
                     <c ca="left">
                        <p>13.84</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>54.55%</p>
                     </c>
                     <c ca="left">
                        <p>0.1594</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP-1</p>
                     </c>
                     <c ca="left">
                        <p>9.44</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>50.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1112</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ATF-2</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>50.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.2422</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>USF1</p>
                     </c>
                     <c ca="left">
                        <p>10.37</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>50.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.2422</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IPF1</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>44.44%</p>
                     </c>
                     <c ca="left">
                        <p>0.2565</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HIF-1</p>
                     </c>
                     <c ca="left">
                        <p>11.00</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>42.86%</p>
                     </c>
                     <c ca="left">
                        <p>0.2938</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>p53</p>
                     </c>
                     <c ca="left">
                        <p>25.74</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>37.50%</p>
                     </c>
                     <c ca="left">
                        <p>0.1949</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HNF-1&#945;-A</p>
                     </c>
                     <c ca="left">
                        <p>13.66</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>37.50%</p>
                     </c>
                     <c ca="left">
                        <p>0.2763</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NF-&#954;B</p>
                     </c>
                     <c ca="left">
                        <p>13.34</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>36.36%</p>
                     </c>
                     <c ca="left">
                        <p>0.2321</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sp1</p>
                     </c>
                     <c ca="left">
                        <p>9.22</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>86</p>
                     </c>
                     <c ca="left">
                        <p>29.07%</p>
                     </c>
                     <c ca="left">
                        <p>0.0049</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP-2&#945;A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>26.09%</p>
                     </c>
                     <c ca="left">
                        <p>0.0581</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C/EBP&#945;</p>
                     </c>
                     <c ca="left">
                        <p>11.12</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>25.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0886</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Egr-1</p>
                     </c>
                     <c ca="left">
                        <p>13.12</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>25.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.1961</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>c-Myb</p>
                     </c>
                     <c ca="left">
                        <p>14.15</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>18.18%</p>
                     </c>
                     <c ca="left">
                        <p>0.0775</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ER-&#945;</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>N/A</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>11.11%</p>
                     </c>
                     <c ca="left">
                        <p>0.0521</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GATA-1</p>
                     </c>
                     <c ca="left">
                        <p>5.57</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>11.11%</p>
                     </c>
                     <c ca="left">
                        <p>0.0521</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gfi1</p>
                     </c>
                     <c ca="left">
                        <p>7.60</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>0.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0028</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AhR</p>
                     </c>
                     <c ca="left">
                        <p>8.62</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0238</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCF-4</p>
                     </c>
                     <c ca="left">
                        <p>12.54</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0.00%</p>
                     </c>
                     <c ca="left">
                        <p>0.0238</p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>See Table 5 footnote for details.</p>
               </tblfn>
            </tbl>
            <p>Although some factors' TFBSs are conserved at higher than expected (for example, CREB) or lower than expected (for example, Gfi1, AR and Sp1) rates in human-mouse comparisons, only the sites of Gfi1 are (under)conserved after the Bonferroni correction (see Materials and methods, below). Similarly, the sites of various factors are over-conserved (for example, HMG and CREB, among others) and under-conserved (for example, Gfi1 and Sp1, and so on) in human-opossum comparisons, but only the HMG sites remain (over)conserved after the correction (Table <tblr tid="T5">5</tblr>). We found that all detectable HMG sites are conserved in both mouse and opossum, but their small number (seven) made them appear significant only in the human-opossum comparisons. Interestingly, human Sp1 TFBSs are under-conserved in all genomes except rodents (Additional data file 1). This may be explained by the fact that the Sp1 target site (consensus: 'GGcGGG') and related patterns are expected to occur frequently in GC-rich mammalian promoters. As such, random mutations in mammalian promoters have a high probability of producing additional copies of functional sites. With such a potential proliferation of 'backup' Sp1 target sites, an increased Sp1 TFBS turnover rate should not be surprising. Therefore, evolutionary conservation of TFBSs has some dependency on the identity of the bound transcription factor, but no strong conclusions can be drawn at this point because of the limited amount of available data. AP-2&#945; is represented by 23 human sites in our dataset. All genes regulated by these sites have orthologs in both mouse and opossum, and yet its TFBSs are under-conserved in mouse. This is an example in which TFBS conservation does not coincide with the conservation of the downstream genes, which has been observed for developmental genes as well <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
            <p>We found no association between the information content (IC) of the transcription factor motif and the percentage conservation. For example, TCF-4 motif has a relatively high IC value (12.5) and its sites are generally under-conserved in both mouse and opossum, but they are significantly under-conserved only in opossum (Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr>). In contrast, the sites of HMG are all in conserved regions in human-mouse and human-opossum comparisons, yet the HMG motif has an IC value of 8.4.</p>
         </sec>
         <sec>
            <st>
               <p>Exploring transcription factor binding site conservation dependencies on Gene Ontology categories between human and opossum</p>
            </st>
            <p>We also test the possible association between TFBS turnover rates and the functional category of the corresponding regulated genes. Previous studies suggest that the genes with the highest upstream sequence conservation coverage are those involved in transcription and development <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. Table <tblr tid="T6">6</tblr> presents the top 30 most populated GO-slim categories <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> in terms of human-mouse orthologous genes from our 513 protein coding gene dataset. Significance was assessed using the Fisher's exact test, as described in the Materials and methods (below). We found that GO categories 'physiologic process' and 'transporter activity' to be over-represented and under-represented, respectively, in both mouse and opossum, even after the Bonferroni correction. Many other GO categories have over-conserved TFBSs in the promoters of their member genes between human and mouse. Examples include 'transcription', 'development', 'cell-cell signaling', response to various stimuli, among others (Table <tblr tid="T6">6</tblr>). Sauer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> also showed that TFBS conservation in human-rodent comparisons is correlated with the functional category of the downstream regulated gene. Their findings agree with ours in many categories. In particular, there are 34 categories in common for which one (or both) of the studies has found them to be statistically over-represented or under-represented. In 29 of them (85%) the two studies agree with respect to the 'sign' of conservation. The differences observed between the two studies can be attributed to the different set of TFBSs upon which their measurements are based (Sauer and coworkers used sites from mouse and rat in addition to human) and the methods used to assign significance.</p>
            <tbl id="T6" hint_layout="double">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Human-mouse TFBS conservation dependency on the GO category of the downstream regulated gene</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>GO category</p>
                     </c>
                     <c ca="left">
                        <p>Number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Upstream coverage</p>
                     </c>
                     <c ca="left">
                        <p>Detectable TFBSs</p>
                     </c>
                     <c ca="left">
                        <p>% TFBS detected</p>
                     </c>
                     <c ca="left">
                        <p><it>p </it>value</p>
                     </c>
                     <c ca="left">
                        <p>Over/under</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription regulator activity</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>37.65%</p>
                     </c>
                     <c ca="left">
                        <p>128</p>
                     </c>
                     <c ca="left">
                        <p>83.59%</p>
                     </c>
                     <c ca="left">
                        <p>6.63 &#215; 10<sup>-4</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell-cell signaling</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>26.00%</p>
                     </c>
                     <c ca="left">
                        <p>141</p>
                     </c>
                     <c ca="left">
                        <p>82.27%</p>
                     </c>
                     <c ca="left">
                        <p>1.27 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Development</p>
                     </c>
                     <c ca="left">
                        <p>55</p>
                     </c>
                     <c ca="left">
                        <p>35.19%</p>
                     </c>
                     <c ca="left">
                        <p>157</p>
                     </c>
                     <c ca="left">
                        <p>81.53%</p>
                     </c>
                     <c ca="left">
                        <p>1.41 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nucleotide binding</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>23.31%</p>
                     </c>
                     <c ca="left">
                        <p>137</p>
                     </c>
                     <c ca="left">
                        <p>79.56%</p>
                     </c>
                     <c ca="left">
                        <p>1.04 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to biotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>81</p>
                     </c>
                     <c ca="left">
                        <p>22.67%</p>
                     </c>
                     <c ca="left">
                        <p>273</p>
                     </c>
                     <c ca="left">
                        <p>79.49%</p>
                     </c>
                     <c ca="left">
                        <p>5.62 &#215; 10<sup>-4</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to external stimulus</p>
                     </c>
                     <c ca="left">
                        <p>65</p>
                     </c>
                     <c ca="left">
                        <p>23.49%</p>
                     </c>
                     <c ca="left">
                        <p>209</p>
                     </c>
                     <c ca="left">
                        <p>79.43%</p>
                     </c>
                     <c ca="left">
                        <p>2.56 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to stress</p>
                     </c>
                     <c ca="left">
                        <p>91</p>
                     </c>
                     <c ca="left">
                        <p>23.78%</p>
                     </c>
                     <c ca="left">
                        <p>316</p>
                     </c>
                     <c ca="left">
                        <p>79.11%</p>
                     </c>
                     <c ca="left">
                        <p>3.50 &#215; 10<sup>-4</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Physiologic process</p>
                     </c>
                     <c ca="left">
                        <p>154</p>
                     </c>
                     <c ca="left">
                        <p>23.59%</p>
                     </c>
                     <c ca="left">
                        <p>526</p>
                     </c>
                     <c ca="left">
                        <p>78.90%</p>
                     </c>
                     <c ca="left">
                        <p>1.37 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell proliferation</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>29.13%</p>
                     </c>
                     <c ca="left">
                        <p>209</p>
                     </c>
                     <c ca="left">
                        <p>78.47%</p>
                     </c>
                     <c ca="left">
                        <p>6.00 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>65</p>
                     </c>
                     <c ca="left">
                        <p>24.36%</p>
                     </c>
                     <c ca="left">
                        <p>246</p>
                     </c>
                     <c ca="left">
                        <p>77.24%</p>
                     </c>
                     <c ca="left">
                        <p>9.74 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>24.55%</p>
                     </c>
                     <c ca="left">
                        <p>114</p>
                     </c>
                     <c ca="left">
                        <p>77.19%</p>
                     </c>
                     <c ca="left">
                        <p>4.29 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mitochondrion organization and biogenesis</p>
                     </c>
                     <c ca="left">
                        <p>100</p>
                     </c>
                     <c ca="left">
                        <p>25.26%</p>
                     </c>
                     <c ca="left">
                        <p>266</p>
                     </c>
                     <c ca="left">
                        <p>77.07%</p>
                     </c>
                     <c ca="left">
                        <p>8.93 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                     <c ca="left">
                        <p>35.72%</p>
                     </c>
                     <c ca="left">
                        <p>223</p>
                     </c>
                     <c ca="left">
                        <p>76.68%</p>
                     </c>
                     <c ca="left">
                        <p>1.82 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Extracellular region</p>
                     </c>
                     <c ca="left">
                        <p>56</p>
                     </c>
                     <c ca="left">
                        <p>21.66%</p>
                     </c>
                     <c ca="left">
                        <p>217</p>
                     </c>
                     <c ca="left">
                        <p>76.04%</p>
                     </c>
                     <c ca="left">
                        <p>2.73 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein binding</p>
                     </c>
                     <c ca="left">
                        <p>142</p>
                     </c>
                     <c ca="left">
                        <p>26.43%</p>
                     </c>
                     <c ca="left">
                        <p>464</p>
                     </c>
                     <c ca="left">
                        <p>75.86%</p>
                     </c>
                     <c ca="left">
                        <p>4.75 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Extracellular space</p>
                     </c>
                     <c ca="left">
                        <p>54</p>
                     </c>
                     <c ca="left">
                        <p>23.08%</p>
                     </c>
                     <c ca="left">
                        <p>232</p>
                     </c>
                     <c ca="left">
                        <p>75.86%</p>
                     </c>
                     <c ca="left">
                        <p>2.70 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Regulation of biologic process</p>
                     </c>
                     <c ca="left">
                        <p>155</p>
                     </c>
                     <c ca="left">
                        <p>29.96%</p>
                     </c>
                     <c ca="left">
                        <p>562</p>
                     </c>
                     <c ca="left">
                        <p>75.27%</p>
                     </c>
                     <c ca="left">
                        <p>4.97 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cytoplasm</p>
                     </c>
                     <c ca="left">
                        <p>45</p>
                     </c>
                     <c ca="left">
                        <p>22.87%</p>
                     </c>
                     <c ca="left">
                        <p>136</p>
                     </c>
                     <c ca="left">
                        <p>74.26%</p>
                     </c>
                     <c ca="left">
                        <p>7.17 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plasma membrane</p>
                     </c>
                     <c ca="left">
                        <p>57</p>
                     </c>
                     <c ca="left">
                        <p>20.12%</p>
                     </c>
                     <c ca="left">
                        <p>143</p>
                     </c>
                     <c ca="left">
                        <p>74.13%</p>
                     </c>
                     <c ca="left">
                        <p>7.10 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription factor activity</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>36.92%</p>
                     </c>
                     <c ca="left">
                        <p>137</p>
                     </c>
                     <c ca="left">
                        <p>73.72%</p>
                     </c>
                     <c ca="left">
                        <p>7.62 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nucleus</p>
                     </c>
                     <c ca="left">
                        <p>92</p>
                     </c>
                     <c ca="left">
                        <p>31.28%</p>
                     </c>
                     <c ca="left">
                        <p>332</p>
                     </c>
                     <c ca="left">
                        <p>73.49%</p>
                     </c>
                     <c ca="left">
                        <p>5.00 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell death</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>21.97%</p>
                     </c>
                     <c ca="left">
                        <p>189</p>
                     </c>
                     <c ca="left">
                        <p>73.02%</p>
                     </c>
                     <c ca="left">
                        <p>6.95 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein metabolism</p>
                     </c>
                     <c ca="left">
                        <p>49</p>
                     </c>
                     <c ca="left">
                        <p>19.65%</p>
                     </c>
                     <c ca="left">
                        <p>147</p>
                     </c>
                     <c ca="left">
                        <p>72.79%</p>
                     </c>
                     <c ca="left">
                        <p>7.83 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Biologic process</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>21.69%</p>
                     </c>
                     <c ca="left">
                        <p>100</p>
                     </c>
                     <c ca="left">
                        <p>72.00%</p>
                     </c>
                     <c ca="left">
                        <p>9.24 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>116</p>
                     </c>
                     <c ca="left">
                        <p>23.96%</p>
                     </c>
                     <c ca="left">
                        <p>398</p>
                     </c>
                     <c ca="left">
                        <p>71.86%</p>
                     </c>
                     <c ca="left">
                        <p>5.33 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell cycle</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                     <c ca="left">
                        <p>28.45%</p>
                     </c>
                     <c ca="left">
                        <p>182</p>
                     </c>
                     <c ca="left">
                        <p>70.88%</p>
                     </c>
                     <c ca="left">
                        <p>6.34 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell</p>
                     </c>
                     <c ca="left">
                        <p>118</p>
                     </c>
                     <c ca="left">
                        <p>21.23%</p>
                     </c>
                     <c ca="left">
                        <p>351</p>
                     </c>
                     <c ca="left">
                        <p>69.23%</p>
                     </c>
                     <c ca="left">
                        <p>1.68 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>90</p>
                     </c>
                     <c ca="left">
                        <p>24.17%</p>
                     </c>
                     <c ca="left">
                        <p>297</p>
                     </c>
                     <c ca="left">
                        <p>68.69%</p>
                     </c>
                     <c ca="left">
                        <p>1.58 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transport</p>
                     </c>
                     <c ca="left">
                        <p>39</p>
                     </c>
                     <c ca="left">
                        <p>24.11%</p>
                     </c>
                     <c ca="left">
                        <p>146</p>
                     </c>
                     <c ca="left">
                        <p>67.81%</p>
                     </c>
                     <c ca="left">
                        <p>3.30 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Catalytic activity</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>19.68%</p>
                     </c>
                     <c ca="left">
                        <p>99</p>
                     </c>
                     <c ca="left">
                        <p>61.62%</p>
                     </c>
                     <c ca="left">
                        <p>4.63 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transporter activity</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>25.00%</p>
                     </c>
                     <c ca="left">
                        <p>123</p>
                     </c>
                     <c ca="left">
                        <p>60.98%</p>
                     </c>
                     <c ca="left">
                        <p>1.20 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The top 31 Gene Ontology (GO) categories in terms of gene numbers in the dataset are shown. The <it>p </it>values given represent the significance (uncorrected) of the observed percentage of conserved (detected) sites, as determined using the Fisher's exact test. Over/under, specifies over-conservation or under-conservation of the sites of the corresponding GO category (by Fisher's exact test) at the 5% significance level. *Statistical over-representation or under-representation after <it>p </it>value correction (using Bonferroni). TFBS, transcription factor binding site.</p>
               </tblfn>
            </tbl>
            <p>We extend this study in opossum (Table <tblr tid="T7">7</tblr>) and the other vertebrate genomes (Additional data file 1). Most of the over-conserved categories between human and mouse are also over-conserved in human-opossum comparisons (Fisher's exact test; see Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>). These include 'cell-cell signaling' and response to stress and biotic stimuli. On the other hand, the TFBS conservation rate for the 'protein binding' went from being over-conserved in human-mouse comparisons (76% TFBS conservation) to under-conserved in human-opossum comparisons (36% TFBS conservation). This is one of the highly populated categories, and its members are involved in almost every cellular process, for instance signal transduction, chromatin structure, transcription, translation, cell cytoskeleton, and so on. It is therefore difficult to assess the significance of this change in TFBS conservation related to this category. One thing is for sure; the observed differences are not an artifact caused by the low number of TFBSs. This category is represented by 142 genes associated with 464 TFBSs in mouse and 122 genes associated with 419 TFBSs in opossum, making it one of the best represented categories in our dataset.</p>
            <tbl id="T7" hint_layout="double">
               <title>
                  <p>Table 7</p>
               </title>
               <caption>
                  <p>Human-opossum TFBS conservation dependency on the GO category of the downstream regulated gene</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>GO category</p>
                     </c>
                     <c ca="left">
                        <p>Number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Upstream Coverage</p>
                     </c>
                     <c ca="left">
                        <p>Detectable TFBSs</p>
                     </c>
                     <c ca="left">
                        <p>% TFBS Detected</p>
                     </c>
                     <c ca="left">
                        <p><it>p </it>value</p>
                     </c>
                     <c ca="left">
                        <p>Over/under</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Receptor binding</p>
                     </c>
                     <c ca="left">
                        <p>51</p>
                     </c>
                     <c ca="left">
                        <p>6.49%</p>
                     </c>
                     <c ca="left">
                        <p>180</p>
                     </c>
                     <c ca="left">
                        <p>55.56%</p>
                     </c>
                     <c ca="left">
                        <p>5.80 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell-cell signaling</p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>6.37%</p>
                     </c>
                     <c ca="left">
                        <p>120</p>
                     </c>
                     <c ca="left">
                        <p>51.67%</p>
                     </c>
                     <c ca="left">
                        <p>3.67 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Physiologic process</p>
                     </c>
                     <c ca="left">
                        <p>122</p>
                     </c>
                     <c ca="left">
                        <p>5.63%</p>
                     </c>
                     <c ca="left">
                        <p>415</p>
                     </c>
                     <c ca="left">
                        <p>49.40%</p>
                     </c>
                     <c ca="left">
                        <p>1.51 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to external stimulus</p>
                     </c>
                     <c ca="left">
                        <p>54</p>
                     </c>
                     <c ca="left">
                        <p>5.60%</p>
                     </c>
                     <c ca="left">
                        <p>168</p>
                     </c>
                     <c ca="left">
                        <p>48.81%</p>
                     </c>
                     <c ca="left">
                        <p>6.12 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription regulator activity</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>10.15%</p>
                     </c>
                     <c ca="left">
                        <p>122</p>
                     </c>
                     <c ca="left">
                        <p>47.54%</p>
                     </c>
                     <c ca="left">
                        <p>2.47 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Extracellular space</p>
                     </c>
                     <c ca="left">
                        <p>43</p>
                     </c>
                     <c ca="left">
                        <p>4.08%</p>
                     </c>
                     <c ca="left">
                        <p>175</p>
                     </c>
                     <c ca="left">
                        <p>47.43%</p>
                     </c>
                     <c ca="left">
                        <p>1.23 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to biotic stimulus</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>5.29%</p>
                     </c>
                     <c ca="left">
                        <p>209</p>
                     </c>
                     <c ca="left">
                        <p>47.37%</p>
                     </c>
                     <c ca="left">
                        <p>7.82 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription</p>
                     </c>
                     <c ca="left">
                        <p>61</p>
                     </c>
                     <c ca="left">
                        <p>10.52%</p>
                     </c>
                     <c ca="left">
                        <p>208</p>
                     </c>
                     <c ca="left">
                        <p>45.67%</p>
                     </c>
                     <c ca="left">
                        <p>2.13 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transcription factor activity</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>9.80%</p>
                     </c>
                     <c ca="left">
                        <p>133</p>
                     </c>
                     <c ca="left">
                        <p>45.11%</p>
                     </c>
                     <c ca="left">
                        <p>4.65 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Development</p>
                     </c>
                     <c ca="left">
                        <p>47</p>
                     </c>
                     <c ca="left">
                        <p>9.48%</p>
                     </c>
                     <c ca="left">
                        <p>120</p>
                     </c>
                     <c ca="left">
                        <p>45.00%</p>
                     </c>
                     <c ca="left">
                        <p>5.25 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>86</p>
                     </c>
                     <c ca="left">
                        <p>5.72%</p>
                     </c>
                     <c ca="left">
                        <p>293</p>
                     </c>
                     <c ca="left">
                        <p>44.71%</p>
                     </c>
                     <c ca="left">
                        <p>1.95 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response to stress</p>
                     </c>
                     <c ca="left">
                        <p>74</p>
                     </c>
                     <c ca="left">
                        <p>6.23%</p>
                     </c>
                     <c ca="left">
                        <p>268</p>
                     </c>
                     <c ca="left">
                        <p>44.03%</p>
                     </c>
                     <c ca="left">
                        <p>3.18 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Regulation of biologic process</p>
                     </c>
                     <c ca="left">
                        <p>134</p>
                     </c>
                     <c ca="left">
                        <p>8.49%</p>
                     </c>
                     <c ca="left">
                        <p>490</p>
                     </c>
                     <c ca="left">
                        <p>43.06%</p>
                     </c>
                     <c ca="left">
                        <p>2.59 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Over</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell</p>
                     </c>
                     <c ca="left">
                        <p>82</p>
                     </c>
                     <c ca="left">
                        <p>6.11%</p>
                     </c>
                     <c ca="left">
                        <p>241</p>
                     </c>
                     <c ca="left">
                        <p>40.66%</p>
                     </c>
                     <c ca="left">
                        <p>5.96 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nucleus</p>
                     </c>
                     <c ca="left">
                        <p>81</p>
                     </c>
                     <c ca="left">
                        <p>10.05%</p>
                     </c>
                     <c ca="left">
                        <p>305</p>
                     </c>
                     <c ca="left">
                        <p>40.66%</p>
                     </c>
                     <c ca="left">
                        <p>5.52 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Extracellular region</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>6.17%</p>
                     </c>
                     <c ca="left">
                        <p>160</p>
                     </c>
                     <c ca="left">
                        <p>40.63%</p>
                     </c>
                     <c ca="left">
                        <p>6.95 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell proliferation</p>
                     </c>
                     <c ca="left">
                        <p>49</p>
                     </c>
                     <c ca="left">
                        <p>7.63%</p>
                     </c>
                     <c ca="left">
                        <p>196</p>
                     </c>
                     <c ca="left">
                        <p>40.31%</p>
                     </c>
                     <c ca="left">
                        <p>6.26 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mitochondrion organization and biogenesis</p>
                     </c>
                     <c ca="left">
                        <p>77</p>
                     </c>
                     <c ca="left">
                        <p>6.90%</p>
                     </c>
                     <c ca="left">
                        <p>213</p>
                     </c>
                     <c ca="left">
                        <p>39.44%</p>
                     </c>
                     <c ca="left">
                        <p>5.29 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cytoplasm</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>6.07%</p>
                     </c>
                     <c ca="left">
                        <p>97</p>
                     </c>
                     <c ca="left">
                        <p>39.18%</p>
                     </c>
                     <c ca="left">
                        <p>7.95 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell death</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                     <c ca="left">
                        <p>6.77%</p>
                     </c>
                     <c ca="left">
                        <p>164</p>
                     </c>
                     <c ca="left">
                        <p>37.80%</p>
                     </c>
                     <c ca="left">
                        <p>4.34 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein binding</p>
                     </c>
                     <c ca="left">
                        <p>122</p>
                     </c>
                     <c ca="left">
                        <p>7.01%</p>
                     </c>
                     <c ca="left">
                        <p>419</p>
                     </c>
                     <c ca="left">
                        <p>35.80%</p>
                     </c>
                     <c ca="left">
                        <p>4.81 &#215; 10<sup>-4</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell cycle</p>
                     </c>
                     <c ca="left">
                        <p>39</p>
                     </c>
                     <c ca="left">
                        <p>7.67%</p>
                     </c>
                     <c ca="left">
                        <p>176</p>
                     </c>
                     <c ca="left">
                        <p>35.23%</p>
                     </c>
                     <c ca="left">
                        <p>1.35 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nucleotide binding</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>5.82%</p>
                     </c>
                     <c ca="left">
                        <p>112</p>
                     </c>
                     <c ca="left">
                        <p>31.25%</p>
                     </c>
                     <c ca="left">
                        <p>5.81 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein complex</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>5.90%</p>
                     </c>
                     <c ca="left">
                        <p>84</p>
                     </c>
                     <c ca="left">
                        <p>29.76%</p>
                     </c>
                     <c ca="left">
                        <p>7.37 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA binding</p>
                     </c>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>10.05%</p>
                     </c>
                     <c ca="left">
                        <p>74</p>
                     </c>
                     <c ca="left">
                        <p>29.73%</p>
                     </c>
                     <c ca="left">
                        <p>1.08 &#215; 10<sup>-2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Binding</p>
                     </c>
                     <c ca="left">
                        <p>71</p>
                     </c>
                     <c ca="left">
                        <p>6.77%</p>
                     </c>
                     <c ca="left">
                        <p>240</p>
                     </c>
                     <c ca="left">
                        <p>29.58%</p>
                     </c>
                     <c ca="left">
                        <p>5.58 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Receptor activity</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>6.53%</p>
                     </c>
                     <c ca="left">
                        <p>88</p>
                     </c>
                     <c ca="left">
                        <p>29.55%</p>
                     </c>
                     <c ca="left">
                        <p>5.67 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plasma membrane</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>4.78%</p>
                     </c>
                     <c ca="left">
                        <p>91</p>
                     </c>
                     <c ca="left">
                        <p>28.57%</p>
                     </c>
                     <c ca="left">
                        <p>3.00 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein metabolism</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                     <c ca="left">
                        <p>6.27%</p>
                     </c>
                     <c ca="left">
                        <p>131</p>
                     </c>
                     <c ca="left">
                        <p>25.95%</p>
                     </c>
                     <c ca="left">
                        <p>3.75 &#215; 10<sup>-5</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transporter activity</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>6.28%</p>
                     </c>
                     <c ca="left">
                        <p>91</p>
                     </c>
                     <c ca="left">
                        <p>23.08%</p>
                     </c>
                     <c ca="left">
                        <p>6.74 &#215; 10<sup>-5</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transport</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>5.53%</p>
                     </c>
                     <c ca="left">
                        <p>102</p>
                     </c>
                     <c ca="left">
                        <p>20.59%</p>
                     </c>
                     <c ca="left">
                        <p>1.85 &#215; 10<sup>-6</sup></p>
                     </c>
                     <c ca="left">
                        <p>Under*</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>See Table 6 footnote for details.</p>
               </tblfn>
            </tbl>
            <p>'Development' is another category in which TFBSs are significantly over-conserved in human-mouse comparisons. In human-opossum comparisons TFBSs are still over-conserved, but not at a significant level. This can also be attributed to the sharp decrease in the percentage of detected TFBSs (from 81.5% in mouse to 45% in opossum) in relation to the high number of potentially detectable TFBSs (157 versus 120 in mouse and opossum, respectively). The developmental genes themselves are ultra-conserved in opossum <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, resulting in the detection of many orthologs and hence many potentially detectable TFBSs associated with them. Conservation tables, similar to Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>, for comparisons between human and other species are available in Additional data file 1.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison with other studies</p>
            </st>
            <p>A number of existing studies have attempted to quantify regulatory conservation in mammals, albeit using different approaches and typically restricting their interest to human-rodent comparisons. Our results on human-rodent comparisons generally agree with these studies. For example, we find approximately 72% of detectable human TFBSs conserved in mouse 5 kb upstream regions. Similarly, Sauer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> reported detection of TRANSFAC <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> TFBSs in human-rodent conserved sequences at a rate of 71.7% when using the same conservation threshold (65% identity).</p>
            <p>For conservation cutoffs of 70% identity, Liu and coworkers <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, Levy and Hannenhalli <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and Lenhard and colleagues <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> independently found human-mouse conservation rates for known TFBSs of about 60%, 65%, and 68%, respectively. The latter three studies were also based on finding conserved blocks via sliding windows on aligned sequences. Dermitzakis and Clark <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> also reported detection of TRANSFAC TFBSs in human-rodent conserved sequences at rates of 60% to 68%. All of the aforementioned human-rodent TFBS turnover rates are consistent with our findings, given the slightly higher conservation cut-offs and the lower number of known TFBSs tested (40 sites by Lenhard and colleagues <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, 64 sites by Dermitzakis and Clark <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, 467 sites by Liu and coworkers <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and 481 sites by Levy and Hannenhalli <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>).</p>
            <p>In relation to our human-mouse 5 kb upstream conservation coverage figure (24%), a number of other studies have found human-rodent upstream conservation rates in the range 17% to 25% <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. In a comparison of 77 well defined human-mouse gene pairs, Jareborg and coworkers <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> found 36% conservation coverage of upstream sequence using the software program DBA and a 60% cutoff. However, their upstream sequences ranged from 500 bp to 1,000 bp upstream of the TSS. Our conservation coverage in the same range of distance is 38.7% to 49.2%. Sauer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> found a background conservation rate of 35% in human-rodent comparisons, although their study was based on 800 bp windows of sequence centered on a known TFBSs, and was therefore also biased toward including sequence from the proximal 500 bp region.</p>
            <p>A recent study of the mouse transcriptome showed that a large part of this mammalian genome may be transcribed <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The authors found many more transcripts than the number of genes currently estimated for the mammalian genomes. For about one-third of these transcripts no association with protein coding genes was found, and therefore they were considered to be noncoding RNAs (ncRNAs). Similar to our study, the authors analyzed the upstream sequences of these potential ncRNAs, which they found to be more conserved than the promoters of the protein coding genes. However, their study has some differences compared with ours. First, it does not focus specifically on the intergenic miRNA genes, but analyzes all transcripts for which no protein coding gene association was found. Also, their study does not depict the similarity we found in the conservation rates of coding and noncoding upstream regions in the first 500 bp, which is an important finding of our study, especially when compared with the conservation of the upstream sequences of the tRNA genes (Figure <figr fid="F2">2</figr>). Cooper and coworkers <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> recently analyzed the conservation rates of core promoter sequences of protein coding genes. Their findings agree with ours in that they find that the first 300 bp upstream of the TSS are important for the core promoter activity. This is the region where we find the highest conservation (Figure <figr fid="F2">2</figr>). In another study, Taylor and coworkers <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> reported that the nucleotide substitution rate increases with the distance from TSS in various types of protein coding genes in a way similar to our observations.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>This study is the first to analyze conservation of the upstream regions of protein coding genes in relation to the upstream regions of intergenic miRNA genes. We found the latter to be about twice as conserved as the former beyond the first 500 bp. The reason for this conservation is currently unknown. The first 500 bp appear to be equally conserved in both types of genes, a feature that is missing from the upstream sequences of the tRNA genes. This indicates that similar mechanisms of gene regulation may be in place, which is in agreement with other studies <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B32">32</abbr></abbrgrp>. The difference in conservation rates is more apparent in the mammalian lineages, including opossum, and may reflect similarities in mammalian gene regulation.</p>
         <p>Another important finding is that the opossum genome offers great potential in terms of improving the performance of the phylogenetic footprinting algorithms. We found that 41% of the known human TFBSs are located in the 6.7% of promoter regions that are conserved between human and opossum, illustrating that the opossum genome sequence can be used to reduce the search space for a large proportion of human TFBSs. A new statistical measure, BRPR, is introduced that quantifies the trade-off between sequence conservation (or reduction of the search space for comparative genomics strategies) and regulatory site conservation. We show that for a given site sensitivity threshold, an appropriate combination of genomes can be selected to minimize the search space. Finally, we find that basic cellular functions, such as cell-cell signaling and receptor binding, have significantly over-conserved sites between human and opossum (the corresponding genes have more TFBSs located in the conserved parts of their promoter regions). By contrast, TFBSs related to functions such as transporter activity and protein metabolism are significantly under-conserved.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>MicroRNA gene dataset</p>
            </st>
            <p>Human miRNA genes were retrieved from the miRBase <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and the UCSC Genome Browser (version hg18, March 2006) <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Cross-referencing them with the miRNAMap dataset <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> identified 169 putatively intergenic miRNA genes. The sequences of these miRNAs were used in BLAST-like Alignment Tool (BLAT) <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> alignments against the latest UCSC human genome and their exact genomic locations were identified. Following observations in previous studies <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B62">62</abbr></abbrgrp>, we consider two miRNA genes to be co-transcribed if their starting points are less than 250 bp apart. In this way, we identified 12 clusters containing 31 genes. Only the 5'-most gene in a cluster was considered in our analysis. Five miRNA genes were found to reside within large introns of protein coding genes, and although they may have their own regulatory regions, we excluded them from further analysis. This resulted in a dataset of 145 human intergenic miRNA genes (Additional data file 1). The coordinates of the BLAT outputs were used to retrieve up to 5 kb regions upstream of the gene start site as described below.</p>
            <p>We note that in a recent study, Devor and Samollow (personal communication) tested 71 predicted miRNA genes using quantitative polymerase chain reaction on pooled RNA from brain, heart, lung, liver, tongue, and esophagus from an adult opossum. They found evidence of expression in 80% of the cases they tested, including 36 genes in our opossum dataset.</p>
         </sec>
         <sec>
            <st>
               <p>Pair-wise and multiple species comparisons</p>
            </st>
            <p>Pair-wise and multiple species alignments for both protein coding and miRNA genes were retrieved from the 17-species MULTIZ multiple alignments <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, which are available from the UCSC web server (version hg18, March 2006) <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. The MULTIZ algorithm builds a multiple alignment from local pair-wise BLASTZ alignments of the reference genome with each other genome of interest <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B64">64</abbr></abbrgrp>. Each base in the reference genome is aligned to at most one base in the other genomes, and the alignment is guided by synteny. In this study, we present the results from pair-wise and multiple species comparisons of human <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> with four eutherian mammals (chimpanzee <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, mouse <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, rat <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, and dog <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>), the newly sequenced opossum <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, chicken <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>, fugu <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, and tetraodon <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. A phylogenetic tree for those species and with branch lengths derived from the ENCODE project Multi-Species Sequence Analysis group (September 2005) is shown in Figure <figr fid="F1">1</figr>. This tree was generated using the phyloGif program <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> from Threaded Blockset Aligner (TBA) alignments over 23 vertebrate species and is based on 4D sites (similar to the tree presented by Margulies and coworkers <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>).</p>
            <p>For each pair-wise or multiple species comparisons, the corresponding (aligned) 5 kb upstream sequences were retrieved directly from the MULTIZ alignments for greater accuracy, using the human genes as reference. If other genes were found within this 5 kb range, then the upstream sequences were shortened accordingly to exclude the additional genes. We used the 65% as our conserved block threshold, which is similar to that in previous studies <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> and similar to the default threshold used by many phylogenetic footprinting algorithms <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>tRNA dataset</p>
            </st>
            <p>Human tRNA genes and pair-wise alignments were extracted from the UCSC Genome Browser database (version hg18, March 2006) using the genomic MULTIZ alignments as we describe above. Genes that were found to be facing opposite directions in the genome ('head-to-head') and their starts were closer than 2.5 kb apart were excluded from the analysis. This rule excluded 156 genes. The final human tRNA dataset included 1,795 upstream sequences.</p>
         </sec>
         <sec>
            <st>
               <p>Dataset of known transcription factor binding sites</p>
            </st>
            <p>TRANSFAC database (release 9.3) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> contains 1,162 human confirmed TFBSs that satisfy the following criteria: the site is experimentally confirmed and associated with a promoter of a human gene from the database (confirmed sites); the TFBS sequence can be found within 5 kb upstream of the TSS; if multiple site occurrences are present in the corresponding promoter, then positional information (relative to TSS) is listed in the database; and the regulated human gene corresponds to an entry in the RefSeq gene collection. The above TFBSs are located in the promoters of 513 human genes, which serves as our primary dataset for the transcription factor-TFBS association study. We focus on the sites located in the 5 kb upstream region, because this includes 83.4% of all known human TFBSs in TRANSFAC (data not shown). The majority of the sites (a total of 774) have a TRANSFAC assigned quality score of 1, 2, 3, or 4, which shows confirmed binding activity to a known transcription factor. For an additional 325 sites, no TRANSFAC quality score was assigned. The remaining 63 sites (about 5%) belong to TRANSFAC category 5, for which an unknown protein has been shown to bind to a DNA element.</p>
         </sec>
         <sec>
            <st>
               <p>Dataset of position-specific scoring matrix models</p>
            </st>
            <p>JASPAR database <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> contains 20 PSSM models for transcription factors whose sites are present in our dataset. In addition, we previously generated manually 60 more PSSM models from high-quality human and mouse sites in TRANSFAC <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, which we make publicly available through our web server <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>. These models were used to analyze the position information content with the nucleotide conservation in the subset of 572 corresponding known TFBSs (Figure <figr fid="F5">5</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Conserved blocks and transcription factor binding site detection: some definitions</p>
            </st>
            <p>In this study, sequence conservation is expressed as conserved block coverage. A sliding window of width 50 bp and step size 10 bp was used to find conserved regions (or blocks) of at least 65% identity between human and each other species. Each pair-wise alignment was extracted from the MULTIZ multiple alignments. Sauer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> have shown that the 65% identity threshold most effectively separates TFBSs from background sequence in human-rodent comparisons. The percentage of human 5 kb upstream sequence that is located within conserved blocks is denoted the 'conserved block coverage'. The 'average block conservation' is the percentage of identical bases in conserved blocks over all bases in conserved blocks. A 'conserved site' is a known human TFBS that overlaps a conserved block between human and another species. Because we explore the effect of sequence and pattern of conservation in the discovery of <it>cis</it>-regulatory elements, this study does not make any assumptions about the biologic functionality of the human-equivalent TFBSs in the other organisms. In other words, we cannot address the issue of actual site turnover, but simply whether a known human TFBS is located in a conserved block between human and one or more other species (regardless of whether it is functional in these other species). 'Detectable TFBSs' are those sites that are in the promoters of genes that have orthologs in the other species (in terms of UCSC multispecies alignments). A detectable site is considered to be 'conserved' between two species if it is located in a conserved block in their corresponding pair-wise alignment. When multiple species are considered, a TFBS is considered to be conserved if it is conserved in each of the species. The 'TFBS conservation rate' between human and other species is defined as the percentage of detectable TFBSs found to be conserved. The conservation rate can be thought of as the upper limit of sensitivity (at the site level) of a phylogenetic footprinting algorithm if only the conserved regions are analyzed. Such algorithms include ConSite and rVista <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B75">75</abbr></abbrgrp>. In general, the methods and thresholds used to define conserved blocks were chosen to reflect those typically used by phylogenetic footprinting algorithms <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B13">13</abbr><abbr bid="B46">46</abbr><abbr bid="B75">75</abbr></abbrgrp> and by other researchers <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Base regulatory potential rate</p>
            </st>
            <p>A base position is called 'regulatory' if it is part of a TFBS. For this report, bases in nonhuman species that are aligned to human regulatory bases are also called regulatory. We understand that this definition is only made for the purposes of this analysis and does not imply any functional role. However, it is expected that the majority of known human sites that are conserved in various species would also be functional in these species. Given a promoter alignment between two species, we define the base regulatory potential rate (BRPR) as the conditional probability of a base being regulatory given it is located in a conserved region over the prior probability of being regulatory. Formally, BRPR is defined in the first part of the following equation:</p>
            <p>
               <display-formula id="M1">
                  <m:math name="gb-2007-8-5-r84-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtext>BRPR</m:mtext>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>R</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>C</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>R</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>C</m:mi>
                                 <m:mo>|</m:mo>
                                 <m:mi>R</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>P</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>C</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaqGcbGaaeOuaiaabcfacaqGsbGaeyypa0ZaaSaaaeaacaWGqbGaaiikaiaadkfacaGG8bGaam4qaiaacMcaaeaacaWGqbGaaiikaiaadkfacaGGPaaaaiabg2da9maalaaabaGaamiuaiaacIcacaWGdbGaaiiFaiaadkfacaGGPaaabaGaamiuaiaacIcacaWGdbGaaiykaaaaaaa@4830@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>R </it>denotes the base as regulatory (part of a known human TFBS) and <it>C </it>indicates that it is located in a conserved region. The last part of the equation derives from the Bayesian rule and is the one we use for the calculation of BRPR because <it>P</it>(<it>R|C</it>) cannot be reliably estimated, given our limited knowledge of mammalian TFBSs. In other words, BRPR shows how much we improve our regulatory potential prediction if we restrict our search space to conserved regions only. <it>P</it>(<it>C</it>) and <it>P</it>(<it>C|R</it>) are directly estimated from the data. <it>P</it>(<it>R</it>) is the <it>a priori </it>probability of a base being regulatory in a given promoter, and it depends on the size of the promoter as well as the number and size of <it>cis</it>-regulatory elements found within. According to our current knowledge of transcriptional control, <it>P</it>(<it>R</it>) decreases as one examines windows of sequence more distal to the transcription start site. In this way, calculated BRPR values are dependant on the length of upstream sequence examined from the transcription start. BRPR values decrease as the examined regions become smaller (5 kb to 1 kb or 500 bp from the TSS; Additional data file 1 [Supplementary Figure 1]) because, from Equation 1 above, <it>P</it>(<it>R</it>) increases in these shorter regions while <it>P</it>(<it>R|C</it>) remains relatively constant. The important point to note, however, is that the relative BRPR rankings of different genome combinations remain constant (Additional data file 1 [Supplementary Figure 1]).</p>
         </sec>
         <sec>
            <st>
               <p>Assessing significance of over-conservation or under-conservation for sets of transcription factor binding sites</p>
            </st>
            <p>The Fisher's exact test on 2 &#215; 2 contingency tables is used to estimate the significance of under-conservation or over-conservation of sites bound by particular transcription factors or associated with certain GO categories (Tables <tblr tid="T4">4</tblr> to <tblr tid="T7">7</tblr>). To account for multiple testing we applied the Bonferroni correction, although the data dependencies among the tests make that correction slightly conservative. Statistically over-represented and under-represented categories are presented in Tables <tblr tid="T4">4</tblr> to <tblr tid="T7">7</tblr> in the corresponding column, and those values that remain significant after the Bonferroni correction are marked with asterisks.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> provides various descriptions, generalized analyses, and supplementary data that complement and extend those given in the main text.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>various descriptions and supplementary data that complement those given in the main text</p>
            </caption>
            <text>
               <p>Supplementary Text 1 describes the dependence of conservation rates on the methods employed. Supplementary Text 2 provides a note on some further properties of the BRPR score. Supplementary Figure 1 illustrates the behavior of BRPR scores in mammalian comparisons as the window of examined upstream sequence is reduced. Supplementary Figure 2 reproduces some of the information in Figure <figr fid="F2">2</figr> (main text), but includes error bars in order that statistical significance of our analysis may be judged. Supplementary Table 1. A shows conservation rates of 5 kb upstream regions and TFBSs as found by the DNA Block Aligner (DBA)-based analysis. Supplementary Table 1. B shows conservation rates of 5 kb upstream regions and TFBSs, as found by the UCSC multiple alignment-based analysis. Supplementary Tables 2 to 9 show TFBS conservation dependency on transcription factor identity for human sites conserved in other species (based on UCSC multiple alignment analysis). Supplementary Tables 10 to 17 show TFBS conservation in relation to the GO category of the regulated gene for human sites conserved in eight other species (based on UCSC multiple alignment analysis). Supplementary Table 18 provides conservation rates of 5 kb upstream regions and TFBSs for human compared with 218 combinations (<sub>8</sub>C<sub>5</sub>) of the eight other tested genomes (based on UCSC multiple alignment analysis). Supplementary Table 19 provides a re-analysis of 5 kb upstream coverage rates and regulatory site conservation using only those sites/regulated genes stored in TRANSFAC public (v. 7.0).</p>
            </text>
            <file name="gb-2007-8-5-r84-S1.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Paul Samollow for being the driving force behind the idea of sequencing <it>Monodelphis</it>. We also thank the people from the Broad Institute for their efforts in sequencing and annotating the <it>Monodelphis </it>genome. Special thanks go to Kerstin Lindblad-Toh, Michael Zody, Tarjei Mikkelsen, and Candace Kammerer for helpful discussions. We also thank two anonymous reviewers for their comments that helped us to improve the manuscript. This work was supported by NIH grants RR014214 and NO1 AI-50018, NSF grant MCB0316255 and a grant with the Pennsylvania Department of Health. The PA Department of Health specifically disclaims responsibility for any analyses, interpretations or conclusions. PVB was also supported by NIH grant 1R01LM007994-01 and TATRC/DoD USAMRAA Prime Award W81XWH-05-2-0066.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Genome of the marsupial <it>Monodelphis domestica </it>reveals innovation in non-coding sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Wakefield</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Amemiya</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Goodstadt</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Heger</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <fpage>167</fpage>
            <lpage>178</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05805</pubid>
                  <pubid idtype="pmpid" link="fulltext">17495919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Reconstructing an ancestral mammalian immune supercomplex from a marsupial major histocompatibility complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Belov</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Deakin</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Papenfuss</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Melman</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Siddle</snm>
                  <fnm>HV</fnm>
               </au>
               <au>
                  <snm>Gouin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Sargeant</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>MD</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e46</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1351924</pubid>
                  <pubid idtype="pmpid" link="fulltext">16435885</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040046</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.</p>
            </title>
            <aug>
               <au>
                  <snm>Siddharthan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Siggia</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>van Nimwegen</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>e67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1309704</pubid>
                  <pubid idtype="pmpid" link="fulltext">16477324</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0010067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>ConSite: web-based prediction of regulatory elements using cross-species comparison.</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Web Server</issue>
            <fpage>W249</fpage>
            <lpage>W252</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441510</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215389</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>rVISTA 2.0: evolutionary analysis of transcription factor binding sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Web Server</issue>
            <fpage>W217</fpage>
            <lpage>221</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441521</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215384</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Footer: a quantitative comparative genomics method for efficient recognition of cis-regulatory elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Corcoran</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Feingold</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dominick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harnaha</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Trucco</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Giannoukakis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Benos</snm>
                  <fnm>PV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>840</fpage>
            <lpage>847</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142474</pubid>
                  <pubid idtype="pmpid" link="fulltext">15930494</pubid>
                  <pubid idtype="doi">10.1101/gr.2952005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>DNA binding sites: representation and discovery.</p>
            </title>
            <aug>
               <au>
                  <snm>Stormo</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>16</fpage>
            <lpage>23</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.1.16</pubid>
                  <pubid idtype="pmpid" link="fulltext">10812473</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Applied bioinformatics for the identification of regulatory elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>276</fpage>
            <lpage>287</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1315</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Eukaryotic regulatory element conservation analysis and identification using comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Altman</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>451</fpage>
            <lpage>458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353232</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993210</pubid>
                  <pubid idtype="doi">10.1101/gr.1327604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover.</p>
            </title>
            <aug>
               <au>
                  <snm>Dermitzakis</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1114</fpage>
            <lpage>1121</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12082130</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Evaluating phylogenetic footprinting for human-rodent comparisons.</p>
            </title>
            <aug>
               <au>
                  <snm>Sauer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shelest</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wingender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>430</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti819</pubid>
                  <pubid idtype="pmpid" link="fulltext">16332706</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Identification of transcription factor binding sites in the human genome sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Levy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hannenhalli</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mamm Genome</source>
            <pubdate>2002</pubdate>
            <volume>13</volume>
            <fpage>510</fpage>
            <lpage>514</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00335-002-2175-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12370781</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Identification of conserved regulatory elements by comparative genome analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mendoza</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Engstrom</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jareborg</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>J Biol</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>13</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193685</pubid>
                  <pubid idtype="pmpid" link="fulltext">12760745</pubid>
                  <pubid idtype="doi">10.1186/1475-4924-2-13</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Mammalian phylogenomics comes of age.</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Pevzner</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>631</fpage>
            <lpage>639</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.09.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15522459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Genome sequence, comparative analysis and haplotype structure of the domestic dog.</p>
            </title>
            <aug>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wade</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Karlsson</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Kulbokas</snm>
                  <fnm>EJ</fnm>
                  <suf>III</suf>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <fpage>803</fpage>
            <lpage>819</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04338</pubid>
                  <pubid idtype="pmpid" link="fulltext">16341006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The <it>C. elegans </it>heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Feinbaum</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Ambros</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1993</pubdate>
            <volume>75</volume>
            <fpage>843</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(93)90529-Y</pubid>
                  <pubid idtype="pmpid" link="fulltext">8252621</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The functions of animal microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Ambros</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>350</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02871</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372042</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Human MicroRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>John</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Aravin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>e363</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">521178</pubid>
                  <pubid idtype="pmpid" link="fulltext">15502875</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020363</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Prediction of mammalian microRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2003</pubdate>
            <volume>115</volume>
            <fpage>787</fpage>
            <lpage>798</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(03)01018-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">14697198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A combined computational-experimental approach predicts human microRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>Kiriakidou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Kouranov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fitziev</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bouyioukos</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mourelatos</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hatzigeorgiou</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>1165</fpage>
            <lpage>1178</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">415641</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131085</pubid>
                  <pubid idtype="doi">10.1101/gad.1184704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>CZ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>396</fpage>
            <lpage>400</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1328</pubid>
                  <pubid idtype="pmpid" link="fulltext">15143321</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>MicroRNAs modulate hematopoietic lineage differentiation.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>CZ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lodish</snm>
                  <fnm>HF</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <fpage>83</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1091903</pubid>
                  <pubid idtype="pmpid" link="fulltext">14657504</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The widespread impact of mammalian MicroRNAs on mRNA repression and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Farh</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Grimson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <fpage>1817</fpage>
            <lpage>1821</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1121158</pubid>
                  <pubid idtype="pmpid" link="fulltext">16308420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>MicroRNAs in mammalian development.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Risom</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Strauss</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Birth Defects Res C Embryo Today</source>
            <pubdate>2006</pubdate>
            <volume>78</volume>
            <fpage>129</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bdrc.20072</pubid>
                  <pubid idtype="pmpid" link="fulltext">16847889</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A microRNA array reveals extensive regulation of microRNAs during brain development.</p>
            </title>
            <aug>
               <au>
                  <snm>Krichevsky</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Donahue</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Khrapko</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kosik</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>1274</fpage>
            <lpage>1281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370491</pubid>
                  <pubid idtype="pmpid" link="fulltext">13130141</pubid>
                  <pubid idtype="doi">10.1261/rna.5980303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>NF-kappaB-dependent induction of microRNA miR-146, an inhibitor targeted to signaling proteins of innate immune responses.</p>
            </title>
            <aug>
               <au>
                  <snm>Taganov</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Boldin</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Baltimore</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>12481</fpage>
            <lpage>12486</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1567904</pubid>
                  <pubid idtype="pmpid" link="fulltext">16885212</pubid>
                  <pubid idtype="doi">10.1073/pnas.0605298103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>MicroRNA genes are transcribed by RNA polymerase II.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yeom</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Baek</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <fpage>4051</fpage>
            <lpage>4060</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524334</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372072</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600385</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The Gene Ontology (GO) database and informatics resource.</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ireland</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lomax</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Foulger</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eilbeck</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D258</fpage>
            <lpage>D261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308770</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681407</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>TRANSFAC: transcriptional regulation, from patterns to profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Matys</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Fricke</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Geffers</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gossling</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Haubrock</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hehl</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hornischer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karas</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kel</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Kel-Margoulis</snm>
                  <fnm>OV</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>374</fpage>
            <lpage>378</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165555</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520026</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg108</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Multiple mutations of the first gene of a dimeric tRNA gene abolish in vitro tRNA gene transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Nichols</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Klekamp</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Weil</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1989</pubdate>
            <volume>264</volume>
            <fpage>17084</fpage>
            <lpage>17090</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2676999</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Antisense-mediated depletion reveals essential and specific functions of microRNAs in <it>Drosophila </it>development.</p>
            </title>
            <aug>
               <au>
                  <snm>Leaman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>PY</fnm>
               </au>
               <au>
                  <snm>Fak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yalcin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pearce</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Unnerstall</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gaul</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>121</volume>
            <fpage>1097</fpage>
            <lpage>1108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.04.016</pubid>
                  <pubid idtype="pmpid" link="fulltext">15989958</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A novel <it>C. elegans </it>zinc finger transcription factor, lsy-2, required for the cell type-specific expression of the lsy-6 microRNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Johnston</snm>
                  <fnm>RJ</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Hobert</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2005</pubdate>
            <volume>132</volume>
            <fpage>5451</fpage>
            <lpage>5460</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/dev.02163</pubid>
                  <pubid idtype="pmpid" link="fulltext">16291785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Conservation of RET regulatory function from human to zebrafish without sequence similarity.</p>
            </title>
            <aug>
               <au>
                  <snm>Fisher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grice</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Vinton</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Bessling</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>McCallion</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>312</volume>
            <fpage>276</fpage>
            <lpage>279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1124070</pubid>
                  <pubid idtype="pmpid" link="fulltext">16556802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Evidence for stabilizing selection in a eukaryotic enhancer element.</p>
            </title>
            <aug>
               <au>
                  <snm>Ludwig</snm>
                  <fnm>MZ</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Kreitman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>564</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35000615</pubid>
                  <pubid idtype="pmpid" link="fulltext">10676967</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>A new generation of JASPAR, the open-access repository for transcription factor binding site profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Vlieghe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>De Bleser</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Vleminckx</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>van Roy</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D95</fpage>
            <lpage>D97</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347477</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381983</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj115</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The hardwiring of development: organization and function of genomic regulatory systems.</p>
            </title>
            <aug>
               <au>
                  <snm>Arnone</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>EH</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>1997</pubdate>
            <volume>124</volume>
            <fpage>1851</fpage>
            <lpage>1864</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9169833</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Regulation of phosphoenolpyruvate carboxykinase (GTP) gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Hanson</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Reshef</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>1997</pubdate>
            <volume>66</volume>
            <fpage>581</fpage>
            <lpage>611</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.66.1.581</pubid>
                  <pubid idtype="pmpid" link="fulltext">9242918</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1034</fpage>
            <lpage>1050</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182216</pubid>
                  <pubid idtype="pmpid" link="fulltext">16024819</pubid>
                  <pubid idtype="doi">10.1101/gr.3715005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Identification and characterization of multi-species conserved sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Margulies</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Blanchette</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2507</fpage>
            <lpage>2518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403793</pubid>
                  <pubid idtype="pmpid" link="fulltext">14656959</pubid>
                  <pubid idtype="doi">10.1101/gr.1602203</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Phylogenetic shadowing of primate sequences to find functional regions of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McAuliffe</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>299</volume>
            <fpage>1391</fpage>
            <lpage>1394</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1081331</pubid>
                  <pubid idtype="pmpid" link="fulltext">12610304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>eShadow: a tool for comparing closely related sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1191</fpage>
            <lpage>1198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419798</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173121</pubid>
                  <pubid idtype="doi">10.1101/gr.1773104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Evolution of candidate transcriptional regulatory motifs since the human-chimpanzee divergence.</p>
            </title>
            <aug>
               <au>
                  <snm>Donaldson</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Gottgens</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R52</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779530</pubid>
                  <pubid idtype="pmpid" link="fulltext">16808854</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Position specific variation in the rate of evolution in transcription factor binding sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>DY</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>19</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">212491</pubid>
                  <pubid idtype="pmpid" link="fulltext">12946282</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-3-19</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Large-scale turnover of functional transcription factor binding sites in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Nix</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>XY</fnm>
               </au>
               <au>
                  <snm>Biggin</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e130</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1599766</pubid>
                  <pubid idtype="pmpid" link="fulltext">17040121</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0020130</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>JASPAR: an open-access database for eukaryotic transcription factor binding profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Alkema</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Engstrom</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D91</fpage>
            <lpage>D94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308747</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681366</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh012</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>FOOTER: a web tool for finding mammalian DNA regulatory regions using phylogenetic footprinting.</p>
            </title>
            <aug>
               <au>
                  <snm>Corcoran</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Feingold</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Benos</snm>
                  <fnm>PV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Web Server</issue>
            <fpage>W442</fpage>
            <lpage>W446</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160181</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980508</pubid>
                  <pubid idtype="doi">10.1093/nar/gki420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Identification of conserved structural features at sequentially degenerate locations in transcription factor binding sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Burden</snm>
                  <fnm>HE</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Genome Inform</source>
            <pubdate>2005</pubdate>
            <volume>16</volume>
            <fpage>49</fpage>
            <lpage>58</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16362906</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Ultraconserved elements in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pheasant</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Makunin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Stephen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>1321</fpage>
            <lpage>1325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1098119</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Highly conserved upstream sequences for transcription factor genes and implications for the regulatory network.</p>
            </title>
            <aug>
               <au>
                  <snm>Iwama</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>17156</fpage>
            <lpage>17161</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">534610</pubid>
                  <pubid idtype="pmpid" link="fulltext">15572454</pubid>
                  <pubid idtype="doi">10.1073/pnas.0407670101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bruce</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Engstrom</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Klos</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Ericson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">544600</pubid>
                  <pubid idtype="pmpid" link="fulltext">15613238</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Highly conserved non-coding sequences are associated with vertebrate development.</p>
            </title>
            <aug>
               <au>
                  <snm>Woolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goodson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McEwen</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Vavouri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>North</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Callaway</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">526512</pubid>
                  <pubid idtype="pmpid" link="fulltext">15630479</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Selective constraint in intergenic regions of human and mouse genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Shabalina</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Ogurtsov</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>373</fpage>
            <lpage>376</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02344-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11418197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Human-mouse genome comparisons to locate regulatory sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Palumbo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Fickett</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>26</volume>
            <fpage>225</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/79965</pubid>
                  <pubid idtype="pmpid" link="fulltext">11017083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs.</p>
            </title>
            <aug>
               <au>
                  <snm>Jareborg</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>815</fpage>
            <lpage>824</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310816</pubid>
                  <pubid idtype="pmpid" link="fulltext">10508839</pubid>
                  <pubid idtype="doi">10.1101/gr.9.9.815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>The transcriptional landscape of the mammalian genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kasukawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oyama</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ravasi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wells</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>1559</fpage>
            <lpage>1563</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1112014</pubid>
                  <pubid idtype="pmpid" link="fulltext">16141072</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Comprehensive analysis of transcriptional promoter structure and function in 1% of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Cooper</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Trinklein</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Anton</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356123</pubid>
                  <pubid idtype="pmpid" link="fulltext">16344566</pubid>
                  <pubid idtype="doi">10.1101/gr.4222606</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Heterotachy in mammalian promoter evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Kai</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kawai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e30</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449885</pubid>
                  <pubid idtype="pmpid" link="fulltext">16683025</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0020030</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>miRBase: microRNA sequences, targets and gene nomenclature.</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grocock</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D140</fpage>
            <lpage>D144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347474</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381832</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>UCSC Genome Browser</p>
            </title>
            <url>http://genome.ucsc.edu/</url>
         </bibl>
         <bibl id="B60">
            <title>
               <p>miRNAMap: genomic maps of microRNA genes and their target genes in mammalian genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Hsu</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>LZ</fnm>
               </au>
               <au>
                  <snm>Tsou</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Tseng</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D135</fpage>
            <lpage>D139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347497</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381831</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>BLAT: the BLAST-like alignment tool.</p>
            </title>
            <aug>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>656</fpage>
            <lpage>664</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187518</pubid>
                  <pubid idtype="pmpid" link="fulltext">11932250</pubid>
                  <pubid idtype="doi">10.1101/gr.229202. Article published online before March 2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>A polycistronic microRNA cluster, miR-17-92, is overexpressed in human lung cancers and enhances cell proliferation.</p>
            </title>
            <aug>
               <au>
                  <snm>Hayashita</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Osada</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tatematsu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yanagisawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tomida</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yatabe</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kawahara</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sekido</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cancer Res</source>
            <pubdate>2005</pubdate>
            <volume>65</volume>
            <fpage>9628</fpage>
            <lpage>9632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1158/0008-5472.CAN-05-2352</pubid>
                  <pubid idtype="pmpid" link="fulltext">16266980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>UCSC MULTIZ alignments</p>
            </title>
            <url>http://hgdownload.cse.ucsc.edu/goldenPath/hg18/multiz17way/</url>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Aligning multiple genomic sequences with the threaded blockset aligner.</p>
            </title>
            <aug>
               <au>
                  <snm>Blanchette</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Riemer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Elnitski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Roskin</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>708</fpage>
            <lpage>715</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383317</pubid>
                  <pubid idtype="pmpid" link="fulltext">15060014</pubid>
                  <pubid idtype="doi">10.1101/gr.1933104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Initial sequencing and analysis of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Linton</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zody</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dewar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Initial sequence of the chimpanzee genome and comparison with the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>CSaA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>69</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04072</pubid>
                  <pubid idtype="pmpid" link="fulltext">16136131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Genome sequence of the Brown Norway rat yields insights into mammalian evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Metzker</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Muzny</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Sodergren</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Steffen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Worley</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Burch</snm>
                  <fnm>PE</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <fpage>493</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02426</pubid>
                  <pubid idtype="pmpid" link="fulltext">15057822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Burt</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Groenen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Delany</snm>
                  <fnm>ME</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>432</volume>
            <fpage>695</fpage>
            <lpage>716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03154</pubid>
                  <pubid idtype="pmpid" link="fulltext">15592404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Whole-genome shotgun assembly and analysis of the genome of <it>Fugu rubripes</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Aparicio</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stupka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Dehal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Christoffels</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rash</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hoon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>1301</fpage>
            <lpage>1310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1072104</pubid>
                  <pubid idtype="pmpid" link="fulltext">12142439</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Genome duplication in the teleost fish <it>Tetraodon nigroviridis </it>reveals the early vertebrate proto-karyotype.</p>
            </title>
            <aug>
               <au>
                  <snm>Jaillon</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Aury</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brunet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Petit</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bouneau</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ozouf-Costaz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bernot</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>946</fpage>
            <lpage>957</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03025</pubid>
                  <pubid idtype="pmpid" link="fulltext">15496914</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>PhyloGif Program for Phylogenetic Trees</p>
            </title>
            <url>http://genome.ucsc.edu/cgi-bin/phyloGif</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Margulies</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Maduro</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Amemiya</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>3354</fpage>
            <lpage>3359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">549084</pubid>
                  <pubid idtype="pmpid" link="fulltext">15718282</pubid>
                  <pubid idtype="doi">10.1073/pnas.0408539102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Benos laboratory web server.</p>
            </title>
            <url>http://www.benoslab.pitt.edu</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>rVista for comparative sequence-based discovery of functional transcription factor binding sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>832</fpage>
            <lpage>839</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186580</pubid>
                  <pubid idtype="pmpid" link="fulltext">11997350</pubid>
                  <pubid idtype="doi">10.1101/gr.225502. Article published online before print in April 2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
