<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-441</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Database</dochead>
      <bibl>
         <title>
            <p>TFCONES: A database of vertebrate transcription factor-encoding genes and their associated conserved noncoding elements</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Lee</snm>
               <mi>P</mi>
               <fnm>Alison</fnm>
               <insr iid="I1"/>
               <email>alee@imcb.a-star.edu.sg</email>
            </au>
            <au id="A2">
               <snm>Yang</snm>
               <fnm>Yuchen</fnm>
               <insr iid="I1"/>
               <email>ycyang@imcb.a-star.edu.sg</email>
            </au>
            <au id="A3">
               <snm>Brenner</snm>
               <fnm>Sydney</fnm>
               <insr iid="I1"/>
               <email>sydney_brenner@a-star.edu.sg</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Venkatesh</snm>
               <fnm>Byrappa</fnm>
               <insr iid="I1"/>
               <email>mcbbv@imcb.a-star.edu.sg</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Molecular and Cell Biology, 61 Biopolis Drive, Singapore 138673, Singapore</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>441</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/441</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18045502</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-441</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>29</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>29</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Lee et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Transcription factors (TFs) regulate gene transcription and play pivotal roles in various biological processes such as development, cell cycle progression, cell differentiation and tumor suppression. Identifying <it>cis</it>-regulatory elements associated with TF-encoding genes is a crucial step in understanding gene regulatory networks. To this end, we have used a comparative genomics approach to identify putative <it>cis</it>-regulatory elements associated with TF-encoding genes in vertebrates.</p>
            </sec>
            <sec>
               <st>
                  <p>Description</p>
               </st>
               <p>We have created a database named TFCONES (<ul>T</ul>ranscription <ul>F</ul>actor Genes &amp; Associated <ul>CO</ul>nserved <ul>N</ul>oncoding <ul>E</ul>lement<ul>S</ul>) (<url>http://tfcones.fugu-sg.org</url>) which contains all human, mouse and fugu TF-encoding genes and conserved noncoding elements (CNEs) associated with them. The CNEs were identified by gene-by-gene alignments of orthologous TF-encoding gene loci using MLAGAN. We also predicted putative transcription factor binding sites within the CNEs. A significant proportion of human-fugu CNEs contain experimentally defined binding sites for transcriptional activators and repressors, indicating that a majority of the CNEs may function as transcriptional regulatory elements. The TF-encoding genes that are involved in nervous system development are generally enriched for human-fugu CNEs. Users can retrieve TF-encoding genes and their associated CNEs by conducting a keyword search or by selecting a family of DNA-binding proteins.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The conserved noncoding elements identified in TFCONES represent a catalog of highly prioritized putative <it>cis</it>-regulatory elements of TF-encoding genes and are candidates for functional assay.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transcription factors (TFs) are proteins that bind to <it>cis</it>-regulatory elements and activate or repress transcription of genes. Other proteins that are involved in transcriptional regulation and that could fall under the broad classification of TFs, include co-factors, chromatin remodeling enzymes involved in chromatin or histone modification affecting the transcriptional state of target genes, and general transcription factors that associate directly with RNA polymerase II in a transcription initiation complex. The human genome is estimated to contain about 2,000 TF-encoding genes including co-factors, chromatin remodeling enzymes and general transcription factors <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. TFs play crucial roles in development (e.g., HOX, SOX and PAX proteins), cell cycle progression (e.g., c-MYC, c-JUN), tumor suppression (e.g., p53, FOXO proteins) and cell differentiation (e.g., RUNX, DLX proteins). The vast majority of TFs are known to regulate the expression of a number of different genes, and TF-encoding genes themselves are the key targets of TFs, with many TFs regulating the expression levels of their own genes. Thus, TFs represent crucial nodes in the gene regulatory networks that determine the correct development of the body plan and regulate various physiological processes. To gain an understanding of these gene regulatory networks, it is important to identify <it>cis</it>-regulatory elements associated with TF-encoding genes and the TFs involved in the differential expression of TF-encoding genes. Mutations that disrupt the <it>cis</it>-regulatory elements of TF-encoding genes have been implicated in genetic diseases such as aniridia (disruption of <it>PAX6 </it>regulatory elements, <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>), Rieger syndrome (<it>PITX2 </it><abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>) and campomelic dysplasia (<it>SOX9 </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>).</p>
         <p>With the availability of whole genome sequences of several vertebrates, comparative genomics has proved to be a powerful approach for identifying <it>cis</it>-regulatory elements in vertebrate genomes. Since functional noncoding elements tend to evolve slowly due to selective pressure, potential <it>cis</it>-regulatory elements can be identified as noncoding sequences that are conserved in distantly related genomes. Whole-genome comparisons of human and other vertebrates have indeed proved to be effective in identifying functional <it>cis</it>-regulatory elements in the human genome <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Notably, a large number of conserved noncoding elements (CNEs) identified in these genome-wide comparisons were found to be associated with TF-encoding and developmental genes. For example, 83% (1,140 out of 1,373) of CNEs (>70% identity/>100 bp alignment length) identified in genome-wide comparison of human and fugu were located in the vicinity of about 120 human DNA-binding TF-encoding genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, while 104 of the 290 human genes associated with human-zebrafish CNEs (>70% identity/>80 bp sequence length) were found to be TF-encoding genes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. In spite of such a known association between TF-encoding genes and CNEs, no systematic gene-by-gene comparison of all the orthologous human and other vertebrate TF-encoding genes has been carried out to identify potential <it>cis</it>-regulatory elements associated with them. Whole-genome comparisons, particularly between distantly related genomes such as human and fish, fail to identify and align all the orthologous sequences due to the stringent criterion of local alignment algorithms. On the other hand, locus-by-locus comparison of orthologous TF-encoding genes is more effective in identifying all the associated CNEs. Furthermore, because global alignment algorithms have an additional assumption that input sequences occur in the same order and orientation, they have more power in detecting weakly conserved regions than local alignments <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Nevertheless, it should be noted that global alignment algorithms tend to miss conserved functional elements that have undergone local inversions and rearrangements. Previously, orthologous TF-encoding genes of human, rodents, fugu and zebrafish have been compared <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. However, such studies were promoter-centric, with comparisons restricted to sequences flanking the transcription start sites. The scope of such studies is limited because <it>cis</it>-regulatory elements can be located at considerable distance from transcription start sites, even up to several hundred kilobases away <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. More recently, a selected set of developmentally regulated genes and their associated conserved noncoding elements has been reported <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. This database includes many but not all of the TF-encoding genes.</p>
         <p>We have constructed a database of DNA-binding transcription factor-encoding genes in vertebrate genomes and evolutionarily conserved noncoding elements associated with them. This database would be useful to researchers interested in studying the regulation of TF-encoding genes and understanding gene regulatory networks in vertebrates.</p>
      </sec>
      <sec>
         <st>
            <p>Construction and Content</p>
         </st>
         <sec>
            <st>
               <p>Human, mouse and fugu transcription factor genes</p>
            </st>
            <p>Sequences for 1,962 human TFs were obtained from <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and redundancies were removed by a homology search against human RefSeq proteins. Several known proteins missing from Messina et al.'s dataset (e.g., <it>JMJ2A, JMJ2C, JMJ2D, HES4 </it>and <it>DLX6</it>) were included with the search results and the resulting proteins were mapped to human genes in Ensembl Release 37 <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The TFs were classified by DNA-binding domains or if lacking a DNA-binding domain, the TFs were classified separately into one of the following categories: co-factors, general transcription factors, components of chromatin remodeling complexes and transcriptional regulators that are involved solely in protein-protein interactions (i.e., TFs with ZnF-PHD, ZnF-BTB/POZ, ZnF-MYND domains). Mouse orthologs were retrieved from Ensembl BioMart. Fugu orthologs were identified using a combination of data from Ensembl BioMart (fugu version 4 assembly) and INPARANOID analysis <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Fishes contain duplicate copies for many human genes due to a 'fish-specific' whole-genome duplication event in the fish lineage <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. INPARANOID was used to identify duplicate fugu orthologs for human TFs that may have been missed in Ensembl. Only proteins longer than 50 residues were used in the analysis. INPARANOID identified some many-to-many ortholog groups which were resolved into smaller families based on phylogenetic analysis using PHYLIP <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> with sequences from cartilaginous fishes, lamprey, amphioxus or <it>Ciona intestinalis </it>as the outgroup. For each family, multiple human and fugu proteins and a single outgroup sequence were aligned with ClustalW. Alignments with ungapped alignment length less than 50 residues were rejected and consensus trees were generated for the remaining alignments using 1,000 bootstraps and the PHYLIP programs PROTDIST and NEIGHBOR (Neighbor-Joining method). Following the generation of phylogenetic trees, families were resolved to one-to-one or one-to-two human-to-fugu gene relationships based on visual inspection of the consensus trees. TF-encoding genes present in clusters (linked to each other on a chromosome/scaffold, with no intervening non-TF-encoding gene) in the genome were identified using a pre-computed index of the relative locations of all protein-coding genes in Ensembl Release 37. To avoid redundancy in predicting conserved noncoding elements associated with TF-encoding genes that reside in clusters, we identified conserved clusters of human TF-encoding genes whose orthologs in mouse and fugu are conserved in the same order and orientation.</p>
         </sec>
         <sec>
            <st>
               <p>Aligning human-mouse-fugu TF-encoding gene loci and identifying CNEs</p>
            </st>
            <p>The protocol used for identifying CNEs associated with human TF-encoding genes is summarized in Fig <figr fid="F1">1</figr>. Genomic sequences of human, mouse and fugu TF-encoding gene loci (spanning entire conserved clusters or singleton genes) including the entire 5' and 3' flanking regions (up to the next gene upstream and downstream), were retrieved from Ensembl Release 37. The sequences were repeat-masked using RepeatMasker version open-3.1.5 <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Sequences of human, mouse and fugu orthologous gene loci were aligned using MLAGAN <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Multiple alignments between human, mouse and fugu sequences were preferred over pair-wise alignments between human and fugu sequences, because multiple alignments are generally more selective (fewer false positives) than pair-wise alignments in the detection of regulatory elements <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. A global alignment strategy is suitable for our goal of aligning orthologous TF-encoding gene loci which are assumed to occur in the same order and orientation. However, it should be noted that global alignment algorithms are unable to detect conserved functional elements that have undergone local inversions and rearrangements.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>A flowchart of protocol used for identifying human-mouse and human-fugu conserved noncoding elements</p>
               </caption>
               <text>
                  <p>A flowchart of protocol used for identifying human-mouse and human-fugu conserved noncoding elements.</p>
               </text>
               <graphic file="1471-2164-8-441-1"/>
            </fig>
            <p>Conserved noncoding sequences were visualized and detected with VISTA <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> using the fugu sequence as the reference sequence. Conserved noncoding sequences between human and mouse were defined as regions with minimum 70% identity over 100 bp of sequence <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Considering the longer evolutionary distance between human and fugu, and the fact that BLASTZ alignments of human and fugu genomes show that the alignable regions cover only 1.8% of the human genome with an average identity of 60% <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, we defined conserved noncoding sequences between mammals and fugu as sequences that display minimum 65% identity over 50 bp of sequence. Any conserved noncoding sequences that overlapped with protein-coding sequence (Ensembl Release 37 or BLASTX search against NCBI's non-redundant proteome, E-value &lt; 10<sup>-4</sup>) were eliminated. Pseudogenes and noncoding RNA genes annotated in Ensembl Release 37 were also eliminated. Conserved noncoding sequences that contained RNA genes were identified based on BLASTN (E-value &lt; 10<sup>-4</sup>) and INFERNAL searches against Rfam (Release 7.0) and miRBase (Release 8.0), and excluded from further analysis. Sequences containing 20% or more microsatellite repeats (2 to 5 nucleotides with minimum 4 occurrences and minimum 10 bp in length) were also filtered out. The remaining conserved noncoding sequences were reclassified as conserved noncoding elements (CNEs). CNEs were annotated as " 5' ", "intronic" or " 3' " based on their location with respect to the protein-coding sequence of human TF-encoding genes. CNEs that fall within intergenic regions of TF-encoding genes are annotated with respect to both upstream and downstream genes.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of CNE statistics for human TF-encoding genes</p>
            </st>
            <p>For every orthologous human TF-encoding gene, we determined the density, number and total length of human-fugu CNEs associated with that gene. CNE density is defined as the number of bases residing in CNEs per unit length (1 kilobase in human; 100 bp in fugu) of non-repetitive noncoding sequence in a human gene locus or the longest orthologous fugu gene locus. For genes that are part of conserved clusters, we averaged out the number or length of CNEs present in the whole cluster over the number of genes in that cluster. For CNE density, we averaged the total length of CNEs present in the whole cluster over the non-repetitive noncoding sequence length of the cluster.</p>
         </sec>
         <sec>
            <st>
               <p>Gene Ontology enrichment analysis</p>
            </st>
            <p>To test the null hypothesis that human TF-encoding genes containing human-fugu CNEs are independent of their associated Gene Ontology (GO) terms, a Fisher's exact test was applied in the GOstat package <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Gene Ontology terms of genes that contained CNEs were submitted to GOstat for analysis against all the human TF orthologs. Two-tailed <it>P</it>-values were reported. Only the "biological process" subset of GO hierarchy was considered. GO output was restricted to GO-terms with <it>P</it>-value &lt; 0.01, and the "false discovery rate" method <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> was used for multiple testing correction. The reference GO annotation was retrieved from Ensembl BioMart for all human protein-coding genes.</p>
         </sec>
         <sec>
            <st>
               <p>Enrichment analysis of CNEs within experimentally defined TF binding sites</p>
            </st>
            <p>The human-fugu CNEs that overlap with the binding sites of OCT4, SOX2, NANOG <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, c-MYC <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and SUZ12 <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> were identified, by comparing chromosomal coordinates of CNEs and TF bound regions and counting CNEs with at least half of its length overlapped by a bound region. Except the bound regions of c-MYC which were identified by the ChIP-PET technique, all other bound regions were identified by ChIP-on-chip.</p>
            <p>To test the null hypothesis that the number of CNEs that overlap experimentally validated TF binding sites is no greater than that expected from CNEs that overlap randomized TF binding sites, a binomial distribution was applied in the following way. For every bound region identified by ChIP-on-chip, we generated a random genomic region of the same length by randomly choosing a probe present on the microarray chip and centering the random genomic region on the chosen probe. For c-MYC, random regions were sampled from the entire human genome. After each randomization, the proportion of CNEs that overlapped the random regions was determined. The random process was repeated 1000 times and a binomial distribution was applied using the expectation derived from the overlap of CNEs with random regions. One-tailed <it>P</it>-values were calculated.</p>
         </sec>
         <sec>
            <st>
               <p>Prediction of TF binding sites</p>
            </st>
            <p>TF binding sites were detected in human-fugu CNEs using TESS (TRANSFAC 4.0 data) <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Only vertebrate sites and weight matrices were searched with background (A+T)-content of 60.4% (average (A+T)-content for all human CNE sequences) or 57.4% (average (A+T)-content for all fugu CNE sequences). Overlapping binding sites bound by the same TFs were removed, and only string matches with log likelihood score ratio <it>L</it><sub><it>q </it></sub>> 0.98 and matrix matches with core similarity <it>S</it><sub><it>c </it></sub>> 90% and matrix similarity <it>S</it><sub><it>m </it></sub>> 80% were retained.</p>
         </sec>
         <sec>
            <st>
               <p>Implementation of TFCONES database</p>
            </st>
            <p>The TFCONES database is housed in a MySQL server and made accessible via an Apache web server. Perl scripts are used to generate result pages in response to user queries. The Generic Genome Browser <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> is used for the display of CNEs on the human genome. A local <it>wwwblast </it>server <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> has been set up to allow users to BLAST-search their sequences against the CNEs in the TFCONES database. In order to provide a set of high-confidence TF binding sites located in CNEs that would be useful for experiments, only human TF binding sites that overlap an orthologous prediction in the fugu sequence are displayed in the database.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Utility and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Transcription factor-encoding genes in human, mouse and fugu</p>
            </st>
            <p>In the TFCONES database, we have identified in total 1,738 human, 1,495 mouse and 1,762 fugu TF-encoding genes. For the purpose of identifying conserved <it>cis</it>-regulatory elements associated with a set of TF-encoding genes that regulate transcription in the same way (i.e., via DNA-binding), we excluded the co-factors, chromatin modifiers and general transcription factors from further analysis and focused on DNA-binding transcription factors. This resulted in the identification of 1,327 sequence-dependent or DNA-binding transcription factors. Similarly, 1,086 and 1,328 DNA-binding TFs were identified in mouse and fugu respectively. Almost all families of TFs, except C<sub>2</sub>H<sub>2</sub>-type (cysteine<sub>2</sub>-histidine<sub>2</sub>) zinc finger proteins, contain a higher number of genes in fugu than human and mouse (Table <tblr tid="T1">1</tblr>). These additional TFs in fugu are likely to be the result of the 'fish-specific' whole-genome duplication in the fish lineage <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We identified the mouse and fugu orthologs for human TF-encoding genes using reciprocal BLAST and INPARANOID (see Construction and Content). Of the 1,327 human genes that encode DNA-binding transcription factors, 1,069 genes have orthologs in mouse while 816 genes have orthologs in both mouse and fugu. Fugu contains two or more orthologs for 36 human TF-encoding genes (Additional data file <supplr sid="S1">1</supplr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>DNA-binding TF-encoding genes in human, mouse and fugu genomes. The TFs were classified based on their DNA-binding domains using a classification scheme adapted from Messina et al. [1]</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>TF family</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Human</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Mouse</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Fugu</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP-2</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ARID</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beta-scaffold &#8211; CCAAT</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beta-scaffold &#8211; MADS</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beta-scaffold &#8211; p53</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beta-scaffold &#8211; RUNT</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Beta-scaffold &#8211; Others</p>
                     </c>
                     <c ca="right">
                        <p>28</p>
                     </c>
                     <c ca="right">
                        <p>28</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BHLH</p>
                     </c>
                     <c ca="right">
                        <p>96</p>
                     </c>
                     <c ca="right">
                        <p>94</p>
                     </c>
                     <c ca="right">
                        <p>134</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BZIP</p>
                     </c>
                     <c ca="right">
                        <p>69</p>
                     </c>
                     <c ca="right">
                        <p>65</p>
                     </c>
                     <c ca="right">
                        <p>83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dwarfin</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E2F</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Forkhead</p>
                     </c>
                     <c ca="right">
                        <p>40</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                     <c ca="right">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GCM</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Heat shock factor</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>High mobility group box</p>
                     </c>
                     <c ca="right">
                        <p>35</p>
                     </c>
                     <c ca="right">
                        <p>39</p>
                     </c>
                     <c ca="right">
                        <p>56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Homeobox</p>
                     </c>
                     <c ca="right">
                        <p>221</p>
                     </c>
                     <c ca="right">
                        <p>199</p>
                     </c>
                     <c ca="right">
                        <p>282</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Nuclear hormone receptor</p>
                     </c>
                     <c ca="right">
                        <p>49</p>
                     </c>
                     <c ca="right">
                        <p>49</p>
                     </c>
                     <c ca="right">
                        <p>69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Paired box</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RFX</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T-box</p>
                     </c>
                     <c ca="right">
                        <p>16</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>19</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TEA</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trp cluster &#8211; Ets</p>
                     </c>
                     <c ca="right">
                        <p>29</p>
                     </c>
                     <c ca="right">
                        <p>29</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trp cluster &#8211; IRF</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trp cluster &#8211; Myb</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZnF-C2H2</p>
                     </c>
                     <c ca="right">
                        <p>472</p>
                     </c>
                     <c ca="right">
                        <p>290</p>
                     </c>
                     <c ca="right">
                        <p>254</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZnF-C3H</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZnF-DM</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZnF-GATA</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZnF-Others</p>
                     </c>
                     <c ca="right">
                        <p>61</p>
                     </c>
                     <c ca="right">
                        <p>55</p>
                     </c>
                     <c ca="right">
                        <p>73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Others (e.g., SAND, RBPSUH)</p>
                     </c>
                     <c ca="right">
                        <p>84</p>
                     </c>
                     <c ca="right">
                        <p>72</p>
                     </c>
                     <c ca="right">
                        <p>74</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>1,327</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>1,086</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>1,328</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <suppl id="S1">
               <title>
                  <p>Additional data file 1</p>
               </title>
               <text>
                  <p>Human TF-encoding genes with more than one ortholog in fugu genome.</p>
               </text>
               <file name="1471-2164-8-441-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Conserved clusters of TF-encoding genes in human, mouse and fugu</p>
            </st>
            <p>TF-encoding genes such as <it>Hox</it>, <it>Iroquois </it>and <it>Dlx </it>genes exist in clusters. The conservation of clustered organization of genes in distantly related genomes is thought to be related to the presence of evolutionarily constrained shared enhancers <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. We identified a total of 149 clusters of human TF-encoding genes containing 407 genes, 95 clusters of mouse TF-encoding genes containing 253 genes, and 115 clusters of fugu TF-encoding genes containing 280 genes (Additional data files <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr>). Of these clusters, 18 orthologous clusters comprising 70 TF-encoding genes were found to be conserved in order and orientation in all three genomes (Additional data file <supplr sid="S5">5</supplr>). These evolutionarily conserved TF-encoding gene clusters are likely to be associated with evolutionarily constrained shared enhancers. This set includes instances such as HoxD cluster which has been shown to contain conserved enhancers that regulate several genes in the cluster <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Among these clusters, the identification of CNEs in the four human <it>Hox </it>gene clusters has been previously reported <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>.</p>
            <suppl id="S2">
               <title>
                  <p>Additional data file 2</p>
               </title>
               <text>
                  <p>Clusters of TF-encoding genes in the human genome.</p>
               </text>
               <file name="1471-2164-8-441-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional data file 3</p>
               </title>
               <text>
                  <p>Clusters of TF-encoding genes in the mouse genome.</p>
               </text>
               <file name="1471-2164-8-441-S3.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional data file 4</p>
               </title>
               <text>
                  <p>Clusters of TF-encoding genes in the fugu genome.</p>
               </text>
               <file name="1471-2164-8-441-S4.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional data file 5</p>
               </title>
               <text>
                  <p>Conserved clusters of human, mouse and fugu TF-encoding genes. An asterisk indicates a human TF-encoding gene that has no ortholog in fugu but is located in a <it>Hox </it>conserved syntenic block.</p>
               </text>
               <file name="1471-2164-8-441-S5.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Identification of human-mouse and human-fugu CNEs</p>
            </st>
            <p>After alignment of the orthologous human, mouse and fugu TF-encoding gene loci using MLAGAN <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and detection of CNEs using VISTA <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and subsequent filtering steps, of the 816 human-mouse-fugu orthologous TF-encoding genes, 797 genes contain 58,954 human-mouse CNEs, whereas 389 genes contain 2,843 human-fugu CNEs. The human-mouse and human-fugu CNEs represent 5.7% and 0.15% respectively, of noncoding sequences extracted for alignments. The distribution of the lengths of human-mouse and human-fugu CNEs are presented in Additional data file <supplr sid="S6">6</supplr>. The human-mouse CNEs are on average 244 bp long (median is 183 bp; longest is 3.9 kb) and add up to a total length of 14,375 kb. The average length of human-fugu CNEs is 136 bp (median is 106 bp; longest is 1.0 kb) and the total length is 388 kb. Whole-genome comparison of human and mouse has indicated that about 5% of human and mouse sequences are under evolutionary constraint <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, of which 3.5% are noncoding sequences. The "Gumby" algorithm developed by Prabhakar et al. <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> which is based on human-rodent sequence comparisons and enforces significance thresholds, predicts that only 2.2% of noncoding sequences in the human and rodent genomes are likely to be under selection. It is, therefore, likely that a considerable number of the human-mouse CNEs are not under evolutionary constraint but rather sequences that did not have adequate time to diverge. On the other hand, most of the human-fugu CNEs which are conserved over a long evolutionary period (420 Myr) are likely to be under purifying selection. To verify this, we determined if the human-fugu CNEs significantly overlap with experimentally validated transcription factor binding sites. We also computed the density and distribution of human-fugu CNEs among TF-encoding genes. The results of these analyses are presented below.</p>
            <suppl id="S6">
               <title>
                  <p>Additional data file 6</p>
               </title>
               <text>
                  <p>Distribution of lengths of (A) human-mouse and (B) human-fugu CNEs.</p>
               </text>
               <file name="1471-2164-8-441-S6.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Density and number of human-fugu CNEs per gene</p>
            </st>
            <p>Since we used the entire 5' and 3' flanking regions in our alignments, we investigated whether genes with larger flanking regions were associated with more human-fugu CNEs than genes with short flanking regions. To this end, we calculated the density of human-fugu CNEs in human TF-encoding genes. The CNE densities of human genes vary widely from 0.03 to 38 bp per kb of human sequence for the 389 CNE-associated genes (Fig <figr fid="F2">2</figr>). The top 20 human TF-encoding genes with the highest CNE densities are listed in Table <tblr tid="T2">2</tblr>. These genes are characteristically homeodomain-encoding genes such as the genes XP_496843 (<it>Uncx4.1</it>), XP_291716 (<it>HMX3</it>), NP_005510 (<it>HMX2</it>), <it>LMO1 </it>and <it>PBX3</it>. The high CNE densities of these genes indicate that a large proportion of their noncoding regions are under evolutionary constraint. To determine if the compaction of intergenic regions in fugu has affected CNE density, we also measured CNE densities in fugu for every human gene, which we defined as the number of bases residing in CNEs per 100 bp of the longest orthologous fugu gene locus. The CNE density in fugu also ranges widely from 0.02 to 67 bp per 100 bp of fugu sequence (Fig <figr fid="F2">2</figr>). However, 16 of the top 20 genes (human and their fugu orthologs) with the highest human-fugu CNE densities in fugu (Additional data file <supplr sid="S7">7</supplr>) are different from the top 20 genes of highest CNE densities in human (Table <tblr tid="T2">2</tblr>). This indicates that the intergenic regions have not been compacted or expanded uniformly in fugu and human, respectively.</p>
            <suppl id="S7">
               <title>
                  <p>Additional data file 7</p>
               </title>
               <text>
                  <p>Top twenty TF-encoding genes associated with the highest density of human-fugu CNEs. For genes that are part of conserved clusters, we averaged out the number or length of CNEs present in the whole cluster over the number of genes in that cluster. CNE density is defined as the number of bases located in CNEs per 100 bp of non-repetitive noncoding sequence in the longest orthologous fugu gene locus.</p>
               </text>
               <file name="1471-2164-8-441-S7.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Top twenty TF-encoding genes associated with the highest density of human-fugu CNEs. For genes that are part of conserved clusters, we averaged out the number or length of CNEs present in the whole cluster over the number of genes in that cluster. CNE density is defined here as the number of bases located in CNEs per kilobase of non-repetitive noncoding sequence in a gene locus.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene ID</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Gene name</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Description</b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>
                           <b>CNE bases per kb of human sequence</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000164853</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>XP_496843.1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PREDICTED: similar to Uncx4.1</p>
                     </c>
                     <c ca="right">
                        <p>37.61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000188620</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>XP_291716.5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PREDICTED: similar to homeodomain protein</p>
                     </c>
                     <c ca="right">
                        <p>31.82</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000188816</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>NP_005510.1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox (H6 family) 2</p>
                     </c>
                     <c ca="right">
                        <p>31.82</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000166407</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LMO1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Rhombotin-1</p>
                     </c>
                     <c ca="right">
                        <p>29.17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000167081</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PBX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Pre-B-cell leukemia transcription factor 3</p>
                     </c>
                     <c ca="right">
                        <p>29.16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000130940</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>CASZ1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Probable transcription factor CST</p>
                     </c>
                     <c ca="right">
                        <p>26.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000139800</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZIC5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Zinc finger protein ZIC 5</p>
                     </c>
                     <c ca="right">
                        <p>25.94</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000109132</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PHOX2B</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Paired mesoderm homeobox protein 2B</p>
                     </c>
                     <c ca="right">
                        <p>24.93</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000075891</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PAX2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Paired box protein Pax-2.</p>
                     </c>
                     <c ca="right">
                        <p>24.27</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000177508</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-3</p>
                     </c>
                     <c ca="right">
                        <p>24.04</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000165655</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZNF503</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>zinc finger protein 503</p>
                     </c>
                     <c ca="right">
                        <p>21.39</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000143032</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>BARHL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>BarH-like 2 homeobox protein.</p>
                     </c>
                     <c ca="right">
                        <p>21.28</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000171540</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>OTP</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>orthopedia</p>
                     </c>
                     <c ca="right">
                        <p>21.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000125285</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SOX21</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Transcription factor SOX-21</p>
                     </c>
                     <c ca="right">
                        <p>19.99</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000159387</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-6</p>
                     </c>
                     <c ca="right">
                        <p>19.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000176842</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-5</p>
                     </c>
                     <c ca="right">
                        <p>19.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000134138</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>MEIS2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Meis2</p>
                     </c>
                     <c ca="right">
                        <p>18.05</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000128652</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>HOXD3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Hox-D3</p>
                     </c>
                     <c ca="right">
                        <p>17.34</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000128709</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>HOXD9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Hox-D9</p>
                     </c>
                     <c ca="right">
                        <p>17.34</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000128710</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>HOXD10</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Hox-D10</p>
                     </c>
                     <c ca="right">
                        <p>17.34</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Plot of CNE density against the total length of CNEs associated with DNA-binding TF-encoding genes</p>
               </caption>
               <text>
                  <p>Plot of CNE density against the total length of CNEs associated with DNA-binding TF-encoding genes. CNE density is defined as the number of bases located in CNEs per unit length (1 kb in human; 100 bp in fugu) of non-repetitive noncoding sequence in a gene locus.</p>
               </text>
               <graphic file="1471-2164-8-441-2"/>
            </fig>
            <p>The number of CNEs associated with each gene gives an indication of the number of conserved <it>cis</it>-regulatory elements in each of these genes. The top 20 human genes with the highest number and total length of human-fugu CNEs are shown in Table <tblr tid="T3">3</tblr>. The <it>MEIS2 </it>locus contains the highest number and total length of CNEs (79 CNEs spanning 12.0 kb) (Fig <figr fid="F3">3</figr>). In mammals, <it>Meis2 </it>is involved in the differentiation of the forebrain, branchial arches and somitic mesoderm as well as in regulating limb outgrowth <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. The next gene with the most CNEs is <it>IRX3 </it>(68 CNEs spanning 10.9 kb). <it>IRX3 </it>is expressed in certain regions of the midbrain, hindbrain, otic vesicle, spinal cord, first and second branchial arches and proximodorsal regions of the developing limb buds <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The association of an abundance of CNEs with TF-encoding genes such as <it>MEIS2 </it>and <it>IRX3 </it>that show spatially restricted expression patterns in a wide range of tissues is consistent with the notion that these CNEs represent regulatory modules that direct expression to different expression domains. At the other end of the spectrum are the TF-encoding genes that do not contain any human-fugu CNEs. TF-encoding genes that lack CNEs and exhibit ubiquitous expression (e.g., <it>SP1, USF1, E2F1</it>) may contain strong basal promoters and lack tissue-specific enhancers. In the case of CNE-lacking TF-encoding genes that exhibit tissue-specific expression, the divergent regulatory elements in human and fugu may still confer the same expression patterns in human and fugu as demonstrated in the case of human and zebrafish RET gene regulatory elements <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. It is also possible that some of these divergent regulatory elements may have conferred different patterns of expression in human and fugu.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Top twenty TF-encoding genes associated with the highest number and total length of human-fugu CNEs. For genes that are part of conserved clusters, we averaged out the number or length of CNEs present in the whole cluster over the number of genes in that cluster.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene ID</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Gene name</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Description</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of CNEs</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Length of CNEs (kb)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000134138</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>MEIS2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Meis2</p>
                     </c>
                     <c ca="right">
                        <p>79</p>
                     </c>
                     <c ca="right">
                        <p>11.98</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000177508</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-3</p>
                     </c>
                     <c ca="right">
                        <p>68</p>
                     </c>
                     <c ca="right">
                        <p>10.87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000169554</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZFHX1B</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Zinc finger homeobox protein 1b</p>
                     </c>
                     <c ca="right">
                        <p>76</p>
                     </c>
                     <c ca="right">
                        <p>10.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000143032</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>BARHL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>BarH-like 2 homeobox protein.</p>
                     </c>
                     <c ca="right">
                        <p>56</p>
                     </c>
                     <c ca="right">
                        <p>10.09</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000169946</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZFPM2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Zinc finger protein ZFPM2</p>
                     </c>
                     <c ca="right">
                        <p>49</p>
                     </c>
                     <c ca="right">
                        <p>9.90</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000167081</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PBX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Pre-B-cell leukemia transcription factor 3</p>
                     </c>
                     <c ca="right">
                        <p>57</p>
                     </c>
                     <c ca="right">
                        <p>9.10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000091656</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZFHX4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>zinc finger homeodomain 4</p>
                     </c>
                     <c ca="right">
                        <p>77</p>
                     </c>
                     <c ca="right">
                        <p>9.05</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000151514</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SALL3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Sal-like protein 3</p>
                     </c>
                     <c ca="right">
                        <p>59</p>
                     </c>
                     <c ca="right">
                        <p>8.94</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000170549</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-1</p>
                     </c>
                     <c ca="right">
                        <p>48</p>
                     </c>
                     <c ca="right">
                        <p>8.67</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000128573</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>FOXP2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Forkhead box protein P2</p>
                     </c>
                     <c ca="right">
                        <p>57</p>
                     </c>
                     <c ca="right">
                        <p>8.41</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000121297</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZNF537</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Teashirt homolog 3</p>
                     </c>
                     <c ca="right">
                        <p>59</p>
                     </c>
                     <c ca="right">
                        <p>7.38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000143995</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>MEIS1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Homeobox protein Meis1.</p>
                     </c>
                     <c ca="right">
                        <p>63</p>
                     </c>
                     <c ca="right">
                        <p>7.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000148737</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>TCF7L2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Transcription factor 7-like 2</p>
                     </c>
                     <c ca="right">
                        <p>42</p>
                     </c>
                     <c ca="right">
                        <p>6.91</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000153234</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>NR4A2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Orphan nuclear receptor NR4A2</p>
                     </c>
                     <c ca="right">
                        <p>38</p>
                     </c>
                     <c ca="right">
                        <p>6.73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000159387</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-6</p>
                     </c>
                     <c ca="right">
                        <p>40.5</p>
                     </c>
                     <c ca="right">
                        <p>6.54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000176842</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>IRX5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Iroquois-class homeodomain protein IRX-5</p>
                     </c>
                     <c ca="right">
                        <p>40.5</p>
                     </c>
                     <c ca="right">
                        <p>6.54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000110693</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SOX6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Transcription factor SOX-6.</p>
                     </c>
                     <c ca="right">
                        <p>47</p>
                     </c>
                     <c ca="right">
                        <p>6.41</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000143013</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LMO4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>LIM domain transcription factor LMO4</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>5.92</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000164853</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>XP_496843.1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PREDICTED: similar to Uncx4.1</p>
                     </c>
                     <c ca="right">
                        <p>31</p>
                     </c>
                     <c ca="right">
                        <p>5.74</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ENSG00000075891</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PAX2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Paired box protein Pax-2.</p>
                     </c>
                     <c ca="right">
                        <p>37</p>
                     </c>
                     <c ca="right">
                        <p>5.68</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Human-fugu and mouse-fugu VISTA alignments of <it>MEIS2 </it>gene locus</p>
               </caption>
               <text>
                  <p>Human-fugu and mouse-fugu VISTA alignments of <it>MEIS2 </it>gene locus. Pink peaks denote conserved noncoding elements (CNEs) and blue peaks denote conserved exonic sequences. <it>MEIS2 </it>locus contains the highest number of CNEs (79 CNEs with a total length of 12.0 kb) among TF-encoding genes in the human, mouse and fugu genomes.</p>
               </text>
               <graphic file="1471-2164-8-441-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Human-fugu CNEs overlap experimentally verified TF binding sites indicating they contain transcriptional regulatory elements</p>
            </st>
            <p>The recent advances in the high-throughput techniques for identifying <it>in vivo </it>binding sites of TFs such as ChIP-on-chip/ChIP-PET have enabled genome-wide mapping of binding sites for TFs such as OCT4, SOX2, NANOG, c-MYC and SUZ12 <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Although genome-wide ChIP analysis has a certain level of noise and does not give conclusive evidence of transcriptional regulation, it provides a good estimate of putative regulatory elements. If the human-fugu CNEs identified by us represent transcriptional regulatory elements, a significant proportion of them should overlap with experimentally verified TF binding sites. To verify this we searched the human-fugu CNEs against the human genome sequences that were shown to be bound by various TFs ("bound regions").</p>
            <p>The TFs OCT4, SOX2 and NANOG are crucial for the maintenance of pluripotency and self-renewal in embryonic stem (ES) cells and are necessary for propagation of undifferentiated ES cells in culture <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. These factors activate the transcription of their own genes and genes of major signaling pathways, but repress the transcription of some developmental genes <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Sequences bound by these TFs within promoter regions of genes in human ES cells were taken from Boyer et al. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. A significant proportion of human-fugu CNEs were found to overlap bound regions of OCT4 (0.63% of human-fugu CNEs; <it>P </it>= 1.82 &#215; 10<sup>-17</sup>), SOX2 (0.84% of human-fugu CNEs; <it>P </it>= 8.83 &#215; 10<sup>-18</sup>) and NANOG (1.27% of human-fugu CNEs; <it>P </it>= 3.44 &#215; 10<sup>-26</sup>). The transcription factor c-MYC plays important roles in regulating cell growth, cell proliferation, cell cycle and apoptosis <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. The genomic bound regions of c-MYC in human B cells were recently identified using ChIP-PET experiments <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In total, about 4,300 high-confidence bound regions of c-MYC were identified to be associated with 668 target genes, of which 48 encode TFs <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. We found that 0.49% of the human-fugu CNEs overlap c-MYC bound regions, constituting a significant enrichment of c-MYC binding sites in CNEs (<it>P </it>= 4.44 &#215; 10<sup>-5</sup>). SUZ12 is a subunit of the polycomb repressive complex 2 which is involved in silencing of genes through the epigenetic modification of chromatin structure <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. A significant proportion of human-fugu CNEs (10.45%; <it>P </it>= 1.53 &#215; 10<sup>-316</sup>) were found to overlap SUZ12 bound regions in human ES cells <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The significant association of human-fugu CNEs with experimentally verified TF binding sites of both transcriptional activators and repressors, indicates that a significant proportion of CNEs are functional transcriptional enhancers and silencers.</p>
         </sec>
         <sec>
            <st>
               <p>TF-encoding genes associated with CNEs tend to be involved in nervous system development</p>
            </st>
            <p>To determine the relationship between human-fugu CNEs and biological functions of TF-encoding genes, the Gene Ontology (GO) terms describing the biological processes of CNE-associated TF-encoding genes (385 genes with Gene Ontology annotation) were compared with that of all human TF orthologs (804 genes with Gene Ontology annotation) using GOstat <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. TF-encoding genes associated with CNEs were found to be significantly enriched for genes involved mainly in development, and in particular development of the nervous system (<it>P </it>&lt; 0.01) (Additional data file <supplr sid="S8">8</supplr>). In contrast, TF-encoding genes that are associated with CNEs are depleted of genes involved in protein metabolism and biopolymer modification (<it>P </it>&lt; 0.01) (Additional data file <supplr sid="S8">8</supplr>; <it>P</it>-values marked with a negative sign). The significant involvement of CNE-associated TF-encoding genes in nervous system development is consistent with previous studies that highlight the association of highly conserved noncoding regions with developmental genes <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B8">8</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <suppl id="S8">
               <title>
                  <p>Additional data file 8</p>
               </title>
               <text>
                  <p>Significantly over-represented and under-represented Gene Ontology terms (<it>P </it>&lt; 0.01) of CNE-associated human TF-encoding genes. Group A denotes 385 of 389 CNE-associated TF-encoding genes with Gene Ontology annotation, while Group B denotes 804 of 816 orthologous TF-encoding genes with Gene Ontology annotation. <it>P</it>-values marked with a negative sign denote significant depletion. The analysis was carried out using GOstat (Beissbarth and Speed 2004).</p>
               </text>
               <file name="1471-2164-8-441-S8.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Potential utility of the TFCONES database</p>
            </st>
            <p>We have created a database called TFCONES (<ul>T</ul>ranscription <ul>F</ul>actor Genes &amp; Associated <ul>CO</ul>nserved <ul>N</ul>oncoding <ul>E</ul>lement<ul>S</ul>) that contains information about the list of TF-encoding genes in human, mouse and fugu genomes, their orthologous relationships, and details of the CNEs associated with them (Figs <figr fid="F4">4</figr>, <figr fid="F5">5</figr> and <figr fid="F6">6</figr>). This set of human-mouse-fugu orthologous genes and their associated CNEs should be useful to researchers investigating the regulation and function of genes in mammals as well as in fishes. For every CNE, the data presented include the VISTA alignment, length, percentage identity, genomic location and the genes flanking the CNE. The users may search for specific human, mouse or fugu TF-encoding genes and retrieve the sequences of the human-mouse and human-fugu CNEs. The CNEs can be visualized in relation to the nearest TF-encoding gene using the GBrowse feature (Fig <figr fid="F7">7</figr>; <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>). The CNE database can also be BLAST-searched. To assist those interested in testing the function of potential TF binding sites within the human-fugu CNEs, we have predicted TF binding sites in the human-fugu CNEs using TESS <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. While the human-fugu CNEs would be useful for scientists investigating evolutionarily conserved <it>cis</it>-regulatory elements that are likely to be shared by all vertebrates, the human-mouse CNEs should be of interest to researchers investigating <it>cis</it>-regulatory elements specific to mammals. In addition, TF-encoding genes that differentially express in certain tissues may be studied in greater detail to explore the gene regulatory networks active in those tissues. For instance, the CNEs that are associated with TF-encoding genes expressed in the central nervous system may be crucial in mediating expression precisely to distinct populations of neuronal and glial cells. These CNEs would be useful tools for marking cell-types through driving expression of reporter genes and will also act as useful reagents for targeting expression of therapeutic proteins to specific cell-types.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>A keyword search of genes in the TFCONES database</p>
               </caption>
               <text>
                  <p>A keyword search of genes in the TFCONES database.</p>
               </text>
               <graphic file="1471-2164-8-441-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Gene record for forkhead transcription factor gene <it>FOXA2</it></p>
               </caption>
               <text>
                  <p>Gene record for forkhead transcription factor gene <it>FOXA2</it>.</p>
               </text>
               <graphic file="1471-2164-8-441-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>List of human-fugu CNEs associated with <it>FOXA2</it></p>
               </caption>
               <text>
                  <p>List of human-fugu CNEs associated with <it>FOXA2</it>.</p>
               </text>
               <graphic file="1471-2164-8-441-6"/>
            </fig>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>An image of the location of CNEs relative to the associated TF-encoding gene <it>FOXA2</it></p>
               </caption>
               <text>
                  <p>An image of the location of CNEs relative to the associated TF-encoding gene <it>FOXA2</it>.</p>
               </text>
               <graphic file="1471-2164-8-441-7"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>TFs are the most crucial nodes in the gene regulatory networks that underlie the developmental plan and physiological systems of vertebrates. The evolutionarily conserved noncoding elements associated with TF-encoding genes in the TFCONES database represent a highly prioritized set of <it>cis</it>-regulatory sequences. These elements and the transcription factor binding sites identified in them should be useful for understanding the regulation of spatially restricted expression patterns of TF-encoding genes.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and Requirements</p>
         </st>
         <p>The TFCONES (<ul>T</ul>ranscription <ul>F</ul>actor Genes &amp; Associated <ul>CO</ul>nserved <ul>N</ul>oncoding <ul>E</ul>lement<ul>S</ul>) database is freely accessible to academic and non-academic users at <url>http://tfcones.fugu-sg.org</url>.</p>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations</p>
         </st>
         <p>CNE, conserved noncoding element; GO, Gene Ontology; TF, transcription factor</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BV and SB conceived and designed the project. APL and YY conducted the data analysis. APL and BV interpreted the data. APL built the TFCONES database. APL and BV wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This project was supported by the Agency for Science, Technology, and Research (A*STAR), Singapore. We thank Mohamad Hirwan M S for the INPARANOID analysis. A.P.L. is supported by the A*STAR Graduate Scholarship. B.V. is an adjunct staff of the Department of Pediatrics, Yong Loo Lin School of Medicine, National University of Singapore.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>An ORFeome-based analysis of human transcription factor genes and the construction of a microarray to interrogate their expression</p>
            </title>
            <aug>
               <au>
                  <snm>Messina</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Glasscock</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lovett</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>2041</fpage>
            <lpage>2047</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">528918</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489324</pubid>
                  <pubid idtype="doi">10.1101/gr.2584104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Long-range downstream enhancers are essential for Pax6 expression</p>
            </title>
            <aug>
               <au>
                  <snm>Kleinjan</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Seawright</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mella</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carr</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Tyas</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>van</snm>
                  <fnm>HV</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2006</pubdate>
            <volume>299</volume>
            <fpage>563</fpage>
            <lpage>581</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2006.08.060</pubid>
                  <pubid idtype="pmpid" link="fulltext">17014839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Construction and analysis of a sequence-ready map in 4q25: Rieger syndrome can be caused by haploinsufficiency of RIEG, but also by chromosome breaks approximately 90 kb upstream of this gene</p>
            </title>
            <aug>
               <au>
                  <snm>Flomen</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Vatcheva</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gorman</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Baptista</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Groet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barisic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ligutic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Nizetic</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1998</pubdate>
            <volume>47</volume>
            <fpage>409</fpage>
            <lpage>413</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1997.5127</pubid>
                  <pubid idtype="pmpid" link="fulltext">9480756</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Analysis of two translocation breakpoints and identification of a negative regulatory element in patients with Rieger's syndrome</p>
            </title>
            <aug>
               <au>
                  <snm>Trembath</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Semina</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Patil</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Qian</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Amendt</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Russo</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Birth Defects Res A Clin Mol Teratol</source>
            <pubdate>2004</pubdate>
            <volume>70</volume>
            <fpage>82</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bdra.10154</pubid>
                  <pubid idtype="pmpid" link="fulltext">14991915</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Long-range upstream and downstream enhancers control distinct subsets of the complex spatiotemporal Sox9 expression pattern</p>
            </title>
            <aug>
               <au>
                  <snm>Bagheri-Fam</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barrionuevo</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Dohrmann</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Gunther</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schule</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kemler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mallo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanzler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2006</pubdate>
            <volume>291</volume>
            <fpage>382</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2005.11.013</pubid>
                  <pubid idtype="pmpid" link="fulltext">16458883</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Ultraconserved elements in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pheasant</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Makunin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Stephen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>1321</fpage>
            <lpage>1325</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1098119</pubid>
                  <pubid idtype="pmpid" link="fulltext">15131266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>In vivo enhancer analysis of human conserved non-coding sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Ahituv</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moses</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Shoukry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Minovitsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>KD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>444</volume>
            <fpage>499</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05295</pubid>
                  <pubid idtype="pmpid" link="fulltext">17086198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Arrays of ultraconserved non-coding regions span the loci of key developmental genes in vertebrate genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bruce</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Engstrom</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Klos</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Ericson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">544600</pubid>
                  <pubid idtype="pmpid" link="fulltext">15613238</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-5-99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Human-zebrafish non-coding conserved elements act in vivo to regulate transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Shin</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Priest</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ronco</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Burns</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>MacRae</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>5437</fpage>
            <lpage>5445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1236720</pubid>
                  <pubid idtype="pmpid" link="fulltext">16179648</pubid>
                  <pubid idtype="doi">10.1093/nar/gki853</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Highly conserved non-coding sequences are associated with vertebrate development</p>
            </title>
            <aug>
               <au>
                  <snm>Woolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goodson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McEwen</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Vavouri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>North</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Callaway</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">526512</pubid>
                  <pubid idtype="pmpid" link="fulltext">15630479</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Shuffling of cis-regulatory elements is a pervasive feature of the vertebrate lineage</p>
            </title>
            <aug>
               <au>
                  <snm>Sanges</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kalmar</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Claudiani</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>D'Amato</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Stupka</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R56</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779573</pubid>
                  <pubid idtype="pmpid" link="fulltext">16859531</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-7-r56</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>AVID: A global alignment program</p>
            </title>
            <aug>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>97</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430967</pubid>
                  <pubid idtype="pmpid" link="fulltext">12529311</pubid>
                  <pubid idtype="doi">10.1101/gr.789803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Cross-species sequence comparisons: a review of methods and available resources</p>
            </title>
            <aug>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Elnitski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430969</pubid>
                  <pubid idtype="pmpid" link="fulltext">12529301</pubid>
                  <pubid idtype="doi">10.1101/gr.222003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>cisRED: a database system for genome-scale computational discovery of regulatory elements</p>
            </title>
            <aug>
               <au>
                  <snm>Robertson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bilenky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yuen</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Dagpinar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Varhol</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Teague</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Griffith</snm>
                  <fnm>OL</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D68</fpage>
            <lpage>D73</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347438</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381958</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj075</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>ConSite: web-based prediction of regulatory elements using cross-species comparison</p>
            </title>
            <aug>
               <au>
                  <snm>Sandelin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W249</fpage>
            <lpage>W252</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441510</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215389</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Highly conserved upstream sequences for transcription factor genes and implications for the regulatory network</p>
            </title>
            <aug>
               <au>
                  <snm>Iwama</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>17156</fpage>
            <lpage>17161</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">534610</pubid>
                  <pubid idtype="pmpid" link="fulltext">15572454</pubid>
                  <pubid idtype="doi">10.1073/pnas.0407670101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Comparative promoter region analysis powered by CORG</p>
            </title>
            <aug>
               <au>
                  <snm>Dieterich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grossmann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanzer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ropcke</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Arndt</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>24</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">555765</pubid>
                  <pubid idtype="pmpid" link="fulltext">15723697</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-24</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Scanning human gene deserts for long-range enhancers</p>
            </title>
            <aug>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>413</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1088328</pubid>
                  <pubid idtype="pmpid" link="fulltext">14563999</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>CONDOR: a database resource of developmentally associated conserved non-coding elements</p>
            </title>
            <aug>
               <au>
                  <snm>Woolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Callaway</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McEwen</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Elgar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Dev Biol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>100</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2020477</pubid>
                  <pubid idtype="pmpid" link="fulltext">17760977</pubid>
                  <pubid idtype="doi">10.1186/1471-213X-7-100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Ensembl Genome Browser</p>
            </title>
            <url>http://www.ensembl.org</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Automatic clustering of orthologs and in-paralogs from pairwise species comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Storm</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>314</volume>
            <fpage>1041</fpage>
            <lpage>1052</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.5197</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Fugu genome analysis provides evidence for a whole-genome duplication early during the evolution of ray-finned fishes</p>
            </title>
            <aug>
               <au>
                  <snm>Christoffels</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Chia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Aparicio</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Venkatesh</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1146</fpage>
            <lpage>1151</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh114</pubid>
                  <pubid idtype="pmpid" link="fulltext">15014147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>PHYLIP (Phylogeny Inference Package) version 3.6. (3.6)</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>Department of Genome Sciences, University of Washington, Seattle</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B24">
            <title>
               <p>RepeatMasker version open-3.1.5</p>
            </title>
            <url>http://www.repeatmasker.org</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Brudno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Do</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Davydov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Sidow</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>721</fpage>
            <lpage>731</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430158</pubid>
                  <pubid idtype="pmpid" link="fulltext">12654723</pubid>
                  <pubid idtype="doi">10.1101/gr.926603</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Analysis of multiple genomic sequence alignments: a web resource, online tools, and lessons learned from analysis of mammalian SCL loci</p>
            </title>
            <aug>
               <au>
                  <snm>Chapman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Donaldson</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grafham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Gottgens</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>313</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">327107</pubid>
                  <pubid idtype="pmpid" link="fulltext">14718377</pubid>
                  <pubid idtype="doi">10.1101/gr.1759004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>VISTA: computational tools for comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Poliakov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W273</fpage>
            <lpage>W279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441596</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215394</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh458</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Locksley</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Blankespoor</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>ZE</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>288</volume>
            <fpage>136</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.288.5463.136</pubid>
                  <pubid idtype="pmpid" link="fulltext">10753117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>GOstat: find statistically overrepresented Gene Ontologies within a group of genes</p>
            </title>
            <aug>
               <au>
                  <snm>Beissbarth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1464</fpage>
            <lpage>1465</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth088</pubid>
                  <pubid idtype="pmpid" link="fulltext">14962934</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Controlling the false discovery rate: a practical and powerful approach to multiple testing</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society Series B Methodological</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Core transcriptional regulatory circuitry in human embryonic stem cells</p>
            </title>
            <aug>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Zucker</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Guenther</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Jenner</snm>
                  <fnm>RG</fnm>
               </au>
               <etal/>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <fpage>947</fpage>
            <lpage>956</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.08.020</pubid>
                  <pubid idtype="pmpid" link="fulltext">16153702</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Global mapping of c-Myc binding sites and target gene networks in human B cells</p>
            </title>
            <aug>
               <au>
                  <snm>Zeller</snm>
                  <fnm>KI</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yustein</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ooi</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Orlov</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Shahab</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yong</snm>
                  <fnm>HC</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>17834</fpage>
            <lpage>17839</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1635161</pubid>
                  <pubid idtype="pmpid" link="fulltext">17093053</pubid>
                  <pubid idtype="doi">10.1073/pnas.0604129103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Control of developmental regulators by Polycomb in human embryonic stem cells</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Jenner</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Guenther</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Chevalier</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Johnstone</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Isono</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <fpage>301</fpage>
            <lpage>313</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.02.043</pubid>
                  <pubid idtype="pmpid" link="fulltext">16630818</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Transcription Element Search System (TESS)</p>
            </title>
            <url>http://www.cbil.upenn.edu/tess</url>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The Generic Genome Browser</p>
            </title>
            <url>http://www.gmod.org/ggb</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>NCBI WWW Blast web server</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/blast/executables/</url>
         </bibl>
         <bibl id="B37">
            <title>
               <p>A global control region defines a chromosomal regulatory landscape containing the HoxD cluster</p>
            </title>
            <aug>
               <au>
                  <snm>Spitz</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gonzalez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Duboule</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2003</pubdate>
            <volume>113</volume>
            <fpage>405</fpage>
            <lpage>417</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(03)00310-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12732147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts</p>
            </title>
            <aug>
               <au>
                  <snm>de la Calle-Mustienes</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Feijoo</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Manzanares</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tena</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Seguel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Letizia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Allende</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Gomez-Skarmeta</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1061</fpage>
            <lpage>1072</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182218</pubid>
                  <pubid idtype="pmpid" link="fulltext">16024824</pubid>
                  <pubid idtype="doi">10.1101/gr.4004805</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Highly conserved regulatory elements around the SHH gene may contribute to the maintenance of conserved synteny across human chromosome 7q36.3</p>
            </title>
            <aug>
               <au>
                  <snm>Goode</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Snell</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Elgar</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2005</pubdate>
            <volume>86</volume>
            <fpage>172</fpage>
            <lpage>181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2005.04.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">15939571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Highly conserved syntenic blocks at the vertebrate Hox loci and conserved regulatory elements within and outside Hox gene clusters</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Tay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Venkatesh</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>6994</fpage>
            <lpage>6999</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1459007</pubid>
                  <pubid idtype="pmpid" link="fulltext">16636282</pubid>
                  <pubid idtype="doi">10.1073/pnas.0601492103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Close sequence comparisons are sufficient to identify human cis-regulatory elements</p>
            </title>
            <aug>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Shoukry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>855</fpage>
            <lpage>863</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1484452</pubid>
                  <pubid idtype="pmpid" link="fulltext">16769978</pubid>
                  <pubid idtype="doi">10.1101/gr.4717506</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Expression of Meis2, a Knotted-related murine homeobox gene, indicates a role in the differentiation of the forebrain and the somitic mesoderm</p>
            </title>
            <aug>
               <au>
                  <snm>Cecconi</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Proetzel</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Alvarez-Bolado</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Jay</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gruss</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Dev Dyn</source>
            <pubdate>1997</pubdate>
            <volume>210</volume>
            <fpage>184</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0177(199710)210:2&lt;184::AID-AJA10>3.0.CO;2-E</pubid>
                  <pubid idtype="pmpid" link="fulltext">9337138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Control of vertebrate limb outgrowth by the proximal factor Meis2 and distal antagonism of BMPs by Gremlin</p>
            </title>
            <aug>
               <au>
                  <snm>Capdevila</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tsukui</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rodriquez</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Zappavigna</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Izpisua Belmonte</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>1999</pubdate>
            <volume>4</volume>
            <fpage>839</fpage>
            <lpage>849</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1097-2765(00)80393-7</pubid>
                  <pubid idtype="pmpid">10619030</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Identification of the vertebrate Iroquois homeobox gene family with overlapping expression during early development of the nervous system</p>
            </title>
            <aug>
               <au>
                  <snm>Bosse</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zulch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Torres</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gomez-Skarmeta</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Modolell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gruss</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mech Dev</source>
            <pubdate>1997</pubdate>
            <volume>69</volume>
            <fpage>169</fpage>
            <lpage>181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0925-4773(97)00165-2</pubid>
                  <pubid idtype="pmpid">9486539</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Conservation of RET regulatory function from human to zebrafish without sequence similarity</p>
            </title>
            <aug>
               <au>
                  <snm>Fisher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grice</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Vinton</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Bessling</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>McCallion</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>312</volume>
            <fpage>276</fpage>
            <lpage>279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1124070</pubid>
                  <pubid idtype="pmpid" link="fulltext">16556802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Transcriptional regulation and transformation by Myc proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Adhikary</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eilers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>635</fpage>
            <lpage>645</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1703</pubid>
                  <pubid idtype="pmpid" link="fulltext">16064138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
