<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-399</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Comparative analysis of function and interaction of transcription factors in nematodes: Extensive conservation of orthology coupled to rapid sequence evolution</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Haerty</snm>
               <fnm>Wilfried</fnm>
               <insr iid="I1"/>
               <email>haertyw@mcmaster.ca</email>
            </au>
            <au id="A2">
               <snm>Artieri</snm>
               <fnm>Carlo</fnm>
               <insr iid="I1"/>
               <email>artiercg@mcmaster.ca</email>
            </au>
            <au id="A3">
               <snm>Khezri</snm>
               <fnm>Navid</fnm>
               <insr iid="I1"/>
               <email>khezrn@mcmaster.ca</email>
            </au>
            <au id="A4">
               <snm>Singh</snm>
               <mi>S</mi>
               <fnm>Rama</fnm>
               <insr iid="I1"/>
               <email>singh@mcmaster.ca</email>
            </au>
            <au ca="yes" id="A5">
               <snm>Gupta</snm>
               <mi>P</mi>
               <fnm>Bhagwati</fnm>
               <insr iid="I1"/>
               <email>guptab@mcmaster.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>399</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/399</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18752680</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-399</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>18</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>27</day>
               <month>8</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>27</day>
               <month>8</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Haerty et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Much of the morphological diversity in eukaryotes results from differential regulation of gene expression in which transcription factors (TFs) play a central role. The nematode <it>Caenorhabditis elegans </it>is an established model organism for the study of the roles of TFs in controlling the spatiotemporal pattern of gene expression. Using the fully sequenced genomes of three <it>Caenorhabditid </it>nematode species as well as genome information from additional more distantly related organisms (fruit fly, mouse, and human) we sought to identify orthologous TFs and characterized their patterns of evolution.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We identified 988 TF genes in <it>C. elegans</it>, and inferred corresponding sets in <it>C. briggsae </it>and <it>C. remanei</it>, containing 995 and 1093 TF genes, respectively. Analysis of the three gene sets revealed 652 3-way reciprocal 'best hit' orthologs (nematode TF set), approximately half of which are zinc finger (ZF-C2H2 and ZF-C4/NHR types) and HOX family members. Examination of the TF genes in <it>C. elegans </it>and <it>C. briggsae </it>identified the presence of significant tandem clustering on chromosome V, the majority of which belong to ZF-C4/NHR family. We also found evidence for lineage-specific duplications and rapid evolution of many of the TF genes in the two species. A search of the TFs conserved among nematodes in <it>Drosophila melanogaster</it>, <it>Mus musculus </it>and <it>Homo sapiens </it>revealed 150 reciprocal orthologs, many of which are associated with important biological processes and human diseases. Finally, a comparison of the sequence, gene interactions and function indicates that nematode TFs conserved across phyla exhibit significantly more interactions and are enriched in genes with annotated mutant phenotypes compared to those that lack orthologs in other species.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our study represents the first comprehensive genome-wide analysis of TFs across three nematode species and other organisms. The findings indicate substantial conservation of transcription factors even across distant evolutionary lineages and form the basis for future experiments to examine TF gene function in nematodes and other divergent phyla.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The growing availability of the whole-genome sequences of eukaryotes has accelerated large-scale functional studies to understand the mechanisms of animal development and evolution <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Many of these studies have highlighted the importance of regulatory evolution and the fundamental role that transcription factors (TFs) play in this process. Alterations in TF function and regulation are linked to phenotypic variation <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> as well as numerous pathologies, including cancers <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Therefore, a detailed analysis of sequence and function of TFs across animal phyla will provide important information about their evolutionary patterns, thereby increasing our ability to understand the molecular basis of diseases and organismal complexity. The nematode <it>Caenorhabditis elegans </it>serves as a powerful model organism to unravel TF function due to the wealth of available resources and the ease with which it can be reared, maintained, and manipulated in the laboratory <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The completion of its genome sequence has aided in the design of large-scale experiments that are beginning to elucidate the complexity of transcriptional regulation and gene interaction networks in multicelllular eukaryotes <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. The recent releases of the genome sequence of two other <it>Caenorhabditid </it>species, <it>C. briggsae </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and <it>C. remanei </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, provide an excellent opportunity for genome-wide study of the conservation and evolution of transcription factors across nematodes. These three species are estimated to have shared a common ancestor between 20&#8211;120 million years ago <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp> and while they are morphologically similar, studies have shown differences in development and behavior <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>As a first step in facilitating the comparative study of TFs in nematodes, we have compiled an updated list of putative TF genes in <it>C. elegans </it>and used it to identify orthologs in <it>C. briggsae </it>and <it>C. remanei</it>. Our results show that two-thirds of all <it>C. elegans </it>TF genes have 3-way one-to-one best reciprocal orthologs in the other two species, whereas the remaining third are either species-specific paralogs or too divergent to assign proper orthologous relationships. We observed that among <it>Caenorhabditid </it>species, although TF genes have a greater sequence divergence than the non-TF genes, they exhibit significantly more detectable interspecific orthologs than non-TF genes. We also identified 150 best reciprocal orthologs of the TF genes conserved among nematodes in fruit fly (<it>Drosophila melanogaster</it>), mouse (<it>Mus musculus</it>), and human (<it>Homo sapiens</it>) many of which are associated with known disorders. We also examined the relationship between gene function and interactions, the results of which demonstrate that conserved TF genes exhibit a significantly greater number of interactions and are more likely to be associated with mutant phenotypes when compared to those that lack detectable orthologs. Our findings provide a framework for future studies of nematode TFs and facilitate the development of resources allowing us to study morphological and developmental diversity in metazoans.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>The <it>C. elegans </it>TF gene set</p>
            </st>
            <p>As a first step in the identification of TFs in <it>Caenorhabditid </it>species, we generated an updated list of putative <it>C. elegans </it>TF genes by searching its annotated genome sequence (Wormbase WS173 release) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> for gene ontology (GO) terms associated with transcription factors. This led to the identification of 1271 putative TF genes (Table <tblr tid="T1">1</tblr>). Since our criteria for selecting a TF was the presence of a well-defined DNA binding domain that selectively modulates gene transcription (for example, bHLH or homeobox), we manually inspected the above list of putative TFs. This allowed us to reject 564 genes as false positives since these encode factors that are associated with the basal transcriptional apparatus (for example, DNA polymerases), chromatin alterations, DNA packaging (histones), as well as entries that were incorrectly curated in Wormbase (Additional files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>). To the remaining genes (707), we added 281 TF encoding genes found in published literature and other public database entries that were not identified in our initial search (See Materials and Methods). The final <it>C. elegans </it>TF set included a total of 988 genes (Table <tblr tid="T2">2</tblr> and additional file <supplr sid="S4">4</supplr>), of which 917 are shared with the previously annotated <it>C. elegans </it>TF set (wTF2.0, 934 genes) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The 17 genes in the wTF2.0 set that are not shared in our updated set either lack a known DNA binding domain or are annotated as pseudogenes (Additional file <supplr sid="S5">5</supplr>). The increased number of genes in the present TF set likely results from the availability of annotations published since the compilation of wTF2.0.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>List of 314 incorrect entries (non-TFs) in <it>C. elegans</it>.</p>
               </text>
               <file name="1471-2164-9-399-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>List of 167 genes that encode chromatin remodeling, general transcription and DNA/RNA binding factors in <it>C. elegans</it>.</p>
               </text>
               <file name="1471-2164-9-399-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>List of 83 histone-encoding genes in <it>C. elegans</it>.</p>
               </text>
               <file name="1471-2164-9-399-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>List of 988 TF genes in <it>C. elegans</it>.</p>
               </text>
               <file name="1471-2164-9-399-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p>List of 17 false positives in wTF2.0.</p>
               </text>
               <file name="1471-2164-9-399-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>GO term-based searches of TF genes in <it>C. elegans</it>.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>GO ID</p>
                     </c>
                     <c ca="left">
                        <p>Term</p>
                     </c>
                     <c ca="center">
                        <p>TF genes</p>
                     </c>
                     <c ca="center">
                        <p>Unique</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0003700</p>
                     </c>
                     <c ca="left">
                        <p>Transcription Factor activity</p>
                     </c>
                     <c ca="center">
                        <p>614</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0043565</p>
                     </c>
                     <c ca="left">
                        <p>DNA binding, sequence specific</p>
                     </c>
                     <c ca="center">
                        <p>515</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0003677</p>
                     </c>
                     <c ca="left">
                        <p>DNA binding</p>
                     </c>
                     <c ca="center">
                        <p>858</p>
                     </c>
                     <c ca="center">
                        <p>352</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0030528</p>
                     </c>
                     <c ca="left">
                        <p>Transcription regulator activity</p>
                     </c>
                     <c ca="center">
                        <p>75</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0006355</p>
                     </c>
                     <c ca="left">
                        <p>Regulation of transcription, DNA dependent</p>
                     </c>
                     <c ca="center">
                        <p>768</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0045449</p>
                     </c>
                     <c ca="left">
                        <p>Regulation of transcription</p>
                     </c>
                     <c ca="center">
                        <p>199</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0000122</p>
                     </c>
                     <c ca="left">
                        <p>Negative regulation of transcription from RNA pol II promoter</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>004544</p>
                     </c>
                     <c ca="left">
                        <p>Positive regulation of transcription from RNA pol II promoter</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The table lists numbers of all TF genes as well as those uniquely identified by each of the terms.</p>
               </tblfn>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The breakdowns of TF genes in each of the nematode species genomes based on various search categories.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Search method</b>
                        </p>
                     </c>
                     <c ca="center" cspan="3">
                        <p>
                           <b>Number of TF genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>C. elegans</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>C. briggsae</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>C. remanei</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GO term-based</p>
                     </c>
                     <c ca="center">
                        <p>707</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Orthologs (InParanoid and reciprocal BLAST)</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>713</p>
                     </c>
                     <c ca="center">
                        <p>703</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Manual curation</p>
                     </c>
                     <c ca="center">
                        <p>281</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HMM alignments</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                     <c ca="center">
                        <p>282</p>
                     </c>
                     <c ca="center">
                        <p>390</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TOTAL:</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>988</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>995</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>1093</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>NA: not applicable, ND: not done.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Identification of transcription factors in nematodes and other phyla</p>
            </st>
            <p>We used the newly defined <it>C. elegans </it>TF set to search for homologs in the fully sequenced genomes of <it>C. briggsae </it>(CB3 release) and <it>C. remanei </it>(11/29/2005 release) <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. We used InParanoid <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> to identify 713 and 703 best reciprocal hit orthologs in <it>C. briggsae </it>and <it>C. remanei</it>, respectively (Table <tblr tid="T2">2</tblr>, see Material and Methods). To these lists, we added 282 <it>C. briggsae </it>and 390 <it>C. remanei </it>putative TF genes that were identified through Hidden Markov Model (HMM)-based searches <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Altogether, a total of 995 and 1093 potential TF encoding genes were identified in <it>C. briggsae </it>and <it>C. remanei</it>, respectively (Table <tblr tid="T2">2</tblr> and additional files <supplr sid="S6">6</supplr> and <supplr sid="S7">7</supplr>). Among the TF orthologs in the three nematode species, we identified 652 genes that exhibit a 3-way best reciprocal BLAST orthologous relationship (hereafter referred to as the nematode TF set) (Figure <figr fid="F1">1</figr>). The proportion of <it>C. elegans </it>TF genes with detectable orthologs in <it>C. briggsae </it>(713/995, 71.7%) is significantly higher compared to the proportion of all conserved genes between the two species (12858/20621, 62.4%) <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> (&#967;<sup>2 </sup>= 7.56, df = 1, <it>p </it>= 6.0 &#215; 10<sup>-3</sup>), which may indicate strong selective pressure to maintain these genes.</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>List of 995 TF genes in <it>C. briggsae</it>.</p>
               </text>
               <file name="1471-2164-9-399-S6.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p>List of 1093 TF genes in <it>C. remanei</it>.</p>
               </text>
               <file name="1471-2164-9-399-S7.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>TF-encoding genes in <it>C. elegans</it>, <it>C. briggsae </it>and <it>C. remanei</it></p>
               </caption>
               <text>
                  <p><b>TF-encoding genes in <it>C. elegans</it>, <it>C. briggsae </it>and <it>C. remanei</it>.</b> The total number of TF genes in each of the species is given inside the brackets. The numbers of divergent TF genes and those conserved among the three nematode species are shown along the vertices and inside of the triangle, respectively.</p>
               </text>
               <graphic file="1471-2164-9-399-1"/>
            </fig>
            <p>To examine the evolutionary conservation of the nematode TF set of genes in other phyla, we searched for their orthologs in the genomes of fruit fly(<it>D. melanogaster</it>), mouse (<it>M. musculus</it>), and human (<it>H. sapiens</it>). Using the InParanoid database <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> we identified a total of 150 TFs that exhibit reciprocal orthologous relationships between three nematode species and are conserved in fly, mouse, and human (Additional files <supplr sid="S4">4</supplr> and <supplr sid="S8">8</supplr>).</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>List of 150 TF orthologs in <it>Drosophila melanogaster</it>, <it>M. musculus</it>, and <it>H. sapiens</it>.</p>
               </text>
               <file name="1471-2164-9-399-S8.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Coding sequence divergence in nematode TF genes</p>
            </st>
            <p>Best-hit reciprocal orthologs could not be identified for 215 TF genes in <it>C. elegans</it>, 211 in <it>C. briggsae</it>, and 310 in <it>C. remanei </it>(Figure <figr fid="F1">1</figr> and additional files <supplr sid="S4">4</supplr>, <supplr sid="S6">6</supplr>, and <supplr sid="S7">7</supplr>). It should be pointed out that <it>C. briggsae </it>and <it>C. remanei </it>TF genes are based on computational predictions and that the <it>C. remanei </it>genome has yet to be assembled; hence while many of the TF genes without detectable orthologs may have arisen by lineage-specific gene duplication, others could result from incomplete annotation of the <it>C. briggsae </it>and <it>C. remanei </it>genomes. Therefore, the actual number of divergent TF genes in these species is likely to be smaller than the numbers we have estimated. To further study this set of genes in <it>C. briggsae </it>(211), we searched for their closest homologs in <it>C. elegans</it>. This revealed 30 genes with weak sequence similarity (BLASTP E-value > 10<sup>-10</sup>) suggesting that these most likely represent candidate <it>C. briggsae</it>-specific TF genes (Additional file <supplr sid="S9">9</supplr>). The remaining 181 appear to be species-specific paralogs, of which 69 are zinc finger-C4/nuclear hormone receptor (ZF-C4/NHR) family members (see below).</p>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>BLASTP hits of <it>C. briggsae</it>-divergent TFs in <it>C. elegans </it>genome.</p>
               </text>
               <file name="1471-2164-9-399-S9.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Previous studies in humans and other organisms have shown that TF genes tend to evolve more rapidly than non-transcription factor (non-TF) genes <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, therefore we performed a similar analysis in nematodes by analyzing their coding sequence divergence and comparing it to non-TF genes. Due to the large divergence times between the three species <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B15">15</abbr></abbrgrp>, the rate of synonymous substitution per synonymous site (<it>d</it><sub>S</sub>) for many genes is likely to be saturated (<it>d</it><sub>S </sub>> 3, Figure <figr fid="F2">2A</figr>). Therefore we restricted our analysis to the rate of non-synonymous substitution per non-synonymous site (<it>d</it><sub>N</sub>), which does not show such saturation <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. We found a significantly higher <it>d</it><sub>N </sub>for TF genes conserved among nematodes (652) when compared to 3-way conserved non-TF gene orthologs (10,827 genes; Kruskal-Wallis rank sum test, <it>p </it>&lt; 2.2 &#215; 10<sup>-16</sup>; Figure <figr fid="F2">2B</figr>), whereas no difference was detected between TF genes with orthologs in nematodes, fly, mouse, and human (150) and non-TF genes (Kruskal-Wallis test, <it>p </it>= 0.3498).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Sequence divergence of transcription factors in <it>C. elegans</it></p>
               </caption>
               <text>
                  <p><b>Sequence divergence of transcription factors in <it>C. elegans</it>.</b> Rates of synonymous substitutions per synonymous site (<it>d</it><sub>S</sub>) (A) and non-synonymous substitutions per non-synonymous site (<it>d</it><sub>N</sub>) (B) as calculated under model 0 in PAML (Yang) for non-TF (10,827), TF genes conserved in nematodes, fly, mouse, and human (150) and TF genes conserved among the three nematode species (652) are shown. The boxplot indicates the first and third quartiles and the dotted lines the 5th and 95th percentiles. The notches indicate the level of uncertainty associated with the median.</p>
               </text>
               <graphic file="1471-2164-9-399-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Distribution of TF families in <it>C. elegans</it></p>
            </st>
            <p>We studied the distribution of protein families among <it>C. elegans </it>TFs based on known DNA binding domains. This analysis revealed more than 50 distinct families of which 30 were found to contain 5 or more members (Figure <figr fid="F3">3</figr>). No significant difference was observed in the representation of various families between the <it>C. elegans </it>set and the set conserved among three nematodes (&#967;<sup>2 </sup>= 35.05, df = 30, <it>p </it>= 0.2408), indicating that the distributions of TF families in these species may be similar. The majority of genes in <it>C. elegans </it>and nematode TF sets (28.6% and 20.5%, respectively) were found to encode the nuclear hormone receptors (NHRs), a C4-type sub-family of zinc finger proteins that play key roles in development and homeostasis <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The NHR genes was previously shown to have undergone extensive lineage-specific expansion in <it>C. elegans </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Besides NHR, HOX genes that regulate cell fate specification and embryogenesis <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> are also among the largest TF families in nematodes (11% of <it>C. elegans </it>TF genes, and 11.6% of nematode TF genes) (Figure <figr fid="F3">3A, C</figr>). In contrast, the distribution of TF families among the divergent <it>C. elegans </it>gene set (215 genes) differs significantly from that observed among the entire <it>C. elegans </it>TF set (&#967;<sup>2 </sup>= 83.91, df = 30, <it>p </it>= 5.33 &#215; 10<sup>-7</sup>) due to its high proportion of NHR genes (52.6%, Figure <figr fid="F3">3B</figr>). Likewise, the representation of different families among TF genes with orthologs in nematodes, fly, mouse, and human also differs from that of the <it>C. elegans </it>TF set (&#967;<sup>2 </sup>= 152.27, df = 30, <it>p </it>= 0, Figure <figr fid="F3">3D</figr>). Interestingly, the single largest conserved family represented among the orthologs in three different phyla is HOX (17.3%), supporting multiple previous studies indicating the importance of this family among all metazoans <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Distribution of TF gene families in <it>C. elegans</it></p>
               </caption>
               <text>
                  <p><b>Distribution of TF gene families in <it>C. elegans</it>.</b> The pie charts show distributions of all (A), divergent (B), nematode-conserved (C), and nematode-fly-mouse-human-conserved (D) TF genes in <it>C. elegans</it>. For details on various gene families please refer to Materials and Methods.</p>
               </text>
               <graphic file="1471-2164-9-399-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Chromosomal distribution of TF genes in <it>C. elegans </it>and <it>C. briggsae</it></p>
            </st>
            <p>Studies in <it>C. elegans </it>as well as other organisms have shown that genes that are co-expressed and/or functionally related are frequently clustered together on chromosomes <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. To investigate whether TF genes in nematodes exhibit a similar pattern, we plotted the physical locations of <it>C. elegans </it>and <it>C. briggsae </it>TF genes using non-overlapping windows of 200 kb (the genome of <it>C. remanei </it>has not yet been assembled and therefore was not used in this analysis). Figure <figr fid="F4">4</figr> shows that TF genes in <it>C. elegans </it>and <it>C. briggsae </it>as well as those that are conserved among nematodes are non-randomly distributed on chromosomes. A total of 183 <it>C. elegans </it>TF genes were found to be located in 25 distinct clusters (marked with stars in Figure <figr fid="F4">4A</figr>, Table <tblr tid="T3">3</tblr>). A similar pattern was observed in <it>C. briggsae </it>(184 genes in 27 clusters) (Figure <figr fid="F4">4B</figr> and Table <tblr tid="T3">3</tblr>). Chromosome V carries highest number of clusters (and genes) in both species (<it>C. elegans</it>: 97 genes in 10 clusters; <it>C. briggsae</it>: 64 genes in 7 clusters) (Table <tblr tid="T3">3</tblr>) that are primarily composed of NHR family members (92% in <it>C. elegans </it>and 84% in <it>C. briggsae</it>, red bars in Figure <figr fid="F4">4</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Chromosomal distribution of TF genes in <it>C. elegans </it>(A) and <it>C. briggsae </it>(B)</p>
               </caption>
               <text>
                  <p><b>Chromosomal distribution of TF genes in <it>C. elegans </it>(A) and <it>C. briggsae </it>(B).</b> The maps have been plotted by taking all TF genes in non-overlapping 200 kb windows. The color codes are as follows. Red: NHR genes, blue: non-NHR TF genes, green: TF genes conserved among the three nematode species. Gene clustering was analyzed by comparing the numbers of TF and non-TF genes located in each window using a &#967;<sup>2 </sup>test. Gene clusters that are significantly enriched have been marked with stars (*: P &lt; 0.05; ** P &lt; 0.01; ***: P &lt; 0.0001).</p>
               </text>
               <graphic file="1471-2164-9-399-4"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Chromosome-wise breakdown of TF gene clusters in <it>C. elegans </it>and <it>C. briggsae</it>.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Chromosome</b>
                        </p>
                     </c>
                     <c ca="center" cspan="2">
                        <p>
                           <b>
                              <it>C. elegans</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center" cspan="2">
                        <p>
                           <b>
                              <it>C. briggsae</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of clusters</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of clusters</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>X</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>TOTAL</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>25</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>183</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>27</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>184</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The analysis of the chromosomal distribution of TF genes also revealed that many members of the large TF families, such as ZF-C4/NHR, ZF-C2H2, T-box and HOX are arranged in perfect tandem arrays (defined as having a contiguous repetition of TF genes) (267 <it>C. elegans </it>genes in 103 arrays, 235 <it>C. briggsae </it>genes in 107 arrays), the largest of which consists of 8 NHR genes in <it>C. elegans </it>and 6 NHR genes in <it>C. briggsae </it>(Figure <figr fid="F5">5</figr> and additional files <supplr sid="S10">10</supplr> and <supplr sid="S11">11</supplr>). Although the majority of such arrays consists of genes of the same TF family (76.7% in <it>C. elegans </it>and 67.3% in <it>C. briggsae</it>), less than half of all genes found in such arrays have best-reciprocal hit orthologs between the two species (31.6% in <it>C. elegans</it>, 47.2% in <it>C. briggsae</it>) (Additional files <supplr sid="S10">10</supplr> and <supplr sid="S11">11</supplr>) suggesting significant lineage-specific duplication and expansion of the tandem arrays.</p>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p>Tandem arrays of TF genes in <it>C. elegans</it>.</p>
               </text>
               <file name="1471-2164-9-399-S10.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p>Tandem arrays of TF genes in <it>C. briggsae</it>.</p>
               </text>
               <file name="1471-2164-9-399-S11.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Tandem arrays of NHR genes in <it>C. elegans </it>and <it>C. briggsae</it></p>
               </caption>
               <text>
                  <p><b>Tandem arrays of NHR genes in <it>C. elegans </it>and <it>C. briggsae</it>.</b> The snapshots of the genomic regions, visualized by Wormbase genome browser, show 8 genes in <it>C. elegans </it>and 6 in <it>C. briggsae</it>. The colors of the open reading frames indicate their orientation (blue: leftward, pink: rightward).</p>
               </text>
               <graphic file="1471-2164-9-399-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Evolution of the Nuclear Hormone Receptor family in nematodes</p>
            </st>
            <p>Our findings extend Robinson-Rechavi et al.'s analysis of the extensive lineage-specific expansion of NHR genes in <it>C. elegans </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp> to the other two <it>Caenorhabditid </it>species (283, 232, and 256 NHR genes in <it>C. elegans</it>, <it>C. briggsae</it>, and <it>C. remanei</it>, respectively). The sequence analyses revealed a total of 134 NHR genes having 3-way best-reciprocal orthologs among the nematode species (Additional files <supplr sid="S4">4</supplr>, <supplr sid="S6">6</supplr>, and <supplr sid="S7">7</supplr>). The remaining NHRs are composed of what appear to be lineage-specific paralogs and those that have diverged sufficiently in sequence such that orthologous relationships could no longer be assigned.</p>
            <p>We constructed a phylogenetic tree of the nematode NHR family members (437 genes, see Materials and Methods) to study their inter&#8211; as well as intra-specific relationships. The most striking feature of the phylogeny is the frequent presence of several closely related NHRs located tandemly on the same chromosome (Additional file <supplr sid="S12">12</supplr>). Such groupings suggest the presence of extensive tandem duplications, which could explain the mechanism behind the expansion of the NHR gene family, and perhaps the independent occurrence of some NHR genes in the lineages of each of these species. In the case of <it>C. elegans </it>NHRs, we found at least 15 distinct groups on chromosome V including 7 that are located in one large cluster of the phylogeny (Additional file <supplr sid="S12">12</supplr>).</p>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p>Phylogenetic tree of NHR genes in <it>Caenorhabditid </it>nematode species. Colors mark NHR genes in different species (blue: <it>C. elegans</it>, red: <it>C. briggsae </it>and light green: <it>C. remanei</it>). Tandemly along chromosomes and phylogenetically clustered genes are indicated by vertical bars. Chromosomes carrying NHR clusters are indicated by roman numerals. The sub-branch comprised of seven groups of NHR genes on chromosome V has been marked by a star (*). Scale bar represents 0.5 substitutions per site.</p>
               </text>
               <file name="1471-2164-9-399-S12.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The presence of NHRs in chromosomal clusters prompted us to study their distribution in further detail. We identified a total of 47 tandem arrays composed of contiguous repetitions of NHR genes in <it>C. elegans</it>, which are found on all chromosomes with the exception of chromosome III (Additional file <supplr sid="S10">10</supplr>). These include 10 arrays that are comprised of 5 or more genes, all of which are located on chromosome V. A similar analysis in <it>C. briggsae </it>identified 30 NHR arrays having 6 or fewer genes (Additional file <supplr sid="S11">11</supplr>). In total, 9 NHR arrays were partially or completely conserved between <it>C. elegans </it>and <it>C. briggsae</it>. One of these arrays, for instance, consists of 7 genes in <it>C. elegans </it>(<it>nhr-136, nhr-153, nhr-154, nhr-206, nhr-207, nhr-208</it>, and <it>nhr-209</it>) and the corresponding 4 in <it>C. briggsae </it>(<it>CBG23383/Cbr-nhr-136, CBG23380/Cbr-nhr-153, CBG23380/Cbr-nhr-154 </it>and <it>CBG23379/Cbr-nhr-209</it>). This suggests that either the array has expanded in <it>C. elegans </it>or perhaps lost 3 of the genes in <it>C. briggsae</it>. Examination of the <it>C. remanei </it>TFs revealed the presence of best reciprocal hit orthologs for all array members found in <it>C. elegans </it>with the exception of <it>nhr-206 </it>leading us to propose that <it>nhr-207 </it>and <it>nhr-208 </it>were most likely lost in the <it>C. briggsae </it>lineage. This analysis, however, carries a caveat in that the annotations of the <it>C. briggsae </it>and <it>C. remanei </it>genomes are based on computational predictions and lack experimental validation.</p>
            <p>Finally, we found that 7 tandem arrays in <it>C. briggsae </it>are composed of NHR genes that lack best reciprocal hit orthologs in <it>C. elegans </it>and <it>C. remanei </it>(Additional file <supplr sid="S11">11</supplr>). The largest of these is comprised of 6 NHR genes (<it>CBG01243, CBG01244, CBG01245, CBG01246, CBG01247, CBG01248</it>) (Figure <figr fid="F5">5</figr>). These <it>C. briggsae</it>-specific arrays may be caused by lineage-specific expansion although the possibility of a selective loss of their orthologs in other species cannot be ruled out.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of TF gene sequence conservation and function in <it>C. elegans</it></p>
            </st>
            <p>We investigated the relationship between sequence conservation and function of TF genes in <it>C. elegans</it>. From a comprehensive list of 13,647 RNAi phenotypes associated with 4,351 genes <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, we identified 281 TFs that exhibit one or more mutant phenotypes (Additional file <supplr sid="S13">13</supplr>). These consist of more than half of all TF genes conserved among nematodes, fly, mouse, and human (52.7%, 79 of 150), over one-third of genes conserved among the three nematode species (36.5%, 238 of 652), and one-fifth of the TF genes in <it>C. elegans </it>that did not have identifiable orthologs in the other nematode species (20%, 43 of 215). We also determined the number of distinct mutant phenotypes associated with TF genes in each of the above three groups as well as with non-TF genes. This analysis revealed that TF genes conserved among nematodes, fly, mouse, and human are linked to a significantly greater number of mutant phenotypes in <it>C. elegans </it>when compared to the other sets (4.38 &#177; 2.31, 3.36 &#177; 2.09, 2.91 &#177; 1.82 and 3.19 &#177; 1.84 phenotypes per gene for TF genes conserved across phyla, conserved in nematodes, <it>C. elegans </it>TF genes without detectable orthologs in the other nematode species and non-TF genes, respectively; Kruskal-Wallis rank sum test, <it>p </it>= 8.58 &#215; 10<sup>-3</sup>, 1.8 &#215; 10<sup>-3</sup>, 1.32 &#215; 10<sup>-15</sup>, respectively, after Bonferroni correction). No difference was found in pairwise comparisons between the other gene sets (Kruskal-Wallis rank sum test <it>p </it>= 1, in all comparisons after Bonferroni correction).</p>
            <suppl id="S13">
               <title>
                  <p>Additional file 13</p>
               </title>
               <text>
                  <p><it>C. elegans </it>genes and their mutant phenotypes in RNAi assays.</p>
               </text>
               <file name="1471-2164-9-399-S13.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To further analyze the roles of <it>C. elegans </it>TF genes in specific tissues and developmental processes, RNAi phenotypes were sorted into six broad categories: viability (embryonic and post-embryonic growth and survival), fertility (germline and germ cells), sex (sex determination and reproductive system), vulva (vulval cell proliferation and morphogenesis), body (cuticle, size, and morphology), and behavior (movement and feeding) (Additional files <supplr sid="S13">13</supplr> and <supplr sid="S14">14</supplr>). Among the six categories, "viability" ranks highest in terms of the proportion of TF and non-TF genes (Figure <figr fid="F6">6</figr>). However, it is important to keep in mind that this may be linked to a greater interest in identifying transcription factors that are involved in growth and survival of <it>C. elegans</it>. A further sub-classification of this category into "embryonic viability" and "post-embryonic viability" (based on the phenotype when lethality occurs in RNAi-treated animals) revealed that among the "embryonic viability" class TF genes conserved among nematode species are significantly under-represented when compared to the non-TFs (&#967;<sup>2 </sup>= 7.39, df = 1, <it>p </it>= 6.56 &#215; 10<sup>-3</sup>) (Figure <figr fid="F6">6</figr>), while no difference was observed among the datasets for genes affecting post-embryonic viability (&#967;<sup>2 </sup>= 0.77, df = 1, <it>p </it>= 0.38). By contrast, the TF genes conserved among nematodes, fly, mouse, and human showed no enrichment for any of these two categories (&#967;<sup>2 </sup>= 3.59 and 0.39, df = 1, <it>p </it>= 0.059 and 0.53, respectively). Among other categories, we observed an over-representation of mutant phenotypes for nematode-conserved as well as nematode-fly-mouse-human-conserved TF gene sets associated with vulval development (&#967;<sup>2 </sup>= 8.24 and 11.75, df = 1, <it>p </it>= 4.1 &#215; 10<sup>-3 </sup>and 6.08 &#215; 10<sup>-4</sup>, respectively) and sex determination and reproductive system-related processes (&#967;<sup>2 </sup>= 9.51 and 8.78, df = 1, <it>p </it>= 2.04 &#215; 10<sup>-3 </sup>and 3.59 &#215; 10<sup>-3</sup>, respectively) when compared to the non-TF gene set.</p>
            <suppl id="S14">
               <title>
                  <p>Additional file 14</p>
               </title>
               <text>
                  <p>RNAi phenotypes sorted into six broad categories.</p>
               </text>
               <file name="1471-2164-9-399-S14.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Functional classification of <it>C. elegans </it>TF genes</p>
               </caption>
               <text>
                  <p><b>Functional classification of <it>C. elegans </it>TF genes.</b> The six broad categories are based on the mutant phenotypes in RNAi studies. Non-TF genes have been plotted for comparison. Viability-E and viability-PE are based on the embryonic and post-embryonic stage lethality phenotypes in RNAi assays, respectively. Refer to text for the description of other categories.</p>
               </text>
               <graphic file="1471-2164-9-399-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phenotypes associated with nematode TF orthologs in fly, mouse, and human</p>
            </st>
            <p>The above findings that more than half of <it>C. elegans </it>TF genes conserved across phyla are associated with RNAi phenotypes prompted us to examine their mutant phenotypes in other organisms. We found that 69 (46%) of TF genes conserved among nematodes, fly, mouse, and human are associated with lethal phenotype in fly (Additional file <supplr sid="S15">15</supplr>). In the case of mouse, out of a total of 81 orthologs for which knock out and mutant phenotypes are described (see Materials and Methods), 75 (92.6%) exhibit defects ranging from mild to gross abnormalities, including lethality (Additional file <supplr sid="S15">15</supplr>). A similar analysis in human revealed 35 TF genes linked to various diseases and genetic disorders (Table <tblr tid="T4">4</tblr>). In total, 44 (29.3%) TF genes regulating <it>C. elegans </it>development and behavior are also essential either in mouse or human or both. These include 30 genes that control viability in the fruit fly. Overall, 121 (80.7%) TF genes conserved among nematodes, fly, mouse, and human play important roles in at least one of these organisms. This is likely an underestimate due to technical limitations of RNAi experiments (e.g., strains, redundancy of factors or pathways) and that comparisons between organisms involve different experimental approaches (e.g., RNAi in <it>C elegans </it>and chromosomal mutations in <it>D. melanogaster</it>). Thus, functional studies of conserved TFs in <it>C. elegans </it>promise to elucidate mechanisms involved in biological processes conserved across phyla.</p>
            <suppl id="S15">
               <title>
                  <p>Additional file 15</p>
               </title>
               <text>
                  <p>List of the worm (<it>C. elegans</it>) fly (<it>D. melanogaster</it>) and mouse (<it>M. musculus</it>) mutant phenotypes associated with conserved TF genes.</p>
               </text>
               <file name="1471-2164-9-399-S15.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Genetic disorders linked to human TF genes conserved among nematodes, fly, mouse, and human.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>
                           <b><it>C. elegans </it>gene</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Human ortholog</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Human disorder</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>vab-3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pax6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Aniridia type II, Peters anomaly with cataract, foveal hypoplasia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Y38H8A.5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>FEZF1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Beckwith-Wiedemann syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-33</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SIX1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Branchiootic syndrome 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>mab-9</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>TBX20</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cardiomyopathy, atrial septal defect 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>tag-192</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>CHD7</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CHARGE syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-24</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>TITF1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Congenital hypothyroidism, neonatal respiratory insufficiency</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>dve-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SATB2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Cleft palate isolated</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-14</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LHX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Combined pituitary hormone deficiency 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>unc-86</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pou4f3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>DFNA15 syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>elt-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>GATA1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Dyserythropoietic anemia with thrombocytopenia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>K02H8.1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>MBNL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Dystrophia myotonica 1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>fax-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Nr2e3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Enhanced s-cone syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-17</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>PHOX2A</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Congenital fibrosis of the extraocular muscles 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-32</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SIX3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Holoprosencephaly 2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>sbp-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Srebf1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Hypercholesterolemia, familial</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>lin-28</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LIN28B</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Hypomyelination and cataract</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>alr-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ARX</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Lissencephaly, X-linked, with ambiguous genitalia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>hmg-5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tfam</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Kearns-Sayre syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>cnd-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>NEUROD1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Maturity-onset diabetes of the young</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>lim-6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LMX1B</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Nail patella syndrome NPS1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>grh-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>GRHL2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Neurosensory deafness 28</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>sma-4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Smad4</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Pancreatic cancer, Hemorrhagic Telangiectasia Syndrome (HTT)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>nhr-6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>NR4A2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PARK14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>POU3F3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Perilymphatic gusher-deafness syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>zag-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ZEB1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Posterior polymorphous corneal dystrophy 3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>eor-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>MYNN</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Promyelocytic leukemia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>R07E5.3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Smarcb1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Rhabdoid tumor</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>cbp-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>CREBBP</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Rubinstein-taybi syndrome, acute myeloid leukemia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ceh-43</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>DLX5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Split-hand/foot malformation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ing-3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>ING3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Squamous cell carcinoma</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ast-1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>FLI1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Thrombocytopenia, Paris-Trousseau type</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>nhr-64</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>HNF4A</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Maturity-onset diabetes of the young</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>tbx-2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tbx2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Ulnar mammary syndrome</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>K02D7.2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>SNAI2</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Waardenburg syndrome, piebaldism</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>F53F8.1</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>KLF3</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Wilms tumor</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Analysis of TF interaction networks in <it>C. elegans</it></p>
            </st>
            <p>To further explore the mechanism of transcription factor function in metazoans, we generated an interaction map of <it>C. elegans </it>TF genes based on known physical and genetic interactions <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The map consists of 1594 interactions involving 277 TF genes and their direct non-TF interactors (Figure <figr fid="F7">7A</figr> and additional file <supplr sid="S16">16</supplr>). The network appears to be scale free as seen by the presence of several nodes with high degree of connectivity (such as <it>lin-35</it>, which shows the highest number of interactions and is connected to more than one-third of all existing nodes; 521 of 1340) (Figure <figr fid="F7">7B</figr>). <it>lin-35 </it>is an ortholog of the human <it>Retinoblastoma </it>(<it>Rb</it>) gene which plays an important role in cell proliferation <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Among the <it>lin-35 </it>interacting genes, 43 (8%) encode TFs, of which 18 have best reciprocal hit orthologs in mouse and human. Other prominent hubs include <it>pal-1 </it>(conserved among nematode species), as well as other TF genes with orthologs in nematodes, fly, mouse, and human: <it>tag-331, eya-1</it>, and <it>sma-4 </it>(Figure <figr fid="F7">7B</figr>). Each of these genes plays important role in <it>C. elegans </it>development, and RNAi-mediated knock-downs cause defects such as slow growth (<it>pal-1</it>), lethality (<it>pal-1, tag-331</it>), larval arrest (<it>eya-1, tag-331</it>), uncoordinated movement (<it>eya-1</it>), and small size (<it>sma-4</it>) <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Interestingly, the subnetwork comprising of the hub gene <it>tag-331 </it>(human ortholog RNF113A) and its 32 direct interactors appears to be largely isolated. A closer examination revealed that two-thirds of these 22 genes are conserved in nematodes yet lack best reciprocal hit orthologs in fly, mouse, or human genomes. The remaining third includes four genes conserved in nematodes, fly, mouse, and human (<it>zfp-1, R11F4.1, apl-1 </it>and <it>fcd-2</it>) and whose human homologs are linked to disorders (AF10/MLLT10: leukemia, Glycerol kinase: hyperglycerolemia, APP: Alzheimer's, and FANCD2: Fanconi anemia). It remains to be determined if the human genes interact with RNF113A as well as whether RNF113A is involved in any of these diseases. Among the remaining hub genes, the <it>eya-1 </it>mammalian orthologs promote development of tissues and organs <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp> whereas the <it>sma-4 </it>ortholog SMAD4/DCP4 acts as a tumor suppressor <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>.</p>
            <suppl id="S16">
               <title>
                  <p>Additional file 16</p>
               </title>
               <text>
                  <p>List of TF genes and their interactors. The columns A and B merely list gene pairs that interact with each other and do not indicate the direction of regulation.</p>
               </text>
               <file name="1471-2164-9-399-S16.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>The interaction network of <it>C. elegans </it>TF genes</p>
               </caption>
               <text>
                  <p><b>The interaction network of <it>C. elegans </it>TF genes.</b> (A) The breakdowns of TF nodes, non-TF nodes and gene interactions in the network for each of the TF categories. The <it>C. elegans</it>-divergent category refers to TF genes that lack unique reciprocal orthologs in other nematode species. (B) The network exhibits several high degree nodes, five of which &#8211; <it>tag-331, eya-1, lin-35, sma-4</it>, and <it>pal-1 </it>&#8211; are boxed and shown at high magnification on the right (marked by arrows). The node colors mark different TF genes (red: conserved among nematodes, fly, mouse, and human; yellow: conserved among the three nematode species; green: <it>C. elegans</it>-divergent). The network was visualized by using Cytoscape <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2164-9-399-7"/>
            </fig>
            <p>In addition to analyzing the prominent hubs in the interaction network, we also examined the relationship between connectivity of TFs, sequence conservation, and known function. The results revealed a significantly greater number of interactions among <it>C. elegans </it>TFs that are conserved in nematodes, fly, mouse, and human, as compared to those that are not (Kruskal-Wallis rank sum test, <it>p </it>= 0.0207, after Bonferroni correction). We also found that TF genes associated with mutant phenotypes in RNAi assays exhibit significantly more interactions when compared to those that lack a detectable phenotype (Kruskal-Wallis rank sum test, <it>p </it>= 0.0069). These results are consistent with previous studies showing that highly connected hubs tend to be enriched in essential genes <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This paper presents the first genome-wide comparative study of TF genes in nematodes and their orthologs in fly (<it>D. melanogaster</it>), mouse (<it>M. musculus</it>), and human (<it>H. sapiens</it>). We took both computational and manual curation approaches to compile sets of TF genes in three <it>Caenorhabditid </it>species, leading to the identification of 988 genes in <it>C. elegans</it>, 995 in <it>C. briggsae </it>and 1093 in <it>C. remanei</it>. A comparison of these data sets has revealed 652 3-way best reciprocal orthologs among these species. Furthermore, using currently available genome annotations, we identified 150 TF gene orthologs shared among nematodes, fly, mouse, and human and shown that according to mutant phenotypes or associated disorders, many of these genes are functionally important. It should be noted that many of the TF genes identified in <it>C. elegans </it>as well as most of those identified as orthologs, paralogs, and divergent in the other two nematode species are based entirely on computational predictions, and thus await experimental validation. However, the results of our study suggest the most likely group of candidate genes from which further experimental tests of TF activity can be designed. In contrast, the majority of the orthologs identified in the two other phyla are annotated as TF genes themselves, owing to the extensive experimental validation performed in these organisms.</p>
         <p>The sequence comparison of orthologs among nematodes has revealed that TF genes conserved among the three nematodes species (652 genes) are evolving more rapidly than non-TF genes, which is in agreement with earlier reports from other species in which TF genes have been shown to be evolving more rapidly than the coding genome average, and that significantly more TF genes have been found to be evolving under positive selection when compared to the rest of the genome <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. While our observation of a greater number of conserved orthologs among all three nematode species, coupled to an accelerated rate of divergence may seem paradoxical, it may be suggestive of widespread positive selection, and thus divergence, acting on genes that are otherwise functionally important. Given the wide estimates of the divergence time between the three nematode species considered in this study, it is unsurprising that the rate of synonymous substitution (<it>d</it><sub>S</sub>) is saturated, and is therefore not amenable for use in analyses that could test the hypothesis of widespread positive selection among TF genes. Additional data, such as a large-scale polymorphism analysis among multiple <it>Caenorabditid </it>nematodes could provide the sensitivity to test for evidence of differential selective pressure affecting specific gene groups.</p>
         <p>The analysis of TF families in nematodes has revealed several interesting features, such as the high proportion of C2H2 and C4/NHR class of zinc-finger family members relative to the other TF families in all three species (see Figure <figr fid="F3">3</figr>). It was previously shown that the NHR family has undergone significant lineage-specific expansion in <it>C. elegans </it>and <it>C. briggsae </it><abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Considering, for example, that <it>Drosophila </it>and humans carry less than 50 identified NHR genes (21 and 48, respectively) <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, the presence of more than 200 genes in <it>Caenorhabditid </it>species is striking. Although it remains to be seen whether all of these have important roles to play, studies in <it>C. elegans </it>have shown that roughly 10% of NHRs mediate diverse processes including molting (<it>nhr-23, nhr-25, nhr-67</it>), neuronal differentiation (<it>unc-55, fax-1</it>), sex determination (<it>sex-1</it>), and dauer formation (<it>daf-12</it>) <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. We found that roughly half of all NHRs in each of the <it>Caenorhabditid </it>species are conserved as 3-way best reciprocal orthologs and another 10% exhibit 2-way orthologous relationships with at least one of the other nematode species. The remaining NHRs are likely to have arisen from lineage-specific gene duplications, suggesting that this class of TF may have a significant role in many of those differences that make individual nematode species unique. While the expansion of the NHR family in nematodes is certainly unusual, other TF families show interesting lineage-specific features as well. Previous studies as well as results presented here indicate that TF families such as ZF-C2H2, HOX and T-box have also diverged between the <it>C. elegans </it>and <it>C. briggsae </it>lineages (see Figure <figr fid="F3">3B</figr> and additional file <supplr sid="S9">9</supplr>) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
         <p>Our work demonstrates that TF genes are non-randomly distributed in the genomes of both <it>C. elegans </it>and <it>C. briggsae</it>. We found that members of gene families such as NHR, HOX, and T-box are frequently clustered and present in tandem arrays. A subset of the rapidly evolving NHR family of TF genes in <it>C. elegans </it>was previously shown to be located on chromosome V <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. We have shown not only that <it>C. briggsae </it>exhibits a similar pattern, but also that the majority of the chromosome V NHRs in both species is tandemly arrayed. Our finding that many NHRs appear to be lineage-specific paralogs suggests that gene duplication has played a significant role in the expansion of this gene family in nematodes. The phenomenon of gene clustering has been observed not only in <it>C. elegans</it>, but also in other species such as <it>D. melanogaster </it>and mouse <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B55">55</abbr><abbr bid="B57">57</abbr></abbrgrp>, and in some cases these clusters are composed of genes that are co-expressed <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B34">34</abbr></abbrgrp>. While the precise mechanism of the origin of such clusters remains to be determined, these may be caused by small-scale regional translocations and illegitimate recombination events leading to tandem gene duplications <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>.</p>
         <p>Our study has revealed that <it>C. elegans </it>TF genes conserved across multiple phyla are more likely to be associated with mutant phenotypes when compared to the remaining TF and non-TF genes. Likewise, the fly, mouse, and human orthologs of <it>C. elegans </it>TF genes are enriched in essential genes when compared to <it>C. elegans </it>TF genes without detectable orthologs (46%, 50% and 23.3%, respectively). The analysis of the relationship between gene function and interactions revealed that TF genes conserved across phyla exhibit greater number of interactions and mutant phenotypes when compared to those that are divergent. Among the TFs with described interactions, <it>lin-35 </it>(human <it>Rb </it>ortholog) appears to have an exceptionally large number of interactions. <it>lin-35 </it>is known to interact with cell cycle-related and chromatin remodeling factors to regulate tissue growth and morphology <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. We found that among the <it>lin-35 </it>interacting genes, 43 (8%) encode TFs, of which 18 have best reciprocal hit orthologs in mouse and human. It is important to keep in mind that conservation in sequence does not indicate the roles of orthologous genes in regulating similar biological processes. Instead, it simply means that genes that are evolutionarily conserved are very likely to play important roles in the development and functioning of the organism. Our results are also consistent with studies in other organisms that have found a significant correlation between connectivity, rate of evolution and gene dispensability (according to lethal or sterile phenotype), even across multiple metazoan phyla. In general, hubs with high degree of connectivity tend to be enriched in essential genes and appear to evolve relatively slower than genes with lower connectivity <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B50">50</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>This study describes a genome-wide analysis of TF genes in three <it>Caenorhabditid </it>nematode species (<it>C. elegans</it>, <it>C. briggsae </it>and <it>C. remanei</it>) as well as their orthologs in fruit fly (<it>D. melanogaster</it>), mouse (<it>M. musculus</it>) and human (<it>H. sapiens</it>). We observed a significantly higher conservation of orthology for the TF genes among <it>Caenorhabditid </it>species, while also noting that the coding sequence of TF genes diverges more rapidly than the coding genome average. Finally, the analyses of sequence conservation, gene interactions, and function revealed that TF set conserved in nematodes, fly, mouse, and human is significantly more enriched in essential genes compared to those that lack orthologs in other phyla. Our findings will serve as a resource in aiding us to understand transcriptional networks and their conservation and divergence among metazoa. The compilation of the TF sets also serves as a stepping-stone in generating various resources such as knock-out mutants, cDNA and promoter clones, and reporter gene expressing lines, with the intent of systematically mapping and studying TF function in nematodes. In parallel with many of ongoing initiatives in <it>C. elegans </it>these resources will provide foundation for future studies of the conservation of TF function and interaction across the breadth of biodiversity.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p><it>C. elegans</it>, <it>C. briggsae </it>and <it>C. remanei </it>TF gene sets</p>
            </st>
            <p>The <it>C. elegans </it>TF-encoding genes were searched using 8 GO terms (Table <tblr tid="T1">1</tblr>) within WS173 release of Wormbase. The <it>C. briggsae </it>and <it>C. remanei </it>TFs were identified using the HMMER <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B65">65</abbr></abbrgrp> and InParanoid programs <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The complete genome sequences of each of the three <it>Caenorhabditid </it>species were downloaded from WormBase (<it>C. elegans </it>release WS173, <it>C. briggsae </it>release WS173 and <it>C. remanei </it>release 11/29/2005) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. As the <it>C. remanei </it>predicted peptide dataset is known to contain redundant copies of genes due to heterozygosity in the sequenced genome, (E. Schwartz, personal communication) we used the CD-HIT program (version 2007-0131) <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> in order to cluster and remove all additional transcripts that had greater than or equal to 98% sequence similarity to other transcripts at the protein level. The original dataset of 25,948 transcripts was truncated down to 24,267 non-redundant transcripts that were used in further analysis <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
            <p>InParanoid was run with default values, using blastall version 2.2.14 with &#8211;VT emulation, on all three complete genome predicted peptide datasets in pairwise comparisons. The results were collected and placed into species-specific paralogs, 2- and 3-way best-hit reciprocal ortholog categories using custom PERL scripts. Each category was searched for genes from the <it>C. elegans </it>TF set and the number of TFs in each category was identified (Additional files <supplr sid="S6">6</supplr> and <supplr sid="S7">7</supplr>). HMM alignment-based searches were carried out on the <it>C. briggsae </it>and <it>C. remanei </it>predicted peptides using previously established techniques <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B67">67</abbr></abbrgrp>. The HMMER signature files (profiles) of known DNA binding domains were retrieved from Pfam <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. In most cases, a cut-off score of 0.1 was used. If a HMMER predicted TF gene in non-<it>elegans </it>species lacked a homolog in <it>C. elegans</it>, it was considered false positive and therefore removed creating the final, conservative datasets that were used in the study.</p>
            <p>The <it>C. elegans </it>orthologs of <it>D. melanogaster</it>, <it>M. musculus </it>and <it>H. sapiens </it>TFs were retrieved using the data available on the InParanoid database <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B69">69</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of the TF gene families</p>
            </st>
            <p>Genes were grouped into different families based on the presence of known DNA-binding domains according to the WormBase <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, Pfam <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, and InterPro <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> databases. Only well defined and unambiguous domains that are known to be involved in transcriptional regulation were considered. Families with fewer than 5 members were placed together in a miscellaneous category. The TF families shown in Figure <figr fid="F3">3</figr> are as follows. AP2: Activator protein-2 family; AT hook: AT hook DNA binding motif (preference to A/T rich region) family; bHLH: basic helix-loop-helix family; bZIP: basic leucine zipper family; CBFB/NF-YA: CCAAT binding factor family; CSD: Cold shock DNA binding domain family; HMG box: High mobility group box family; HOX: Homeobox family; MADF: Myb DNA binding domain family; SAND: DNA binding domain family named after Sp100, AIRE-1, NucP41/75, DEAF-1; SANT: Myb-like DNA binding domain; SMAD: SMAD (Mothers against decapentaplegic (MAD) homolog) domain family; T-box: T-box family; WH: Winged-helix family; WH-FH: Winged-helix and Forkhead domain family; WH-ETS: Winged-helix and ETS domain family; ZF-C2H2: C2H2-type zinc finger protein family; ZF-C2H2-BED: C2H2 and BED-type zinc finger protein family; ZF-BED: BED-type zinc finger family; ZF-C2H2-RING: C2H2 and RING-type zinc finger protein family; ZF-C4/NHR: C4-type zinc finger/Nuclear hormone receptor family; ZF-CCCH: C-x8-C-x5-C-x3-H class of zinc finger family; ZF-DHHC: DHHC-type zinc finger family; ZF-FLYWCH: FLYWCH-type of zinc finger family; ZF-GATA: GATA class of zinc finger family; ZF-PHD: C4HC3 zinc-finger-like motif family; ZF-others: zinc finger family members not listed above; ZF-DM: DM (dsx and mab-3) zinc finger family; ZF, AT hook: AT hook and zinc finger domain family; ZF, SANT: SANT and zinc finger domain family; Misc: Miscellaneous TF family not listed above.</p>
         </sec>
         <sec>
            <st>
               <p>Generation of the chromosomal map</p>
            </st>
            <p>The physical locations of <it>C. elegans </it>and <it>C. briggsae </it>TF and non-TF genes were retrieved from Wormbase (WS173 release) and grouped into non-overlapping windows of 200 kb (similar to the 250 kb used by <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>). A 400 kb window analysis was also performed and the conclusions remain the same (data not shown). Since many genes are alternatively spliced, we eliminated transcript-specific bias by focusing on single open reading frame for each transcription factor. In the case of <it>C. briggsae</it>, a total of 1329 genes were not assigned to any of the chromosomes and hence were excluded from the analysis. For simplicity, we only used the average between the start and end positions as a proxy for the gene position. The significance of TF clustering on chromosomes was determined by comparing their frequency with the overall frequency of genes in a given window using a &#967;<sup>2 </sup>test <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Clusters with <it>p </it>value less than 0.05 were considered significant.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis of the nematode NHR genes</p>
            </st>
            <p>The predicted <it>C. elegans </it>NHR gene dataset (283 genes) was used to identify orthologs and paralogs in <it>C. briggsae </it>and <it>C. remanei </it>using the complete genome INPARANOID datasets (see above). 204 and 152 potential homologs were identified in <it>C. briggsae </it>and <it>C. remanei</it>, respectively. The peptide dataset was aligned using Dialign 2.2 <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> and then manually inspected. We identified two large conserved blocks within most predicted peptides and removed all sequences that did not align within these blocks. The remaining sequences were then realigned with Dialign 2.2 and truncated only to retain the two conserved domains. As per Robinson-Rechavi et al. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> we chose to use only ungapped sites and removed first sequences missing significant portions of the conserved domains and finally excluded all gapped sites. In the end, we retained 437 sequences (213 <it>C. elegans</it>, 106 <it>C. briggsae </it>and 118 <it>C. remanei</it>) for phylogenetic analysis.</p>
            <p>The phylogeny was constructed using a maximum likelihood based method as implemented in PhyML <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> using the JTT substitution model <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> with the default proportion of invariable sites (0.0) and rate heterogeneity between sites corrected by a gamma law (using the default gamma parameter of 1.0 and eight rate categories). The phylogeny was then bootstrapped by generating 1000 randomized datasets using SEQBOOT and assessing the percentage of consensus trees using CONSENSE, both in the PHYLIP package <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Calculation of TF divergence</p>
            </st>
            <p>DNA sequences from <it>C. elegans</it>, <it>C. briggsae </it>and <it>C. remanei </it>were aligned according to their protein alignment using Dialign 2.2 <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> and RevTrans 1.4 <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. Rates of synonymous substitutions per synonymous site (<it>d</it><sub><it>S</it></sub>) and non-synonymous substitutions per non-synonymous site (<it>d</it><sub><it>N</it></sub>) were estimated using codeml from PAML <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>. Evolutionary rates between TF and non-TF data sets were compared using a permuted Kruskal-Wallis rank sum test using 10,000 permutations.</p>
         </sec>
         <sec>
            <st>
               <p>Curation of the mutant phenotypes of TFs</p>
            </st>
            <p>The RNAi phenotypes of all known <it>C. elegans </it>genes were retrieved from Wormbase (WS170 release). A total of 13,648 phenotypes associated with 4,351 genes were analyzed and sorted into 82 different categories (Unc, Dpy, Vul etc.) (Additional files <supplr sid="S13">13</supplr> and <supplr sid="S14">14</supplr>).</p>
            <p>For phenotypes associated with <it>C. elegans </it>TF orthologs in fly, mouse, and human, we searched Flybase <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>, NCBI OMIM <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>, PubMed <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, and other public databases (<url>http://www.informatics.jax.org</url>, <url>http://www.bioscience.org/knockout/alphabet.htm</url>, <url>http://www.dsi.univ-paris5.fr/genatlas</url>, <url>http://www.genetests.org</url>). Only those phenotypes that were unambiguous and did not show discrepancy between different published sources were included. In order to reduced any effect linked to a differential amount of genes annotated as involved in particular mutant phenotypes, all the analyses were performed within each phenotypic class by comparing the distribution of genes with mutant phenotypes among the different sets (non-TF genes, TF genes, <it>C. elegans </it>TF genes, TF genes conserved among the three nematode species, and TF genes with orthologs in nematodes, fly, mouse, and human).</p>
         </sec>
         <sec>
            <st>
               <p>Construction of TF interaction network</p>
            </st>
            <p>The <it>C. elegans </it>gene network was built using the genetic and protein-protein interaction data for transcription factors curated by BioGRID (version 2.0.27 release) <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The network was visualized by using Cytoscape <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p><it>d</it><sub>N:</sub> non-synonymous substitutions per non-synonymous site; <it>d</it><sub>S:</sub> synonymous substitutions per synonymous site; NHR: Nuclear hormone receptor; TF: Transcription factor; ZF: Zinc finger.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The laboratories of BPG and RSS contributed to this publication. BPG and NK identified the <it>C. elegans </it>TF set, protein families, chromosomal maps, and mutant phenotypes. WH identified nematode TF orthologs in fly, mouse, and human and carried out most of the sequence alignments, and interaction network analysis. CA performed the InParanoid orthology searches creating the <it>C. briggsae </it>and <it>C. remanei </it>TF gene sets and constructed NHR phylogenetic tree. BPG, WH and CA drafted the manuscript. BPG conceived and coordinated the study. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Phil Cumbo, Eric Schwartz, Jack Chen, and anonymous reviewers for constructive comments and advice. This work was supported by Natural Sciences and Engineering Research Council of Canada (NSERC) funds to BPG and RSS. CA is supported by an NSERC Post-Graduate Doctoral Scholarship and NK was an NSERC undergraduate summer trainee.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Evolution at two levels: on genes and form</p>
            </title>
            <aug>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>7</issue>
            <fpage>e245</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1174822</pubid>
                  <pubid idtype="pmpid" link="fulltext">16000021</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030245</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The evolution of transcriptional regulation in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Wray</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Abouheif</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Balhoff</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Pizer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rockman</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Romano</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <issue>9</issue>
            <fpage>1377</fpage>
            <lpage>1419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg140</pubid>
                  <pubid idtype="pmpid" link="fulltext">12777501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Working in the post-genomic <it>C. elegans</it> world</p>
            </title>
            <aug>
               <au>
                  <snm>Sternberg</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <issue>2</issue>
            <fpage>173</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(01)00308-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11336667</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Evolution of development in closely related species of flies and worms</p>
            </title>
            <aug>
               <au>
                  <snm>Simpson</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>12</issue>
            <fpage>907</fpage>
            <lpage>917</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg947</pubid>
                  <pubid idtype="pmpid" link="fulltext">12459721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genetic control and evolution of sexually dimorphic characters in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Kopp</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Duncan</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Godt</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <issue>6812</issue>
            <fpage>553</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35046017</pubid>
                  <pubid idtype="pmpid" link="fulltext">11117736</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Morphological evolution through multiple cis-regulatory mutations at a single gene</p>
            </title>
            <aug>
               <au>
                  <snm>McGregor</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Orgogozo</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Delon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zanet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Payre</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>448</volume>
            <issue>7153</issue>
            <fpage>587</fpage>
            <lpage>590</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05988</pubid>
                  <pubid idtype="pmpid" link="fulltext">17632547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Evolutionary innovation of the excretory system in <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chamberlin</snm>
                  <fnm>HM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <issue>3</issue>
            <fpage>231</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1301</pubid>
                  <pubid idtype="pmpid" link="fulltext">14758362</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Deciphering AP-1 function in tumorigenesis: fra-ternizing on target promoters</p>
            </title>
            <aug>
               <au>
                  <snm>Verde</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Casalino</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Talotta</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yaniv</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weitzman</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Cell Cycle</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <issue>21</issue>
            <fpage>2633</fpage>
            <lpage>2639</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17957143</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Defining ETS transcription regulatory networks and their contribution to breast cancer progression</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Findlay</snm>
                  <fnm>VJ</fnm>
               </au>
               <au>
                  <snm>Moussa</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>J Cell Biochem</source>
            <pubdate>2007</pubdate>
            <volume>102</volume>
            <issue>3</issue>
            <fpage>549</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jcb.21494</pubid>
                  <pubid idtype="pmpid" link="fulltext">17661355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The versatile worm: genetic and genomic resources for <it>Caenorhabditis elegans</it> research</p>
            </title>
            <aug>
               <au>
                  <snm>Antoshechkin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sternberg</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>7</issue>
            <fpage>518</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg2105</pubid>
                  <pubid idtype="pmpid" link="fulltext">17549065</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Developmental genomic approaches in model organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Reinke</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>153</fpage>
            <lpage>178</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genom.3.031302.100922</pubid>
                  <pubid idtype="pmpid" link="fulltext">12142361</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A map of the interactome network of the metazoan <it>C. elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Bertin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Milstein</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Boxem</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vidalain</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Chesneau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rual</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Lamesch</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tewari</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Berriz</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Jacotot</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Vaglio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Reboul</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hirozane-Kishikawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Gabel</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Elewa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baumgartner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bosak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sequerra</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mango</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Saxton</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Strome</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van Den Heuvel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Piano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Vandenhaute</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sardet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Doucette-Stamm</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gunsalus</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Harper</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Cusick</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <issue>5657</issue>
            <fpage>540</fpage>
            <lpage>543</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1698949</pubid>
                  <pubid idtype="pmpid" link="fulltext">14704431</pubid>
                  <pubid idtype="doi">10.1126/science.1091403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Coghlan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fitch</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Kamath</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kuwabara</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Mardis</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Miner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Minx</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mullikin</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Plumb</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schein</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Sohrmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Willey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>E45</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">261899</pubid>
                  <pubid idtype="pmpid" link="fulltext">14624247</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0000045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Wormbase</p>
            </title>
            <url>http://www.wormbase.org</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Rates of deleterious mutation and the evolution of sex in Caenorhabditis</p>
            </title>
            <aug>
               <au>
                  <snm>Cutter</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Payseur</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>J Evol Biol</source>
            <pubdate>2003</pubdate>
            <volume>16</volume>
            <issue>5</issue>
            <fpage>812</fpage>
            <lpage>822</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1420-9101.2003.00596.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14635896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Genomics and biology of the nematode <it>Caenorhabditis briggsae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Johnsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>WormBook</source>
            <publisher> WormBook</publisher>
            <editor>Community TCR</editor>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">18050493</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Wormbase FTP site</p>
            </title>
            <url>ftp://ftp.wormbase.org/pub/wormbase/genomes/</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A compendium of <it>Caenorhabditis elegans</it> regulatory transcription factors: a resource for mapping transcription regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Reece-Hoyes</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Deplancke</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shingles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grove</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hope</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Walhout</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>13</issue>
            <fpage>R110</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1414109</pubid>
                  <pubid idtype="pmpid" link="fulltext">16420670</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-13-r110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Comparison of <it>C. elegans</it> and <it>C. briggsae</it> Genome Sequences Reveals Extensive Conservation of Chromosome Organization and Synteny</p>
            </title>
            <aug>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Baird</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Koboldt</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2007</pubdate>
            <volume>5</volume>
            <issue>7</issue>
            <fpage>e167</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1914384</pubid>
                  <pubid idtype="pmpid" link="fulltext">17608563</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0050167</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p><it>C. remanei</it> genome sequencing project</p>
            </title>
            <url>http://genome.wustl.edu/genome.cgi?GENOME=Caenorhabditis%20remanei</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Automatic clustering of orthologs and in-paralogs from pairwise species comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Storm</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>314</volume>
            <issue>5</issue>
            <fpage>1041</fpage>
            <lpage>1052</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.5197</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The HMMER software package</p>
            </title>
            <url>http://hmmer.janelia.org</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Inparanoid: a comprehensive database of eukaryotic orthologs</p>
            </title>
            <aug>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database issue</issue>
            <fpage>D476</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540061</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608241</pubid>
                  <pubid idtype="doi">10.1093/nar/gki107</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Expression profiling in primates reveals a rapid evolution of human transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Gilad</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Oshlack</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <issue>7081</issue>
            <fpage>242</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04559</pubid>
                  <pubid idtype="pmpid" link="fulltext">16525476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Natural selection on protein-coding genes in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Bustamante</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Fledel-Alon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williamson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Glanowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanenbaum</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Sninsky</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hernandez</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Civello</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Cargill</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <issue>7062</issue>
            <fpage>1153</fpage>
            <lpage>1157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04240</pubid>
                  <pubid idtype="pmpid" link="fulltext">16237444</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Evolution of genes and genomes on the Drosophila phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Markow</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Kaufman</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gelbart</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sackton</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Larracuente</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Abad</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Abt</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Adryan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Aguade</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Aquadro</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Ardell</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Arguello</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Artieri</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Barbash</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barsanti</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batterham</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>et</snm>
                  <fnm/>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <issue>7167</issue>
            <fpage>203</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06341</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Sexual selection and maintenance of sex: evidence from comparisons of rates of genomic accumulation of mutations and divergence of sex-related genes in sexual and hermaphroditic species of Caenorhabditis</p>
            </title>
            <aug>
               <au>
                  <snm>Artieri</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Haerty</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2008</pubdate>
            <volume>25</volume>
            <issue>5</issue>
            <fpage>972</fpage>
            <lpage>979</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msn046</pubid>
                  <pubid idtype="pmpid" link="fulltext">18281268</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Nuclear hormone receptors and gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Aranda</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pascual</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Physiol Rev</source>
            <pubdate>2001</pubdate>
            <volume>81</volume>
            <issue>3</issue>
            <fpage>1269</fpage>
            <lpage>1304</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11427696</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Explosive lineage-specific expansion of the orphan nuclear receptor HNF4 in nematodes</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson-Rechavi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maina</snm>
                  <fnm>CV</fnm>
               </au>
               <au>
                  <snm>Gissendanner</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Laudet</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sluder</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2005</pubdate>
            <volume>60</volume>
            <issue>5</issue>
            <fpage>577</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-004-0175-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">15983867</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Modulating Hox gene functions during animal body patterning</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Lemons</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McGinnis</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>12</issue>
            <fpage>893</fpage>
            <lpage>904</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16341070</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Insight into transcription factor gene duplication from <it>Caenorhabditis elegans</it> Promoterome-driven expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Reece-Hoyes</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Shingles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grove</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Walhout</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Vidal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hope</snm>
                  <fnm>IA</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>27</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1785375</pubid>
                  <pubid idtype="pmpid" link="fulltext">17244357</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-8-27</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Chromosomal clustering of muscle-expressed genes in <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <issue>6901</issue>
            <fpage>975</fpage>
            <lpage>979</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12214599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Clustered organization of reproductive genes in the <it>C. elegans</it> genome</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Cutter</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Greenstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>14</issue>
            <fpage>1284</fpage>
            <lpage>1290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2004.07.025</pubid>
                  <pubid idtype="pmpid" link="fulltext">15268860</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Large clusters of co-expressed genes in the Drosophila genome</p>
            </title>
            <aug>
               <au>
                  <snm>Boutanaev</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kalmykova</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Shevelyov</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Nurminsky</snm>
                  <fnm>DI</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <issue>6916</issue>
            <fpage>666</fpage>
            <lpage>669</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01216</pubid>
                  <pubid idtype="pmpid" link="fulltext">12478293</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Evidence for large domains of similarly expressed genes in the Drosophila genome</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>J Biol</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117248</pubid>
                  <pubid idtype="pmpid" link="fulltext">12144710</pubid>
                  <pubid idtype="doi">10.1186/1475-4924-1-5</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Clustering of housekeeping genes provides a unified model of gene order in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Urrutia</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <issue>2</issue>
            <fpage>180</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng887</pubid>
                  <pubid idtype="pmpid" link="fulltext">11992122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The BioGRID</p>
            </title>
            <url>http://www.thebiogrid.org</url>
         </bibl>
         <bibl id="B38">
            <title>
               <p>BioGRID: a general repository for interaction datasets</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Reguly</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>Database issue</issue>
            <fpage>D535</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347471</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381927</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>lin-35 and lin-53, two genes that antagonize a <it>C. elegans</it> Ras pathway, encode proteins similar to Rb and its binding protein RbAp48</p>
            </title>
            <aug>
               <au>
                  <snm>Lu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Horvitz</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>7</issue>
            <fpage>981</fpage>
            <lpage>991</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)81722-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">9875852</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>RB1 gene mutations in retinoblastoma</p>
            </title>
            <aug>
               <au>
                  <snm>Lohmann</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>1999</pubdate>
            <volume>14</volume>
            <issue>4</issue>
            <fpage>283</fpage>
            <lpage>288</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1098-1004(199910)14:4&lt;283::AID-HUMU2>3.0.CO;2-J</pubid>
                  <pubid idtype="pmpid" link="fulltext">10502774</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The <it>C. elegans</it> eyes absent ortholog EYA-1 is required for tissue differentiation and plays partially redundant roles with PAX-6</p>
            </title>
            <aug>
               <au>
                  <snm>Furuya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Qadota</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Sugimoto</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2005</pubdate>
            <volume>286</volume>
            <issue>2</issue>
            <fpage>452</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2005.08.011</pubid>
                  <pubid idtype="pmpid" link="fulltext">16154558</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Genome-wide RNAi of <it>C. elegans</it> using the hypersensitive rrf-3 strain reveals novel gene functions</p>
            </title>
            <aug>
               <au>
                  <snm>Simmer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Moorman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>van der Linden</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kuijk</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>van den Berghe</snm>
                  <fnm>PV</fnm>
               </au>
               <au>
                  <snm>Kamath</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Ahringer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Plasterk</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>77</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0000012</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Full-genome RNAi profiling of early embryogenesis in <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Sonnichsen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koski</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marschall</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Neumann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brehm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Alleaume</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Artelt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bettencourt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cassin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hewitson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Holz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lazik</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nitzsche</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ruer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stamford</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Winzi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heinkel</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Roder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Finell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hantsch</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Piano</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gunsalus</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Oegema</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gonczy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hyman</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Echeverri</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <issue>7032</issue>
            <fpage>462</fpage>
            <lpage>469</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03353</pubid>
                  <pubid idtype="pmpid" link="fulltext">15791247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Systematic functional analysis of the <it>Caenorhabditis elegans</it> genome using RNAi</p>
            </title>
            <aug>
               <au>
                  <snm>Kamath</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gotta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Le Bot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sohrmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Welchman</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Zipperlen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ahringer</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <issue>6920</issue>
            <fpage>231</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01278</pubid>
                  <pubid idtype="pmpid" link="fulltext">12529635</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Mutations of a human homologue of the Drosophila eyes absent gene (EYA1) detected in patients with congenital cataracts and ocular anterior segment anomalies</p>
            </title>
            <aug>
               <au>
                  <snm>Azuma</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hirakiyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Asaka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2000</pubdate>
            <volume>9</volume>
            <issue>3</issue>
            <fpage>363</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/9.3.363</pubid>
                  <pubid idtype="pmpid" link="fulltext">10655545</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Eya1 and Eya2 proteins are required for hypaxial somitic myogenesis in the mouse embryo</p>
            </title>
            <aug>
               <au>
                  <snm>Grifone</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Demignon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Giordani</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Niro</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Souil</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bertin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Laclef</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>PX</fnm>
               </au>
               <au>
                  <snm>Maire</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2007</pubdate>
            <volume>302</volume>
            <issue>2</issue>
            <fpage>602</fpage>
            <lpage>616</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ydbio.2006.08.059</pubid>
                  <pubid idtype="pmpid" link="fulltext">17098221</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>DPC4, a candidate tumor suppressor gene at human chromosome 18q21.1</p>
            </title>
            <aug>
               <au>
                  <snm>Hahn</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Schutte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hoque</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Moskaluk</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>da Costa</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Rozenblum</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yeo</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Hruban</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Kern</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>271</volume>
            <issue>5247</issue>
            <fpage>350</fpage>
            <lpage>353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.271.5247.350</pubid>
                  <pubid idtype="pmpid" link="fulltext">8553070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Higher frequency of Smad4 gene mutation in human colorectal cancer with distant metastasis</p>
            </title>
            <aug>
               <au>
                  <snm>Miyaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Iijima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Konishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yasuno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hishima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Koike</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shitara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Iwama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Utsunomiya</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuroki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Oncogene</source>
            <pubdate>1999</pubdate>
            <volume>18</volume>
            <issue>20</issue>
            <fpage>3098</fpage>
            <lpage>3103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.onc.1202642</pubid>
                  <pubid idtype="pmpid" link="fulltext">10340381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Genetics of adenocarcinomas of the small intestine: frequent deletions at chromosome 18q and mutations of the SMAD4 gene</p>
            </title>
            <aug>
               <au>
                  <snm>Blaker</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Herbay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Penzel</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Otto</snm>
                  <fnm>HF</fnm>
               </au>
            </aug>
            <source>Oncogene</source>
            <pubdate>2002</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>158</fpage>
            <lpage>164</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.onc.1205041</pubid>
                  <pubid idtype="pmpid" link="fulltext">11791187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Why Do Hubs Tend to Be Essential in Protein Networks?</p>
            </title>
            <aug>
               <au>
                  <snm>He</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLoS Genetics</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <issue>6</issue>
            <fpage>e88</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1473040</pubid>
                  <pubid idtype="pmpid" link="fulltext">16751849</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0020088</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Comparative Genomics of Centrality and Essentiality in Three Eukaryotic Protein-Interaction Networks</p>
            </title>
            <aug>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Kern</snm>
                  <fnm>AD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>4</issue>
            <fpage>803</fpage>
            <lpage>806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi072</pubid>
                  <pubid idtype="pmpid" link="fulltext">15616139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Comprehensive analysis of animal TALE homeobox genes: new conserved motifs and cases of accelerated evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Burglin</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2007</pubdate>
            <volume>65</volume>
            <issue>2</issue>
            <fpage>137</fpage>
            <lpage>153</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-006-0023-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">17665086</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The nuclear receptor superfamily has undergone extensive proliferation and diversification in nematodes</p>
            </title>
            <aug>
               <au>
                  <snm>Sluder</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Mathews</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Hough</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>VP</fnm>
               </au>
               <au>
                  <snm>Maina</snm>
                  <fnm>CV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>103</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10022975</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Nuclear hormone receptors in <it>C. elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Antebi</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>WormBook</source>
            <pubdate>2006</pubdate>
            <fpage>1</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">18050471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Nuclear receptors in nematodes: themes and variations</p>
            </title>
            <aug>
               <au>
                  <snm>Sluder</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Maina</snm>
                  <fnm>CV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>206</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02242-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">11275326</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Analysis of homologous gene clusters in <it>Caenorhabditis elegans</it> reveals striking regional cluster domains</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>172</volume>
            <issue>1</issue>
            <fpage>127</fpage>
            <lpage>143</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456141</pubid>
                  <pubid idtype="pmpid" link="fulltext">16291650</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.040030</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>An abundance of X-linked genes expressed in spermatogonia</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>McCarrey</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>27</volume>
            <issue>4</issue>
            <fpage>422</fpage>
            <lpage>426</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/86927</pubid>
                  <pubid idtype="pmpid" link="fulltext">11279525</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Gene duplication and gene conversion in the <it>Caenorhabditis elegans</it> genome</p>
            </title>
            <aug>
               <au>
                  <snm>Semple</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <issue>5</issue>
            <fpage>555</fpage>
            <lpage>564</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006498</pubid>
                  <pubid idtype="pmpid" link="fulltext">10198121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The structure and early evolution of recently arisen gene duplicates in the <it>Caenorhabditis elegans</it> genome</p>
            </title>
            <aug>
               <au>
                  <snm>Katju</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>165</volume>
            <issue>4</issue>
            <fpage>1793</fpage>
            <lpage>1803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462873</pubid>
                  <pubid idtype="pmpid" link="fulltext">14704166</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>synMuv verite--Myb comes into focus</p>
            </title>
            <aug>
               <au>
                  <snm>Lipsick</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <issue>23</issue>
            <fpage>2837</fpage>
            <lpage>2844</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1274804</pubid>
                  <pubid idtype="pmpid" link="fulltext">15574590</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Some <it>C. elegans</it> class B synthetic multivulva proteins encode a conserved LIN-35 Rb-containing complex distinct from a NuRD-like complex</p>
            </title>
            <aug>
               <au>
                  <snm>Harrison</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Ceol</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Horvitz</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>45</issue>
            <fpage>16782</fpage>
            <lpage>16787</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636532</pubid>
                  <pubid idtype="pmpid" link="fulltext">17075059</pubid>
                  <pubid idtype="doi">10.1073/pnas.0608461103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>A simple dependence between protein evolution rate and the number of protein-protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166126</pubid>
                  <pubid idtype="pmpid" link="fulltext">12769820</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-3-11</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Evolutionary rate in the protein interaction network</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Scharfe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <issue>5568</issue>
            <fpage>750</fpage>
            <lpage>752</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068696</pubid>
                  <pubid idtype="pmpid" link="fulltext">11976460</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Lemos</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bettencourt</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Meiklejohn</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>5</issue>
            <fpage>1345</fpage>
            <lpage>1354</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi122</pubid>
                  <pubid idtype="pmpid" link="fulltext">15746013</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Profile hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>13</issue>
            <fpage>1658</fpage>
            <lpage>1659</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl158</pubid>
                  <pubid idtype="pmpid" link="fulltext">16731699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure</p>
            </title>
            <aug>
               <au>
                  <snm>Gough</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Karplus</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hughey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chothia</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>313</volume>
            <issue>4</issue>
            <fpage>903</fpage>
            <lpage>919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.5080</pubid>
                  <pubid idtype="pmpid" link="fulltext">11697912</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Pfam</p>
            </title>
            <url>http://pfam.sanger.ac.uk</url>
         </bibl>
         <bibl id="B69">
            <title>
               <p>The InParanoid database</p>
            </title>
            <fpage>Eukaryotic ortholog groups</fpage>
            <url>http://inparanoid.sbc.su.se/cgi-bin/index.cgi</url>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Interpro</p>
            </title>
            <url>http://www.ebi.ac.uk/interpro</url>
         </bibl>
         <bibl id="B71">
            <title>
               <p>DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Morgenstern</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>211</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.3.211</pubid>
                  <pubid idtype="pmpid" link="fulltext">10222408</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <issue>5</issue>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150390235520</pubid>
                  <pubid idtype="pmpid">14530136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>The rapid generation of mutation data matrices from protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1992</pubdate>
            <volume>8</volume>
            <issue>3</issue>
            <fpage>275</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1633570</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>The PHYLIP package</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <url>http://evolution.genetics.washington.edu/phylip.html</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>DIALIGN: multiple DNA and protein sequence alignment at BiBiServ</p>
            </title>
            <aug>
               <au>
                  <snm>Morgenstern</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Web Server issue</issue>
            <fpage>W33</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441511</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215344</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>RevTrans: Multiple alignment of coding DNA from aligned amino acid sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Wernersson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3537</fpage>
            <lpage>3539</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169015</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824361</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg609</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <issue>6</issue>
            <fpage>908</fpage>
            <lpage>917</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032247</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Flybase</p>
            </title>
            <url>http://www.flybase.org</url>
         </bibl>
         <bibl id="B79">
            <title>
               <p>OMIM - Online Mendelian Inheritance in Man</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/sites/entrez?db=omim</url>
         </bibl>
         <bibl id="B80">
            <title>
               <p>The NCBI PubMed</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/sites/entrez?db=PubMed</url>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Cytoscape: a software environment for integrated models of biomolecular interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Shannon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Markiel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ozier</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Ramage</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Amin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schwikowski</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>11</issue>
            <fpage>2498</fpage>
            <lpage>2504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403769</pubid>
                  <pubid idtype="pmpid" link="fulltext">14597658</pubid>
                  <pubid idtype="doi">10.1101/gr.1239303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

