<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-9-r134</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Conserved chromosomal clustering of genes governed by chromatin regulators in <it>Drosophila</it></p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Blanco</snm>
               <fnm>Enrique</fnm>
               <insr iid="I1"/>
               <email>eblanco@ub.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Pignatelli</snm>
               <fnm>Miguel</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>miguel.pignatelli@uv.es</email>
            </au>
            <au id="A3">
               <snm>Beltran</snm>
               <fnm>Sergi</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>beltran@ub.edu</email>
            </au>
            <au id="A4">
               <snm>Punset</snm>
               <fnm>Adri&#224;</fnm>
               <insr iid="I1"/>
               <email>apunset@ub.edu</email>
            </au>
            <au id="A5">
               <snm>P&#233;rez-Lluch</snm>
               <fnm>Silvia</fnm>
               <insr iid="I1"/>
               <email>silviaperez@ub.edu</email>
            </au>
            <au id="A6">
               <snm>Serras</snm>
               <fnm>Florenci</fnm>
               <insr iid="I1"/>
               <email>fserras@ub.edu</email>
            </au>
            <au id="A7">
               <snm>Guig&#243;</snm>
               <fnm>Roderic</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>roderic.guigo@crg.es</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Corominas</snm>
               <fnm>Montserrat</fnm>
               <insr iid="I1"/>
               <email>mcorominas@ub.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Departament de Gen&#232;tica and Institut de Biomedicina de la Universitat de Barcelona (IBUB), Universitat de Barcelona, Diagonal 645, 08028 Barcelona, Catalonia, Spain</p>
            </ins>
            <ins id="I2">
               <p>Centre de Regulaci&#243; Gen&#242;mica, Parc de Recerca Biom&#232;dica de Barcelona, Dr. Aiguader 88, 08003 Barcelona, Catalonia, Spain</p>
            </ins>
            <ins id="I3">
               <p>Grup de Recerca en Inform&#224;tica Biom&#232;dica, Institut Municipal d'Investigaci&#243; M&#232;dica - Universitat Pompeu Fabra Barcelona, Catalonia, Spain</p>
            </ins>
            <ins id="I4">
               <p>Current address: Instituto Cavanilles of Biodiversity and Evolutionary Biology, University of Valencia, Apdo 22085, 46071 Valencia, Spain and CIBER of Epidemiology and Public Health (CIBERESP)</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>9</issue>
         <fpage>R134</fpage>
         <url>http://genomebiology.com/2008/9/9/R134</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18783608</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-9-r134</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>1</day>
               <month>8</month>
               <year>2008</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>4</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>10</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>10</day>
               <month>09</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Blanco et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Chromosomal clustering of genes in <it>Drosophila</it></p>
      </shorttitle>
      <shortabs>
         <p>Transcriptional analysis of chromatin regulator mutants in <it>Drosophila melanogaster</it> identified clusters of functionally related genes conserved in other insect species.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The trithorax group (trxG) and Polycomb group (PcG) proteins are responsible for the maintenance of stable transcriptional patterns of many developmental regulators. They bind to specific regions of DNA and direct the post-translational modifications of histones, playing a role in the dynamics of chromatin structure.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have performed genome-wide expression studies of <it>trx </it>and <it>ash2 </it>mutants in <it>Drosophila melanogaster</it>. Using computational analysis of our microarray data, we have identified 25 clusters of genes potentially regulated by TRX. Most of these clusters consist of genes that encode structural proteins involved in cuticle formation. This organization appears to be a distinctive feature of the regulatory networks of TRX and other chromatin regulators, since we have observed the same arrangement in clusters after experiments performed with ASH2, as well as in experiments performed by others with NURF, dMyc, and ASH1. We have also found many of these clusters to be significantly conserved in <it>D. simulans</it>, <it>D. yakuba</it>, <it>D. pseudoobscura </it>and partially in <it>Anopheles gambiae</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The analysis of genes governed by chromatin regulators has led to the identification of clusters of functionally related genes conserved in other insect species, suggesting this chromosomal organization is biologically important. Moreover, our results indicate that TRX and other chromatin regulators may act globally on chromatin domains that contain transcriptionally co-regulated genes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010005">Development</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Differential gene expression is essential to the cellular diversity required for adequate pattern formation and organogenesis during the first stages of development in multicellular organisms. Thereafter, epigenetic regulatory systems must ensure the maintenance of these gene expression patterns to preserve cell identity in adulthood <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Regulation of transcription is, therefore, crucial to proper temporal and spatial gene expression throughout development. The complex transcriptional regulatory code that governs the different gene expression programs of an organism involves many different actors, such as transcription factors, regulatory sequences in the genome, chromatin structure and modification states <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Chromatin packaging plays a central role during gene transcription by controlling the access of the RNA polymerase II transcriptional machinery and other gene regulatory elements (such as transcription factors) to the promoter region of the genes <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. The dynamics of chromatin structure is controlled through multiple mechanisms, such as nucleosome positioning, chromatin remodeling and histone post-translational modifications <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Gene regulation can occur in the genome at distinct levels of organization: individual genes, chromosomal domains and entire chromosomes <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Thus, a set of transcriptionally active genes and the regulatory elements necessary for their correct expression are generally associated with open chromatin domains, while silent genes are embedded in more compact chromatin regions <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The main effect of such domains on genome organization is observed in the non-random distribution of genes in a genome, which can favor coordinated gene expression. In fact, the interplay of genome rearrangements, gene expression mechanisms and evolutionary forces could explain the complex landscape of gene regulation <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Since the publication of the sequence of many eukaryotic genomes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, several whole-genome studies about genome organization have established the existence of clusters of co-expressed genes, in some cases functionally related (see <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for a comprehensive review). Examples have been found in many species such as yeast <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, worm <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> or human <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. In <it>D. melanogaster</it>, the presence of clusters has been studied by several groups. Ueda <it>et al</it>. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> found that genes controlling circadian rhythms tend to be grouped in local clusters on chromosomes, suggesting this is due to higher order chromatin structures. Spellman and Rubin <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> analyzed the chromosomal position of gene expression profiles from 88 different experimental conditions and found that over 20% of all genes were clustered into co-regulated groups of 10-30 genes of unrelated function. Boutanaev <it>et al</it>. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> identified 1,661 testes-specific genes, one-third of which were clustered on chromosomes in groups of three or more genes. The effect of chromatin structure on a particular cluster of five genes in the previous screening <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> was successfully validated by Kalmykova <it>et al</it>. <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Belyakin <it>et al</it>. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> reported 1,036 genes that are arranged in clusters located in 52 underreplication regions of the larval salivary gland polytene chromosomes.</p>
         <p>Epigenetic regulation of gene expression is necessary for the correct deployment of developmental programs and for the maintenance of cell fates. The Polycomb and Trithorax epigenetic system, initially discovered in <it>D. melanogaster</it>, is responsible for the maintenance of gene expression throughout late development and adulthood. Polycomb group (PcG) proteins are required to prevent inappropriate expression of homeotic genes, while trithorax group (trxG) proteins seem to work antagonistically as anti-repressors. Recent studies have identified and characterized several multiprotein complexes containing these transcriptional regulators. They control transcription through multistep mechanisms that involve histone modification, chromatin remodeling, and interaction with general transcription factors. In flies, PcG and trxG complexes are recruited to certain regulatory sequence response elements of the genome denominated PRE/TREs (see <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> for a review on trxG and PcG proteins).</p>
         <p>Systematic examination of gene expression patterns using microarrays can provide a global picture of the distinct regulatory networks of different genomes <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. In particular, several genome-wide expression experiments involving members of trxG have recently been published <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Trithorax (<it>trx</it>), the first isolated member of the trxG, is required throughout embryonic and larval development for the correct differentiation in the adult <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The <it>trx </it>gene encodes a histone methyltransferase that can modify lysine 4 of histone 3 (H3K4). This methylation is an epigenetic mark associated with transcriptionally active genes <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In the work presented here we have combined the expression profiles obtained from microarray experiments with exhaustive bioinformatic analyses that include gene clustering, comparative genomics and functional annotation to gain insight into the role of trxG proteins. Our results show the existence of evolutionarily conserved chromosomal clusters with most of the genes being also regulated by other chromatin regulators, and functionally annotated as components of the cuticle.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Whole-genome expression analysis of <it>trx </it>mutants</p>
            </st>
            <p>In order to investigate the molecular signature of the <it>trx </it>mutants in <it>Drosophila melanogaster</it>, we have compared whole-genome expression profiles of <it>trx </it>mutant third instar larvae and wild-type larvae (see Materials and methods). We designed two-color cDNA microarrays containing 12,120 genes annotated in RefSeq from <it>D. melanogaster </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The analysis of the microarray experiments identified 535 genes showing a statistically significant change (at least 2-fold change, <it>p</it>-value &lt;0.05) in expression between mutant and wild-type samples (see Materials and methods). Of these, 260 were over-expressed and 275 were under-expressed in mutant larvae (Additional data file 1).</p>
            <p>We mapped these deregulated genes to the fly genome (assembly dm2, April 2004) using the RefSeq <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> track of the UCSC genome browser <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, and the chromosomal distribution is shown in Table <tblr tid="T1">1</tblr>. The number of RefSeq genes annotated on each chromosome is also displayed. We mapped more co-expressed genes on chromosome 3L than on any other chromosome (30% of 535 deregulated genes; Table <tblr tid="T1">1</tblr>): 69 up-regulated genes (<it>p</it>-value &lt;10<sup>-2</sup>) and 94 down-regulated genes (<it>p</it>-value &lt;10<sup>-8</sup>). Chromosomes 2R and 3R are, however, richer in number of annotated genes (3,993 and 4,843 genes respectively, compared to 3,775 genes in chromosome 3L in Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Genome distribution of genes and clusters deregulated in <it>trx </it>mutants</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="left">
                        <p>Chromosome</p>
                     </c>
                     <c ca="right">
                        <p>Length</p>
                     </c>
                     <c ca="right">
                        <p>Genes</p>
                     </c>
                     <c ca="right">
                        <p>TRX &#8593;</p>
                     </c>
                     <c ca="right">
                        <p>TRX &#8595;</p>
                     </c>
                     <c ca="right">
                        <p>TRX &#8593;+&#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8593;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8593;+&#8595;</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2L</p>
                     </c>
                     <c ca="right">
                        <p>22,855,998</p>
                     </c>
                     <c ca="right">
                        <p>3,594</p>
                     </c>
                     <c ca="right">
                        <p>39</p>
                     </c>
                     <c ca="right">
                        <p>26</p>
                     </c>
                     <c ca="right">
                        <p>65</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2R</p>
                     </c>
                     <c ca="right">
                        <p>21,182,128</p>
                     </c>
                     <c ca="right">
                        <p>3,993</p>
                     </c>
                     <c ca="right">
                        <p>54</p>
                     </c>
                     <c ca="right">
                        <p>51</p>
                     </c>
                     <c ca="right">
                        <p>105</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3L</p>
                     </c>
                     <c ca="right">
                        <p>24,247,342</p>
                     </c>
                     <c ca="right">
                        <p>3,775</p>
                     </c>
                     <c ca="right">
                        <p>69</p>
                     </c>
                     <c ca="right">
                        <p>94</p>
                     </c>
                     <c ca="right">
                        <p>163</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3R</p>
                     </c>
                     <c ca="right">
                        <p>28,463,162</p>
                     </c>
                     <c ca="right">
                        <p>4,843</p>
                     </c>
                     <c ca="right">
                        <p>59</p>
                     </c>
                     <c ca="right">
                        <p>71</p>
                     </c>
                     <c ca="right">
                        <p>130</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>X</p>
                     </c>
                     <c ca="right">
                        <p>22,668,884</p>
                     </c>
                     <c ca="right">
                        <p>3,238</p>
                     </c>
                     <c ca="right">
                        <p>39</p>
                     </c>
                     <c ca="right">
                        <p>32</p>
                     </c>
                     <c ca="right">
                        <p>71</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="right">
                        <p>1,307,279</p>
                     </c>
                     <c ca="right">
                        <p>227</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="right">
                        <p>120,724,793</p>
                     </c>
                     <c ca="right">
                        <p>19,670</p>
                     </c>
                     <c ca="right">
                        <p>260</p>
                     </c>
                     <c ca="right">
                        <p>275</p>
                     </c>
                     <c ca="right">
                        <p>535</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The following information is displayed for each chromosome from <it>D. melanogaster</it>: length, number of genes, number of up-regulated genes, number of down-regulated genes, total number of deregulated genes in the microarray, number of up-regulated clusters, number of down-regulated clusters, and total number of clusters.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Chromosomal clustering of genes deregulated in <it>trx </it>mutants</p>
            </st>
            <p>Since chromatin modifications are typically associated with the coordinated expression of groups of nearby genes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and the analysis of different transcriptome datasets has shown that genes with a similar expression pattern are frequently located next to one another in the linear genome <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B39">39</abbr></abbrgrp>, our next step was to determine whether deregulated genes in our <it>trx </it>mutants are located in close proximity in the fly genome (chromosomal clusters). There are many possible definitions of what a cluster of genes is (see <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for a review). Here, we define a cluster as a group of genes located close to each other on the same chromosome in the genome, but not necessarily adjacent, that showed the same expression pattern (up-regulation or down-regulation) in the microarray experiment (see Materials and methods).</p>
            <p>Chromosomal clusters can be identified computationally <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B40">40</abbr></abbrgrp>. We detected 97 genes, organized in 25 genomic clusters, that are deregulated in <it>trx </it>deficient larvae (10 clusters of up-regulated genes and 15 clusters of down-regulated genes; Table <tblr tid="T1">1</tblr>), using the program REEF <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> with the following parameters: window length, 25,000 bp; window step, 1,000 bp; minimal number of co-expressed genes, 3; q-value &#8804;0.05. The chromosomal distribution of clusters and genes along the genome of <it>D. melanogaster </it>is shown in Figure <figr fid="F1">1</figr> (up-regulated genes are depicted in red, down-regulated genes in green; the genomic position of each cluster is represented with the corresponding red or green triangle and each cluster is labeled with the same identifier used in Table <tblr tid="T2">2</tblr>). Clusters of genes deregulated in <it>trx </it>mutant larvae are not uniformly distributed along the genome: 15 out of 25 clusters (60%) are located on chromosome 3L (Table <tblr tid="T1">1</tblr>). Remarkably, the proportion of genes in clusters increases dramatically in chromosome 3L: 62 genes out of 163 deregulated genes mapped to this chromosome are clustered (38%), as opposed to only 35 genes out of 372 deregulated genes mapped to the other chromosomes (9%) (Additional data file 2).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Clusters of genes deregulated in <it>trx </it>mutants</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>ID</p>
                     </c>
                     <c ca="center">
                        <p>Chromosome</p>
                     </c>
                     <c ca="center">
                        <p>Start</p>
                     </c>
                     <c ca="center">
                        <p>End</p>
                     </c>
                     <c ca="center">
                        <p>Regulation</p>
                     </c>
                     <c ca="left">
                        <p>Deregulated genes</p>
                     </c>
                     <c ca="left">
                        <p>No deregulated genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>2L</p>
                     </c>
                     <c ca="center">
                        <p>7,740,552</p>
                     </c>
                     <c ca="center">
                        <p>7,753,160</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p><it>Acp1</it>, CG7214, CG7203</p>
                     </c>
                     <c ca="left">
                        <p>CG7211</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>2R</p>
                     </c>
                     <c ca="center">
                        <p>6,757,647</p>
                     </c>
                     <c ca="center">
                        <p>6,771,890</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG9080, CG30029, CG7738, CG13224</p>
                     </c>
                     <c ca="left">
                        <p>CG13226, <it>Or47a</it>, CG9079</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>2R</p>
                     </c>
                     <c ca="center">
                        <p>7,906,941</p>
                     </c>
                     <c ca="center">
                        <p>7,941,371</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG8836, CG8505, CG8510, CG8511, CG8520</p>
                     </c>
                     <c ca="left">
                        <p><it>Or49a</it>, CG30048, CG30050, CG33626, CG33627, CG8515, CG13157, CG8834</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>2R</p>
                     </c>
                     <c ca="center">
                        <p>12,673,194</p>
                     </c>
                     <c ca="center">
                        <p>12,681,364</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG30458, CG30457, CG10953</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>2R</p>
                     </c>
                     <c ca="center">
                        <p>13,899,427</p>
                     </c>
                     <c ca="center">
                        <p>13,905,472</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG18107, CG16836, CG15068</p>
                     </c>
                     <c ca="left">
                        <p>CG15067, <it>IM2</it>, <it>IM3</it>, CG15065</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>1,190,902</p>
                     </c>
                     <c ca="center">
                        <p>1,211,770</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p><it>LysB</it>, <it>LysC</it>, <it>LysD</it>, <it>LysP</it>, <it>LysS</it></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>LysE</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>1,286,480</p>
                     </c>
                     <c ca="center">
                        <p>1,296,909</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG9149, CG2469, CG9186</p>
                     </c>
                     <c ca="left">
                        <p>CG2277</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>4,429,174</p>
                     </c>
                     <c ca="center">
                        <p>4,447,704</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG12607, CG11345, CG32241</p>
                     </c>
                     <c ca="left">
                        <p>CG15022, CG15023, CG15024</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>6,097,832</p>
                     </c>
                     <c ca="center">
                        <p>6,125,586</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>l(3)mbn, CG18779, CG18778, <it>Lcp65Ag2</it>, <it>Lcp65Ae</it>, CG32405, C32404, <it>Lcp65Ac</it>, <it>Lcp65Ab2</it>, <it>Lcp65Aa</it></p>
                     </c>
                     <c ca="left">
                        <p><it>Lcp65Ag1</it>, <it>Lcp65Af</it>, <it>Lcp65Ad</it>, <it>Lcp65Ab1</it>, CG18777</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>8,189,385</p>
                     </c>
                     <c ca="center">
                        <p>8,196,898</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG8012, CG13674, CG13678</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>9,350,364</p>
                     </c>
                     <c ca="center">
                        <p>9,359,229</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p><it>Hsp26</it>, <it>Hsp23</it>, <it>Hsp27</it></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Hsp67Ba</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>11,103,295</p>
                     </c>
                     <c ca="center">
                        <p>11,109,286</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG7628, CG32074, CG14143</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>nol</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>11,482,913</p>
                     </c>
                     <c ca="center">
                        <p>11,487,349</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p><it>Sgs8</it>, <it>Sgs7</it>, <it>Sgs3</it></p>
                     </c>
                     <c ca="left">
                        <p>CG33272</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>11,917,595</p>
                     </c>
                     <c ca="center">
                        <p>11,926,172</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG5883, CG7252, CG17826</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>15,017,322</p>
                     </c>
                     <c ca="center">
                        <p>15,026,980</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG13461, CG18649, CG13460</p>
                     </c>
                     <c ca="left">
                        <p>CG13463</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>16,230,688</p>
                     </c>
                     <c ca="center">
                        <p>16,260,799</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG13069, CG13068, CG13067, CG13047, CG4962</p>
                     </c>
                     <c ca="left">
                        <p>CG4950, CG13066, CG13065, CG13050, CG13064, CG13049, CG13048, CG13046, CG13045</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>16,266,728</p>
                     </c>
                     <c ca="center">
                        <p>16,289,521</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG4982, CG13063, CG13042, CG13041, CG13060, CG13059</p>
                     </c>
                     <c ca="left">
                        <p>CG13044, CG13043, CG32160, CG13062, <it>Nplp3</it></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>20,138,463</p>
                     </c>
                     <c ca="center">
                        <p>20,154,238</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG7290, CG7017, CG6933</p>
                     </c>
                     <c ca="left">
                        <p>CG6996, CG32224</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>21,226,060</p>
                     </c>
                     <c ca="center">
                        <p>21,235,012</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG11310, <it>Edg78E</it>, CG7658</p>
                     </c>
                     <c ca="left">
                        <p>CG7663</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>3L</p>
                     </c>
                     <c ca="center">
                        <p>21,664,564</p>
                     </c>
                     <c ca="center">
                        <p>21,691,822</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG14569, CG14568, CG14566, CG14572, CG14565, CG14564</p>
                     </c>
                     <c ca="left">
                        <p>CG14573, CG14567, <it>Syn1</it></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>3R</p>
                     </c>
                     <c ca="center">
                        <p>2,512,449</p>
                     </c>
                     <c ca="center">
                        <p>2,530,625</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p><it>Ccp84Ag</it>, <it>Ccp84Ad</it>, <it>Ccp84Ab</it>, <it>Ccp84Aa</it></p>
                     </c>
                     <c ca="left">
                        <p><it>Ccp84Af</it>, <it>Ccp84Ae</it>, <it>Ccp84Ac</it></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>3R</p>
                     </c>
                     <c ca="center">
                        <p>10,386,703</p>
                     </c>
                     <c ca="center">
                        <p>10,392,190</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG14850, CG8087, CG14852</p>
                     </c>
                     <c ca="left">
                        <p>CG14851</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>3R</p>
                     </c>
                     <c ca="center">
                        <p>14,551,120</p>
                     </c>
                     <c ca="center">
                        <p>14,558,756</p>
                     </c>
                     <c ca="center">
                        <p>&#8593;</p>
                     </c>
                     <c ca="left">
                        <p>CG7714, CG7715, CG14302</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>3R</p>
                     </c>
                     <c ca="center">
                        <p>22,444,844</p>
                     </c>
                     <c ca="center">
                        <p>22,458,678</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG5468, CG14240, CG6452, CG5476</p>
                     </c>
                     <c ca="left">
                        <p>CG6478, CG6447, CG6460, CG5471</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>X</p>
                     </c>
                     <c ca="center">
                        <p>17,040,356</p>
                     </c>
                     <c ca="center">
                        <p>17,065,159</p>
                     </c>
                     <c ca="center">
                        <p>&#8595;</p>
                     </c>
                     <c ca="left">
                        <p>CG32564, CG10598, CG10597</p>
                     </c>
                     <c ca="left">
                        <p>CG32563, CG12995, CG18258, CG5162, CG12998, CG5172, CG12997</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>For each cluster we display: identifier, chromosome, genomic coordinates (start and end), regulation (up or down), co-expressed genes, and no co-expressed genes in the microarray.</p>
               </tblfn>
            </tbl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Genomic map of clusters of genes deregulated in <it>trx </it>mutants</p>
               </caption>
               <text>
                  <p>Genomic map of clusters of genes deregulated in <it>trx </it>mutants. The location of each gene significantly deregulated in the microarray is indicated with a vertical line (up-regulated genes in red, down-regulated genes in green). Genes in the forward strand are displayed above the chromosome line; genes in the reverse strand are displayed below. Clusters of genes are indicated with a triangle in red or green according to their expression. The genome map was produced using the program GFF2PS <abbrgrp><abbr bid="B102">102</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-1"/>
            </fig>
            <p>The clusters reported here contain a total of 162 genes (97 deregulated genes and 65 genes whose change in expression was not significant), comprising in total 372,967 nucleotides, with an average gene density of 4.3 genes per 10 Kb. In contrast, the average gene density in the fruit fly genome is 1.6 genes per 10 Kb. The average length of the genes in clusters is 946 bp, while the length of the deregulated genes that are not clustered is 3,416 bp (the overall average for <it>D. melanogaster </it>is 6,976 bp). Since the REEF program approach is based on genomic proximity measured in number of nucleotides, this could favor artifactual cluster definition in gene-rich regions of the genome. To rule out this possibility, we have designed an alternative clustering algorithm based on measuring the number of co-expressed genes within a window containing a fixed number of annotated genes, rather than a fixed number of nucleotides (see Materials and methods for further details). Results obtained with our clustering strategy are highly concordant with those produced by the REEF program (Additional data file 3): 27 clusters were detected (22 identical clusters, 2 clusters with additional genes, 3 new clusters and 1 missing cluster). Therefore, the high gene density observed in our clusters is not the consequence of any computational limitation in the clustering method. Given the high concordance of the two clustering approaches and since REEF is the more standard approach, we have based our subsequent analysis and experiments on the REEF results (the list of the clusters and the genes that constitute each cluster are shown in Table <tblr tid="T2">2</tblr>).</p>
            <p>As a control test to assess the statistical significance of the clustering, we repeated the analysis on 100 sets of genes that were randomly selected from the fly genome, preserving the gene distribution in the chromosomes that we observed in the set of genes deregulated in <it>trx </it>mutant larvae (see Materials and methods). The number of clusters identified on the random sets was very small (on average 1.7 clusters compared with the 25 clusters observed from the experimental data) despite containing the same proportion of genes on every chromosome (Figure <figr fid="F2">2a</figr>). In addition, we computed the Z-score of the number of clusters observed in our microarray, using the distribution of number of clusters found in the random sets as background distribution (see Materials and methods). This score is highly significant for <it>trx </it>clusters: 17.25 (Additional data file 4). Because of the small size of clustered genes, one could argue that the clustering described here is due to specific properties of short and active genes, and not related to a trxG characteristic. Therefore, we retrieved all small genes of the fly genome (that is, genes with the same range of sizes as the ones found in this work) and repeated the previous test (see Materials and methods). The number of clusters observed in the whole collection of fly small genes was significant: 107 clusters (including 21 of the 25 <it>trx </it>clusters; Z-score 9.75; Additional data file 4). The existence of clusters of small sized and active genes has already been established for many genomes and it is thought that this organization could favor coordinated and efficient gene expression <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. However, the clustering tendency of genes regulated by TRX is stronger as the Z-score for <it>trx </it>clusters (17.25) clearly contrasts with the one measured in the whole fly genome (9.75). As an additional control, we generated 100 random sets of genes preserving the same size distribution observed in up-regulated and down-regulated genes (see Materials and methods). The number of clusters detected in <it>trx </it>deregulated genes is highly significant (10 and 15 clusters, respectively) in comparison to the average number of clusters identified on these random gene sets (0.9 and 1.4 clusters). This is strongly indicative that the clustering tendency observed here is a specific characteristic of TRX regulated genes, and not a general feature of short genes (Additional data file 5).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Specificity controls in the clustering process</p>
               </caption>
               <text>
                  <p>Specificity controls in the clustering process. <b>(a) </b>Statistical significance of clusters. Bar plots representing the number of clusters observed in the set of genes regulated by TRX (up-regulated clusters in red, down-regulated clusters in green) and the number of clusters expected by chance (in white). The number of <it>trx </it>clusters observed in each chromosome was highly significant (Z-score >2). Error bars represent the standard deviation of the random samples. <b>(b) </b>Quantitative RT-PCR of target expression (clusters 4 and 20) in third instar wild-type (WT) and <it>trx </it>mutant larvae. Error bars represent variability between replicates.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-2"/>
            </fig>
            <p>In the analysis presented here, we have used no information about homology between genes within clusters to control for overrepresentation of gene families. Many genomic clusters corresponding to gene families have indeed been previously identified <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Such genomic clusters could cause spurious co-expression because of probe cross-hybridization between highly similar genes. In fact, some of the clusters that we have computationally identified do contain members of the same gene family (Table <tblr tid="T2">2</tblr>). We have searched for regions of similarity between the sequences of the genes within each cluster but no significant pairwise sequence alignments were found for any cluster (see Materials and methods). Furthermore, we confirmed the reported change in the expression of these genes by quantitative real-time RT-PCR in two clusters (Figure <figr fid="F2">2b</figr>).</p>
            <p>Finally, we used the specific set of 445 genes (302 RefSeq genes) that are basally expressed in larvae described by Arbeitman <it>et al</it>. <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> to measure the specificity of our results (see Materials and methods). We were not able to reproduce in this data set the organization in clusters found in genes regulated by TRX (only one potential cluster was found), indicating that this is not a general feature of the larval stage in <it>D. melanogaster </it>development.</p>
         </sec>
         <sec>
            <st>
               <p>Chromosomal clustering of genes controlled by other chromatin regulators</p>
            </st>
            <p>To determine whether the chromosomal organization in clusters is also common to genes regulated by other proteins involved in chromatin dynamics, we performed a second microarray expression experiment with mutant larvae for another such factor, ASH2, and compared the results obtained in this experiment, as well as previous published results on the transcriptomes of NURF <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, dMyc <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and ASH1 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, with the results obtained in the microarray analysis of the <it>trx </it>mutant. In all experiments, deregulated genes have been clustered on the <it>D. melanogaster </it>genome using the REEF program (Additional data file 6).</p>
            <p>The <it>ash2 </it>gene (<it>absent</it>, <it>small</it>, <it>or homeotic discs 2</it>) is another member of the trxG involved in chromatin-mediated maintenance of transcription <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. The microarray analysis identified 244 genes showing a statistically significant change (at least 2-fold change, <it>p</it>-value &lt;0.05) in their expression between mutant and wild-type samples (see Materials and methods). According to their pattern of regulation, we identified 123 over-expressed genes and 121 under-expressed genes in the mutant larvae (Additional data file 7). As in previous studies <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, we found the same proportion of up-regulated and down-regulated genes in the <it>ash2 </it>mutants. We also mapped these genes to the genome of <it>D. melanogaster </it>according to the RefSeq annotations in the UCSC genome browser, and identified eight clusters of co-expressed genes (six clusters of up-regulated genes and two clusters of down-regulated genes) using the program REEF (Table <tblr tid="T3">3</tblr>).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Clusters of genes regulated by different chromatin regulators</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Microarray</p>
                     </c>
                     <c ca="right">
                        <p>Genes &#8593;</p>
                     </c>
                     <c ca="right">
                        <p>Genes &#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8593;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8593;+&#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters 3L</p>
                     </c>
                     <c ca="center">
                        <p>Reference</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trithorax</p>
                     </c>
                     <c ca="right">
                        <p>260</p>
                     </c>
                     <c ca="right">
                        <p>275</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ASH2</p>
                     </c>
                     <c ca="right">
                        <p>123</p>
                     </c>
                     <c ca="right">
                        <p>121</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NURF</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>265</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B46">46</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>dMyc</p>
                     </c>
                     <c ca="right">
                        <p>203</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B47">47</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ASH1</p>
                     </c>
                     <c ca="right">
                        <p>239</p>
                     </c>
                     <c ca="right">
                        <p>159</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>8</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B34">34</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rovers</p>
                     </c>
                     <c ca="right">
                        <p>127</p>
                     </c>
                     <c ca="right">
                        <p>38</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B57">57</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sitters</p>
                     </c>
                     <c ca="right">
                        <p>131</p>
                     </c>
                     <c ca="right">
                        <p>112</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B57">57</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>For each microarray we display: number of up-regulated genes, number of down-regulated genes, number of up-regulated clusters, number of down-regulated clusters, total number of clusters, number of clusters located in chromosome 3L, bibliographical reference.</p>
               </tblfn>
            </tbl>
            <p>NURF is an ISWI-containing ATP-dependent chromatin remodeling complex <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Badenhorst <it>et al</it>. <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> performed a microarray analysis using larvae from <it>D. melanogaster </it>lacking the NURF specific subunit NURF301. We mapped the list of 274 genes (265 RefSeq genes) that require NURF301 according to this experiment (the list of up-regulated genes has not been published) to the genome. We then identified seven clusters of down-regulated genes using the program REEF (Table <tblr tid="T3">3</tblr>).</p>
            <p>Goodliffe <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> reported that the Polycomb protein (Pc), a member of PcG, mediates Myc autorepression and its transcriptional control at many loci. In this study the authors used the Gal4 UAS system to express ectopic <it>dmyc </it>in embryos and performed microarray analysis to examine the effect on gene expression. We mapped the list of 272 genes (203 RefSeq genes) up-regulated in this experiment (the list of down-regulated genes is unavailable) and then identified 6 clusters of co-expressed genes using the program REEF (Table <tblr tid="T3">3</tblr>).</p>
            <p>More recently, Goodliffe <it>et al</it>. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> extended the studies on Myc function and reported a coordinated regulation of Myc trans-activation targets by Pc and ASH1. The <it>ash1 </it>gene (<it>absent</it>, <it>small</it>, <it>or homeotic discs 1</it>) is also a member of the trxG <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. In this work, the authors used RNAi to reduce the levels of <it>ash1 </it>and conducted microarray experiments <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The analysis of these microarrays identified 398 genes with a substantial change in their expression (239 over-expressed RefSeq genes and 159 under-expressed RefSeq genes). We mapped these genes to the fly genome and identified eight clusters of co-expressed genes (seven clusters of up-regulated genes and one cluster of down-regulated genes) using the program REEF (Table <tblr tid="T3">3</tblr>).</p>
            <p>Together, these results suggest that chromosomal organization in clusters is a distinctive feature of some genes controlled by chromatin regulators. To elaborate more on this hypothesis, we compared the clusters identified in the microarray experiments of <it>trx </it>with those identified in the experiments of the other factors at three different levels: common clusters, common genes in clusters and common genes in the transcriptome maps (see Materials and methods for further details). We consider that two clusters from two different microarrays are matching if and only if they are overlapping in at least one commonly deregulated gene. The results of the comparison are shown in Table <tblr tid="T4">4</tblr> and, as an example, the regulatory gene profiles of <it>trx</it>, <it>ash2</it>, <it>Nurf</it>, <it>dmyc </it>and <it>ash1 </it>along the chromosome 3L and the clusters containing these genes are shown in Figure <figr fid="F3">3</figr> (the regions of the chromosome harboring the same cluster at the same time in both the <it>trx </it>experiment and another microarray are indicated with gray).</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Comparison between the clusters identified in different microarrays</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Microarray 1</p>
                     </c>
                     <c ca="left">
                        <p>Microarray 2</p>
                     </c>
                     <c ca="center">
                        <p>Common genes</p>
                     </c>
                     <c ca="center">
                        <p>Common genes in clusters</p>
                     </c>
                     <c ca="center">
                        <p>Common genes in common clusters</p>
                     </c>
                     <c ca="center">
                        <p>Common clusters</p>
                     </c>
                     <c ca="center">
                        <p>Common clusters 3L</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trithorax</p>
                     </c>
                     <c ca="left">
                        <p>ASH2</p>
                     </c>
                     <c ca="center">
                        <p>76 (20%)</p>
                     </c>
                     <c ca="center">
                        <p>17 (27%)</p>
                     </c>
                     <c ca="center">
                        <p>17 (75%)</p>
                     </c>
                     <c ca="center">
                        <p>6 (75%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (100%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trithorax</p>
                     </c>
                     <c ca="left">
                        <p>NURF</p>
                     </c>
                     <c ca="center">
                        <p>55 (14%)</p>
                     </c>
                     <c ca="center">
                        <p>10 (17%)</p>
                     </c>
                     <c ca="center">
                        <p>10 (45%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (57%)</p>
                     </c>
                     <c ca="center">
                        <p>3 (75%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trithorax</p>
                     </c>
                     <c ca="left">
                        <p>dMyc</p>
                     </c>
                     <c ca="center">
                        <p>43 (12%)</p>
                     </c>
                     <c ca="center">
                        <p>13 (20%)</p>
                     </c>
                     <c ca="center">
                        <p>13 (41%)</p>
                     </c>
                     <c ca="center">
                        <p>6 (100%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (100%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trithorax</p>
                     </c>
                     <c ca="left">
                        <p>ASH1</p>
                     </c>
                     <c ca="center">
                        <p>52 (11%)</p>
                     </c>
                     <c ca="center">
                        <p>9 (15%)</p>
                     </c>
                     <c ca="center">
                        <p>9 (38%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (50%)</p>
                     </c>
                     <c ca="center">
                        <p>2 (100%)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Average</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>14%</p>
                     </c>
                     <c ca="center">
                        <p>20%</p>
                     </c>
                     <c ca="center">
                        <p>50%</p>
                     </c>
                     <c ca="center">
                        <p>71%</p>
                     </c>
                     <c ca="center">
                        <p>94%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Each line contains the following information about the comparison between the <it>trx </it>microarray and a second microarray: number of up- and down-regulated genes reported in common, number of common genes in clusters, number of common genes in common clusters, number of common clusters, number of common clusters in chromosome 3L.</p>
               </tblfn>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Genomic map of clusters of genes on chromosome 3L that are regulated by several chromatin regulators</p>
               </caption>
               <text>
                  <p>Genomic map of clusters of genes on chromosome 3L that are regulated by several chromatin regulators. The location of each gene reported on every microarray is indicated with a vertical line (up-regulated genes in red, down-regulated genes in green). Genes in the forward strand are displayed above the chromosome line, genes in the reverse strand are displayed below. Clusters of genes in each experiment are indicated with a triangle in red or green according to their expression. Clusters present in two or more microarrays are highlighted by gray bands. Clusters of small genes identified along the fly genome are denoted with a triangle in gray.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-3"/>
            </fig>
            <p>Overall, between 50% (ASH1) and 100% (dMyc) of the <it>trx </it>clusters are also detected in the other chromatin regulators (71% on average; Table <tblr tid="T4">4</tblr>). This strongly suggests that there is high concordance between the <it>trx </it>clusters and the clusters inferred for the other chromatin regulators. There is not, however, an exact equivalence: clusters from different regulators that overlap in genome space with <it>trx </it>clusters may contain different regulated genes. Thus, the intersection between the genes deregulated by TRX and the genes regulated by other factors in the common clusters ranges from 38% (ASH1) to 75% (ASH2) of the genes (50% on average; Table <tblr tid="T4">4</tblr>). Nevertheless, this value dramatically decreases when the whole transcriptomes of each experiment are taken into account. In this case, the intersection between the set of genes deregulated in <it>trx </it>mutant larvae and any other set of genes whose expression was significantly affected by other chromatin regulators is lower than 20% on average (Table <tblr tid="T4">4</tblr>). These results suggest that the clusters identified in common form a group of gene targets directly or indirectly regulated by these chromatin regulators. In addition, this clustering is a specific feature of short and active genes: the average length of deregulated genes in these clusters is 1,135 bp, while the size of deregulated genes in these microarrays that are not clustered is, on average, 4,204 bp (Additional data file 8). These clusters overlap with clusters of small genes identified along the fly genome in the previous section: 75% of them for ASH2, 57% for NURF, 83% for dMyc, 75% for ASH1 (see Figure <figr fid="F3">3</figr> for a graphical comparison on chromosome 3L).</p>
            <p>The clustering organization reported here might be general for transcription factor target genes, and not a feature of genes regulated by chromatin remodeling factors. To rule out this hypothesis, we have collected microarray data for six transcription factors to extend the clustering analysis: <it>fkh </it>(fork head) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, <it>ey </it>(eyeless) <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, <it>spdk </it>(spotted-dick) <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, <it>gcm </it>(glial cells missing) <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, <it>Otd </it>(Orthodenticle) <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> and <it>lab </it>(labial) <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. We mapped each set of genes to the fly genome, using the program REEF to identify putative clusters. In most cases, however, no clusters were detected (Additional data file 9), indicating that clustering is not a general characteristic of transcription factor target genes. The lack of clustering in these microarrays does not merely reflect the larger gene size for the targets of these genes (Additional data file 10).</p>
            <p>Finally, we used the expression data published by Riedl <it>et al</it>. <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> as a negative control to qualitatively assess the significance of our results. The information has been obtained from two microarray experiments involving rover and sitter larvae to study foraging locomotion in the fruit fly <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. The intersection between these transcriptomes and the <it>trx </it>transcriptome is only slightly lower than that observed between TRX and the other chromatin regulators (6% and 9% for rover and sitter, respectively). However, only five clusters in total were detected among the genes regulated in the rover and sitter microarrays (2 and 3 clusters, respectively). Of these, only one mapped to chromosome 3L and none overlapped the <it>trx </it>clusters (Table <tblr tid="T3">3</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of co-expressed genes that constitute the clusters</p>
            </st>
            <p>The genomic structure of the gene clusters governed by chromatin regulators does not appear to be homogeneous. The average size of clusters in the <it>trx </it>mutants is 3.5 genes, while the genomic region that harbors such genes contains, on average, 6.7 genes (Additional data file 2). For instance, although the cluster shown in Figure <figr fid="F4">4a</figr> contains four genes down-regulated by TRX (depicted in green), there are five additional genes annotated in this genomic region (depicted in blue) for which no change in expression was detected in the microarray. In addition, the comparison of the clusters identified in the different microarrays indicated that, as already outlined, only about 50% of the genes in a cluster regulated by either TRX or another chromatin regulator are actually deregulated in both experiments at the same time (Table <tblr tid="T4">4</tblr>). In many cases, therefore, either genes in the equivalent clusters from different experiments do not show the same regulation pattern or the boundaries of the clusters are not exactly the same. For example, the same cluster containing eight genes shown in Figure <figr fid="F4">4a, b</figr> was identified by the program REEF in both the <it>trx </it>and the <it>ash1 </it>microarrays. However, there are three interesting differences: the gene boundaries of the clusters when considering only the regulated genes are not the same; the expression of the genes changes in the opposite sense (down-regulation versus up-regulation); and some of the clustered genes are not regulated by any of the factors.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Co-expression of genes in clusters</p>
               </caption>
               <text>
                  <p>Co-expression of genes in clusters. <b>(a,b) </b>Expression of genes in the same cluster in different microarrays. (a) Cluster of four down-regulated genes (in green) in <it>trx </it>microarrays. (b) Cluster of four up-regulated genes (in red) in <it>ash1 </it>microarrays. Notice the boundaries and the co-regulated genes of the cluster are not the same in both experiments. These images were produced using the program GFF2PS <abbrgrp><abbr bid="B102">102</abbr></abbrgrp>. <b>(c) </b>Graphical comparison between co-expression of genes in <it>trx </it>and artificial clusters, according to the expression data provided in <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. For each cluster, the co-expression level was computed as the mean of Pearson's correlation coefficient between all pairs of genes in the cluster. The distribution of co-expression values within the boundaries of the <it>trx </it>clusters (including all genes or only the deregulated ones) is clearly skewed to the right, indicating much stronger co-expression than expected at random.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-4"/>
            </fig>
            <p>We used the whole-genome expression data generated by Hooper <it>et al</it>. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> to investigate whether all genes within the genomic expanse of the <it>trx </it>clusters, and not only those defining the clusters themselves, are co-expressed (there are 162 genes within the region of the <it>trx </it>clusters, but only 97 in the clusters). For this dataset, Hooper <it>et al</it>. measured the expression of genes during the first 24 hours of embryonic development in <it>D. melanogaster </it>(1 hour time points). We used the data between 4 h and 24 h to minimize the possibility that the maternal effect could mask zygotic expression (see Materials and methods). Co-expression was evaluated both by using only those genes that define the <it>trx </it>clusters and using all genes located within the boundaries of each cluster. Based on the expression data provided in <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, we computed the Pearson's correlation coefficient between each pair of genes within the same chromosome across the 20 time points. For each cluster, the level of co-expression was then defined as the mean of Pearson's correlation coefficients between all pairs of genes in the cluster (see Materials and methods). As a reference set, we calculated the same values for each possible artificial cluster of N consecutive genes in the genome (2 &#8804; N &#8804; 15).</p>
            <p>The distribution of values obtained for the clusters containing only the genes deregulated in <it>trx </it>mutants, the clusters containing all genes mapped within the boundaries of the clusters and the artificial clusters of several sizes using the 4 h-24 h expression data set are shown in Figure <figr fid="F4">4c</figr>. Interestingly, the distribution of co-expression levels in randomly generated clusters of different sizes appears to be slightly positive (means ranging from minimum to maximum), probably suggesting an overall induction of transcription during the first stages of larval development. The distribution of co-expression levels computed within the boundaries of clusters, and, in particular, computed only from the regulated genes defining the clusters, is, however, clearly skewed to the right, indicating much stronger coexpression than expected at random. The bimodal shape of the distribution, more accentuated when considering only the genes defining the clusters, suggests the existence of a class of clusters with tight regulation of expression. The deviation from randomness in the <it>trx </it>clusters is perhaps more appreciable in the cumulative plots (Additional data file 11).</p>
            <p>Therefore, genes present within the genomic boundaries of the <it>trx </it>clusters, including those not in the defined clusters, are overall co-expressed. There are several causes that can explain the existence of additional genes within the boundaries of a <it>trx </it>cluster. These genes might not have been included in the clusters either because they were not in the array (4 cases out of 65 additional genes), because the gene showed a different pattern of regulation (up-regulated instead of down-regulated or vice versa, 1 case), or because the expression intensity from the microarray was below the selected thresholds (60 cases).</p>
         </sec>
         <sec>
            <st>
               <p>Clusters may contain both up- and down-regulated genes</p>
            </st>
            <p>The trxG members are known to be positive regulators of transcription <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. However, in our study, we found a similar number of up-regulated compared to down-regulated genes in the <it>trx </it>mutants. Similar results have recently been reported for <it>ash2</it>, <it>ash1 </it>and <it>Nurf301 </it><abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B46">46</abbr></abbrgrp>, suggesting that trxG proteins might also act directly or indirectly as repressors of certain genes. We once more applied the REEF clustering strategy, but this time considering all <it>trx </it>deregulated genes together, irrespective of the direction of their regulation. In addition to the 25 clusters previously detected, this method allowed us to identify six additional 'hybrid' clusters (with both up- and down-regulated genes). Moreover, we also enriched previously detected clusters with genes regulated in the opposite direction (Figure <figr fid="F5">5</figr>). In total, we identified 129 deregulated genes that were organized in 31 clusters.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Genomic map of 'hybrid' clusters of genes deregulated by TRX in <it>D. melanogaster</it></p>
               </caption>
               <text>
                  <p>Genomic map of 'hybrid' clusters of genes deregulated by TRX in <it>D. melanogaster</it>. Computational identification of clusters was performed on a set of up- and down-regulated genes in the microrray. The new hybrid clusters of genes are indicated with a blue triangle. The clusters detected before - using one of both sets - are indicated with a red triangle (up-regulated genes) or a green triangle (down-regulated genes). Some of them have been enriched using genes expressed in the opposite sense (displayed in light red or light green).</p>
               </text>
               <graphic file="gb-2008-9-9-r134-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The chromosomal clustering is conserved in other species</p>
            </st>
            <p>The clusters of genes detected here might be acting as transcriptional units with coordinated transcriptional regulation. One would therefore expect some level of conservation of cluster organization across species. The genomes of multiple species of <it>Drosophila </it>have been recently made available through the UCSC genome browser <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, allowing investigation of the conservation of <it>trx </it>clusters in other <it>Drosophila </it>species. Only three of these genomes have been completely assembled: <it>D. simulans</it>, <it>D. yakuba </it>and <it>D. pseudoobscura </it><abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. We have mapped all <it>D. melanogaster </it>genes to the genomes of each of these species using the BLAT alignments provided by the UCSC genome browser <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> (see Materials and methods). The number of genes annotated on each species using this method is shown in Table <tblr tid="T5">5</tblr>.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Clusters of genes deregulated in <it>trx </it>mutants conserved in other phylogenetically related species</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Genes (orthologs)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="right">
                        <p>Genome</p>
                     </c>
                     <c ca="right">
                        <p>&#8593;</p>
                     </c>
                     <c ca="right">
                        <p>&#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8593;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters &#8595;</p>
                     </c>
                     <c ca="right">
                        <p>Clusters</p>
                     </c>
                     <c ca="right">
                        <p>Deregulated genes in clusters</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>
                              <it>D. melanogaster</it>
                           </b>
                        </p>
                     </c>
                     <c ca="right">
                        <p>19,670</p>
                     </c>
                     <c ca="right">
                        <p>260</p>
                     </c>
                     <c ca="right">
                        <p>275</p>
                     </c>
                     <c ca="right">
                        <p>10</p>
                     </c>
                     <c ca="right">
                        <p>15</p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                     <c ca="right">
                        <p>97</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. simulans</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>17,927</p>
                     </c>
                     <c ca="right">
                        <p>235</p>
                     </c>
                     <c ca="right">
                        <p>262</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="right">
                        <p>20</p>
                     </c>
                     <c ca="right">
                        <p>75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. yakuba</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>19,929</p>
                     </c>
                     <c ca="right">
                        <p>259</p>
                     </c>
                     <c ca="right">
                        <p>275</p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                     <c ca="right">
                        <p>96</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. pseudoobscura</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>15,096</p>
                     </c>
                     <c ca="right">
                        <p>170</p>
                     </c>
                     <c ca="right">
                        <p>230</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                     <c ca="right">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>A. gambiae</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>7,283</p>
                     </c>
                     <c ca="right">
                        <p>97</p>
                     </c>
                     <c ca="right">
                        <p>151</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="right">
                        <p>34</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>For each species, we show: number of genes for which an ortholog in <it>D. melanogaster </it>was found, number of orthologs for up-regulated genes, number of orthologs for down-regulated genes, number of up-regulated clusters, number of down-regulated clusters, total number of clusters, number of deregulated genes from <it>D. melanogaster </it>that constitute the clusters.</p>
               </tblfn>
            </tbl>
            <p>After mapping the up-regulated and down-regulated genes of the <it>trx </it>mutant from <it>D. melanogaster </it>to the other <it>Drosophila </it>genomes, we used the program REEF with the same set of parameters to identify putative clustering of these genes. The number of clusters detected in these species is shown in Table <tblr tid="T5">5</tblr>: 20 clusters in <it>D. simulans </it>(corresponding to 7 up-regulated clusters and 13 down-regulated clusters in the <it>trx </it>microarrays), 25 clusters in <it>D. yakuba </it>(11 up-regulated clusters, 14 down-regulated clusters) and 14 clusters in <it>D. pseudoobscura </it>(1 up-regulated cluster, 13 down-regulated clusters). We have compared the clusters obtained in <it>D. melanogaster </it>with the clusters identified in these three species: 24 out of 25 clusters (96%) identified in <it>D. melanogaster </it>were conserved in at least one other species (80% of the clusters were conserved in <it>D. melanogaster </it>and two more species, 36% of the clusters were conserved in all species). In contrast, the percentage of clusters identified in these species that was not detected in <it>D. melanogaster </it>was very low (0% in <it>D. simulans</it>, 16% in <it>D. yakuba</it>, 14% in <it>D. pseudoobscura</it>; Table <tblr tid="T6">6</tblr>), indicating that this set of deregulated genes is similarly organized in the genome of these species. The distribution of clusters on each genome is shown in Figure <figr fid="F6">6</figr> (the clusters of <it>D. melanogaster </it>that are conserved in other species have the same identifier as in Figure <figr fid="F1">1</figr>).</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Conservation of genes in the clusters and their vicinity</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Genome</p>
                     </c>
                     <c ca="right">
                        <p>No. of clusters</p>
                     </c>
                     <c ca="center">
                        <p>No. of clusters conserved in <it>D. melanogaster</it></p>
                     </c>
                     <c ca="right">
                        <p>No. of genes within the clusters</p>
                     </c>
                     <c ca="right">
                        <p>% Genes conserved within the clusters</p>
                     </c>
                     <c ca="right">
                        <p>% Genes conserved in the flanking area</p>
                     </c>
                     <c ca="right">
                        <p>% Genes conserved in artificial clusters</p>
                     </c>
                     <c ca="center">
                        <p>Divergence time estimates (Mya)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. simulans</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>20 (100%)</p>
                     </c>
                     <c ca="right">
                        <p>144</p>
                     </c>
                     <c ca="right">
                        <p>96%</p>
                     </c>
                     <c ca="right">
                        <p>86%</p>
                     </c>
                     <c ca="right">
                        <p>90%</p>
                     </c>
                     <c ca="center">
                        <p>2-3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. yakuba</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>21 (84%)</p>
                     </c>
                     <c ca="right">
                        <p>146</p>
                     </c>
                     <c ca="right">
                        <p>88%</p>
                     </c>
                     <c ca="right">
                        <p>64%</p>
                     </c>
                     <c ca="right">
                        <p>79%</p>
                     </c>
                     <c ca="center">
                        <p>5-7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>D. pseudoobscura</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>12 (86%)</p>
                     </c>
                     <c ca="right">
                        <p>83</p>
                     </c>
                     <c ca="right">
                        <p>96%</p>
                     </c>
                     <c ca="right">
                        <p>58%</p>
                     </c>
                     <c ca="right">
                        <p>65%</p>
                     </c>
                     <c ca="center">
                        <p>25-55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>A. gambiae</it>
                        </p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>6 (86%)</p>
                     </c>
                     <c ca="right">
                        <p>48</p>
                     </c>
                     <c ca="right">
                        <p>66%</p>
                     </c>
                     <c ca="right">
                        <p>2%</p>
                     </c>
                     <c ca="right">
                        <p>20%</p>
                     </c>
                     <c ca="center">
                        <p>250-300</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The following information is shown for each genome: the species, the number of clusters predicted, the number and the percentage of clusters that are conserved in <it>D. melanogaster</it>, the number of genes in the clusters (the same amount of genes is used to measure the conservation in the flanking areas), the percentage of genes that are conserved between these clusters and the corresponding clusters in <it>D. melanogaster</it>, the percentage of genes that are conserved in the flanking areas of the clusters (average conservation in the left and right flanking areas), the percentage of the genes that are conserved in 10,000 artificial clusters sampled on each species, and the divergence time (million years ago (Mya)) estimates between <it>D. melanogaster </it>and each species, extracted from <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B101">101</abbr></abbrgrp>.</p>
               </tblfn>
            </tbl>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Genomic map in other species of clusters deregulated in <it>trx </it>mutants</p>
               </caption>
               <text>
                  <p>Genomic map in other species of clusters deregulated in <it>trx </it>mutants. The location in each species of the orthologous gene deregulated in <it>D. melanogaster </it>is indicated with a vertical line (up-regulated genes in red, down-regulated genes in green). Genes in the forward strand are displayed above the chromosome line, genes in the reverse strand are displayed below. Clusters of genes identified on each genome are indicated with a blue triangle.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-6"/>
            </fig>
            <p>Another genome of interest for the identification of homologous clusters potentially regulated by the <it>trx </it>gene is that of <it>Anopheles gambiae </it><abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. We obtained the list of putative <it>Anopheles </it>orthologs to the <it>D. melanogaster </it>genes using the ENSEMBL annotations <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Less than 50% of the fly genes could be mapped to the mosquito genome in this way (Table <tblr tid="T5">5</tblr>). Consequently, only 7 clusters were identified. Most of these clusters, however, were conserved in <it>D. melanogaster </it>(Figure <figr fid="F6">6</figr> and Table <tblr tid="T6">6</tblr>).</p>
            <p>In the work presented here, we identified a set of 25 gene clusters in <it>D. melanogaster </it>that are phylogenetically conserved in other flies. However, given the strong synteny between the <it>Drosophila </it>genomes (see divergence time estimates in Table <tblr tid="T6">6</tblr>), we can not claim that the conservation of clusters that we observed is not simply a consequence of such an overall synteny. To discard such a hypothesis, for each cluster identified in <it>D. melanogaster </it>we examined the number of genes in common found in the corresponding cluster in each of the other <it>Drosophila </it>species (allowing for gene rearrangements and chromosome inversions inside the region; see Materials and methods for further details). We also analyzed the number of genes in common between the corresponding flanking areas of these clusters in order to compare the number of genes that are conserved inside and outside them; the results are shown in Table <tblr tid="T6">6</tblr>. While the genes that constitute the clusters of <it>trx </it>in <it>D. melanogaster </it>are mostly the same in the clusters of the other species (96% in <it>D. simulans</it>, 88% in <it>D. yakuba</it>, 96% in <it>D. pseudoobscura</it>), the number of conserved genes in the vicinity of each cluster decreases in more distant species (86% in <it>D. simulans</it>, 64% in <it>D. yakuba</it>, 58% in <it>D. pseudoobscura</it>). Additional statistical tests confirmed these observations (see Materials and methods). According to these results, we conclude that the overall synteny between the <it>Drosophila </it>genomes is not enough to explain the high level of conservation observed in the clusters of genes deregulated by TRX in <it>D. melanogaster</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Clusters of deregulated genes are enriched in some functional categories</p>
            </st>
            <p>In order to characterize the clusters previously identified in <it>D. melanogaster</it>, we functionally annotated their constituent genes (Additional data file 12) using Gene Ontology (GO) <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. GO is a hierarchical dictionary of biological terms structured into three main categories: molecular function, biological process and cellular component. We also annotated the function of the full set of genes in our microarray and of the genes that were reported to be up-regulated or down-regulated to estimate the statistical significance of our results.</p>
            <p>We analyzed the information available for the genes of each respective set (12,120 genes in the microarray, 535 deregulated genes, 97 genes in clusters) at the third level of the molecular function ontology (see Materials and methods). A graphical representation of the more abundant categories for each of the three gene sets is shown in Figure <figr fid="F7">7a</figr>. The clusters of down-regulated genes are significantly enriched in structural proteins involved in cuticle formation (<it>p</it>-value &lt;10<sup>-37</sup>; see Materials and methods). The over-representation is less relevant in the set of down-regulated genes, while it is not observed in the full collection of genes in the microarray (Figure <figr fid="F7">7</figr>). The clusters of up-regulated genes are also enriched in proteins with carbohydrate and pattern binding functions, as well as structural components of the peritrophic membrane (<it>p</it>-value &lt;0.005; see Materials and methods). The set of up-regulated genes is also, albeit to a lesser extent, enriched in these categories, while no over-representation is observed in the complete set of genes in the microarray (Figure <figr fid="F7">7</figr>). A more detailed inspection of the functional annotations of the clusters reveals that this over-representation is due to the abundance of genes involved in the subcategory 'chitin binding' <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Functional annotation of genes deregulated in <it>trx </it>mutants</p>
               </caption>
               <text>
                  <p>Functional annotation of genes deregulated in <it>trx </it>mutants. <b>(a) </b>Classification of the microarray gene set, the deregulated genes and the genes that constitute the clusters according to the GO category 'molecular function', level 3. <b>(b) </b>Genomic map of clusters of functionally related genes. Clusters of genes annotated as structural constituents of cuticle (displayed as blue stars) and clusters of genes annotated as chitin binding (displayed as purple circles). Clusters of co-regulated genes in the <it>trx </it>mutant are indicated with a triangle in red or green according to their expression. Notice that most functional clusters match regulatory clusters despite the fact that both approaches are completely different.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-7"/>
            </fig>
            <p>According to this, the clusters of genes deregulated in the <it>trx </it>mutant contain a significant number of genes involved in cuticle structure and other related functions. To confirm these results, we performed a whole-genome clustering of the genes from <it>D. melanogaster </it>according to their functional annotation in GO, in which no expression data were used (see Materials and methods). We focused, in particular, on the two functional categories over-represented in the clusters of genes deregulated in <it>trx </it>mutants: structural constituents of cuticle (GO:0042302) and chitin binding (GO:0008061).</p>
            <p>We found 98 genes annotated as components of the cuticle in the genome of <it>D. melanogaster</it>. Using the program REEF with the same set of parameters, we identified 12 clusters of genes involved in cuticle structure (shown as blue stars in Figure <figr fid="F7">7b</figr>), 8 of which are located on chromosome 3L. We found 67 genes annotated as chitin binding proteins and identified 6 clusters, 5 of which are also located on chromosome 3L (shown as purple circles in Figure <figr fid="F7">7b</figr>). Of the 18 functional clusters, 8 overlap the clusters of genes regulated by TRX (shown as red and green marks in Figure <figr fid="F7">7b</figr>). In addition, most genes in the functional clusters are annotated as being in the same functional categories (Additional data file 13).</p>
         </sec>
         <sec>
            <st>
               <p>Clusters of genes deregulated in <it>trx </it>mutants are enriched in tissue specific genes</p>
            </st>
            <p>We have also attempted to characterize the expression pattern of genes in the <it>trx </it>clusters. Using data from the Li <it>et al</it>. <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> study, in which the tissue distribution of genes expressed 18 hours before the larval to pupal metamorphosis in <it>D. melanogaster </it>was determined, we characterized the expression of the members of three different gene sets (genes in the microarray, deregulated genes, genes in clusters; see Materials and methods for further details); results are shown in Figure <figr fid="F8">8a</figr>. As expected, the proportion of genes in our microarray expressed in different tissues is very similar to that reported by Li <it>et al</it>. <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>: 28% are expressed in the central nervous system, 24% in wing discs, 20% in the midgut, 16% in salivary glands and 12% in the epidermis. Most genes regulated by TRX, on the other hand, are expressed in the midgut (30%, <it>p</it>-value &lt;10<sup>-5</sup>) and in the epidermis (27%, <it>p</it>-value &lt;10<sup>-6</sup>), while the genes that constitute the clusters regulated by TRX are abundantly expressed in salivary glands (41% of genes, <it>p</it>-value &lt;10<sup>-5</sup>). We compared the tissue-specific expression pattern of the genes in the <it>trx </it>clusters with the functional annotation of these genes as inferred through the GO annotations (see previous section) and results are shown in Figure <figr fid="F8">8b</figr>: 16 of 25 clusters (64%) contain genes that are either expressed in salivary glands or code for structural cuticle proteins. In addition, the clusters appear to be divided into those containing genes expressed in salivary glands and those containing genes coding for structural proteins. Together, these results demonstrate that TRX plays a key role in the regulation of clusters containing genes involved in the structure of the cuticle during the larval stages of <it>D. melanogaster </it>development.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Clusters are enriched in genes expressed in particular tissues</p>
               </caption>
               <text>
                  <p>Clusters are enriched in genes expressed in particular tissues. <b>(a) </b>From left to right: tissue distribution of genes expressed 18 h before larval to pupal metamorphosis according to Li <it>et al</it>. <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>; expression pattern of genes included in our <it>trx </it>microarray; genes deregulated by TRX; and genes in clusters deregulated by TRX. <b>(b) </b>Tissue distribution of clustered genes (at least one gene must be expressed in that tissue). Clusters that contain genes annotated as structural proteins in GO are displayed for comparison.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-8"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>PRE/TRE predictions and experimental approaches</p>
            </st>
            <p>The PcG and trxG complexes bind to sequences called PRE/TREs. However, not only is the genomic distance between well-characterized PRE/TREs and their target genes highly variable, ranging from a few nucleotides to more than 60 Kb in many cases, but they can be found both upstream or downstream of the gene <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr><abbr bid="B66">66</abbr></abbrgrp>. Several methods have been proposed to detect PRE/TRE elements in genomic sequences. For example, Ringrose <it>et al</it>. <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> developed a computational approach to detect potential PRE/TREs and identified 167 candidates in the fly genome, some of which were validated experimentally <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. More recently, several ChIP-on-chip experiments have been performed to search for PcG targets. Among others, Schwartz <it>et al</it>. <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> determined the distribution of the PcG proteins Pc, E(z) and Psc and of H3K27me3 in the whole genome, Tolhuis <it>et al</it>. <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> constructed a map of binding patterns of the PcG proteins Pc, esc and Sce in chromosomes 2L and 4, and part of chromosomes 2R and X, and Negre <it>et al</it>. <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> analyzed the binding profile of the PcG proteins Pc and ph, and the GAGA factor in certain regions of chromosomes 2L and X.</p>
            <p>We mapped the PcG target sites identified in each of these experiments (251 sites in Schwartz <it>et al</it>. <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, 131 sites in Tolhuis <it>et al</it>. <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> and 36 sites in Negre <it>et al</it>. <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>) and the 167 PRE/TREs predicted by Ringrose <it>et al</it>. <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> in the fly genome (assembly dm2, April 2004). We then compared the location of the 25 clusters of genes deregulated by TRX in the fruit fly genome with the location of the PcG target sites and the PRE/TRE predictions (Figure <figr fid="F9">9</figr>). Since the distance between the PRE/TRE and its target gene can be highly variable, we decided to confine the search to the PcG binding regions or PRE/TREs that were located at most 100 Kb distant from our clusters. According to this restriction, we found 14 out of 25 clusters (56%) near one experimental evidence or a PRE/TRE prediction, five of which were supported by both a PcG binding site and a PRE/TRE (Additional data file 14). We performed ChIP analysis of third instar larvae, using both anti-TRX and H3K4me3 specific antibodies, to test the possible binding of the TRX protein to some of the predicted PRE/TREs <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> that are in close proximity to the <it>trx </it>deregulated clusters. Preliminary results seem to indicate that TRX is capable of binding to at least two of the PRE/TREs tested (Additional data file 15). In keeping with its role as a histone methyltransferase, TRX binding correlates with the presence of H3K4me3.</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Genomic map of clusters, PcG ChIP-on-chip data and predicted PRE/TREs</p>
               </caption>
               <text>
                  <p>Genomic map of clusters, PcG ChIP-on-chip data and predicted PRE/TREs. The location of each gene reported on the <it>trx </it>microarray is indicated with a vertical line (up-regulated genes in red, down-regulated genes in green). Genes in the forward strand are displayed above the chromosome line, genes in the reverse strand are displayed below. Clusters of genes on each experiment are indicated with a triangle in red or green according to their expression. PcG binding domains reported by Schwartz <it>et al</it>. <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, Tolhuis <it>et al</it>. <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> and Negre <it>et al</it>. <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> are displayed in blue. PRE/TRE predictions obtained by Ringrose <it>et al</it>. <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> are displayed in black.</p>
               </text>
               <graphic file="gb-2008-9-9-r134-9"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this study we have compared the whole-genome expression profiles of <it>trx </it>mutant and wild-type fly larvae and located the deregulated genes on the fly genome. Considering the stringent threshold used in this study, it is possible that the total number of regulated genes may be underestimated. In the absence of genome-wide ChIP data, we can not assess TRX binding and its putative relationship with H3K4 trimethylation, but taking into account that most active genes have this mark in most species, it is tempting to speculate that the TRX protein itself or another member of this group should be responsible for it. Nevertheless, the genome mapping revealed the tendency of some <it>trx </it>deregulated genes to cluster within a few genomic locations with 97 genes (about 20% of all deregulated genes) organized in 25 genomic clusters covering less than 400,000 bp (about 0.3% of the fly genome). This appears to be a distinctive feature of the regulatory networks of other chromatin regulators. Indeed, our microarray experiments on <it>ash2 </it>mutants, as well as experiments performed on NURF <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, dMyc <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and ASH1 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, indicate that the genes regulated by these chromatin factors are also clustered in the fly genome. Remarkably, these clusters are a subset of the <it>trx </it>deregulated clusters, despite that only about 20% of the <it>trx </it>deregulated genes are also deregulated in these other experiments. In fact, the number of genes in clusters is likely to be also underestimated, because of the thresholds used to establish deregulation. Moreover, when using the time course whole-genome expression data during the first 24 hours of embryonic development <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, we observe an overall positive correlation between the expression levels of all genes within the boundaries of the <it>trx </it>clusters (irrespective of whether they are included in the clusters themselves). Similar results are observed when using the transcription maps generated across the fly genome in the same developmental stage by Manak <it>et al</it>. <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. In fact, most genes in the <it>trx </it>clusters present similar patterns of overlapping transcribed fragments (transfrags; Additional data file 16). In addition, we found a significant under-representation of non-exonic transfrags within our clusters (the observed overlap between transfrags and clusters was 100 bp and the expected overlap was 2,610.8 bp, <it>&#967;</it><sup>2 </sup>homogeneity test, <it>p</it>-value = 0). Since the presence of transfrags in the vicinity of genes is often an indication of alternative transcription initiation and termination sites, their under-representation suggests that the boundaries of the genes in these clusters are under tight transcriptional control.</p>
         <p>Some clusters detected here have been previously described in the literature. For instance, clusters 9 (Lcp65A, larval cuticle proteins) and 21 (Ccp84, cuticular genes cluster), both down-regulated in the <it>trx </it>microarray, are two well-known groups of genes involved in the determination of the physical characteristics of the insect cuticle <abbrgrp><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>. Other interesting examples are clusters 11 (<it>Hsp23</it>, <it>Hsp26</it>, <it>Hsp27</it>) and 13 (<it>Sgs3</it>, <it>Sgs7</it>, <it>Sgs8</it>), which are up-regulated in <it>trx </it>deficient flies and down-regulated in flies lacking <it>ash2 </it>or <it>Nurf301</it>. <it>Hsp23</it>, <it>Hsp26 </it>and <it>Hsp27 </it>are heat shock genes also expressed in the absence of stress during embryogenesis and metamorphosis in <it>D. melanogaster </it>but their role in these processes is not well understood <abbrgrp><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr></abbrgrp>. <it>Sgs3</it>, <it>Sgs7 </it>and <it>Sgs8 </it>form a cluster of genes that code for proteins that are part of the salivary glue secreted by <it>Drosophila </it>larvae to fix themselves to an external substrate for the duration of the pre-pupal and pupal periods <abbrgrp><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr></abbrgrp>. Both clusters were previously reported to be regulated by the 20E- inducible Broad-Complex (BR-C), an early ecdysone response gene complex that is active during larval to pupal metamorphosis and encodes a family of zinc-finger transcription factors <abbrgrp><abbr bid="B78">78</abbr><abbr bid="B79">79</abbr></abbrgrp>. Although <it>Sgs </it>expression is known to be indirectly controlled by BR-C <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, up-regulation of <it>br </it>(a member of BR-C) in <it>trx </it>mutants could explain the up-regulation of both clusters. Consistent with this hypothesis, <it>br </it>has a reduced level of expression in <it>ash2 </it>and <it>Nurf301 </it>mutants, where such clusters are down-regulated, suggesting BR-C can be an intermediate step in the regulation of the expression of <it>Sgs </it>and <it>Hsp </it>clusters. Homeotic genes are also clustered in the genome and their expression state is maintained by PcG and trxG proteins after the initial transcriptional regulators disappear from the embryo <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. In our <it>trx </it>mutant microarrays most homeotic genes did present a decrease in expression; however, the difference was inferior to the fold change threshold selected to filter and normalize the expression data. In addition, homeotic complexes are substantially larger than the <it>trx </it>clusters reported in our work, which suggests that the computational tools used here should be reconfigured to detect them.</p>
         <p>We compared the results of several ChIP-on-chip experiments on different fragments of the fruit fly genome that have already been published <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr></abbrgrp> and the set of computational PRE/TRE predictions obtained by Ringrose <it>et al</it>. <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> with the location of our clusters. The analysis of this information is, however, difficult due to the different conditions in each experiment, the genome coverage of them and the limited knowledge of the PRE/TRE sequences <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. In contrast to vertebrates, it seems that only around 30% of PcG binding sites are within 2 Kb of a promoter in flies, complicating the assignment to specific genes <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>. Half of the gene clusters deregulated in <it>trx </it>mutants are close (less than 100 Kb) to functional PcG binding sites or PRE/TRE predictions, suggesting that some clusters may be directly regulated by TRX, while others can be under indirect regulation. A recent study presented evidence for the existence of arrays of highly conserved non-coding elements (HCNEs) and genomic regulatory blocks in five <it>Drosophila </it>species <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>, giving rise to some controversy about whether PRE/TREs might be found inside those regions or not <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. We mapped the HCNEs identified by Engstrom <it>et al</it>. <abbrgrp><abbr bid="B82">82</abbr></abbrgrp> to our clusters (Additional data file 17), detecting a significant under-representation of such elements in comparison to the rest of the genome (the observed overlap between HCNEs and clusters was 190 bp, and the expected overlap was 1,342.7 bp, <it>&#967;</it><sup>2 </sup>homogeneity test, <it>p</it>-value = 0).</p>
         <p>Multiple genome-wide approaches for the detection of clustering organization have been published (see Hurst <it>et al</it>. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for a review), and a variety of computational approaches towards that end have been developed <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B83">83</abbr></abbrgrp>. Comparison of the results, however, is complex because of several reasons: the lack of a standard definition of a cluster (from series of two/three adjacent genes to groups of 10-30 genes), the statistical engine is different, even in the same species, different sets of genes were used, or the biological conditions (tissues/developmental stage) of the study are not comparable. The clusters presented here are located in relatively small regions of the genome (average cluster size of 15,918 nucleotides) in contrast to larger clusters reported in previous studies (cluster length between 20 and 200 kb in <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>). In addition, we tolerate within the boundaries of the genomic clusters the presence of genes that are not shown to be regulated in the microarray data (only adjacent co-expressed genes were accepted in clusters in <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>).</p>
         <p>The growing body of evidence supporting the existence of non-random gene distribution in genomes <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> indicates that genome organization may be partially evolutionarily conserved across eukaryotes. Phylogenetically conserved clusters of co-expressed genes have recently been reported in human and mouse <abbrgrp><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr></abbrgrp>, while adjacent pairs of essential genes that are evolutionarily conserved have been shown to cluster in regions of low recombination <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>. In fact, neighboring genes are thought to experience continuous concerted expression changes during evolution, which can lead to the formation of clusters of co-expressed genes. However, the pattern of expression might be evolving slowly within the clusters. On the other hand, some clusters may be maintained by natural selection because of their similar biological functions <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>. Recently, a comparative analysis of the genomes of 12 <it>Drosophila </it>species has been published <abbrgrp><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr></abbrgrp>. Here, we identified a set of 25 clusters in <it>D. melanogaster </it>that are phylogenetically conserved in <it>D. simulans</it>, <it>D. yakuba </it>and <it>D. pseudoobscura</it>. The conservation that we have observed in these clusters is stronger than the overall synteny among the <it>Drosophila </it>genomes. Our preliminary results on <it>A. gambiae</it>, which diverged from <it>Drosophila </it>about 250 million years ago, are in support of the selective constraints to maintain the cluster organization.</p>
         <p>Lee and Sonnhammer <abbrgrp><abbr bid="B90">90</abbr></abbrgrp> recently reported the existence of a link between functional pathways in the KEGG dabatase <abbrgrp><abbr bid="B91">91</abbr></abbrgrp> and the chromosomal distribution of genes involved in them. Here, we functionally annotated the genes constituting the clusters of genes deregulated in the <it>trx </it>mutant using GO <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, and detected a significant enrichment in structural proteins involved in cuticle formation and chitin metabolism. To assess the importance of this overrepresentation, we extracted all genes of the fruit fly genome annotated with these functions and used the program REEF to identify potential clusters on this set of genes. Remarkably, there is a high overlap between the map of functional clusters and the map of regulatory clusters. The overrepresentation of structural proteins involved in cuticle formation suggests that the trxG proteins analyzed here may play a role in the hormonal response that takes place during metamorphosis. Indeed, a function for the histone methyltransferase protein TRR (Trithorax-related) as a coactivator for the ecdysone receptor <abbrgrp><abbr bid="B92">92</abbr></abbrgrp> and a direct link between NURF and ecdysteroid signaling in larval to pupal metamorphosis <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> have been reported. If genes involved in larval/pupal transition, such as the ones described here, are among the trxG targets, a putative explanation for the preferential localization in chromosome 3L may be just that 67% of the functionally annotated clusters of genes involved in cuticle structure are located in this chromosome. Moreover, it is known that many mutations affecting trxG genes (either null alleles or heteroallelic combinations) are lethal at the third instar larval stage, probably due to a large dowry of maternally loaded mRNA, and do not undergo larval to pupal metamorphosis. In spite of other phenotypic differences between these larvae, it is tempting to speculate that lethality could be caused by desiccation due to defective cuticle secretion and that the trxG/PcG regulation of the clusters could have a pivotal role in metamorphosis.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Further experiments correlating gene expression states and chromatin modifications in specific tissues during development as well as chromatin protein binding maps will be required to understand the complex role of trxG proteins.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p><it>Drosophila </it>strains</p>
            </st>
            <p>All <it>Drosophila </it>strains and crosses were kept on standard media with 0.025% bromophenol blue. The reference line used was the <it>w</it><sup><it>1118</it></sup><sub><it>iso</it></sub>; <it>2</it><sub><it>iso</it></sub>; <it>3</it><sub><it>iso </it></sub>isogenic line from the DrosDel Collection <abbrgrp><abbr bid="B93">93</abbr></abbrgrp>. To reduce the differences in the biological background between the alleles under study and the reference strain, we transferred chromosomes X, Y and 2 from the isogenic line to the TM6c-balanced <it>ash2</it><sup><it>I1</it></sup>, and to <it>trx</it><sup><it>B11</it></sup> and <it>trx</it><sup><it>E</it><it>3</it></sup> alleles, which were used in trans-heterozygosity. Their genotypes were, respectively, <it>w</it><sup><it>1118</it></sup><sub><it>iso</it></sub>; <it>2</it><sub><it>iso</it></sub>; <it>ash2</it><sup><it>I</it><it>1</it></sup>/<it>TM6c</it>, <it>w</it><sup><it>1118</it></sup><sub><it>iso</it></sub>; <it>2</it><sub><it>iso</it></sub>; <it>trx</it><sup><it>B</it><it>11</it></sup>/<it>TM6c </it>and <it>w</it><sup><it>1118</it></sup><sub><it>iso</it></sub>; <it>2</it><sub><it>iso</it></sub>; <it>trx</it><sup><it>E</it><it>3</it></sup>/<it>TM6c</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray design</p>
            </st>
            <p>Microarrays were printed on Corning UltraGAPS slides (Corning, Corning, NY, USA) at the Plataforma de Transcript&#242;mica (SCT - PCB, Universitat de Barcelona, Spain) using the <it>Drosophila </it>Genome Oligo Set version 1.1 (Operon Biotechnologies Inc., Huntsville, AL, USA), a collection of 14,593 probes representing 13,577 <it>Drosophila </it>genes with Flybase ID (12,120 genes in RefSeq). The 70 mer <it>Arabidopsis </it>sequences from TIGR <abbrgrp><abbr bid="B94">94</abbr></abbrgrp> and spots with no material or with buffer were also printed to be used as spike-in and negative controls, respectively. The microarray annotation is deposited in the Gene Expression Omnibus database with accession number GPL3797. Wandering blue-gut staged Tb+ early third instar larvae were selected in all cases to extract total RNA using the RNeasy Protect Mini Kit (Qiagen Inc., Valencia, CA, USA). At least two independent total RNA extractions were carried out. Quality was assessed in all samples using the Eukaryote Total RNA Nano Assay on a 2100 Bioanalyzer (Agilent Technologies Inc., Santa Clara, CA, USA). Total RNA from <it>w</it><sup><it>1118</it></sup>;+;+ larvae was used as a common reference. Four microarrays were hybridized for each experiment in biological replicate pairs, such that the total RNA from the sample used as a starting material came from different extractions. Both arrays from each pair were hybridized with the same amplified RNA from sample and common reference (obtained using the Amino-Allyl Messageamp II aRNA Amplification Kit from Ambion Inc., Austin, TX, USA) using dyes Cy3 and Cy5 (GE Healthcare UK Ltd, Buckinghamshire, UK).</p>
         </sec>
         <sec>
            <st>
               <p>Microarray analysis</p>
            </st>
            <p>GenePix Results (GPR) data files were obtained for each microarray with an Axon 4000B scanner and GenePix Pro 6 (Molecular Devices Corp., Sunnyvale, CA, USA). All GPR files were analyzed with the Limma package from BioConductor <abbrgrp><abbr bid="B95">95</abbr><abbr bid="B96">96</abbr></abbrgrp> using the same criteria. The spots not fulfilling the quality thresholds (based on spot size, foreground versus background signals, saturation, coincidence between differently calculated ratio measures and R<sup>2 </sup>of regression ratio) were eliminated from the analysis, and the data were background corrected with the <it>normexp </it>method and normalized using OLIN <abbrgrp><abbr bid="B97">97</abbr></abbrgrp>. Between-array normalization was carried out independently for each set of four arrays using the <it>mad </it>method from OLIN, and a linear model was fitted and corrected with False Discovery Rate (FDR) <abbrgrp><abbr bid="B95">95</abbr></abbrgrp>. We obtained lists of genes that were differentially expressed two-fold in the mutants compared to the isogenic strain (FDR-corrected <it>p</it>-value lower than 0.05). Raw and normalized data are deposited in the Gene Expression Omnibus database (user name: sergiba_rev_1, password: 2139861083) with the accession number GSE8783, which includes the data for <it>trx </it>(GSE8748) and <it>ash2 </it>(GSE8750) mutants.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray data collection</p>
            </st>
            <p>We obtained the set of genes that are differentially expressed in the larvae of <it>D. melanogaster </it>from Arbeitman <it>et al</it>. <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We selected only the genes that were up-regulated at least two-fold in the larvae (302 RefSeq genes). We extracted the gene expression data for <it>Nurf301-/- </it>from Badenhorst <it>et al</it>. <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> (only the list of down-regulated genes was published). The set of up-regulated genes in response to <it>dmyc </it>over-expression was taken from Goodliffe <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. We extracted from Goodliffe <it>et al</it>. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> the set of up and down-regulated genes when the regulatory activity of <it>ash1 </it>was repressed using RNA interference to study the interaction between the dMyc complex and ASH1. We used the expression data from two microarray analyses (rovers and sitters) about foraging locomotion from Riedl <it>et al</it>. <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> as a negative control in the clustering process. The set of genes expressed in five different tissues (midgut, salivary glands, epidermis, central nervous system and wing disc) studied in the larval stage of <it>D. melanogaster </it>was obtained from Li <it>et al</it>. <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Real-time RT-PCR analysis</p>
            </st>
            <p>Reverse transcription reactions with RNA independently isolated in Trizol from all mutant alleles and the reference were used to synthesize cDNA with M-MLV RT (Invitrogen Corp., Carlsbad, CA, USA) according to the manufacturer instructions. We used 1 <it>&#956;</it>l of a 1/10 dilution of cDNA on each quantitative real-time PCR (qRT-PCR). The qRT-PCR was performed on the ABI PRISM<sup>&#174; </sup>7700 following the recommended protocol (Applied Biosystems, Foster City, CA, USA). Each sample was replicated three times and average values were used for further analysis. Data were analyzed by the &#916;&#916;C<sub>T </sub>method, being normalized by subtracting the value of the control gene <it>dia</it>. TaqMan primers and probes designed and synthesized by Applied Biosystems for this analysis were: Dm02371023_s1 (CG30458); Dm02366349_s1 (CG30457); Dm02366353_s1 (CG10953); Dm01792445_s1 (CG14567); Dm01792458_s1 (CG14572); Dm01792469_s1 (CG14565); Dm01792478_s1 (CG14564); and Dm01811206_g1 (<it>dia</it>).</p>
         </sec>
         <sec>
            <st>
               <p>Chromatin immunoprecipitation</p>
            </st>
            <p>ChIP assays were essentially performed as previously described by Papp and Muller <abbrgrp><abbr bid="B98">98</abbr></abbrgrp> with the following changes: 40 wandering third instar larvae were fixed with 1.8% formaldehyde solution for 25 minutes at room temperature, collected in 700 <it>&#956;</it>l of lysis buffer (1% SDS, 50 mM Tris HCl pH 8.0 and 10 mM EDTA) and disrupted 6 times for 20 seconds at 30% of a Branson sonifier. Samples were centrifuged at top speed at 4&#176;C and aliquotes of 100 <it>&#956;</it>L of extract were used per immunoprecipitation. Ten percent of the immunoprecipitated extract was used as input, and a sample immunoprecipitated without antibody as a precipitation control. A 300 bp fragment corresponding to the transcription start of the <it>Ultrabithorax </it>gene, which has been previously described to present a TRX binding site <abbrgrp><abbr bid="B98">98</abbr><abbr bid="B99">99</abbr></abbrgrp>, was used as a positive control. TRX antibody was a gift from A Mazo and H3K4me3 antibody was obtained from Abcam Inc. The primers used for the PCR amplifications were: Ubx forward, 5' CATGCCCAGCGAGAGAGG 3'; Ubx reverse, 5' AACAGCACAGAAAGCGAGG 3'; cluster9 forward, 5' ACCCACTTTTGCGCCATCG 3'; cluster9 reverse, 5' ACAAAGCGGTTCCGTGTCG 3'; cluster25 forward, 5' ACGTCTGGCTATGGATCTGG 3'; cluster25 reverse, 5' GGACACCGATGTGACCACC 3'.</p>
         </sec>
         <sec>
            <st>
               <p>Gene mapping in the genome</p>
            </st>
            <p>All gene sets were mapped in the genome of <it>D. melanogaster </it>using the RefSeq track of the UCSC genome browser <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> annotations (genome assembly dm2, April 2004). We considered only one transcript per gene (the first one present in the annotations). The file refGene.txt was used to retrieve the coordinates of each gene in RefSeq. The official name and the description of each gene were retrieved from the file refLink.txt. The statistics of gene distribution on each chromosome were computed using the 19,670 unique RefSeq genes contained in the file refGene.txt as well. To measure the statistical significance of the gene distribution, we randomly generated 1,000 rounds of sets of genes with the same size to evaluate the <it>p</it>-value of each result using a hypergeometric distribution. We used the BLAT alignments between the RefSeq genes from <it>D. melanogaster </it>and the genomes of <it>D. simulans</it>, <it>D. yakuba </it>and <it>D. pseudoobscura </it>to produce a catalogue of genes for these species (Additional data file 18). On each genome distribution, the file xenoRefGene.txt contains the best BLAT hits of the genes of other species in that genome <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. We selected only the best hit for each RefSeq gene from <it>D. melanogaster </it>in the corresponding genome distributions of <it>D. simulans</it>, <it>D. yakuba </it>and <it>D. pseudoobscura</it>. The sets of up- and down-regulated genes in the <it>trx </it>microarray were then mapped in those genomes using the appropriate catalogue of genes. We used the list of orthologous genes between <it>D. melanogaster </it>and <it>A. gambiae </it>provided by the Biomart tool <abbrgrp><abbr bid="B100">100</abbr></abbrgrp> from ENSEMBL. The association between the ENSEMBL gene identifier and the official gene name in the fruit fly genome was used to cross this information with the set of RefSeq genes of fly annotated in the UCSC.</p>
         </sec>
         <sec>
            <st>
               <p>Cluster identification</p>
            </st>
            <p>We define a cluster as a group of neighboring genes located in a limited region of the genome, not necessarily consecutive, that show the same expression pattern (up-regulation or down-regulation) in the microarray experiment. Clusters are determined by the physical position in the genome, the number of co-expressed genes inside and the total number of genes in such a genomic fragment. The length of the clusters can be then defined in terms of nucleotides or number of genes annotated on that region. We follow two similar approaches to computationally detect the clusters in the sets of genes in our work: the program REEF and our own implementation of a clustering program. No significant difference was observed between the results obtained with both computational approaches.</p>
            <p>To evaluate the significance of the number of clusters identified in our microarray, we generated 100 randomly generated sets with the same size and chromosomal gene distribution of the real sets of genes regulated by TRX. We then performed the clustering analysis in such data sets to find out how many clusters could be obtained by chance (expected clusters). The Z-score of the results observed in our microarray was thus calculated using the distribution of clusters identified in the random sets:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2008-9-9-r134-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>z</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>x</m:mi>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>&#956;</m:mi>
                              </m:mrow>
                              <m:mi>&#963;</m:mi>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8GiVeY=Pipec8Eeeu0xXdbba9frFj0xb9Lqpepeea0xd9q8qiYRWxGi6xij=hbbc9s8aq0=yqpe0xbbG8A8frFve9Fve9Fj0dmeaabaqaciGacaGaaeqabaqabeGadaaakeaacaWG6bGaeyypa0tcfa4aaSaaaeaacaWG4bGaeyOeI0IaeqiVd0gabaGaeq4Wdmhaaaaa@384B@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>x </it>is the number of observed clusters in the <it>trx </it>microarray, and <it>&#956; </it>and <it>&#963; </it>are the mean and the standard deviation of the distribution of number of clusters found in the random sets, respectively.</p>
            <p>To evaluate whether regulation by TRX imposes a stronger clustering tendency compared to gene size, we extracted all genes from the fruit fly genome whose size is within the range delimited by the size observed in genes from <it>trx </it>clusters (average size 946 bp, standard deviation 788 bp). We thus mapped the 6,626 genes that match this condition on the genome to get their coordinates and identified 107 clusters, using the REEF program. In order to assess the statistical significance of the clustering (and to measure which parameter, <it>trx </it>regulation or gene size, is more discriminant in the clustering), we repeated the analysis on 100 sets of genes that were randomly selected, preserving the gene distribution observed in the dataset of short genes. As an additional control, we also counted how many clusters can be identified in 100 different random sets of 260 genes (number of up-regulated genes) and 275 genes (number of down-regulated genes), which preserve the same gene size distribution observed in both lists of deregulated genes in our <it>trx </it>microarray (average and standard deviation).</p>
            <p>The program REEF <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> identifies regions of a genome enriched in specific features, compared with a reference landscape of feature density. The two input files are: the list of reference features (genes annotated in the fly genome) mapped on a genome sequence; and the list of selected features among the reference set (genes regulated by TRX in our microarray) with their genomic positions and the number and the length of the chromosomes in the genome under consideration. REEF scans the reference genome using a sliding window approach, calculating the statistical significance of each window using the hypergeometric distribution and the FDR. The windows are defined as fragments of a fixed genomic length. Consecutive significant windows form a cluster of regional enriched features. Using an approach similar to that used by Spellman and Rubin <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, we defined the optimal set of parameters comparing the growth function in the number of clusters obtained in random sets and in the real reference set for different values (Additional data file 19). The optimal configuration of parameters to scan the genome of <it>D. melanogaster </it>was: window length, 25,000 bp; window step, 1,000 bp; minimal number of co-expressed genes, 3; q-value &#8804;0.05s.</p>
         </sec>
         <sec>
            <st>
               <p>Cluster identification based on gene density</p>
            </st>
            <p>We have also implemented our own approach in Perl to cluster the genes regulated by a certain gene. Basically, we scan the genome following a sliding window approach combined with the hypergeometric distribution to evaluate the statistical significance of the clusters. The length of the windows is determined, however, in terms of genes rather than nucleotides. Thus, the clustering process is not limited by a fixed window length. Some parameters in this algorithm must be adjusted in order to maximize the sensibility and specificity. Firstly, only clusters of at least three genes showing the same sense of deregulation were considered. Another important parameter is the size of the window to apply. Using the results in random sets, a window size of seven genes was selected as the optimal length (the number of clusters found in <it>trx </it>data seems to grow logarithmically with the size of the window used, while the number of clusters found in random sets grows exponentially). We used our algorithm to scan the chromosomes of <it>D. melanogaster </it>using window lengths of seven genes, advancing one gene between two instances of the window so that all possible seven-gene windows were tested. To avoid multiple testing problems, we took a conservative <it>p</it>-value threshold of 10<sup>-3</sup>. Consecutive statistically significant windows were merged up in only one cluster. Results obtained with our clustering strategy are highly concordant to those produced by the REEF program (Additional data file 3): 27 clusters were detected (22 identical clusters, 2 clusters with additional genes, 3 new clusters and 1 missing cluster).</p>
         </sec>
         <sec>
            <st>
               <p>Clustering characterization and comparison</p>
            </st>
            <p>We computed different values on the genes that constitute the clusters (using the file refGene.txt downloaded from the UCSC genome browser): the average gene length, the gene length distribution, the number of bidirectional/opposed genes on each cluster according to their strand, and the average intergenic distance. Such values were compared to those observed in the whole set of genes annotated in the fruit fly genome, to evaluate their significance. The comparison between the clusters of genes regulated by TRX and the clusters regulated by other chromatin regulators was performed at three different levels: common genes in the transcriptomic maps, common genes in clusters and genomic position of the clusters. The intersection between two transcriptomes is defined as the quotient between the number of common genes (twice) and the total number of genes in both experiments. Two clusters from two different microarrays are matching if and only if they are overlapping in at least one commonly deregulated gene. Thus, we calculated the clusters identified in a second experiment (<it>ash2</it>, <it>Nurf</it>, <it>dmyc</it>, <it>ash1</it>) that were supported by another cluster in the <it>trx </it>microarray results. Once a set of common clusters was identified in two microarrays, we computed the percentage of co-expressed genes that was present in each pair of equivalent clusters (the ratio between the number of common genes and the total number of genes on each cluster).</p>
            <p>Two clusters of genes identified in two different genomes are considered to be equivalent when the percentage of genes that are present in both clusters is 50% or higher. To calculate the percentage of genes that are common in two clusters, the genes that are located in a different relative order within the clusters are considered to be conserved. Clusters of genes that are conserved in different strands of the chromosomes are considered to be equivalent as well (allowing for chromosomal inversions in the flanking regions). The length of the left and right flanking areas of each cluster is equal to the number of genes of the corresponding cluster. To measure the statistical significance of these results, we randomly sampled 10,000 artificial clusters of seven genes in <it>D. melanogaster </it>(the average size of our clusters is 6.7 genes) and found the location of the equivalent cluster on the other genomes. We examined the number of genes shared between each artificial cluster of genes in <it>D. melanogaster </it>and its equivalent cluster conserved on each of the other <it>Drosophila </it>species. Each artificial cluster on each genome was constituted of the corresponding gene and the three genes annotated before and after in the same location (the average length of our clusters is 6.7 genes). The results are shown in Table <tblr tid="T6">6</tblr>. The conservation of the artificial clusters in terms of common genes is more similar to the conservation observed in the flanking area of our clusters (90% in <it>D. simulans</it>, 79% in <it>D. yakuba</it>, 65% in <it>D. pseudoobscura</it>). In fact, the difference between the number of conserved genes inside our clusters and inside the artificial clusters is significant in all the species (<it>D. simulans p</it>-value &lt;10<sup>-3</sup>, <it>D. yakuba p</it>-value &lt;10<sup>-3</sup>, <it>D. pseudoobscura p</it>-value &lt;10<sup>-6</sup>; Wilcoxon test). Similar results were obtained in the set of clusters detected in <it>A. gambiae </it>(Table <tblr tid="T6">6</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Gene co-expression in clusters</p>
            </st>
            <p>Hooper <it>et al</it>. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> generated whole-genome expression data for the first 24 h of embryonic <it>D. melanogaster </it>development by extracting RNA from overlapping 1 h time points for the first 6.5 h of development and non-overlapping 1 h time points for the rest. We generated a table containing all pairwise Pearson's correlation coefficients between all genes expressed between 4 and 24 h in the study by Hooper <it>et al</it>. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. For each cluster identified in the <it>trx </it>mutants, we computed the mean of all pairwise correlation coefficients between the genes constituting the cluster to assess if they are co-expressed along the embryonic development. We also calculated the mean, including all the genes included in the boundaries of each cluster. As a reference set, we computed the same value for each set of correlative N genes in the genome (2 &#8804; N &#8804; 15). We then evaluated if each particular cluster was located in any of these tails and the total percentage of clusters that were found in these tails.</p>
         </sec>
         <sec>
            <st>
               <p>Functional annotations</p>
            </st>
            <p>We downloaded the GO terms and relationships (file gene_ontology_edit.obo, release 1.2) and the associations between gene products from <it>D. melanogaster </it>and GO terms (file gene_association.fb, release 1.94) from the GO web site <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. We annotated different gene sets (full set of genes in the <it>trx </it>microarray, the up-regulated and down-regulated genes in the same experiment, the clusters identified in such co-expression sets) using the annotation available at level 3 of the 'molecular function' ontology tree. We climbed up in the ontology to obtain the corresponding parent term at level 3 for those genes that were annotated at deeper levels in the hierarchical tree. The statistical analysis was performed using hypergeometric distribution to test the probability of observing a given GO term significantly enriched in genes belonging to such clusters:</p>
            <p>
               <display-formula>
                  <m:math name="gb-2008-9-9-r134-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mrow>
                              <m:mi>p</m:mi>
                              <m:mo>=</m:mo>
                              <m:mstyle displaystyle="true">
                                 <m:munderover>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mi>n</m:mi>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>m</m:mi>
                                       <m:mi>i</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mrow>
                                          <m:mo>[</m:mo>
                                          <m:mrow>
                                             <m:mi>k</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>a</m:mi>
                                          </m:mrow>
                                          <m:mo>]</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:munderover>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>A</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>i</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>G</m:mi>
                                                         <m:mo>&#8722;</m:mo>
                                                         <m:mi>A</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>k</m:mi>
                                                         <m:mo>&#8722;</m:mo>
                                                         <m:mi>i</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>G</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr>
                                                      <m:mtd>
                                                         <m:mi>k</m:mi>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mstyle>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8GiVeY=Pipec8Eeeu0xXdbba9frFj0xb9Lqpepeea0xd9q8qiYRWxGi6xij=hbbc9s8aq0=yqpe0xbbG8A8frFve9Fve9Fj0dmeaabaqaciGacaGaaeqabaqabeGadaaakeaabaGaamiCaiaad2dadaaeWbqcfayaamaalaaabaWaaeWaaeaaeaGabeaacaWGbbaabaGaamyAaaaaaiaawIcacaGLPaaadaqadaqaaqaaceqaaiaadEeacqGHsislcaWGbbaabaGaam4AaiabgkHiTiaadMgaaaaacaGLOaGaayzkaaaabaWaaeWaaeaaeaGabeaacaWGhbaabaGaam4AaaaaaiaawIcacaGLPaaaaaaaleaacaWGPbGaamypaiaad6gaaeaacaWGTbGaamyAaiaad6gadaWadaqaaiaadUgacaWGSaGaamyyaaGaay5waiaaw2faaaqdcqGHris5aaaaaa@4C3C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>In this case, <it>G </it>is the number of up- or down-regulated genes, <it>A </it>is the subset of <it>G </it>that are annotated in that GO term, <it>k </it>is the number of genes up- or down-regulated in clusters, and <it>n </it>is the subset of <it>k </it>annotated in the GO term.</p>
            <p>We performed a whole-genome clustering of the genes from <it>D. melanogaster </it>annotated with the hierarchies based on the term structural constituents of cuticle (GO:0042302). No data about gene expression in the microarrays were used here. We first extracted the fly genes annotated with the following GO terms: structural constituent of cuticle (GO:0042302); structural constituent of chitin-based cuticle (GO:0005214); structural constituent of chitin-based larval cuticle (GO:0008010); structural constituent of pupal chitin-based cuticle (GO:0008011); structural constituent of adult chitin-based cuticle (GO:0008012); and structural constituent of collagen and cuticulin-based cuticle (GO:0042329). Then, we used the program REEF to identify clustering organization events in this gene set (the same configuration of parameters set in the case of co-expressed genes). We also performed a whole-genome clustering of the genes from <it>D. melanogaster </it>annotated with the hierarchies based on the term chitin binding (GO:0008061). We first extracted the fly genes annotated with the following GO terms: chitin binding (GO:0008061); polysaccharide binding (GO:0030247); carbohydrate binding (GO:0030246); and pattern binding (GO:0001871). Then, we used the program REEF to identify clustering organization events in this gene set.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>BR-C, Broad-Complex; FDR, false discovery rate; GO, Gene Ontology; HCNE, highly conserved non-coding element; Pc, Polycomb protein; PcG, polycomb group; PRE, Polycomb response element; qRT-PCR, quantitative real-time PCR; TRE, Trithorax response element; trxG, trithorax group.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>EB, MP and RG conceived the bioinformatics experiments. EB, MP and SB performed the bionformatics analysis. SB, AP and MC designed the microarray experiments. SB performed the microarray analysis. AP and SP performed the qRT-PCR experiments. SP designed and performed the chromatin immunoprecipitation experiments. FS and MC contributed reagents/material/analysis tools. EB, MP, SB, RG and MC wrote the paper.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data files are available. Additional data file <supplr sid="S1">1</supplr> lists the up- and down-regulated genes in the <it>trx </it>microarray. Additional data file <supplr sid="S2">2</supplr> is a table of general features of the clusters of genes deregulated by TRX. Additional data file <supplr sid="S3">3</supplr> is a table of the clusters detected using our own clustering approach. Additional data file <supplr sid="S4">4</supplr> contains the results of clusters detected in random gene sets (gene distribution). Additional data file <supplr sid="S5">5</supplr> contains the results of clusters detected in random gene sets (gene size). Additional data file <supplr sid="S6">6</supplr> is a graphical representation of the different clusters detected in the genome that are deregulated by several chromatin remodelers. Additional data file <supplr sid="S7">7</supplr> lists up- and down-regulated genes in the <it>ash2 </it>microarray. Additional data file <supplr sid="S8">8</supplr> is a table of the average lengths of the deregulated genes on each microarray (clustered or not). Additional data file <supplr sid="S9">9</supplr> is a table that shows the results of the clustering analysis in other microarrays of transcription factors. Additional data file <supplr sid="S10">10</supplr> is a table that shows the average gene size of deregulated genes in the microarrays analyzed in this study. Additional data file <supplr sid="S11">11</supplr> shows the cumulative distribution Pearson correlation coefficient means in the real and the artificial clusters. Additional data file <supplr sid="S12">12</supplr> contains the GO functional annotation of the <it>trx </it>clusters. Additional data file <supplr sid="S13">13</supplr> is a graphical representation of the clusters of genes in the genome associated with similar categories (cuticle/chitin binding). Additional data file <supplr sid="S14">14</supplr> is a table that shows the intersection between the <it>trx </it>clusters, the ChIP-on-chip information and the PRE/TRE predictions. Additional data file <supplr sid="S15">15</supplr> shows the results of the chromatin immunoprecipitation using anti-TRX and H3K4me3 specific antibodies to test the binding of TRX to the predicted PRE/TREs in clusters 9 and 25. Additional data file <supplr sid="S16">16</supplr> shows several examples of clusters and transfrags that detect similar patterns of expression. Additional data file <supplr sid="S17">17</supplr> is a graphical genome-wide representation of the clusters of <it>trx </it>and the HCNEs mapped in several <it>Drosophila </it>species. Additional data file <supplr sid="S18">18</supplr> is a table that contains catalogs of orthologous genes between <it>D. melanogaster </it>and the other <it>Drosophila </it>species or mosquito. Additional data file <supplr sid="S19">19</supplr> is an image that represents the optimal window length to discriminate between real and artificial clusters. Additional data file <supplr sid="S20">20</supplr> is a table that shows the results of an alternative overlap analysis among the clusters of genes regulated by different chromatin remodelers (only genes affected in the same way are considered).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Up- and down-regulated genes in the <it>trx </it>microarray</p>
            </caption>
            <text>
               <p>Up- and down-regulated genes in the <it>trx </it>microarray.</p>
            </text>
            <file name="gb-2008-9-9-r134-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>General features of the clusters of genes deregulated by TRX</p>
            </caption>
            <text>
               <p>General features of the clusters of genes deregulated by TRX.</p>
            </text>
            <file name="gb-2008-9-9-r134-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Clusters detected using our own clustering approach</p>
            </caption>
            <text>
               <p>Clusters detected using our own clustering approach.</p>
            </text>
            <file name="gb-2008-9-9-r134-S3.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Clusters detected in random gene sets (gene distribution)</p>
            </caption>
            <text>
               <p>Clusters detected in random gene sets (gene distribution).</p>
            </text>
            <file name="gb-2008-9-9-r134-S4.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Clusters detected in random gene sets (gene size)</p>
            </caption>
            <text>
               <p>Clusters detected in random gene sets (gene size).</p>
            </text>
            <file name="gb-2008-9-9-r134-S5.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>Different clusters detected in the genome that are deregulated by several chromatin remodelers</p>
            </caption>
            <text>
               <p>Different clusters detected in the genome that are deregulated by several chromatin remodelers.</p>
            </text>
            <file name="gb-2008-9-9-r134-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>Up- and down-regulated genes in the <it>ash2 </it>microarray</p>
            </caption>
            <text>
               <p>Up- and down-regulated genes in the <it>ash2 </it>microarray.</p>
            </text>
            <file name="gb-2008-9-9-r134-S7.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p>Average lengths of the deregulated genes on each microarray (clustered or not)</p>
            </caption>
            <text>
               <p>Average lengths of the deregulated genes on each microarray (clustered or not).</p>
            </text>
            <file name="gb-2008-9-9-r134-S8.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S9">
            <title>
               <p>Additional data file 9</p>
            </title>
            <caption>
               <p>Clustering analysis in other microarrays of transcription factors</p>
            </caption>
            <text>
               <p>Clustering analysis in other microarrays of transcription factors.</p>
            </text>
            <file name="gb-2008-9-9-r134-S9.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S10">
            <title>
               <p>Additional data file 10</p>
            </title>
            <caption>
               <p>Average gene size of deregulated genes in the microarrays analyzed in this study</p>
            </caption>
            <text>
               <p>Average gene size of deregulated genes in the microarrays analyzed in this study.</p>
            </text>
            <file name="gb-2008-9-9-r134-S10.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S11">
            <title>
               <p>Additional data file 11</p>
            </title>
            <caption>
               <p>Cumulative distribution Pearson correlation coefficient means in the real and the artificial clusters</p>
            </caption>
            <text>
               <p>Cumulative distribution Pearson correlation coefficient means in the real and the artificial clusters.</p>
            </text>
            <file name="gb-2008-9-9-r134-S11.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S12">
            <title>
               <p>Additional data file 12</p>
            </title>
            <caption>
               <p>GO functional annotation of the <it>trx </it>clusters</p>
            </caption>
            <text>
               <p>GO functional annotation of the <it>trx </it>clusters.</p>
            </text>
            <file name="gb-2008-9-9-r134-S12.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S13">
            <title>
               <p>Additional data file 13</p>
            </title>
            <caption>
               <p>Clusters of genes in the genome associated with similar categories (cuticle/chitin binding)</p>
            </caption>
            <text>
               <p>Clusters of genes in the genome associated with similar categories (cuticle/chitin binding).</p>
            </text>
            <file name="gb-2008-9-9-r134-S13.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S14">
            <title>
               <p>Additional data file 14</p>
            </title>
            <caption>
               <p>Intersection between the <it>trx </it>clusters, the ChIP-on-chip information and the PRE/TRE predictions</p>
            </caption>
            <text>
               <p>Intersection between the <it>trx </it>clusters, the ChIP-on-chip information and the PRE/TRE predictions.</p>
            </text>
            <file name="gb-2008-9-9-r134-S14.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S15">
            <title>
               <p>Additional data file 15</p>
            </title>
            <caption>
               <p>Chromatin immunoprecipitation using anti-TRX and H3K4me3 specific antibodies to test the binding of TRX to the predicted PRE/TREs in clusters 9 and 25</p>
            </caption>
            <text>
               <p>Chromatin immunoprecipitation using anti-TRX and H3K4me3 specific antibodies to test the binding of TRX to the predicted PRE/TREs in clusters 9 and 25.</p>
            </text>
            <file name="gb-2008-9-9-r134-S15.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S16">
            <title>
               <p>Additional data file 16</p>
            </title>
            <caption>
               <p>Examples of clusters and transfrags that detect similar patterns of expression</p>
            </caption>
            <text>
               <p>Examples of clusters and transfrags that detect similar patterns of expression.</p>
            </text>
            <file name="gb-2008-9-9-r134-S16.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S17">
            <title>
               <p>Additional data file 17</p>
            </title>
            <caption>
               <p>Graphical genome-wide representation of the clusters of <it>trx </it>and the HCNEs mapped in several <it>Drosophila </it>species</p>
            </caption>
            <text>
               <p>Graphical genome-wide representation of the clusters of <it>trx </it>and the HCNEs mapped in several <it>Drosophila </it>species.</p>
            </text>
            <file name="gb-2008-9-9-r134-S17.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S18">
            <title>
               <p>Additional data file 18</p>
            </title>
            <caption>
               <p>Orthologous genes between <it>D. melanogaster </it>and the other <it>Drosophila </it>species or mosquito</p>
            </caption>
            <text>
               <p>Orthologous genes between <it>D. melanogaster </it>and the other <it>Drosophila </it>species or mosquito.</p>
            </text>
            <file name="gb-2008-9-9-r134-S18.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S19">
            <title>
               <p>Additional data file 19</p>
            </title>
            <caption>
               <p>The optimal window length to discriminate between real and artificial clusters</p>
            </caption>
            <text>
               <p>The optimal window length to discriminate between real and artificial clusters.</p>
            </text>
            <file name="gb-2008-9-9-r134-S19.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S20">
            <title>
               <p>Additional data file 20</p>
            </title>
            <caption>
               <p>Alternative overlap analysis among the clusters of genes regulated by different chromatin remodelers (only genes affected in the same way are considered)</p>
            </caption>
            <text>
               <p>Alternative overlap analysis among the clusters of genes regulated by different chromatin remodelers (only genes affected in the same way are considered).</p>
            </text>
            <file name="gb-2008-9-9-r134-S20.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank A Shearn and T Breen for <it>trx </it>alleles, and the Program for Genomic Applications at The Institute for Genomic Research for providing us with <it>A. thaliana </it>spike-in controls. We thank A Mazo for the TRX antibody. We thank F Casares and M Rehmsmeier for insightful discussions, and A Carbonell for interesting suggestions. We are grateful to JF Abril for helpful assistance with the program GFF2PS, A Coppe for introducing slight modifications in the REEF program to identify small clusters and B Lenhard for providing information about the genomic regulatory blocks in insects. We also thank the Plataforma de Transcript&#242;mica of the Universitat de Barcelona and the Instituto Nacional de Bioinform&#225;tica, Spain, for analysis support. E Blanco and M Pignatelli were supported by Juan de la Cierva postdoctoral contracts from the Ministerio de Educaci&#243;n y Ciencia (MEC), Spain. S Beltran was supported by a FPI fellowship, A Punset by grant BFU05-24129-E and S P&#233;rez-Lluch by grant GEN2006-28564-E from MEC. This project was funded by grants BMC2003-05018 and BMC2006-07334 from MEC, Spain.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Perceptions of epigenetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Bird</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <fpage>396</fpage>
            <lpage>398</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17522671</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The transcriptional regulatory code of eukaryotic cells - insights from genome-wide analysis of chromatin organization and transcription factor binding.</p>
            </title>
            <aug>
               <au>
                  <snm>Barrera</snm>
                  <fnm>LO</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Opin Cell Biol</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>291</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16647254</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The complex language of chromatin regulation during transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Berger</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <fpage>407</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">17522673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The role of chromatin during transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Carey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>707</fpage>
            <lpage>719</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17320508</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Chromatin modifications and their function.</p>
            </title>
            <aug>
               <au>
                  <snm>Kouzarides</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>693</fpage>
            <lpage>705</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17320507</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Gene regulation by chromatin structure: paradigms established in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Schulze</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Wallrath</snm>
                  <fnm>LL</fnm>
               </au>
            </aug>
            <source>Annu Rev Entomol</source>
            <pubdate>2007</pubdate>
            <volume>52</volume>
            <fpage>171</fpage>
            <lpage>192</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16881818</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The chromatin domain as a unit of gene regulation.</p>
            </title>
            <aug>
               <au>
                  <snm>Goldman</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>1988</pubdate>
            <volume>9</volume>
            <fpage>50</fpage>
            <lpage>55</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3066357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The evolutionary dynamics of eukaryotic gene order.</p>
            </title>
            <aug>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>P&#225;l</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>299</fpage>
            <lpage>310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15131653</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Life with 6000 genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Feldmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hoheisel</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacq</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Louis</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>274</volume>
            <fpage>546, 563</fpage>
            <lpage>567</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genome sequence of the nematode <it>C. elegans</it>: a platform for investigating biology.</p>
            </title>
            <aug>
               <au>
                  <cnm><it>C. elegans </it>Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>2012</fpage>
            <lpage>2018</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">9851916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The genome sequence of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Celniker</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Amanatides</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Hoskins</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Galle</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>George</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>LX</fnm>
               </au>
               <au>
                  <snm>Brandon</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Blazej</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Champe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pfeiffer</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Wan</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Baxter</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Helt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>CR</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>2185</fpage>
            <lpage>2195</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10731132</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Finishing the euchromatic sequence of the human genome.</p>
            </title>
            <aug>
               <au>
                  <cnm>International Human Genome Sequencing Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>931</fpage>
            <lpage>945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15496913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Non-random clustering of stress-related genes during evolution of the <it>S. cerevisiae </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Burhans</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Patterton</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Breitenbach</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burhans</snm>
                  <fnm>WC</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>58</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1550265</pubid>
                  <pubid idtype="pmpid" link="fulltext">16859541</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Mitra</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>26</volume>
            <fpage>183</fpage>
            <lpage>186</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11017073</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Chromosomal clustering of muscle-expressed genes in <it>Caenorhabditis elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <fpage>975</fpage>
            <lpage>979</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12214599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A global analysis of <it>Caenorhabditis elegans </it>operons.</p>
            </title>
            <aug>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Link</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Guffanti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>417</volume>
            <fpage>851</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12075352</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The human transcriptome map: clustering of highly expressed genes in chromosomal domains.</p>
            </title>
            <aug>
               <au>
                  <snm>Caron</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>van Schaik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mee</snm>
                  <mnm>van der</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baas</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Riggins</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>van Sluis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hermus</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>van Asperen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boon</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Voute</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Heisterkamp</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Kampen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Versteeg</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <fpage>1289</fpage>
            <lpage>1292</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11181992</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Chromosomal clustering of a human transcriptome reveals regulatory background.</p>
            </title>
            <aug>
               <au>
                  <snm>Vogel</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>von Heydebreck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Purmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sperling</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>230</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1261156</pubid>
                  <pubid idtype="pmpid" link="fulltext">16171528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genome-wide transcriptional orchestration of circadian rhythms in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ueda</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawamura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Iino</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanimura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hashimoto</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>14048</fpage>
            <lpage>14052</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11854264</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Evidence for large domains of similarly expressed genes in the <it>Drosophila </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>J Biol</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117248</pubid>
                  <pubid idtype="pmpid" link="fulltext">12144710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Large clusters of co-expressed genes in the <it>Drosophila </it>genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Boutanaev</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kalmykova</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Shevelyov</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Nurminsky</snm>
                  <fnm>DI</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>666</fpage>
            <lpage>669</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12478293</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Regulated chromatin domain comprising cluster of co-expressed genes in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Kalmykova</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Nurminsky</snm>
                  <fnm>DI</fnm>
               </au>
               <au>
                  <snm>Ryzhov</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Shevelyov</snm>
                  <fnm>YY</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>1435</fpage>
            <lpage>1444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1062873</pubid>
                  <pubid idtype="pmpid" link="fulltext">15755746</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genomic analysis of <it>Drosophila </it>chromosome underreplication reveals a link between replication control and transcriptional territories.</p>
            </title>
            <aug>
               <au>
                  <snm>Belyakin</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Alekseyenko</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Kriventseva</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Belyaeva</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Nanayev</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Makunin</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Kafatos</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Zhimulev</snm>
                  <fnm>IF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>8269</fpage>
            <lpage>8274</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1149430</pubid>
                  <pubid idtype="pmpid" link="fulltext">15928082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Epigenetic regulation of cellular memory by the Polycomb and Trithorax group proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <fpage>413</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15568982</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Polycomb silencing mechanisms and the management of genomic programmes.</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>YB</fnm>
               </au>
               <au>
                  <snm>Pirrotta</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>9</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17173055</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Polycomb/Trithorax response elements and epigenetic memory of cell identity.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2007</pubdate>
            <volume>134</volume>
            <fpage>223</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17185323</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Genome regulation by polycomb and trithorax proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Schuettengruber</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chourrout</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vervoort</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Leblanc</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Cavalli</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>735</fpage>
            <lpage>745</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17320510</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Gene expression during the life cycle of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Arbeitman</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Furlong</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Imam</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Null</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Krasnow</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>2270</fpage>
            <lpage>2275</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12351791</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Patterns of gene expression during <it>Drosophila </it>mesoderm development.</p>
            </title>
            <aug>
               <au>
                  <snm>Furlong</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Null</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <fpage>1629</fpage>
            <lpage>1633</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11486054</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Identification of tightly regulated groups of genes during <it>Drosophila melanogaster </it>embryogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Hooper</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Bou&#233;</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Krause</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Ghanim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Furlong</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>72</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800352</pubid>
                  <pubid idtype="pmpid" link="fulltext">17224916</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Microarray analysis of <it>Drosophila </it>development during metamorphosis.</p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Rifkin</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Hurban</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hogness</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>2179</fpage>
            <lpage>2184</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10591654</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Transcriptional network controlled by the trithorax-group gene ash2 in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Beltran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blanco</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Serras</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Perez-Villamil</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Artavanis-Tsakonas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Corominas</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>3293</fpage>
            <lpage>3298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152285</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626737</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Functional dissection of the ash2 and ash1 transcriptomes provides insights into the transcriptional basis of wing phenotypes and reveals conserved protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Beltran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Angulo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pignatelli</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Serras</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Corominas</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1896016</pubid>
                  <pubid idtype="pmpid" link="fulltext">17466076</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Coordinated regulation of Myc trans-activation targets by Polycomb and the Trithorax group protein Ash1.</p>
            </title>
            <aug>
               <au>
                  <snm>Goodliffe</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Wieschaus</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>40</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1887537</pubid>
                  <pubid idtype="pmpid" link="fulltext">17519021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>trithorax and the regulation of homeotic gene expression in <it>Drosophila </it>: a historical perspective.</p>
            </title>
            <aug>
               <au>
                  <snm>Ingham</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Int J Dev Biol</source>
            <pubdate>1998</pubdate>
            <volume>42</volume>
            <fpage>423</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9654027</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The histone methyltransferases Trithorax and Ash1 prevent transcriptional silencing by Polycomb group proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Klymenko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>373</fpage>
            <lpage>377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1299022</pubid>
                  <pubid idtype="pmpid" link="fulltext">15031712</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D61</fpage>
            <lpage>D65</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716718</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The UCSC genome browser database: update 2007.</p>
            </title>
            <aug>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Trumbower</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Thakkapallayil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Stanke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Rhead</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D668</fpage>
            <lpage>D673</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669757</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142222</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The role of chromatin structure in regulating the expression of clustered genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Sproul</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bickmore</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>775</fpage>
            <lpage>781</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16160692</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Calculating the statistical significance of physical clusters of co-regulated genes in the genome: the role of chromatin in domain-wide gene regulation.</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Wai</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Patterton</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>1798</fpage>
            <lpage>1807</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390345</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>REEF: searching REgionally Enriched Features in genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Coppe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Danieli</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Bortoluzzi</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>453</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1624853</pubid>
                  <pubid idtype="pmpid" link="fulltext">17042935</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Selection for short introns in highly expressed genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Castillo-Davis</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>415</fpage>
            <lpage>418</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12134150</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Identifying clusters of functionally related genes in genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Yi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sze</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Thon</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>1053</fpage>
            <lpage>1060</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17237058</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Four heat shock proteins of <it>Drosophila melanogaster </it>coded within a 12-kilobase region in chromosome subdivision 67B.</p>
            </title>
            <aug>
               <au>
                  <snm>Corces</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Holmgren</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Freund</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Morimoto</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meselson</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1980</pubdate>
            <volume>77</volume>
            <fpage>5390</fpage>
            <lpage>5393</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">350064</pubid>
                  <pubid idtype="pmpid">6254079</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>DNA sequences, gene regulation and modular protein evolution in the <it>Drosophila </it>68C glue gene cluster.</p>
            </title>
            <aug>
               <au>
                  <snm>Garfinkel</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1983</pubdate>
            <volume>168</volume>
            <fpage>765</fpage>
            <lpage>789</lpage>
            <xrefbib>
               <pubid idtype="pmpid">6411930</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The <it>Drosophila </it>nucleosome remodeling factor NURF is required for Ecdysteroid signaling and metamorphosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Badenhorst</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Xiao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherbas</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kwon</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Voas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rebay</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Cherbas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <fpage>2540</fpage>
            <lpage>2545</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1276728</pubid>
                  <pubid idtype="pmpid" link="fulltext">16264191</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Polycomb mediates Myc autorepression and its transcriptional control of many loci in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Goodliffe</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wieschaus</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <fpage>2941</fpage>
            <lpage>2946</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1315398</pubid>
                  <pubid idtype="pmpid" link="fulltext">16357214</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Trans-regulation of thoracic homeotic selector genes of the Antennapedia and bithorax complexes by the trithorax group genes: absent, small, and homeotic discs 1 and 2.</p>
            </title>
            <aug>
               <au>
                  <snm>LaJeunesse</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shearn</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mech Dev</source>
            <pubdate>1995</pubdate>
            <volume>53</volume>
            <fpage>123</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8555105</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The <it>Drosophila </it>trithorax group proteins BRM, ASH1 and ASH2 are subunits of distinct protein complexes.</p>
            </title>
            <aug>
               <au>
                  <snm>Papoulas</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Beek</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Moseley</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>McCallum</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Sarte</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shearn</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tamkun</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>1998</pubdate>
            <volume>125</volume>
            <fpage>3955</fpage>
            <lpage>3966</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9735357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>ISWI, a member of the SWI2/SNF2 ATPase family, encodes the 140 kDa subunit of the nucleosome remodeling factor.</p>
            </title>
            <aug>
               <au>
                  <snm>Tsukiyama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Daniel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tamkun</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1995</pubdate>
            <volume>83</volume>
            <fpage>1021</fpage>
            <lpage>1026</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8521502</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Genes and biological processes controlled by the <it>Drosophila </it>FOXA orthologue Fork head.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>17</volume>
            <fpage>91</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18353099</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Genome-wide identification of direct targets of the <it>Drosophila </it>retinal determination protein Eyeless.</p>
            </title>
            <aug>
               <au>
                  <snm>Ostrin</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mardon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>466</fpage>
            <lpage>476</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1457028</pubid>
                  <pubid idtype="pmpid" link="fulltext">16533912</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Spotted-dick, a zinc-finger protein of <it>Drosophila </it>required for expression of Orc4 and S phase.</p>
            </title>
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kovacs</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Deak</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Torok</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kiss</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dario</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bastos</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Batista</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gomes</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ohkura</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Glover</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2005</pubdate>
            <volume>24</volume>
            <fpage>4304</fpage>
            <lpage>4315</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356331</pubid>
                  <pubid idtype="pmpid" link="fulltext">16369566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Expression profiling of glial genes during <it>Drosophila </it>embryogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Altenhein</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Busold</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Beckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hoheisel</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Technau</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Dev Biol</source>
            <pubdate>2006</pubdate>
            <volume>296</volume>
            <fpage>545</fpage>
            <lpage>560</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16762338</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Evolutionary conservation of otd/Otx2 transcription factor action: a genome-wide microarray analysis in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Montalta-He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Leemans</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Loop</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Strahm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Certa</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Primig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Acampora</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Simeone</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Reichert</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>RESEARCH0015</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">115189</pubid>
                  <pubid idtype="pmpid" link="fulltext">11983056</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Identification of candidate downstream genes for the homeodomain transcription factor Labial in <it>Drosophila </it>through oligonucleotide-array transcript imaging.</p>
            </title>
            <aug>
               <au>
                  <snm>Leemans</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Loop</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Egger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kammermeier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hartmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Certa</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Reichert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hirth</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>RESEARCH0015</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">32187</pubid>
                  <pubid idtype="pmpid" link="fulltext">11387036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p><it>Drosophila </it>soluble guanylyl cyclase mutants exhibit increased foraging locomotion: behavioral and genomic investigations.</p>
            </title>
            <aug>
               <au>
                  <snm>Riedl</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Neal</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Robichon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Westwood</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Sokolowski</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Behav Genet</source>
            <pubdate>2005</pubdate>
            <volume>35</volume>
            <fpage>231</fpage>
            <lpage>244</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15864439</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Comparative genome sequencing of <it>Drosophila pseudoobscura</it>: chromosomal, gene, and cis-element evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bettencourt</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Hradecky</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Letovsky</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Meisel</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hua</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bussemaker</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>van Batenburg</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Howells</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Sodergren</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Schroeder</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Ortiz-Barrientos</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rives</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Metzker</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Muzny</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Steffen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540289</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>BLAT&#8212;the BLAST-like alignment tool.</p>
            </title>
            <aug>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>656</fpage>
            <lpage>664</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187518</pubid>
                  <pubid idtype="pmpid" link="fulltext">11932250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The genome sequence of the malaria mosquito Anopheles gambiae.</p>
            </title>
            <aug>
               <au>
                  <snm>Holt</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Halpern</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Charlab</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nusskern</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Ribeiro</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Wides</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Loftus</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yandell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Majoros</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Rusch</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Kraft</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Arensburger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Atkinson</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Baden</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>de Berardinis</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Benes</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Biedler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blass</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bolanos</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Boscus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barnstead</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>129</fpage>
            <lpage>149</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12364791</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Ensembl 2007.</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ballester</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fernandez-Banet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Megy</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <issue>Database issue</issue>
            <fpage>D610</fpage>
            <lpage>D617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761443</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>The Gene Ontology (GO) project in 2006.</p>
            </title>
            <aug>
               <au>
                  <cnm>Gene Ontology Consortium</cnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>Database issue</issue>
            <fpage>D322</fpage>
            <lpage>D326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347384</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381878</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Tissue-specific gene expression and ecdysone-regulated genomic networks in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>59</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12852852</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Unusual properties of regulatory DNA from the <it>Drosophila </it>engrailed gene: three "pairing-sensitive" sites within a 1.6-kb region.</p>
            </title>
            <aug>
               <au>
                  <snm>Kassis</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1994</pubdate>
            <volume>136</volume>
            <fpage>1025</fpage>
            <lpage>1038</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205860</pubid>
                  <pubid idtype="pmpid" link="fulltext">8005412</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Dissecting the regulatory landscape of the Abd-B gene of the bithorax complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Mihaly</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barges</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sipos</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cleard</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hogga</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bender</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gyurkovics</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Karch</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2006</pubdate>
            <volume>133</volume>
            <fpage>2983</fpage>
            <lpage>2993</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16818450</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Polycomb response elements mediate the formation of chromosome higher-order structures in the bithorax complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Lanzuolo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roure</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dekker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bantignies</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Orlando</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nat Cell Biol</source>
            <pubdate>2007</pubdate>
            <volume>9</volume>
            <fpage>1167</fpage>
            <lpage>1174</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17828248</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Genome-wide prediction of Polycomb/Trithorax response elements in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rehmsmeier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dura</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Paro</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Dev Cell</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>759</fpage>
            <lpage>771</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14602076</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Genome-wide analysis of Polycomb targets in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>YB</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Nix</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>XY</fnm>
               </au>
               <au>
                  <snm>Bourgon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Biggin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pirrotta</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>700</fpage>
            <lpage>705</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16732288</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Genome-wide profiling of PRC1 and PRC2 Polycomb chromatin binding in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Tolhuis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>de Wit</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Muijrers</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Teunissen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Talhout</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>van Steensel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>van Lohuizen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>694</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16628213</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Chromosomal distribution of PcG proteins during <it>Drosophila </it>development.</p>
            </title>
            <aug>
               <au>
                  <snm>N&#232;gre</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hennetin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Lavrov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Cavalli</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e170</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1440717</pubid>
                  <pubid idtype="pmpid" link="fulltext">16613483</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Biological function of unannotated transcription during the early development of <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Manak</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Dike</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sementchenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Biemar</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>1151</fpage>
            <lpage>1158</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16951679</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>A cluster of cuticle protein genes of <it>Drosophila melanogaster </it>at 65A: sequence, structure and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Charles</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Chihara</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nejad</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Riddiford</snm>
                  <fnm>LM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>147</volume>
            <fpage>1213</fpage>
            <lpage>1224</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1208245</pubid>
                  <pubid idtype="pmpid" link="fulltext">9383064</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>HOM-C evolution in <it>Drosophila</it>: is there a need for Hox gene clustering?</p>
            </title>
            <aug>
               <au>
                  <snm>Negre</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ruiz</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>55</fpage>
            <lpage>59</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">17188778</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Overexpression of the small mitochondrial Hsp22 extends <it>Drosophila </it>life span and increases resistance to oxidative stress.</p>
            </title>
            <aug>
               <au>
                  <snm>Morrow</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Samson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Michaud</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanguay</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>FASEB J</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>598</fpage>
            <lpage>599</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">14734639</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Expression of the Hsp23 chaperone during <it>Drosophila </it>embryogenesis: association to distinct neural and glial lineages.</p>
            </title>
            <aug>
               <au>
                  <snm>Michaud</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanguay</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>BMC Dev Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>9</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">293469</pubid>
                  <pubid idtype="pmpid" link="fulltext">14617383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>A trans-acting regulatory product necessary for expression of the <it>Drosophila melanogaster </it>68C glue gene cluster.</p>
            </title>
            <aug>
               <au>
                  <snm>Crowley</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Mathers</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1984</pubdate>
            <volume>39</volume>
            <fpage>149</fpage>
            <lpage>156</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6207936</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Sequences sufficient for correct regulation of Sgs-3 lie close to or within the gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Raghavan</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Mathers</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1986</pubdate>
            <volume>5</volume>
            <fpage>3321</fpage>
            <lpage>3326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1167329</pubid>
                  <pubid idtype="pmpid">3028780</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>The ecdysone-induced puffing cascade in <it>Drosophila </it>salivary glands: a Broad-Complex early gene regulates intermolt and late gene transcription.</p>
            </title>
            <aug>
               <au>
                  <snm>Guay</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Guild</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1991</pubdate>
            <volume>129</volume>
            <fpage>169</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1204563</pubid>
                  <pubid idtype="pmpid" link="fulltext">1936956</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>The <it>Drosophila </it>broad-complex regulates developmental changes in transcription and chromatin structure of the 67B heat-shock gene cluster.</p>
            </title>
            <aug>
               <au>
                  <snm>Dubrovsky</snm>
                  <fnm>EB</fnm>
               </au>
               <au>
                  <snm>Dretzen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bellard</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>241</volume>
            <fpage>353</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8064852</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Downregulation of the tissue-specific transcription factor Fork head by Broad-Complex mediates a stage-specific hormone response.</p>
            </title>
            <aug>
               <au>
                  <snm>Renault</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>King-Jones</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2001</pubdate>
            <volume>128</volume>
            <fpage>3729</fpage>
            <lpage>3737</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11585799</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Polycomb comes of age: genome-wide profiling of target sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Ringrose</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Curr Opin Cell Biol</source>
            <pubdate>2007</pubdate>
            <volume>19</volume>
            <fpage>290</fpage>
            <lpage>297</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17481880</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Genomic regulatory blocks underlie extensive microsynteny conservation in insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Ho Sui</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Drivenes</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Lenhard</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1898</fpage>
            <lpage>1908</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099597</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989259</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>ChroCoLoc: an application for calculating the probability of co-localization of microarray gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Blake</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schwager</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kapushesky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brazma</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>765</fpage>
            <lpage>767</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16377611</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Genomic organization of transcriptomes in mammals: Coregulation and cofunctionality.</p>
            </title>
            <aug>
               <au>
                  <snm>Purmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Toedling</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schueler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sperling</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2007</pubdate>
            <volume>89</volume>
            <fpage>580</fpage>
            <lpage>587</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17369017</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Clusters of co-expressed genes in mammalian genomes are conserved by natural selection.</p>
            </title>
            <aug>
               <au>
                  <snm>Singer</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Lloyd</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Huminiecki</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>767</fpage>
            <lpage>775</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15574806</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>Evidence for co-evolution of gene order and recombination rate.</p>
            </title>
            <aug>
               <au>
                  <snm>P&#225;l</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <fpage>392</fpage>
            <lpage>395</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12577060</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Evolutionary origin and maintenance of coexpressed gene clusters in mammals.</p>
            </title>
            <aug>
               <au>
                  <snm>S&#233;mon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <fpage>1715</fpage>
            <lpage>1723</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16757654</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Evolution of genes and genomes on the <it>Drosophila </it>phylogeny.</p>
            </title>
            <aug>
               <au>
                  <cnm>Drosophila 12 Genomes Consortium</cnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Markow</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Kaufman</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gelbart</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Pollard</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sackton</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Larracuente</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Abad</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Abt</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Adryan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Aguade</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Aquadro</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Ardell</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Arguello</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Artieri</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Barbash</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barsanti</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batterham</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Begun</snm>
                  <fnm>D</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <fpage>203</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">17994087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>Discovery of functional elements in 12 <it>Drosophila </it>genomes using evolutionary signatures.</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Kheradpour</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Parts</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deoras</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Ruby</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hodges</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Caspi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Paten</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Maeder</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Polansky</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Robson</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Helden</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hassan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Eastman</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weir</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <fpage>219</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17994088</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Genomic gene clustering analysis of pathways in eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>875</fpage>
            <lpage>882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430880</pubid>
                  <pubid idtype="pmpid" link="fulltext">12695325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>The KEGG database.</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Novartis Found Symp</source>
            <pubdate>2002</pubdate>
            <volume>247</volume>
            <fpage>91</fpage>
            <lpage>101</lpage>
            <note>discussion 101-103, 119-128, 244-252.</note>
            <xrefbib>
               <pubid idtype="pmpid">12539951</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>Methylation at lysine 4 of histone H3 in ecdysone-dependent development of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Sedkov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Petruk</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cherbas</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Cherbas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Canaani</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Jaynes</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>426</volume>
            <fpage>78</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14603321</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>The DrosDel collection: a set of P-element insertions for generating custom chromosomal aberrations in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ryder</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Blows</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bautista-Llacer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Drummond</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Webster</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gubb</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gunton</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>O'Kane</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Huen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Asztalos</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Baisch</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schulze</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kube</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kittlaus</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Reuter</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Maroy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Szidonya</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rasmuson-Lestander</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ekstrom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dickson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hugentobler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stocker</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hafen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lepesant</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Pflugfelder</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Heisenberg</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genetics</source>
            <pubdate>2004</pubdate>
            <volume>167</volume>
            <fpage>797</fpage>
            <lpage>813</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470913</pubid>
                  <pubid idtype="pmpid" link="fulltext">15238529</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>Assessing unmodified 70-mer oligonucleotide probe performance on glass-slide microarrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Malek</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Kwitek</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Greene</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Behbahani</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>NH</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151289</pubid>
                  <pubid idtype="pmpid" link="fulltext">12540297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Linear models and empirical bayes methods for assessing differential expression in microarray experiments.</p>
            </title>
            <aug>
               <au>
                  <snm>Smyth</snm>
                  <fnm>GK</fnm>
               </au>
            </aug>
            <source>Stat Appl Genet Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <fpage>Article 3</fpage>
         </bibl>
         <bibl id="B96">
            <title>
               <p>Bioconductor: open software development for computational biology and bioinformatics.</p>
            </title>
            <aug>
               <au>
                  <snm>Gentleman</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Carey</snm>
                  <fnm>VJ</fnm>
               </au>
               <au>
                  <snm>Bates</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Bolstad</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dettling</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gautier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gentry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hornik</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hothorn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Iacus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Leisch</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maechler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rossini</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Sawitzki</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tierney</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R80</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545600</pubid>
                  <pubid idtype="pmpid" link="fulltext">15461798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>OLIN: optimized normalization, visualization and quality testing of two-channel microarray data.</p>
            </title>
            <aug>
               <au>
                  <snm>Futschik</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Crompton</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>1724</fpage>
            <lpage>1726</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15585527</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Histone trimethylation and the maintenance of transcriptional ON and OFF states by trxG and PcG proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2006</pubdate>
            <volume>20</volume>
            <fpage>2041</fpage>
            <lpage>2054</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1536056</pubid>
                  <pubid idtype="pmpid" link="fulltext">16882982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>Transcription of bxd noncoding RNAs promoted by trithorax represses Ubx in cis by transcriptional interference.</p>
            </title>
            <aug>
               <au>
                  <snm>Petruk</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sedkov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schweisguth</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hirose</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jaynes</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Brock</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>127</volume>
            <fpage>1209</fpage>
            <lpage>1221</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1866366</pubid>
                  <pubid idtype="pmpid" link="fulltext">17174895</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B100">
            <title>
               <p>EnsMart: a generic system for fast and flexible access to biological data.</p>
            </title>
            <aug>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rocca-Serra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>160</fpage>
            <lpage>169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314293</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>Molecular phylogeny and divergence times of drosophilid species.</p>
            </title>
            <aug>
               <au>
                  <snm>Russo</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Takezaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1995</pubdate>
            <volume>12</volume>
            <fpage>391</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7739381</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>gff2ps: visualizing genomic annotations.</p>
            </title>
            <aug>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Guig&#243;</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>743</fpage>
            <lpage>744</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11099262</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
