<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-438</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>The rules of gene expression in plants: Organ identity and gene body methylation are key factors for regulation of gene expression in <it>Arabidopsis thaliana</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Aceituno</snm>
               <mi>F</mi>
               <fnm>Felipe</fnm>
               <insr iid="I1"/>
               <email>feflorea@uc.cl</email>
            </au>
            <au id="A2">
               <snm>Moseyko</snm>
               <fnm>Nick</fnm>
               <insr iid="I2"/>
               <email>nick@anm.f2s.com</email>
            </au>
            <au id="A3">
               <snm>Rhee</snm>
               <mi>Y</mi>
               <fnm>Seung</fnm>
               <insr iid="I2"/>
               <email>rhee@acoma.stanford.edu</email>
            </au>
            <au ca="yes" id="A4">
               <snm>Guti&#233;rrez</snm>
               <mi>A</mi>
               <fnm>Rodrigo</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>rgutierrez@uc.cl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Departamento de Gen&#233;tica Molecular y Microbiolog&#237;a, Pontificia Universidad Cat&#243;lica de Chile, Avenue Libertador Bernardo O'Higgins 340, Santiago, Chile</p>
            </ins>
            <ins id="I2">
               <p>Department of Plant Biology, Carnegie Institution of Washington, 260 Panama St, Stanford, CA, 94305, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Biology, New York University, 100 Washington Square East, 1009 Main Building, New York, NY 10003, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>438</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/438</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18811951</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-438</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Aceituno et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Microarray technology is a widely used approach for monitoring genome-wide gene expression. For Arabidopsis, there are over 1,800 microarray hybridizations representing many different experimental conditions on Affymetrix&#8482; ATH1 gene chips alone. This huge amount of data offers a unique opportunity to infer the principles that govern the regulation of gene expression in plants.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We used bioinformatics methods to analyze publicly available data obtained using the ATH1 chip from Affymetrix. A total of 1887 ATH1 hybridizations were normalized and filtered to eliminate low-quality hybridizations. We classified and compared control and treatment hybridizations and determined differential gene expression. The largest differences in gene expression were observed when comparing samples obtained from different organs. On average, ten-fold more genes were differentially expressed between organs as compared to any other experimental variable. We defined "gene responsiveness" as the number of comparisons in which a gene changed its expression significantly. We defined genes with the highest and lowest responsiveness levels as hypervariable and housekeeping genes, respectively. Remarkably, housekeeping genes were best distinguished from hypervariable genes by differences in methylation status in their transcribed regions. Moreover, methylation in the transcribed region was inversely correlated (R<sup>2 </sup>= 0.8) with gene responsiveness on a genome-wide scale. We provide an example of this negative relationship using genes encoding TCA cycle enzymes, by contrasting their regulatory responsiveness to nitrate and methylation status in their transcribed regions.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our results indicate that the Arabidopsis transcriptome is largely established during development and is comparatively stable when faced with external perturbations. We suggest a novel functional role for DNA methylation in the transcribed region as a key determinant capable of restraining the capacity of a gene to respond to internal/external cues. Our findings suggest a prominent role for epigenetic mechanisms in the regulation of gene expression in plants.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Understanding the regulation of gene expression is essential to understand the form and function of living systems. Microarray technology has been widely used in many organisms to understand genome-wide changes in gene expression in response to treatments <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, in different organs <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, cell-types <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and along developmental time series <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Therefore, a large amount of microarray data representing many different biological conditions has accumulated over recent years. This data has been used successfully to hypothesize on gene function on a global scale in different organisms, such as yeast and <it>C. elegans </it><abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, and to suggest shared regulatory mechanisms. Promoters of genes with strongly correlated expression patterns in multiple experiments are likely to be bound by a common transcription factor <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, and conserved regulatory motifs have been identified based solely on expression data <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. From a systems view, however, we believe that this data has been underutilized as a resource to understand the basic rules of gene expression.</p>
         <p>To learn the general rules that govern gene expression in plants, we took advantage of a large microarray database available for Arabidopsis in the NASCarrays database <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Using this data, we defined the internal and external cues that regulate the expression of all of the Arabidopsis genes that are represented in the Affymetrix ATH1 gene chips. We quantified the effect of the different experimental conditions on gene expression, which revealed tissue type to be the most influential variable. We also analyzed different structural features and correlated it with the capacity of the genes to respond to the different stimuli. We found evidence for a mechanistic relationship between DNA methylation in the body of the gene (i.e., the transcript region) and the regulation of gene expression, thus assigning a novel and important role for the methylation of the body of the gene in eukaryotic genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>The Arabidopsis transcriptome is robust to most perturbations but strongly influenced by organ type</p>
            </st>
            <p>In an effort to discover new principles that govern gene expression in <it>Arabidopsis thaliana</it>, we integrated and analyzed publicly available whole-genome microarray data for this model plant. From this data, we defined 474 biologically relevant comparisons (i.e. control vs. treatment) as described in Materials and Methods (Additional File <supplr sid="S1">1</supplr>). These comparisons spanned a wide variety of experimental conditions and plant organs (Figure <figr fid="F1">1</figr>). We wished to evaluate the effect of the different experimental factors that defined each comparison on genome-wide gene expression patterns. To do so, we defined differential gene expression using the RankProducts method <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. This method outperformed other methods to determine regulation of gene expression in previous studies <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp> and in our own evaluation (see Materials and Methods), particularly in datasets with a small number of replicates.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><b>Control vs. tests comparisons</b>. List of the analyzed 474 comparisons in the NASCarrays database, annotated according to the experimental factor and plant structure categories. NASC experiment numbers are provided.</p>
               </text>
               <file name="1471-2164-9-438-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Classification of experiments from the NASCarrays database</p>
               </caption>
               <text>
                  <p><b>Classification of experiments from the NASCarrays database</b>. Pie charts with the classification of microarray experiments according to the experimental factor categories defined by TAIR (A) or the organ used to extract RNA to perform the microarray experiments (B).</p>
               </text>
               <graphic file="1471-2164-9-438-1"/>
            </fig>
            <p>We first examined the number of differentially regulated genes per comparison. We found their distribution to be far from normal. As shown in Figure <figr fid="F2">2A</figr>, some comparisons exhibit more than 4,000 differentially expressed genes. These outliers were exclusively comparisons between different organs. In fact, organ type was the strongest experimental factor contributing to the number of differentially expressed genes. Other experimental factors, regardless of their nature, showed an approximately 10-fold smaller impact on gene expression with an average of 337 genes regulated per comparison (Figure <figr fid="F2">2B</figr>). Moreover, approximately 10% of the Arabidopsis genes did not respond to any of the stimuli in the dataset and were only differentially expressed between organ samples. Thus, organ is by far the most important factor in determining genome-wide expression levels. Furthermore, the upper 5<sup>th </sup>percentile (ordered by the number of genes regulated) of the 77 mutant vs wt comparisons involved only genes whose mutations have well documented developmental phenotypes. These genes were AP2-6<abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, ARR21<abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, GLABROUS1<abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and LFY-12 mutations <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. They regulated 1475, 1420, 1379 and 1362 genes, respectively &#8211; a much more than the category average (471 genes). These results indicate that global gene expression patterns are established during plant development. The results also suggest that the Arabidopsis transcriptome is robust to most perturbations, with only an estimated 1.5% of the genome on average responding in a single experiment to experimental factors such as chemical or hormone treatments, pathogen challenges or environmental stress. A detail of the categories in which each of the Arabidopsis genes responds is presented in Additional File <supplr sid="S2">2</supplr>. Additional Files <supplr sid="S3">3</supplr> to <supplr sid="S10">10</supplr> contain the genes that respond in exclusively one category, including organ type.</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p><b>Gene responsiveness by categories</b>. Table detailing the number of experiments, within the eight experimental categories, in which each Arabidopsis gene is regulated. The number in parenthesis in the header of the Table indicates the total number of experiments in each category.</p>
               </text>
               <file name="1471-2164-9-438-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S6.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S7.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S8.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S9.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p><b>Genes regulated specifically in one experimental category</b>. Each file provides the individual genes responding exclusively in abiotic, biotic, ecotype, chemical, hormone, mutant, nutrient or organ comparisons, respectively.</p>
               </text>
               <file name="1471-2164-9-438-S10.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Global characteristics of the Arabidopsis transcriptome</p>
               </caption>
               <text>
                  <p><b>Global characteristics of the Arabidopsis transcriptome</b>. A) Histogram of the number of genes (X-axis) regulated in a given number of comparisons (Y-axis). B) Average number of genes regulated by each experimental category as defined in Figure 1A. C) Histogram of the number of comparisons (X-axis) for which the specified number of genes (Y-axis) show significant regulation.</p>
               </text>
               <graphic file="1471-2164-9-438-2"/>
            </fig>
            <p>Given its impact on global gene expression levels, we next wished to evaluate the importance of organ type in the context of typical experimental factors that are tested in the laboratory. We compared the number of genes responding in shoots or roots for each of the nine treatments in the AtGenExpress abiotic stress series. On average, only 13% of the total genes that responded to a treatment responded in both organs. By contrast, a much higher proportion of genes (88%) were regulated by the treatment in an organ-specific manner (Additional File <supplr sid="S11">11</supplr>). This data indicate that plant responses to external stimuli are strongly organ-dependent and underscore the need for a more thorough survey of organ-specific and, by extension, cell-specific responses in Arabidopsis and other plants <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
            <suppl id="S11">
               <title>
                  <p>Additional File 11</p>
               </title>
               <text>
                  <p><b>Importance of organ type in the response to abiotic stress in Arabidopsis</b>. Percentage of genes responding to various stresses in either roots, shoots or both. Data corresponds to the AtGenExpress Abiotic Stress series present in the NASCarrays database. The black zone indicates the percentage of genes responding only in roots; the white zone indicates those responding only in shoots, and the black squares region indicates the genes responding in both tissues</p>
               </text>
               <file name="1471-2164-9-438-S11.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Housekeeping and hypervariable genes possess marked structural differences</p>
            </st>
            <p>To identify properties that explain the capacity of a gene to respond to stimuli, we ranked genes based on the number of comparisons in which they are differentially expressed. As shown in Figure <figr fid="F2">2C</figr>, the Arabidopsis genome contains genes that are regulated in a wide range of comparisons, with an average of 14 comparisons, or 3% of the total comparisons in our dataset. The underlying data is provided in Additional File <supplr sid="S12">12</supplr>. We expect structural differences to be maximized at the extremes of this distribution. We defined housekeeping genes based on three criteria: (1) genes that were not differentially expressed in any of the 474 comparisons, (2) genes with signal intensities higher than the median intensity across the entire dataset and (3) genes with the lowest signal variability (measured with the interquartile range, see Materials and Methods) across the entire dataset. In contrast, we defined hypervariable genes based on the following three criteria: (1) genes that were within the top 1% of the gene responsiveness distribution, (2) genes with the largest signal variability, and (3) genes that show differential expression by stimuli from six of the eight categories described in Figure <figr fid="F1">1A</figr>. These criteria defined 384 housekeeping genes and 123 hypervariable genes (Additional files <supplr sid="S13">13</supplr> and <supplr sid="S14">14</supplr>).</p>
            <suppl id="S12">
               <title>
                  <p>Additional file 12</p>
               </title>
               <text>
                  <p><b>Gene responsiveness</b>. Gene responsiveness as determined by the Rank Products and fold-change method.</p>
               </text>
               <file name="1471-2164-9-438-S12.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S13">
               <title>
                  <p>Additional file 13</p>
               </title>
               <text>
                  <p><b>Housekeeping and hypervariable genes and their methylation status (1)</b>. List of Housekeeping and hypervariable genes, classified according their methylation status as defined in:<it>Zhang X, et al: Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell 2006, 126(6): 1189&#8211;1201</it>. Gene annotation was provided by the VirtualPlant system <url>http://www.virtualplant.org</url>.</p>
               </text>
               <file name="1471-2164-9-438-S13.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S14">
               <title>
                  <p>Additional file 14</p>
               </title>
               <text>
                  <p><b>Housekeeping and hypervariable genes and their methylation status (2)</b>. List of Housekeeping and hypervariable genes, classified according their methylation status as defined in: <it>Zilberman et al: Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet 2007, 39(1): 61&#8211;69</it>. Gene annotation was provided by the VirtualPlant system <url>http://www.virtualplant.org</url></p>
               </text>
               <file name="1471-2164-9-438-S14.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>A previous study positively correlated expression levels with gene size in plants <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. To understand how gene responses to stimuli relate to gene size and other structural features, we analyzed the structure of housekeeping and hypervariable genes. Housekeeping genes were significantly larger and had more introns than do hypervariable genes and were above genome averages for both criteria (Table <tblr tid="T1">1</tblr>). By contrast, hypervariable genes were significantly shorter and contained fewer introns than average (Table <tblr tid="T1">1</tblr>). Interestingly, a functional annotation of the hypervariable gene set indicates that it is enriched for genes involved in responses to internal and external stimuli (Additional File <supplr sid="S15">15</supplr>). Most hypervariable genes were plant specific as defined in a previous study <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, and the set was enriched for genes that code for unstable transcripts <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> (Table <tblr tid="T1">1</tblr>). These results suggest that plants favored the evolution of small, hypervariable genes to respond quickly and economically to multiple environmental signals.</p>
            <suppl id="S15">
               <title>
                  <p>Additional file 15</p>
               </title>
               <text>
                  <p><b>Function of housekeeping and hypervariable genes</b>. Analysis of over-representation of gene ontology functional terms in housekeeping and hypervariable genes (performed in VirtualPlant &#8211; <url>http://www.virtualplant.org</url></p>
               </text>
               <file name="1471-2164-9-438-S15.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Contrasting features of housekeeping and hypervariable genes.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene feature</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Housekeeping</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Hypervariable</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Genome</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>CDS length (bp)</b>
                           <sup>
                              <it>a</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2624 (s.e. = 89)</p>
                     </c>
                     <c ca="center">
                        <p>1178 (s.e. = 73)</p>
                     </c>
                     <c ca="center">
                        <p>1931 (s.e. = 8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene length (bp)</b>
                           <sup>
                              <it>a</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>3117 (s.e. = 87)</p>
                     </c>
                     <c ca="center">
                        <p>1493 (s.e. = 78)</p>
                     </c>
                     <c ca="center">
                        <p>2229 (s.e. = 8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Total exon length (bp)</b>
                           <sup>
                              <it>a</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1941 (s.e. = 52)</p>
                     </c>
                     <c ca="center">
                        <p>1169 (s.e. = 50)</p>
                     </c>
                     <c ca="center">
                        <p>1568 (s.e. 6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Total intron length (bp)</b>
                           <sup>
                              <it>a</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1173 (s.e. = 52)</p>
                     </c>
                     <c ca="center">
                        <p>323 (s.e. = 44)</p>
                     </c>
                     <c ca="center">
                        <p>660 (s.e. = 4)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Number of exons (pb)</b>
                           <sup>
                              <it>a</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8 (s.e. = 0.31)</p>
                     </c>
                     <c ca="center">
                        <p>3 (s.e. = 0.24)</p>
                     </c>
                     <c ca="center">
                        <p>5 (s.e = 0.03)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Genes without introns</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>6% (p = 5E-16)</p>
                     </c>
                     <c ca="center">
                        <p>33% (p = 0.0007)</p>
                     </c>
                     <c ca="center">
                        <p>28%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Average number of transcription factor binding sites</b>
                           <sup>
                              <it>b</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>27 &#177; 1.2 (p &lt; 0.01)</p>
                     </c>
                     <c ca="center">
                        <p>46 &#177; 1.8 (p &lt; 0.0001)</p>
                     </c>
                     <c ca="center">
                        <p>30 &#177; 0.1</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>TATA-containing genes</b>
                           <sup>
                              <it>c</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5% (p = 1.3E-6)</p>
                     </c>
                     <c ca="center">
                        <p>45% (p = 6.1E-15)</p>
                     </c>
                     <c ca="center">
                        <p>15%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Genes coding for unstable transcripts</b>
                           <sup>
                              <it>d</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0% (n.a.)</p>
                     </c>
                     <c ca="center">
                        <p>8% (p = 9E-11)</p>
                     </c>
                     <c ca="center">
                        <p>1%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Shared among eukaryotes</b>
                           <sup>
                              <it>e</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>18% (p = 0.002)</p>
                     </c>
                     <c ca="center">
                        <p>7%</p>
                     </c>
                     <c ca="center">
                        <p>14%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Plant-specific</b>
                           <sup>
                              <it>e</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>11%</p>
                     </c>
                     <c ca="center">
                        <p>34% (p = 2E-10)</p>
                     </c>
                     <c ca="center">
                        <p>14%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Body methylation</b>
                           <sup>
                              <it>f</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>63% (p = 1.5E-35)</p>
                     </c>
                     <c ca="center">
                        <p>8% (p = 2E-10)</p>
                     </c>
                     <c ca="center">
                        <p>34%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Promoter methylation</b>
                           <sup>
                              <it>f</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>3%</p>
                     </c>
                     <c ca="center">
                        <p>3%</p>
                     </c>
                     <c ca="center">
                        <p>5%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Body methylation</b>
                           <sup>
                              <it>g</it>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>36% (p = 9.1E-21)</p>
                     </c>
                     <c ca="center">
                        <p>2% (p = 3.8E-8)</p>
                     </c>
                     <c ca="center">
                        <p>20%</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The first column lists various features analyzed for housekeeping genes (second column), hypervariable genes (third column) and the whole genome (fourth column). Rows report average and standard error or percentage values. P values for significant (p &lt; 0.01) enrichment or depletion as compared to the genome occurrence are shown in parenthesis. <it>a</it>, differences between all groups are significant (p &lt; 0.01) as determined by ANOVA.<it>b</it>, average number of cis-acting regulatory elements as defined in the AGRIS database <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. p-value was determined by a t-test. <it>C</it>, presence of TATA-box as determined by the MotifSearch algorithm <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Similar results were obtained with an alternative TATA-box definition <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. <it>d</it>, unstable transcripts as defined in <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. <it>e</it>, phylogenetic profiles as defined previously <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Only significantly enriched profiles are listed. <it>f</it>, methylation patterns as determined in <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. <it>g</it>, methylation patterns as determined in <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. n.a., not applicable.</p>
               </tblfn>
            </tbl>
            <p>Eukaryotic genes are transcriptionally regulated by the coordinated interaction of multiple protein factors that interact with discrete binding sites and with each other <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. These binding sites are usually located upstream of the transcribed region they regulate <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The promoters of hypervariable genes often have a TATA-box sequence and contain a larger number of predicted transcription factor binding sites as compared to the housekeeping genes or the genome average (Table <tblr tid="T1">1</tblr> and Additional File <supplr sid="S16">16</supplr>). These data suggest that the presence of a TATA box and the number of transcription factor binding sites in the promoter region of some of the most responsive genes in Arabidopsis may explain their capacity to respond to stimuli, as was previously found in an analysis of a smaller expression dataset <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. However, it is clear that this simple rule does not always apply and that other factors are necessary to explain gene expression responses.</p>
            <suppl id="S16">
               <title>
                  <p>Additional file 16</p>
               </title>
               <text>
                  <p><b>Enrichment of cis-acting motifs in the promoter of hypervariable genes</b>. Frequency distribution of the number of predicted transcription binding sites in the promoter of housekeeping and hypervariable genes and the whole genome. The genes were ranked according to the number of cis-acting regulatory elements in their promoters according to the AGRIS database (X-axis). The points represent the fraction of genes in a bin of 10 motifs.</p>
               </text>
               <file name="1471-2164-9-438-S16.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>In addition to gene structure, epigenetic mechanisms such as DNA methylation are known to have an impact on gene expression in eukaryotes, particularly in heterochromatic regions <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. To evaluate the potential role of DNA methylation in the gene expression responses observed for housekeeping and hypervariable genes, we analyzed the methylation patterns of these two groups of genes. We used two recently published genome-wide methylation data sets <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> to analyze methylation in the promoter and transcribed regions of each gene. Using the methylome data produced by Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, we found that a large proportion of housekeeping genes were methylated in their transcribed regions (a significant enrichment compared to the expected genome frequency; p = 1.5E-35, Table <tblr tid="T1">1</tblr>). By contrast, only 8% of the hypervariable genes were methylated in their transcribed regions (a significant depletion; p = 2E-10, Table <tblr tid="T1">1</tblr>). Similar results were obtained with an independently generated methylome data set <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. These results suggest that the capacity of Arabidopsis housekeeping and hypervariable genes to respond to stimuli not only depends on structural features in their promoter or transcribed regions, such as transcription factor binding sites, but may also have an important epigenetic component.</p>
         </sec>
         <sec>
            <st>
               <p>Transcript region methylation is the most important factor to explain genome-wide responses to internal/external stimuli</p>
            </st>
            <p>To evaluate the importance of these features for gene expression responses on a genomic scale, we performed a regression analysis of the gene responsiveness for all Arabidopsis genes as a function of each of the structural features described above. We used a linear model of the form: Y ~ &#945;X + &#946;, where Y was the observed gene responsiveness of all genes and X was the structural feature under evaluation (e.g. presence of TATA-box, cis-acting binding sites in the promoter or gene body methylation). Thus, the effects detected were free from any bias arising from gene selection, as could be the case when analyzing this relatively small group of housekeeping and hypervariable genes.</p>
            <p>Notably, using the two independently generated methylome datasets <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, gene responsiveness showed a remarkably high negative correlation with the presence of methylation in the transcribed region of the gene. Both datasets generated models with a coefficient of determination (R<sup>2</sup>) of 0.8 (share of explained variability, Figure <figr fid="F3">3A&#8211;B</figr>). A similar result was obtained using average fold-change &#8805; |2| (treatment versus control) as a criterion to determine gene responsiveness (Additional Files <supplr sid="S17">17</supplr> and <supplr sid="S18">18</supplr>). This correlation was independent of the type of experimental factor, as similar trends were observed when analyzing each experimental category individually for both methylome datasets (Figure <figr fid="F3">3C&#8211;F</figr> and Additional File <supplr sid="S19">19</supplr>). Next, to transcript region methylation, the presence of a TATA-box was the second best factor to explain gene responsiveness, and it had a positive effect. R<sup>2 </sup>for two definitions of TATA-box <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> were 0.49 or 0.68. Two factor models that included transcript region methylation and the presence of a TATA-box slightly improved the R<sup>2 </sup>over those obtained with methylation alone (Table <tblr tid="T2">2</tblr>). Two factor ANOVA models (Additional File <supplr sid="S20">20</supplr>) confirmed the stronger effect of gene body methylation on responsiveness, as determined by the Tukey comparison procedure <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. However, goodness of fit estimation by the Bayesian Information Criteria <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> suggests that additive models, including TATA-box and methylation, are better than one-factor ANOVA models. (Additional File <supplr sid="S20">20</supplr>). Interestingly, this also suggests that the effect of TATA-box and methylation are independent, as interaction terms are not significant in these models (not shown). None of the other structural features (gene size, presence of introns, number of binding sites, etc) yielded models with such high R<sup>2 </sup>on a genomic scale. Thus, gene body methylation and, to a lesser extent, TATA-box presence explained gene responsiveness on a global scale. It is not possible, however, to infer from this data the mechanistic relationships between TATA-related factors, gene body methylation status and regulation of gene expression.</p>
            <suppl id="S17">
               <title>
                  <p>Additional file 17</p>
               </title>
               <text>
                  <p><b>Correlation between gene responsiveness as determined by the fold-change method and gene body methylation</b>. Table listing gene responsiveness as determined by the fold-change method (&#8805; |2|), and the corresponding frequencies of methylated genes.</p>
               </text>
               <file name="1471-2164-9-438-S17.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S18">
               <title>
                  <p>Additional file 18</p>
               </title>
               <text>
                  <p><b>Plot of the correlation between gene responsiveness determined by the fod-change method versus gene body methylation</b>. This graphs shows the linear correlation between gene responsiveness as determined by fold change ((&#8805; |2|) and gene body methylation.</p>
               </text>
               <file name="1471-2164-9-438-S18.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S19">
               <title>
                  <p>Additional file 19</p>
               </title>
               <text>
                  <p><b>Results of simple regression models, given by experimental category</b>. Description is as Table <tblr tid="T2">2</tblr>, see main text.</p>
               </text>
               <file name="1471-2164-9-438-S19.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S20">
               <title>
                  <p>Additional file 20</p>
               </title>
               <text>
                  <p><b>ANOVA models for the effect of methylation and TATA-box presence on gene responsiveness, by category of experimental treatment</b>. The models have the form Y ~ aX + b, where X was a factor encoding the presence or absence of those features as two different levels. We used the 'aov' function in R to fit the model. The F statistic estimates the significance of the contribution of the factors to the response. The differences between the levels of the factors were estimated by the Tukey procedure, using the 'glht' function from the 'multcomp' package in R. This is equivalent to comparing the coefficients of the factors. The Bayesian Information Criteria was calculated in R using the 'BIC' function in the package 'nlme'. This parameter represents the "a posteriori" probability of the model to be true, being maximized when the magnitude of the parameter is minimized.</p>
               </text>
               <file name="1471-2164-9-438-S20.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Results of the simple and multiple linear regression analyses</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Explanatory variable(s)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Data Source</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>r</b>
                           <sup>2</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>p</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Coefficient</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Methylation frequency</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B24">24</abbr>
                              <abbr bid="B25">25</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.8</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.8</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Frequency of genes target of H3k27me3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B30">30</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.12</p>
                     </c>
                     <c ca="center">
                        <p>0.000207</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene size</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>TAIR Genome v6.0</p>
                     </c>
                     <c ca="center">
                        <p>0.02</p>
                     </c>
                     <c ca="center">
                        <p>>0.01</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>
                              <it>Cis</it>
                           </b>
                           <b>-acting elements</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <abbrgrp>
                              <abbr bid="B48">48</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.05</p>
                     </c>
                     <c ca="center">
                        <p>>0.01</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>TATA-box frequency</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>(MotifSearch, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                        <p>(PlantProm, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                     </c>
                     <c ca="center">
                        <p>0.49</p>
                        <p>0.68</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16</p>
                        <p>&lt;2E-16</p>
                     </c>
                     <c ca="center">
                        <p>n.r.</p>
                        <p>n.r.</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="" ca="center">
                        <p>
                           <b>Methylation + TATA-box</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>+ (MotifSearch, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                     </c>
                     <c ca="center">
                        <p>0.84</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16<sup><it>a</it></sup></p>
                        <p>0.0002<sup><it>b</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>-201.5<sup><it>a</it></sup></p>
                        <p>35<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="">
                        <p/>
                     </c>
                     <c ca="center">
                        <p><abbrgrp><abbr bid="B24">24</abbr></abbrgrp> + (PlantProm, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                     </c>
                     <c ca="center">
                        <p>0.86</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16<sup><it>a</it></sup></p>
                        <p>1.00E-09<sup><it>b</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>-168<sup><it>a</it></sup></p>
                        <p>50.5<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="">
                        <p/>
                     </c>
                     <c ca="center">
                        <p><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> + (MotifSearch, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                     </c>
                     <c ca="center">
                        <p>0.87</p>
                     </c>
                     <c ca="center">
                        <p>2.00E-16<sup><it>a</it></sup></p>
                        <p>5.00E-09<sup><it>b</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>-158.6<sup><it>a</it></sup></p>
                        <p>54.8<sup><it>b</it></sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="">
                        <p/>
                     </c>
                     <c ca="center">
                        <p><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> + (PlantProm, <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>)</p>
                     </c>
                     <c ca="center">
                        <p>0.84</p>
                     </c>
                     <c ca="center">
                        <p>&lt;2E-16<sup><it>a</it></sup></p>
                        <p> 0.0006<sup><it>b</it></sup></p>
                     </c>
                     <c ca="center">
                        <p>-194.3<sup><it>a</it></sup></p>
                        <p>39<sup><it>b</it></sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Column 1 reports the explanatory variables used to model gene responsiveness. Column 2 indicates the source of the data (reference). Columns 3 and 4 report the different statistics obtained with the linear regression. n.r., not reported; n.d., not determined. <it>a</it>, statistics for methylation variable. <it>b</it>, statistics for TATA-box variable. Column 5 shows the coefficients from the linear regression analysis.</p>
               </tblfn>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Correlation between methylation and gene responsiveness</p>
               </caption>
               <text>
                  <p><b>Correlation between methylation and gene responsiveness</b>. (A) Plot of the frequency of methylated genes (according to Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>; X-axis) within a group of genes against the number of comparisons in which that group of genes is regulated (Y-axis). The dotted line represents the regression line. B) Same as (A) except using data from Zilberman et al <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. C) to E). Same as (A) except with the different experimental categories defined in Figure 1A, using methylome data from Zhang et al <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. G) Same as (A) except the X-axis represents the frequency of genes that are the target of trimethylation on H3K27 <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2164-9-438-3"/>
            </fig>
            <p>The effect of DNA methylation on gene responsiveness could be explained by a simple transcriptional gene silencing effect <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. Silencing a gene would render it unable to be regulated. If so, transcript region methylation should correlate with expression levels. Comparing the frequency of methylation to the median expression level of the whole dataset revealed no such trend (Figure <figr fid="F4">4</figr>). The most and the least highly expressed genes are likely to lack methylation within their body, as previously reported <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Similarly, no correlation was found between the presence of a TATA-box and gene expression levels. (Figure <figr fid="F4">4</figr>). Moreover, no relationship was evident between expression level and gene responsiveness in our data set (Additional File <supplr sid="S21">21</supplr>).</p>
            <suppl id="S21">
               <title>
                  <p>Additional file 21</p>
               </title>
               <text>
                  <p><b>Lack of linear correlation between expression levels and gene responsiveness</b>. Box plot of the signal of a gene across the whole NASC arrays dataset (X-axis) versus gene responsiveness (the number of comparisons in which it is significantly regulated, Y-axis). A simple linear regression model cannot explain the variability in the data (R<sup>2 </sup>= 0.04).</p>
               </text>
               <file name="1471-2164-9-438-S21.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Lack of linear correlation between expression levels and gene body methylation or TATA-box presence</p>
               </caption>
               <text>
                  <p><b>Lack of linear correlation between expression levels and gene body methylation or TATA-box presence</b>. (A) Plot of the median expression level across the whole NASC arrays dataset in 10% bins (X-axis) versus the frequency of methylated genes in the bin (Y-axis), as determined by Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. (B) Same as (A), except using data from Zilberman et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. C) Same as (A), except the Y-axis represents the frequency of TATA-containing genes according to the MotifSearch definition <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. D) Same as (C), but using the PlantProm definition <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2164-9-438-4"/>
            </fig>
            <p>We also evaluated the relationship between the presence of modified histones and gene responsiveness. We used a recently published genomic survey of trimethylation in lysine 27 of histone H3 (H3K27me3) f<abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. We found a weak correlation between the frequency of H3K27me3 gene targets and gene responsiveness, with an R<sup>2 </sup>of 0.12 (Figure <figr fid="F3">3F</figr> and Additional File <supplr sid="S19">19</supplr>). This finding is consistent with the hypothesis that H3K27me3 mostly acts in a DNA methylation-independent manner, as previously suggested <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Other histone modifications, such as H3K4 or H3K9 methylation <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> or combinations thereof <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, may be related to gene body methylation in Arabidopsis, thus "marking" the corresponding chromatin region for or against the regulation of gene expression <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Gene body methylation and regulation of expression by nitrate in TCA cycle genes</p>
            </st>
            <p>As a case-study and to provide a concrete example of the influence of methylation patterns on the regulation of gene expression, we focused on a discrete biological process and experimental factor: nitrate. Nitrate has been shown to be a signal to regulate gene expression in plants <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. We chose four microarray experiments in which wild-type seedlings were treated with different nitrate concentrations. These nitrate experiments were not included in the microarray database used in the previous sections. We found that nitrate regulates many genes in central metabolic pathways such as the TCA cycle <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. We analyzed responsiveness and nitrate regulation for all genes coding for TCA cycle enzymes. Most of the genes (29 out of 36, data not shown) did not respond to the nitrate treatments, as expected due to the robustness of expression patterns in Arabidopsis (see Figure <figr fid="F2">2B</figr>). Among the genes regulated by nitrate, we found a malate dehydrogenase gene (MDH, At3g47520), two genes coding for NAD<sup>+ </sup>dependent isocitrate dehydrogenases (At5g03290 and At4g35260) and a putative NADP<sup>+ </sup>dependent isocitrate dehydrogenase (At1g65930) (Table <tblr tid="T3">3</tblr>). Remarkably, these four genes were classified as unmethylated in studies by both Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and Zilberman et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Moreover, body methylated genes were enriched among the analyzed genes that were not regulated by nitrate (Table <tblr tid="T3">3</tblr>). For instance, among eight genes coding for malate dehydrogenase that are not regulated by nitrate, five are methylated according to the two methylome datasets. This is a much higher frequency than is expected by chance (p &lt; 0.05), as only 20&#8211;34% of the genes were methylated according to the two methylome datasets. The same was true for the isocitrate dehydrogenases, with enrichment of methylated genes for those that did not respond to the nitrate treatment (p &lt; 0.05). These results agree with the proposed relationship between gene body methylation and the regulation of gene expression in response to regulatory signals (in this case, nitrate). Moreover, it suggests gene body methylation plays a role in the regulation of gene expression in physiological processes such as the reprogramming of carbon metabolism in response to nitrogen nutrient availability <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Relationship between the methylation status and nitrate regulation of TCA cycle genes.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>AGI number</p>
                     </c>
                     <c ca="center">
                        <p>Gene Annotation</p>
                     </c>
                     <c ca="center">
                        <p>Responsiveness to nitrate</p>
                     </c>
                     <c ca="center">
                        <p>Methylation status<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At3g47520</p>
                     </c>
                     <c ca="center">
                        <p>MDH (malate dehydrogenase); malate dehydrogenase</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At1g04410</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase, cytosolic, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At1g53240</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase (NAD), mitochondrial</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At2g22780</p>
                     </c>
                     <c ca="center">
                        <p>PMDH1 (PEROXISOMAL NAD-MALATE DEHYDROGENASE 1); malate dehydrogenase</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At3g15020</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase (NAD), mitochondrial, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g09660</p>
                     </c>
                     <c ca="center">
                        <p>PMDH2 (PEROXISOMAL NAD-MALATE DEHYDROGENASE 2), PMDH2 (PEROXISOMAL NAD-MALATE DEHYDROGENASE 2); malate dehydrogenase</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g56720</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase, cytosolic, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g58330</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase (NADP), chloroplast, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g43330</p>
                     </c>
                     <c ca="center">
                        <p>malate dehydrogenase, cytosolic, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g03290</p>
                     </c>
                     <c ca="center">
                        <p>isocitrate dehydrogenase, putative/NAD+ isocitrate dehydrogenase, putative</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At4g35260</p>
                     </c>
                     <c ca="center">
                        <p>IDH1 (ISOCITRATE DEHYDROGENASE 1); isocitrate dehydrogenase (NAD+)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At1g65930</p>
                     </c>
                     <c ca="center">
                        <p>isocitrate dehydrogenase, putative/NADP+ isocitrate dehydrogenase, putative</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At3g09810</p>
                     </c>
                     <c ca="center">
                        <p>isocitrate dehydrogenase, putative/NAD+ isocitrate dehydrogenase, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At4g35650</p>
                     </c>
                     <c ca="center">
                        <p>isocitrate dehydrogenase, putative/NAD+ isocitrate dehydrogenase, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>U</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At5g14590</p>
                     </c>
                     <c ca="center">
                        <p>isocitrate dehydrogenase, putative/NADP+ isocitrate dehydrogenase, putative</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At1g54340</p>
                     </c>
                     <c ca="center">
                        <p>ICDH (ICDH); isocitrate dehydrogenase (NADP+)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>M</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>This table provides the AGI number, the gene annotation, regulation by nitrate as determined from four independent experiments (see main text) and the methylation status according to the two methylome datasets used in this work. This table includes all the different malate dehydrogenase and isocitrate dehydrogenase isozyme-coding genes present in the Arabidopsis genome, according to VirtualPlant <url>http://www.virtualplant.org</url>. <sup>a</sup>Methylation code: U, unmethylated in both datasets; M, methylated in both datasets; A, ambiguous according to Zilberman et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> but unmethylated according to Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The analysis of the large and heterogeneous whole-genome microarray dataset available in the public domain proved useful to evaluate principles that govern regulation of gene expression in plants. Our global and systematic analysis of the quantitative effect of different experimental factors (e.g., mutations, stress and organ identity) on the plant transcriptome revealed the key role of developmental processes for establishing mRNA levels throughout the plant. This process in turn determines how cells, organs and tissues respond to exogenous cues. Our data indicate that plant responses to external stimuli are strongly organ-dependent and underscore the need for a more thorough survey of organ-specific and, by extension, cell-specific responses in Arabidopsis and other plants <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <p>The second part of our analysis provided a weighted insight into the role of different molecular mechanisms in the global regulation of gene expression in Arabidopsis. The data indicate that DNA methylation within the body of Arabidopsis genes is a key factor that may determine or negatively influence the capacity of genes to respond to internal or external cues. The presence of a TATA-box may favor gene responsiveness but to a lesser extent than the negative effect of DNA methylation. Surprisingly, our data indicate that other gene structural features (e.g., number of cis-acting elements, gene size, presence and number of introns) are less important than DNA methylation and the presence of a TATA-box. These results highlight the importance of epigenetic mechanisms for the global control of gene expression. As a concrete example, we found consistency between regulation by an external stimulus (nitrate) and gene body methylation for a discrete biological process, the TCA cycle, beyond what would be expected by chance. The results presented here suggest a model whereby gene body DNA methylation restrains the ability of a gene to be regulated, regardless of regulatory signals (e.g., binding sites for specific transcription factors in the promoter region). This effect would not be directly dependent on basal gene expression levels. Moreover, our results provide a plausible functional role for the DNA methylation that is found in the body of a large number of Arabidopsis genes. This new role differs from the proposed role for DNA methylation in suppressing spurious transcriptional initiation <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B39">39</abbr></abbrgrp> and reinforces the link between the regulation of gene expression and DNA methylation in eukaryotes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data processing</p>
            </st>
            <p>The CEL data files comprising all ATH1 Affymetrix hybridizations through the end of 2005 were obtained from NASCArrays through the AffyWatch Subscription Service. This data comprised 1887 hybridizations corresponding to 108 different experiments. The entire hybridization set was normalized using the Robust Multiarray Analysis method <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> available from Bioconductor <url>http://www.bioconductor.org</url>. Once normalized, the hybridizations were quality-controlled using the method devised by Persson et al <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Briefly, this method uses a Kolmogorov-Smirnov goodness-of-fit test to evaluate whether the distribution of deleted residuals for an individual hybridization deviates from a "t" distribution. According to Persson et al <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, this occurs when the value of the <it>D </it>statistic from the goodness-of-fit test is more than 0.15. The CEL files with a <it>D </it>statistic over this cut-off value were excluded from the analysis. This step resulted in the exclusion of 186 CEL files.</p>
            <p>For the analysis of differential expression, the remaining 1701 hybridizations were mapped to their corresponding experiments. Controls and biologically meaningful tests were identified and grouped with their replicates. Comparisons in which the control or treatment hybridizations had less than 2 replicates were discarded. This process resulted in a list of 474 biologically meaningful comparisons (control versus test), including 1295 hybridizations. In the case of tissue comparisons, we used rosette leaves as a control, and all other tissues were considered tests. Rosette leaves were chosen as the reference because they are the prototypical organ system <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. We classified the comparisons according the experimental variable involved using the criteria defined by TAIR <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and according to the RNA source organ (Figure <figr fid="F1">1</figr>)</p>
         </sec>
         <sec>
            <st>
               <p>Differential expression analysis</p>
            </st>
            <p>The comparisons were analyzed for differential gene expression using the RankProducts method <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, implemented as a Bioconductor package <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. This method outperformed other methods to define differential expression in a study comparing ten different methods <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, particularly in high-noise, low-replicate datasets. Our comparisons have a low number of replicates (average = 2.7) and a high variability (pooled variance of the whole dataset = 4.04). We also evaluated the performance of RankProducts as compared to other popular alternative methods based on biological criteria. We defined regulation using RankProducts, average fold change and t-test with different FDR corrections for multiple testing <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. To evaluate the methods, we randomly chose five test comparisons from different experimental categories (e.g. biotic, abiotic, tissue).</p>
            <p>We evaluated the functional coherence of the differentially expressed genes by the different methods by evaluating enriched gene ontology (GO) terms in the resulting lists. For most of the comparisons tested, visual inspection revealed enriched GO terms that were obviously related to the experimental factor. This was not the case for the other methods. As an example, 245 genes were found to be differentially expressed in the comparison DO.1.1 (Additional File <supplr sid="S1">1</supplr>). Out of these 245 genes, 217 were previously identified as regulated in these experiments using a different method in a prior study <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. In addition, the 140 down-regulated genes determined by RankProducts showed an overrepresentation of "transport" and other functional terms previously known to be related to the experimental factor <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Similarly, the abscisic acid response evaluated in comparison AQ.4.4 (Additional File <supplr sid="S1">1</supplr>) identified 241 differentially expressed genes. Among the up-regulated genes, we found that the 'abscisic acid response' functional term was overrepresented.</p>
            <p>With the results of the differential expression analysis, a "regulation matrix" was created. This matrix contained the p-value for the down- and up-regulation of all of the ATH1 Affymetrix chip probes across the 474 comparisons. The cut-off for defining a probe as differentially expressed was 0.05. The complete data file with ratios is available from <url>http://virtualplant.bio.puc.cl/cgi-bin/Lab/download.cgi</url>. Additional data files are available upon request.</p>
         </sec>
         <sec>
            <st>
               <p>Housekeeping and hypervariable gene definition</p>
            </st>
            <p>The least responsive genes (housekeeping genes) were defined as follows: first, we selected genes which did not show differential expression in any comparison (5652 genes). Second, these genes were filtered for expression above the median of the entire NASC dataset (1758 genes). Third, we choose only those having a signal difference between the 1<sup>st </sup>and 3<sup>rd </sup>quartile (interquartile range) that was in the bottom 5 percentile of the signal interquartile ranges from the whole dataset. This ensured the selection of 384 expressed Arabidopsis genes that exhibit the lowest expression variability.</p>
            <p>For the most responsive genes (hypervariable genes), we first choose genes that were regulated in 86 or more comparisons, corresponding to the top 1% most responsive genes from Figure <figr fid="F2">2C</figr>. Second, we selected genes that were regulated in at least six out of the eight categories defined in Figure <figr fid="F1">1A</figr> to avoid any bias due to large categories (e.g., abiotic stress experiments). We did not use an expression cutoff, since as expected hypervariable genes were sufficiently expressed, with a median signal of 8.4 across the NASC dataset (the global median is 7.4). From the 185 genes selected by these criteria, we choose those with a signal interquartile range in the upper 5% of the entire dataset. Thus, we defined a group of 123 "hypervariable genes".</p>
         </sec>
         <sec>
            <st>
               <p>Structural and phylogenetic analyses and correlation with gene responsiveness</p>
            </st>
            <p>Gene structural features (gene, CDS, exon, intron lengths and numbers) &#8211; were obtained from the TAIR 6.0 Arabidopsis genome <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Phylogenetic classifications of the genes were obtained from the Plant-Specific Database <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Methylation status of the different genes (body methylated, body unmethylated and promoter methylated) was obtained from Zhang et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> or Zilberman et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. TATA-box presence or absence in the promoter region of Arabidopsis genes was obtained from Molina and Grotewold<abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The number of transcription factor binding sites in gene promoters was calculated from the data in the AtCis Database from AGRIS <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. Unstable transcripts were extracted from the data generated by Gutierrez et al. <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. All data were processed using custom-made scripts in R <url>http://www.R-project.org</url> and Perl languages. Statistical analyses and graphs were done in R, GraphPad Prisma 4.0 software or Microsoft Excel.</p>
         </sec>
         <sec>
            <st>
               <p>Statistical and regression analysis</p>
            </st>
            <p>Calculation of significant enrichment or depletion was done in R using the hypergeometric distribution. t-tests were carried out with the GraphPad Prisma 4.0 software. Simple and multiple linear regression models used to predict gene responsiveness as a function of various structural parameters were done in R. We used simple models of the form: <it>Y </it>~ &#945;<it>X </it>+ &#946;, where <it>Y</it>, the response variable, is the gene responsiveness and <it>X </it>is the value of the structural feature under evaluation. In the case of categorical features, such as methylation or the presence of TATA-box, <it>X </it>represented the frequency of the feature in a group of genes sharing the same responsiveness. For multiple linear regressions, we used models of the form: <it>Y </it>~ &#945;<it>X </it>+ &#946;<it>Z </it>+ &#947;<it>W</it>... where <it>Y </it>was the gene responsiveness and <it>X</it>, <it>Z</it>, <it>W</it>, etc. corresponded to different features to evaluate. Models were fitted using the lm function from the R statistical software. We used the R<sup>2 </sup>parameter to evaluate the quality of the model, since R<sup>2 </sup>represents the extent of data variability explained by the model. As a complementary approach for categorical features, we used one factor ANOVA models. They have the form <it>Y </it>~ &#945;<it>X </it>+ &#946;, where <it>X </it>was a factor encoding the presence or absence of those features at two different levels. We used the 'aov' function in R to fit the model. We used the F statistic to estimate the significance of the contribution of the factors to the response. To estimate the differences between the levels of the factors, we followed the Tukey procedure, using the 'glht' function from the 'multcomp' package in R. The Bayesian Information Criteria was calculated in R using the 'BIC' function in the package 'nlme'. Graphs were done in R, GraphPad Prisma 4.0 software or Microsoft Excel.</p>
         </sec>
         <sec>
            <st>
               <p>Gene body methylation and regulation by nitrate for TCA cycle genes</p>
            </st>
            <p>We retrieved the genes corresponding to the TCA cycle from AraCyc <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. We then determined the gene responsiveness of these genes in four previously published microarray data sets <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp> that were not included in the NASCarrays database and were therefore not used to derive our genome-wide conclusions. We intersected the methylation status <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> and regulation by nitrate of the genes encoding malate dehydogenases and isocitrate dehydrogenases using the VirtualPlant software platform <url>http://www.virtualplant.org</url>. Statistical analysis of enrichment was performed as described above.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>FFA carried out the bioinformatics and statistical analyses and wrote the manuscript. NM and SYR revised the manuscript critically for important intellectual content. RAG carried out some of the bioinformatics analyses, wrote the manuscript and was responsible for the conception of the study, the design of the data analysis and the interpretation of the results. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Xiaoyu Zhang, Dr. Steve Jacobsen and Dr. Joseph Ecker for kindly providing genome-wide DNA methylation data in a custom format, and Juanita Larra&#237;n-Linton for her proof-reading. This work was funded by grants from: ICGEB (CRPCHI0501), FONDECYT (1060457), MILLENNIUM NUCLEUS FOR PLANT FUNCTIONAL GENOMICS (P006-09-F), FUNDACION ANDES (C14060/62) and NSF (DBI0445666) to R.A.G. F.F.A. was funded by a Ph.D. fellowship from CONICYT.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses</p>
            </title>
            <aug>
               <au>
                  <snm>Kilian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Horak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wanke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weinl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Batistic</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>D'Angelo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bornberg-Bauer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kudla</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Harter</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2007</pubdate>
            <volume>50</volume>
            <issue>2</issue>
            <fpage>347</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2007.03052.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17376166</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>A gene expression map of Arabidopsis thaliana development</p>
            </title>
            <aug>
               <au>
                  <snm>Schmid</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Davison</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Henz</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Pape</snm>
                  <fnm>UJ</fnm>
               </au>
               <au>
                  <snm>Demar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scholkopf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weigel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lohmann</snm>
                  <fnm>JU</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>5</issue>
            <fpage>501</fpage>
            <lpage>506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1543</pubid>
                  <pubid idtype="pmpid" link="fulltext">15806101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A Gene Expression Map of the Arabidopsis Root</p>
            </title>
            <aug>
               <au>
                  <snm>Birnbaum</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shasha</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Lambert</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Galbraith</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Benfey</snm>
                  <fnm>PN</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <issue>5652</issue>
            <fpage>1956</fpage>
            <lpage>1960</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1090022</pubid>
                  <pubid idtype="pmpid" link="fulltext">14671301</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Transcriptional profiling of the Arabidopsis embryo</p>
            </title>
            <aug>
               <au>
                  <snm>Spencer</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Casson</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Lindsey</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2007</pubdate>
            <volume>143</volume>
            <issue>2</issue>
            <fpage>924</fpage>
            <lpage>940</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1803724</pubid>
                  <pubid idtype="pmpid" link="fulltext">17189330</pubid>
                  <pubid idtype="doi">10.1104/pp.106.087668</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Functional discovery via a compendium of expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Marton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Stoughton</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Armour</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Coffey</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Slade</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lum</snm>
                  <fnm>PY</fnm>
               </au>
               <au>
                  <snm>Stepaniants</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Gachotte</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chakraburtty</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bard</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Friend</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>102</volume>
            <issue>1</issue>
            <fpage>109</fpage>
            <lpage>126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)00015-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">10929718</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A Gene Expression Map for Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Eizinger</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wylie</snm>
                  <fnm>BN</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <issue>5537</issue>
            <fpage>2087</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1061603</pubid>
                  <pubid idtype="pmpid" link="fulltext">11557892</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A gene-coexpression network for global discovery of conserved genetic modules</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <issue>5643</issue>
            <fpage>249</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1087447</pubid>
                  <pubid idtype="pmpid" link="fulltext">12934013</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Quantifying the relationship between co-expression, co-regulation and gene function</p>
            </title>
            <aug>
               <au>
                  <snm>Allocco</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kohane</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Butte</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>1</issue>
            <fpage>18</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">375525</pubid>
                  <pubid idtype="pmpid" link="fulltext">15053845</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-18</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Orchestrated transcription of a key pathway in Arabidopsis by the circadian clock</p>
            </title>
            <aug>
               <au>
                  <snm>Harmer</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hogenesch</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Straume</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Kreps</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Kay</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2110</fpage>
            <lpage>2113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5499.2110</pubid>
                  <pubid idtype="pmpid" link="fulltext">11118138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>NASCArrays: a repository for microarray data generated by NASC's transcriptomics service</p>
            </title>
            <aug>
               <au>
                  <snm>Craigon</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okyere</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jotham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D575</fpage>
            <lpage>577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308867</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681484</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Breitling</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Armengaud</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Amtmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Herzyk</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2004</pubdate>
            <volume>573</volume>
            <issue>1&#8211;3</issue>
            <fpage>83</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.febslet.2004.07.055</pubid>
                  <pubid idtype="pmpid" link="fulltext">15327980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffery</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Culhane</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>359</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1544358</pubid>
                  <pubid idtype="pmpid" link="fulltext">16872483</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-359</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>AP2 Gene Determines the Identity of Perianth Organs in Flowers of Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Kunst</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Klenz</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Martinez-Zapater</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Haughn</snm>
                  <fnm>GW</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1989</pubdate>
            <volume>1</volume>
            <issue>12</issue>
            <fpage>1195</fpage>
            <lpage>1208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">159855</pubid>
                  <pubid idtype="pmpid" link="fulltext">12359889</pubid>
                  <pubid idtype="doi">10.1105/tpc.1.12.1195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Comparative studies on the type-B response regulators revealing their distinctive properties in the His-to-Asp phosphorelay signal transduction of Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Tajima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Imamura</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Amano</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamashino</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mizuno</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant Cell Physiol</source>
            <pubdate>2004</pubdate>
            <volume>45</volume>
            <issue>1</issue>
            <fpage>28</fpage>
            <lpage>39</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/pcp/pcg154</pubid>
                  <pubid idtype="pmpid" link="fulltext">14749483</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Cell-fate specification in the epidermis: a common patterning mechanism in the root and shoot</p>
            </title>
            <aug>
               <au>
                  <snm>Schiefelbein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2003</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>74</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S136952660200002X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12495754</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>LEAFY controls floral meristem identity in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Weigel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Alvarez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1992</pubdate>
            <volume>69</volume>
            <issue>5</issue>
            <fpage>843</fpage>
            <lpage>859</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(92)90295-N</pubid>
                  <pubid idtype="pmpid" link="fulltext">1350515</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>In plants, highly expressed genes are the least compact</p>
            </title>
            <aug>
               <au>
                  <snm>Ren</snm>
                  <fnm>X-Y</fnm>
               </au>
               <au>
                  <snm>Vorst</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fiers</snm>
                  <fnm>MWEJ</fnm>
               </au>
               <au>
                  <snm>Stiekema</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Nap</snm>
                  <fnm>J-P</fnm>
               </au>
            </aug>
            <source>Trends in Genetics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>10</issue>
            <fpage>528</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.08.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">16934358</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Phylogenetic profiling of the Arabidopsis thaliana proteome: what proteins distinguish plants from other organisms?</p>
            </title>
            <aug>
               <au>
                  <snm>Gutierrez</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Keegstra</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohlrogge</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>8</issue>
            <fpage>R53</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">507878</pubid>
                  <pubid idtype="pmpid" link="fulltext">15287975</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-8-r53</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identification of unstable transcripts in Arabidopsis by cDNA microarray analysis: rapid decay is associated with a group of touch- and specific clock-controlled genes</p>
            </title>
            <aug>
               <au>
                  <snm>Gutierrez</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Ewing</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>17</issue>
            <fpage>11513</fpage>
            <lpage>11518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123287</pubid>
                  <pubid idtype="pmpid" link="fulltext">12167669</pubid>
                  <pubid idtype="doi">10.1073/pnas.152204099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A Unified Theory of Gene Expression</p>
            </title>
            <aug>
               <au>
                  <snm>Orphanides</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Reinberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>108</volume>
            <issue>4</issue>
            <fpage>439</fpage>
            <lpage>451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00655-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11909516</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The regulatory code for transcriptional response diversity and its relation to genome structural properties in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Walther</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brunnemann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLoS Genetics</source>
            <pubdate>2006</pubdate>
            <note><b>preprint(2006)</b>:e11.eor.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1796623</pubid>
                  <pubid idtype="pmpid" link="fulltext">17291162</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Gardening the genome: DNA methylation in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Chan</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>351</fpage>
            <lpage>360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1601</pubid>
                  <pubid idtype="pmpid" link="fulltext">15861207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>DNA methylation dynamics in plant genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gehring</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>2007</pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17341434</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yazaki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sundaresan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cokus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Henderson</snm>
                  <fnm>IR</fnm>
               </au>
               <au>
                  <snm>Shinn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Ecker</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>126</volume>
            <issue>6</issue>
            <fpage>1189</fpage>
            <lpage>1201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.08.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">16949657</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Zilberman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gehring</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Ballinger</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Henikoff</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <issue>1</issue>
            <fpage>61</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1929</pubid>
                  <pubid idtype="pmpid" link="fulltext">17128275</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Genome wide analysis of Arabidopsis core promoters</p>
            </title>
            <aug>
               <au>
                  <snm>Molina</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grotewold</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">554773</pubid>
                  <pubid idtype="pmpid" link="fulltext">15733318</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-25</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>PlantProm: a database of plant promoter sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Shahmuradov</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Gammerman</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Hancock</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Bramley</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Solovyev</snm>
                  <fnm>VV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>114</fpage>
            <lpage>117</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165488</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519961</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Multiple comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Tukey</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Am Stat Assoc</source>
            <pubdate>1953</pubdate>
            <volume>48</volume>
            <fpage>624</fpage>
            <lpage>625</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Estimating the dimension of a model</p>
            </title>
            <aug>
               <au>
                  <snm>Schwarz</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Annls Statistics</source>
            <pubdate>1978</pubdate>
            <volume>6</volume>
            <fpage>461</fpage>
            <lpage>464</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/aos/1176344136</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Whole-genome analysis of histone H3 lysine 27 trimethylation in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Clarenz</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Cokus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bernatavichute</snm>
                  <fnm>YV</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goodrich</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jacobsen</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2007</pubdate>
            <volume>5</volume>
            <issue>5</issue>
            <fpage>e129</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1852588</pubid>
                  <pubid idtype="pmpid" link="fulltext">17439305</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0050129</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Partitioning of the Maize Epigenome by the Number of Methyl Groups on Histone H3 Lysines 9 and 27</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dawe</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>173</volume>
            <issue>3</issue>
            <fpage>1571</fpage>
            <lpage>1583</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1526679</pubid>
                  <pubid idtype="pmpid" link="fulltext">16624902</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.056853</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A Bivalent Chromatin Structure Marks Key Developmental Genes in Embryonic Stem Cells</p>
            </title>
            <aug>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huebert</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Meissner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wernig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Plath</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jaenisch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wagschal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Feil</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>125</volume>
            <issue>2</issue>
            <fpage>315</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.02.041</pubid>
                  <pubid idtype="pmpid" link="fulltext">16630819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Histone Methylation-Dependent Mechanisms Impose Ligand Dependency for Gene Activation by Nuclear Receptors</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia-Bassets</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kwon</snm>
                  <fnm>Y-S</fnm>
               </au>
               <au>
                  <snm>Telese</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Prefontaine</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Hutt</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Ju</snm>
                  <fnm>B-G</fnm>
               </au>
               <au>
                  <snm>Ohgi</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Escoubet-Lozach</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>X-D</fnm>
               </au>
               <au>
                  <snm>Rosenfeld</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <issue>3</issue>
            <fpage>505</fpage>
            <lpage>518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1994663</pubid>
                  <pubid idtype="pmpid" link="fulltext">17289570</pubid>
                  <pubid idtype="doi">10.1016/j.cell.2006.12.038</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Genomic analysis of the nitrate response using a nitrate reductase-null mutant of Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tischner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gutierrez</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Xing</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Coruzzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <issue>1</issue>
            <fpage>2512</fpage>
            <lpage>2522</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523318</pubid>
                  <pubid idtype="pmpid" link="fulltext">15333754</pubid>
                  <pubid idtype="doi">10.1104/pp.104.044610</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Genome-wide reprogramming of primary and secondary metabolism, protein synthesis, cellular growth processes, and the regulatory infrastructure of Arabidopsis in response to nitrogen</p>
            </title>
            <aug>
               <au>
                  <snm>Scheible</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Morcuende</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Czechowski</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fritz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Osuna</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Palacios-Rojas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Schindelasch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Udvardi</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <issue>1</issue>
            <fpage>2483</fpage>
            <lpage>2499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523316</pubid>
                  <pubid idtype="pmpid" link="fulltext">15375205</pubid>
                  <pubid idtype="doi">10.1104/pp.104.047019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Genome-wide patterns of carbon and nitrogen regulation of gene expression validate the combined carbon and nitrogen (CN)-signaling hypothesis in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Palenchar</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kouranov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lejay</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coruzzi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>11</issue>
            <fpage>R91</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545782</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535867</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Microarray analysis of the nitrate response in Arabidopsis roots and shoots reveals over 1,000 rapidly responding genes and new linkages to glucose, trehalose-6-phosphate, iron, and sulfate metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Xing</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>Plant Physiology</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <fpage>556</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166997</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805587</pubid>
                  <pubid idtype="doi">10.1104/pp.103.021253</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Steps towards an integrated view of nitrogen metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Matt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Carillo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Morcuende</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Scheible</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Krapp</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Exp Bot</source>
            <pubdate>2002</pubdate>
            <volume>53</volume>
            <issue>370</issue>
            <fpage>959</fpage>
            <lpage>970</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jexbot/53.370.959</pubid>
                  <pubid idtype="pmpid" link="fulltext">11912238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>CpG methylation is targeted to transcription units in an invertebrate genome</p>
            </title>
            <aug>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Kerr</snm>
                  <fnm>ARW</fnm>
               </au>
               <au>
                  <snm>De Sousa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bird</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <issue>5</issue>
            <fpage>625</fpage>
            <lpage>631</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1855171</pubid>
                  <pubid idtype="pmpid" link="fulltext">17420183</pubid>
                  <pubid idtype="doi">10.1101/gr.6163007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Exploration, normalization, and summaries of high density oligonucleotide array probe level data</p>
            </title>
            <aug>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Hobbs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Collin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Beazer-Barclay</snm>
                  <fnm>YD</fnm>
               </au>
               <au>
                  <snm>Antonellis</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Scherf</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Biostat</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>249</fpage>
            <lpage>264</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/biostatistics/4.2.249</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Persson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Milne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Somerville</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>24</issue>
            <fpage>8633</fpage>
            <lpage>8638</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142401</pubid>
                  <pubid idtype="pmpid" link="fulltext">15932943</pubid>
                  <pubid idtype="doi">10.1073/pnas.0503392102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community</p>
            </title>
            <aug>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Beavis</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Berardini</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Dixon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Garcia-Hernandez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Montoya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mundodi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reiser</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tacklind</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Weems</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>224</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165523</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519987</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg076</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Hong</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Breitling</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McEntee</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Wittner</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Nemhauser</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Chory</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>2825</fpage>
            <lpage>2827</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl476</pubid>
                  <pubid idtype="pmpid" link="fulltext">16982708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Controlling the False Discovery Rate: a practical and powerful approach to multiple testing</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society, Series B</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Statistical significance for genomewide studies</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>16</issue>
            <fpage>9440</fpage>
            <lpage>9445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170937</pubid>
                  <pubid idtype="pmpid" link="fulltext">12883005</pubid>
                  <pubid idtype="doi">10.1073/pnas.1530509100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>An integrated view of gene expression and solute profiles of Arabidopsis tumors: a genome-wide approach</p>
            </title>
            <aug>
               <au>
                  <snm>Deeken</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Engelmann</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Efetova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Czirjak</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kaiser</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Tietz</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Krischke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Palme</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hedrich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <issue>12</issue>
            <fpage>3617</fpage>
            <lpage>3634</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1785400</pubid>
                  <pubid idtype="pmpid" link="fulltext">17172353</pubid>
                  <pubid idtype="doi">10.1105/tpc.106.044743</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>AGRIS: Arabidopsis Gene Regulatory Information Server, an information resource of Arabidopsis cis-regulatory elements and transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Davuluri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Palaniswamy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Molina</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grotewold</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>25</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166152</pubid>
                  <pubid idtype="pmpid" link="fulltext">12820902</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-25</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>AraCyc: A Biochemical Pathway Database for Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <issue>2</issue>
            <fpage>453</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166988</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805578</pubid>
                  <pubid idtype="doi">10.1104/pp.102.017236</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
