<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2229-8-99</ui>
   <ji>1471-2229</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Regulon organization of Arabidopsis</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Mentzen</snm>
               <mi>I</mi>
               <fnm>Wieslawa</fnm>
               <insr iid="I1"/>
               <email>wimentzen@gmail.com</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Wurtele</snm>
               <mnm>Syrkin</mnm>
               <fnm>Eve</fnm>
               <insr iid="I2"/>
               <email>mash@iastate.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>CRS4 Bioinformatics Laboratory, Parco Scientifico e Technologico POLARIS, 09010 Pula (CA), Italy</p>
            </ins>
            <ins id="I2">
               <p>Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011, USA</p>
            </ins>
         </insg>
         <source>BMC Plant Biology</source>
         <issn>1471-2229</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>99</fpage>
         <url>http://www.biomedcentral.com/1471-2229/8/99</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18826618</pubid>
               <pubid idtype="doi">10.1186/1471-2229-8-99</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>6</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>30</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Mentzen and Wurtele; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Despite the mounting research on Arabidopsis transcriptome and the powerful tools to explore biology of this model plant, the organization of expression of Arabidopsis genome is only partially understood. Here, we create a coexpression network from a 22,746 Affymetrix probes dataset derived from 963 microarray chips that query the transcriptome in response to a wide variety of environmentally, genetically, and developmentally induced perturbations.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Markov chain graph clustering of the coexpression network delineates 998 regulons ranging from one to 1623 genes in size. To assess the significance of the clustering results, the statistical over-representation of GO terms is averaged over this set of regulons and compared to the analogous values for 100 randomly-generated sets of clusters. The set of regulons derived from the experimental data scores significantly better than any of the randomly-generated sets. Most regulons correspond to identifiable biological processes and include a combination of genes encoding related developmental, metabolic pathway, and regulatory functions. In addition, nearly 3000 genes of unknown molecular function or process are assigned to a regulon. Only five regulons contain plastomic genes; four of these are exclusively plastomic. In contrast, expression of the mitochondrial genome is highly integrated with that of nuclear genes; each of the seven regulons containing mitochondrial genes also incorporates nuclear genes. The network of regulons reveals a higher-level organization, with dense local neighborhoods articulated for photosynthetic function, genetic information processing, and stress response.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This analysis creates a framework for generation of experimentally testable hypotheses, gives insight into the concerted functions of Arabidopsis at the transcript level, and provides a test bed for comparative systems analysis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Genes that share a similar expression profile across multiple spatial, temporal, environmental and genetic conditions are likely to be under common transcriptional regulations. Such sets of coexpressed genes could be considered eukaryotic regulons <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Meta-analysis of microarray data, sometimes combined with other types of data &#8211; proteomics, co-precipitation, literature, yeast two hybrid &#8211; has proven valuable for model organisms including bacteria <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, nematode <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, human <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>, chimpanzee <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, mouse <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, rat <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and yeast <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The use of transcriptome data alone has allowed for identification of functionally coherent modules corresponding to major cellular processes in yeast <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> and some of these modules might be important enough to be conserved across eukaryotic organisms <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Meta-analysis of the Arabidopsis transcriptome thus offers the potential to identify prevailing cellular processes, to associate genes with particular biological processes, and to assign otherwise unknown genes to biological processes they are correlated with. Despite the model plant Arabidopsis genome having been fully sequenced since 2000, the function of many of its over 27,000 protein-coding genes is experimentally undetermined. Almost 9,000 of the genes cannot be ascribed any function. Many other genes contain a domain recognizable as representing a general molecular or biochemical function (phosphorylation, glycosylation), but no clear physiological function; i.e., the nature of their involvement in cellular processes is not understood (TAIR8 Genome Release, April 28, 2008 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). The effort to assign the function to otherwise unknown genes that are correlated with genes of known function was recently undertaken by Horan and coworkers <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. The authors used the clustering of expression data to propose a function to 1547 genes coding for proteins of unknown function (PUF) and set up a Plant Unknown-eome Database (POND) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
         <p>Arabidopsis expression data is available across a wide range of perturbations of nutrients, stress, and light, in the framework of defined organs, genetic backgrounds, and developmental stages. With this wealth of data it is tempting to identify genes that share common expression signatures across a variety of experiments. Thus, the Arabidopsis transcriptome is receiving growing attention, despite the challenges associated with a high volume of genes, distribution of data across multiple databases and publications, and incompleteness of the biological data and metadata. Several online repositories for microarray data and metadata storage and/or analysis have been created, including NASCArrays <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, Genevestigator <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>, PLEXdb <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, MetaOmGraph <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, ArrayExpress <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, Vanted <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, VirtualPlant <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, ATTED-II <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, Arabidopsis Coexpression Data Mining Tools <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, Bio-Array Resource <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>, MapMan <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>, PageMan <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp> and CressExpress <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. Based on the data from public datasets, coexpression of genes in the indole, flavonoid and phenyl-propanoid biosynthetic pathways has been reported <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Similarly, coexpression has been shown for genes encoding synthesis of cellulose and other cell wall components <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp> and oxidative phosphorylation <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Morcuende et al. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> used large-scale transcript data, along with metabolomic and enzymatic activity data, to investigate finer-tuned changes in Arabidopsis regulatory networks during phosphate starvation. Ma and Bohnert <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and Weston et al. <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> classified characteristic transcriptome responses to stresses, using public microarray data. Biehl et al. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> assigned 1590 Arabidopsis nuclear genes, mostly encoding plastid-localized proteins, to 23 regulons, based on RNA accumulation profiles across 101 different conditions. Wei et al. <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> study of the transcriptional coordination of 1,330 genes coding for enzymes in AraCyc pathways <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp> indicated a broad transcriptional basis for coexpression of metabolic pathways. Recently, Ma et al. <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> presented an Arabidopsis gene coexpression network based on partial correlation analysis. With the random sampling of genes, the authors circumvented the computational problem resulting from small number of samples versus large number of genes and approximated direct associations between the genes, obtaining a network with 6760 genes and 18,000 interactions.</p>
         <p>Here, we present a global analysis of the regulon organization of the Arabidopsis genome, derived from the results of the graph clustering of the coexpression network of the 13,456 genes. The relationships among genes in a complex organism clearly entail shifts in alliances among the genes, resulting in "fuzzy" memberships <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> in different clusters according to perturbations in environmental and genetic conditions. None-the-less, this analysis captures the prevailing transcriptional network of the organism. As such, it provides a strategy to evaluate functions of genes in a given gene family, and to develop experimentally testable hypotheses about the functions of genes with no known physiological or developmental role. The approach applied in this study delivers a perspective that is not constrained by existing assumptions about the organization of plant processes. Instead, the organization emerges directly from observations. The analysis reveals biologically coherent functional modules, representing a sometimes surprising combination of metabolic and developmental genes.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Graph-clustering of Arabidopsis transcriptome data</p>
            </st>
            <p>To facilitate identification of the regulon organization of coexpressed Arabidopsis genes that reflect the most prevailing processes in this plant, we performed meta-analysis of multiple microarray experiments. We developed a transcriptome data set for 70 experiments from the public microarray depositories NASCArrays <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B18">18</abbr></abbrgrp> and PLEXdb <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B56">56</abbr></abbrgrp> (described in detail in Methods). The experiments from these databases incorporate a wide variety of mutants, environmental conditions and stages of development. To avoid artifactual signals, samples and experiments with poor replicate quality were removed. The resulting 963 chips were normalized to the common range and the replicates were averaged to yield 424 samples. To further minimize noise in the data, probe sets with low expression values (defined as probes whose expression in every microarray chip was lower than the mean value for that chip) were removed from analysis (Figure <figr fid="F1">1</figr>). In order to concentrate on the most prominent coexpressed sets, only genes correlated above Pearson's correlation threshold of 0.7 with at least two other genes were included in the analysis. The expression data for the resulting 13,456 probe sets was treated as a coexpression network in which genes are represented as nodes, and two nodes are connected by an edge if the Pearson correlation between their RNA accumulation profiles is higher than a threshold of 0.7.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Data processing for construction of the transcriptional network</p>
               </caption>
               <text>
                  <p><b>Data processing for construction of the transcriptional network</b>. Filters were applied to original probe sets on ATH1 chip to remove the genes with expression lower than the mean of 100 and with correlation to other genes lower than the threshold of 0.7. The network was then constructed and the largest connected component of this network was retained; smaller connected components as well as genes with only one neighbor in the giant connected component were filtered out. This resulting network, containing 13,456 genes, was then clustered. Enrichment of Gene Ontology terms in groups of filtered genes is indicated.</p>
               </text>
               <graphic file="1471-2229-8-99-1"/>
            </fig>
            <p>Pearson's R has been chosen as the similarity measure between the expression profiles, in spite of its known shortcoming &#8211; measuring the strength of only linear relationships and sensitivity to outliers. We estimate that the presence of strong non-linear relationships between gene expression profiles in expression data, which would not be picked up by Pearson's R, is relatively rare. The results presented by Daub et al. <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, where authors found no increase in the discovery of high correlations in gene expression data when using measures of non-linear relationships (mutual information) agree with this assertion. Pearson correlation gives high score for the expression profiles which consist of mostly very low values and one, or few, very high values, which in our dataset often occurs for genes that are expressed only in couple of underrepresented tissues or conditions. This sensitivity to outliers is usually seen as the drawback of Pearson's correlation measure. However, we decided that for the purpose of our analysis these outliers are valid, though extreme, datapoints and that clusters based on the presence of genes in only couple of samples do represent valid clusters. For the groups of genes with expression profiles that are variable and parallel across many diverse conditions, a hypothesis of common regulatory program acting upon participating genes is particularly plausible. For clusters that are active only under a small subset of conditions, there is less ground for a co-regulation assumption. However, such clusters may also reveal valuable information. For example, the subset of genes for increased growth in response to auxin and cytokinin upregulated only in cell cultures and tumors would be hard to pinpoint if not for meta-analysis. Correlations based on extreme values would not be found with Spearman rank correlation, Kendall's tau or in logged data.</p>
            <p>We aimed to identify sets of coexpressed genes, which are represented in our model as densely-connected regions of the network. A graph-clustering method based on flow simulation (Markov chain graph clustering, MCL) was used to identify clusters in this network that correspond to the sets of coexpressed genes. This method, developed by van Dongen <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, has been used previously for clustering protein sequence data <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> and for identifying modules in the yeast protein interaction network <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. One of the advantages of Markov clustering is that it is scalable to large graphs, unlike most other graph clustering algorithms, which are not applicable to graphs with more than 5000 nodes. Using MCL, we identified 998 clusters in the Arabidopsis network, ranging in size from 1 to 1623 genes.</p>
            <p>To evaluate the significance of the clustering results, we compared the overrepresentation of Gene Ontology (GO <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>) terms in the set of the 148 largest regulons (i.e., containing at least 10 genes) derived from Markov clustering of the experimental data, with GO terms overrepresentation of 100 randomly-obtained sets of clusters. For each randomly-obtained set, clusters were designated by permuting the gene locus IDs, such that the sizes of the 148 clusters were preserved relative to the experimental data, but the genes assigned to each cluster changed (see Methods section). The best p-value for overrepresentation of GO terms was recorded for each cluster in a set and averaged over all clusters. Distribution of p-values for GO terms in the randomly-obtained clusters was then compared to the respective value for the regulons derived from the experimental data (Figure <figr fid="F2">2</figr>). For each GO terms category (Molecular Function, Biological Process, Cellular Component), the experimental dataset scores significantly better than any of the randomly-obtained sets (Wilcoxon test p-value &lt; 2.2 &#215; 10<sup>-16</sup>). This analysis indicates that the concentration of similar GO terms in the clusters derived from experimental data is not random, and thus these regulons might correspond to meaningful biological processes.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Statistical significance of Markov chain graph clustering results</p>
               </caption>
               <text>
                  <p><b>Statistical significance of Markov chain graph clustering results</b>. The best p-values for over-representations of Gene Ontology (GO) terms, averaged over all clusters (S score, denoted by color arrow) are compared to the analogous values for 100 randomly-obtained clusterings (histogram). GO categories: (<b>A</b>) Molecular Function, (<b>B</b>) Biological Process, (<b>C</b>) Cellular Component. In each case, the actual clustering scored significantly better than any of 100 randomly obtained ones (Wilcoxon test p-value &lt; 2.2 &#215; 10<sup>-16</sup>). In a comparison of the MCL (Markov clustering) and k-means clustering results (the latter denoted by black arrows), MCL had better S scores for GO terms overrepresentation than the k-means method (0.0016 versus 0.0020 for "Molecular Function" category; 0.0016 versus 0.0026 for "Biological Process"; and 0.0044 versus 0.0050 for "Cellular Component").</p>
               </text>
               <graphic file="1471-2229-8-99-2"/>
            </fig>
            <p>The Markov clustering result was also compared with results obtained from another common method, k-means clustering, with the same number of clusters (= 998) as a parameter. The clusterings produced by MCL and k-means differed in the distribution of the cluster sizes, which might have influenced their scoring. The agreement between gene assignments to regulons by these two methods is 0.044 (based on the adjusted <it>rand </it>index; as compared to an agreement between MCL clustering and random reassignment of genes to clusters of only 10<sup>-5</sup>). MCL clustering yielded somewhat better S scores for GO terms overrepresentation than the k-means method (0.0016 versus 0.0020 for the Molecular Function category, 0.0016 versus 0.0026 for Biological Process category, and 0.0044 versus 0.0050 for Cellular Component category). MCL clustering had also higher Z-score for the total mutual information between the clustering and all the GO terms describing the genes within the clustering (80.1 versus 54.2 for k-means clustering).</p>
            <p>The prevalent physiological or developmental functionality of each regulon containing over 20 genes (69 regulons comprising collectively 9,436 genes) was examined in more detail; the results are summarized in Table <tblr tid="T1">1</tblr>. To assign functionality to a regulon, we combined the results from two independent methods, automatic analysis of enrichment of GO terms and manual inspection of annotation supplemented by literature searches for each gene's annotation and function, as well as examination of the conditions under which the genes of the regulon are maximally expressed. Most regulons are characterized by a mixture of molecular functions (enzymes, transporters, transcription factors and signaling molecules) that work together to achieve a common goal. This goal could be, for example, hormone-mediated development of floral organs accompanied by metabolic processes (Regulon 43), or a defense response leading to synthesis of protective compounds (Regulon 46). One cluster is almost exclusively devoted to proteolysis (proteasome complex in Regulon 45). Although genes with low expression have been filtered out, the genes that predominate in three of the larger clusters (Regulons 11, 17, and 58) are annotated as "hypothetical", transposons", or "pseudogenes".</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Predominant functions of the regulons</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Regulon</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b># of genes</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Postulated physiological function</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FC<sup>a</sup></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1629</p>
                     </c>
                     <c ca="left">
                        <p>mixed (tricellular and mature pollen-specific) <sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1136</p>
                     </c>
                     <c ca="left">
                        <p>Photosynthesis</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>869</p>
                     </c>
                     <c ca="left">
                        <p>protein synthesis</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>583</p>
                     </c>
                     <c ca="left">
                        <p>Mitosis</p>
                     </c>
                     <c ca="center">
                        <p>49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>507</p>
                     </c>
                     <c ca="left">
                        <p>membrane transporters -metal, toxins removal (root-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>417</p>
                     </c>
                     <c ca="left">
                        <p>embryo maturation (fruit and seed-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>330</p>
                     </c>
                     <c ca="left">
                        <p>developmental regulation (leaf apex- preferential)</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>281</p>
                     </c>
                     <c ca="left">
                        <p>information (uninucleate microspore and bicellular pollen-specific)</p>
                     </c>
                     <c ca="center">
                        <p>64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>234</p>
                     </c>
                     <c ca="left">
                        <p>response to environmental stimuli</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>223</p>
                     </c>
                     <c ca="left">
                        <p>protein modification, defense response</p>
                     </c>
                     <c ca="center">
                        <p>66</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>215</p>
                     </c>
                     <c ca="left">
                        <p>nuclear, others with very low expression</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>182</p>
                     </c>
                     <c ca="left">
                        <p>mixed (fruit-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>154</p>
                     </c>
                     <c ca="left">
                        <p>upregulated in 'response to CO<sub>2 </sub>levels' experiment</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>140</p>
                     </c>
                     <c ca="left">
                        <p>regulation of organ development</p>
                     </c>
                     <c ca="center">
                        <p>61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>138</p>
                     </c>
                     <c ca="left">
                        <p>plastid stress and circadian rhythm</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>121</p>
                     </c>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>115</p>
                     </c>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c ca="center">
                        <p>51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="left">
                        <p>cell wall, respiration/catabolism (pollen-specific; highest in tricellular pollen)</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c ca="left">
                        <p>mixed (flower-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c ca="center">
                        <p>80</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>92</p>
                     </c>
                     <c ca="left">
                        <p>secondary products, secondary wall (flower-specific, mostly tapetum)</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>81</p>
                     </c>
                     <c ca="left">
                        <p>cell wall biosynthesis, carbohydrate metabolism</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>77</p>
                     </c>
                     <c ca="left">
                        <p>membrane proteins</p>
                     </c>
                     <c ca="center">
                        <p>69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                     <c ca="left">
                        <p>defense response</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                     <c ca="left">
                        <p>defense response</p>
                     </c>
                     <c ca="center">
                        <p>77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>68</p>
                     </c>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c ca="center">
                        <p>79</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>68</p>
                     </c>
                     <c ca="left">
                        <p>regulation, root (root- and hypocotyl-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>66</p>
                     </c>
                     <c ca="left">
                        <p>nucleic acid binding, regulation</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="left">
                        <p>aerobic respiration in mitochondria</p>
                     </c>
                     <c ca="center">
                        <p>92</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="left">
                        <p>Signalling</p>
                     </c>
                     <c ca="center">
                        <p>89</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>31</p>
                     </c>
                     <c ca="center">
                        <p>52</p>
                     </c>
                     <c ca="left">
                        <p>defense response</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>nuclear genes, RNA processing, DNA replication</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>chloroplast organization and biogenesis</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="left">
                        <p>mitochondrial genes</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="left">
                        <p>kinases, signaling, disease resistance</p>
                     </c>
                     <c ca="center">
                        <p>69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="left">
                        <p>lipid modification and cuticular wax synthesis (flowers and shoot apex-specific)</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="left">
                        <p>heat shock response</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>38</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>RNA processing, translation, transcription regulation</p>
                     </c>
                     <c ca="center">
                        <p>82</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>39</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>catabolic processes deriving energy</p>
                     </c>
                     <c ca="center">
                        <p>51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>transcription, translation, protein folding and transport</p>
                     </c>
                     <c ca="center">
                        <p>86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>41</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>regulation, information</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>Regulation</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>flower/fruit, cell wall depositions (flower/fruit-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>44</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>metabolic processes in flowers/fruit (flower/fruit-specific)</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>proteasome complex</p>
                     </c>
                     <c ca="center">
                        <p>87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>defense response</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>nuclear, replication, chromosome organization, cell cycle</p>
                     </c>
                     <c ca="center">
                        <p>67</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>cell culture and tumor specific</p>
                     </c>
                     <c ca="center">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>49</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>chloroplast-encoded</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>Signalling</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>51</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>organ specification in shoot (leaf apex- and hypocotyl-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>52</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>endoplasmic reticulum: protein folding and secretion/redox function</p>
                     </c>
                     <c ca="center">
                        <p>73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>53</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>fatty acid biosynthesis</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>54</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>protein degradation and lipid modification</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>55</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>epidermal/cuticular deposits</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>nectaries/carpel specific function (carpel-specific)</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>57</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>phloem specific (vasculature tissues-specific)</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>transposases, mostly CACTA-type</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>59</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>metabolism and transport of triterpenoids (root hairs-preferential)</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>60</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>ubiquitin ligase</p>
                     </c>
                     <c ca="center">
                        <p>53</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>61</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>metabolism of glutathione and glutamate, redox</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>information, nuclear</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>stress-induced catabolism, mediated by jasmonic acid</p>
                     </c>
                     <c ca="center">
                        <p>72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>64</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>information, nuclear</p>
                     </c>
                     <c ca="center">
                        <p>87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>secondary metabolism/pathogen infection</p>
                     </c>
                     <c ca="center">
                        <p>61</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>66</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>exocytosis</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>67</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>Ca <sup>2+ </sup>&#8211; triggered exocytosis (pathogen response?)</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>68</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>shoot meristem development and nucleic acid binding (leaf apex and hypocotyl &#8211; preferential)</p>
                     </c>
                     <c ca="center">
                        <p>69</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>69</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>leucine/glucosinolates metabolism</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Regulons with 20 or more genes are shown. Annotations are postulated based on GO terms supplemented with information from the published literature.</p>
                  <p><sup><b>a </b></sup>Functional Coherence, calculated as percentage of annotated genes whose TAIR annotation is consistent with the cluster functional classification. (Genes designated "hypothetical" or "unknown" are not included in this calculation.)</p>
                  <p><sup><b>b </b></sup>prevalent locations of expression are indicated in parenthesis. "specific" refers to virtually all expression in the given location; "preferential" refers to most expression being in the given location</p>
               </tblfn>
            </tbl>
            <p>A simplified view of the coexpression network formed by 69 largest regulons is shown in Figure <figr fid="F3">3</figr>. A link between two regulons means that there are genes in one regulon that are correlated with genes in the second regulon. It is interesting to note the higher-order grouping of the regulons that predominantly contain genes with genetic information-related, photosynthetic/plastidic, and stress response functions.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Higher-order structure in the coexpression network</p>
               </caption>
               <text>
                  <p><b>Higher-order structure in the coexpression network</b>. All regulons containing at least 20 genes are depicted; these comprise a total of 9,436 genes. Regulons are represented by ovals numbered 1 through 69. A linkage between two clusters means that one or more genes in one of the clusters are correlated with one or more genes in the other cluster. As observed from the proximity of regulons with similar broader functional category, three super-clusters of regulons are revealed: regulons related to information-related functions (purple), plastidic functions (green) and defense response-related functions (yellow). The predominant functionality of each regulon is defined in Table 1. Network was visualized using the GraphExplore tool <abbrgrp><abbr bid="B118">118</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2229-8-99-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Six regulons are devoted to nuclear-encoded plastidic functions</p>
            </st>
            <p>Six of the regulons with over 20 genes represent plastidic functions that are encoded predominantly by nuclear genes (Regulons: 2, photosynthesis/chloroplast biogenesis; 15, plastid stress and circadian rhythm; 33, plastid organization and biogenesis; 49, plastid-encoded genes; 53, fatty acid biosynthesis; 69, glucosinolate biosynthesis; Table <tblr tid="T1">1</tblr>).</p>
            <p>Regulon 2, the second biggest cluster, contains 971 mainly nuclear-encoded genes involved in chloroplast biogenesis and photosynthesis (overrepresented GO terms: chloroplast: p-value &lt; 10<sup>-85</sup>, thylakoid: p-value = 1.02 &#215; 10<sup>-28</sup>, photosynthesis: p-value = 1.68 &#215; 10<sup>-15</sup>) (Figure <figr fid="F4">4A</figr>). Nineteen genes are involved in the formation and development of plastid organelle: its biogenesis, organization, fission and relocation. An example of these genes is <it>thylakoid formation 1 </it>(PSB29), required for thylakoid membrane organization <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Transporters of sodium, calcium and other metals are represented. Two hundreds and eighteen genes in Regulon 2 have a photosynthesis-related activity. Of these, 36 encode enzymes required for synthesis of the photosynthetic apparatus metabolome (porphyrin pigments, tetrahydrofolate, chlorophyll, carotenoids, and other plastidic isoprenoids). Seventy five genes encode plastidic ribosome constituents and related functions. Twenty genes from the Calvin cycle, 16 genes from photorespiration, 14 genes representing a subset of plastidic glycolysis enzymes, and 11 genes involved in starch metabolism are also present, reflecting the coupling of these metabolic activities with the light reactions of photosynthesis. In addition, enzymes for plastidic metabolism of amino acids and nucleotides are represented.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Regulons with organelle-specific functions and organelle-encoded genes</p>
               </caption>
               <text>
                  <p><b>Regulons with organelle-specific functions and organelle-encoded genes</b>. Regulon 2, photosynthesis (for clarity, representative expression profiles of 200 randomly chosen genes from this regulon are shown) (<b>A</b>); Regulon 49, plastid-encoded genes (<b>B</b>); Regulon 29, mitochondrial respiration (<b>C</b>); Regulon 34, mitochondrion-encoded genes (<b>D</b>). The plots on the right side show expression profiles of the genes in respective regulon (each gene depicted with different color) across the 424 samples in the dataset. The samples have been arranged according to plant tissue. Pie charts are based on manual annotations from published data. RNA profiles plotted using MetaOmGraph <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B124">124</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2229-8-99-4"/>
            </fig>
            <p>Regulon 2 contains a total of 38 plastid-encoded genes, 27 of which participate in the light reactions of photosynthesis. Such coupling of plastid-encoded and nuclear-encoded genes for the light reactions might be achieved by nuclear-encoded proteins with tetratricopeptide (TPR) or pentatricopeptide (PPR) motifs, which are thought to be transcript-specific regulators of plastome expression <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>, and sigma factors for plastidic RNA polymerase <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Consistent with this concept, 43 genes in Regulon 2 encode proteins with a TPR or PPR domain. One of these, HCF107, has been reported to process the polycistronic chloroplast <it>psbB-psbT-psbH-petB-petD </it>operon coding for proteins of the photosystem II and cytochrome b6/f complexes <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>; both <it>psbH </it>and <it>petB </it>are members of Regulon 2. FLU, another TPR containing protein in Regulon 2, is a negative regulator of chlorophyll synthesis <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Five of the six nuclear-encoded sigma factors that modulate the specificity of plastidic RNA polymerase are present in Regulon 2 (SIG1-SIG4 and SIG6 <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>). The exception is the phylogenetically and functionally distinct SIG5, which has been reported to be important for stress response <abbrgrp><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr></abbrgrp>; SIG5 is a member of Regulon 15, "plastid stress and circadian rhythm".</p>
            <p>Protein import is represented in Regulon 2 by: HCF106, a translocation pathway component that imports proteins into the thylakoid lumen <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>; Tic22 and TOC159, a transit sequence receptor required for import of proteins and essential for chloroplast biogenesis <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>; and CPFTSY and CAO, chloroplast signal-recognition particle receptor proteins <abbrgrp><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr></abbrgrp>. Two hundred and three genes in Regulon 2 have no known function. The expression of Regulon 2 is high in light, and in every organ except roots.</p>
         </sec>
         <sec>
            <st>
               <p>Expression of the plastidic genome is partitioned</p>
            </st>
            <p>The plastid-encoded genes are partitioned across five of the 998 regulons: Regulon 2 (described above), Regulon 176 (8 genes), Regulon 283 (5 genes), Regulon 656 (2 genes) and Regulon 49 (27 genes). In contrast to Regulon 2, with its mixture of nuclear and plastidic genes, the other four regulons contain exclusively plastid-encoded genes: Regulon 49 contains genes for 17 ribosomal proteins and RNA polymerases, seven photosystem-related proteins, and three "hypothetical" proteins (Figure <figr fid="F4">4B</figr>). Interestingly, the operon membership of genes does not necessarily conform to their regulon membership. For example, genes from the tri-cistronic operon, <it>psaA-psaB-rps14</it>, each belong to a different cluster (Regulons 49, 2 and 176, respectively), likewise, the genes from the <it>accD </it>operon are scattered among three clusters (Regulons 2, 49 and 283).</p>
         </sec>
         <sec>
            <st>
               <p>Aerobic respiration is a major mitochondrial-related regulon</p>
            </st>
            <p>Only a single regulon of over 20 genes (Regulon 29, 63 genes) contains exclusively nuclear-encoded genes with a predominantly mitochondrial function. Most genes of Regulon 29 are involved in mitochondrial aerobic respiration (Figure <figr fid="F4">4C</figr>). Thirty-nine of these genes encode structural components of the electron transport chain and ATP synthase, six encode TCA cycle enzymes, two code for pyruvate dehydrogenase (one for a cofactor) and twelve are of unknown function. The four remaining genes encode the putative cytosolic galactose kinase GAL1, adenylate kinase, sigma F inhibition-like factor and a mitochondrial dicarboxylate/tricarboxylate carrier. Fifty-one of the 63 proteins encoded by Regulon 29 genes are experimentally demonstrated or predicted to have a mitochondrial localization. The expression is well correlated and highest in pollen.</p>
            <p>A large subset of the genes involved in mitochondrial protein synthesis (191 of 869 genes) is contained in Regulon 3, together with genes for protein synthesis in other cell components.</p>
         </sec>
         <sec>
            <st>
               <p>Mitochondrial genes are integrated with nuclear genes</p>
            </st>
            <p>Seven of the 998 regulons contain an amalgamation of genes from the mitochondrial and nuclear genomes. Although mitochondrial genes predominate in three of these regulons: Regulons 73 (17 out of 18 genes), Regulon 205 (5 out of 7 genes), and Regulon 34 (45 out of 47 genes), no regulon contains exclusively mitochondrial genes. This synchronization of expression of mitochondrial and nuclear genomes is consistent with the recent report that mitochondrial functions require coexpression of genes from both genomes <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <p>Of the genes in Regulon 34, 19 represent respiration-related functions (ATPases, cytochrome, NADPH dehydrogenase) (Figure <figr fid="F4">4D</figr>). Most of the other genes are annotated as hypothetical or unknown. Eight are adjacent genes already reported to be co-transcribed (nad3 and rpsL2; rpl5 and cob; nad4L and orf25; atp1 and orf294) <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>, however, other genes are from scattered regions of the mitochondrial chromosome. This observed coexpression of genes from different regions of the mitochondrial genome is consistent with experimental evidence that modulation of RNA stability plays a major role in regulation of gene expression in this organelle <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. The expression of Regulon 34 is generally high and well-correlated, and is upregulated in seeds, male gametophytes, and during starvation.</p>
         </sec>
         <sec>
            <st>
               <p>Control of nuclear function and information processing</p>
            </st>
            <p>Sixteen regulons are enriched in genes with a genetic information-related function (transcription, translation, replication, DNA metabolism and repair, RNA processing, chromatin assembly, chromatin rearrangement or cell cycle; Table <tblr tid="T1">1</tblr>).</p>
            <p>Regulon 4 contains 495 genes, many with experimentally determined or predicted functions in the cell cycle (Figure <figr fid="F5">5A</figr>). For example, twenty one genes are directly involved in mitosis, including cell division control proteins (CDKB1, CDKB2;1, CDKB2;2, CDC2MsF) and cell division cycle protein HBT, cyclins and other cyclin-dependent proteins. Other nuclear functions represented include gene silencing, regulation of organ development, nuclear transport, RNA processing and histones. Regulatory and signaling genes include 55 transcription factors, 54 protein kinases, 53 signaling-related genes and 18 other regulatory proteins. The expression of Regulon 4 is highest in the leaf apex.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Regulons with developmental and metabolic functions</p>
               </caption>
               <text>
                  <p><b>Regulons with developmental and metabolic functions</b>. Regulon 4, cell division (for clarity, representative expression profiles of 200 randomly chosen genes are shown) (<b>A</b>); Regulon 20, nuclear regulation (<b>B</b>); Regulon 35, protein kinases, signaling and defense response (<b>C</b>); Regulon 69, glucosinolate biosynthesis (<b>D</b>); Regulon 25, defense response (<b>E</b>); and Regulon 1, pollen-specific (200 randomly chosen genes) (<b>F</b>). Pie charts are based on manual annotation. RNA profiles plotted using MetaOmGraph <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B124">124</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2229-8-99-5"/>
            </fig>
            <p>Regulon 20 provides an example of a set of genes involved in nuclear function, which is also associated with a specific developmental process (Figure <figr fid="F5">5B</figr>). Sixty-five out of 94 genes have some kind of nucleic acid-associated activity: transcription factors, splicing factors, chromatin remodeling, histone deacetylases, RNA helicases, DNA repair, RNA processing. However, five genes in this cluster have been implicated in the regulation of flower development: At3g12680, HUA1, is an RNA-binding protein which specifies stamen and carpel identities <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>; At5g04240 (ELF6, early flowering) acts as a repressor of the photoperiod pathway <abbrgrp><abbr bid="B77">77</abbr></abbrgrp>; At2g28290 (SYD) regulates floral homeotic gene expression <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>; At5g17690 (TFL2) controls flowering and floral organ identity by silencing nuclear genes <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>; and At4g32551 (LUG) is a negative regulator of the floral homeotic gene AGAMOUS <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>. No genes reported to be involved in other developmental processes are represented. Rather surprisingly, the expression pattern of this cluster is relatively low and uniform.</p>
         </sec>
         <sec>
            <st>
               <p>Defense responses</p>
            </st>
            <p>Six regulons contain primarily genes involved in resistance to disease or a pathogen; each is characterized by a set of genes that appears to have a specialized function.</p>
            <p>Regulon 35 (45 genes) includes genes involved mostly in a combination of signaling events and responses to pathogens (Figure <figr fid="F5">5C</figr>). Sixteen of the genes encode protein kinases, some experimentally linked to pathogen responses (e.g., CRK5); 10 other genes are involved in disease resistance, response and signaling (cellulase, expansin, six disease resistance proteins, calmodulin- and cyclic nucleotide-binding proteins). Three genes are involved in MAPKKK cascades: MPK3 (a MAP kinase), and MKK1 and MKK2 (MAP kinase kinases). Twenty-three of the encoded proteins have a predicted location in the endomembrane system. At3g56710 (SIB1) is a nuclear protein that modulates transcription in chloroplasts <abbrgrp><abbr bid="B81">81</abbr></abbrgrp> and might coordinate the response of the plastidic genome to the pathogen with the nuclear one. Expression is high in leaves, especially following perturbations by pathogens or during senescence.</p>
            <p>Glucosinolates provide a chemical defense against herbivores <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. Most of the 20 genes in Regulon 69 (Figure <figr fid="F5">5D</figr>) may participate in glucosinolate biosynthesis in chloroplasts. Of the 16 genes annotated with biosynthetic functions, seven have demonstrated or putative involvement in glucosinolate biosynthesis <abbrgrp><abbr bid="B83">83</abbr></abbrgrp>, six encode enzymes similar in sequence to those of leucine, homoserine, lysine or choline biosynthesis. Enzymes currently annotated in TAIR by sequence evidence as being involved in leucine biosynthesis might be also active in the glucosinolate pathway, since those pathways have analogous chemical reactions <abbrgrp><abbr bid="B84">84</abbr><abbr bid="B85">85</abbr></abbrgrp>. A flavin-containing monooxygenase, an antioxidant involved in glucosinolate production from phenylalanine in rapeseed <abbrgrp><abbr bid="B86">86</abbr></abbrgrp> is also present in the regulon.</p>
            <p>Regulon 25 has 70 genes, 19 of which are annotated as disease resistance proteins (Figure <figr fid="F5">5E</figr>). It also contains other genes involved in pathogen response, among them lectin and lectin kinases, genes related to apoptosis, 17 protein kinases, many of them receptors, and eight genes annotated as involved in signaling. Eighteen genes are predicted to be integral membrane genes. The spiky expression of this cluster, highest in leaves, could be considered symptomatic of genes responding to environmental stimuli.</p>
            <p>Other regulons predominantly devoted to stress responses include heat shock response (Regulon 37), stress-induced catabolism (Regulon 63), and synthesis of protective compounds derived from shikimate (Regulon 46).</p>
         </sec>
         <sec>
            <st>
               <p>Tissue-specific regulons</p>
            </st>
            <p>Only sixteen regulons are predominantly expressed in particular reproductive or vegetative structures (flowers- Regulons 21 and 36; flower/fruit &#8211; Regulons 12, 43, 44 and 56; pollen &#8211; Regulons 1, 8 and 18; leaf apex &#8211; Regulon 7; root- Regulons 27 and 59; phloem &#8211; Regulon 57; shoot meristem &#8211; Regulon 68).</p>
            <p>Genes expressed mainly in pollen are grouped in three clusters, Regulons 1, 8, and 18, each having a different expression profile among the samples containing pollen. Regulon 18 is almost exclusively expressed in pollen. Regulon 1, the biggest cluster, is composed of 1623 genes (Figure <figr fid="F5">5F</figr>). It is a very highly correlated cluster and also a very dense one, containing genes with the highest average number of neighbors (154) for each gene. Regulon 1 contains genes involved in the regulation of the pollen development, spermatogenesis and pollen tube growth. Many genes in Regulon 8 have regulatory functions, and many in Regulon 18 are associated with lipid and carbohydrate metabolism and transport. Our clustering confirms the separation of pollen transcriptome into early and late stages of pollen development, observed by Honys and Twell (2004) <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>, whose experiment contributes most of the pollen samples in our dataset. Since the number of experiments using pollen tissue in our analysis was small, the temporal resolution of expression of pollen-specific genes may not be high.</p>
            <p>In addition to pollen having its own complement of unusually expressed genes, other clusters (Regulons 3, 29, 43, and 52) are highly upregulated in pollen but also in other tissues.</p>
         </sec>
         <sec>
            <st>
               <p>Coexpression of neighboring genes</p>
            </st>
            <p>We noticed that the genes that are neighbors on a chromosome are often coexpressed. This phenomenon has also been observed by others using different approaches <abbrgrp><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr></abbrgrp>. To quantify the extent of the coexpression, we calculated the number of groups of coexpressed neighbors. Coexpressed neighbors are defined as nuclear genes in the same regulon whose Locus IDs differ by at most 20. To eliminate the contribution of tandem gene duplications from this evaluation, arrays of tandem duplicates were removed.</p>
            <p>There are 539 groups of coexpressed genes, 1161 genes in total. This value is significantly larger than the number of groups of coexpressed genes from data in which genes are randomly reassigned to regulons (the mean value from random data is 421; Wilcoxon test p-value &lt; 2.2 &#215; 10<sup>-16</sup>; Additional file <supplr sid="S1">1</supplr>). The groups of coexpressed adjacent genes range in size from two to six genes: 384 of the groups have only two genes and 54 have three genes (Additional file <supplr sid="S2">2</supplr>).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Distribution of the numbers of groups of coexpressed neighboring genes in 100 randomized datasets</b>. The nuclear-encoded genes were randomly reassigned to the regulons. Groups of coexpressed neighbors were counted in the same way as in the real dataset. The mean number of coexpressed groups in 100 randomized datasets was 421.4, compared to 539 in the real dataset</p>
               </text>
               <file name="1471-2229-8-99-S1.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Distribution of the sizes of groups of coexpressed neighboring genes in experimental and randomized data</p>
               </text>
               <file name="1471-2229-8-99-S2.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We were able to take advantage of our assignments of genes to functionally coherent regulons to evaluate whether the phenomenon of coexpressed neighbors is associated with specific regulons or with regulons of particular functions or characteristics. The 539 genes groups of coexpressed neighbors are not members of regulons with any obvious common characteristic or function, nor are they associated with any of the three super-clusters of regulons in the network (photosynthetic functions, information processing, and stress responses). Also, the coexpressed neighboring genes are not enriched in any GO term. Thus, coexpressed neighbors don't seem to be associated with a particular function, either with respect to gene annotation or participation in particular regulons.</p>
            <p>Interestingly, the distribution of groups of coexpressed neighbors on the chromosomes is not uniform. Domains of coexpressed neighbors are absent from large part of the long arm of chromosome 4, adjacent to the pericentromeric region, and very rare in the analogous area of chromosome 2 (Figure <figr fid="F6">6</figr>).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Coexpressed neighboring genes are absent from the region of long arm of chromosome 4</p>
               </caption>
               <text>
                  <p><b>Coexpressed neighboring genes are absent from the region of long arm of chromosome 4</b>. Distribution of the coexpressed neighboring genes (marked in yellow) on five Arabidopsis chromosomes (visualized in Chromosome Map Tool, <abbrgrp><abbr bid="B126">126</abbr><abbr bid="B115">115</abbr></abbrgrp>). Domains of coexpressed neighbors are absent from large part of the long arm of chromosome 4, adjacent to the pericentromeric region, and very rare in the analogous area of chromosome 2.</p>
               </text>
               <graphic file="1471-2229-8-99-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Genes of unknown function</p>
            </st>
            <p>The unknown genes that co-cluster with genes of known function might be hypothesized to share that function with characterized genes. Recently, Horan et al. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> used the clustering of public expression data to assign function to genes coding for proteins of unknown function (PUF), defined as genes with a GO term GO:0003674 (unknown molecular function). This way authors have proposed a function to 277&#8211;1541 PUFs, depending on the significance threshold. Of the 277 PUFs assigned to clusters with the highest confidence in Horan et al., 216 are present in our 998 regulons. The highest number of those 216 PUFs belong to Regulon 2 (photosynthesis, 94 PUFs), 19 are present in Regulon 3 (protein synthesis) and 12 in Regulon 4 (mitosis).</p>
            <p>In our analysis, the total of 2896 PUFs have been assigned to 148 larger regulons with at least 10 members. In 69 largest regulons there are 2584 PUFs, 1768 PUFs in only 10 largest regulons. PUFs are approximately proportionally distributed and consist about 30% of regulons. Thus, the functions most often assigned to PUFs are that of Regulon 1 (pollen-specific, 457 PUFs), Regulon 2 (photosynthesis, 327 PUFs) and Regulon 3 (protein synthesis, 213 PUFs). The exceptions are regulons 49 (plastidic genes), 53 (fatty acid biosynthesis), and 69 (leucine/glucosinolates biosynthesis) from which PUFs are entirely absent.</p>
         </sec>
         <sec>
            <st>
               <p>Genes of extremes</p>
            </st>
            <p>We identified the genes with highly varied expression, little variation in expression, as well as those genes that had sub- or super- mean levels of expression, from the expression profiles of the 22,746 probes in the Arabidopsis ATH1 chip using the same 963-chips dataset. To evaluate whether the genes with extremes in expression patterns have any particular characteristics, the functions of the 100 genes with the most varied expression, the steadiest expression and also those with the highest and lowest expression were assigned to functional classes based on TAIR and GO annotations and manual curation (Figure <figr fid="F7">7</figr> and Additional file <supplr sid="S3">3</supplr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Functional assignments and expression profiles of genes with the most and the least variable expression across multiple conditions</p>
               </caption>
               <text>
                  <p><b>Functional assignments and expression profiles of genes with the most and the least variable expression across multiple conditions</b>. <b>(A) </b>100 genes with the most variable expression (highest standard deviation of logE). <b>(B) </b>100 genes with the most steady expression (lowest standard deviation of logE). The scale along Y axis (expression values) is the same for both plots to facilitate comparison of the expression profiles between them. Inlet shows a version of plot B with zoomed scale of expression values.</p>
               </text>
               <graphic file="1471-2229-8-99-7"/>
            </fig>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Functional assignments and expression profiles of the 100 genes with (A) the highest expression (maximum mean), and (B) the lowest expression (minimum mean).</p>
               </text>
               <file name="1471-2229-8-99-S3.tiff">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Genes with greatly shifting expression patterns across a wide variety of conditions might be considered candidates for responses of the plant to internal and/or external signals. We defined the genes with the most dramatically shifting expression profiles as 100 genes with the highest standard deviation of logged expression value. Indeed, 39 of these 100 genes had annotation suggesting their involvement in signaling (reaction to stimuli, p-value 0.0765; response to oxidative stress, pathogens or hormones) (Figure <figr fid="F7">7A</figr> and Additional file <supplr sid="S4">4</supplr>). Only a single metabolic function is included: 12 genes had a function related to lipid metabolism, lipid transport, or lipid degradation, possibly reflecting fluctuating requirements for energy and membrane synthesis. The endomembrane system is highly overrepresented (p-value 9.2 &#215; 10<sup>14</sup>).</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>Genes with the most changing expression.</p>
               </text>
               <file name="1471-2229-8-99-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Genes with the steadiest expression were defined as the 100 genes with the lowest standard deviation of logged expression value. Twenty-six of these genes are relatively highly expressed, having a mean level of expression greater than 100. The group of most evenly expressed genes includes a conglomerate of metabolic, regulatory, and transport functions (Figure <figr fid="F7">7B</figr> and Additional file <supplr sid="S5">5</supplr>). There is a high proportion of "unknown" genes in this group (21%); possibly the steady level of expression of these genes would make it more difficult to ascertain their function.</p>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p>Genes with the most steady expression.</p>
               </text>
               <file name="1471-2229-8-99-S5.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>A related result was obtained with a different approach, focused on responses in stress-related microarray experiments, applied by Walther et al. <abbrgrp><abbr bid="B91">91</abbr></abbrgrp>. The authors found that genes annotated as responding to the various stimuli were differentially expressed in the highest number of experiments, while those with unknown or house-keeping functions had the smallest breadth of response.</p>
            <p>Genes with the highest expression (defined as genes with the highest mean of expression values across all samples) include photosynthesis-related genes (p-value 9.28 &#215; 10<sup>-26</sup>), and structural constituents of ribosomes (p-value 2.26 &#215; 10<sup>-09</sup>) and other genes for protein biosynthesis and modification. Together, these functions constitute 58% of annotations of genes expressed at the highest level (Additional file <supplr sid="S3">3</supplr> and Additional file <supplr sid="S6">6</supplr>).</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>Genes with the highest expression.</p>
               </text>
               <file name="1471-2229-8-99-S6.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The majority of the genes expressed at the lowest levels (defined as the 100 genes with the lowest mean of expression values across all samples) are predominantly hypothetical genes, transposons and pseudogenes (Additional file <supplr sid="S3">3</supplr> and Additional file <supplr sid="S7">7</supplr>); these are classes of genes for which no or little expression might be expected. Twenty-three of the genes in low expression group have functional annotations, including nucleic acid-binding genes and disease resistance genes. Because the expression for genes in this group does not exceed 40 for any chip, a significant component of the signal may be an artifact (e.g., due to signal processing or normalization).</p>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p>Genes with the lowest expression.</p>
               </text>
               <file name="1471-2229-8-99-S7.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Negatively correlated pairs of genes</p>
            </st>
            <p>These analyses also yield information about which gene pairs are negatively correlated. Wei et al. <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> noted that within the subset of 1330 annotated metabolic genes, there are few negative correlations. Interestingly, this paucity of negative correlations is also true when the genes other than metabolic (e.g., regulatory and unknown) are examined. In our dataset, negative correlations are far less abundant than positive ones; the highest negative value in the dataset is -0.73. This value for negative correlation may be an underestimate, since Pearson correlation coefficient measures the amount of linear relationship, while negatively correlated genes appear to have reciprocal relationships. Interestingly, in six out of the eight most negatively correlated pairs of genes, one or both genes are implicated in a regulatory function.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The picture of Arabidopsis emerging from this study is that of a plant mainly occupied with gathering energy, reproduction and defense from a hostile environment. Genes involved in photosynthesis and photosynthesis-related metabolic processes are highly expressed, and form the second largest regulon. Several other regulons appear to mediate chloroplast development and aerobic respiration. Developmental programs associated with reproduction account for the function of ten clusters and include pollen, flower, fruit, root, and embryo. Maturing pollen-specific genes form the largest and the most highly correlated regulon, comprised of the 1623 genes. Response and signaling programs are diverse and abundant, reflecting the need in the realization of the genetic program for elasticity in response to changing conditions. Thirteen of the 69 regulons with at least 20 gene members appear to mediate plant responses to external or internal stimuli. Each of these response-related regulons contains a mixture of molecular functions: receptors, kinases, hormone signaling, and metabolic genes required for defense (for example, enzymes for degradation of a pathogen cell wall). Response-related genes are also among the most variable with respect to expression level.</p>
         <sec>
            <st>
               <p>Comparison with Graphical Gaussian Model</p>
            </st>
            <p>The graphical Gaussian model (GGM network) presented by Ma et al. <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> is a network which contains only links that signify direct dependencies between genes. Our purpose was a bit different &#8211; we wanted to see all the genes whose response in expression in various conditions is similar, irrelevant if the effect on them is direct or not, in order to learn the organization of the plant cell transcriptome. Similarity of expression between connected genes is precisely the meaning of the link in our coexpression network. Thus, with no restrictions for the associations to be direct, our network is much larger, containing 13,456 genes connected by almost 1.5 million links. A method for exhaustive comparison between these two networks would be problematic, because only 4749 genes are shared between them, and because we analyze our network by partitioning it globally into densely interconnected subnetworks, while Ma and coworkers query the local neighborhoods of the selected seed nodes.</p>
            <p>However, generally, we observe that genes from our genetic information-related regulons are largely missing from the GGM network (over 90% of genes from Regulons 3, 11, 20, 26, 32, 38, 40&#8211;42, 50, 60&#8211;61, 64 and 66 are absent). On the other hand, genes from metabolism-related and organelle-encoded regulons are very well represented in GGM (over 90% of genes from Regulons 21, 22, 23, 30, 34, 37, 44, 46, 49, 56, 59, 65, 70 are present).</p>
            <p>A network formed by genes for cellulose biosynthesis is one of the most characterized in Arabidopsis <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. Both in our network and in the GGM network, the cellulose biosynthesis genes were partitioned into those specific to primary and secondary cell wall biosynthesis. However, our regulon representing secondary cell wall (Regulon 22) is more comprehensive, containing 81 genes, compared to 41 in Persson et al. <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, and 64 in GGM; 34 of those 81 genes have an annotation consistent with cell wall biosynthesis, including laccases and microtubule-associated proteins linked with this process <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B92">92</abbr></abbrgrp>. The three primary cell wall biosynthesis genes (CESA1, CESA3 and CESA6) are part of the 12-gene regulon (Regulon 121) which also includes drought and cold-responsive genes. Drought and cell wall biosynthesis have been linked before, as the modulation of the rate of the cellulose biosynthesis provides a mechanism to cope with dehydration <abbrgrp><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr></abbrgrp>. Likewise, the 11 genes designated as "proteasome complex" in GGM are part of a bigger module in our network: 9 out of these 11 genes are present in Regulon 45, along with 17 other genes encoding proteasome subunits. On the other hand, most of the genes that form the local subnetwork related to cytokinin-mediated signaling in GGM are not present in our network. Some of the subnetworks identified in GGM are further partitioned in our data into more specialized regulons. For example, 9 out of 20 chromatin-related genes belong to Regulon 4 (mitosis), while 3 belong to Regulon 84 (regulation of flower development). Similarly, flavonoid biosynthesis genes form Regulon 93 (lignin/lignan biosynthesis; 12 of the 14 genes have confirmed or putative functions as catalyzing reactions from synthesis of shikimate through generation of lignin/an monomers) and Regulon 146 (flavonoid/rhamnoflavanoid synthesis; 9 of the 10 genes have confirmed or putative functions catalyzing reactions from chalcone synthase to rhamnose transferase).</p>
         </sec>
         <sec>
            <st>
               <p>Implications for metabolic pathways modelling</p>
            </st>
            <p>Our analysis reflects a clear dichotomy between the metabolic pathways of textbooks and the Arabidopsis transcriptional network. By identifying genes that are co-regulated with particular metabolic fluxes, and experimentally evaluating the effect of these genes on these fluxes, the fundamental mechanisms underlying regulation of metabolism can be better understood. One fundamental question is the extent of the integration of regulatory, metabolic and structural genes within a regulon. Genes for a number of metabolic pathways have been shown to be coexpressed (eg., <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B51">51</abbr><abbr bid="B46">46</abbr></abbrgrp>), and pathway genes from the AraCyc metabolic pathway database also have coexpressed regulatory genes <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. In contrast, in a clustering of 3292 nuclear-encoded genes for plastid-localized proteins, Biehl et al. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> identified regulons that, with the exception of photosynthesis and plastid protein biosynthesis, were composed of genes with diverse pathway associations. The global clustering described herein places these observations in the context of the near-whole Arabidopsis transcriptome. Although our analysis of a correlation among the 22,746 probes on the ATH1 chip clearly reflects the co-occurrence of enzymes from metabolic pathways in regulons, the aggregation of single metabolic pathways into distinct regulons is surprisingly scarce. Exclusively (or nearly exclusively) metabolic regulons are: fatty acid synthesis (Regulon 53), aerobic respiration (Regulon 29), glucosinolate biosynthesis (Regulon 69), lignin/lignan biosynthesis (Regulon 93), flavonoid synthesis (Regulon 146), and protein biosynthesis (Regulon 3).</p>
            <p>One possible explanation of why a given metabolic pathway may not form stand-alone regulons is that it is part of a biological program encompassing several concurrent and mutually dependent processes. Thus, the clusters of enzymes found to be coexpressed by Gachon et al. <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> in their analysis of secondary metabolite pathway genes also are identified in our analyses, however, most are contained within regulons that combine regulatory functions with structural genes and related metabolic steps. For example, shikimate biosynthesis genes are part of Regulon 46, which contains genes involved in defense responses, including biosynthesis of protective compounds. Similarly, genes from plastidic glycolysis and the Calvin cycle cluster with other genes active in the chloroplast during photosynthesis (Regulon 2).</p>
            <p>In a few cases, such as sucrose biosynthesis, or glycolysis, metabolic pathways are dispersed across multiple regulons. Metabolic pathway might not form a discrete regulon because of participating enzymes having multiple metabolic functions, e.g., enzymes of cytosolic glycolysis are utilized for respiration and for anapleurotic reactions <abbrgrp><abbr bid="B95">95</abbr></abbrgrp>. A second possibility is that enzymes from the pathway may be under translational or post-translational regulation. Finally, gene families may confound co-expression patterns, as multiple genes in a family may contribute under different conditions to a single enzyme.</p>
         </sec>
         <sec>
            <st>
               <p>Regulation of expression of organellar genomes</p>
            </st>
            <p>Our results agree with a body of experimental literature (reviewed in <abbrgrp><abbr bid="B96">96</abbr></abbrgrp>), which indicates that the expression of the 130 genes of the plastidic genome is not uniform and is apparently finely tuned by multiple levels of regulation. Genes from the plastidic genome fall into five regulons of varied functions: Regulon 2 (predominantly encoding proteins of photosynthesis and related metabolism); Regulon 176 (predominantly encoding proteins of PSI); Regulon 49 (mainly ribosomal proteins); and Regulons 283 and 656 (two small clusters of mixed function). In general, all genes from a given operon are in the same regulon, suggesting that transcriptional regulation of the plastid genome is a major determinant of transcript accumulation. However, in some cases, genes from a single operon are dispersed across multiple regulons. Thus, this pattern of regulon organization is likely a reflection of two processes: accumulation of subsets of transcripts driven by distinct PEP/NEP promoter combinations, and modifications of transcript levels due to alternative RNA processing, mediated by nuclear processing factors such as PPR proteins.</p>
            <p>Individual regulons may contain many levels of regulation, as exemplified by Regulon 2, which appears to participate in the building and function of the photosynthetic machinery and related metabolic processes. The two genomes that cooperate to meet this goal, nuclear, encoding the majority of the plastid-localized proteins, and plastidic, <abbrgrp><abbr bid="B97">97</abbr></abbrgrp> both are represented in Regulon 2. Coordination of these two genomes requires anterograde (nucleus to plastid) signaling mechanisms (e.g., regulation of transcription of nuclear-encoded plastid proteins, import of proteins into plastids, plastid genome transcription rate and specificity, photosynthetic complex assembly, or plastid development by nuclear-encoded factors) as well as retrograde (plastid to nucleus) signaling mechanisms (e.g., via redox state, chlorophyll synthesis intermediates, sugar or singlet oxygen signaling) <abbrgrp><abbr bid="B98">98</abbr><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr><abbr bid="B101">101</abbr></abbrgrp>. Since genes experimentally identified as participating in many aspects of anterograde and retrograde signaling are coexpressed in Regulon 2, we infer that a complex network may modulate photosynthesis-related activities in this regulon.</p>
            <p>Interestingly, of the 998 regulons, none contains genes from both organellar genomes (mitochondrion and plastid). This suggests that coordination between these organelles may not be achieved by transcriptional co-regulation. Furthermore, that all plastid-encoded genes, except those mobilized in Regulon 2, are grouped in regulons by themselves (not with any nuclear gene) underscores the independence this organelle has maintained hundreds of millions years after endosymbiosis.</p>
         </sec>
         <sec>
            <st>
               <p>Relationships among the regulons</p>
            </st>
            <p>The grouping of regulons with information-related, stress response, and plastid-related functions in dense regions of the transcriptional network indicates that genes in these dense regions may participate in multiple related genetic programs that under some subsets of conditions are coexpressed. Other studies have shown that stress response genes are not usually specific and react to several types of stress <abbrgrp><abbr bid="B102">102</abbr></abbrgrp>. In contrast, some regulons are relatively isolated from the network, most notably, Regulon 53, fatty acid synthesis, and Regulon 34, containing mitochondrial genes. Isolated regulons might contain genes that are committed to a discrete process that is carried out in relative independence from other cellular functions.</p>
         </sec>
         <sec>
            <st>
               <p>Genes not included in the network</p>
            </st>
            <p>During the data preprocessing, we have filtered out genes with low expression and with no similarity to other genes (Fig. <figr fid="F1">1</figr>). The genes that code for proteins located in mitochondrion or endomembrane, or involved in apoptosis or regulation of transcription, were among those with the least expression. Many genes for membrane-located proteins, including transporters, had expression profiles unlike any other in the genome. Both these low-expressed and unique-profile classes of genes were not used for network construction. Although many of low-expressed genes are hypothetical and might not be transcribed (76 out of 100 genes with lowest expression in our data were not expressed in the experimental evaluation of the Arabidopsis expression activity by whole genome tiling array by Yamada et al. <abbrgrp><abbr bid="B103">103</abbr></abbrgrp>), some of the hypothetical genes might be active, but in very specific cell types or temporal conditions and thus their expression or ESTs have never been detected. Several well studied genes, for example regulators of flowering CONSTANS and FRI, or myb-type transcription factor CPC, responsible for differentiation of the epidermal cells, are expressed at a very low level and were filtered out of our analysis. Furthermore, genes with flat expression profiles might also have little representation in the network; only 14 out of the 100 genes with most steady expression profiles were incorporated into a cluster, while 68 were filtered out due to low similarity to any other gene. The profiles of regulatory genes may be flat because the activity of their products is often modulated by translational or post-transcriptional modification &#8211; addition of a phosphate group, induced conformation change, binding a cofactor or other subunit. Because regulatory genes are among the most comprehensively studied and often helped to guide the designation of cluster function, their absence in clusters hinders the identification of developmental programs in our clustered data.</p>
         </sec>
         <sec>
            <st>
               <p>Coexpression of neighboring genes</p>
            </st>
            <p>The coexpression of neighboring genes identified in this analysis is higher than expected by chance and cannot be explained by coexpression of tandem duplicates. This result is consistent with several reports, each applying different definitions of coexpressed neighbors and using various methodologies to identify such groups <abbrgrp><abbr bid="B88">88</abbr><abbr bid="B89">89</abbr><abbr bid="B90">90</abbr></abbrgrp>. The small sizes of domains of coexpressed neighbors in our data also agree with reports that coexpression is a short distance effect in Arabidopsis. Several hypotheses have been raised to account for observed local coexpression, such as that the coexpressed neighbors reside in chromatin domains with open conformation <abbrgrp><abbr bid="B104">104</abbr><abbr bid="B105">105</abbr><abbr bid="B90">90</abbr></abbrgrp>, have shared regulatory cis-elements <abbrgrp><abbr bid="B106">106</abbr></abbrgrp>, or are organized into eukaryotic operons <abbrgrp><abbr bid="B107">107</abbr><abbr bid="B108">108</abbr><abbr bid="B109">109</abbr></abbrgrp>. Our analysis indicated an unusual distribution of coexpressed neighbors on chromosomes along with an absence of overrepresentation of any biological function in co-expressed neighbors; both these observations are in accordance with the hypothesis that chromatin structure is the key player in the local coexpression effect.</p>
         </sec>
         <sec>
            <st>
               <p>Negative correlations</p>
            </st>
            <p>Pairs of negatively correlated genes might merely reflect disjoint sets of conditions in which the two genes are active; alternatively, they might indicate a regulatory relationship. The fact that most of the stronger negative correlations observed in this analysis include regulatory proteins is consistent with a possible biological importance of negative correlations. In agreement with this interpretation, our analysis identifies negative regulations that have already been established experimentally. For example, At2g23430 (ICK1), cyclin-dependent kinase inhibitor protein, functions as a negative regulator of cell division and interacts with CYCD3;1 <abbrgrp><abbr bid="B110">110</abbr></abbrgrp>; in our analysis, ICK1 is negatively correlated with CYCD3;1 (Pearson's R = 0.43). ICK1 also negatively correlates with the cyclin-dependent protein kinases CYC2b and CYCA2-similar, cell division control protein CDKB2;1 and other mitosis-related genes (Pearson's R = 0.52 to 0.56). In a second example, At1g75950, SKP1, a negative regulator of DNA recombination <abbrgrp><abbr bid="B111">111</abbr></abbrgrp>, is most negatively correlated with DNA polymerase and tubulin-related genes (R = -0.5 to -0.4).</p>
            <p>Whether a given negative <it>correlation </it>translates to a negative <it>regulation </it>must be experimentally evaluated. An example of a pair of genes that are highly negatively correlated (R = -0.73) is At1g06650 (2-oxoglutarate-dependent dioxygenase similar to tomato ethylene synthesis regulatory protein E8) and At5g23430 (transducin family protein with nucleotide binding WD-40 repeat). Another example is At3g11910 (DNA binding ubiquitin-specific protease) and At4g12800 (photosystem I reaction center subunit XI) (R = -0.73). A testable hypothesis of the later correlation is that this ubiquitin-specific protease may play a role in the turnover of the photosystem I reaction center protein associated with photodamage.</p>
         </sec>
         <sec>
            <st>
               <p>Availability of the data</p>
            </st>
            <p>The regulons data have been incorporated in MetaOmGraph software for visualizing and analysis of large datasets within the MetNet Platform <abbrgrp><abbr bid="B112">112</abbr></abbrgrp>. Regulons can be downloaded as the gene sets. The user can view expression profiles of the regulons across all experiments, or in a subset of experiments, examine the gene contents of the regulons and calculate the values for the absolute and signed versions of Pearson and Spearman correlation between the genes.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>This analysis yields insight on the organization of plant transcriptome into concerted processes. The network provides an initial glimpse of the interactions among regulons in a broad biological context. Moreover, this study has the potential of assigning function to un-annotated and partially annotated genes; nearly 3000 genes of "unknown" molecular function have been assigned to a regulon. As such, it provides new, experimentally-testable hypotheses about the functions of genes. Further analysis of functionally coherent regulons will enable refining the existing models of metabolic regulation, developmental and response programs, and intergenomic communication.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Transcriptome data</p>
            </st>
            <p>Arabidopsis expression data for 963 Affymetrix ATH1 chips with 22,746 probes were obtained from Nottingham Arabidopsis Stock Centre microarray database <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B18">18</abbr></abbrgrp> and PLEXdb <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B56">56</abbr></abbrgrp>). The data represent 70 experiments, including development, stress, mutant, and other studies. All chips obtained from NASC database were already individually scaled with MAS 5.0 algorithm (Affymetrix) to the common mean = 100, excluding top and bottom 2% signal intensities. The data in PLEXdb are MAS5-normalized with mean expression of chips set to 500. To make the data from these two databases comparable, we scaled the data from PLEXdb database to set the chip mean to 100. The reproducibility of experiments was assessed by visual inspection of scatter plots and by applying a threshold of R<sup>2 </sup>> 0.86. Chips with poor biological replicates were discarded. The remaining biological replicates were averaged to yield 424 samples. The data was subsequently normalized to the same range by a median absolute deviation (MAD)-based scale normalization method described by Yang et al. <abbrgrp><abbr bid="B113">113</abbr></abbrgrp>. MAD-based scale normalization was chosen instead of the quantile normalization methods in order to minimize the interference with the data. Expression values <it>x</it><sub><it>ij </it></sub>on microarray chip <it>j </it>were multiplied by the factor <inline-formula><m:math name="1471-2229-8-99-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mi>C</m:mi><m:mrow><m:mi>M</m:mi><m:mi>A</m:mi><m:msub><m:mi>D</m:mi><m:mi>j</m:mi></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGdbWqaeaacqWGnbqtcqWGbbqqcqWGebardaWgaaqaaiabdQgaQbqabaaaaaaa@323F@</m:annotation></m:semantics></m:math></inline-formula>, where <it>MAD </it>is defined by</p>
            <p>
               <display-formula><it>MAD</it><sub><it>j </it></sub>= median<sub><it>i</it></sub>{|<it>x</it><sub><it>ij</it></sub>-median<sub><it>i</it></sub>(<it>x</it><sub><it>ij</it></sub>)|}</display-formula>
            </p>
            <p>and the constant C is an arithmetic mean of <it>MAD</it></p>
            <p>
               <display-formula>
                  <m:math name="1471-2229-8-99-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>C</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>n</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mi>M</m:mi>
                                       <m:mi>A</m:mi>
                                       <m:msub>
                                          <m:mi>D</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mi>n</m:mi>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4qamKaeyypa0tcfa4aaSaaaeaadaaeWbqaaiabd2eanjabdgeabjabdseaenaaBaaabaGaemOAaOgabeaaaeaacqWGQbGAcqGH9aqpcqaIXaqmaeaacqWGUbGBaiabggHiLdaabaGaemOBa4gaaaaa@3BDA@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Because our dataset is rather large, consisting of 424 datapoints, and the distribution is generally assumed to be approximately normal when the number of datapoints exceeds 100, the data has not been log-transformed.</p>
            <p>The normalized data, together with its metadata is available online <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The 70 experiments in the dataset are also listed in the Additional file <supplr sid="S8">8</supplr>. ATH1 probe set-to-Locus ID mapping was obtained from TAIR <abbrgrp><abbr bid="B114">114</abbr><abbr bid="B115">115</abbr></abbrgrp>.</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>Experimental metadata.</p>
               </text>
               <file name="1471-2229-8-99-S8.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All computations in this work, except for graph clustering, were performed in R software <abbrgrp><abbr bid="B116">116</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Generating the network of coexpressed genes</p>
            </st>
            <p>Because low expression values are not reliable and might introduce noise in the dataset, the 4551 of the 22,746 gene probes on the Arabidopsis ATH1 chip, whose expression was &lt; 100 (the mean of the gene expression values on the chip) in all samples were filtered out. A Pearson correlation matrix was calculated for the remaining 18,195 genes. Of these, only the 14,564 genes that were correlated above the Pearson correlation threshold of 0.7 with any other gene were retained for further analysis. The matrix was transformed into a binary matrix by replacing the values of all correlations > 0.7 by 1, and assigning the others as 0. The resulting binary matrix induced the adjacency matrix of the coexpression network, in which genes form the nodes and two genes are connected by an edge if they are correlated above 0.7.</p>
            <p>This Pearson correlation criterion of 0.7 was developed on the basis of our previous results of coexpression analysis of three metabolic pathways (fatty acid biosynthesis, leucine catabolism and starch metabolism) that was performed on the same expression dataset <abbrgrp><abbr bid="B117">117</abbr></abbrgrp>. In this previous work we observed the emergence of specific (i.e. within-pathway) links from the background noise (intra-pathway links) with the increase of the Pearson correlation threshold from 0.5 to 0.7.</p>
         </sec>
         <sec>
            <st>
               <p>Clustering the coexpression network</p>
            </st>
            <p>Connected components were identified in the network (<it>connectedComp </it>function in R software), yielding one giant connected component with 14,368 nodes and 77 smaller components, ranging from 2 to 8 nodes. Because we aimed to find strongly inter-connected clusters, genes connected only by a single edge were removed from the biggest connected component, and the resulting network composed of 13,456 genes and nearly 1.5 million edges was clustered by Markov chain graph clustering algorithm <abbrgrp><abbr bid="B114">114</abbr><abbr bid="B58">58</abbr></abbrgrp> with the <it>inflation </it>parameter set at 1.8. An array of the inflation parameters has been assessed based on the degree of integrity of three metabolic pathways (fatty acid biosynthesis, leucine catabolism and starch metabolism) in the corresponding clustering results. The inflation value of 1.8 was chosen because it resulted in the best correspondence between the clusters and the sets of genes from the same pathway. Nine hundred and ninety eight clusters were produced (see Additional file <supplr sid="S9">9</supplr> online for the complete assignment of genes to the regulons); these were analyzed together with smaller connected components from the previous step. The network was visualized (see Figure <figr fid="F3">3</figr>) using the GraphExplore tool <abbrgrp><abbr bid="B118">118</abbr></abbrgrp> in a simplified representation, in which the nodes represent clusters (regulons) and an edge joins two clusters if there exists an edge between any pair of genes belonging to these two clusters in the underlying network. This criterion was chosen because of the large differences among the number of between-cluster links.</p>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>Assignment of genes to 998 regulons.</p>
               </text>
               <file name="1471-2229-8-99-S9.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Clustering significance</p>
            </st>
            <p>The significance of our clustering results was assessed by comparison of the overrepresentation of GO terms in the 148 regulons identified from the experimental microarray data with 10 or more genes, to the overrepresentation of GO terms of 100 sets of 148 randomly-obtained clusters. Each of these random sets was obtained by permuting the gene IDs so that the permuted cluster sizes were the same as the real ones, but genes assignment to the clusters changed. Each clustering <it>i </it>was assigned a score <it>S</it><sub><it>i</it></sub>. For this value, the best p-value <it>p</it><sub><it>min </it></sub>for overrepresentation of any GO term was recorded for each cluster and averaged over all clusters.</p>
            <p>
               <display-formula>
                  <m:math name="1471-2229-8-99-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>S</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>n</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mrow>
                                             <m:mi>min</m:mi>
                                             <m:mo>&#8289;</m:mo>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mi>n</m:mi>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uam1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpjuaGdaWcaaqaamaaqahabaGaemiCaa3aaSbaaeaacyGGTbqBcqGGPbqAcqGGUbGBcqWGQbGAaeqaaaqaaiabdQgaQjabg2da9iabigdaXaqaaiabd6gaUbGaeyyeIuoaaeaacqWGUbGBaaaaaa@3FD7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>n </it>denotes the number of clusters (n = 148).</p>
            <p>Distribution of S values for GO Molecular Function, Biological Process, and Cell Compartment for randomly-assigned groups were compared to the respective values for the real clustering. In each case, the real value scored significantly better than any of the random ones (Wilcoxon test p-value &lt; 2.2 &#215; 10<sup>-16</sup>).</p>
            <p>To compare the overrepresentation of GO terms in clusters obtained by MCL clustering with that of clustering produced by k-means algorithm, the <it>kmeans </it>function in the <it>stats </it>package in R was used on the original expression dataset, with the same number of clusters (998) as a parameter. Only 471 k-means clusters with at least 10 members were used in the comparison. The adjusted <it>rand </it>indexes (<it>classAgreement </it>function in <it>e1071 </it>package in R <abbrgrp><abbr bid="B119">119</abbr></abbrgrp>) were calculated to quantify agreement between gene assignments to regulons by MCL and k-means clustering algorithms.</p>
            <p>The Z-scores for mutual information between clusterings and GO terms were calculated according to Steuer et al. <abbrgrp><abbr bid="B120">120</abbr></abbrgrp></p>
            <p>
               <display-formula>
                  <m:math name="1471-2229-8-99-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>S</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>M</m:mi>
                                 <m:mi>I</m:mi>
                                 <m:msub>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>C</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>r</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>l</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>m</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mi>a</m:mi>
                                 <m:mi>n</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>M</m:mi>
                                 <m:mi>I</m:mi>
                                 <m:msub>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>C</m:mi>
                                       <m:mo>,</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>r</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>d</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>m</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#963;</m:mi>
                                    <m:mrow>
                                       <m:mi>r</m:mi>
                                       <m:mi>a</m:mi>
                                       <m:mi>n</m:mi>
                                       <m:mi>d</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>m</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uamLaeyypa0tcfa4aaSaaaeaacqWGnbqtcqWGjbqscqGGOaakcqWGdbWqcqGGSaalcqWGbbqqcqGGPaqkdaWgaaqaaiabdkhaYjabdwgaLjabdggaHjabdYgaSbqabaGaeyOeI0IaemyBa0MaemyzauMaemyyaeMaemOBa4MaeiikaGIaemyta0KaemysaKKaeiikaGIaem4qamKaeiilaWIaemyqaeKaeiykaKYaaSbaaeaacqWGYbGCcqWGHbqycqWGUbGBcqWGKbazcqWGVbWBcqWGTbqBaeqaaiabcMcaPaqaaiabeo8aZnaaBaaabaGaemOCaiNaemyyaeMaemOBa4MaemizaqMaem4Ba8MaemyBa0gabeaaaaaaaa@5CD1@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where MI(C, A) denotes the mutual information between the clustering and the GO terms attributes, and &#963;<sub><it>random </it></sub>denotes the standard deviation of the MI(C, A) in the randomized data. MI(C, A) was calculated from the contingency table that contained the counts of 333 GO terms for genes in 148 larger clusters with at least 10 genes (305 GO terms and 471 clusters, respectively, for k-means clustering). GO terms included all terms associated with any gene in the 148 (471) larger clusters, after removing rare terms (associated with less than 10 genes in the clustering) and one of each pair of redundant GO terms (that differ in characterization of less than 10 genes). Random data was obtained as before, by randomizing assignments of genes to clusters while preserving the sizes of the clusters.</p>
            <p>The mapping of Arabidopsis genes to GO terms was obtained from TAIR <abbrgrp><abbr bid="B121">121</abbr></abbrgrp>. The modified <it>GoHyperGall </it>function in R module <it>Bioconductor </it>was used to obtain batch results of overrepresentation of GO terms.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of functional coherence</p>
            </st>
            <p>The coherence of functionality of the genes within each clusters of at least twenty genes was assessed by a combination of automatic analysis of overrepresentation of GO terms <abbrgrp><abbr bid="B122">122</abbr><abbr bid="B123">123</abbr></abbrgrp> and manual inspection of function and expression, using the published literature and tools (MetaOmGraph, AtGeneSearch) in the MetNet Platform <abbrgrp><abbr bid="B112">112</abbr><abbr bid="B124">124</abbr></abbrgrp>. The RNA profiles were visualized and plotted in MetaOmGraph.</p>
         </sec>
         <sec>
            <st>
               <p>Coexpression analysis of the neighboring genes</p>
            </st>
            <p>Coexpressed neighbors were defined as those nuclear-encoded genes in the same regulon, whose Locus IDs differ by at most 20. Arabidopsis Locus IDs are in the form <b><it>At</it></b><it>x</it><b><it>g</it></b><it>yyyyy</it>, where <it>x </it>denotes the chromosome, and the number <it>yyyyy </it>reflects the order of genes on the chromosome. Only one gene from each array of tandem replicates was used in the calculation of the number of genes in groups of coexpressed neighbors. Tandem replicates were identified with AGI software <abbrgrp><abbr bid="B125">125</abbr></abbrgrp> using BLASTP with a threshold of <it>E </it>&lt; 10<sup>-20 </sup>and allowing for one unrelated gene among cluster members.</p>
            <p>For identification of groups of coexpressed neighbors in randomized datasets, nuclear-encoded genes were reassigned to regulons randomly, and the number of coexpressed neighbors was determined using the same criteria as for the experimental data. The mean number of groups of coexpressed neighbors in 100 reshuffled datasets was 421, compared to 539 in the real dataset. Overrepresentation of GO terms in the genes that form the groups of coexpressed neighbors was evaluated in GOstat web tool <abbrgrp><abbr bid="B123">123</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GGM: Graphical Gaussian Model; GO: Gene Ontology; MCL: Markov cluster algorithm; PEP: plastid-encoded RNA polymerase; NEP: nuclear-encoded RNA polymerase.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>WIM designed the study and conducted the analyses, WIM and ESW wrote the manuscript. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the Arabidopsis research community for making their microarray expression data available through NASC and PLEXdb databases, D. Cook for assistance with data normalization, and A. de la Fuente, B. Nikolau, D. Oliver, S. Rodermel and D. Voytas for helpful comments.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Arabidopsis CBF1 overexpression induces COR genes and enhances freezing tolerance</p>
            </title>
            <aug>
               <au>
                  <snm>Jaglo-Ottosen</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Gilmour</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Zarka</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Schabenberger</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>280</volume>
            <fpage>104</fpage>
            <lpage>106</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9525853</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Regulatory network of <it>Escherichia coli </it>: consistency between literature knowledge and microarray profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Gutierrez-Rios</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Rosenblueth</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Loza</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Huerta</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2435</fpage>
            <lpage>2443</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403762</pubid>
                  <pubid idtype="pmpid" link="fulltext">14597655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A gene expression map for <it>Caenorhabditis elegans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Eizinger</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wylie</snm>
                  <fnm>BN</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <fpage>2087</fpage>
            <lpage>2092</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11557892</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Coexpression analysis of human genes across many microarray data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Sajdak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pavlidis</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1085</fpage>
            <lpage>1094</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419787</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Systematic discovery of functional modules and context-specific functional annotation of human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>XJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>i222</fpage>
            <lpage>i229</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17646300</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Conservation and evolution of gene coexpression networks in human and chimpanzee brains</p>
            </title>
            <aug>
               <au>
                  <snm>Oldham</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Geschwind</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>17973</fpage>
            <lpage>17978</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693857</pubid>
                  <pubid idtype="pmpid" link="fulltext">17101986</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The functional landscape of mouse gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Morris</snm>
                  <fnm>QD</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shai</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bakowski</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Mitsakakis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mohammad</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Zirngibl</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Somogyi</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source> J Biol</source>
            <pubdate>2004</pubdate>
            <volume>3</volume>
            <issue>5</issue>
            <fpage>21</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">549719</pubid>
                  <pubid idtype="pmpid" link="fulltext">15588312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Applications of a rat multiple tissue gene expression data set</p>
            </title>
            <aug>
               <au>
                  <snm>Walker</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>AI</fnm>
               </au>
               <au>
                  <snm>Self</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Hogenesch</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Lapp</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sturchler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kinnunen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Maier</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hoyer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bilbe</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>742</fpage>
            <lpage>749</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383321</pubid>
                  <pubid idtype="pmpid" link="fulltext">15060018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Cluster analysis and display of genome-wide expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Spellman</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>14863</fpage>
            <lpage>14868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">24541</pubid>
                  <pubid idtype="pmpid" link="fulltext">9843981</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks</p>
            </title>
            <aug>
               <au>
                  <snm>Carlson</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mischel</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>40</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1413526</pubid>
                  <pubid idtype="pmpid" link="fulltext">16515682</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12399584</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A gene-coexpression network for global discovery of conserved genetic modules</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>249</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12934013</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shapira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Regev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>166</fpage>
            <lpage>176</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12740579</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Estimating genomic coexpression networks using first-order conditional independence</p>
            </title>
            <aug>
               <au>
                  <snm>Magwene</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R100</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545795</pubid>
                  <pubid idtype="pmpid" link="fulltext">15575966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>TAIR</p>
            </title>
            <url>http://www.arabidopsis.org/</url>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Annotating genes of known and unknown function by large-scale coexpression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Horan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bailey-Serres</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mittler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shelton</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Harper</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Cushman</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Gollery</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Girke</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2008</pubdate>
            <volume>147</volume>
            <fpage>41</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2330292</pubid>
                  <pubid idtype="pmpid" link="fulltext">18354039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Plant Unknown-eome DB (POND)</p>
            </title>
            <url>http://bioweb.ucr.edu/scripts/unknownsDisplay.pl</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>NASCArrays: a repository for microarray data generated by NASC's transcriptomics service</p>
            </title>
            <aug>
               <au>
                  <snm>Craigon</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okyere</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jotham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D575</fpage>
            <lpage>577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308867</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681484</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>NASCArrays</p>
            </title>
            <url>http://affymetrix.arabidopsis.info/narrays/experimentbrowse.pl</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>GENEVESTIGATOR: Arabidopsis Microarray Database and Analysis Toolbox</p>
            </title>
            <aug>
               <au>
                  <snm>Zimmermann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hirsch-Hoffmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hennig</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gruissem</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Plant Physiology</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <issue>1</issue>
            <fpage>2621</fpage>
            <lpage>2632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523327</pubid>
                  <pubid idtype="pmpid" link="fulltext">15375207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Genevestigator</p>
            </title>
            <url>https://www.genevestigator.ethz.ch/</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>BarleyBase/PLEXdb: A Unified Expression Profiling Database for Plants and Plant Pathogens</p>
            </title>
            <aug>
               <au>
                  <snm>Wise</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Caldo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cannon</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dickerson</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>406</volume>
            <fpage>347</fpage>
            <lpage>364</lpage>
            <xrefbib>
               <pubid idtype="pmpid">18287702</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>PLEXdb</p>
            </title>
            <url>http://plexdb.org/</url>
         </bibl>
         <bibl id="B24">
            <title>
               <p>MetaOmGraph</p>
            </title>
            <url>http://www.metnetdb.org/MetNet_MetaOmGraph.htm</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>ArrayExpress &#8211; a public database of microarray experiments and gene expression profiles</p>
            </title>
            <aug>
               <au>
                  <snm>Parkinson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kapushesky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shojatalab</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Abeygunawardena</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Coulso</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Farne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Holloway</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kolesnykov</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lilja</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lukk</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D747</fpage>
            <lpage>750</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716725</pubid>
                  <pubid idtype="pmpid" link="fulltext">17132828</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>ArrayExpress</p>
            </title>
            <url>http://www.ebi.ac.uk/arrayexpress/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>VANTED: A system for advanced data analysis and visualization in the context of biological networks</p>
            </title>
            <aug>
               <au>
                  <snm>Junker</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Klukas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <fpage>7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1361790</pubid>
                  <pubid idtype="pmpid" link="fulltext">16401345</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Vanted</p>
            </title>
            <url>http://vanted.ipk-gatersleben.de/</url>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Virtual Plant</p>
            </title>
            <url>http://virtualplant-prod.bio.nyu.edu/cgi-bin/virtualplant.cgi</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Obayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kinoshita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nakai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shibaoka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saeki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohta</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D863</fpage>
            <lpage>869</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716726</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>ATTED-II</p>
            </title>
            <url>http://www.atted.bio.titech.ac.jp/</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Arabidopsis Co-expression Tool (ACT): web server tools for microarray-based gene expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Manfield</snm>
                  <fnm>IW</fnm>
               </au>
               <au>
                  <snm>Jen</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Pinney</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Michalopoulos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bradford</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Gilmartin</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Westhead</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>W504</fpage>
            <lpage>W509</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1538833</pubid>
                  <pubid idtype="pmpid" link="fulltext">16845059</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Arabidopsis Coexpression Data Mining Tool</p>
            </title>
            <url>http://www.arabidopsis.leeds.ac.uk/act/</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The Botany Array Resource: e-Northerns, Expression Angling, and promoter analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Toufighi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brady</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Austin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ly</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Provart</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>The Plant Journal</source>
            <pubdate>2005</pubdate>
            <volume>43</volume>
            <fpage>153</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15960624</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Bio-Array Resource</p>
            </title>
            <url>http://bar.utoronto.ca/</url>
         </bibl>
         <bibl id="B36">
            <title>
               <p>MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes</p>
            </title>
            <aug>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bl&#228;sing</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kr&#252;ger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Plant Journal</source>
            <pubdate>2004</pubdate>
            <volume>37</volume>
            <fpage>914</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14996223</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>MapMan</p>
            </title>
            <url>http://gabi.rzpd.de/projects/MapMan/</url>
         </bibl>
         <bibl id="B38">
            <title>
               <p>PageMan: an interactive ontology tool to generate, display, and annotate overview graphs for profiling experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Steinhauser</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bl&#228;sing</snm>
                  <fnm>OE</fnm>
               </au>
               <au>
                  <snm>Redestig</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sreenivasulu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Krall</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hannah</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Poree</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>535</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1766370</pubid>
                  <pubid idtype="pmpid" link="fulltext">17176458</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>PageMan</p>
            </title>
            <url>http://mapman.mpimp-golm.mpg.de/pageman/</url>
         </bibl>
         <bibl id="B40">
            <title>
               <p>CressExpress: A Tool For Large-Scale Mining of Expression Data from Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Srinivasasainagendra</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Mehta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Coulibaly</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Loraine</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2008</pubdate>
            <volume>147</volume>
            <fpage>1004</fpage>
            <lpage>1016</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2442548</pubid>
                  <pubid idtype="pmpid" link="fulltext">18467456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>CressExpress</p>
            </title>
            <url>http://www.cressexpress.org</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Transcriptional co-regulation of secondary metabolism enzymes in Arabidopsis: functional and evolutionary implications</p>
            </title>
            <aug>
               <au>
                  <snm>Gachon</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Langlois-Meurinne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Henry</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Saindrenan</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>58</volume>
            <fpage>229</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16027976</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Zeef</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goodacre</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <fpage>2281</fpage>
            <lpage>2295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182489</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Persson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Milne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Somerville</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>8633</fpage>
            <lpage>8638</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142401</pubid>
                  <pubid idtype="pmpid" link="fulltext">15932943</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Transcriptional co-response analysis as a tool to identify new components of the wall biosynthetic machinery</p>
            </title>
            <aug>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kuschinsky</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Steinhauser</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pauly</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Biosystems</source>
            <pubdate>2005</pubdate>
            <volume>139</volume>
            <fpage>69</fpage>
            <lpage>73</lpage>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Transcriptional coordination of the biogenesis of the oxidative phosphorylation machinery in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Gonzalez</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Welchen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Attallah</snm>
                  <fnm>CV</fnm>
               </au>
               <au>
                  <snm>Comelli</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Mufarrege</snm>
                  <fnm>EF</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2007</pubdate>
            <volume>51</volume>
            <fpage>105</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17561924</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Genome-wide reprogramming of metabolism and regulatory networks of Arabidopsis in response to phosphorus</p>
            </title>
            <aug>
               <au>
                  <snm>Morcuende</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bari</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pant</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Blasing</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Czechowski</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Udvardi</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Cell Environ</source>
            <pubdate>2007</pubdate>
            <volume>30</volume>
            <fpage>85</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17177879</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Integration of <it>Arabidopsis thaliana </it>stress-related transcript profiles, promoter structures, and cell-specific expression</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bohnert</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R49</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1896000</pubid>
                  <pubid idtype="pmpid" link="fulltext">17408486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Connecting genes, coexpression modules, and molecular signatures to environmental stress phenotypes in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Weston</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Gunter</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wullschleger</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>BMC Syst Biol</source>
            <pubdate>2008</pubdate>
            <volume>2</volume>
            <fpage>16</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2277374</pubid>
                  <pubid idtype="pmpid" link="fulltext">18248680</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Analysis of 101 nuclear transcriptomes reveals 23 distinct regulons and their relationship to metabolism, chromosomal gene distribution and co-ordination of nuclear and plastid gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Biehl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Richly</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Noutsos</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Salamini</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Leister</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2005</pubdate>
            <volume>344</volume>
            <fpage>33</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15656970</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Transcriptional coordination of the metabolic network in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Persson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mehta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Srinivasasainagendra</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Somerville</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Loraine</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2006</pubdate>
            <volume>142</volume>
            <fpage>762</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1586052</pubid>
                  <pubid idtype="pmpid" link="fulltext">16920875</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>AraCyc</p>
            </title>
            <url>http://www.arabidopsis.org/biocyc/index.jsp</url>
         </bibl>
         <bibl id="B53">
            <title>
               <p>AraCyc: a biochemical pathway database for Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Mueller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <fpage>453</fpage>
            <lpage>460</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166988</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>An Arabidopsis gene network based on the graphical Gaussian model</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Bohnert</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1614</fpage>
            <lpage>1625</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2045144</pubid>
                  <pubid idtype="pmpid" link="fulltext">17921353</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Modeling gene expression networks using fuzzy logic</p>
            </title>
            <aug>
               <au>
                  <snm>Du</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wurtele</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Dickerson</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>IEEE Trans Syst Man Cybern B Cybern</source>
            <pubdate>2005</pubdate>
            <volume>35</volume>
            <issue>6</issue>
            <fpage>1351</fpage>
            <lpage>1359</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16366260</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>BarleyBase &#8211; an expression profiling database for plant genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Shen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Caldo</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Nettleton</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Dickerson</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D614</fpage>
            <lpage>D618</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540077</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608273</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Estimating mutual information using B-spline functions &#8211; an improved similarity measure for analysing gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Daub</snm>
                  <fnm>CO</fnm>
               </au>
               <au>
                  <snm>Steuer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kloska</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>118</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">516800</pubid>
                  <pubid idtype="pmpid" link="fulltext">15339346</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Graph clustering by flow simulation</p>
            </title>
            <aug>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PhD thesis</source>
            <publisher>University of Utrecht</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B59">
            <title>
               <p>An efficient algorithm for large-scale detection of protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>1575</fpage>
            <lpage>1584</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">101833</pubid>
                  <pubid idtype="pmpid" link="fulltext">11917018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Detection of functional modules from protein interaction networks</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <fpage>49</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14705023</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Gene Ontology</p>
            </title>
            <url>http://www.geneontology.org/</url>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Deletion of the chloroplast-localized Thylakoid formation1 gene product in Arabidopsis leads to deficient thylakoid formation and variegated leaves</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Kight</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Henry</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Korth</snm>
                  <fnm>KL</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <fpage>3594</fpage>
            <lpage>3604</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">527158</pubid>
                  <pubid idtype="pmpid" link="fulltext">15516501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Chloroplast RNA-binding and pentatricopeptide repeat proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sugita</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Biochem Soc Trans</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>571</fpage>
            <lpage>574</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15270678</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Lurin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andr&#233;s</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Aubourg</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bellaoui</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bitton</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Bruy&#232;re</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Caboche</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Debast</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gualberto</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>B</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>2089</fpage>
            <lpage>2103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">519200</pubid>
                  <pubid idtype="pmpid" link="fulltext">15269332</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Deletion of rpoB reveals a second distinct transcription system in plastids of higher plants</p>
            </title>
            <aug>
               <au>
                  <snm>Allison</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Maliga</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1996</pubdate>
            <volume>15</volume>
            <fpage>2802</fpage>
            <lpage>2809</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">450217</pubid>
                  <pubid idtype="pmpid">8654377</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>The nuclear gene HCF107 encodes a membrane-associated R-TPR (RNA tetratricopeptide repeat)-containing protein involved in expression of the plastidial psbH gene in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Sane</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Westhoff</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>42</volume>
            <fpage>720</fpage>
            <lpage>730</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15918885</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>FLU: a negative regulator of chlorophyll biosynthesis in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Meskauskiene</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nater</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goslings</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kessler</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>op den Camp</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Apel</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>12826</fpage>
            <lpage>12831</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60138</pubid>
                  <pubid idtype="pmpid" link="fulltext">11606728</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>The role of sigma factors in plastid transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Allison</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Biochimie</source>
            <pubdate>2000</pubdate>
            <volume>82</volume>
            <fpage>537</fpage>
            <lpage>548</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10946105</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>AtSig5 is an essential nucleus-encoded Arabidopsis sigma-like factor</p>
            </title>
            <aug>
               <au>
                  <snm>Yao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Roy-Chowdhury</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Allison</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <fpage>739</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">167013</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805603</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>The multiple-stress responsive plastid sigma factor, SIG5, directs activation of the psbD blue light-responsive promoter (BLRP) in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Nagashima</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hanaoka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shikanai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fujiwara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanamaru</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell Physiol</source>
            <pubdate>2004</pubdate>
            <volume>45</volume>
            <fpage>357</fpage>
            <lpage>368</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15111710</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Component specificity for the thylakoidal Sec and Delta pH-dependent protein transport pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Mori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Summer</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Cline</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>146</volume>
            <fpage>45</fpage>
            <lpage>56</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2199744</pubid>
                  <pubid idtype="pmpid" link="fulltext">10402459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>The major protein import receptor of plastids is essential for chloroplast biogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hiltbunner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wehrli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Eugster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schnell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kessler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>203</fpage>
            <lpage>207</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10646606</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>A chromodomain protein encoded by the Arabidopsis CAO gene is a plant-specific component of the chloroplast signal recognition particle pathway that is involved in LHCP targeting</p>
            </title>
            <aug>
               <au>
                  <snm>Klimyuk</snm>
                  <fnm>VI</fnm>
               </au>
               <au>
                  <snm>Persello-Cartieaux</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Havaux</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Contard-David</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schuenemann</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Meiherhoff</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gouet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Nussaume</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1999</pubdate>
            <volume>11</volume>
            <fpage>87</fpage>
            <lpage>99</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">144089</pubid>
                  <pubid idtype="pmpid" link="fulltext">9878634</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Chloroplast FtsY, chloroplast signal recognition particle, and GTP are required to reconstitute the soluble phase of light-harvesting chlorophyll protein transport into thylakoid membranes</p>
            </title>
            <aug>
               <au>
                  <snm>Tu</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Schuenemann</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>NE</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1999</pubdate>
            <volume>274</volume>
            <fpage>27219</fpage>
            <lpage>27224</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10480939</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>RNA degradation buffers asymmetries of transcription in Arabidopsis mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Giege</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Binder</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brennicke</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <fpage>164</fpage>
            <lpage>170</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1084256</pubid>
                  <pubid idtype="pmpid">11265757</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>HUA1, a regulator of stamen and carpel identities in Arabidopsis, codes for a nuclear RNA binding protein</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jia</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2001</pubdate>
            <volume>13</volume>
            <fpage>2269</fpage>
            <lpage>2281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139158</pubid>
                  <pubid idtype="pmpid" link="fulltext">11595801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Divergent roles of a pair of homologous jumonji/zinc-finger-class transcription factor proteins in the regulation of Arabidopsis flowering time</p>
            </title>
            <aug>
               <au>
                  <snm>Noh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Yi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Amasino</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Noh</snm>
                  <fnm>YS</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>2601</fpage>
            <lpage>2613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">520958</pubid>
                  <pubid idtype="pmpid" link="fulltext">15377760</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>SPLAYED, a novel SWI/SNF ATPase homolog, controls reproductive development in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>85</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11818058</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>The Arabidopsis HETEROCHROMATIN PROTEIN1 homolog (TERMINAL FLOWER2) silences genes within euchromatic region but not genes positioned in heterochromatin</p>
            </title>
            <aug>
               <au>
                  <snm>Nakahigashi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jasencakova</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Schubert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell Physiol</source>
            <pubdate>2005</pubdate>
            <volume>46</volume>
            <fpage>1747</fpage>
            <lpage>1756</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16131496</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>LEUNIG regulates AGAMOUS expression in Arabidopsis flowers</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>1995</pubdate>
            <volume>121</volume>
            <fpage>975</fpage>
            <lpage>991</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7743940</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>Novel nuclear-encoded proteins interacting with a plastid sigma factor, Sig1, in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Morikawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shiina</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Toyoshima</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2002</pubdate>
            <volume>514</volume>
            <fpage>300</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11943170</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>The chemical diversity and distribution of glucosinolates and isothiocyanates among plants</p>
            </title>
            <aug>
               <au>
                  <snm>Fahey</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Zalcmann</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Talalay</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Phytochemistry</source>
            <pubdate>2001</pubdate>
            <volume>56</volume>
            <fpage>5</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11198818</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Glucosinolate metabolism and its control</p>
            </title>
            <aug>
               <au>
                  <snm>Grubb</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Abel</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>2006</pubdate>
            <volume>11</volume>
            <fpage>89</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16406306</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Glucosinolate and amino acid biosynthesis in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Field</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Traka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Botterman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vancanneyt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mithen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>828</fpage>
            <lpage>839</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514118</pubid>
                  <pubid idtype="pmpid" link="fulltext">15155874</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Arabidopsis branched-chain aminotransferase 3 functions in both amino acid and glucosinolate biosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Knill</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Reichelt</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gershenzon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Binder</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2008</pubdate>
            <volume>146</volume>
            <fpage>1028</fpage>
            <lpage>1039</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2259058</pubid>
                  <pubid idtype="pmpid" link="fulltext">18162591</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B86">
            <title>
               <p>Aldoxime-forming microsomal enzyme systems involved in the biosynthesis of glucosinolates in oilseed rape (<it>Brassica napus</it>) leaves</p>
            </title>
            <aug>
               <au>
                  <snm>Bennett</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Donald</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dawson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hick</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wallsgrove</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1993</pubdate>
            <volume>102</volume>
            <fpage>1307</fpage>
            <lpage>1312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">158920</pubid>
                  <pubid idtype="pmpid" link="fulltext">12231906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>Transcriptome analysis of haploid male gametophyte development in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Honys</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Twell</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R85</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545776</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535861</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B88">
            <title>
               <p>Coexpression of neighboring genes in the genome of <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Bowles</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1060</fpage>
            <lpage>1067</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419784</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B89">
            <title>
               <p>Local coexpression domains of two to four genes in the genome of Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Ren</snm>
                  <fnm>XY</fnm>
               </au>
               <au>
                  <snm>Fiers</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Stiekema</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Nap</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>138</volume>
            <fpage>923</fpage>
            <lpage>934</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1150408</pubid>
                  <pubid idtype="pmpid" link="fulltext">15923337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B90">
            <title>
               <p>Islands of co-expressed neighbouring genes in <it>Arabidopsis thaliana </it>suggest higher-order chromosome domains</p>
            </title>
            <aug>
               <au>
                  <snm>Zhan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Horrocks</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lukens</snm>
                  <fnm>LN</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2006</pubdate>
            <volume>45</volume>
            <fpage>347</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16412082</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>The regulatory code for transcriptional response diversity and its relation to genome structural properties in <it>A. thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Walther</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brunnemann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLos Genetics</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1796623</pubid>
                  <pubid idtype="pmpid" link="fulltext">17291162</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>Microtubule cortical array organization and plant cell morphogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Paradez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ehrhardt</snm>
                  <fnm>DW</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2006</pubdate>
            <volume>9</volume>
            <fpage>571</fpage>
            <lpage>578</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17010658</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>Disruption of the cellulose synthase gene, AtCesA8/IRX1, enhances drought and osmotic stress tolerance</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>43</volume>
            <fpage>273</fpage>
            <lpage>283</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15998313</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>Xylem wall collapse in water-stressed pine needles</p>
            </title>
            <aug>
               <au>
                  <snm>Cochard</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Froux</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mayr</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Coutand</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>134</volume>
            <fpage>401</fpage>
            <lpage>408</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">316319</pubid>
                  <pubid idtype="pmpid" link="fulltext">14657404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Biochemistry</p>
            </title>
            <aug>
               <au>
                  <snm>Stryer</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <publisher>New York: WH Freeman</publisher>
            <pubdate>1988</pubdate>
         </bibl>
         <bibl id="B96">
            <title>
               <p>Plastids unleashed: their development and their integration in plant development</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Juez</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pyke</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Int J Dev Biol</source>
            <pubdate>2005</pubdate>
            <volume>49</volume>
            <fpage>557</fpage>
            <lpage>577</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16096965</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>Chloroplasts</p>
            </title>
            <aug>
               <au>
                  <snm>Bogorad</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>1981</pubdate>
            <volume>91</volume>
            <fpage>256s</fpage>
            <lpage>270s</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2112801</pubid>
                  <pubid idtype="pmpid" link="fulltext">6172430</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Nuclear-chloroplast signalling</p>
            </title>
            <aug>
               <au>
                  <snm>Somanchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mayfield</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>1999</pubdate>
            <volume>2</volume>
            <fpage>404</fpage>
            <lpage>409</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10508759</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>Knock-out of the plastid ribosomal protein L11 in Arabidopsis: effects on mRNA translation and photosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Pesaresi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Varotto</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Meurer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jahns</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Salamini</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Leister</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2001</pubdate>
            <volume>27</volume>
            <fpage>179</fpage>
            <lpage>189</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11532164</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B100">
            <title>
               <p>Pathways of intracellular communication: Tetrapyrroles and plastid-to-nucleus signaling</p>
            </title>
            <aug>
               <au>
                  <snm>Rodermel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2003</pubdate>
            <volume>25</volume>
            <fpage>631</fpage>
            <lpage>636</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12815718</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>Genomics based dissection of the cross-talk of chloroplasts with the nucleus and mitochondria in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Leister</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2005</pubdate>
            <volume>354</volume>
            <fpage>110</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15908143</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses</p>
            </title>
            <aug>
               <au>
                  <snm>Kilian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Horak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wanke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weinl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Batistic</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>D'Angelo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bornberg-Bauer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kudla</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Harter</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Journal</source>
            <pubdate>2007</pubdate>
            <volume>50</volume>
            <fpage>347</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17376166</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B103">
            <title>
               <p>Empirical analysis of transcriptional activity in the Arabidopsis genome</p>
            </title>
            <aug>
               <au>
                  <snm>Yamada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dale</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shinn</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Palm</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Southwick</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>842</fpage>
            <lpage>846</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14593172</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B104">
            <title>
               <p>The role of chromatin structure in regulating the expression of clustered genes</p>
            </title>
            <aug>
               <au>
                  <snm>Sproul</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bickmore</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>10</issue>
            <fpage>775</fpage>
            <lpage>781</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16160692</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B105">
            <title>
               <p>The evolutionary dynamics of eukaryotic gene order</p>
            </title>
            <aug>
               <au>
                  <snm>Hurst</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>P&#225;l</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>299</fpage>
            <lpage>310</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15131653</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B106">
            <title>
               <p>An abundance of bidirectional promoters in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Trinklein</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Aldred</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Hartman</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Schroeder</snm>
                  <fnm>DI</fnm>
               </au>
               <au>
                  <snm>Otillar</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>62</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314279</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707170</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B107">
            <title>
               <p>An imprinted, mammalian bicistronic transcript encodes two independent proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Gray</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Saitoh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nicholls</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>5616</fpage>
            <lpage>5621</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21909</pubid>
                  <pubid idtype="pmpid" link="fulltext">10318933</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B108">
            <title>
               <p>A global analysis of <it>Caenorhabditis elegans </it>operons</p>
            </title>
            <aug>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Link</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Guffanti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thierry-Mieg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Duke</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kiraly</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>417</volume>
            <fpage>851</fpage>
            <lpage>854</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12075352</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B109">
            <title>
               <p>A gene expression map of the Arabidopsis root</p>
            </title>
            <aug>
               <au>
                  <snm>Birnbaum</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shasha</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Lambert</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Galbraith</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Benfey</snm>
                  <fnm>PN</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>1956</fpage>
            <lpage>1960</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14671301</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B110">
            <title>
               <p>Effects of coexpressing the plant CDK inhibitor ICK1 and D-type cyclin genes on plant growth, cell size and ploidy in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gilmer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Whitwill</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fowke</snm>
                  <fnm>LC</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2003</pubdate>
            <volume>216</volume>
            <fpage>604</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12569402</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B111">
            <title>
               <p>The ARABIDOPSIS SKP1-LIKE1 (ASK1) protein acts predominately from leptotene to pachytene and represses homologous recombination in male meiosis</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2006</pubdate>
            <volume>223</volume>
            <fpage>613</fpage>
            <lpage>617</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16283376</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B112">
            <title>
               <p>MetNet Platform</p>
            </title>
            <url>http://metnetdb.org</url>
         </bibl>
         <bibl id="B113">
            <title>
               <p>Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ngai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">100354</pubid>
                  <pubid idtype="pmpid" link="fulltext">11842121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B114">
            <title>
               <p>FTP directory</p>
            </title>
            <source/>
            <url>ftp://ftp.arabidopsis.org/home/tair/Microarrays/</url>
         </bibl>
         <bibl id="B115">
            <title>
               <p>The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community</p>
            </title>
            <aug>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Beavis</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Berardini</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dixon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Garcia-Hernandez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Huala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Montoya</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>224</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165523</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519987</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B116">
            <title>
               <p>R: A language and environment for statistical computing. R Foundation for Statistical Computing</p>
            </title>
            <aug>
               <au>
                  <cnm>R Development Core Team</cnm>
               </au>
            </aug>
            <source>Vienna, Austria</source>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B117">
            <title>
               <p>Articulation of three core metabolic processes in Arabidopsis: fatty acid biosynthesis, leucine catabolism and starch metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Mentzen</snm>
                  <fnm>WI</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ransom</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nikolau</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Wurtele</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>BMC Plant Biology</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>76</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2483283</pubid>
                  <pubid idtype="pmpid" link="fulltext">18616834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B118">
            <title>
               <p>GraphExplore</p>
            </title>
            <url>http://graphexplore.cgt.duke.edu</url>
         </bibl>
         <bibl id="B119">
            <title>
               <p>Validating clustering for gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Yeung</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Haynor</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ruzzo</snm>
                  <fnm>WL</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>309</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11301299</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B120">
            <title>
               <p>Validation and functional annotation of expression-based clusters based on gene ontology</p>
            </title>
            <aug>
               <au>
                  <snm>Steuer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Humburg</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>380</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1586215</pubid>
                  <pubid idtype="pmpid" link="fulltext">16911788</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B121">
            <title>
               <p>Functional annotation of the Arabidopsis genome using controlled vocabularies</p>
            </title>
            <aug>
               <au>
                  <snm>Berardini</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Mundodi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reiser</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Huala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
         </bibl>
         <bibl id="B122">
            <title>
               <p>GOstat</p>
            </title>
            <url>http://gostat.wehi.edu.au/cgi-bin/goStat.pl</url>
         </bibl>
         <bibl id="B123">
            <title>
               <p>GOstat: Find statistically overrepresented Gene Ontologies within a group of genes</p>
            </title>
            <aug>
               <au>
                  <snm>Beissbarth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1464</fpage>
            <lpage>1465</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14962934</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B124">
            <title>
               <p>MetNet: Systems Biology Software for Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Wurtele</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Berleant</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dickerson</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <etal/>
            </aug>
            <source>Concepts in Plant Metabolomics</source>
            <publisher>Springer-Verlag</publisher>
            <pubdate>2007</pubdate>
            <fpage>145</fpage>
            <lpage>158</lpage>
         </bibl>
         <bibl id="B125">
            <title>
               <p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <cnm>Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11130711</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B126">
            <title>
               <p>Chromosome Map Tool</p>
            </title>
            <url>http://www.arabidopsis.org/jsp/ChromosomeMap/tool.jsp</url>
         </bibl>
      </refgrp>
   </bm>
</art>
