<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-454</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Transcription factor target prediction using multiple short expression time series from Arabidopsis thaliana</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Redestig</snm>
               <fnm>Henning</fnm>
               <insr iid="I1"/>
               <email>redestig@mpimp-golm.mpg.de</email>
            </au>
            <au id="A2">
               <snm>Weicht</snm>
               <fnm>Daniel</fnm>
               <insr iid="I1"/>
               <email>weicht@mpimp-golm.mpg.de</email>
            </au>
            <au id="A3">
               <snm>Selbig</snm>
               <fnm>Joachim</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>selbig@mpimp-golm.mpg.de</email>
            </au>
            <au id="A4">
               <snm>Hannah</snm>
               <mi>A</mi>
               <fnm>Matthew</fnm>
               <insr iid="I1"/>
               <email>hannah@mpimp-golm.mpg.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Max Planck Institute for Molecular Plant Physiology, Am M&#252;hlenberg 1, D-14476 Potsdam-Golm, Germany</p>
            </ins>
            <ins id="I2">
               <p>University of Potsdam, Am Neuen Palais, D-14469, Potsdam, Germany</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>454</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/454</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18021423</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-454</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>18</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>18</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Redestig et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The central role of transcription factors (TFs) in higher eukaryotes has led to much interest in deciphering transcriptional regulatory interactions. Even in the best case, experimental identification of TF target genes is error prone, and has been shown to be improved by considering additional forms of evidence such as expression data. Previous expression based methods have not explicitly tried to associate TFs with their targets and therefore largely ignored the treatment specific and time dependent nature of transcription regulation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In this study we introduce CERMT, Covariance based Extraction of Regulatory targets using Multiple Time series. Using simulated and real data we show that using multiple expression time series, selecting treatments in which the TF responds, allowing time shifts between TFs and their targets and using covariance to identify highly responding genes appear to be a good strategy. We applied our method to published TF &#8211; target gene relationships determined using expression profiling on TF mutants and show that in most cases we obtain significant target gene enrichment and in half of the cases this is sufficient to deliver a usable list of high-confidence target genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>CERMT could be immediately useful in refining possible target genes of candidate TFs using publicly available data, particularly for organisms lacking comprehensive TF binding data. In the future, we believe its incorporation with other forms of evidence may improve integrative genome-wide predictions of transcriptional networks.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transcriptional regulation is essential for all eukaryotes and is central to the complex development and environmental responses of higher organisms. The identification of transcription factors (TFs), TF-target genes and transcriptional regulatory networks is therefore of fundamental importance for biology. The ability of TFs to modify the expression of many physiologically important target genes has made them attractive targets for biotechnology <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Traditionally, experimental approaches have sought to identify TF-targets by measuring gene expression in loss- or gain-of-function mutants, whilst TF binding to their target promoters has been measured using gel-shift assays, co-transfection assays or chromatin-immunoprecipitation (ChIP). With the arrival of genome-scale technologies, approaches have been scaled up to allow for the unbiased identification of either genes with altered expression in TF mutants using expression profiling, or promoters and other genomic sequences that are bound by a TF <it>in vivo </it>by hybridizing ChIP samples to DNA microarrays (ChIP-chip) <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>However, phenotypes of TF mutants are the product of the combination of temporal, developmental and genetic interactions with the altered gene function. Target identification may therefore be confounded by factors such as redundancy, pleiotropic overlap, severe developmental phenotypes or lethality (e.g. <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>). The use of inducible expression or inducible nuclear targeting of the TF may overcome these limitations but such systems have been rarely used and can also lead to secondary effects <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Likewise, genome-wide location data for TF binding from ChIP-chip experiments does not provide definitive evidence of target regulation. Observed DNA binding is not always sufficient to accurately predict a regulatory interaction <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B9">9</abbr></abbrgrp> as it may be related to a process other than transcriptional control of gene expression, or simply be biologically irrelevant. In yeast, ChIP-chip has been comprehensively applied to all 203 predicted TFs. However, such data provides only a snapshot of the complete regulatory network as interactions are dependent on many variables such as the cell type, genetic background and developmental stage of the organism, and the timing and type of environmental or biological stimuli <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In the case of higher eukaryotes, which have an order of magnitude greater diversity of both TFs <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and potential targets, mapping the regulatory network would require a currently unfeasible amount of time and resources. The central role of TFs and the limitations of the available data have together generated considerable interest in the computational prediction of TF-targets and regulatory networks. Applied simply, genes with altered expression in a TF mutant may be filtered by the presence or absence of a binding motif for the TF or for those showing a similar treatment-response to the TF (e.g. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>). More complex algorithms have been used to improve the target prediction accuracy by combining ChIP-chip data with other resources such as phylogeny, TF binding motifs, co-expression data, or a combination of these <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B10">10</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. The power of combining multiple forms of evidence was recently demonstrated by Beyer and coworkers, who, by using eight forms of evidence, were able to predict previously unknown TF-binding interactions that could subsequently be proven by new, condition-directed, ChIP-chip experiments <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In the absence of more comprehensive ChIP-chip data, the application of these methods to higher eukaryotes is not yet feasible. One form of evidence that is also widely available for higher eukaryotes is co-expression data, which has become commonly used in computational biology since the increase in public availability of microarray data. Several tools that support such analyses have been developed (e.g. <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>). These analyses have been used to identify additional components of enzymatic modules <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and assign specific functions to generalist enzymes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Co-expression relationships may also support the known regulation of target genes by a TF <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. However, such examples are limited and the utility of co-expression data to predict targets of a given TF in an unsupervised fashion has not been explored. Given the importance of translocation and post-translational modification (e.g. <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>) this is understandable. However, TFs and their targets do tend to be co-expressed and by applying methods that overcome problems such as time shifts <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and conditional responses <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, gene expression data can be used as a proxy to measure TF activity. Shi et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> recently demonstrated this by using expression data from multiple time series and considering the possibility of time shifts when predicting TF-targets. By modeling the known regulatory relationships, they estimated treatment specific time scales which they then used to estimate the correct time shifts for predicting novel interactions. Their method depends on comprehensive prior knowledge in the form of large scale ChIP-chip data <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, and is therefore not yet applicable to organisms for which such resources are still unavailable. Furthermore, given the demonstrated condition dependency of ChIP-chip data and that its utility for inferring general properties of TF &#8211; TF-target interactions is not fully assessed, it is unclear whether the ChIP-chip data now becoming available will be useful beyond the directly studied TFs. Hence, it is of interest to examine the performance of methods that do not require prior information about regulatory interactions.</p>
         <p>Lacking appropriate training data, one has to resort to fully unsupervised (clustering) approaches. Several clustering methods that include the possibility of time shifts for gene expression data (e.g. <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>) have been described, but all of them take an <it>ab initio </it>approach by not using the information of which genes are supposed to be TFs, and are therefore unsuitable for querying the data for targets for a particular TF. Recently, Heard and coworkers also proposed a clustering method that can use multiple gene expression time series <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, but without considering time shifts.</p>
         <p>A prerequisite for incorporating temporal information into TF-target gene prediction is the selection of an appropriate dataset and the concomitant selection of a model organism. As will be discussed, the AtGenExpress consortium's stress series dataset for the model plant <it>Arabidopsis thaliana </it>was selected as the most technically and biologically appropriate. In particular, TF-target prediction in higher eukaryotes is appealing due to the aforementioned unfeasible task of comprehensive mapping of all TF-target gene interactions, meaning that such a method could have immediate biological application.</p>
         <p>We developed a method to identify potential TF-target genes as those responding strongly to the same stimuli as their controlling TF(s) in a coordinate temporal response. Initially, simulated data with predefined TF-target gene expression relationships showed that, by selecting treatments and incorporating temporal information, our algorithm can improve performance as compared to conventional co-expression based methods. We then applied the method to identify known TF &#8211; target gene relationships, as experimentally determined, using expression profiling on TF mutants. These data revealed that the method was useful to enrich targets for a diverse set of experimentally determined TF &#8211; target gene relationships. Furthermore, for half of the studied TFs, the enrichment of true targets among extracted genes was sufficient to obtain usable numbers of high-confidence target genes. By looking at a large set of annotated TFs, we also observed that the targets predicted using our approach are more enriched with both functional annotations and putative cis-elements compared to those obtained by conventional methods, hence, indicating a higher biological relevance.</p>
         <p>We envisage our method could be immediately useful in narrowing the search for target genes of candidate TFs using publicly available data either through direct prediction or by filtering data obtained by expression profiling of TF mutants. This would be particularly applicable for organisms lacking comprehensive TF binding data. We show that considering other evidence has the potential to improve the methods performance and, in the future, we believe its incorporation into methods using multiple forms of evidence may improve integrative genome-wide predictions of transcriptional networks.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Covariance based extraction of regulatory targets using multiple time series</p>
            </st>
            <p>In a simple scenario, assuming full transcriptional control of gene expression, the target genes will have the same characteristic expression pattern as the regulating TF itself, although possibly shifted forward in time. However, other genes can have similar expression pattern as a direct response to an applied treatment even though they are unrelated in a regulatory sense. In order to separate such co-expression from the more interesting co-regulation one has to look at many different time series of the same system but exposed to different perturbations.</p>
            <p>A direct approach to utilize such data for co-expression analysis is to concatenate the available time courses and compute correlations based on the constructed pseudo time series as was done in the co-expression databases CSB.DB and ATTED-II <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B18">18</abbr></abbrgrp>. However, there are two main conceptual problems with this approach. Biological interaction patterns are very dynamic and genes that are co-regulated in one condition can share little resemblance in the next, particularly if they happened to regulated by more than one TF <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Thus, the inclusion of an experiment in which the TF is not active could theoretically worsen predictions. The second problem is that transcriptionally controlled regulation is delayed, so if the resolution of the time series is high enough then the TF will only be correlated with its target in a time shifted manner. Moreover, this delay might not be the same across the different treatments because even though the studied <it>physical </it>time frame is the same, the <it>biological </it>time frame might differ. For example, transcription (like all other chemical reactions) is affected by temperature and so the time shift between the TF and its target is likely to be different under high versus low temperature treatments. Ergo, the problem is two-fold. In order to make good predictions of plausible targets we propose that it is necessary to, from the total set of considered treatments, <it>I</it>: pick the 'right' subset of treatments and <it>II</it>: introduce the 'right' time shift for these. Finally, the use of the correlation coefficient implies that the scale of changes in gene expression is irrelevant and only the shape matters. To overcome the background noise from the numerous untranscribed genes one usually applies some sort of variance threshold. Here, we instead experiment with using the covariance, which pays attention to <it>both </it>shape and magnitude, instead of correlation and thus assume that big changes are more relevant than small changes. To summarize the previous section, the following assumptions will lay the ground for our approach:</p>
            <p>&#8226; The expression of the true target genes are transcriptionally controlled by the investigated TF.</p>
            <p>&#8226; The TF-targets have a similar (covariant) response to the TF but may show a treatment dependent delay.</p>
            <p>&#8226; Genes can be part of more than one regulon so any treatments in which the TF does not respond are also not informative.</p>
            <p>&#8226; Because not all genes are transcribed at the same time, the sought TF-targets will have higher variance than the bulk of genes.</p>
            <p>Given a set of gene expression time series and a TF of interest, the output of the proposed method is a cluster of co-expressed genes that, given the assumptions above, look like they are controlled by the TF of interest. Because the cluster is directly associated with a known TF, we will instead refer to it as a predicted <it>regulon</it>. Figure <figr fid="F1">1</figr> shows a flow scheme of the proposed algorithm, and below we outline the main strategies.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>A flowchart of the CERMT algorithm</p>
               </caption>
               <text>
                  <p><b>A flowchart of the CERMT algorithm</b>. The input is a set of microarray time series for several treatments and a transcription factor (TF) of interest. First the treatments in which the TF does not respond are removed. Then a pair of treatments are selected for which the same genes are highly covariant with the TF. The rest of the treatments are then searched and added or discarded depending on a goodness-of-fit test. Finally a cut-off for the gene list ordered by their covariance with the, possibly time shifted, TF in the selected treatments is estimated via the Gap statistic.</p>
               </text>
               <graphic file="1471-2105-8-454-1"/>
            </fig>
            <sec>
               <st>
                  <p>Method outline</p>
               </st>
               <p>The method we suggest predicts regulons by first removing the time series in which the TF does not respond. We do this by only including treatments in which the TF exceeds two thresholds; the overall maximum response of the TF and the maximum difference between the TF of the stressed plant and the same TF under control conditions. The latter is necessary to account for the extensive diurnal expression changes of <it>Arabidopsis </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
               <p>The remaining treatments are organized in a three dimensional expression matrix, <it>X </it>= <it>x</it><sub><it>i</it>, <it>j</it>, <it>k</it></sub>, with measurements from <it>n</it><sub><it>t </it></sub>different time points (<it>i</it>), <it>n</it><sub><it>g </it></sub>genes (<it>j</it>) and <it>n</it><sub><it>p</it></sub>different treatments (<it>k</it>). Following the stipulated assumptions, we rank the genes according to how strongly associated they are with the TF, by seeking a set of treatments, <it>m</it>, and corresponding lags, <it>l</it>, for which the covariance</p>
               <p>
                  <display-formula id="M1">
                     <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i1">
                        <m:semantics>
                           <m:mrow>
                              <m:msub>
                                 <m:mi>c</m:mi>
                                 <m:mrow>
                                    <m:mi>j</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>k</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>l</m:mi>
                                       <m:mi>k</m:mi>
                                    </m:msub>
                                 </m:mrow>
                              </m:msub>
                              <m:mo>=</m:mo>
                              <m:mstyle displaystyle="true">
                                 <m:munderover>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>i</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>n</m:mi>
                                          <m:mi>t</m:mi>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msub>
                                          <m:mi>l</m:mi>
                                          <m:mi>k</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:munderover>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mi>y</m:mi>
                                             <m:mrow>
                                                <m:mi>i</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>y</m:mi>
                                                <m:mo>&#175;</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>k</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:msub>
                                                   <m:mi>l</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mi>x</m:mi>
                                             <m:mrow>
                                                <m:mi>i</m:mi>
                                                <m:mo>+</m:mo>
                                                <m:msub>
                                                   <m:mi>l</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                                <m:mo>,</m:mo>
                                                <m:mi>j</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>k</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>x</m:mi>
                                                <m:mo>&#175;</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>j</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:msub>
                                                   <m:mi>l</m:mi>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>n</m:mi>
                                             <m:mi>t</m:mi>
                                          </m:msub>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>l</m:mi>
                                             <m:mi>k</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mstyle>
                              <m:mo>,</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4yam2aaSbaaSqaaiabdQgaQjabcYcaSiabdUgaRjabcYcaSiabdYgaSnaaBaaameaacqWGRbWAaeqaaaWcbeaakiabg2da9maaqahabaqcfa4aaSaaaeaacqGGOaakcqWG5bqEdaWgaaqaaiabdMgaPjabcYcaSiabdUgaRbqabaGaeyOeI0IafmyEaKNbaebadaWgaaqaaiabdUgaRjabcYcaSiabdYgaSnaaBaaabaGaem4AaSgabeaaaeqaaiabcMcaPiabcIcaOiabdIha4naaBaaabaGaemyAaKMaey4kaSIaemiBaW2aaSbaaeaacqWGRbWAaeqaaiabcYcaSiabdQgaQjabcYcaSiabdUgaRbqabaGaeyOeI0IafmiEaGNbaebadaWgaaqaaiabdQgaQjabcYcaSiabdUgaRjabcYcaSiabdYgaSnaaBaaabaGaem4AaSgabeaaaeqaaiabcMcaPaqaaiabd6gaUnaaBaaabaGaemiDaqhabeaacqGHsislcqWGSbaBdaWgaaqaaiabdUgaRbqabaaaaaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa42aaSbaaWqaaiabdsha0bqabaWccqGHsislcqWGSbaBdaWgaaadbaGaem4AaSgabeaaa0GaeyyeIuoakiabcYcaSaaa@7073@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>is maximized for the genes in the sought regulon in all <it>k </it>&#8712; <it>m</it>. In (1), <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i2"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>x</m:mi><m:mo>&#175;</m:mo></m:mover><m:mrow><m:mi>j</m:mi><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>,</m:mo><m:msub><m:mi>l</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmiEaGNbaebadaWgaaWcbaGaemOAaOMaeiilaWIaem4AaSMaeiilaWIaemiBaW2aaSbaaWqaaiabdUgaRbqabaaaleqaaaaa@3504@</m:annotation></m:semantics></m:math></inline-formula> is the average expression of gene <it>j </it>in treatment <it>k </it>shifted backward by <it>l</it><sub><it>k </it></sub>time points and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i3"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>y</m:mi><m:mo>&#175;</m:mo></m:mover><m:mrow><m:mi>k</m:mi><m:mo>,</m:mo><m:msub><m:mi>l</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmyEaKNbaebadaWgaaWcbaGaem4AaSMaeiilaWIaemiBaW2aaSbaaWqaaiabdUgaRbqabaaaleqaaaaa@32C9@</m:annotation></m:semantics></m:math></inline-formula> is the average expression of the TF in treatment <it>k </it>truncated by <it>l</it><sub><it>k </it></sub>time points.</p>
               <p>Finding the optimal solutions for <it>m </it>and <it>l </it>is difficult as it would require knowledge about the identity of at least some of the true targets. This information is unavailable in our setting and therefore we design the following greedy heuristic. Assuming that the regulon is large enough and under control of the TF in at least two treatments, <it>m</it><sub>1 </sub>and <it>m</it><sub>2</sub>, after time lags <it>l</it><sub>1 </sub>and <it>l</it><sub>2</sub>, then the product of the two corresponding covariance vectors will be high for a good pair of treatments and lags. The product of the covariance vectors from two lagged treatments is defined as:</p>
               <p>
                  <display-formula id="M2">
                     <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i4">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>C</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>k</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>k</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>l</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>l</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:mstyle displaystyle="true">
                                 <m:munderover>
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:mi>j</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>n</m:mi>
                                          <m:mi>g</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:munderover>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>c</m:mi>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>k</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>l</m:mi>
                                             <m:mn>1</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>&#215;</m:mo>
                                    <m:msub>
                                       <m:mi>c</m:mi>
                                       <m:mrow>
                                          <m:mi>j</m:mi>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>k</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msub>
                                          <m:mo>,</m:mo>
                                          <m:msub>
                                             <m:mi>l</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msub>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                              </m:mstyle>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4qamKaeiikaGIaem4AaS2aaSbaaSqaaiabigdaXaqabaGccqGGSaalcqWGRbWAdaWgaaWcbaGaeGymaedabeaakiabcYcaSiabdYgaSnaaBaaaleaacqaIXaqmaeqaaOGaeiilaWIaemiBaW2aaSbaaSqaaiabikdaYaqabaGccqGGPaqkcqGH9aqpdaaeWbqaaiabdogaJnaaBaaaleaacqWGQbGAcqGGSaalcqWGRbWAdaWgaaadbaGaeGymaedabeaaliabcYcaSiabdYgaSnaaBaaameaacqaIXaqmaeqaaaWcbeaakiabgEna0kabdogaJnaaBaaaleaacqWGQbGAcqGGSaalcqWGRbWAdaWgaaadbaGaeGOmaidabeaaliabcYcaSiabdYgaSnaaBaaameaacqaIYaGmaeqaaaWcbeaaaeaacqWGQbGAcqGH9aqpcqaIXaqmaeaacqWGUbGBdaWgaaadbaGaem4zaCgabeaaa0GaeyyeIuoakiabc6caUaaa@5B78@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>If we only consider a relatively small number of possible lags, we can set the seed pair of treatments to:</p>
               <p>
                  <display-formula id="M3">
                     <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i5">
                        <m:semantics>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>m</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>m</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>l</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mover accent="true">
                                    <m:mi>l</m:mi>
                                    <m:mo>^</m:mo>
                                 </m:mover>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo>}</m:mo>
                              <m:mo>=</m:mo>
                              <m:mi>arg</m:mi>
                              <m:mo>&#8289;</m:mo>
                              <m:munder>
                                 <m:mrow>
                                    <m:mi>max</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>m</m:mi>
                                       <m:mn>1</m:mn>
                                    </m:msub>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>m</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msub>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>l</m:mi>
                                       <m:mn>1</m:mn>
                                    </m:msub>
                                    <m:mo>,</m:mo>
                                    <m:msub>
                                       <m:mi>l</m:mi>
                                       <m:mn>2</m:mn>
                                    </m:msub>
                                 </m:mrow>
                              </m:munder>
                              <m:mi>C</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:msub>
                                 <m:mi>m</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>m</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>l</m:mi>
                                 <m:mn>1</m:mn>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>l</m:mi>
                                 <m:mn>2</m:mn>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaei4EaSNafmyBa0MbaKaadaWgaaWcbaGaeGymaedabeaakiabcYcaSiqbd2gaTzaajaWaaSbaaSqaaiabikdaYaqabaGccqGGSaalcuWGSbaBgaqcamaaBaaaleaacqaIXaqmaeqaaOGaeiilaWIafmiBaWMbaKaadaWgaaWcbaGaeGOmaidabeaakiabc2ha9jabg2da9iGbcggaHjabckhaYjabcEgaNnaaxababaGagiyBa0MaeiyyaeMaeiiEaGhaleaacqWGTbqBdaWgaaadbaGaeGymaedabeaaliabcYcaSiabd2gaTnaaBaaameaacqaIYaGmaeqaaSGaeiilaWIaemiBaW2aaSbaaWqaaiabigdaXaqabaWccqGGSaalcqWGSbaBdaWgaaadbaGaeGOmaidabeaaaSqabaGccqWGdbWqcqGGOaakcqWGTbqBdaWgaaWcbaGaeGymaedabeaakiabcYcaSiabd2gaTnaaBaaaleaacqaIYaGmaeqaaOGaeiilaWIaemiBaW2aaSbaaSqaaiabigdaXaqabaGccqGGSaalcqWGSbaBdaWgaaWcbaGaeGOmaidabeaakiabcMcaPaaa@61F3@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>by exhaustively trying all pairs of treatments and lags. Figure <figr fid="F2">2</figr> exemplifies the idea. The sought regulon co-varies with its TF in two treatments at a certain time shift and causes (2) to reach its maximum for these.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>The covariance between a simulated transcription factor (TF) and all other genes in two different treatments</p>
                  </caption>
                  <text>
                     <p><b>The covariance between a simulated transcription factor (TF) and all other genes in two different treatments</b>. With no time shift (left panel) the true regulon (red points) has low covariance with the TF in both treatments. When the expression of the TF has been shifted forward (right panel) the correct number of time points it becomes highly covariant with its regulon and (2) increases.</p>
                  </text>
                  <graphic file="1471-2105-8-454-2"/>
               </fig>
               <p>Once the best pair has been found we create a summary of the two treatments by concatenating the measurement vectors to obtain a pseudo treatment with 2<it>n</it><sub><it>t </it></sub>- <it>l</it><sub>1 </sub>- <it>l</it><sub>2 </sub>time points and again calculating the covariance between the TF and the rest of the genes according (1).</p>
               <p>By ordering the genes after their covariance with the TF we assume that the proposed regulator is an inducer. If desired (see Section 'Target accuracy and input specificity'), repressed targets can be extracting by reversing the search order, i.e. by replacing <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i6"><m:semantics><m:mrow><m:msub><m:mi>c</m:mi><m:mrow><m:mi>j</m:mi><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>,</m:mo><m:msub><m:mi>l</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4yam2aaSbaaSqaaiabdQgaQjabcYcaSiabdUgaRjabcYcaSiabdYgaSnaaBaaameaacqWGRbWAaeqaaaWcbeaaaaa@34C2@</m:annotation></m:semantics></m:math></inline-formula> with -<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i6"><m:semantics><m:mrow><m:msub><m:mi>c</m:mi><m:mrow><m:mi>j</m:mi><m:mo>,</m:mo><m:mi>k</m:mi><m:mo>,</m:mo><m:msub><m:mi>l</m:mi><m:mi>k</m:mi></m:msub></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaem4yam2aaSbaaSqaaiabdQgaQjabcYcaSiabdUgaRjabcYcaSiabdYgaSnaaBaaameaacqWGRbWAaeqaaaWcbeaaaaa@34C2@</m:annotation></m:semantics></m:math></inline-formula>. This does not affect the initial search for a good pair of treatments and lags.</p>
               <p>In order to investigate if there are more treatments for which (1) is high for the same genes, we order the remaining treatments according to (2) by setting <it>m</it><sub>1 </sub>to the artificial pseudo treatment. If a cross-validation based goodness-of-fit test suggests that the next treatment is useful, it is added and the procedure is reiterated.</p>
               <p>By selecting treatments and lags we construct a pseudo-time series in which the top-ranked genes have high covariance with the TF. This would be true even if all of the expression data were completely independent of the TF. Therefore, we must investigate how likely it is to observe regulons of the same quality from randomized data. We did this by adapting the Gap statistic described by Hastie et al. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> to our problem. The Gap statistic is beneficial as it both provides an estimate of the statistical quality of the proposed regulon, and simultaneously recommends the best number of genes to extract.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Comparisons with other approaches</p>
            </st>
            <p>Conceptually, our approach differs from other co-expression based methods in that it aims to directly associate a known TF with target genes. As it does not require extensive prior knowledge (i.e. ChIP-chip data), it also differs from a recently described method <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> to identify target genes for known TFs. Methodologically, our method is characterized by two main aspects. Firstly, it incorporates the <it>a priori </it>assumption that true TF-targets have higher variance then the bulk of the represented genes by using the covariance instead of correlation. Secondly, it performs a selection of treatments and time shifts to increase overlap between the TF and putative targets. In order to investigate the importance of both of these components we assessed the performance of both the full CERMT approach and a reduced version, CERMT-0, which always uses all treatments without considering any time shifts.</p>
            <p>By choosing a different initial pair of treatments, by for example excluding treatments that are previously suspected to be irrelevant, it is possible to obtain different regulons. For simplicity, we will in this study restrict ourselves to consider only one regulon for each TF.</p>
            <p>The standard work-horse for detecting co-expression is the Pearson correlation and we therefore compare our results to just concatenating all time series and ranking genes against their correlation with the TF. Here, we refer to this approach as 'Cor'. However, considering the timescale, limited number of samples and relatively controlled conditions, the AtGenExpress dataset may not allow for the best comparison with a correlation based method. We therefore chose ACT <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> to provide a more stringent comparison, which like other co-expression tools <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>, uses a highly diverse dataset from hundreds of steady state conditions. Repressed targets were sought by ordering the genes after their negative correlation.</p>
            <p>To measure performance, we first looked at the threshold-free AUC statistic (area under the ROC curve), but this performed poorly with such grossly different sample sizes (false targets vs. true targets). Therefore, we simply counted the number of true targets among of the top 100 predicted genes, thereby also allowing us to assess the ability of the method to return true targets in the very top of the gene list; an important aspect as experimental validation rapidly becomes infeasible with the number of predicted genes.</p>
         </sec>
         <sec>
            <st>
               <p>Simulated data</p>
            </st>
            <p>We compared CERMT with its reduced version, CERMT-0, on 100 simulated data sets that contained 10000 genes, six different treatments and seven time points. In three of the treatments, a regulon of varying size was added that followed the pattern of the TF directly or lagged by either 1 or 2 time points. Figure <figr fid="F3">3</figr> shows boxplots on the percentage of the true positives that were found in the top 2<it>n </it>genes, where <it>n </it>equals the size of the planted regulon. The performance of CERMT-0 is high if there is no time lag, but, not surprisingly, very poor if we plant a delay. The full CERMT approach performs poorly if the sought regulon is too small as the 'right' treatment pair becomes increasingly diffcult to find with decreasing regulon size. On the other hand, for sufficiently large regulons, CERMT shows good performance regardless of whether response is delayed or not. The size of the smallest detectable regulon decreases with the amount of time points in the experiment (data not shown). Note that though the limit for the minimum regulon size in the simulation seems to be around 50 genes, this estimate is strongly data specific and is not transferable to performance on real data.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Boxplot of method performance using simulated data</p>
               </caption>
               <text>
                  <p><b>Boxplot of method performance using simulated data</b>. Performance was measured as the percentage of the recovered genes (100 &#215; True positives/<it>n</it>) in the top 2<it>n </it>predicted genes where <it>n </it>equals the size of the planted regulon. The simplistic method CERMT-0 does not consider any time lag, makes no selection of treatments and is therefore robust against the size of the planted regulon but for the same reason also fails if there exists a time lag between the TF and its targets. CERMT on the other hand performs poorly if the planted regulon is small, but it is robust against the presence of time lags.</p>
               </text>
               <graphic file="1471-2105-8-454-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Comparison with over-expression and knock-out experiments using the AtGenExpress dataset</p>
            </st>
            <p>The abiotic stress series from the AtGenExpress project <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> was selected for evaluation of the proposed method. This large data set uses the standardized Affymetrix platform and was all generated in parallel through the coordinated work of five different research groups. It consists of data from root and shoot of 16 days old <it>Arabidopsis </it>seedlings exposed to nine different abiotic stresses as well as control conditions, giving a total of 18 different usable time series (loosely referred to as 'treatments'). The seven time points commonly measured in all treatments were selected from each experiment to arrive at an expression matrix with 140 samples. These time points were 0, 0.5, 1, 3, 6, 12 and 24 hours, which approximates a log timescale. For this data, log scaled sampling seems preferable as we noted that a log-linear model yields higher absolute <it>t</it>-values than a linear model, thus, a majority of the genes have a more linear response in log-time than in linear time. Non-linear sampling could otherwise have detrimental effects on any time shifting attempts. Considering the response of plant gene expression to various perturbations (e.g. <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>), it seems reasonable to assume that this timescale will reveal at least some of the biologically relevant TF-TF-target relationships.</p>
            <p>CERMT depends on having expression estimates at the same time points in all treatments. Datasets which do not fulfill this can also be used if common time points are first interpolated, which preferably can be done using sophisticated interpolation strategies such as that proposed by Bar-Joseph et al. <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B35">35</abbr></abbrgrp>. Relying solely on gene expression data, we can only expect to extract TF-targets for TFs that actually respond to the applied treatments. Therefore, we conducted a literature study specifically to find experiments investigating the targets of abiotic stress related TFs either by over-expression, knock-out or ChIP-chip experiments, see Additional file <supplr sid="S1">1</supplr>. Also, in the cases where we could find known motifs for TF's <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> we extracted all genes with those motifs in their 500 base upstream regions regarding that gene list as a set of 'targets' to predict. We expect the experimentally obtained lists of TF &#8211; target relationships to likely contain many erroneous findings. For example, as many TFs regulate other TFs, it has been pointed out that the effects of constitutive TF mutants, as defined by expression profiling, will likely include those of the regulated TFs <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Despite this, we feel this TF target practice is a useful exercise. Firstly, only some of the experimentally predicted targets need to be <it>true </it>targets in order for these lists to be useful for the comparison of methods. Secondly, the targets predicted by the method could help in reducing the number of false-positives in experimental predictions. Thirdly, even if all genes represent indirect effects of regulated downstream TFs, these effects may relate to the overall biological function of the candidate TF and so their prediction could still be useful. Previously, a simplified version of the second of these arguments was used to restrict gene lists obtained by over-expressing cold-responsive TFs by also requiring the genes to be cold-responsive <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. To maximize the number of TF datasets against which we could assess our method, we somewhat loosened our selection from considering only fully transcriptionally regulated individual TFs. We included MBF1c, a transcriptional co-activator <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>; the functionally paired MYC2/MYB2 TFs <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and HSFA1a/HSFA1b <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>; the functionally redundant CBF1-3 TFs <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and the post-translationally regulated TFs DREB2A <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and AREB1 <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. All TFs considered as pairs/functionally redundant showed very similar expression patterns (data not shown). The inclusion of DREB2A and AREB1 was motivated by their demonstrated parallel transcriptional and post-translational stress regulation <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>, which should allow their transcription to act as a proxy for their direct target regulation. Their inclusion could therefore validate wider use of the proposed method beyond strictly transcriptionally controlled TFs. Therefore, we do not expect these lists to wholly represent direct TF-target genes, but considering these down-stream effects, there is still utility in their prediction.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Literature study of transcription factors and their targets considered in this study. We focused on TFs that have previously been implicated in stress responses, but as not all respond under the conditions used to generate the AtGenExpress dataset, some were therefore excluded from our test set. Wherever multiple genes were knocked out or over-expressed, or where functional redundancy has been implicated, the average of those genes was used.</p>
               </text>
               <file name="1471-2105-8-454-S1.csv">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>Target accuracy and input specificity</p>
               </st>
               <p>An important property of a prediction method is that it shows specificity towards the input TF, i.e. the real TF should really be a better input for enriching its own targets than a random TF. To measure this, we ran the algorithm on all 1484 genes in our data set annotated to bin 27.3, 'RNA.regulation of transcription', by the MapMan project and computed an empirical specificity <it>P</it>-value, <it>P</it><sub><it>spec</it></sub>, as the number of random genes that gave the same or better enrichments than the 'true' TF divided by the total number of tested TFs. Known TF families comprise approximately 90% of this annotation bin with the remaining 10% being putative TFs or other regulatory proteins. As our method is not specific to TFs but can also apply to other genes that regulate transcription (e.g. MBF1c) this bin is useful for our purposes, however, in the worst case, <it>P</it><sub><it>spec </it></sub>will have negligible 10% error. Note that to calculate <it>P</it><sub><it>spec </it></sub>one has to know the true regulon and it is therefore only applicable for validation purposes.</p>
               <p>The upper part of Table <tblr tid="T1">1</tblr> shows the performance of the proposed method along with that of the simplistic methods and the conventional co-expression database ACT. In general, it seems that co-expression methods can be used to identify experimental targets of a given TF. The covariance based methods CERMT-0 and CERMT perform better than the others in most cases and CERMT has the overall best performance, although, with the exception of the CBF regulon, the performance increase compared to CERMT-0 is admittedly modest. In half of the cases, the ratio of true targets is sufficient (12&#8211;59%) to deliver a usable number of high-confidence target genes. ACT is the only method that ever substantially outperforms the proposed method. Interestingly, it does so for PAP1, for which the other methods perform poorly. This indicates that these targets are better found in a larger, mostly steady state correlation dataset and that the proposed method can be complementary to existing methods. <it>P</it><sub><it>spec </it></sub>follows the over-representation significance as expected, and are generally not different between the covariance or correlation based methods. Despite this, covariance appears to be inclined towards finding genes that are really regulated rather than noise genes that happen to have the same expression trajectory. Presumably, the benefit of using covariance over correlation will decrease rapidly with increasing number of studied time points. For MBF1c, ZAT12 and CBF, the original studies also reported target genes that were suspected to be repressed by the corresponding TF. The lower part of Table <tblr tid="T1">1</tblr> shows the predictions for these target pools, assuming that the TF is a repressor. CERMT finds a significant number of repressed targets in the top 100 extracted genes for CBF and ZAT12. Significant overlap with the repressed targets of MBF1c were only found in the top 200 predicted targets (4 hits, <it>P </it>= 0.006). However, as the Gap-statistic and the corresponding recommended cluster size indicates, the statistical significance of the extracted regulons is not convincing. Furthermore, the <it>P</it><sub><it>spec </it></sub>indicates that the found targets are unspecific to their corresponding TF. It is difficult to assess the utility of the method for extracting repressed genes with only three examples, hence, CERMT should be used with caution when searching for repressed targets. Repression of transcription is only visible using microarrays after subsequent degradation of available mRNA and the observed response could be less coordinate among a regulon than during induction as degradation rate varies between different mRNAs. The time scale is also likely to be quite different compared to that of induced responses. Thus, searching for repression using expression data could very well be more difficult than searching for induced targets.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Performance on real data</p>
                  </caption>
                  <tblbdy cols="15">
                     <r>
                        <c ca="left">
                           <p>
                              <b>TF</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>Targetpool</b>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>Cor</b>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>ACT</b>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>CERMT-0</b>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>CERMT</b>
                           </p>
                        </c>
                        <c cspan="3" ca="center">
                           <p>
                              <b>CERMT Diagnostics</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>Source</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>E</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Size</p>
                        </c>
                        <c ca="center">
                           <p>Hits<sup>100</sup></p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>P</it>
                              <sub>
                                 <it>Spec</it>
                              </sub>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Hits<sup>100</sup></p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>P</it>
                              <sub>
                                 <it>Spec</it>
                              </sub>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Hits<sup>100</sup></p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>P</it>
                              <sub>
                                 <it>Spec</it>
                              </sub>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Hits<sup>100</sup></p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>P</it>
                              <sub>
                                 <it>Spec</it>
                              </sub>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Size</p>
                        </c>
                        <c ca="center">
                           <p>Hits<sup>Gap</sup></p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>Gap</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c cspan="15" ca="center">
                           <p>
                              <it>Induced targets</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>AREB</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.16</p>
                        </c>
                        <c ca="center">
                           <p>28</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.15</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>9</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>9</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                        <c ca="center">
                           <p>128</p>
                        </c>
                        <c ca="center">
                           <p>10</p>
                        </c>
                        <c ca="center">
                           <p>0.20</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>CBF</p>
                        </c>
                        <c ca="left">
                           <p>CRE</p>
                        </c>
                        <c ca="center">
                           <p>14.32</p>
                        </c>
                        <c ca="center">
                           <p>2508</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>18</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.17</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>34</p>
                        </c>
                        <c ca="center">
                           <p>0.10</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>58</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.01</p>
                        </c>
                        <c ca="center">
                           <p>86</p>
                        </c>
                        <c ca="center">
                           <p>53</p>
                        </c>
                        <c ca="center">
                           <p>0.27</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.82</p>
                        </c>
                        <c ca="center">
                           <p>143</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>0.10</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>56</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>52</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>DREB2A</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.12</p>
                        </c>
                        <c ca="center">
                           <p>21</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.07</p>
                        </c>
                        <c ca="center">
                           <p>10</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>11</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.05</p>
                        </c>
                        <c ca="center">
                           <p>244</p>
                        </c>
                        <c ca="center">
                           <p>15</p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>HSFA2</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.24</p>
                        </c>
                        <c ca="center">
                           <p>42</p>
                        </c>
                        <c ca="center">
                           <p>12</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>22</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>20</p>
                        </c>
                        <c ca="center">
                           <p>0.02</p>
                        </c>
                        <c ca="center">
                           <p>76</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>0.09</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>HY5</p>
                        </c>
                        <c ca="left">
                           <p>ChIP/KO</p>
                        </c>
                        <c ca="center">
                           <p>0.76</p>
                        </c>
                        <c ca="center">
                           <p>133</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>9</p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>9</p>
                        </c>
                        <c ca="center">
                           <p>0.11</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>12</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                        <c ca="center">
                           <p>902</p>
                        </c>
                        <c ca="center">
                           <p>32</p>
                        </c>
                        <c ca="center">
                           <p>-0.19</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>KO</p>
                        </c>
                        <c ca="center">
                           <p>0.69</p>
                        </c>
                        <c ca="center">
                           <p>120</p>
                        </c>
                        <c ca="center">
                           <p>13</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>19</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>14</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>14</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>46</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>CRE</p>
                        </c>
                        <c ca="center">
                           <p>12.07</p>
                        </c>
                        <c ca="center">
                           <p>2113</p>
                        </c>
                        <c ca="center">
                           <p>22</p>
                        </c>
                        <c ca="center">
                           <p>0.11</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>24</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.05</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>0.31</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>0.30</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>153</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MBF1c</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.86</p>
                        </c>
                        <c ca="center">
                           <p>150</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>2</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.17</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>2</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.36</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>4</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.22</p>
                        </c>
                        <c ca="center">
                           <p>102</p>
                        </c>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>0.09</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MYB2/MYC2</p>
                        </c>
                        <c ca="left">
                           <p>CRE</p>
                        </c>
                        <c ca="center">
                           <p>6.75</p>
                        </c>
                        <c ca="center">
                           <p>1182</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>2</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.46</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>
                                 <it>6</it>
                              </b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.64</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>5</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.39</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>4</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.42</p>
                        </c>
                        <c ca="center">
                           <p>536</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>31</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.23</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>CRE</p>
                        </c>
                        <c ca="center">
                           <p>19.90</p>
                        </c>
                        <c ca="center">
                           <p>3485</p>
                        </c>
                        <c ca="center">
                           <p>31</p>
                        </c>
                        <c ca="center">
                           <p>0.02</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>23</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.32</p>
                        </c>
                        <c ca="center">
                           <p>30</p>
                        </c>
                        <c ca="center">
                           <p>0.07</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>32</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.05</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <it>121</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.15</p>
                        </c>
                        <c ca="center">
                           <p>26</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>
                                 <it>1</it>
                              </b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.26</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>
                                 <it>1</it>
                              </b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.30</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <it>2</it>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC019</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                        <c ca="center">
                           <p>14</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>
                                 <it>1</it>
                              </b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.13</p>
                        </c>
                        <c ca="center">
                           <p>33</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.01</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC055</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.05</p>
                        </c>
                        <c ca="center">
                           <p>9</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>2</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>121</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.09</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC072</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.13</p>
                        </c>
                        <c ca="center">
                           <p>23</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.09</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>5</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                        <c ca="center">
                           <p>96</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>0.25</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PAP1</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.24</p>
                        </c>
                        <c ca="center">
                           <p>42</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>8</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>27</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.28</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ZAT12</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.79</p>
                        </c>
                        <c ca="center">
                           <p>139</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>2</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.17</p>
                        </c>
                        <c ca="center">
                           <p>8</p>
                        </c>
                        <c ca="center">
                           <p>0.19</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>10</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                        <c ca="center">
                           <p>536</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>0.08</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c cspan="15" ca="center">
                           <p>
                              <it>Repressed targets</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="15">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>CBF</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.24</p>
                        </c>
                        <c ca="center">
                           <p>43</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>2</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.17</p>
                        </c>
                        <c ca="center">
                           <p>128</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>0.09</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MBF1c</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.46</p>
                        </c>
                        <c ca="center">
                           <p>80</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>0</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.00</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.47</p>
                        </c>
                        <c ca="center">
                           <p>1000</p>
                        </c>
                        <c ca="center">
                           <p>10</p>
                        </c>
                        <c ca="center">
                           <p>-0.19</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ZAT12</p>
                        </c>
                        <c ca="left">
                           <p>OX</p>
                        </c>
                        <c ca="center">
                           <p>0.90</p>
                        </c>
                        <c ca="center">
                           <p>158</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.18</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>7</p>
                        </c>
                        <c ca="center">
                           <p>0.16</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>11</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>0.12</p>
                        </c>
                        <c ca="center">
                           <p>1000</p>
                        </c>
                        <c ca="center">
                           <p>33</p>
                        </c>
                        <c ca="center">
                           <p>0.01</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Hit ratios based on real data for the top 100 genes associated with 12 different transcription factors (TFs). Bold entries are the highest values for that target pool, italic entries are insignificant over-representations according to Fisher's exact test (<it>p </it>&#8805; 0.05). Targetpool size is the number of the true targets in our expression set, <it>E </it>is the expected number of hits when picking 100 genes genes at random and <it>P</it><sub><it>spec </it></sub>is an empirical <it>P</it>-value indicating the probability that a random TF would give the same or better hit ratio. For CERMT, the best cluster size as indicated by the Gap statistic is shown along with the number of true targets for that regulon size. A large Gap statistic (greater than zero) indicates that the suggested regulon is significantly more related to the expression of the TF than could be expected from the expression of a shuffled TF. Target pools were defined from over-expression experiments (OX), knock-out experiments (KO), ChIP-chip experiments (ChIP) or by all genes carrying a known cis-regulatory element (CRE).</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Statistical properties of the predicted regulons</p>
               </st>
               <p>Figure <figr fid="F4">4</figr> shows an example plot of the statistical quality of the predicted regulons for four of the examined TFs. The CBF, AREB and NAC072 regulons show convex Gap curves where the maximum indicates the best number of genes to include in the regulon. The observed Gap statistics are greater than zero which indicates that obtaining such a good or better regulon is highly unlikely given that the expression of the TF was independent of the rest of the genes. The HY5 regulon on the other hand, exhibits no stronger connection to its regulon than can be expected from a randomized TF, despite the fact that it contains a significant number of true targets. This could be an effect of the low resolution of the AtGenExpress dataset and the sinusoidal expression pattern of HY5 in response to UV-B stress. Such complex patterns depend on more parameters and are consequently harder to approximate with only seven time points.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>The Gap curves for four of the examined transcription factors (TFs)</p>
                  </caption>
                  <text>
                     <p><b>The Gap curves for four of the examined transcription factors (TFs)</b>. Shown is the distance between the observed <it>R</it><sup>2</sup>(goodness) of the predicted regulon and the 95<sup>th </sup>percentile of the null-distribution (which is not shown here). A positive <it>R</it><sup>2 </sup>means that the regulon is significant on the 5% significance level and the maximum of the Gap curve indicate the best number of genes to include in the regulon. The Gap curves for CBF, NAC072 and AREB are plotted along with the the curves obtained for two-hundred shuffled TFs (thin lines). The shuffled TFs get mostly negative Gap statistics as they lie close the expectation value of the null-distribution. CBF, NAC072 and AREB show very significant Gap curves, the HY5 regulon on the other hand does not.</p>
                  </text>
                  <graphic file="1471-2105-8-454-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>CERMT can propose biologically interpretable target lists</p>
               </st>
               <p>One of the key benefits of the increasing public availability of expression data is the ability to quickly generate hypotheses on gene function. Standard co-expression analyses have yielded several insights that were experimentally validated <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. We therefore investigated the functional insight provided by the CERMT predicted CBF regulon. Remarkably, among the top seven genes there are four COR/LEA genes and one galactinol synthase. The cold-regulated (COR) genes are the defining members of the CBF-regulon as the CBF TFs were first identified through their binding to the C-repeat element present in the promoters of these genes <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Galactinol synthase catalyzes the first committed step of raffinose synthesis which is an important component of cold acclimation known to be under the control of the CBF TFs <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Overall, the predicted regulon reveals many more known CBF targets including further cold responsive COR genes, enzymes and TFs. These data clearly offer significant biological insight into the central function of the CBF TFs in controlling transcriptional and metabolic changes during cold acclimation. In addition to the predicted target lists, the information about the used treatments shown in Table <tblr tid="T2">2</tblr> can also provide useful biological insight into the function of the TF and of the predicted regulon. Several of the studied examples verify known biological information such as CBF's and ZAT12's importance for the response to cold, HY5's for UV, HSFA2's for heat and MBF1c's for heat and osmotic stress <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B39">39</abbr><abbr bid="B41">41</abbr><abbr bid="B46">46</abbr></abbrgrp>. Table <tblr tid="T2">2</tblr> also shows which time shifts were used for each TF along with the time shift for which the median covariance between the TF and all the genes in its regulon is maximized in the used treatments. This can be seen as a supervised 'answer' to what the algorithm is trying to predict. It is clear that there often exists a transcriptional time shift for the studied regulons, which justifies one of our primary assumptions. However, the correct time lag is frequently missed by the algorithm. The reason for this becomes apparent when one considers the plots of the over-expression defined regulon for PAP1 and the first 50 genes in the predicted regulon for PAP1, see Figure <figr fid="F5">5</figr>. The difference is glaring so it is not surprising that the true regulon is overlooked. In order to increase performance it would be necessary to use additional resources rather than the gene expression data alone. By, for example, using the information that the deep purple phenotype of the PAP1 over-expresser is due to anthocyanin accumulation and therefore only consider genes involved in flavonoid metabolism <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. When this information is combined with the <it>a priori </it>assumption that there exists a time shift, the algorithm picks out nine of the true targets in its top 100 (Fisher's exact test: <it>P </it>= 10<sup>-5</sup>). Including such additional data therefore adds one more TF to those whose hit ratio is sufficient to deliver a usable number of high-confidence target genes. This illustrates an unavoidable problem with gene expression data for TF-target prediction; there are no unique solutions. Given these data sets however, we draw the conclusion that the true regulons often, but far from always, can be discovered with simple statistical functions thus conceptually strengthening the approach by Beyer et al. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> which integrates many different techniques to boost target predictions.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>The used treatments and time shifts</p>
                  </caption>
                  <tblbdy cols="3">
                     <r>
                        <c ca="left">
                           <p>Regulator</p>
                        </c>
                        <c ca="left">
                           <p>Used treatments</p>
                        </c>
                        <c ca="left">
                           <p>Used shift:Best shift</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="3">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>AREB</p>
                        </c>
                        <c ca="left">
                           <p>osmotic-S, salt-S</p>
                        </c>
                        <c ca="left">
                           <p>0:0 0:2</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>CBF (Induced)</p>
                        </c>
                        <c ca="left">
                           <p>cold-S, cold-R</p>
                        </c>
                        <c ca="left">
                           <p>2:2 2:2</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>CBF (Repressed)</p>
                        </c>
                        <c ca="left">
                           <p>cold-S, cold-R, drought-R</p>
                        </c>
                        <c ca="left">
                           <p>2:2 2:2 1:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>DREB2A</p>
                        </c>
                        <c ca="left">
                           <p>genotoxic-R, wounding-S, cold-R, cold-S, osmotic-S</p>
                        </c>
                        <c ca="left">
                           <p>1:0 0:0 2:1 2:1 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>HSFA2</p>
                        </c>
                        <c ca="left">
                           <p>drought-R, oxidative-R, oxidative-S, heat-S, heat-R</p>
                        </c>
                        <c ca="left">
                           <p>0:0 0:0 1:0 0:0 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>HY5</p>
                        </c>
                        <c ca="left">
                           <p>uvb-R, cold-S, uvb-S</p>
                        </c>
                        <c ca="left">
                           <p>0:0 1:0 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MBF1c (Induced)</p>
                        </c>
                        <c ca="left">
                           <p>osmotic-R, heat-S, heat-R</p>
                        </c>
                        <c ca="left">
                           <p>0:2 0:1 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MBF1c (Repessed)</p>
                        </c>
                        <c ca="left">
                           <p>oxidative-S, cold-R, heat-S, heat-R</p>
                        </c>
                        <c ca="left">
                           <p>1:1 0:0 0:0 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MYC2-MYB2</p>
                        </c>
                        <c ca="left">
                           <p>salt-R, drought-R</p>
                        </c>
                        <c ca="left">
                           <p>0:2 0:1</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC019</p>
                        </c>
                        <c ca="left">
                           <p>uvb-S, osmotic-R, salt-R osmotic-S, salt-S</p>
                        </c>
                        <c ca="left">
                           <p>0:1 0:0 0:0 0:1 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC055</p>
                        </c>
                        <c ca="left">
                           <p>cold-R, osmotic-R, salt-R</p>
                        </c>
                        <c ca="left">
                           <p>0:0 0:1 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>NAC072</p>
                        </c>
                        <c ca="left">
                           <p>osmotic-S, salt-S</p>
                        </c>
                        <c ca="left">
                           <p>0:1 0:1</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PAP1</p>
                        </c>
                        <c ca="left">
                           <p>osmotic-S, salt-S</p>
                        </c>
                        <c ca="left">
                           <p>0:2 0:2</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ZAT12 (Induced)</p>
                        </c>
                        <c ca="left">
                           <p>osmotic-R, cold-R, salt-R</p>
                        </c>
                        <c ca="left">
                           <p>0:2 0:0 0:0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>ZAT12 (Repressed)</p>
                        </c>
                        <c ca="left">
                           <p>drought-R, oxidative-R, cold-R, salt-R</p>
                        </c>
                        <c ca="left">
                           <p>0:0 0:0 0:2 0:0</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The treatments and time shifts CERMT selected for the real data. Also shown are the 'best' time lags based on maximizing the covariance between the TF and the experimentally determined regulon. These data can be useful for interpreting the biological relevance of the predicted regulon. Treatments ending with 'S' and 'R' are from shoot and root respectively.</p>
                  </tblfn>
               </tbl>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>The PAP regulon</p>
                  </caption>
                  <text>
                     <p><b>The PAP regulon</b>. Comparison of the expression of the CERMT predicted regulon (upper panel) and the over-expression defined regulon [47] (lower panel) versus the expression of the PAP1 transcription factor in the shoot in response to salt and osmotic stress. No time lag was used for the prediction, so there is no overlap between predicted and true regulons. The difference in terms of coherency and variance is pronounced so it is not hard to see why the algorithm is seeded with no time lag instead of the more appropriate lag of two time points. This illustrates an unavoidable problem of TF target prediction based only on gene expression data &#8211; there are no unique solutions and the most obvious solution is not necessarily the correct one.</p>
                  </text>
                  <graphic file="1471-2105-8-454-5"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Biological significance of the predicted regulons for a large set of transcription factors</p>
               </st>
               <p>Having used existing experimentally validated TF-targets to assess the utility of the proposed algorithm for predicting plant regulons, we wished to extend the study to also include less characterized TFs. Although no known targets are available for validating the predictions for such TFs, it is still possible to estimate the statistical significance of the extracted regulons from a biological point of view, and this information can be used to compare the different prediction methods. By searching for significance, we can estimate how randomly chosen a selection of genes seems to be, and, lacking stronger guidelines, we prefer a method which is less random.</p>
               <p>A standard method for assessing the biological significance of gene expression clusters is to look at the overlap between the clusters and existing functional annotations <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. The predicted regulons are in a sense also clusters and because targets of a given TF often share biological functions (e.g. <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B47">47</abbr></abbrgrp>), we reasoned that the predicted regulons, just like standard gene expression clusters, also should share functional annotations, and that these should be detectable by searching for over-representation.</p>
               <p>TFs regulate the expression of their target genes by binding to sequence motifs, i.e. cis-regulatory elements (CREs), in their promoter regions. A group of genes that is more likely to share sequence elements amongst each other is therefore also more likely to be co-regulated. Measuring this likelihood could possibly be done by identifying the best motif and assessing its significance. There are many excellent algorithms for identifying motifs given a set or promoters available (e.g. <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>), however, the use of these methods is not feasible for our purpose as most of them are very time consuming and need data specific parameter tuning <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Fortunately, we are not directly interested in finding the <it>correct </it>motif, but merely to score how reasonable the existence of such a motif is. Therefore we chose a more simplistic method, which was inspired by the enumeration strategy used by the efficacious Weeder algorithm <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, and estimate the likelihood of motif-existence by the number of over-represented nucleotide hexamers. Although it might be naively assumed that a single, highly over-represented motif would be found for each regulon, there are at least two reasons why better regulon prediction is more likely accompanied by an increased number of over-represented motifs. Firstly, the hexamers searched for are redundant or overlapping in sequence, resulting in multiple hits from a single CRE. Secondly, genes may be regulated by more than one CRE, and these 'hitchhiking' elements may also be enriched.</p>
               <p>To assess the plausibility of our assumptions about regulon characteristics, we counted the over-represented hexamers and functional annotations in the experimentally defined regulons. Six of the seven target pools that contained more than 30 genes had between 5 to 37 over-represented hexamers and seven had between 3 to 8 over-represented functional annotations, see Additional file <supplr sid="S2">2</supplr>. In addition, by examining the gene lists annotated to a common known TF binding site by ATCISDB <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, we found that 80% had significantly more over-represented hexamers than could be expected by chance. This strongly indicates that biologically relevant regulons are more likely to contain over-represented annotations and hexamers than random selections of genes. Note that these numbers are dependent on the size of the regulon and are thus not readily transferable to regulons of different sizes.</p>
               <suppl id="S2">
                  <title>
                     <p>Additional file 2</p>
                  </title>
                  <text>
                     <p>The numbers of over-represented hexamers and annotations (MapMan bins) in the experimentally defined regulons. With randomly chosen genes we would not expect any over-representation. A clear majority of the larger regulons have several over-represented hexamers annotations. 'OX' indicates that the targets were found using over-expression, 'KO' using knock-out and 'ChIP' using ChIP-chip experiment.</p>
                  </text>
                  <file name="1471-2105-8-454-S2.csv">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p>From the simulated data, we could draw the conclusion that, if the proposed method is better than a simple correlation based measure using concatenated time series, then the improvement would be most apparent when there is a time shift between the TF and its regulon. Therefore, we ran the proposed algorithm, treating the TFs as both inducers and repressors, on each of the 1484 genes in our data set annotated in the MapMan software <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> to bin 27.3, 'RNA.regulation of transcription'. For the 265 and 307 genes that the algorithm chose to introduce a time shift in at least one treatment in induction and repression mode respectively, we counted the over-represented annotations and upstream motifs.</p>
               <p>Figure <figr fid="F6">6</figr> shows boxplots of the number of significantly over-represented annotations and motifs as a function of the number of extracted genes using the four tested methods. There is strong difference between the correlation and covariance based methods in all four. Obviously, the covariance based methods enrich both more functional annotations and hexamers, and thus extract regulons with more appealing properties than the standard correlation based methods do. No prominent differences could be seen between CERMT and CERMT-0, suggesting that the qualitative differences between these methods was too small to be resolved by this type of enrichment analysis. As indicated by the non-overlapping notches in Figure <figr fid="F6">6</figr>, CERMT-0 enriched slightly more hexamers than CERMT in induction mode, a trend which was reversed and more prominent in repression mode. Having shown that the predicted regulons show significant promoter sequence properties, it is tempting to speculate that a way to further separate overlapping regulons and refine the target lists, could be to integrate CERMT with motif prediction algorithms.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Characteristics of predicted regulons</p>
                  </caption>
                  <text>
                     <p><b>Characteristics of predicted regulons</b>. Boxplots of characteristics of predicted regulon. The number of over-represented functional annotations and over-represented hexamers 500 nucleotides upstream in the top 100&#8211;500 predicted genes for a selection of transcription factors were counted for each method. The results are shown from both induction mode (A) and (B) repression mode. Truly co-regulated genes share cis-regulatory elements in their promoters and are also likely to share biological function. Due to hexamer redundancy, motif interactions and parallel TF pathways, a higher number of enriched hexamers and functional annotations therefore indicate a higher probability that a group of genes actually is co-regulated. Compared to the previously described methods ACT (Arabidopsis Co-expression tool), Cor (Pearson correlation), the covariance based methods CERMT-0 and CERMT extract genes with more over-represented hexamers and functional annotations.</p>
                  </text>
                  <graphic file="1471-2105-8-454-6"/>
               </fig>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We designed a method for extracting potential targets to known TFs using gene expression data in the form of multiple time series. The method provides a heuristic for solving the combinatorial problem of selecting informative treatments and appropriate time shifts between the TF and its targets. By maximizing the overlap in covariance between the TF and all other genes in two treatments and then systematically adding further treatments, we not only avoid the need for computationally expensive optimizations, but also increase the interpretability and quality of the predictions.</p>
         <p>Using existing experimental data on target associations for twelve TFs, the method showed higher performance than existing steady-state co-expression tools, but indicated that both methods could be complementary. This not only highlighted the utility of the method but also showed that the targets identified by mutant profiling in normal conditions indeed often are highly covariant with the associated TF in a treatment and time dependent fashion in the wild-type plant.</p>
         <p>The predicted regulons for unknown TFs also showed appealing properties in terms of enriching both annotations and upstream motifs. These results indicate that the described approach could be used both as a method for exploratory analysis of regulatory relationships of a particular TF, and as a means of obtaining high-confidence subsets from putative target genes identified by mutant profiling or other experimental techniques.</p>
         <p>Gene expression based techniques are especially useful for extracting potential targets when no information about the regulatory relationships is available. Such methods can therefore just as well be used for aiding hypothesis generation regarding regulatory properties of e.g. metabolites.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Cross-validation based test for adding further treatments</p>
            </st>
            <p>In order to investigate if there are more treatments for which (1) is high for the same genes, we order the remaining treatments according to (2) by setting the first treatment to the artificial pseudo treatment. For the remaining treatments we measure the goodness-of-fit by estimating the change in predictive performance between the one-component partial least squares (PLS) regression model <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> predicting the expression of the TF from all other genes including the new treatment, <it>y" </it>= <it>X"B </it>+ <it>E"</it>, versus the old one, <it>y' </it>= <it>X'B </it>+ <it>E'</it>, where <it>B </it>is the vector with regression coefficients and <it>E </it>the residual matrix. The predictive capability is measured by calculating the <it>Q</it><sup>2 </sup>statistic using repeated five-fold cross-validation. By using Student's <it>t</it>-test we test the hypothesis <it>H</it><sub>0 </sub>: <it>Q</it><sup>2' </sup>> <it>Q</it><sup>2" </sup>(i.e. including the next treatment led to a decrease in predictive performance) and only include the treatments where we fail to reject. <it>Q</it><sup>2 </sup>is defined as following:</p>
            <p>
               <display-formula id="M4">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i7">
                     <m:semantics>
                        <m:mrow>
                           <m:msup>
                              <m:mi>Q</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo>=</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>&#8722;</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msubsup>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>k</m:mi>
                                    </m:msubsup>
                                    <m:mrow>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:msub>
                                                <m:mover accent="true">
                                                   <m:mi>y</m:mi>
                                                   <m:mo>^</m:mo>
                                                </m:mover>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>&#8722;</m:mo>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:msubsup>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>k</m:mi>
                                    </m:msubsup>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>y</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aaWbaaSqabeaacqaIYaGmaaGccqGH9aqpcqaIXaqmcqGHsisljuaGdaWcaaqaamaaqadabaGaeiikaGIafmyEaKNbaKaadaWgaaqaaiabdMgaPbqabaGaeyOeI0IaemyEaK3aaSbaaeaacqWGPbqAaeqaaiabcMcaPmaaCaaabeqaaiabikdaYaaaaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGRbWAaiabggHiLdaabaWaaabmaeaacqWG5bqEdaqhaaqaaiabdMgaPbqaaiabikdaYaaaaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGRbWAaiabggHiLdaaaiabc6caUaaa@4DA9@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>PLS is designed for developing models with strong predictive performance, although this is not our direct interest, it is suitable here as it is desirable to find a set of genes of undefined size that are strongly related to a given TF for <it>all </it>treatments used. <it>Q</it><sup>2 </sup>will not increase when a treatment is added that requires high regression coefficients for genes that are unrelated to the TF in the other treatments and it therefore provides a valid, albeit indirect, tool for deciding whether to leave treatments out or not.</p>
            <p>Our method does not allow for more than one time shift per treatment. Therefore, we are only interested in looking for targets responding in the same (induced) or opposite (repressed) direction as the TF. Hence, we modified the PLS algorithm slightly to set all negative or positive coefficients respectively, to zero. According to H&#246;skuldsson <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> this does not affect the central properties of the PLS regression.</p>
         </sec>
         <sec>
            <st>
               <p>Estimation of the regulon size and significance using the Gap statistic</p>
            </st>
            <p>The Gap statistic has previously been proposed as a method for simultaneously choosing a suitable cluster size and assessing its statistical quality <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The method works by calculating a goodness-statistic for several different cluster sizes and choosing that which is farthest away from a pre-defined null-distribution. In our setting, the null-distribution is the goodness-statistics of the regulons we obtain when using a random gene whose expression has been shuffled within each treatment. We define statistical quality of a regulon as the amount of its variance that can be directly related to the TF, and measure this by reversing the previous PLS regression model and calculating <it>R</it><sup>2</sup>.</p>
            <p>
               <display-formula id="M5"><it>X</it><sub><it>j </it>&#8712; 1...<it>k </it></sub>= <it>y</it><it>B </it>+ <it>E</it></display-formula>
            </p>
            <p>
               <display-formula id="M6">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i8">
                     <m:semantics>
                        <m:mrow>
                           <m:msup>
                              <m:mi>R</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>X</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>j</m:mi>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mn>1...</m:mn>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:mo>&#8721;</m:mo>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mi>X</m:mi>
                                          <m:mrow>
                                             <m:mi>j</m:mi>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mn>1...</m:mn>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOuai1aaWbaaSqabeaacqaIYaGmaaGccqGGOaakcqWGRbWAcqGGPaqkcqGH9aqpjuaGdaWcaaqaamaaqaeabaGafmiwaGLbaKaadaqhaaqaaiabdQgaQjabgIGiolabigdaXiabc6caUiabc6caUiabc6caUiabdUgaRbqaaiabikdaYaaaaeqabeGaeyyeIuoaaeaadaaeabqaaiabdIfaynaaDaaabaGaemOAaOMaeyicI4SaeGymaeJaeiOla4IaeiOla4IaeiOla4Iaem4AaSgabaGaeGOmaidaaaqabeqacqGHris5aaaaaaa@4B86@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>We then define the Gap statistic as the observed <it>R</it><sup>2 </sup>minus the 95<sup>th </sup>percentile of the null-distribution &#8211; <it>R</it><sup>2</sup>*.</p>
            <p>
               <display-formula id="M7">Gap(k) = R<sup>2 </sup>(<it>k</it>) - <it>Q</it><sub>0.95</sub>(<it>R</it><sup>2</sup>*(<it>k</it>))</display-formula>
            </p>
            <p>The recommended regulon size is given by:</p>
            <p>
               <display-formula id="M8">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i9">
                     <m:semantics>
                        <m:mrow>
                           <m:mover accent="true">
                              <m:mi>k</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mo>=</m:mo>
                           <m:mi>arg</m:mi>
                           <m:mo>&#8289;</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>max</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mi>k</m:mi>
                           </m:munder>
                           <m:mtext>Gap</m:mtext>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>k</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafm4AaSMbaKaacqGH9aqpcyGGHbqycqGGYbGCcqGGNbWzdaWfqaqaaiGbc2gaTjabcggaHjabcIha4bWcbaGaem4AaSgabeaakiabbEeahjabbggaHjabbchaWjabcIcaOiabdUgaRjabcMcaPiabc6caUaaa@4026@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Because we use the 95<sup>th </sup>percentile, a positive Gap curve can directly be translated to a significant regulon at the 5% confidence level.</p>
         </sec>
         <sec>
            <st>
               <p>Simulation of gene expression data</p>
            </st>
            <p>The gene expression, <it>x</it>, at time point <it>i </it>&#8712; {1, 2,..., 7} for gene <it>j </it>&#8712; {1, 2,..., 10000}, in treatment <it>k </it>&#8712; {1, 2,..., 6} was simulated in a naive way as</p>
            <p>
               <display-formula id="M9">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i10">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>j</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>k</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mrow>
                              <m:mo>{</m:mo>
                              <m:mrow>
                                 <m:mtable columnalign="left">
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mi>N</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:msub>
                                                <m:mi>&#963;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>x</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>c</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>l</m:mi>
                                                      <m:mi>k</m:mi>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo>+</m:mo>
                                             <m:msub>
                                                <m:mi>p</m:mi>
                                                <m:mrow>
                                                   <m:mi>j</m:mi>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                             <m:msub>
                                                <m:mi>y</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:msub>
                                                      <m:mi>l</m:mi>
                                                      <m:mi>k</m:mi>
                                                   </m:msub>
                                                   <m:mo>,</m:mo>
                                                   <m:mi>k</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:mi>i</m:mi>
                                             <m:mo>></m:mo>
                                             <m:msub>
                                                <m:mi>l</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr columnalign="left">
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mi>N</m:mi>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mn>0</m:mn>
                                             <m:mo>,</m:mo>
                                             <m:msub>
                                                <m:mi>&#963;</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>x</m:mi>
                                                      <m:mi>j</m:mi>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:msub>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                       </m:mtd>
                                       <m:mtd columnalign="left">
                                          <m:mrow>
                                             <m:mtext>if&#160;</m:mtext>
                                             <m:mi>i</m:mi>
                                             <m:mo>&#8804;</m:mo>
                                             <m:msub>
                                                <m:mi>l</m:mi>
                                                <m:mi>k</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemiEaG3aaSbaaSqaaiabdMgaPjabcYcaSiabdQgaQjabcYcaSiabdUgaRbqabaGccqGH9aqpdaGabeqaauaabaqaciaaaeaacqWGobGtcqGGOaakcqaIWaamcqGGSaaliiGacqWFdpWCdaWgaaWcbaGaemiEaG3aaSbaaWqaaiabdQgaQbqabaaaleqaaOGaeiykaKIaey4kaSIaem4yam2aaSbaaSqaaiabdQgaQbqabaGccqWG5bqEdaWgaaWcbaGaemyAaKMaeyOeI0IaemiBaW2aaSbaaWqaaiabdUgaRbqabaWccqGGSaalcqWGRbWAaeqaaOGaey4kaSIaemiCaa3aaSbaaSqaaiabdQgaQjabcYcaSiabdUgaRbqabaGccqWG5bqEdaWgaaWcbaGaemyAaKMaeyOeI0IaemiBaW2aaSbaaWqaaiabdUgaRbqabaWccqGGSaalcqWGRbWAaeqaaaGcbaGaeeyAaKMaeeOzayMaeeiiaaIaemyAaKMaeyOpa4JaemiBaW2aaSbaaSqaaiabdUgaRbqabaaakeaacqWGobGtcqGGOaakcqaIWaamcqGGSaalcqWFdpWCdaWgaaWcbaGaemiEaG3aaSbaaWqaaiabdQgaQbqabaaaleqaaOGaeiykaKcabaGaeeyAaKMaeeOzayMaeeiiaaIaemyAaKMaeyizImQaemiBaW2aaSbaaSqaaiabdUgaRbqabaaaaaGccaGL7baaaaa@77B7@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where</p>
            <p>
               <display-formula>
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-8-454-i11">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#963;</m:mi>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>x</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>&#8712;</m:mo>
                                       <m:mi>&#915;</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>0.4</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>+</m:mo>
                                       <m:mn>0.2</m:mn>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>y</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>&#8712;</m:mo>
                                       <m:mi>N</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>0</m:mn>
                                       <m:mo>,</m:mo>
                                       <m:mn>1.5</m:mn>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>,</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaGGaciab=n8aZnaaBaaaleaacqWG4baEdaWgaaadbaGaemOAaOgabeaaaSqabaGccqGHiiIZcqqHtoWrcqGGOaakcqaIXaqmcqGGSaalcqaIWaamcqGGUaGlcqaI0aancqGGPaqkcqGHRaWkcqaIWaamcqGGUaGlcqaIYaGmcqGGSaalaeaacqWG5bqEdaWgaaWcbaGaemyAaKMaeiilaWIaem4AaSgabeaakiabgIGiolabd6eaojabcIcaOiabicdaWiabcYcaSiabigdaXiabc6caUiabiwda1iabcMcaPiabcYcaSaaaaaa@4E18@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and <it>c</it><sub><it>j </it></sub>was one or zero depending on if gene <it>j </it>was part of the planted regulon or not. The constant <it>l</it><sub><it>k </it></sub>defined the planted lag for treatment <it>k</it>. The lag was either set to zero or allowed to vary between 1 and 2 time points. The parameters for the distribution of <it>&#963;</it><sub><it>y </it></sub>were picked to resemble a real world dataset. Tomake the data more illustrative the term <it>p</it><sub><it>j</it>, <it>k</it></sub><it>y</it><sub><it>i</it>-<it>l</it>, <it>k </it></sub>in (9) was added where <it>p</it><sub><it>j</it>, <it>k </it></sub>was one or zero depending on whether or not the gene <it>j </it>belonged to a 'masking' regulon in treatment <it>k</it>, a non-intersecting group of genes of the same size as the true regulon. Thus, in order to recover the hidden regulon it is necessary to combine information from different treatments.</p>
            <p>Simulating time series data using random normal deviates is naive in the sense that the different time points are independent of each other. For this particular application it is however acceptable as the simplification only becomes detrimental when comparing methods that utilize the time series aspect of the data, as for now CERMT does not do this.</p>
         </sec>
         <sec>
            <st>
               <p>The AtGenExpress data</p>
            </st>
            <p>The data from abiotic stress series of the AtGenExpress project was downloaded from <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> and normalized using the RMA normalization algorithm <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> as provided by the Bioconductor project <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> for the statistical programming environment R <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Probesets matching multiple AGI codes or organellar encoded genes were excluded and where multiple probesets matched the same AGI code the original chip design designations were used and superfluous probesets were dropped in order to obtain a bijective mapping for 20872 probesets. Only probesets that received a present call by the MAS5 algorithm for both replica in at least one time point were kept giving a final expression set of 17513 probesets.</p>
            <p>Throughout this study, we only considered the time shifts 0, 0.5 and 1 h, as further time shifts would result in relying on too few time points and unrealistically long transcriptional delays.</p>
            <p>The thresholds used for judging whether a TF responded to a treatment or not were set to the standard moderate outliers threshold, <it>Q</it><sub>0.75 </sub>+ 1.5 &#215; <it>IQR</it>, i.e. the third quartile plus 1.5 times the inter-quartile range, to the distributions of the maximum responses and maximum deviations from the control, given the probes on the arrays with only insignificant expression signals, as judged by the MAS5 algorithm.</p>
         </sec>
         <sec>
            <st>
               <p>Motif and Bin enrichment</p>
            </st>
            <p>The enrichment of hexamers in predicted regulons was calculated by first building a dictionary with all possible hexamer, minus those that resembled the TATA-box, and counting their occurrences in the 500 base upstream regions of all considered genes. The obtained global distribution was then compared with that of the predicted regulon. <it>P</it>-values for over-representation were calculated using the hypergeometric distribution, FDR corrected <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> and over-representation was noted for FDR &lt; 0.05.</p>
            <p>The calculation of annotational enrichments was based on the method proposed by Hannah et al. <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>, which uses MapMan ontologies <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B65">65</abbr></abbrgrp> combined with Fisher's exact test. Bins (gene classes) were counted as significantly enriched if FDR &lt; 0.05. The MapMan annotations were preferred over alternatives such as mappings to Gene Ontology <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> because of its maturity and plant specific scope.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>The R package is contains all methods discussed in this paper and the part of the AtGenExpress data as it was used here. It is not organism specific and makes it possible to apply CERMT to other species after collation of the appropriate gene expression time series. For <it>Arabidopsis thaliana</it>, we also provide the method as a web-service which allows the user to select the TF of interest, extract and plot the suggested regulon using a fast but simplified version of the proposed algorithm.</p>
         <p><b>Project name</b>: cermt</p>
         <p><b>Project home page</b>: <url>http://cermt.mpimp-golm.mpg.de/</url></p>
         <p><b>Operating systems</b>: Platform independent</p>
         <p><b>Programming language</b>: R package with Java based web-interface</p>
         <p><b>Licence</b>: GPL v2</p>
         <p><b>Any restrictions to use by non-academics</b>: No</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>HR designed and implemented the methods and wrote the manuscript. DW implemented the web-service. JS provided essential mentoring and supervision to HR. MH initiated and supervised the project, made the literature study and wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank Dr. Solve S&#230;b&#248; for helpful discussions on PLS, Dr. Dirk Walther, Dr. Dirk Repsilber and Dr. Patrick May for discussions and comments on the manuscript and the AtGenExpress consortium and TAIR for data availability.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Improving plant drought, salt, and freezing tolerance by gene transfer of a single stress-inducible transcription factor</p>
            </title>
            <aug>
               <au>
                  <snm>Kasuga</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Miura</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>1999</pubdate>
            <volume>17</volume>
            <issue>3</issue>
            <fpage>287</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/7036</pubid>
                  <pubid idtype="pmpid" link="fulltext">10096298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Intellectual property. Decision on NFkappaB patent could have broad implications for biotech</p>
            </title>
            <aug>
               <au>
                  <snm>Garber</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>312</volume>
            <issue>5775</issue>
            <fpage>827</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.312.5775.827a</pubid>
                  <pubid idtype="pmpid" link="fulltext">16690824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Transcriptional regulatory networks in Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Murray</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wyrick</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Tagne</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Volkert</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <issue>5594</issue>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Transcriptional regulatory code of a eukaryotic genome</p>
            </title>
            <aug>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Macisaac</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Danford</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Tagne</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Reynolds</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Zeitlinger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pokholok</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rolfe</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Takusagawa</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Fraenkel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <issue>7004</issue>
            <fpage>99</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02800</pubid>
                  <pubid idtype="pmpid" link="fulltext">15343339</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The transcription factors c-rel and RelA control epidermal development and homeostasis in embryonic and adult skin via distinct mechanisms</p>
            </title>
            <aug>
               <au>
                  <snm>Gugasyan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Varigos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Grumont</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Kaur</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Grigoriadis</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gerondakis</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>24</volume>
            <issue>13</issue>
            <fpage>5733</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">480872</pubid>
                  <pubid idtype="pmpid" link="fulltext">15199130</pubid>
                  <pubid idtype="doi">10.1128/MCB.24.13.5733-5745.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>AUXIN RESPONSE FACTOR 2 (ARF2): a pleiotropic developmental regulator</p>
            </title>
            <aug>
               <au>
                  <snm>Okushima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mitina</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Quach</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Theologis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>43</volume>
            <fpage>29</fpage>
            <lpage>46</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2005.02426.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15960614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A glucocorticoid-inducible transcription system causes severe growth defects in Arabidopsis and induces defense-related genes</p>
            </title>
            <aug>
               <au>
                  <snm>Kang</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>KB</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1999</pubdate>
            <volume>20</volume>
            <fpage>127</fpage>
            <lpage>133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.1999.00575.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10571872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Ethanol-inducible gene expression: non-transformed plants also respond to ethanol</p>
            </title>
            <aug>
               <au>
                  <snm>Vreugdenhil</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Claassens</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Verhees</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>van der Krol</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>van der Plas</snm>
                  <fnm>LH</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>2006</pubdate>
            <volume>11</volume>
            <fpage>9</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tplants.2005.11.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">16356757</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Combined Global Localization Analysis and Transcriptome Data Identify Genes That Are Directly Coregulated by Adr1 and Cat8</p>
            </title>
            <aug>
               <au>
                  <snm>Tachibana</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Tagne</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Kacherovsky</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>ET</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>25</volume>
            <issue>6</issue>
            <fpage>2138</fpage>
            <lpage>2146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1061606</pubid>
                  <pubid idtype="pmpid" link="fulltext">15743812</pubid>
                  <pubid idtype="doi">10.1128/MCB.25.6.2138-2146.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Integrated assessment and prediction of transcription factor binding</p>
            </title>
            <aug>
               <au>
                  <snm>Beyer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hollunder</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Radke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>M&#246;ller</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Wilhelm</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <issue>6</issue>
            <fpage>e70</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1479087</pubid>
                  <pubid idtype="pmpid" link="fulltext">16789814</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0020070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>DBD: a transcription factor prediction database</p>
            </title>
            <aug>
               <au>
                  <snm>Kummerfeld</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D74</fpage>
            <lpage>81</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347493</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381970</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Roles of the CBF2 and ZAT12 transcription factors in configuring the low temperature transcriptome of Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Vogel</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Zarka</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Van Buskirk</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Fowler</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>41</volume>
            <issue>2</issue>
            <fpage>195</fpage>
            <lpage>211</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2004.02288.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15634197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Inferring transcriptional modules from ChIP-chip, motif and microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Lemmens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dhollander</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>De Bie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Monsieurs</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Engelen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Smets</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Winderickx</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Marchal</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>5</issue>
            <fpage>R37</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1779513</pubid>
                  <pubid idtype="pmpid" link="fulltext">16677396</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-5-r37</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Inferring pairwise regulatory relationships from multiple time series datasets</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>6</issue>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl676</pubid>
                  <pubid idtype="pmpid" link="fulltext">17237067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The Arabidopsis co-expression tool (ACT): a WWW-based tool and database for microarray-based gene expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Jen</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Manfield</snm>
                  <fnm>IW</fnm>
               </au>
               <au>
                  <snm>Michalopoulos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pinney</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Willats</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Gilmartin</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Westhead</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2006</pubdate>
            <volume>46</volume>
            <issue>2</issue>
            <fpage>336</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2006.02681.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16623895</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>CSB.DB: a comprehensive systems-biology database</p>
            </title>
            <aug>
               <au>
                  <snm>Steinhauser</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Luedemann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kopka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>18</issue>
            <fpage>3647</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth398</pubid>
                  <pubid idtype="pmpid" link="fulltext">15247097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The Botany Array Resource: e-Northerns, Expression Angling, and promoter analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Toufighi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brady</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Austin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ly</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Provart</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>43</volume>
            <fpage>153</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2005.02437.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15960624</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>ATTED-II: a database of co-expressed genes and cis elements for identifying co-regulated gene groups in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Obayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kinoshita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nakai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shibaoka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saeki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohta</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D863</fpage>
            <lpage>D869</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1716726</pubid>
                  <pubid idtype="pmpid" link="fulltext">17130150</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl783</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Persson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Milne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Somerville</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>24</issue>
            <fpage>8633</fpage>
            <lpage>8638</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142401</pubid>
                  <pubid idtype="pmpid" link="fulltext">15932943</pubid>
                  <pubid idtype="doi">10.1073/pnas.0503392102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Transcriptional co-regulation of secondary metabolism enzymes in Arabidopsis: functional and evolutionary implications</p>
            </title>
            <aug>
               <au>
                  <snm>Gachon</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Langlois-Meurinne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Henry</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Saindrenan</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>58</volume>
            <issue>2</issue>
            <fpage>229</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-005-5346-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">16027976</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Intercellular movement of transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Kurata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wada</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <issue>6</issue>
            <fpage>600</fpage>
            <lpage>605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pbi.2005.09.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16182599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Redox-dependent transcriptional regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Colavitti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rovira</snm>
                  <fnm>II</fnm>
               </au>
               <au>
                  <snm>Finkel</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Circ Res</source>
            <pubdate>2005</pubdate>
            <volume>97</volume>
            <issue>10</issue>
            <fpage>967</fpage>
            <lpage>974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1161/01.RES.0000188210.72062.10</pubid>
                  <pubid idtype="pmpid" link="fulltext">16284189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Genomic analysis of gene expression relationships in transcriptional regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Luscombe</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Qian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>8</issue>
            <fpage>422</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00175-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12902159</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Reiss</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bonneau</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>280</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1502140</pubid>
                  <pubid idtype="pmpid" link="fulltext">16749936</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-280</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Identifying time-lagged gene clusters using gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Ji</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>KL</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>4</issue>
            <fpage>509</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti026</pubid>
                  <pubid idtype="pmpid" link="fulltext">15374868</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Clustering of gene expression data using a local shape-based similarity measure</p>
            </title>
            <aug>
               <au>
                  <snm>Balasubramaniyan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>H&#252;llermeier</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Weskamp</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>K&#228;mper</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>1069</fpage>
            <lpage>1077</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti095</pubid>
                  <pubid idtype="pmpid" link="fulltext">15513997</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The Graphical Query Language: a tool for analysis of gene expression time-courses</p>
            </title>
            <aug>
               <au>
                  <snm>Costa</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Sch&#246;nhuth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schliep</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>10</issue>
            <fpage>2544</fpage>
            <lpage>2545</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti311</pubid>
                  <pubid idtype="pmpid" link="fulltext">15701683</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Bayesian coclustering of Anopheles gene expression time series: study of immune defense response to multiple experimental challenges</p>
            </title>
            <aug>
               <au>
                  <snm>Heard</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Hand</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Dimopoulos</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>47</issue>
            <fpage>16939</fpage>
            <lpage>16944</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1287961</pubid>
                  <pubid idtype="pmpid" link="fulltext">16287981</pubid>
                  <pubid idtype="doi">10.1073/pnas.0408393102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Gasch</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>11</issue>
            <fpage>RESEARCH0059</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">133443</pubid>
                  <pubid idtype="pmpid" link="fulltext">12429058</pubid>
                  <pubid idtype="doi">10.1186/gb-2002-3-11-research0059</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Sugars and circadian regulation make major contributions to the global regulation of diurnal gene expression in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Bl&#228;sing</snm>
                  <fnm>OE</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gunther</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>H&#246;hne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Morcuende</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Osuna</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Scheible</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <issue>12</issue>
            <fpage>3257</fpage>
            <lpage>3281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1315368</pubid>
                  <pubid idtype="pmpid" link="fulltext">16299223</pubid>
                  <pubid idtype="doi">10.1105/tpc.105.035261</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Hastie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Alizadeh</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Staudt</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>RESEARCH0003</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15015</pubid>
                  <pubid idtype="pmpid" link="fulltext">11178228</pubid>
                  <pubid idtype="doi">10.1186/gb-2000-1-2-research0003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The AtGenExpress global stress expression data set: protocols, evaluation and model data analysis of UV-B light, drought and cold stress responses</p>
            </title>
            <aug>
               <au>
                  <snm>Kilian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Horak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wanke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weinl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Batistic</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>D'Angelo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bornberg-Bauer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kudla</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Harter</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2007</pubdate>
            <volume>50</volume>
            <issue>2</issue>
            <fpage>347</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2007.03052.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17376166</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Monitoring the expression profiles of 7000 Arabidopsis genes under drought, cold and high-salinity stresses using a full-length cDNA microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Narusaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ishida</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nanjo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fujita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oono</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kamiya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Enju</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sakurai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Satou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taji</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kawai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <issue>3</issue>
            <fpage>279</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.2002.01359.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12164808</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Arabidopsis transcriptome profiling indicates that multiple regulatory pathways are activated during cold acclimation in addition to the CBF cold response pathway</p>
            </title>
            <aug>
               <au>
                  <snm>Fowler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <issue>8</issue>
            <fpage>1675</fpage>
            <lpage>1690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151458</pubid>
                  <pubid idtype="pmpid" link="fulltext">12172015</pubid>
                  <pubid idtype="doi">10.1105/tpc.003483</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Continuous representations of time-series gene expression dat</p>
            </title>
            <aug>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jaakkola</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <fpage>341</fpage>
            <lpage>356</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/10665270360688057</pubid>
                  <pubid idtype="pmpid" link="fulltext">12935332</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Role of arabidopsis MYC and MYB homologs in drought- and abscisic acid-regulated gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Abe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Urao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Iwasaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hosokawa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1997</pubdate>
            <volume>9</volume>
            <issue>10</issue>
            <fpage>1859</fpage>
            <lpage>1868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">157027</pubid>
                  <pubid idtype="pmpid" link="fulltext">9368419</pubid>
                  <pubid idtype="doi">10.1105/tpc.9.10.1859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Arabidopsis bZIP protein HY5 directly interacts with light-responsive promoters in mediating light control of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Chattopadhyay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ang</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Puente</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>XW</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1998</pubdate>
            <volume>10</volume>
            <issue>5</issue>
            <fpage>673</fpage>
            <lpage>683</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">144028</pubid>
                  <pubid idtype="pmpid" link="fulltext">9596629</pubid>
                  <pubid idtype="doi">10.1105/tpc.10.5.673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Identification of cold-inducible downstream genes of the Arabidopsis DREB1A/CBF3 transcriptional factor using two microarray systems</p>
            </title>
            <aug>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sakuma</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kasuga</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <issue>6</issue>
            <fpage>982</fpage>
            <lpage>993</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2004.02100.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15165189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Enhanced tolerance to environmental stress in transgenic plants expressing the transcriptional coactivator multiprotein bridging factor 1c</p>
            </title>
            <aug>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rizhsky</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shuman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shulaev</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mittler</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>139</volume>
            <issue>3</issue>
            <fpage>1313</fpage>
            <lpage>1322</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283768</pubid>
                  <pubid idtype="pmpid" link="fulltext">16244138</pubid>
                  <pubid idtype="doi">10.1104/pp.105.070110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Arabidopsis AtMYC2 (bHLH) and AtMYB2 (MYB) function as transcriptional activators in abscisic acid signaling</p>
            </title>
            <aug>
               <au>
                  <snm>Abe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Urao</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <fpage>63</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">143451</pubid>
                  <pubid idtype="pmpid" link="fulltext">12509522</pubid>
                  <pubid idtype="doi">10.1105/tpc.006130</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Identification of novel heat shock factor-dependent genes and biochemical pathways in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Busch</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wunderlich</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sch&#246;ff</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>41</volume>
            <fpage>1</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2004.02272.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15610345</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Arabidopsis transcriptional activators CBF1, CBF2, and CBF3 have matching functional activities</p>
            </title>
            <aug>
               <au>
                  <snm>Gilmour</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Fowler</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <issue>5</issue>
            <fpage>767</fpage>
            <lpage>781</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/B:PLAN.0000040902.06881.d4</pubid>
                  <pubid idtype="pmpid" link="fulltext">15356394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Functional analysis of an Arabidopsis transcription factor, DREB2A, involved in drought-responsive gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Sakuma</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Osakabe</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <issue>5</issue>
            <fpage>1292</fpage>
            <lpage>1309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1456870</pubid>
                  <pubid idtype="pmpid" link="fulltext">16617101</pubid>
                  <pubid idtype="doi">10.1105/tpc.105.035881</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>AREB1 is a transcription activator of novel ABRE-dependent ABA signaling that enhances drought stress tolerance in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Fujita</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fujita</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Satoh</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parvez</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hiratsu</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohme-Takagi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2005</pubdate>
            <volume>17</volume>
            <issue>12</issue>
            <fpage>3470</fpage>
            <lpage>3488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1315382</pubid>
                  <pubid idtype="pmpid" link="fulltext">16284313</pubid>
                  <pubid idtype="doi">10.1105/tpc.105.035659</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Arabidopsis transcription factors regulating cold acclimation</p>
            </title>
            <aug>
               <au>
                  <snm>van Buskirk</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Physiol Plantarum</source>
            <pubdate>2006</pubdate>
            <volume>126</volume>
            <fpage>72</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1399-3054.2006.00625.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>CONSTITUTIVELY PHOTOMORPHOGENIC1 is required for the UV-B response in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Oravecz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>M&#225;t&#233;</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Brzezinska</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Molinier</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oakeley</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Ad&#225;m</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sch&#228;fer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nagy</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ulm</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <issue>8</issue>
            <fpage>1975</fpage>
            <lpage>1990</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1533968</pubid>
                  <pubid idtype="pmpid" link="fulltext">16829591</pubid>
                  <pubid idtype="doi">10.1105/tpc.105.040097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Functional genomics by integrated analysis of metabolome and transcriptome of Arabidopsis plants over-expressing a MYB transcription factor</p>
            </title>
            <aug>
               <au>
                  <snm>Tohge</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nishiyama</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hirai</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Yano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Awazuhara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Goodenowe</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Kitayama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Noji</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2005</pubdate>
            <volume>42</volume>
            <issue>2</issue>
            <fpage>218</fpage>
            <lpage>235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2005.02371.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15807784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Judging the quality of gene expression-based clustering methods using gene annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Gibbons</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>10</issue>
            <fpage>1574</fpage>
            <lpage>1581</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187526</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368250</pubid>
                  <pubid idtype="doi">10.1101/gr.397002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Validation and functional annotation of expression-based clusters based on gene ontology</p>
            </title>
            <aug>
               <au>
                  <snm>Steuer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Humburg</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>380</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1586215</pubid>
                  <pubid idtype="pmpid" link="fulltext">16911788</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-380</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Regulatory sequence analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>van Helden</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3593</fpage>
            <lpage>3596</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168973</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824373</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg567</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling</p>
            </title>
            <aug>
               <au>
                  <snm>Thijs</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lescot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marchal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moor</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Rouz&#233;</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <issue>12</issue>
            <fpage>1113</fpage>
            <lpage>1122</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.12.1113</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751219</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>MEME: discovering and analyzing DNA and protein sequence motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Misleh</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Web Server</issue>
            <fpage>W369</fpage>
            <lpage>W373</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1538909</pubid>
                  <pubid idtype="pmpid" link="fulltext">16845028</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Assessing computational tools for the discovery of transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Eskin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Favorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Makeev</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Noble</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pavesi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>R&#233;gnier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Simonis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sinha</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Thijs</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>van Helden</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vandenbogaert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <fpage>137</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1053</pubid>
                  <pubid idtype="pmpid" link="fulltext">15637633</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes</p>
            </title>
            <aug>
               <au>
                  <snm>Pavesi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mereghetti</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mauri</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Web Server</issue>
            <fpage>W199</fpage>
            <lpage>W203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441603</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215380</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh465</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>AGRIS: Arabidopsis gene regulatory information server, an information resource of Arabidopsis cis-regulatory elements and transcription factors</p>
            </title>
            <aug>
               <au>
                  <snm>Davuluri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Palaniswamy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Molina</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grotewold</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>24</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165662</pubid>
                  <pubid idtype="pmpid" link="fulltext">12795817</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-25</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes</p>
            </title>
            <aug>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bl&#228;sing</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2004</pubdate>
            <volume>37</volume>
            <issue>6</issue>
            <fpage>914</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2004.02016.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14996223</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>PLS-regression a basic tool of chemometrics</p>
            </title>
            <aug>
               <au>
                  <snm>Wold</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sj&#246;strom</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eriksson</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Chemom Intell Lab Syst</source>
            <pubdate>2001</pubdate>
            <volume>58</volume>
            <fpage>109</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0169-7439(01)00155-1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>PLS regression methods</p>
            </title>
            <aug>
               <au>
                  <snm>H&#246;skuldsson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Chemom</source>
            <pubdate>1988</pubdate>
            <volume>2</volume>
            <fpage>211</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cem.1180020306</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The Arabidopsis Information Resource</p>
            </title>
            <aug>
               <au>
                  <cnm>TAIR</cnm>
               </au>
            </aug>
            <pubdate>2000</pubdate>
            <url>http://www.arabidopsis.org</url>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Summaries of Affymetrix GeneChip probe level data</p>
            </title>
            <aug>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bolstad</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Collin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cope</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hobbs</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>4</issue>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">150247</pubid>
                  <pubid idtype="pmpid" link="fulltext">12582260</pubid>
                  <pubid idtype="doi">10.1093/nar/gng015</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Bioconductor: Open software development for computational biology and bioinformatics</p>
            </title>
            <aug>
               <au>
                  <snm>Gentleman</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Carey</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Bates</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bolstad</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dettling</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gautier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gentry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hornik</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hothorn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Iacus</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>FLC</fnm>
               </au>
               <au>
                  <snm>Maechler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rossini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sawitzki</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Smyth</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tierney</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R80</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545600</pubid>
                  <pubid idtype="pmpid" link="fulltext">15461798</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-10-r80</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <aug>
               <au>
                  <cnm>R Development Core Team</cnm>
               </au>
            </aug>
            <source>R: A language and environment for statistical computing</source>
            <publisher>R Foundation for Statistical Computing, Vienna, Austria</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Controlling the false discovery rate: a practical and powerful approach to multiple testing</p>
            </title>
            <aug>
               <au>
                  <snm>Benjamini</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hochberg</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>J R Stat Soc</source>
            <pubdate>1995</pubdate>
            <volume>57</volume>
            <fpage>289</fpage>
            <lpage>300</lpage>
         </bibl>
         <bibl id="B64">
            <title>
               <p>A Global Survey of Gene Regulation during Cold Acclimation in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Hannah</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Heyer</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Hincha</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>e26</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1189076</pubid>
                  <pubid idtype="pmpid" link="fulltext">16121258</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0010026</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Extension of the visualization tool MapMan to allow statistical analysis of arrays, display of corresponding genes, and comparison with known responses</p>
            </title>
            <aug>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thimm</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Redestig</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Blaesing</snm>
                  <fnm>OE</fnm>
               </au>
               <au>
                  <snm>Palacios-Rojas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Selbig</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hannemann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Piques</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Steinhauser</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Scheible</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Gibon</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Morcuende</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Weicht</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stitt</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>138</volume>
            <issue>3</issue>
            <fpage>1195</fpage>
            <lpage>1204</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1176394</pubid>
                  <pubid idtype="pmpid" link="fulltext">16009995</pubid>
                  <pubid idtype="doi">10.1104/pp.105.060459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Gene Ontology: tool for the unification of biology</p>
            </title>
            <aug>
               <au>
                  <cnm>The Gene Ontology Consortium</cnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
