<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2006-7-8-r70</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Statistical assessment of the global regulatory role of histone acetylation in <it>Saccharomyces cerevisiae</it></p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Yuan</snm>
               <fnm>Guo-Cheng</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>gyuan@cgr.harvard.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Ma</snm>
               <fnm>Ping</fnm>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>pingma@uiuc.edu</email>
            </au>
            <au id="A3">
               <snm>Zhong</snm>
               <fnm>Wenxuan</fnm>
               <insr iid="I1"/>
               <email>wenxuan@stat.harvard.edu</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Liu</snm>
               <mi>S</mi>
               <fnm>Jun</fnm>
               <insr iid="I1"/>
               <email>jliu@stat.harvard.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Statistics, Harvard University, Cambridge, MA 02138, USA</p>
            </ins>
            <ins id="I2">
               <p>Bauer Center for Genomics Research, Harvard University, Cambridge, MA 02138, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Statistics, University of Illinois, Champaign, IL 61820, USA</p>
            </ins>
            <ins id="I4">
               <p>Institute for Genomic Biology, University of Illinois, Champaign, IL 61820, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>8</issue>
         <fpage>R70</fpage>
         <url>http://genomebiology.com/2006/7/8/R70</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16884527</pubid>
               <pubid idtype="doi">10.1186/gb-2006-7-8-r70</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>5</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>5</day>
               <month>6</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>2</day>
               <month>8</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>2</day>
               <month>8</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Yuan et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Histone acetylation in yeast</p>
      </shorttitle>
      <shortabs>
         <p>An analysis of genome-wide histone acetylation data using a few complementary statistical models gives support to a cumulative effect model for global histone acetylation.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Histone acetylation plays important but incompletely understood roles in gene regulation. A comprehensive understanding of the regulatory role of histone acetylation is difficult because many different histone acetylation patterns exist and their effects are confounded by other factors, such as the transcription factor binding sequence motif information and nucleosome occupancy.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We analyzed recent genomewide histone acetylation data using a few complementary statistical models and tested the validity of a cumulative model in approximating the global regulatory effect of histone acetylation. Confounding effects due to transcription factor binding sequence information were estimated by using two independent motif-based algorithms followed by a variable selection method. We found that the sequence information has a significant role in regulating transcription, and we also found a clear additional histone acetylation effect. Our model fits well with observed genome-wide data. Strikingly, including more complicated combinatorial effects does not improve the model's performance. Through a statistical analysis of conditional independence, we found that H4 acetylation may not have significant direct impact on global gene expression.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Decoding the combinatorial complexity of histone modification requires not only new data but also new methods to analyze the data. Our statistical analysis confirms that histone acetylation has a significant effect on gene transcription rates in addition to that attributable to upstream sequence motifs. Our analysis also suggests that a cumulative effect model for global histone acetylation is justified, although a more complex histone code may be important at specific gene loci. We also found that the regulatory roles among different histone acetylation sites have important differences.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Gene activities in eukaryotic cells are concertedly regulated by transcription factors and chromatin structure. The basic repeating unit of chromatin is the nucleosome, an octamer containing two copies each of four core histone proteins. Recent microarray based studies <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp> have begun to uncover the global regulatory role of nucleosome positioning and modifications. While nucleosome occupancy in promoter regions typically occludes transcription factor binding, thereby repressing global gene expression <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, the role of histone modification is more complex <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Histone tails can be modified in various ways, including acetylation, methylation, phosphorylation, and ubiquitination. Even the regulatory role of histone acetylation, the best characterized modification to date, is still not fully understood <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Each of the four core histones contains several acetylable sites at their amino terminus tails. Genome-wide histone acetylation data from <it>Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr></abbrgrp> have offered new opportunities for us to evaluate the regulatory effects of histone acetylation at these lysine sites. In particular, both H3 and H4 acetylation levels were found to be positively correlated with gene transcription rates. However, a subtle but important issue in analyzing such data is that effects of other potentially important factors not included in the analysis, generally termed as confounding factors, cannot be revealed by simple correlation plots. It is unclear, for example, how much regulatory information associated with histone acetylation is redundant with the genomic sequence information. To gain insights into this, we conducted a statistical analysis by combining acetylation <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr><abbr bid="B8">8</abbr></abbrgrp>, nucleosome occupancy <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr></abbrgrp>, gene upstream sequence information <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and gene expression data <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> to investigate the effect of histone acetylation in the context of other regulatory factors in <it>S. cerevisiae</it>.</p>
         <p>A related question is whether different histone acetylation sites play similar roles in gene regulation. It is commonly postulated that globally H3 and H4 acetylation are both associated with global gene activation. Indeed, the acetylation levels of H3 and H4 across gene promoters have been shown to be highly correlated <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. However, other experimental studies have also suggested that H3 and H4 acetylations have different regulatory roles <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. We investigated the validity of a cumulative model for the regulatory effect of histone acetylation and also compare the regulatory effects of H3 and H4 acetylation in a coherent statistical framework.</p>
         <p>Another interesting question is whether combinatorial patterns of histone acetylation code for distinct regulatory information at a global level <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr></abbrgrp>, with each pattern being recognized by a specific regulatory protein. If such codes exist, a large number of codes may result from combinations of different histone acetylation sites. On the other hand, if the effect is cumulative, multiple histone acetylation sites may be used to gradually control the interaction between nucleosomes or the stability of the regulatory proteins. Recent mutagenesis studies <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> have suggested that multiple H4 acetylation sites have a cumulative effect. Here we revisit this question using a statistical approach to combine available genome-wide data. Our analysis suggests that the simple additive-effect model is sufficient for fitting the available data.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Effect of histone acetylation on gene transcription rate</p>
            </st>
            <sec>
               <st>
                  <p>Standard analysis</p>
               </st>
               <p>We analyzed two recent genome-wide histone acetylation datasets <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr></abbrgrp> (see Materials and methods for details about the data sources). Due to space limits, here we only present the results for Pokholok <it>et al</it>.'s data <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, with the discussion of Kurdistani <it>et al</it>.'s data <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> in Additional data file 1. Pokholok <it>et al</it>. measured acetylation levels at three different sites, H3K9, H3K14, and H4, with the last referring to non-specific acetylation on any of the four acetylable lysines on H4 tails.</p>
               <p>A typical analysis, when both histone acetylation data on a single site (for example, H3K9) and transcription rate data are available, is to simply correlate the two sets of measurements and to report the apparent significant statistical correlation between the two. When data on multiple acetylation sites are available, a slightly more formal analysis is to fit a linear regression model of the form:</p>
               <p>
                  <m:math name="gb-2006-7-8-R70-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>y</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mi>&#945;</m:mi>
                           <m:mo>+</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>j</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#946;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>+</m:mo>
                           <m:msub>
                              <m:mi>&#949;</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mtext>equation&#160;</m:mtext>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyypa0JaeqySdeMaey4kaSYaaabuaeaacqaHYoGydaWgaaWcbaGaamOAaaqabaaabaGaamOAaaqab0GaeyyeIuoakiaadIhadaWgaaWcbaGaamyAaiaadQgaaeqaaOGaey4kaSIaeqyTdu2aaSbaaSqaaiaadMgaaeqaaOGaaiilaiaaxMaacaWLjaWaaeWaceaacaqGLbGaaeyCaiaabwhacaqGHbGaaeiDaiaabMgacaqGVbGaaeOBaiaabccacaaIXaaacaGLOaGaayzkaaaaaa@51BB@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where <it>y</it><sub><it>i </it></sub>is the transcription rate of gene <it>i</it>, and <it>x</it><sub><it>ij</it></sub>, for <it>j </it>= 1,2,3, is the histone acetylation level of H3K9, H3K14, and H4, respectively. All data were log-transformed before analysis. This model is highly statistically significant for both intergenic (<it>p </it>value &lt; 2.0 &#215; 10<sup>-16</sup>) and coding regions (<it>p </it>value &lt; 2.0 &#215; 10<sup>-16</sup>). The association between gene expression and intergenic histone acetylation is commonly interpreted as regulatory effects, whereas correlation between gene expression and coding histone acetylation is believed to be a result of passing of transcriptional machineries through active genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Significant confounding factors</p>
               </st>
               <p>Gene regulation is a complex process involving many contributing factors. Probably the best characterized factor for controlling gene transcription is the upstream sequence information. Although histone acetyltransferases (HATs) and histone deacetylases (HDACs) do not have obvious sequence specificity themselves, they may be recruited by transcription factors that recognize specific sequences. Thus, sequence information is an important confounding factor. Our main interest here is to delineate the roles of these factors and investigate whether histone acetylation provides any additional information on gene transcription. In the past decade, numerous computational methods have been developed to identify target sequences of transcription factors and to use such information to predict gene expression <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>.</p>
               <p>Another well-characterized property of the chromatin structure, the nucleosome occupancy, also plays an important role in gene regulation. Histone acetylation and nucleosome positioning are closely related events. Genome-scale, high-resolution nucleosome positioning data have led to the observation that transcription factor binding sites tend to be nucleosome-depleted <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Although genome-wide, high-resolution nucleosome positioning data are still unavailable, lower resolution data have already shown that gene expression levels are reciprocally correlated with nucleosome occupancy <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr></abbrgrp>. Therefore, nucleosome occupancy may also be an important confounding factor in explaining gene regulation.</p>
            </sec>
            <sec>
               <st>
                  <p>Refined analysis</p>
               </st>
               <p>We tested using two different sequence motif based-methods to account for the <it>cis </it>regulatory information (see Materials and methods for details). As shown in Additional data file 1, the two methods gave remarkably consistent results. Here we present results from using MDscan <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, which infers sequence motif information <it>de novo</it>. The combined transcriptional control by transcription factor binding motifs (TFBMs), nucleosome occupancy, and histone acetylation is modeled as:</p>
               <p>
                  <m:math name="gb-2006-7-8-R70-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>y</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mi>&#945;</m:mi>
                           <m:mo>+</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>j</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#946;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                           <m:msub>
                              <m:mi>x</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>+</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mi>j</m:mi>
                              </m:munder>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#951;</m:mi>
                                    <m:mi>j</m:mi>
                                 </m:msub>
                              </m:mrow>
                           </m:mstyle>
                           <m:msub>
                              <m:mi>z</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>+</m:mo>
                           <m:mi>&#948;</m:mi>
                           <m:msub>
                              <m:mi>w</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>+</m:mo>
                           <m:msub>
                              <m:mi>&#949;</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mtext>equation&#160;</m:mtext>
                                 <m:mn>2</m:mn>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyypa0JaeqySdeMaey4kaSYaaabuaeaacqaHYoGydaWgaaWcbaGaamOAaaqabaaabaGaamOAaaqab0GaeyyeIuoakiaadIhadaWgaaWcbaGaamyAaiaadQgaaeqaaOGaey4kaSYaaabuaeaacqaH3oaAdaWgaaWcbaGaamOAaaqabaaabaGaamOAaaqab0GaeyyeIuoakiaadQhadaWgaaWcbaGaamyAaiaadQgaaeqaaOGaey4kaSIaeqiTdqMaam4DamaaBaaaleaacaWGPbaabeaakiabgUcaRiabew7aLnaaBaaaleaacaWGPbaabeaakiaacYcacaWLjaGaaCzcamaabmGabaGaaeyzaiaabghacaqG1bGaaeyyaiaabshacaqGPbGaae4Baiaab6gacaqGGaGaaGOmaaGaayjkaiaawMcaaaaa@602F@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where the <it>x</it><sub><it>ij </it></sub>values are the three histone acetylation levels (corresponding to H3K9, H3K14, and H4, respectively), the <it>z</it><sub><it>ij </it></sub>values are the corresponding scores to the 33 selected motifs, and <it>w</it><sub><it>i </it></sub>is the nucleosome occupancy level. Table <tblr tid="T1">1</tblr> shows the R<sup>2 </sup>(referring to the adjusted R-square statistic, which measures the fraction of explained variance after an adjustment of the number of parameters fitted; see page 231 of <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>) of the various linear models. One can see that a simple regression of transcription rates against histone acetylation without considering any other factors gave an R<sup>2 </sup>of 0.1841 (Table <tblr tid="T1">1</tblr>), implying that about 18% of the variation of the transcription rates is attributable to histone acetylation. In contrast, the regression of transcription rates against motif scores and nucleosome density levels (no histone acetylation) gave an R<sup>2 </sup>of 0.1997. The comprehensive model with all the variables we considered (equation 2) bumped up the R<sup>2 </sup>to 0.3262, indicating that the histone acetylation does have a significant effect on the transcription rate, although not as high as that in the na&#239;ve model.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Model performance (adjusted R<sup>2</sup>) with different covariates</p>
                  </caption>
                  <tblbdy cols="9">
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c cspan="4" ca="center">
                           <p>Intergenic regions</p>
                        </c>
                        <c cspan="4" ca="center">
                           <p>Coding regions</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Acetylation sites included</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                        <c ca="center">
                           <p>Seq</p>
                        </c>
                        <c ca="center">
                           <p>Nuc</p>
                        </c>
                        <c ca="center">
                           <p>Seq/Nuc</p>
                        </c>
                        <c ca="center">
                           <p>-</p>
                        </c>
                        <c ca="center">
                           <p>Seq</p>
                        </c>
                        <c ca="center">
                           <p>Nuc</p>
                        </c>
                        <c ca="center">
                           <p>Seq/Nuc</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>-</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>0.1387</p>
                        </c>
                        <c ca="center">
                           <p>0.1145</p>
                        </c>
                        <c ca="center">
                           <p>0.1997</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>0.1315</p>
                        </c>
                        <c ca="center">
                           <p>0.1440</p>
                        </c>
                        <c ca="center">
                           <p>0.2185</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>H3K9 and H3K14</p>
                        </c>
                        <c ca="center">
                           <p>0.1808</p>
                        </c>
                        <c ca="center">
                           <p>0.2700</p>
                        </c>
                        <c ca="center">
                           <p>0.2641</p>
                        </c>
                        <c ca="center">
                           <p>0.3208</p>
                        </c>
                        <c ca="center">
                           <p>0.1014</p>
                        </c>
                        <c ca="center">
                           <p>0.2059</p>
                        </c>
                        <c ca="center">
                           <p>0.2515</p>
                        </c>
                        <c ca="center">
                           <p>0.3068</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>H4</p>
                        </c>
                        <c ca="center">
                           <p>0.0849</p>
                        </c>
                        <c ca="center">
                           <p>0.2086</p>
                        </c>
                        <c ca="center">
                           <p>0.2487</p>
                        </c>
                        <c ca="center">
                           <p>0.3085</p>
                        </c>
                        <c ca="center">
                           <p>0.0222</p>
                        </c>
                        <c ca="center">
                           <p>0.1522</p>
                        </c>
                        <c ca="center">
                           <p>0.2131</p>
                        </c>
                        <c ca="center">
                           <p>0.2774</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>H3K9, H3K14, and H4</p>
                        </c>
                        <c ca="center">
                           <p>0.1841</p>
                        </c>
                        <c ca="center">
                           <p>0.2706</p>
                        </c>
                        <c ca="center">
                           <p>0.2704</p>
                        </c>
                        <c ca="center">
                           <p>0.3262</p>
                        </c>
                        <c ca="center">
                           <p>0.1957</p>
                        </c>
                        <c ca="center">
                           <p>0.2627</p>
                        </c>
                        <c ca="center">
                           <p>0.2619</p>
                        </c>
                        <c ca="center">
                           <p>0.3131</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The adjusted R<sup>2 </sup>for the linear regression model (equation 2) containing different regulatory factors (Nuc, nucleosome occupancy; Seq, sequence information). (The adjusted R<sup>2 </sup>is related to the (unadjusted) R<sup>2 </sup>as <m:math name="gb-2006-7-8-R70-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>R</m:mi><m:mrow><m:mi>a</m:mi><m:mi>d</m:mi><m:mi>j</m:mi><m:mi>u</m:mi><m:mi>s</m:mi><m:mi>t</m:mi><m:mi>e</m:mi><m:mi>d</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup><m:mo>=</m:mo><m:mn>1</m:mn><m:mo>&#8722;</m:mo><m:mo stretchy="false">[</m:mo><m:mo stretchy="false">(</m:mo><m:mi>n</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo><m:mo>/</m:mo><m:mo stretchy="false">(</m:mo><m:mi>n</m:mi><m:mo>&#8722;</m:mo><m:mi>p</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">]</m:mo><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo>&#8722;</m:mo><m:msubsup><m:mi>R</m:mi><m:mrow><m:mi>u</m:mi><m:mi>n</m:mi><m:mi>a</m:mi><m:mi>d</m:mi><m:mi>j</m:mi><m:mi>u</m:mi><m:mi>s</m:mi><m:mi>t</m:mi><m:mi>e</m:mi><m:mi>d</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWGsbWaa0baaSqaaiaadggacaWGKbGaamOAaiaadwhacaWGZbGaamiDaiaadwgacaWGKbaabaGaaGOmaaaakiabg2da9iaaigdacqGHsislcaGGBbGaaiikaiaad6gacqGHsislcaaIXaGaaiykaiaac+cacaGGOaGaamOBaiabgkHiTiaadchacqGHsislcaaIXaGaaiykaiaac2facaGGOaGaaGymaiabgkHiTiaadkfadaqhaaWcbaGaamyDaiaad6gacaWGHbGaamizaiaadQgacaWG1bGaam4CaiaadshacaWGLbGaamizaaqaaiaaikdaaaGccaGGPaaaaa@5992@</m:annotation></m:semantics></m:math>, where <it>n </it>is the sample size, and <it>p </it>is the number of explanatory variables in the linear regression model.)</p>
                  </tblfn>
               </tbl>
               <p>To confirm that the above results indicate intrinsic statistical associations rather than artifacts of the statistical procedure, we validated our model using two independent methods. First, we tested whether applying the above procedure to the random inputs would yield substantially worse performance than applying it to the real data. We generated 50 independent samples by random permutation (see Materials and methods). The R<sup>2 </sup>for these randomized data are much smaller than for the real data. For example, considering equation 2 fitted with sequence motif information only, the largest R<sup>2 </sup>for coding regions for the 50 randomly permutated samples is 0.0378 (Figure <figr fid="F1">1</figr>), compared to R<sup>2 </sup>of 0.1315 for the real data. The differences between the R<sup>2 </sup>values are even larger if we also include histone acetylation and nucleosome occupancy in the model. Therefore, our model is able to extract real statistical association. Secondly, we tested whether the model might overfit by a five-fold cross validation procedure (see Materials and methods). The root mean square (rms) errors for the training data are 1.500 (for intergenic regions) and 1.483 (for coding regions), whereas the rms errors for the testing data were 1.519 (for intergenic regions) and 1.498 (for coding regions). In both cases, the difference between the in-sample and out-of-sample errors is less than 2%, suggesting overfitting is not an issue here.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>Model validation by comparing the R<sup>2 </sup>for the real versus randomly permutated datasets</p>
                  </caption>
                  <text>
                     <p>Model validation by comparing the R<sup>2 </sup>for the real versus randomly permutated datasets. The R<sup>2 </sup>obtained by applying the motif selection and fitting equation 2 (with sequence motif information only) procedures to randomly permutated and real data. The histogram is obtained based on 50 randomly permutated samples. The arrow on the right marks the R<sup>2 </sup>for the real data. Results for the coding regions are represented here. See the main text for details.</p>
                  </text>
                  <graphic file="gb-2006-7-8-r70-1"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Multiple histone acetylation sites have cumulative regulatory effects</p>
            </st>
            <p>With the foregoing regression framework, we further investigated whether the combined effect of various histone acetylation sites could be approximated by a simple cumulative model, or a more complex 'combinatorial histone code' is needed. To gain a qualitative overview, we grouped genes according to their upstream histone acetylation patterns, each corresponding to a combination of high (greater than 60th percentile) or low (less than 40th percentile) histone acetylation levels at three acetylation sites. To avoid ambiguity due to measurement noise, the middle 20% of genes was not included in any groups. This coarse-grained partition method results in eight groups of genes with distinct upstream acetylation patterns. For example, one of these eight groups contains genes with high H3K9, high H3K14, and low H4 acetylation levels in their upstream intergenic regions. Increasing H3K9 acetylation level enhances gene transcription (Table <tblr tid="T2">2</tblr>), regardless of the acetylation level at other sites. A similar but weaker pattern can be seen for H3K14 acetylation. In contrast, the increase of H4 acetylation level is associated with both elevated and reduced transcription rates. A possible explanation is that the regulatory effect of H4 acetylation is dependent on acetylation level at other sites, while another explanation is that the H4 acetylation effect is weak overall. These relationships are not sensitive to the cutoff threshold for removing ambiguous genes.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Mean transcription rates (log-transformed) for genes with similar histone acetylation patterns</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>H3K9ac Low</p>
                     </c>
                     <c ca="center">
                        <p>H3K9ac High</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H3K14ac Low</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H4ac Low</p>
                     </c>
                     <c ca="center">
                        <p>-0.850</p>
                     </c>
                     <c ca="center">
                        <p>0.207</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H4ac High</p>
                     </c>
                     <c ca="center">
                        <p>-0.522</p>
                     </c>
                     <c ca="center">
                        <p>0.307</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H3K14ac High</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H4ac Low</p>
                     </c>
                     <c ca="center">
                        <p>-0.454</p>
                     </c>
                     <c ca="center">
                        <p>0.816</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>H4ac High</p>
                     </c>
                     <c ca="center">
                        <p>-0.126</p>
                     </c>
                     <c ca="center">
                        <p>0.460</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Ac, acetylation.</p>
               </tblfn>
            </tbl>
            <p>As a quantitative validation of the above observation, we re-examined the validity of equation 2 in modeling the regulatory role of histone acetylation. We observed that the inclusion of all quadratic interaction terms among the three histone acetylation covariates in the regression model does not improve the model fitting (that is, R<sup>2 </sup>= 0.3262 and 0.3278, respectively, for intergenic regions, and R<sup>2 </sup>= 0.3131 and 0.3132, respectively, for coding regions). The same conclusion also holds when we do not include the sequence motif information and nucleosome occupancy data as covariates (R2 = 0.1841 and 0.1925, respectively, for intergenic regions). These observations suggest that the combinatorial effect is, at best, undetectable from the current data and the simple cumulative model (equation 2) is sufficient. Similar results were obtained using the acetylation data in Kurdistani <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> (Additional data file 1).</p>
         </sec>
         <sec>
            <st>
               <p>H3 and H4 acetylation play different roles in gene regulation</p>
            </st>
            <p>A statistical measure of the effect of an individual factor/covariate on the response variable is the partial correlation (Materials and methods), which roughly reflects the 'pure' relationship between two variables while controlling other factors. As shown in Table <tblr tid="T3">3</tblr>, partial correlations between transcription rates and intergenic H3K9 and H3K14 acetylation levels, while controlling the sequence information and H4 acetylation levels, are 0.25 and 0.21, respectively; whereas that between transcription rates and H4 acetylation effect is insignificant (-0.03). In addition, the difference between the effects of H3 and H4 acetylations is visually evident (Figure <figr fid="F2">2</figr>). The same phenomenon can be observed by comparing different regression models. As shown by Table <tblr tid="T1">1</tblr>, the R<sup>2 </sup>(adjusted R-square) for the model without using the H4 acetylation information is comparable to the full model, whereas the performance of the model without H3 acetylation is significantly poorer. Interestingly, the transcription rate is negatively correlated with coding region H4 acetylation. These observations suggest that while H3 acetylation plays an important role in global gene activation, H4 acetylation in the intergenic region has little global effect. Similar results were also obtained for the acetylation data in Kurdistani <it>et al</it>. (Additional data file 1).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Partial correlation between covariate and transcription rates</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Intergenic regions</p>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Coding regions</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Covariate</p>
                     </c>
                     <c ca="center">
                        <p>Control variable</p>
                     </c>
                     <c ca="center">
                        <p>Partial correlation</p>
                     </c>
                     <c ca="center">
                        <p>Control variables</p>
                     </c>
                     <c ca="center">
                        <p>Partial correlation</p>
                     </c>
                     <c ca="center">
                        <p>Control variable</p>
                     </c>
                     <c ca="center">
                        <p>Partial correlation</p>
                     </c>
                     <c ca="center">
                        <p>Control variables</p>
                     </c>
                     <c ca="center">
                        <p>Partial correlation</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>H3K9</p>
                     </c>
                     <c ca="center">
                        <p>H4</p>
                     </c>
                     <c ca="center">
                        <p>0.3015</p>
                     </c>
                     <c ca="center">
                        <p>H4 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>0.2507</p>
                     </c>
                     <c ca="center">
                        <p>H4</p>
                     </c>
                     <c ca="center">
                        <p>0.2439</p>
                     </c>
                     <c ca="center">
                        <p>H4 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>0.2038</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>H3K14</p>
                     </c>
                     <c ca="center">
                        <p>H4</p>
                     </c>
                     <c ca="center">
                        <p>0.2359</p>
                     </c>
                     <c ca="center">
                        <p>H4 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>0.2105</p>
                     </c>
                     <c ca="center">
                        <p>H4</p>
                     </c>
                     <c ca="center">
                        <p>0.4070</p>
                     </c>
                     <c ca="center">
                        <p>H4 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>0.3473</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>H4</p>
                     </c>
                     <c ca="center">
                        <p>H3K9, H3K14</p>
                     </c>
                     <c ca="center">
                        <p>-0.0656</p>
                     </c>
                     <c ca="center">
                        <p>H3K9, H3K14 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>-0.0344</p>
                     </c>
                     <c ca="center">
                        <p>H3K9, H3K14</p>
                     </c>
                     <c ca="center">
                        <p>-0.3245</p>
                     </c>
                     <c ca="center">
                        <p>H3K9, H3K14 and Seq</p>
                     </c>
                     <c ca="center">
                        <p>-0.2678</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The partial correlation between transcription rates and H3 (or H4) acetylation levels while controlling for the effects of H4 (or H3) acetylation and sequence information (Seq).</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Dependency of transcription rates on histone acetylation levels (ac) after controlling for confounding effects</p>
               </caption>
               <text>
                  <p>Dependency of transcription rates on histone acetylation levels (ac) after controlling for confounding effects. <b>(a) </b>Transcription rates versus intergenic H3K9 and K14 acetylation levels controlling for H4 acetylation levels. <b>(b) </b>Transcription rates versus intergenic H4 acetylation levels controlling for H3K9 and K14 acetylation levels. <b>(c) </b>Same as (a) except that coding region histone acetylation data are used. <b>(d) </b>Same as (b) except that coding region histone acetylation data are used. All data are log-transformed. Genes are sorted by transcription levels. A sliding smoothing window of 20 genes is applied to the transcription rates and histone acetylation data.</p>
               </text>
               <graphic file="gb-2006-7-8-r70-2"/>
            </fig>
            <p>Gcn5 is the catalytic component of the SAGA complex and preferentially acetylates H3 lysines, including K9, K14, and K18. Esa1 is the catalytic component of the NuA4 complex and preferentially acetylates H4 lysines. Based on the above analysis, we predicted that the global gene expression was significantly affected by the abundance of Gcn5 but not of Esa1. Genome-wide occupancy of Gcn5 and Esa1 has been measured previously <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Both enzymes were found to be globally associated with active genes, and the Pearson correlation between the occupancy levels of the two is as high as 0.7890, probably because they share a common component, Tra1, for recognizing targets <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We modified equation 2 to estimate the association between Pol II occupancy <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and these two HATs. In this case, the response variable <it>y</it><sub><it>i </it></sub>is the log-ratio of Pol II binding, whereas the <it>x</it><sub><it>ij </it></sub>(<it>j </it>= 1,2) are the log-transformed binding ratios of Gcn5 and Esa1, respectively. Fitting the model using both Gcn5 and Esa1 data yields an R<sup>2 </sup>of 0.2365. If we remove Gcn5 from the model, the R<sup>2 </sup>is reduced to 0.1808. However, removing Esa1 causes little change in model performance (R<sup>2 </sup>= 0.2366). In addition, the partial correlation between the occupancy levels of Pol II and Gcn5 while controlling Esa1 occupancy is 0.2690, whereas the number is reduced to only 0.0154 if the order of Gcn5 and Esa1 is reversed. Taken together, these results show that, indeed, Esa1 only marginally affects the global association with Pol II binding.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The global regulatory role of histone acetylation is still not fully understood and contradictory results have been reported in the literature <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B5">5</abbr><abbr bid="B8">8</abbr></abbrgrp>. Part of this inconsistency has been attributed to data analysis procedures <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. In this paper, we analyzed two recent CHIP-chip datasets <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr></abbrgrp> in a statistically coherent framework. Our model isolates the regulatory role of individual acetylation sites and systematically controls for the effects of important confounding factors, thus resulting in a more detailed evaluation of the regulatory role of histone acetylation than previous studies <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B8">8</abbr></abbrgrp>. Interestingly, our analyses of the two aforementioned datasets yielded similar results, even though the biological interpretations in the original papers were drastically different.</p>
         <p>We found that the regulatory effect of histone acetylation can be well approximated by a simple regression model. In contrast to Kurdistani <it>et al</it>.'s claims <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, our results suggest that the currently available data supports a simple cumulative effect model instead of a combinatorial code model of histone modifications as originally proposed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, consistent with a recent mutagenesis study <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> showing that three of the four acetylable sites on H4 tails are functionally redundant. It is worth noting that these results do not exclude the possibility that combinatorial control is critical at specific gene loci, but it is unlikely that a fully combinatorial code regulates global gene expression.</p>
         <p>We also quantitatively analyzed the regulatory effects due to individual acetylation sites. To our surprise, we found that the overall effects of H3 and H4 acetylation were quite different, at least statistically. In particular, while elevated H3 acetylation in promoter regions appears to be responsible for activating global gene expression, H4 acetylation seems to play a less important role. Levels of H3 and H4 acetylation in intergenic regions are closely coordinated by the binding of Gcn5 and Esa1, both of which have been found to bind to actively transcribed genes <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. However, our analysis suggests that Esa1 may not be important for global regulation, consistent with previous experimental studies by Kevin Struhl's group <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. In these studies, the authors show that depletion of Esa1 causes a global decrease of H4 acetylation, but only a small subset of the genes responds with significant transcription change <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. They also found that the effect of H4 acetylation may be highly transcription factor specific <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. It will be interesting to further investigate whether there is any biological benefit for the co-recruitment of Esa1 and Gcn5 to activate genes.</p>
         <p>Histone modification in coding regions is often viewed as demarcating recent transcriptional events rather than playing a regulatory role. In this view, our analysis suggests that, along with methylation, acetylation also serves as a potent marker for transcription activities. On the other hand, H4 acetylation in coding regions may also have important regulatory roles. For example, the binding of the HDAC protein Hos2 to coding regions is important for active transcription <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B20">20</abbr></abbrgrp>. The negative partial correlation between transcriptional activities and H4 acetylation levels is consistent with the aforementioned experimental results.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Data sources</p>
            </st>
            <p>Datasets analyzed in this study include those for histone acetylation, nucleosome occupancy, gene expression, and genome sequence. In two recently published papers, genome-wide histone acetylation levels at eleven <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and three sites <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> in yeast were measured using CHIP-chip. A major difference in experimental procedure between these two studies is that the acetylated DNA was hybridized against nucleosomal DNA on microarrays in Pokholok <it>et al</it>.'s study <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, but was hybridized against the genomic DNA in Kurdistani <it>et al</it>.'s study <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Since in the latter dataset histone acetylation was confounded with nucleosome occupancy, our discussion in the main text is focused on analyzing Pokholok <it>et al</it>.'s data. To compare the results from the two experiments, we repeated our analysis procedure on a normalized version of Kurdistani <it>et al</it>.'s data after removing its dependency on nucleosome occupancy. We found that the main conclusions remain the same. Detailed analysis of Kurdistani <it>et al</it>.'s data is presented in Additional data file 1.</p>
            <p>In addition, several groups have measured genome-wide nucleosome occupancy in yeast <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr><abbr bid="B8">8</abbr></abbrgrp>. We chose to utilize Pokholok <it>et al</it>.'s nucleosome occupancy data in our analysis as well, since nucleosome occupancy has a clear effect on gene regulation <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. We used Bernstein <it>et al</it>.'s transcription rate data <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> as the response variable in our study of the relationship between gene transcription and histone acetylation. These transcription rates were estimated by dividing the transcription levels by half-life time <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Due to concern that measured microarray data may vary significantly among different microarray platforms or research groups <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, we repeated our analysis using an independent dataset <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The results obtained from the two gene transcription data are similar (Additional data file 1).</p>
            <p>After removing genes (with their corresponding intergenic and coding regions) that have missing data in any of the above datasets, we merged all the data into a single dataset, which contains 3,049 intergenic and 3,384 coding regions. The genomic sequence of <it>S. cerevisiae </it>was downloaded from the <it>Saccharomyces </it>Genome Database <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The promoter sequences (up to 800 base pairs (bp) upstream of the translation start site of each gene) were extracted for cis regulatory signal analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Delineating <it>cis </it>regulatory information</p>
            </st>
            <p>Transcription factors regulate genes by binding to transcription factor binding sites (TFBSs), which are short sequence segments (approximately 10 bp) located near genes' transcription start sites (TSSs). In yeast, these binding sites are mostly within 500 bp upstream of each gene's TSS. It has been shown that a gene's expression pattern can be predicted to a great extent by its upstream sequence information <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. We took two different approaches to accommodate sequence information in our analysis of the histone acetylation effect.</p>
            <p>In our first approach, we conducted <it>de novo </it>TFBS predictions using MDscan <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> among the upstream sequences of the genes that were transcribed at high rates <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In particular, this algorithm searched for enriched sequence motifs of widths 5 to 15 in the promoter sequences, resulting in 580 statistically significant, possibly overlapping, candidate TFBMs (<it>p </it>value &lt; 0.05). We then used these motif patterns to scan all promoter regions for matches so as to compute a motif score for each TFBM at each promoter <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. To avoid overfitting, we selected a subset of 33 functional motifs based on the association of the motif score of a promoter with the transcription rate of the corresponding gene. In particular, we used both a linear regression procedure, Motif Regressor <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and a model-free method, regularized sliced inverse regression (RSIR) <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, as explained below.</p>
            <p>Our second approach to account for the <it>cis </it>regulatory information was to directly use the 666 transcription factor binding motifs reported by Beer and Tavazoie <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, which is a combination of computational predictions using AlignACE <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and 51 experimentally derived ones <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. Since these motifs have been shown to have a high predictive power for gene expression patterns, they may also be informative for predicting transcription rate. Out of these 666 motifs, our linear regression and RSIR procedures (see below) found 15 that are highly relevant to predicting gene transcription rates.</p>
         </sec>
         <sec>
            <st>
               <p>Model free motif selection</p>
            </st>
            <p>RSIR <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> is a statistical method for dimension reduction and variable selection. It assumes that gene <it>i</it>'s transcription rate <it>y</it><sub><it>i </it></sub>and its sequence motif scores <b>x</b><sub><it>i </it></sub>= (<it>x</it><sub><it>i</it>1</sub>,...,<it>x</it><sub><it>iM</it></sub>)<sup><it>T </it></sup>are related as:</p>
            <p>
               <m:math name="gb-2006-7-8-R70-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msub>
                           <m:mi>y</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo>=</m:mo>
                        <m:mi>f</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msubsup>
                           <m:mi>&#946;</m:mi>
                           <m:mn>1</m:mn>
                           <m:mi>T</m:mi>
                        </m:msubsup>
                        <m:msub>
                           <m:mi>x</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:msubsup>
                           <m:mi>&#946;</m:mi>
                           <m:mn>2</m:mn>
                           <m:mi>T</m:mi>
                        </m:msubsup>
                        <m:msub>
                           <m:mi>x</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo>,</m:mo>
                        <m:mn>...</m:mn>
                        <m:mo>,</m:mo>
                        <m:msubsup>
                           <m:mi>&#946;</m:mi>
                           <m:mi>k</m:mi>
                           <m:mi>T</m:mi>
                        </m:msubsup>
                        <m:msub>
                           <m:mi>x</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:msub>
                           <m:mi>&#949;</m:mi>
                           <m:mi>i</m:mi>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>,</m:mo>
                        <m:mtext>&#160;&#160;&#160;&#160;&#160;</m:mtext>
                        <m:mrow>
                           <m:mo>(</m:mo>
                           <m:mrow>
                              <m:mtext>equation&#160;</m:mtext>
                              <m:mn>3</m:mn>
                           </m:mrow>
                           <m:mo>)</m:mo>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfeBSjuyZL2yd9gzLbvyNv2Caerbhv2BYDwAHbqedmvETj2BSbqee0evGueE0jxyaibaiKI8=vI8tuQ8FMI8Gi=hEeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciGacaGaaeqabaqadeqadaaakeaacaWG5bWaaSbaaSqaaiaadMgaaeqaaOGaeyypa0JaamOzaiaacIcaiiqacqWFYoGydaqhaaWcbaGaaGymaaqaaiaadsfaaaacbeGccaGF4bWaaSbaaSqaaiaadMgaaeqaaOGaaiilaiab=j7aInaaDaaaleaacaaIYaaabaGaamivaaaakiaa+HhadaWgaaWcbaGaamyAaaqabaGccaGGSaGaaiOlaiaac6cacaGGUaGaaiilaiab=j7aInaaDaaaleaacaWGRbaabaGaamivaaaakiaa+HhadaWgaaWcbaGaamyAaaqabaGccqaH1oqzdaWgaaWcbaGaamyAaaqabaGccaGGPaGaaiilaiaaxMaacaWLjaWaaeWaceaacaqGLbGaaeyCaiaabwhacaqGHbGaaeiDaiaabMgacaqGVbGaaeOBaiaabccacaaIZaaacaGLOaGaayzkaaaaaa@5CBB@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <it>f</it>() is an unknown (and possibly nonlinear) function, <b>&#946;</b><sub><it>l </it></sub>= (<it>&#946;</it><sub><it>l</it>1</sub>,...,<it>&#946;</it><sub><it>lM</it></sub>)<sup><it>T</it></sup>, (<it>l </it>= 1,...,<it>k</it>), are vectors of linear coefficients, and <it>&#949;</it><sub><it>i </it></sub>represents the noise. The number <it>k </it>is called the dimension of the model. A linear regression model is a special one-dimensional case of equation 3. RSIR estimates both <it>k </it>and the <b>&#946;</b><sub><it>l </it></sub>values without estimating <it>f</it>(). Since many entries of the <it>&#946;</it><sub><it>lj </it></sub>values are close to zero, which implies that the corresponding motif scores contribute very little in equation 2, we retain only those motifs whose coefficient <it>&#946;</it><sub><it>lj </it></sub>is significantly nonzero.</p>
            <p>We applied RSIR to the 580 candidate motifs selected by MDscan and the 666 motifs from <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, with the transcription rate as the response variable. In both cases, <it>k </it>was estimated as 1, and <it>f</it>() showed a strong linear pattern. We found 104 and 69 motifs, respectively, that have significantly nonzero coefficients in our RSIR model.</p>
            <p>Previous application suggests that RSIR is conservative in selecting variables <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. We applied the stepwise regression algorithm (which is a recursive method commonly used for variable selection; see page 347 of <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>) to further reduce the number of motifs. In the end, a total of 33 motifs from MDscan and 15 motifs from <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> were retained for further use. These motifs represent our summary of sequence-specific information on gene transcription rates.</p>
         </sec>
         <sec>
            <st>
               <p>Model validation</p>
            </st>
            <p>To assess the significance of our model for controlling the confounding effects due to sequence information, we randomly permuted the transcription rates data 50 times and repeated the same statistical procedures: identifying motif candidates using MDscan, selecting the most significant motifs using RSIR, and fitting the linear regression model. The distribution of R<sup>2 </sup>obtained for these randomized data was used as a baseline to evaluate the significance of our statistical procedure.</p>
            <p>We also performed a five-fold cross validation procedure to test whether equation 2 overfit data. In particular, the full set of genes (seethe Data sources section above) was randomly partitioned into five subsets of equal sizes. Each subset was used for testing in turn with the rest used for training. For each training subset, the sequence motifs were inferred using MDscan, RSIR, and stepwise regression methods. We fit the model equation 2 using the training data and then evaluated out-of-sample error by applying to the testing data. The in-sample and out-of-sample root mean square errors were then compared.</p>
         </sec>
         <sec>
            <st>
               <p>Partial correlation</p>
            </st>
            <p>Let <it>X </it>and <it>Y </it>represent two random variables and <it>Z </it>= (<it>Z</it><sub>1</sub>,<it>Z</it><sub>2</sub>,...,<it>Z</it><sub><it>p</it></sub>) be a set of control random variables. The linear relationship between <it>X </it>and <it>Z </it>can be estimated via a linear regression model <it>X </it>= <it>&#945;</it><sub><it>X </it></sub>+ <it>Z&#946;</it><sub><it>X </it></sub>+ <it>&#949;</it><sub><it>X</it></sub>, similarly for that between <it>Y </it>and <it>Z</it>. The residues <it>&#949;</it><sub><it>X </it></sub>and <it>&#949; </it><sub><it>Y </it></sub>contain the information left unexplained by <it>Z</it>. The partial correlation between <it>X </it>and <it>Y </it>while controlling <it>Z </it>is defined as the Pearson correlation between <it>&#949;</it><sub><it>X </it></sub>and <it>&#949;</it><sub><it>Y</it></sub>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: Additional data file <supplr sid="S1">1</supplr> contains supporting text, figures, and tables. The adequacy of the linear regression, normalization of the Kurdistani <it>et al</it>. data, and sensitivity issues are discussed in further detail in the text. The figures and tables included demonstrate and compare the Pokholok <it>et al</it>. and Kurdistani <it>et al</it>. data.</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <caption>
               <p>Supporting text, figures, and tables</p>
            </caption>
            <text>
               <p>Supporting text, figures, and tables relating to the Kurdistani <it>et al</it>., Pokholok <it>et al</it>. and Kurdistani <it>et al</it>. data.</p>
            </text>
            <file name="gb-2006-7-8-r70-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Oliver Rando for helpful discussions. GY was partially supported by the Bauer Center for Genomics Research. PM was partially supported by a research board grant from University of Illinois. JSL acknowledges support from NSF DMS-0204674 and a grant (10228102) from NSF China.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Global nucleosome occupancy in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Humphrey</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Perlstein</snm>
                  <fnm>EO</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R62</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522869</pubid>
                  <pubid idtype="pmpid" link="fulltext">15345046</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-9-r62</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Mapping global histone acetylation patterns to gene expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurdistani</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grunstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>117</volume>
            <fpage>721</fpage>
            <lpage>733</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2004.05.023</pubid>
                  <pubid idtype="pmpid" link="fulltext">15186774</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evidence for nucleosome depletion at active regulatory regions genome-wide.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Lieb</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <fpage>900</fpage>
            <lpage>905</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1400</pubid>
                  <pubid idtype="pmpid" link="fulltext">15247917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Global position and recruitment of HATs and HDACs in the yeast genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Pokholok</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Chandy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rolfe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Gifford</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>199</fpage>
            <lpage>209</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.09.021</pubid>
                  <pubid idtype="pmpid" link="fulltext">15494307</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genomic characterization reveals a simple histone H4 acetylation code.</p>
            </title>
            <aug>
               <au>
                  <snm>Dion</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Altschuler</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Rando</snm>
                  <fnm>OJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>5501</fpage>
            <lpage>5506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">555684</pubid>
                  <pubid idtype="pmpid" link="fulltext">15795371</pubid>
                  <pubid idtype="doi">10.1073/pnas.0500136102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Genome-scale identification of nucleosome positions in <it>S. cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Yuan</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Dion</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Slack</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Altschuler</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Rando</snm>
                  <fnm>OJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>626</fpage>
            <lpage>630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1112178</pubid>
                  <pubid idtype="pmpid" link="fulltext">15961632</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Single-nucleosome mapping of histone modifications in <it>S. cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Kaplan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Buratowski</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rando</snm>
                  <fnm>OJ</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e328</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1195719</pubid>
                  <pubid idtype="pmpid" link="fulltext">16122352</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030328</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genome-wide map of nucleosome acetylation and methylation in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Pokholok</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rolfe</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Herbolsheimer</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <fpage>517</fpage>
            <lpage>527</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.06.026</pubid>
                  <pubid idtype="pmpid" link="fulltext">16122420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The language of covalent histone modifications.</p>
            </title>
            <aug>
               <au>
                  <snm>Strahl</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>41</fpage>
            <lpage>45</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/47412</pubid>
                  <pubid idtype="pmpid" link="fulltext">10638745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Cellular memory and the histone code.</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>BM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>285</fpage>
            <lpage>291</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)01080-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12419240</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Signaling network model of chromatin.</p>
            </title>
            <aug>
               <au>
                  <snm>Schreiber</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>BE</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>111</volume>
            <fpage>771</fpage>
            <lpage>778</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)01196-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12526804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Histone acetyltransferases.</p>
            </title>
            <aug>
               <au>
                  <snm>Roth</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Denu</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Allis</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2001</pubdate>
            <volume>70</volume>
            <fpage>81</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.70.1.81</pubid>
                  <pubid idtype="pmpid" link="fulltext">11395403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Histone acetylation and deacetylation in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurdistani</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Grunstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>276</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1075</pubid>
                  <pubid idtype="pmpid" link="fulltext">12671650</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p><it>Saccharomyces </it>Genome Database</p>
            </title>
            <url>http://www.yeastgenome.org/</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Dissecting the regulatory circuitry of a eukaryotic genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Holstege</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Wyrick</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Hengartner</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>717</fpage>
            <lpage>728</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)81641-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9845373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Precision and functional specificity in mRNA decay.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Herschlag</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>5860</fpage>
            <lpage>5865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122867</pubid>
                  <pubid idtype="pmpid" link="fulltext">11972065</pubid>
                  <pubid idtype="doi">10.1073/pnas.092538799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Histone acetylation at promoters is differentially affected by specific activators and repressors.</p>
            </title>
            <aug>
               <au>
                  <snm>Deckert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2001</pubdate>
            <volume>21</volume>
            <fpage>2726</fpage>
            <lpage>2735</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">86903</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283252</pubid>
                  <pubid idtype="doi">10.1128/MCB.21.8.2726-2735.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Requirement of Hos2 histone deacetylase for gene activity in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kurdistani</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Grunstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>1412</fpage>
            <lpage>1414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1077790</pubid>
                  <pubid idtype="pmpid" link="fulltext">12434058</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genome-wide binding map of the histone deacetylase Rpd3 in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurdistani</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Robyr</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grunstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>248</fpage>
            <lpage>254</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng907</pubid>
                  <pubid idtype="pmpid" link="fulltext">12089521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Genomewide analysis of nucleosome density histone acetylation and HDAC function in fission yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Wiren</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Silverstein</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Sinha</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Walfridsson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Laurenson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pillus</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Robyr</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grunstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ekwall</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2005</pubdate>
            <volume>24</volume>
            <fpage>2906</fpage>
            <lpage>2918</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1187943</pubid>
                  <pubid idtype="pmpid" link="fulltext">16079916</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Bayesian models for multiple local sequence alignment and gibbs sampling strategies.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Neuwald</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>J Am Stat Assoc</source>
            <pubdate>1995</pubdate>
            <volume>90</volume>
            <fpage>1156</fpage>
            <lpage>1170</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2291508</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation.</p>
            </title>
            <aug>
               <au>
                  <snm>Roth</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Estep</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>1998</pubdate>
            <volume>16</volume>
            <fpage>939</fpage>
            <lpage>945</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1098-939</pubid>
                  <pubid idtype="pmpid" link="fulltext">9788350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Regulatory element detection using correlation with expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Bussemaker</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Siggia</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>27</volume>
            <fpage>167</fpage>
            <lpage>171</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/84792</pubid>
                  <pubid idtype="pmpid" link="fulltext">11175784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments.</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Brutlag</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2002</pubdate>
            <volume>20</volume>
            <fpage>835</fpage>
            <lpage>839</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12101404</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Integrating regulatory motif discovery and genome-wide expression analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Conlon</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>XS</fnm>
               </au>
               <au>
                  <snm>Lieb</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>3339</fpage>
            <lpage>3344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152294</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626739</pubid>
                  <pubid idtype="doi">10.1073/pnas.0630591100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Predicting gene expression from sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Beer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>117</volume>
            <fpage>185</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(04)00304-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">15084257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <aug>
               <au>
                  <snm>Neter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kutner</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Wasserman</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Nachtsheim</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>Applied Linear Statistical Models</source>
            <publisher>Singapore; Boston: McGraw-Hill</publisher>
            <edition>4</edition>
            <pubdate>1996</pubdate>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Recruitment of HAT complexes by direct activator interactions with the ATM-related Tra1 subunit.</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sousa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Alley</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Carrozza</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Workman</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>292</volume>
            <fpage>2333</fpage>
            <lpage>2337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1060214</pubid>
                  <pubid idtype="pmpid" link="fulltext">11423663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A new map for navigating the yeast epigenome.</p>
            </title>
            <aug>
               <au>
                  <snm>Schubeler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>BM</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>122</volume>
            <fpage>489</fpage>
            <lpage>492</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2005.08.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">16122415</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Coordinate regulation of yeast ribosomal protein genes is associated with targeted recruitment of Esa1 histone acetylase.</p>
            </title>
            <aug>
               <au>
                  <snm>Reid</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>VR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2000</pubdate>
            <volume>6</volume>
            <fpage>1297</fpage>
            <lpage>1307</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1097-2765(00)00128-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">11163204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Eaf3 regulates the global pattern of histone acetylation in <it>Saccharomyces cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Reid</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Moqtaderi</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>24</volume>
            <fpage>757</fpage>
            <lpage>764</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">343795</pubid>
                  <pubid idtype="pmpid" link="fulltext">14701747</pubid>
                  <pubid idtype="doi">10.1128/MCB.24.2.757-764.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Standardizing global gene expression analysis between laboratories and across platforms.</p>
            </title>
            <aug>
               <au>
                  <snm>Bammler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Beyer</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Boorman</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Boyles</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bradford</snm>
                  <fnm>BU</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Bushel</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Chaturvedi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>D</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>351</fpage>
            <lpage>356</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth0605-477a</pubid>
                  <pubid idtype="pmpid" link="fulltext">15846362</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Multiple-laboratory comparison of microarray platforms.</p>
            </title>
            <aug>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spencer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>IF</fnm>
               </au>
               <au>
                  <snm>Biswal</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Gabrielson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Geoghegan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Germino</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>345</fpage>
            <lpage>350</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth756</pubid>
                  <pubid idtype="pmpid" link="fulltext">15846361</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Independence and reproducibility across microarray platforms.</p>
            </title>
            <aug>
               <au>
                  <snm>Larkin</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Gavras</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sultana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nat Methods</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>337</fpage>
            <lpage>344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth757</pubid>
                  <pubid idtype="pmpid" link="fulltext">15846360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>RSIR: regularized sliced inverse regression for motif discovery.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>4169</fpage>
            <lpage>4175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti680</pubid>
                  <pubid idtype="pmpid" link="fulltext">16166098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Computational identification of cis-regulatory elements associated with groups of functionally related genes in <it>Saccharomyces cerevisiae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Estep</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Tavazoie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>296</volume>
            <fpage>1205</fpage>
            <lpage>1214</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.3519</pubid>
                  <pubid idtype="pmpid" link="fulltext">10698627</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TI</fnm>
               </au>
               <au>
                  <snm>Rinaldi</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Odom</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Bar-Joseph</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Hannett</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Harbison</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>799</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075090</pubid>
                  <pubid idtype="pmpid" link="fulltext">12399584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
