<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-32</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Microarray analysis of the <it>in vivo </it>sequence preferences of a minor groove binding drug</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Eckdahl</snm>
               <mi>T</mi>
               <fnm>Todd</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>eckdahl@missouriwestern.edu</email>
            </au>
            <au id="A2">
               <snm>Brown</snm>
               <mi>D</mi>
               <fnm>Adam</fnm>
               <insr iid="I1"/>
               <email>adamdbrown84@hotmail.com</email>
            </au>
            <au id="A3">
               <snm>Hart</snm>
               <mi>N</mi>
               <fnm>Steven</fnm>
               <insr iid="I1"/>
               <email>shart3@kumc.edu</email>
            </au>
            <au id="A4">
               <snm>Malloy</snm>
               <mi>J</mi>
               <fnm>Kelly</fnm>
               <insr iid="I1"/>
               <email>kjm6938@missouriwestern.edu</email>
            </au>
            <au id="A5">
               <snm>Shott</snm>
               <fnm>Martha</fnm>
               <insr iid="I2"/>
               <email>mashott@math.ucdavis.edu</email>
            </au>
            <au id="A6">
               <snm>Yiu</snm>
               <fnm>Gloria</fnm>
               <insr iid="I3"/>
               <email>gloria.yiu@pomona.edu</email>
            </au>
            <au id="A7">
               <snm>Hoopes</snm>
               <mnm>L Mays</mnm>
               <fnm>Laura</fnm>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <email>lhoopes@pomona.edu</email>
            </au>
            <au id="A8">
               <snm>Heyer</snm>
               <mi>J</mi>
               <fnm>Laurie</fnm>
               <insr iid="I2"/>
               <insr iid="I4"/>
               <email>laheyer@davidson.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Biology Department, Missouri Western State University, Saint Joseph, MO, 64507, USA</p>
            </ins>
            <ins id="I2">
               <p>Mathematics Department, Davidson College, Davidson, NC, 28035, USA</p>
            </ins>
            <ins id="I3">
               <p>Biology Department, Pomona College, Claremont, CA, 91711, USA</p>
            </ins>
            <ins id="I4">
               <p>Genome Consortium for Active Teaching, Davidson College, Davidson, NC, 28035, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>32</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/32</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18215295</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-32</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Eckdahl et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Minor groove binding drugs (MGBDs) interact with DNA in a sequence-specific manner and can cause changes in gene expression at the level of transcription. They serve as valuable models for protein interactions with DNA and form an important class of antitumor, antiviral, antitrypanosomal and antibacterial drugs. There is a need to extend knowledge of the sequence requirements for MGBDs from <it>in vitro </it>DNA binding studies to living cells.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we describe the use of microarray analysis to discover yeast genes that are affected by treatment with the MGBD berenil, thereby allowing the investigation of its sequence requirements for binding <it>in vivo</it>. A novel approach to sequence analysis allowed us to address hypotheses about genes that were directly or indirectly affected by drug binding. The results show that the sequence features of A/T richness and heteropolymeric character discovered by <it>in vitro </it>berenil binding studies are found upstream of genes hypothesized to be directly affected by berenil but not upstream of those hypothesized to be indirectly affected or those shown to be unaffected.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The data support the conclusion that effects of berenil on gene expression in yeast cells can be explained by sequence patterns discovered by <it>in vitro </it>binding experiments. The results shed light on the sequence and structural rules by which berenil binds to DNA and affects the transcriptional regulation of genes and contribute generally to the development of MGBDs as tools for basic and applied research.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Improved understanding of the sequence rules by which small molecules bind to DNA and alter patterns of gene expression advances both basic and applied research. In both of these contexts, molecules that bind noncovalently in the DNA minor groove with sequence-selective recognition have drawn considerable attention <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Minor groove binding drugs (MGBDs) have served as useful models for protein components of the transcriptional machinery since they can be more experimentally tractable than their macromolecular counterparts. For example, the understanding of the mechanism of action of TATA box binding protein, a general transcription factor required for proper initiation of transcription by RNA polymerase II, has been furthered using the MGBDs distamycin A, Hoechst 33258, and netropsin <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The observation that the MGBD berenil affects mitochondrial function and aerobic respiration in yeast suggests that it alters genome-wide patterns of gene expression <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. A long standing goal in drug development has been the development of agents that can target specific genes in cells, altering patterns of gene expression in a clinically relevant manner. Advances in the general areas of synthetic organic chemistry, molecular biology and biochemistry and specifically in genomics and functional genomics have made the goal of developing more effective drugs tangible. MGBDs have attracted attention because of their demonstrated antitumor, antiviral, antibacterial, and antitrypanosomal activities <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. For example, brostallicin, a derivative of distamycin A, has been shown to be cytotoxic to tumor cells <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and underwent a phase I clinical investigation involving patients with advanced solid tumors <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. MGBDs in the category of lexitropsins have been shown to have anticancer properties as well; moderate cytotoxicity in human MCF-7 breast cancer cells was exhibited by analogues of bis-netropsin <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Berenil is a member of a family of MGBDs found to be useful in the treatment of infectious diseases caused by <it>Pneumocystis jiroveci </it>and trypanosomes <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, diseases of concern in AIDS patients. The design of MGBDs as agents that have more potency but fewer side effects relies on a more thorough understanding of the ways in which MGBDs effect changes at the level of transcription by interacting with promoter DNA and transcription factors of specific genes in the context of chromatin.</p>
         <p>MGBDs have cationic charges, narrow molecular cross section, and concave shape, allowing them to fit into the narrow minor groove of DNA <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. MGBD binding depends on the spine of hydration, van der Waal's interactions with the floor or walls of the minor groove, charge interactions between cationic drug groups and negative electrostatic sequences, the shapes of base pairs, and the width of the minor groove. Because these features vary in a sequence dependent manner, MGBDs exhibit sequence selectivity properties that have been subject of intense study using <it>in vitro </it>binding to oligonucleotides and polymeric DNA. MGBDs have been shown to exhibit a preference for short tracts of A/T-rich sequences <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The length of DNA protected by MGBDs varies from 4 to 6 bp <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The sequence specificity of the MGBD berenil was investigated by measurement of its binding affinity to hairpin oligonucleotides containing all 512 possible 5-mer sequences <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The results showed that of 512 sequences studied, all the sequences that were entirely A+T ranked among the fifty best binding sequences, supporting the conclusion that berenil prefers A/T-rich binding sites. Berenil is also apparently able to discriminate among sequences composed only of A+T. The two most optimal binding sequences found in Boger <it>et al. </it>were ATATT and AATAT. Binding of berenil to sequences of the form GGGG(A/T)<sub>4</sub>GGGG was studied using electrospray ionization mass spectrometry and it was found to bind sequences in the order ATAT > AATT > AAAA <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. DNase I footprinting studies revealed the binding of berenil to ATAT, AATT, TAAT, TTAA, and TATA <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Rotational viscometry measurements found the sites of highest berenil binding strength to be alternating helical A/T segments <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. An overarching conclusion from these studies is that berenil prefers to bind to A/T-rich sequences that are heteropolymeric, with A and T alternating on the same strand.</p>
         <p>There is a need to extend the study of the sequence requirements for MGBDs to the context of living cells. We chose to contribute to this effort by investigating the effects of the berenil on yeast gene expression, enabling an examination of its <it>in vivo </it>sequence binding requirements. Our experimental data and analysis promise to contribute to the body of knowledge of MGBDs with basic and applied research applications.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Measuring effects of berenil on yeast mRNA levels</p>
            </st>
            <p>The approach we took to finding putative <it>in vivo </it>binding sites for berenil was to consider the yeast genome as a bank of DNA sequences that, in the context of chromatin and the environment of the nucleus, have various affinities for the drug. Among these are sequences whose role in the regulation of transcription could be affected by berenil binding. Changes in gene expression at the level of transcription could therefore occur directly through mechanisms such as interfering with transcription factor binding or altering chromatin structure <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B17">17</abbr></abbrgrp>. Indirect effects on transcript levels may also occur through gene regulatory networks or general stress response <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. We sought to use microarray analysis to determine the set of yeast genes that are directly or indirectly affected at the level of transcription by berenil. We then addressed the significant challenge of distinguishing between these two categories during analysis of the upstream regions of affected genes.</p>
         </sec>
         <sec>
            <st>
               <p>Generation of an affected gene list</p>
            </st>
            <p>Our experimental method was to culture yeast in the presence and absence of berenil, isolate total RNA, and conduct microarray hybridizations to measure changes in steady state transcript levels for all yeast genes. We gathered data suitable for analysis from five experiments, with two whole-genome microarrays in each experiment and two of them conducted as dye swaps of the other three. We analyzed microarray data using MicroArray Genome Imaging and Clustering (MAGIC) Tool <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Once MAGIC Tool produced files of foreground and background intensities for all of the microarray experiments, we used Excel to analyze the data. Differences in the performance of the two dyes were accounted for by normalization <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A series of filtering criteria were used to ensure that only reliable data were used in the production of an affected gene list. As described in the Methods section, the criteria for inclusion of the data for a given ORF were 1) the sum of the intensities for the two channels had to be greater than the minimum median of that sum for all the experiments, 2) the foreground for at least one channel had to be double the background, 3) data for a feature had to pass the first two filters in at least 5 of the 10 measurements of that feature, 4) data for a feature must pass the first two filters in both Cy3- and Cy5-labelled samples, and 5) the coefficient of variation of log transformed ratios across experiments had to be less than one. The culmination of our analysis was a list of ORFs for which we had reliable microarray data. This resulted in a final list of 52 genes whose mRNA levels decreased and two whose levels increased upon berenil treatment. We find it very interesting that the vast majority of the reliably affected genes were negatively affected; this observation is likely to be relevant to discovery of the mechanism of action of berenil and other MGBDs.</p>
         </sec>
         <sec>
            <st>
               <p>Real time PCR validation of microarray results</p>
            </st>
            <p>In order to provide validation of the microarray results from an independent method, we performed real time PCR measurements for selected genes. Quantitative reverse transcription real time PCR with SYBR Green reporting was used to generate the data presented in Table <tblr tid="T1">1</tblr>. The gene TUB1 was used as a standard as described <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and was also unaffected in our microarray experiments. The genes STF2, HSP78, and SPI1 were found to have lower steady state levels of mRNA according to the microarray data and also had lower levels according to real time PCR. Although the genes appeared in the same order with regard to the magnitude of the effect, the real time data resulted in higher expression ratios, an effect that may be due to signal saturation in the microarray experimental approach.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Validation of microarray data with real time PCR</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Microarray</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Real Time PCR</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>TUB1</p>
                     </c>
                     <c ca="center">
                        <p>-0.11</p>
                     </c>
                     <c ca="center">
                        <p>0.00 (standard)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>STF2</p>
                     </c>
                     <c ca="center">
                        <p>-1.14</p>
                     </c>
                     <c ca="center">
                        <p>-1.81</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>HSP78</p>
                     </c>
                     <c ca="center">
                        <p>-1.42</p>
                     </c>
                     <c ca="center">
                        <p>-2.60</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>SPI1</p>
                     </c>
                     <c ca="center">
                        <p>-1.51</p>
                     </c>
                     <c ca="center">
                        <p>-2.74</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Expression ratios (log<sub>2 </sub>transformed treated to untreated) of mRNA levels for selected genes inferred from microarray analysis versus real time PCR.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Direct and indirect effects of berenil</p>
            </st>
            <p>The list of genes whose steady state transcript levels were shown by our microarray analysis to be reliably affected by berenil includes two genes whose levels increased. According to the Saccharomyces Genome Database (SGD), their functions are in phosphate metabolism and processing of 20S pre-RNA <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Of the 52 genes whose mRNA levels decrease, 14 are involved in stress response, nine in carbohydrate metabolism, four in electron transport, two in meiosis and mitosis, one in regulation of redox homeostasis, one in regulation of proteolysis, one in salinity response, one in vacuole fusion, one in response to metals, one in phosphate metabolism, and one in DNA repair, according to the SGD. The remaining 16 genes have not had functions assigned to them by SGD. The list of affected genes is likely to include some that are directly affected by berenil. Genes in this category are expected to have upstream transcriptional control regions that include berenil binding sites. Genes whose transcript levels changed by indirect drug effects are also likely to be in the list. Such genes would not be expected to contain berenil binding sites in their upstream control regions. Although we cannot determine which genes may be indirectly affected through gene regulatory networks, we note that there are a number of genes that function in stress response. We chose to test the hypothesis that these 14 genes are indirectly affected by berenil and that the remaining genes are directly affected. Table <tblr tid="T2">2</tblr> lists the 40 genes hypothesized to be directly affected while Table <tblr tid="T3">3</tblr> lists those hypothesized to be indirectly affected, each with an expression ratio, function, and molecular process, if known.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Direct hypothesis category of yeast genes shown by microarray analysis to be affected by berenil treatment</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>ORF</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Gene</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Process</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YBR233W-A</p>
                     </c>
                     <c ca="left">
                        <p>DAD3</p>
                     </c>
                     <c ca="center">
                        <p>-3.45</p>
                     </c>
                     <c ca="left">
                        <p>mitosis</p>
                     </c>
                     <c ca="left">
                        <p>protein binding activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Q0130</p>
                     </c>
                     <c ca="left">
                        <p>OLI1</p>
                     </c>
                     <c ca="center">
                        <p>-2.75</p>
                     </c>
                     <c ca="left">
                        <p>ATP synthase activity</p>
                     </c>
                     <c ca="left">
                        <p>ATP synthesis coupled proton transport</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YDR070C</p>
                     </c>
                     <c ca="left">
                        <p>FMP16</p>
                     </c>
                     <c ca="center">
                        <p>-2.43</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YEL039C</p>
                     </c>
                     <c ca="left">
                        <p>CYC7</p>
                     </c>
                     <c ca="center">
                        <p>-2.41</p>
                     </c>
                     <c ca="left">
                        <p>electron transport</p>
                     </c>
                     <c ca="left">
                        <p>electron carrier activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YLR327C</p>
                     </c>
                     <c ca="left">
                        <p>TMA10</p>
                     </c>
                     <c ca="center">
                        <p>-2.35</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YJL156W-A</p>
                     </c>
                     <c ca="left">
                        <p>YJL156W-A</p>
                     </c>
                     <c ca="center">
                        <p>-2.25</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YMR105C</p>
                     </c>
                     <c ca="left">
                        <p>PGM2</p>
                     </c>
                     <c ca="center">
                        <p>-2.23</p>
                     </c>
                     <c ca="left">
                        <p>glucose 1-phosphate utilization</p>
                     </c>
                     <c ca="left">
                        <p>phosphoglucomutase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YGR248W</p>
                     </c>
                     <c ca="left">
                        <p>SOL4</p>
                     </c>
                     <c ca="center">
                        <p>-2.00</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YHR087W</p>
                     </c>
                     <c ca="left">
                        <p>YHR087W</p>
                     </c>
                     <c ca="center">
                        <p>-1.95</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YPR160W</p>
                     </c>
                     <c ca="left">
                        <p>GPH1</p>
                     </c>
                     <c ca="center">
                        <p>-1.76</p>
                     </c>
                     <c ca="left">
                        <p>glycogen catabolism</p>
                     </c>
                     <c ca="left">
                        <p>glycogen phosphorylase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YLR178C</p>
                     </c>
                     <c ca="left">
                        <p>TFS1</p>
                     </c>
                     <c ca="center">
                        <p>-1.76</p>
                     </c>
                     <c ca="left">
                        <p>regulation of proteolysis</p>
                     </c>
                     <c ca="left">
                        <p>lipid binding activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR173W</p>
                     </c>
                     <c ca="left">
                        <p>DCS2</p>
                     </c>
                     <c ca="center">
                        <p>-1.72</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YEL011W</p>
                     </c>
                     <c ca="left">
                        <p>GLC3</p>
                     </c>
                     <c ca="center">
                        <p>-1.66</p>
                     </c>
                     <c ca="left">
                        <p>glycogen metabolism</p>
                     </c>
                     <c ca="left">
                        <p>1,4-alpha-glucan branching enzyme activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YMR081C</p>
                     </c>
                     <c ca="left">
                        <p>ISF1</p>
                     </c>
                     <c ca="center">
                        <p>-1.64</p>
                     </c>
                     <c ca="left">
                        <p>aerobic respiration</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YER150W</p>
                     </c>
                     <c ca="left">
                        <p>SPI1</p>
                     </c>
                     <c ca="center">
                        <p>-1.51</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YER067W</p>
                     </c>
                     <c ca="left">
                        <p>YER067W</p>
                     </c>
                     <c ca="center">
                        <p>-1.50</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR031W</p>
                     </c>
                     <c ca="left">
                        <p>CRS5</p>
                     </c>
                     <c ca="center">
                        <p>-1.45</p>
                     </c>
                     <c ca="left">
                        <p>response to metal ion</p>
                     </c>
                     <c ca="left">
                        <p>copper ion binding activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YIL136W</p>
                     </c>
                     <c ca="left">
                        <p>OM45</p>
                     </c>
                     <c ca="center">
                        <p>-1.43</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YPL230W</p>
                     </c>
                     <c ca="left">
                        <p>YPL230W</p>
                     </c>
                     <c ca="center">
                        <p>-1.41</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR178C</p>
                     </c>
                     <c ca="left">
                        <p>GAC1</p>
                     </c>
                     <c ca="center">
                        <p>-1.38</p>
                     </c>
                     <c ca="left">
                        <p>meiosis</p>
                     </c>
                     <c ca="left">
                        <p>protein phosphatase type 1 activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YFR053C</p>
                     </c>
                     <c ca="left">
                        <p>HXK1</p>
                     </c>
                     <c ca="center">
                        <p>-1.35</p>
                     </c>
                     <c ca="left">
                        <p>fructose metabolism</p>
                     </c>
                     <c ca="left">
                        <p>hexokinase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR120W</p>
                     </c>
                     <c ca="left">
                        <p>GCY1</p>
                     </c>
                     <c ca="center">
                        <p>-1.33</p>
                     </c>
                     <c ca="left">
                        <p>salinity response</p>
                     </c>
                     <c ca="left">
                        <p>aldo-keto reductase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YFR017C</p>
                     </c>
                     <c ca="left">
                        <p>YFR017C</p>
                     </c>
                     <c ca="center">
                        <p>-1.32</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR374W</p>
                     </c>
                     <c ca="left">
                        <p>ALD4</p>
                     </c>
                     <c ca="center">
                        <p>-1.31</p>
                     </c>
                     <c ca="left">
                        <p>ethanol metabolism</p>
                     </c>
                     <c ca="left">
                        <p>aldehyde dehydrogenase (NAD+) activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YJR096W</p>
                     </c>
                     <c ca="left">
                        <p>YJR096W</p>
                     </c>
                     <c ca="center">
                        <p>-1.29</p>
                     </c>
                     <c ca="left">
                        <p>arabinose metabolism</p>
                     </c>
                     <c ca="left">
                        <p>oxidoreductase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YFR015C</p>
                     </c>
                     <c ca="left">
                        <p>GSY1</p>
                     </c>
                     <c ca="center">
                        <p>-1.27</p>
                     </c>
                     <c ca="left">
                        <p>glycogen metabolism</p>
                     </c>
                     <c ca="left">
                        <p>glycogen (starch) synthase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YDR453C</p>
                     </c>
                     <c ca="left">
                        <p>TSA2</p>
                     </c>
                     <c ca="center">
                        <p>-1.22</p>
                     </c>
                     <c ca="left">
                        <p>regulation of redox homeostasis</p>
                     </c>
                     <c ca="left">
                        <p>thioredoxin peroxidase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YHL021C</p>
                     </c>
                     <c ca="left">
                        <p>FMP12</p>
                     </c>
                     <c ca="center">
                        <p>-1.21</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YKL151C</p>
                     </c>
                     <c ca="left">
                        <p>YKL151C</p>
                     </c>
                     <c ca="center">
                        <p>-1.15</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YGR008C</p>
                     </c>
                     <c ca="left">
                        <p>STF2</p>
                     </c>
                     <c ca="center">
                        <p>-1.14</p>
                     </c>
                     <c ca="left">
                        <p>ATP synthesis</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YDL130W-A</p>
                     </c>
                     <c ca="left">
                        <p>STF1</p>
                     </c>
                     <c ca="center">
                        <p>-1.09</p>
                     </c>
                     <c ca="left">
                        <p>ATP synthesis</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YCL042W</p>
                     </c>
                     <c ca="left">
                        <p>YCL042W</p>
                     </c>
                     <c ca="center">
                        <p>-1.08</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOR385W</p>
                     </c>
                     <c ca="left">
                        <p>YOR385W</p>
                     </c>
                     <c ca="center">
                        <p>-1.07</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YIL045W</p>
                     </c>
                     <c ca="left">
                        <p>PIG2</p>
                     </c>
                     <c ca="center">
                        <p>-1.07</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                     <c ca="left">
                        <p>protein phosphatase regulator activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YLR258W</p>
                     </c>
                     <c ca="left">
                        <p>GSY2</p>
                     </c>
                     <c ca="center">
                        <p>-1.06</p>
                     </c>
                     <c ca="left">
                        <p>glycogen metabolism</p>
                     </c>
                     <c ca="left">
                        <p>glycogen (starch) synthase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YMR173W</p>
                     </c>
                     <c ca="left">
                        <p>DDR48</p>
                     </c>
                     <c ca="center">
                        <p>-1.04</p>
                     </c>
                     <c ca="left">
                        <p>DNA repair</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YNL015W</p>
                     </c>
                     <c ca="left">
                        <p>PBI2</p>
                     </c>
                     <c ca="center">
                        <p>-1.03</p>
                     </c>
                     <c ca="left">
                        <p>vacuole fusion (non-autophagic)</p>
                     </c>
                     <c ca="left">
                        <p>endopeptidase inhibitor activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YCL040W</p>
                     </c>
                     <c ca="left">
                        <p>GLK1</p>
                     </c>
                     <c ca="center">
                        <p>-1.02</p>
                     </c>
                     <c ca="left">
                        <p>carbohydrate metabolism</p>
                     </c>
                     <c ca="left">
                        <p>glucokinase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YAR071W</p>
                     </c>
                     <c ca="left">
                        <p>PHO11</p>
                     </c>
                     <c ca="center">
                        <p>1.02</p>
                     </c>
                     <c ca="left">
                        <p>phosphate metabolism</p>
                     </c>
                     <c ca="left">
                        <p>acid phosphatase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YKL099C</p>
                     </c>
                     <c ca="left">
                        <p>UTP11</p>
                     </c>
                     <c ca="center">
                        <p>1.18</p>
                     </c>
                     <c ca="left">
                        <p>processing of 20S pre-rRNA</p>
                     </c>
                     <c ca="left">
                        <p>snoRNA binding activity</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Genes hypothesized to be directly affected by berenil treatment are listed with their expression ratios (log<sub>2 </sub>transformed treated to untreated) and functions, if known.</p>
               </tblfn>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Indirect hypothesis category of yeast genes shown by microarray analysis to be affected by berenil treatment</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>ORF</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Gene</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Process</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YOL052C-A</p>
                     </c>
                     <c ca="left">
                        <p>DDR2</p>
                     </c>
                     <c ca="center">
                        <p>-2.44</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YGR088W</p>
                     </c>
                     <c ca="left">
                        <p>CTT1</p>
                     </c>
                     <c ca="center">
                        <p>-2.35</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>catalase activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YBR072W</p>
                     </c>
                     <c ca="left">
                        <p>HSP26</p>
                     </c>
                     <c ca="center">
                        <p>-2.32</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>chaperone activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YML100W</p>
                     </c>
                     <c ca="left">
                        <p>TSL1</p>
                     </c>
                     <c ca="center">
                        <p>-2.17</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>enzyme regulator activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YCR021C</p>
                     </c>
                     <c ca="left">
                        <p>HSP30</p>
                     </c>
                     <c ca="center">
                        <p>-2.17</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>heat shock protein activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YFL014W</p>
                     </c>
                     <c ca="left">
                        <p>HSP12</p>
                     </c>
                     <c ca="center">
                        <p>-1.93</p>
                     </c>
                     <c ca="left">
                        <p>response to oxidative stress</p>
                     </c>
                     <c ca="left">
                        <p>heat shock protein activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YMR250W</p>
                     </c>
                     <c ca="left">
                        <p>GAD1</p>
                     </c>
                     <c ca="center">
                        <p>-1.66</p>
                     </c>
                     <c ca="left">
                        <p>response to oxidative stress</p>
                     </c>
                     <c ca="left">
                        <p>glutamate decarboxylase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YMR169C</p>
                     </c>
                     <c ca="left">
                        <p>ALD3</p>
                     </c>
                     <c ca="center">
                        <p>-1.64</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>aldehyde dehydrogenase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YNL160W</p>
                     </c>
                     <c ca="left">
                        <p>YGP1</p>
                     </c>
                     <c ca="center">
                        <p>-1.42</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>unknown</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YDR258C</p>
                     </c>
                     <c ca="left">
                        <p>HSP78</p>
                     </c>
                     <c ca="center">
                        <p>-1.42</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>chaperone activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YDR074W</p>
                     </c>
                     <c ca="left">
                        <p>TPS2</p>
                     </c>
                     <c ca="center">
                        <p>-1.28</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>trehalose phosphatase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YKL026C</p>
                     </c>
                     <c ca="left">
                        <p>GPX1</p>
                     </c>
                     <c ca="center">
                        <p>-1.24</p>
                     </c>
                     <c ca="left">
                        <p>response to oxidative stress</p>
                     </c>
                     <c ca="left">
                        <p>glutathione peroxidase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YLL026W</p>
                     </c>
                     <c ca="left">
                        <p>HSP104</p>
                     </c>
                     <c ca="center">
                        <p>-1.18</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>heat shock protein activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YLL039C</p>
                     </c>
                     <c ca="left">
                        <p>UBI4</p>
                     </c>
                     <c ca="center">
                        <p>-1.18</p>
                     </c>
                     <c ca="left">
                        <p>response to stress</p>
                     </c>
                     <c ca="left">
                        <p>protein tagging activity</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Genes hypothesized to be indirectly affected by berenil treatment are listed with their expression ratios (log<sub>2 </sub>transformed treated to untreated) and functions, if known.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Sequences found upstream of affected genes</p>
            </st>
            <p>We sought to analyze the upstream regions of the hypothesized direct and indirect categories of genes for the occurrence of sequence features and to consider the results in light of <it>in vitro </it>berenil binding studies. In this way, we can address the validity of our categorization of the affected genes and extend knowledge of berenil binding preferences to the environment of cells. As reviewed above, studies conducted <it>in vitro </it>have shown that berenil binds 5&#8211;6 nucleotide A/T-rich tracts that tend toward heteropolymeric (alternating A and T) character <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. If these binding preferences extend to the cellular context, then the upstream sequences of the 40 yeast genes hypothesized to be directly affected by berenil and listed in Table <tblr tid="T2">2</tblr> are expected to contain 5&#8211;6 nucleotide sequence elements that are A/T-rich and heteropolymeric. For the 14 genes hypothesized to be indirectly affected and listed in Table <tblr tid="T3">3</tblr>, the signal of berenil binding sequences should fall to the background level found in the upstream sequences of all yeast genes. Our challenge was to find ways to analyze the sequences in order to uncover any existing sequence patterns. We reasoned that if the effect of berenil on transcription levels is due to binding sites that they would be found upstream of directly affected genes as 5-mer and 6-mer sequences. In order to reduce the background noise, we limited our search to 200 nt upstream of the start site for translation and to the sense strand only. We developed two measures for determining whether the occurrence rate of a given sequence element is unusually high in the upstream regions of genes. The first was the difference between the percentages of the upstream regions of affected and unaffected genes containing a sequence. The second is the ratio of the number of occurrences of a sequence in the affected gene upstream regions to that of the unaffected gene upstream regions.</p>
            <p>For use as a control group, we assembled a list of 56 genes that remained reliably unaffected by berenil treatment in the course of our microarray experiments. Sense strand sequences from the 200 bp upstream of these genes, the 40 directly affected genes, and the 14 indirectly affected genes were measured for the occurrence of all possible 5-mer and 6-mer sequences The percentage of genes with an. occurrence of each sequence was determined for each of the three categories and the difference in percentage was calculated between each of the two affected categories and the unaffected category.</p>
            <p>Table <tblr tid="T4">4</tblr> shows the top ten 5-mer and 6-mer sequences from the direct category according to this criterion. For example, AATAA occurred upstream of 71% of the directly affected genes, but in only 40% of the unaffected ones, for a difference of 31%. The sequences listed occur in an average of 61% of the direct gene upstream regions. All of them occur more frequently upstream of direct category genes than of the unaffected genes; the average occurrence is 23% higher. We also measured the number of occurrences of each sequence in the directly affected, indirectly affected, and unaffected gene upstream regions. The ratio of the number of occurrences in the affected category to that in the unaffected one was calculated and the sequences were ranked according to this ratio, with the top ten sequences listed in Table <tblr tid="T5">5</tblr>. For example, the sequence ATAAG occurs in 30 times in the 40 affected gene upstream regions, a rate that is 2.3 times higher than in the unaffected gene regions. The sequences listed occur an average of 28 times in the 40 directly affected genes, an average of 2.1 times the occurrence rate found in the 56 unaffected gene regions.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Difference criterion sequences in direct gene category</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>5-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Directly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Difference</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>6-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Directly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Difference</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>aataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>71%</p>
                     </c>
                     <c ca="center">
                        <p>31%</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>tatata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>61%</p>
                     </c>
                     <c ca="center">
                        <p>33%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>ataag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>atataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>agaat</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>aaaaga</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>aacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>aaataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>aaata</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>aaaata</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>atata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>tataag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>tataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>taataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>cataa</p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>gaaata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>gtaaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>aaagaa</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>aaaga</p>
                     </c>
                     <c ca="center">
                        <p>66</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>aataat</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Sequences found to be most overrepresented in the upstream regions of genes hypothesized to be directly affected by berenil compared to unaffected genes. The percentage of affected genes and difference in percentage between affected and unaffected genes having each sequence is listed. Bolded sequences are shared with Table 5.</p>
               </tblfn>
            </tbl>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Ratio criterion sequences in direct gene category</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>
                           <b>5-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Directly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>6-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Directly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>ataag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>2.3</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>tataag</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>2.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>atataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>2.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                     <c ca="center">
                        <p>aatata</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>gtaaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>gaaata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>taata</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>taataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>2.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>acata</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>tatata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>2.2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>agaat</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aataat</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>2.2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>atata</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>tataa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>ataata</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>2.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tatat</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaaaga</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>2.0</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Sequences ranked according to the number of occurrence in upstream regions of genes hypothesized to be directly affected compared to unaffected genes. The number of occurrences in affected genes and ratio of occurrences in affected genes to unaffected genes is listed for each sequence. Bolded sequences are shared with Table 4.</p>
               </tblfn>
            </tbl>
            <p>Table <tblr tid="T6">6</tblr> shows the top ten sequences according to the difference criterion for the indirect gene category. For example, ACCTC occurs in 50% of the indirect gene regions but only 2% of the unaffected one, for a difference of 48%. The sequences listed occur in an average of 50% of the 14 genes hypothesized to be indirectly affected with an average difference of 35% between the affected and unaffected gene regions. Table <tblr tid="T7">7</tblr> shows the results of ranking sequences in the upstream region of genes in the indirect category using the ratio criterion. For example, the sequence AATCT occurs nine times in the indirect sequence regions, a rate that is 3.3 times higher than that found for the unaffected genes. The sequences listed occur an average of 7.8 times in the upstream regions of the 14 indirectly affected gene set, an average of 4.4 times higher than the rate found for the 56 unaffected genes.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Difference criterion sequences in indirect gene category</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>
                           <b>5-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Indirectly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Difference</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>6-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Indirectly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Difference</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>acctc</p>
                     </c>
                     <c ca="center">
                        <p>50%</p>
                     </c>
                     <c ca="center">
                        <p>48%</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ctgaaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42%</p>
                     </c>
                     <c ca="center">
                        <p>38%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aatct</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>taagga</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ctaat</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>atataa</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ctcac</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>cttat</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaacca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>gatta</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aataca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>67</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>tctttc</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ataca</p>
                     </c>
                     <c ca="center">
                        <p>67</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>ataaat</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>aaagc</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>acacat</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>acaca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>ctcacc</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Sequences found to be most overrepresented in the upstream regions of genes hypothesized to be indirectly affected by berenil compared to unaffected genes. The percentage of affected genes and difference in percentage between affected and unaffected genes having each sequence is listed. Bolded sequences are shared with Table 7.</p>
               </tblfn>
            </tbl>
            <tbl id="T7">
               <title>
                  <p>Table 7</p>
               </title>
               <caption>
                  <p>Ratio criterion sequences in indirect gene category</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>
                           <b>5-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Indirectly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>6-mer</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Indirectly Affected</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Ratio</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aatct</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>3.3</p>
                     </c>
                     <c ca="center">
                        <p>cattct</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>20.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>aacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>2.7</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>ctgaaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>10.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>caaca</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>taagga</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>10.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>caata</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>2.3</p>
                     </c>
                     <c ca="center">
                        <p>aacaac</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>5.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>aatac</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                     <c ca="center">
                        <p>acaaca</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>5.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tctct</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>2.1</p>
                     </c>
                     <c ca="center">
                        <p>tataag</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>3.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>acaca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>2.0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaacca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>3.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>tataa</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>2.0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aataca</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>3.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ataag</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>tctttc</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>3.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>gaata</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>1.6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>aaacaa</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>3.0</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Sequences ranked according to the number of occurrence in upstream regions of genes hypothesized to be indirectly affected compared to unaffected genes. The number of occurrences in affected genes and ratio of occurrences in affected genes to unaffected genes is listed for each sequence. Bolded sequences are shared with Table 6.</p>
               </tblfn>
            </tbl>
            <p>The difference and ratio criteria for choosing overrepresented sequences yield more similar results for the direct category than for the indirect category. Of the 20 sequences from the direct category listed in Table <tblr tid="T4">4</tblr>, 15 are found in Table <tblr tid="T5">5</tblr>, shown in bold. Only nine indirect category sequences are shared between Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>. The extent to which the difference and ratio lists share sequences can be attributed to three causes. First, the two measures are not unrelated. The number of occurrences of a given sequence affects the number of genes with which it is associated, and vice versa. Second, an average of one shared sequence is expected at random. Third, shared sequences may occur because they reflect sequence characteristics of the data set. Since the first two of these causes are not expected to be different for the direct and indirect categories, the increased level of shared sequences for the direct category is telling. It is likely to arise from characteristics of the upstream regions of the direct category genes. In order to investigate these characteristics, we formed sets of unique 5- and 6-mers for the direct and indirect categories that included each shared sequence only once and conducted several types of sequence analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence analysis</p>
            </st>
            <p>Several observations can be made regarding the sequences presented in Tables <tblr tid="T4">4</tblr>, <tblr tid="T5">5</tblr>, <tblr tid="T6">6</tblr>, <tblr tid="T7">7</tblr> that can be used to address our hypotheses about genes that are directly or indirectly affected by berenil and to evaluate the extent to which the rules for drug binding <it>in vitro </it>can be extended to the cellular context. These observations relate to the A+T content and the extent of heteropolymeric character found in the sequences listed in Tables <tblr tid="T4">4</tblr>, <tblr tid="T5">5</tblr>, <tblr tid="T6">6</tblr>, <tblr tid="T7">7</tblr> and are outlined in the following sections.</p>
            <sec>
               <st>
                  <p>1. Overall A/T Richness in Direct Category Sequences</p>
               </st>
               <p>The most obvious characteristic of the 5- and 6-mer sequences found to be overrepresented upstream of the direct category genes is A/T richness. The set of unique 5-mers listed for the direct category in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr> is 89% A+T while the unique 6-mers listed are 94% A+T. For comparison, the A+T content of the 200 nt upstream regions of 5869 yeast genes averages 65%. The results of a Z-test showed a high level of significance (p &lt; .0001) for the A+T content of both the 5- and 6-mers compared to the set of all yeast genes. The 5- and 6-mer sequences from the indirect category in Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr> average 72% and 73% A+T, respectively. These values are not significantly different from all yeast genes, with p-values of 0.19 and 0.14. These observations support the conclusion that A/T richness is a characteristic of the direct category sequences much more than it is of the indirect category sequences.</p>
            </sec>
            <sec>
               <st>
                  <p>2. A/T Richness of Individual Direct Category Sequences</p>
               </st>
               <p>Another observation is that the A+T content levels of individual members of the direct category lists are uniformly high. In Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr>, 100% of the 5- and 6-mers are at least 80% A+T. The average value of this measure for 5869 yeast genes upstream regions is only 35% and 19% for 5-mers and 6-mers, respectively. By contrast, of the indirect category sequences of Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>, only 65% of the 17 unique 5-mers and 43% of the 14 unique 6-mers are at least 80% A+T. The high A+T content of individual 5- and 6-mers from the direct category means that none of them contains more than one C or G nucleotide. Interestingly, each time a C or G occurs, it is either at the end of the sequence or it disrupts a 2&#8211;5 nt homopolymeric A stretch.</p>
            </sec>
            <sec>
               <st>
                  <p>3. High Rate of AT and TA Dinucleotides in Direct Category</p>
               </st>
               <p>The occurrence of heteropolymeric AT and TA dinucleotides is unusually high in the direct category compared to the indirect category. Among the 25 unique 5- and 6-mers in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr> from the direct category, 52% of the dinucleotides are AT and TA. Based on the number of As and Ts in the sequences, a level of 33% is expected by chance. Of the dinucleotides in the 31 unique indirect category sequences of Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>, 25% are AT and TA, while 23% are expected by the number of As and Ts. For the analogous region upstream of 5869 yeast genes, 18% of dinucleotides are AT or TA, with an expected value of 21% based on As and Ts.</p>
            </sec>
            <sec>
               <st>
                  <p>4. Occurrence of Completely Heteropolymeric A/T Sequences</p>
               </st>
               <p>The completely A/T heteropolymeric sequences ATATA, TATAT, ATATAT, and TATATA occur at high rates in the direct category. Of eight possible sequences that are 100% alternating As and Ts that could have occurred in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr>, five appear. Based on A+T content, only 1.1 occurrences would be expected at random. By contrast, none of these sequences occurs in the indirect category lists of Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>. This observation is best explained by lower A+T content; an average of 0.34 occurrences would be expected based on A+T content.</p>
            </sec>
            <sec>
               <st>
                  <p>5. Direct Category Sequences Enriched for Heteropolymeric A/T Tracts</p>
               </st>
               <p>There is a high rate of occurrence of 2&#8211;6 nt heteropolymeric A/T tracts among the set of unique 5- and 6-mers from the direct gene category. In order to investigate statistical significance of this observation, we compared the rate of occurrence of heteropolymeric A/T tracts for both the direct and indirect category to that found in the 200 nt upstream regions of 5869 yeast genes, and the results are listed in Table <tblr tid="T8">8</tblr>. Heteropolymeric tracts of 2&#8211;5 nt in length occur an average of 7.9 times more often in the unique direct category 5-mers than in the set of 5869 yeast genes and 8.6 times more often in the unique direct category 6-mers. Among the unique indirect category 5- and 6-mers, the heteropolymeric tracts occur at average rates of only 1.3 and 1.9 times higher than in the 5869 yeast genes. We also conducted a one-sided Z-test of the rates of occurrence of the heteropolymeric A/T tracts in both the direct and indirect categories compared to the set of all yeast genes. Although the significance levels are inflated by the lack of independence in overlapping heteropolymeric A/T tracts, this affects both the direct and indirect categories equally, so the resulting p-values can be fairly compared. As shown in Table <tblr tid="T8">8</tblr>, p-values of less than 0.0001 indicate very high levels of statistical significance for each of the nine direct category comparisons. These results indicate that heteropolymeric A/T tracts of 2&#8211;6 nt occur at a higher rate in the direct category sequences compared to yeast genes in general. Enrichment of heteropolymeric A/T tracts in the indirect category compared to the 5869 yeast genes is far less significant, and the significance levels decrease as the tract length increases.</p>
               <tbl id="T8">
                  <title>
                     <p>Table 8</p>
                  </title>
                  <caption>
                     <p>Analysis of A/T heteropolymeric sequence occurrences</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c ca="left">
                           <p>
                              <b>A/T Heteropolymeric Sequences</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>5-mers</b>
                           </p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>
                              <b>6-mers</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="right">
                           <p>
                              <b>Direct</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>Indirect</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>Direct</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>Indirect</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>AT, TA</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>observed</p>
                        </c>
                        <c ca="right">
                           <p>0.500</p>
                        </c>
                        <c ca="right">
                           <p>0.294</p>
                        </c>
                        <c ca="right">
                           <p>0.533</p>
                        </c>
                        <c ca="right">
                           <p>0.214</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="right">
                           <p>0.180</p>
                        </c>
                        <c ca="right">
                           <p>0.180</p>
                        </c>
                        <c ca="right">
                           <p>0.180</p>
                        </c>
                        <c ca="right">
                           <p>0.180</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>p-value</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.007</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.228</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>ATA, TAT</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>observed</p>
                        </c>
                        <c ca="right">
                           <p>0.359</p>
                        </c>
                        <c ca="right">
                           <p>0.157</p>
                        </c>
                        <c ca="right">
                           <p>0.396</p>
                        </c>
                        <c ca="right">
                           <p>0.125</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="right">
                           <p>0.065</p>
                        </c>
                        <c ca="right">
                           <p>0.065</p>
                        </c>
                        <c ca="right">
                           <p>0.065</p>
                        </c>
                        <c ca="right">
                           <p>0.065</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>p-value</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.004</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.033</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>ATAT, TATA</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>observed</p>
                        </c>
                        <c ca="right">
                           <p>0.192</p>
                        </c>
                        <c ca="right">
                           <p>0.029</p>
                        </c>
                        <c ca="right">
                           <p>0.222</p>
                        </c>
                        <c ca="right">
                           <p>0.071</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="right">
                           <p>0.024</p>
                        </c>
                        <c ca="right">
                           <p>0.024</p>
                        </c>
                        <c ca="right">
                           <p>0.024</p>
                        </c>
                        <c ca="right">
                           <p>0.024</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>p-value</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.418</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.022</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>ATATA, TATAT</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>observed</p>
                        </c>
                        <c ca="right">
                           <p>0.154</p>
                        </c>
                        <c ca="right">
                           <p>0.000</p>
                        </c>
                        <c ca="right">
                           <p>0.167</p>
                        </c>
                        <c ca="right">
                           <p>0.036</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>p-value</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.662</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.090</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>ATATAT, TATATA</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>observed</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>0.083</p>
                        </c>
                        <c ca="right">
                           <p>0.000</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                        <c ca="right">
                           <p>0.010</p>
                        </c>
                     </r>
                     <r>
                        <c>
                           <p/>
                        </c>
                        <c ca="left">
                           <p>p-value</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>-</p>
                        </c>
                        <c ca="right">
                           <p>
                              <b>&lt; .0001</b>
                           </p>
                        </c>
                        <c ca="right">
                           <p>0.579</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The rate of occurrence (observed) of 2&#8211;6 nt A/T heteropolymeric sequences in the unique 5-mer and 6-mer sequences listed in Tables 4-7 for the direct and indirect gene categories was compared to the rate of occurrence (yeast) in the 200 nt upstream region of 5869 yeast genes using a 1-sided Z-test. Highly significant p-values are shown in bold.</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>6. Basis for Heteropolymeric A/T Tracts</p>
               </st>
               <p>Clearly, the occurrence of heteropolymeric A/T tracts is higher in the upstream regions of the direct category genes than in the corresponding regions of the indirect category genes or of yeast genes in general. But to what extent can this be attributed to the A/T richness of these regions or to the high rate of occurrence of A/T tracts of any type? We sought to address these questions by conducting an analysis of the rate of occurrence of heteropolymeric sequences compared to the rate expected by either A+T content or the occurrence of 100% A/T tracts. We first tabulated the number of occurrences of 3&#8211;6 nt A/T heteropolymeric tracts in each of the eight lists of Tables <tblr tid="T4">4</tblr>, <tblr tid="T5">5</tblr>, <tblr tid="T6">6</tblr>, <tblr tid="T7">7</tblr>. We then used two different means to establish an expected number of occurrences of these sequences. One was simply A+T content, with the consequence that higher content results in more expected occurrences of the A/T heteropolymeric tracts. The other involved using the number of 100% A/T tracts that occurs in a given list to determine the expected number of A/T heteropolymeric tracts. For 3 nt A/T tracts, two out of eight are expected at random to be ATA or TAT. For 4 nt tracts, the expected rate is two of 16, for 5 nt two of 32, and for 6 nt two of 64. For the direct sequences listed in Table <tblr tid="T4">4</tblr>, A/T heteropolymeric tracts of all sizes occur at rates greater than expected by A+T content, with an average ratio of observed to expected of 3.3. The same is true for the direct category sequences in Table <tblr tid="T5">5</tblr>, with an average ratio of 4.2. However, A/T heteropolymeric tracts in the sequences from the indirect lists in Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr> occur near the expected frequencies, with ratios of observed to expected of 1.8 and 0.6, respectively. Using the expected values from the occurrence of 100% A+T tracts, the sequences of Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr> still display unusually high occurrences of A/T heteropolymeric tracts, with ratios of observed to expected of 3.0 and 3.6, respectively. However, the indirect sequence category has the expected sequence properties since the ratio of observed to expected for Table <tblr tid="T6">6</tblr> is 1.2 and the ratio for Table <tblr tid="T7">7</tblr> is 0.8. We also conducted a Chi-squared analysis of the occurrence of 3&#8211;6 nt A/T heteropolymeric sequences in each of the lists from Tables <tblr tid="T4">4</tblr>, <tblr tid="T5">5</tblr>, <tblr tid="T6">6</tblr>, <tblr tid="T7">7</tblr>. Strikingly, for the unique direct category sequences there is high degree of statistical significance for the occurrence of A/T heteropolymeric tracts based on A+T content in every one of the seven combinations of tract length and 5-mer versus 6-mer (p-values range from &lt; .0001 to .018). All seven direct category combinations also yielded a high degree of significance when the expected values were based on A/T tracts (p-values from &lt; .0001 to .013). Equally striking is the result that for the unique indirect category sequences, none of the fourteen analyses showed statistical significance (p-values from .13 to .89), indicating that A/T heteropolymeric tracts occur at expected frequencies.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The results of our microarray experiments and associated sequence analysis provide insight into the sequence patterns required for binding of the minor groove binder berenil in the environment of yeast cells. They support the conclusion that the upstream regions of the genes hypothesized to be directly affected by berenil contain sequence features that are in good accord with those discovered by <it>in vitro </it>berenil binding studies. This was established by observation of sequence characteristics of the upstream regions of the direct category genes: high A+T content, A/T richness of the most frequently found 5- and 6-mers, and a high rate of occurrence of 2&#8211;6 nt heteropolymeric A/T sequences. By contrast, these sequence features were not apparent in the upstream regions of the genes hypothesized to be indirectly affected by berenil or by those shown to be unaffected by the treatment.</p>
         <p>Our hypotheses about which genes were directly affected by berenil and which were indirectly affected were based solely on information about their functions. However, each of the ways that we analyzed the sequences found upstream of the genes supported the conclusion that the direct category upstream regions contained sequence characteristics found by <it>in vitro </it>binding studies while the indirect category regions did not. These hypotheses would benefit from further experimentation on individual genes and on the mechanism by which direct and indirect effects are manifested.</p>
         <p>Our observation that 52 of the 54 affected yeast genes were negatively affected by berenil may have important implications for the mechanism of action of the drug, directing us to several possible mechanisms of drug binding that can be expressed as testable hypotheses. One hypothesis is that the drug interferes with the binding of transcription factors to DNA upstream of affected genes. This hypothesis is supported by several studies. For example, MGBDs have been shown to compete with the transcription factor NF-Y for binding to the DNA minor groove <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and to both prevent and disrupt binding of TBP to it <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Of the 25 unique sequences listed in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr> from the upstream regions of the direct category genes, five are exact matches to the TBP consensus binding site of TATAWAW <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. There are also three matches to the TBP consensus among the indirect category sequences of Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>. TBP was found to bind the TATA box sequence TATATAAA from the yeast CYC1 gene <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Seven exact matches to this sequence are found of all four direct category lists while only two are found in the indirect lists. HAP1 is a zinc finger transcription factor of the Zn(2)-Cys(6) binuclear cluster domain type that is known to make minor groove contact with the sequence GCTAATAGCGATAATAGCGAGGG <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. This sequence includes two exact matches to the unique direct sequences listed in Tables <tblr tid="T4">4</tblr> and <tblr tid="T5">5</tblr> and only one match to the unique indirect sequences in Tables <tblr tid="T6">6</tblr> and <tblr tid="T7">7</tblr>. It is also found in the upstream region of CYC7, a gene whose expression level was found to be lowered by berenil in our study.</p>
         <p>A second hypothesis is that berenil is able to affect the initiation of transcription by altering the conformation of DNA in promoter sequences. Evidence points to the ability of MGBDs to bind to the narrowed minor groove of A/T tracts spaced at a periodicity that produces intrinsic DNA curvature; uncurving of naked DNA by MGBDs has also been demonstrated <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. In order to make predictions of DNA curvature for the upstream regions of our affected genes, we used bend.it <sup>&#174; </sup><abbrgrp><abbr bid="B29">29</abbr></abbrgrp> to calculate predicted curvature in 500 bp regions upstream of the 54 genes shown by our study to be affected by berenil <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Regions of strongly predicted curvature were notably absent; the ma