<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2005-6-11-r96</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Method</dochead>
		<bibl>
			<title>
				<p>Chipper: discovering transcription-factor targets from chromatin immunoprecipitation microarrays using variance stabilization</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Gibbons</snm>
					<mi>D</mi>
					<fnm>Francis</fnm>
					<insr iid="I1"/>
					<email>fgibbons@hms.harvard.edu</email>
				</au>
				<au id="A2">
					<snm>Proft</snm>
					<fnm>Markus</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>mproft@ibmcp.upv.es</email>
				</au>
				<au id="A3">
					<snm>Struhl</snm>
					<fnm>Kevin</fnm>
					<insr iid="I1"/>
					<email>kevin@hms.harvard.edu</email>
				</au>
				<au id="A4" ca="yes">
					<snm>Roth</snm>
					<mi>P</mi>
					<fnm>Frederick</fnm>
					<insr iid="I1"/>
					<email>fritz_roth@hms.harvard.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Longwood Avenue, Boston, MA 02115, USA</p>
				</ins>
				<ins id="I2">
					<p>Instituto de Biolog&#237;a Molecular y Celular de Plantas (IBMCP), Universidad Polit&#233;cnica de Valencia, Camino de Vera s/n, 46022 Valencia, Spain</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2005</pubdate>
			<volume>6</volume>
			<issue>11</issue>
			<fpage>R96</fpage>
			<url>http://genomebiology.com/2005/6/11/R96</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">16277751</pubid><pubid idtype="doi">10.1186/gb-2005-6-11-r96</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>23</day>
					<month>3</month>
					<year>2005</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>1</day>
					<month>8</month>
					<year>2005</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>30</day>
					<month>9</month>
					<year>2005</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>1</day>
					<month>11</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>Gibbons et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Discovering transcription-factor targets from chromatin immunoprecipitation microarrays</p>
		</shorttitle>
		<shortabs>
			<p>A new method, implemented in software as 'Chipper', is described that 
allows genome-wide determination of protein-DNA binding sites from 
chromatin immunoprecipitation microarrays.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>Chromatin immunoprecipitation combined with microarray technology (Chip<sup>2</sup>) allows genome-wide determination of protein-DNA binding sites. The current standard method for analyzing Chip<sup>2 </sup>data requires additional control experiments that are subject to systematic error. We developed methods to assess significance using variance stabilization, learning error-model parameters without external control experiments. The method was validated experimentally, shows greater sensitivity than the current standard method, and incorporates false-discovery rate analysis. The corresponding software ('Chipper') is freely available. The method described here should help reveal an organism's transcription-regulatory 'wiring diagram'.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>A major goal in understanding cellular behavior is to reveal the 'wiring' of transcriptional regulation, through which transcription factors (TFs) bind target-gene promoters to control gene expression. Promoter regions contain sequence elements - typically 5 to 12 nucleotides (nt) in length - at which TFs bind specifically. By enhancing/inhibiting transcription or recruiting complexes that remodel chromatin structure, TFs regulate expression of the genes whose promoters they bind. Chromatin immunoprecipitation (ChIP) is an experimental technique for identifying those regions of DNA bound by a particular protein, and is, therefore, a useful method for determining which genes have their promoters bound by a TF. In outline, the method consists of the following steps. The TF under study is crosslinked to DNA which is subsequently extracted and sheared into fragments approximately 400 nt long (1,000 nt resolution is usually sufficient to assign binding to the regulation of a specific gene, so it is rare to exceed this length <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>). The fragments are immunoprecipitated with an antibody specific to that TF (or to a peptide affinity tag fused to that TF), whereupon the crosslinks are reversed, the DNA precipitate amplified, and the intergenic regions (IGRs) containing the binding site(s) are determined by examining the relative abundance of each immunoprecipitated DNA fragment. The combination of ChIP with microarray technology is often called 'ChIP-chip' <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and is referred to here as 'Chip<sup>2</sup>'. It has turned ChIP into a high-throughput technique for efficiently mapping gene regulatory networks <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
			<p>Two-channel microarrays use hybridization to compare the abundance of specific nucleic acid sequences in one mixture to abundance of the same sequences in another control mixture. The choice of control mixture may greatly affect the outcome of the experiment. A typical choice is fragmented genomic DNA, which controls for the relative abundance and non-specific hybridization potential of genomic DNA fragments. Genomic DNA may be purified from 'whole-cell extract', which itself is sometimes used as a control. As some DNA fragments may be 'stickier' than others, a more stringent and laborious mock control (containing fragments recovered nonspecifically by immunoprecipitation (IP)) is sometimes performed, in which the TF does not have a fused affinity tag.</p>
			<p>The change in abundance of a particular sequence between two mixtures is often measured in terms of 'fold-change' between the two channels (ratio) or, alternatively, the logarithm of fold-change (log-ratio). The IP channel serves as numerator, while the control is the denominator. The array surface between regions with spotted DNA is never completely 'dark', due to the combined effects of residual DNA fragments bound non-specifically to the array surface, and the experimentalist's control of the visual amplification ('gain') in the image analysis software. It is customary to subtract this 'background' from each spot because it reveals nothing about the protein-DNA binding. This subtraction raises the possibility, however, that the denominator could become negative or zero, in which case the log-ratio is not useful. Common strategies for handling zero or negative values are either to threshold or to discard data points altogether, neither of which is entirely satisfactory. A further, and perhaps more serious, problem is the practice of interpreting this fold-change as a measure of significance, when it provides no such statistical basis. Small random fluctuations in signals close to background, particularly in the denominator, are amplified, leading to spuriously high levels of 'fold-change' <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In other words, we should reduce our confidence in a twofold change between signals that are each near the background noise, compared to a twofold change between strong signals. Because we are generally more interested in whether a region is specifically bound at all than we are in the degree of its binding (occupancy), there is a need for an accurate measure of confidence in each measurement.</p>
			<p>A statistical approach for analysis of mRNA abundance microarrays has been developed in which a 'single-array' error model accounts for variation in the background level for each microarray, while a 'gene-specific' error model describes variation of a single gene across replicate arrays. These two complementary models can be combined to estimate the error in each log-ratio measurement <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. A variant of the single-array approach (in which there is gene-specific normalization) has been applied to transcription-factor binding site identification by means of Chip<sup>2 </sup>in yeast <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Unfortunately, it requires one or more separate control experiments to determine error model parameters, in which identical nucleic acid mixtures are compared. This adds to the expense of the experiment; furthermore, error model parameters derived from a separate microarray are potential sources of systematic error, since quality can vary between microarrays.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<p>Here we describe a new approach for assessing statistical significance of TF-binding from Chip<sup>2 </sup>data. We illustrate our method using a Chip<sup>2 </sup>analysis of Sko1 (also known as Acr1), a TF of the basic leucine zipper (bZIP) family (CREB sub-family) that regulates the expression of osmotic stress inducible genes <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. We also use independent confirmation experiments of individual IGRs to validate our method.</p>
			<sec>
				<st>
					<p>Combining replicates</p>
				</st>
				<p>We distinguish two kinds of repeated experiment. When the same IGR is spotted onto an array in more than one location, we term these measurements 'duplicates,' and we consider them as two spatially separated parts of the same 'spot'. Though other approaches have been described <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, for simplicity we average duplicate signals before analyzing them, giving us a single value that is less susceptible to physical blemishes on the slide. When the same IGRs are spotted onto two or more distinct microarrays, we term them 'replicates.' We consider each replicate as an independent measurement of the binding affinity or 'occupancy' of the IGRs.</p>
			</sec>
			<sec>
				<st>
					<p>Variance stabilization</p>
				</st>
				<p>It is common to replicate genome-wide experiments several times, to improve confidence in the results, which may be degraded by array imperfections or by handling errors. Additional replicates can compensate for random error in individual measurements, and the typical number of replicates is likely to increase as the cost of microarrays falls <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Sometimes the most significantly enhanced IGRs are those with low signal-to-noise ratio, yet applying log-ratios to such signals has the potential to introduce many false positives because minor variations in a small denominator value can have a large effect on a ratio. A single-array error model can account for this variation in calculating significance for each IGR. The log-ratios themselves are difficult to interpret, however, because two IGRs with the same log-ratio may differ in significance, and a greater log-ratio does not indicate increased significance. An alternative approach, the method of variance stabilization, was described by two groups <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> and made available as part of the BioConductor project <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> in the package 'vsn' <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. It uses a regression algorithm that is robust to outliers to scale and offset each channel independently, in such a way that the variance between channels is independent of signal strength. The transformation of the signal <it>y</it><sub><it>i </it></sub>in the <it>i</it>th channel (<it>i </it>= 1 for IP, or <it>i </it>= 2 for control) can be expressed as:</p>
				<p>
					<graphic file="gb-2005-6-11-r96-i1.gif"/>
				</p>
				<p>where <it>&#945;</it><sub><it>i </it></sub>and <it>&#955;</it><sub><it>i </it></sub>represent the background and noise in the <it>i</it>th channel, respectively. Because ln(<it>a</it>) - ln(<it>b</it>) = ln(<it>a</it>/<it>b</it>), the difference between the two transformed channels (&#916;<it>h </it>&#8801; <it>h</it><sub><it>i </it></sub>- <it>h</it><sub>2</sub>) is then a generalized log-ratio that is asymptotically equivalent to the log-ratio of the original channels when both are high (<it>y</it><sub><it>i </it></sub>&gt;&gt; <it>&#945;</it><sub><it>i</it></sub>), yet transforms smoothly to the difference between channels when both are low. This allows direct comparison between any two datapoints, even when they belong to opposite ends of the microarray's dynamic range. Two IGRs with the same <it>&#916;h </it>are equally significant, and greater <it>&#916;h </it>implies a more significantly bound IGR.</p>
			</sec>
			<sec>
				<st>
					<p>Deriving error model parameters internally</p>
				</st>
				<p>Binding of protein to DNA is a dynamic, stochastic process in equilibrium. While every TF is likely to be bound to every IGR at least some fraction of the time, our goal here is to perform binary classification of the IGRs. We therefore consider IGRs to fall into two categories: those that are specifically bound by the TF and those that are not. We wish to compute a <it>p </it>value that expresses our degree of surprise at seeing a particular <it>&#916;h </it>score for a given IGR, under the null hypothesis that the IGR is not bound. The 'vsn' package can be used to variance-stabilize each array separately, or all of them simultaneously; we used the former method. Having computed the inter-channel variance-stabilized difference (<it>&#916;h</it>) for each spot, we may plot a histogram of all scores from a chip. We expect that most regions are not bound. Therefore, the distribution of <it>&#916;h </it>scores should be largely determined by random binding and measurement errors <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. A smaller number of regions are bound, and those will tend to have positive scores, indicating higher occupancies in the IP channel than the whole-cell extract/mock control. Measurements in the negative portion of the <it>&#916;h </it>distribution should, therefore, be more completely dominated by unbound IGRs. By fitting a parametric curve to the region of the observed <it>&#916;h </it>distribution left of the mode, we obtain an estimate of the null distribution in the positive region of the <it>&#916;h </it>distribution. This is an essential feature of our method, because it allows us to estimate the distribution expected of unbound IGRs without performing an external control experiment in which an identical mixture is examined in both channels of a separate microarray. It is this null distribution that permits calculation of significance for each observed <it>&#916;h </it>value. The symmetric nature of the null distribution is an assumption of our model, and is based on our own experience and that of others <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
				<p>Specifically, a parametric distribution is fitted by minimizing the negative log-likelihood of the data to the left of the mode (found after smoothing the data using gaussian kernel-based density estimation) <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Three possible distributions were initially considered (normal, Cauchy, and Gumbel), but the normal distribution consistently obtains the best log-likelihood score. Goodness-of-fit for the fitted normal distributions was verified with a &#967;<sup>2 </sup>test, and all passed with <it>p </it>&lt; 10<sup>-20</sup>. The <it>&#916;h </it>scores from all replicates are standardized (centered to have zero mean and re-scaled to have unit variance) yielding a score <it>z</it><sub><it>i </it></sub>= (<it>&#916;h</it><sub><it>i </it></sub>- <it>&#956;</it><sub><it>i</it></sub>)/<it>&#963;</it><sub><it>i</it></sub>, where <it>&#956;</it><sub><it>i </it></sub>and <it>&#963;</it><sub><it>i </it></sub>represent the mean and standard deviation, respectively, of the <it>&#916;h </it>values obtained from replicate <it>i</it>. Figure <figr fid="F1">1a-c</figr> shows <it>&#916;h </it>distributions for three replicates <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We expect the distribution of <it>&#916;h </it>scores to be centered about zero; as shown by the vertical dotted lines in Figure <figr fid="F1">1</figr>, this is true to a very good approximation. Variance stabilization attempts to transform the data such that measurement error is uniform for each spot on a given array, and if replicate arrays were identical, one would expect to see the same variance in each array; large discrepancies between arrays might indicate problems with the quality of some of the arrays. Standardization is necessary to account for minor (on the order of 10%) differences in variance between arrays. Standardized scores are averaged to give an overall score (<graphic file="gb-2005-6-11-r96-i2.gif"/>), the distribution of which is shown in Figure <figr fid="F1">1d</figr>. This distribution is again smoothed with a gaussian kernel, and fitted as described above. Finally, a <it>p </it>value for each IGR is computed on the <graphic file="gb-2005-6-11-r96-i2.gif"/> score, according to the null hypothesis that all IGRs are described by this fitted normal distribution, that is, they are not bound by the TF.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Three replicate two-channel Chip<sup>2 </sup>experiments performed on Sko1 [22] were variance-stabilized</p>
					</caption>
					<text>
						<p>Three replicate two-channel Chip<sup>2 </sup>experiments performed on Sko1 [22] were variance-stabilized. <b>(a-c) </b>Distributions of the <it>&#916;h </it>values obtained. Shaded gray areas indicate kernel-smoothed densities estimated from data. Magenta curves estimate the distribution of scores expected of unbound intergenic regions (IGRs) by fitting a normal distribution to the negative <it>&#916;h </it>side of the distribution. Sufficient statistics (mean, variance) of each fitted distribution are used to standardize the <it>&#916;h </it>distributions to a score <it>z</it><sub><it>i </it></sub>for each replicate. <b>(d) </b>The distribution of the average score <graphic file="gb-2005-6-11-r96-i2.gif"/> over all three replicates. We computed a <it>p </it>value for each IGR under the null hypothesis that it is unbound, using the curve fitted to the negative portion of the empirical <graphic file="gb-2005-6-11-r96-i2.gif"/> distribution.</p>
					</text>
					<graphic file="gb-2005-6-11-r96-1"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Experimental verification of our dataset and evaluation of <it>p </it>value accuracy</p>
				</st>
				<p>The distribution of computed <it>p </it>values is shown in Figure <figr fid="F2">2a</figr>. It clearly shows near-ideal behavior: uniform distribution across most of the interval (0,1) arising from the vast majority of unbound IGRs, and a peak close to <it>p = 0</it>, arising from bound IGRs. Figure <figr fid="F2">2b</figr> shows the distribution of <it>q </it>values. As expected, most IGRs have a high <it>q </it>value, consistent with the assumption that most are unbound. False discovery rates, as represented by <it>q </it>values <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, are particularly useful when the goal is discovery of TF-bound IGRs. For example, the <it>q </it>values for Sko1 (see Additional data file 1) indicate that scientists willing to accept a list of targets in which 33% are false positives should examine the top 224 entries using a more-accurate experimental method, while those only willing to tolerate a false-positive rate of 20% should restrict themselves to the top 91.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Observed distributions of <it>p </it>and <it>q </it>values</p>
					</caption>
					<text>
						<p>Observed distributions of <it>p </it>and <it>q </it>values. <b>(a) </b>The distribution of <it>p </it>values for the same data as in Figure 1. They are relatively uniformly distributed on the interval (0,1), except for a slight peak close to <it>p </it>= 0, indicating a small fraction of specifically bound intergenic regions (IGRs). <b>(b) </b>Corresponding <it>q </it>values, but with a log scale on the vertical axis. As one descends the ranked list of IGRs the <it>q </it>value rapidly approaches unity. That most IGRs have <it>q </it>close to 1 is expected given that the list of tested IGRs is long, and the number of true targets is generally small.</p>
					</text>
					<graphic file="gb-2005-6-11-r96-2"/>
				</fig>
				<p>We independently validated 35 target genes spread widely across the top 350 in our list using targeted ChIP analysis. Considering only the 35 targets for which follow-up testing was performed, ranking of IGRs by the <it>p </it>values of Lee <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> (see Additional data file 4) shows an ability similar to our method ('Chipper') at placing true positives above false positives. When considering all IGRs, however, there is little correlation between rank by our method and rank by the Lee <it>et al</it>. approach. In other words, top-ranking targets by one method are not top-ranking by the other. Thus, although our validation experiments are consistent with Chipper achieving the same sensitivity at a lower false-positive rate, it is also possible that the two methods are each adept at identifying different subsets of targets. The discrepancy may be due to some systematic error in determination of the parameters of the error model. As the error model parameters are not provided explicitly with their data, we could not investigate this possibility further. Inaccurate determination of error-model parameters can lead to unjustified confidence in differences based on noisy measurements. Therefore, in the task of ranking IGRs by the likelihood of being TF-bound, Chipper is on par and complementary to the Lee <it>et al</it>. approach and may outperform it. Furthermore, the Chipper algorithm uses an internally determined error model and thus is not subject to systematic errors that may arise via the separate control experiments required of the methods in Lee <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Below we show that Chipper allows increased sensitivity at a given significance threshold.</p>
				<p>Chip<sup>2 </sup>experiments cannot distinguish the strand on which binding occurs, only the location at which it takes place. When binding is assigned to an IGR less than 2,000 nt in size, which happens to separate two genes on opposite strands, it is not possible to determine, on the basis of Chip<sup>2 </sup>alone, which one is the target of a TF. For example, as illustrated in Table <tblr tid="T1">1</tblr>, <it>FAA1 </it>and <it>COT1 </it>are divergently transcribed genes separated by a 1,800 nt IGR. The IGR is split into <it>FAA1</it>-proximal and <it>COT1</it>-proximal IGR segments. The primers used for targeted ChIP (about 200 nt) are smaller than the sheared fragments used in the microarray experiments (500 nt), which gives them a greater spatial resolution. As the primers are designed for a specific promoter, and amplified by polymerase chain reaction, they are strand-specific. Only <it>FAA1 </it>is found to bind Sko1 in a targeted ChIP experiment, yet because both IGR segments overlap Sko1-bound fragments in the Chip<sup>2 </sup>experiment, a spurious positive result is generated for <it>COT1</it>. We score correctly identified IGRs as true positives, even when only a single gene is verified in the targeted experiment. The Sko1 data, along with further study of Sko1 targets, are published elsewhere in the context of a focused study of Sko1 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Divergently transcribed genes, grouped in pairs of which at least one is a target of Sko1, according to a targeted ChIP assay</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>Gene</p>
							</c>
							<c ca="center">
								<p>Promoter</p>
							</c>
							<c ca="left">
								<p>Target?</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>FAA1</it>
								</p>
							</c>
							<c ca="center">
								<p>-827/-576</p>
							</c>
							<c ca="left">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>COT1</it>
								</p>
							</c>
							<c ca="center">
								<p>-1,743/-1,561</p>
							</c>
							<c ca="left">
								<p>No</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>PUT4</it>
								</p>
							</c>
							<c ca="center">
								<p>-617/-372</p>
							</c>
							<c ca="left">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>CIN1</it>
								</p>
							</c>
							<c ca="center">
								<p>-1,007/ -</p>
							</c>
							<c ca="left">
								<p>No</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>RPI1</it>
								</p>
							</c>
							<c ca="center">
								<p>-606/-451</p>
							</c>
							<c ca="left">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>RHO3</it>
								</p>
							</c>
							<c ca="center">
								<p>-1,611/-1,336</p>
							</c>
							<c ca="left">
								<p>No</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>SPO20</it>
								</p>
							</c>
							<c ca="center">
								<p>-449/-211</p>
							</c>
							<c ca="left">
								<p>Yes</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<it>SOK2</it>
								</p>
							</c>
							<c ca="center">
								<p>-1,896/-</p>
							</c>
							<c ca="left">
								<p>No</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Promoter distances are measured in nucleotides from the start codon of gene 1. Both genes of a pair are counted as positives in evaluating the algorithm described here, since distinguishing members of these pairs is beyond the resolution of Chip<sup>2 </sup>experimental technology. ChIP, chromatin immunoprecipitation.</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>False discovery rate analysis</p>
				</st>
				<p>A common measure of significance used in hypothesis testing is the <it>p </it>value. In large-scale experiments like these, random chance can cause some IGRs to have <it>p </it>values that will be considered significant. Multiple hypothesis corrections (that is, corrections for the fact that a hypothesis is being tested multiple times, once for each IGR) are a popular approach in which the significance threshold is raised (or the <it>p </it>value lowered) as a function of the number of IGRs. Bonferroni-type <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> corrections are often conservative, in that many positives may be classified as non-significant ('false negatives'). This is borne out in our analysis of Sko1 Chip<sup>2 </sup>data, in which, after multiple-hypothesis correction, only a small number of IGRs (&lt;10) were significant, at an experimentwise <it>p </it>value = 0.05 or lower (equivalent to <it>p </it>= 1.06 &#215; 10<sup>-5 </sup>before multiple-hypothesis correction). However, the motivation of most Chip<sup>2 </sup>users is not to cautiously establish a list of binding sites that are known with near-certainty. The attraction of Chip<sup>2 </sup>is its high-throughput nature, which allows the experimentalist to rapidly generate a list of potential binding sites for subsequent study. A relatively recent alternative to the <it>p </it>value is the <it>q </it>value, which is a measure of false discovery rate (FDR) that has proven useful when the aim of an experiment is hypothesis generation rather than hypothesis testing <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Despite the fact that Chip<sup>2 </sup>experiments are typically used for hypothesis generation, no previously reported analysis of Chip<sup>2 </sup>experiments has employed an FDR approach. Figure <figr fid="F3">3</figr> shows that the <it>q </it>values computed from our <it>p </it>values (broken line) agree quite well with our empirical FDR (solid line). As the first verified false positive ranks just above 100, our empirical FDR is zero to that point. Thereafter, it tracks the computed FDR quite closely until all true positives have been discovered.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Agreement between predicted and empirical false-discovery rate for Sko1</p>
					</caption>
					<text>
						<p>Agreement between predicted and empirical false-discovery rate for Sko1. The broken curve shows <it>q </it>values computed from the ranked list of <it>p </it>values, using QVALUE software [32]. The solid curve shows the false-discovery rate (FDR) computed using only targeted chromatin immunoprecipitation experiments (35 targets).</p>
					</text>
					<graphic file="gb-2005-6-11-r96-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Validation with publicly available datasets</p>
				</st>
				<p>We obtained the raw data used by Lee <it>et al. </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and compared the <it>p </it>values produced by our algorithm with the published <it>p </it>values. The 7,200 IGRs were ranked using the appropriate score for each method, and the ranked lists were evaluated for the presence of targets annotated as bound by the TF of interest in the Yeast Proteome Database (YPD) <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Data for two TFs (Ino4 and Sko1) are shown in Figure <figr fid="F4">4</figr>, and analysis of another six TFs is shown in Additional data file 5. In Figure <figr fid="F4">4a</figr> we show the receiver-operating characteristic (ROC) curve for Ino4, which tracks the sensitivity of an algorithm (its ability to find true positives (TPs)) as a function of its tendency to turn up false positives (FPs). An optimal algorithm would rank all TPs at the top. Its ROC curve would begin at the lower left-hand corner (FP = 0, TP = 0), move vertically to the upper left-hand corner (FP = 0, TP = 1), and then across the top of the chart to the upper right-hand corner (FP = 1, TP = 1). As this is a hypothesis-generation technique, only those targets near the top of a ranked list are likely to be of interest; we therefore show only the region from FP = 0 to FP = 0.1. The ranking performance of each algorithm is good in this case, and there appears little to choose between methods: either one can achieve a sensitivity of almost 1.0 with a false-positive rate of about 0.05.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Performance of our algorithm on publicly available Chip<sup>2 </sup>data [2] is evaluated using the Yeast Proteome Database collection of transcription factor targets [28,29] and compared with another popular means of computing <it>p </it>values [2]</p>
					</caption>
					<text>
						<p>Performance of our algorithm on publicly available Chip<sup>2 </sup>data [2] is evaluated using the Yeast Proteome Database collection of transcription factor targets [28,29] and compared with another popular means of computing <it>p </it>values [2]. <b>(a) </b>Receiver-operating characteristic curves for our method (black, 'Chipper') and that of Lee <it>et al</it>. [2] (green, 'Lee') using three replicate experiments for the transcription factor Ino4, made publicly available by Lee <it>et al. </it><b>(b) </b>Sensitivity as a function of significance threshold. The broken line represents the performance of choosing potential targets at random. <b>(c,d) </b>Analogous curves for the transcription factor Sko1. FP, false positive; TP, true positive.</p>
					</text>
					<graphic file="gb-2005-6-11-r96-4"/>
				</fig>
				<p>In practice, however, it is common to consider only those IGRs passing a standard threshold of significance (<it>p </it>&lt; 10<sup>-3 </sup>in Lee <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and Harbison <it>et al</it>. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>). Therefore, we evaluated the same data, but rather than focusing on simple ranking ability, we examined the <it>p </it>value of each call (results for Ino4 shown in Figure <figr fid="F4">4b</figr>). We constructed the graph by choosing a significance threshold (<it>&#945;</it>) and asking what fraction of the known true positives exceed the threshold (that is, have <it>p </it>values less than <it>&#945;</it>). At <it>&#945;</it> = 1, any algorithm will have perfect sensitivity because it calls all IGRs significant; this comes at the cost of specificity, as it is unable to distinguish between true and false positives. The <it>p </it>values reported by Lee <it>et al. </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp> are shown in green, those by our method are shown in black. The vertical dotted line indicates a threshold <it>&#945;</it><sub>7 </sub>= 10<sup>-3 </sup>at which we would expect approximately 7 out of 7,200 intergenic regions to achieve significant scores purely by chance, even if none were bound by the TF. The vertical dashed line indicates the threshold <it>&#945;</it><sub>1 </sub>= 1.6 &#215; 10<sup>-4</sup>, which we expect to be exceeded by chance for only one out of 7,200 IGRs. The unshaded area to the right of <it>&#945;</it><sub>1 </sub>indicates the region in which fewer than one IGR would be expected to exceed the threshold by chance. The higher an algorithm's sensitivity in this region (that is, the more true positives it puts here), the better. As we decrease the threshold, the sensitivity decreases slowly at first, for both methods. For the <it>p </it>values of Lee <it>et al. </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, there is then a rapid reduction in sensitivity. At an <it>&#945;</it> threshold such that only one false positive is expected, our method can recover more than half the known targets while Lee <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> find none.</p>
				<p>In Figure <figr fid="F4">4c</figr>, we show an ROC curve for the transcription factor Sko1, for which nine targets are annotated in the YPD. The error model of Lee <it>et al. </it><abbrgrp><abbr bid="B2">2</abbr></abbrgrp> ranks the targets slightly better than our method of average <it>z </it>scores. Yet, as shown in Figure <figr fid="F4">4d</figr>, for any given significance threshold, our algorithm returns more of those targets. Ino4 showed the most striking improvement in sensitivity (Figure <figr fid="F4">4b</figr>) for all TFs examined. However, for each of the eight TFs we examined (Figure <figr fid="F4">4</figr> and Additional data file 5) our method called an equal or greater number of targets significant at the level of <it>&#945;</it><sub>1 </sub>than did the method of Lee <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Thus, for all TFs examined, our method yields sensitivity either markedly better than or similar to that of the <it>de facto </it>standard method.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p>We have developed a method for analyzing results from chromatin-immunoprecipitation/microarray (Chip<sup>2</sup>) experiments that computes <it>p </it>values without needing a separate control for developing a model of measurement error. The method proposed here successfully combines multiple replicates (separate arrays) and duplicates (same array) to produce a single overall <it>p </it>value for each IGR. By using variance stabilization rather than log ratios, we eliminate the need to threshold low-signal spots obtaining an alternative measure, <it>&#916;h</it>, which interpolates between a difference and a log-ratio and is monotonically related to significance. In addition, by averaging the resulting <it>z </it>score over replicates, an IGR that scores highly in a single replicate, but has no usable data in other replicates, may score well in the overall rankings. This is desirable in hypothesis generation: the algorithm should not be conservative, rather it should be sensitive and provide accurate <it>p </it>values by which the false positive rate can be judged. The <it>p </it>values produced by our algorithm behave as one would expect <it>p </it>values to: a broadly uniform distribution over the full range, but with enrichment near <it>p </it>= 0. Experimentalists can use the <it>q </it>values computed from these <it>p </it>values to generate a short list that is customized to their tolerance for false discoveries. We have evaluated our algorithm using the transcription factor Sko1 by performing targeted ChIP on 35 selected genes. Additionally, we have compared performance of our algorithm with that of a previous error model <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, using data from a public database of transcription-factor targets <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. Generally, discrimination of true positives, as measured by ROC curves, is comparable for both methods. However, our method returns targets with more significant <it>p </it>values. We find that the observed false-discovery rate on these putative targets generally tracks that predicted by the <it>q </it>values, therefore validating the accuracy of the <it>p </it>values and <it>q </it>values produced by our method. To parameterize error models, the method presented here requires no external control microarray experiments (which may introduce systematic error), giving it a distinct advantage over others in current use. Software implementing the algorithm is available either in web-based form for online use, or for download by non-commercial users, from our website <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<p>Chip<sup>2 </sup>analysis on Sko1 was performed using three microarrays, each with duplicate spots. Genomic DNA was used as a negative control. We used targeted ChIP experiments on 35 putative targets of Sko1 to validate how well our algorithm finds TF binding sites. We selected targets distributed throughout the top-ranking 350 IGRs. Primers were specifically designed for each IGR, and each region was assayed three times both with and without the hemagglutinin (HA) epitope tag, and the results averaged. The <it>POL1 </it>open reading frame (ORF) and an ORF-free region were used as negative controls, since Sko1 is not expected to bind there. Each IGR was scored according to the ratio of its IP efficiency with the HA epitope tag compared to that of <it>POL1 </it>ORF (non-specific control). Based on prior experience, we chose a threshold of 2.0, above which we considered Sko1 to have bound to the IGR, and below which we considered it not to have bound. By this criterion, we found 21 bound IGRs, with the remaining 7 tested IGRs not bound. (The number of IGRs tested is less than the number of target genes because some IGRs are associated with more than one gene.) Of those scoring &gt;2.0, we found that six (<it>ICY1</it>, <it>HOR7</it>, <it>YPR127W</it>, <it>DPM1</it>, <it>POS5</it>, and <it>RSN1</it>) also scored highly (above 2.0) without the tag, indicating that they bind non-specifically. In fact, only <it>POS5 </it>scored in the top 100 by our method. Further details on Chip<sup>2 </sup>analysis of Sko1 and validation experiments are published elsewhere in the context of a focused study of Sko1 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The complete dataset is available from the Gene Expression Omnibus (GEO) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> under series accession number GSE3335.</p>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> is a tab-delimited file containing the results of our analysis for all IGRs studied in our experiments. Additional data file <supplr sid="S2">2</supplr> contains a detailed description of the comparison between the targets of Sko1 identified by Chipper when applied both to the data presented here and to other Chip<sup>2 </sup>data <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and previously published <it>p </it>values using a single-array error model <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Additional data files <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr> are figures illustrating these comparisons. Additional data file <supplr sid="S5">5</supplr> is a figure comparing the two methods as applied to results from six additional transcription factors. Additional data file <supplr sid="S6">6</supplr> lists the IGRs identified as targets <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
			<suppl id="S1">
				<title>
					<p>Additional data File 1</p>
				</title>
				<caption>
					<p>A tab-delimited file containing the results of our analysis for all intergenic regions studied in our experiments</p>
				</caption>
				<text>
					<p>A tab-delimited file containing the results of our analysis for all intergenic regions studied in our experiments.</p>
				</text>
				<file name="gb-2005-6-11-r96-S1.txt">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S2">
				<title>
					<p>Additional data File 2</p>
				</title>
				<caption>
					<p>A detailed description of the comparison between the targets of Sko1 identified by Chipper when applied both to the data presented here and to other Chip<sup>2 </sup>data <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and previously published <it>p </it>values using a single-array error model <abbrgrp><abbr bid="B2">2</abbr></abbrgrp></p>
				</caption>
				<text>
					<p>A detailed description of the comparison between the targets of Sko1 identified by Chipper when applied both to the data presented here and to other Chip<sup>2 </sup>data <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and previously published <it>p </it>values using a single-array error model <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
				</text>
				<file name="gb-2005-6-11-r96-S2.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S3">
				<title>
					<p>Additional data File 3</p>
				</title>
				<caption>
					<p>A figure illustrating the comparisons made in Additional data file 2</p>
				</caption>
				<text>
					<p>A figure illustrating the comparisons made in Additional data file 2.</p>
				</text>
				<file name="gb-2005-6-11-r96-S3.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S4">
				<title>
					<p>Additional data File 4</p>
				</title>
				<caption>
					<p>A figure illustrating the comparisons made in Additional data file 2</p>
				</caption>
				<text>
					<p>A figure illustrating the comparisons made in Additional data file 2.</p>
				</text>
				<file name="gb-2005-6-11-r96-S4.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S5">
				<title>
					<p>Additional data File 5</p>
				</title>
				<caption>
					<p>A figure comparing the two methods described in Additional data file 2 as applied to results from six additional transcription factors.</p>
				</caption>
				<text>
					<p>A figure comparing the two methods described in Additional data file 2 as applied to results from six additional transcription factors.</p>
				</text>
				<file name="gb-2005-6-11-r96-S5.pdf">
					<p>Click here for file</p>
				</file>
			</suppl>
			<suppl id="S6">
				<title>
					<p>Additional data File 6</p>
				</title>
				<caption>
					<p>A list of the intergenic regions identified as targets <abbrgrp><abbr bid="B29">29</abbr></abbrgrp></p>
				</caption>
				<text>
					<p>A list of the intergenic regions identified as targets <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
				</text>
				<file name="gb-2005-6-11-r96-S6.txt">
					<p>Click here for file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank J Geisberg, M Damelin, P Silver, Z Moqtaderi and J Wade for helpful discussions, and J Geisberg and J Casolari for 'beta-testing' the website and algorithm. F.D.G. and F.P.R. were supported in part by Funds for Discovery provided by John Taplin and by an institutional grant from the HHMI Biomedical Research Support Program for Medical Schools. M.P., F.D.G., and K.S. were supported by NIH/NIGMS grants GM30186, GM53720, and NIH/NHGRI grant HG003147. M.P. was supported by an EMBO Long Term Fellowship and the 'Ram&#243;n y Cajal' program of the Spanish Ministry of Science.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments.</p>
				</title>
				<aug>
					<au>
						<snm>Buck</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Lieb</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>2004</pubdate>
				<volume>83</volume>
				<fpage>349</fpage>
				<lpage>360</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.ygeno.2003.11.004</pubid>
						<pubid idtype="pmpid">14986705</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Transcriptional regulatory networks in <it>Saccharomyces cerevisiae</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Lee</snm>
						<fnm>TI</fnm>
					</au>
					<au>
						<snm>Rinaldi</snm>
						<fnm>NJ</fnm>
					</au>
					<au>
						<snm>Robert</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Odom</snm>
						<fnm>DT</fnm>
					</au>
					<au>
						<snm>Bar-Joseph</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gerber</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Hannett</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Harbison</snm>
						<fnm>CT</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Simon</snm>
						<fnm>I</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>298</volume>
				<fpage>799</fpage>
				<lpage>804</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1075090</pubid>
						<pubid idtype="pmpid" link="fulltext">12399584</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF.</p>
				</title>
				<aug>
					<au>
						<snm>Iyer</snm>
						<fnm>VR</fnm>
					</au>
					<au>
						<snm>Horak</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Scafe</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Snyder</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>409</volume>
				<fpage>533</fpage>
				<lpage>538</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35054095</pubid>
						<pubid idtype="pmpid" link="fulltext">11206552</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association.</p>
				</title>
				<aug>
					<au>
						<snm>Lieb</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2001</pubdate>
				<volume>28</volume>
				<fpage>327</fpage>
				<lpage>334</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng569</pubid>
						<pubid idtype="pmpid" link="fulltext">11455386</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Genome-wide location and function of DNA binding proteins.</p>
				</title>
				<aug>
					<au>
						<snm>Ren</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Robert</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Wyrick</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Aparicio</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Jennings</snm>
						<fnm>EG</fnm>
					</au>
					<au>
						<snm>Simon</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Zeitlinger</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Schreiber</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hannett</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Kanin</snm>
						<fnm>E</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2000</pubdate>
				<volume>290</volume>
				<fpage>2306</fpage>
				<lpage>2309</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.290.5500.2306</pubid>
						<pubid idtype="pmpid" link="fulltext">11125145</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Genome-wide analysis of protein-DNA interactions in living cells.</p>
				</title>
				<aug>
					<au>
						<snm>Pugh</snm>
						<fnm>BF</fnm>
					</au>
					<au>
						<snm>Gilmour</snm>
						<fnm>DS</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2001</pubdate>
				<volume>2</volume>
				<fpage>reviews1013.1</fpage>
				<lpage>1013.3</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/gb-2001-2-4-reviews1013</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Genome-wide location and regulated recruitment of the RSC nucleosome-remodeling complex.</p>
				</title>
				<aug>
					<au>
						<snm>Ng</snm>
						<fnm>HH</fnm>
					</au>
					<au>
						<snm>Robert</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Young</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Struhl</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Genes Dev</source>
				<pubdate>2002</pubdate>
				<volume>16</volume>
				<fpage>806</fpage>
				<lpage>819</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">186327</pubid>
						<pubid idtype="pmpid" link="fulltext">11937489</pubid>
						<pubid idtype="doi">10.1101/gad.978902</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Transcriptional regulatory code of a eukaryotic genome.</p>
				</title>
				<aug>
					<au>
						<snm>Harbison</snm>
						<fnm>CT</fnm>
					</au>
					<au>
						<snm>Gordon</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>TI</fnm>
					</au>
					<au>
						<snm>Rinaldi</snm>
						<fnm>NJ</fnm>
					</au>
					<au>
						<snm>Macisaac</snm>
						<fnm>KD</fnm>
					</au>
					<au>
						<snm>Danford</snm>
						<fnm>TW</fnm>
					</au>
					<au>
						<snm>Hannett</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Tagne</snm>
						<fnm>JB</fnm>
					</au>
					<au>
						<snm>Reynolds</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Yoo</snm>
						<fnm>J</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>431</volume>
				<fpage>99</fpage>
				<lpage>104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02800</pubid>
						<pubid idtype="pmpid" link="fulltext">15343339</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs.</p>
				</title>
				<aug>
					<au>
						<snm>Cawley</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bekiranov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ng</snm>
						<fnm>HH</fnm>
					</au>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Sekinger</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Kampa</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Piccolboni</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Sementchenko</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Williams</snm>
						<fnm>AJ</fnm>
					</au>
					<etal/>
				</aug>
				<source>Cell</source>
				<pubdate>2004</pubdate>
				<volume>116</volume>
				<fpage>499</fpage>
				<lpage>509</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(04)00127-8</pubid>
						<pubid idtype="pmpid" link="fulltext">14980218</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Functional discovery via a compendium of expression profiles.</p>
				</title>
				<aug>
					<au>
						<snm>Hughes</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Marton</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>AR</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Stoughton</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Armour</snm>
						<fnm>CD</fnm>
					</au>
					<au>
						<snm>Bennett</snm>
						<fnm>HA</fnm>
					</au>
					<au>
						<snm>Coffey</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>YD</fnm>
					</au>
					<etal/>
				</aug>
				<source>Cell</source>
				<pubdate>2000</pubdate>
				<volume>102</volume>
				<fpage>109</fpage>
				<lpage>126</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(00)00015-5</pubid>
						<pubid idtype="pmpid">10929718</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Yeast SKO1 gene encodes a bZIP protein that binds to the CRE motif and acts as a repressor of transcription.</p>
				</title>
				<aug>
					<au>
						<snm>Nehlin</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Carlberg</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ronne</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1992</pubdate>
				<volume>20</volume>
				<fpage>5271</fpage>
				<lpage>5278</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">334331</pubid>
						<pubid idtype="pmpid">1437546</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Repressors and upstream repressing sequences of the stress-regulated <it>ENA1</it> gene in <it>Saccharomyces cerevisiae</it>: bZIP protein Sko1p confers HOG-dependent osmotic regulation.</p>
				</title>
				<aug>
					<au>
						<snm>Proft</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Serrano</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1999</pubdate>
				<volume>19</volume>
				<fpage>537</fpage>
				<lpage>546</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">83911</pubid>
						<pubid idtype="pmpid" link="fulltext">9858577</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>ACR1, a yeast ATF/CREB repressor.</p>
				</title>
				<aug>
					<au>
						<snm>Vincent</snm>
						<fnm>AC</fnm>
					</au>
					<au>
						<snm>Struhl</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1992</pubdate>
				<volume>12</volume>
				<fpage>5394</fpage>
				<lpage>5405</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">360477</pubid>
						<pubid idtype="pmpid">1448073</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Use of within-array replicate spots for assessing differential expression in microarray experiments.</p>
				</title>
				<aug>
					<au>
						<snm>Smyth</snm>
						<fnm>GK</fnm>
					</au>
					<au>
						<snm>Michaud</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Scott</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>2067</fpage>
				<lpage>2075</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti270</pubid>
						<pubid idtype="pmpid" link="fulltext">15657102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Variance stabilization applied to microarray data calibration and to the quantification of differential expression.</p>
				</title>
				<aug>
					<au>
						<snm>Huber</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>von Heydebreck</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>S&#252;ltmann</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Poustka</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Vingron</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18 Suppl 1</volume>
				<fpage>S96</fpage>
				<lpage>S104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12169536</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>A variance-stabilizing transformation for gene-expression microarray data.</p>
				</title>
				<aug>
					<au>
						<snm>Durbin</snm>
						<fnm>BP</fnm>
					</au>
					<au>
						<snm>Harin</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Hawkins</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Rocke</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18 Suppl 1</volume>
				<fpage>S105</fpage>
				<lpage>S110</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12169537</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Bioconductor: open software development for computational biology and bioinformatics.</p>
				</title>
				<aug>
					<au>
						<snm>Gentleman</snm>
						<fnm>RC</fnm>
					</au>
					<au>
						<snm>Carey</snm>
						<fnm>VJ</fnm>
					</au>
					<au>
						<snm>Bates</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Bolstad</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Dettling</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Dudoit</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ellis</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Gautier</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Ge</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Gentry</snm>
						<fnm>J</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<fpage>R80</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">545600</pubid>
						<pubid idtype="pmpid" link="fulltext">15461798</pubid>
						<pubid idtype="doi">10.1186/gb-2004-5-10-r80</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>A model for measurement error for gene expression arrays.</p>
				</title>
				<aug>
					<au>
						<snm>Rocke</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Durbin</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2001</pubdate>
				<volume>8</volume>
				<fpage>557</fpage>
				<lpage>569</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/106652701753307485</pubid>
						<pubid idtype="pmpid" link="fulltext">11747612</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Parameter estimation for the calibration and variance stabilization of microarray data.</p>
				</title>
				<aug>
					<au>
						<snm>Huber</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>von Heydebreck</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Sueltmann</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Poustka</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Vingron</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Stat Appl Genet Mol Biol</source>
				<pubdate>2003</pubdate>
				<volume>2</volume>
				<fpage>3.1</fpage>
				<lpage>3.22</lpage>
			</bibl>
			<bibl id="B20">
				<aug>
					<au>
						<snm>Dennis</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Schnabel</snm>
						<fnm>RB</fnm>
					</au>
				</aug>
				<source>Numerical Methods for Unconstrained Optimization and Nonlinear Equations</source>
				<publisher>Englewood Cliffs, NJ: Prentice-Hall</publisher>
				<pubdate>1983</pubdate>
			</bibl>
			<bibl id="B21">
				<aug>
					<au>
						<snm>Press</snm>
						<fnm>WH</fnm>
					</au>
					<au>
						<snm>Flannery</snm>
						<fnm>BP</fnm>
					</au>
					<au>
						<snm>Teukolsky</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Vetterling</snm>
						<fnm>WT</fnm>
					</au>
				</aug>
				<source>Numerical Recipes</source>
				<publisher>Cambridge, UK: Cambridge University Press</publisher>
				<edition>1</edition>
				<pubdate>1986</pubdate>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Genomewide identification of Sko1 target promoters reveals a regulatory network that operates in response to osmotic stress in <it>Saccharomyces cerevisiae</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Proft</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Gibbons</snm>
						<fnm>FD</fnm>
					</au>
					<au>
						<snm>Copeland</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Roth</snm>
						<fnm>FP</fnm>
					</au>
					<au>
						<snm>Struhl</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Eukaryotic Cell</source>
				<pubdate>2005</pubdate>
				<volume>4</volume>
				<fpage>1343</fpage>
				<lpage>1352</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1214534</pubid>
						<pubid idtype="pmpid" link="fulltext">16087739</pubid>
						<pubid idtype="doi">10.1128/EC.4.8.1343-1352.2005</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>The positive false discovery rate: a Bayesian interpretation and the q-value.</p>
				</title>
				<aug>
					<au>
						<snm>Storey</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>Ann Statistics</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>2013</fpage>
				<lpage>2035</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1214/aos/1074290335</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<aug>
					<au>
						<snm>Sokal</snm>
						<fnm>RR</fnm>
					</au>
					<au>
						<snm>Rohlf</snm>
						<fnm>FJ</fnm>
					</au>
				</aug>
				<source>Biometry: The Principles and Practice of Statistics in Biological Research</source>
				<publisher>New York: WH Freeman &amp; Company</publisher>
				<edition>3</edition>
				<pubdate>1995</pubdate>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Controlling the false discovery rate: a practical and powerful approach to multiple testing.</p>
				</title>
				<aug>
					<au>
						<snm>Benjamini</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Hochberg</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>J R Stat Soc Ser B</source>
				<pubdate>1995</pubdate>
				<volume>57</volume>
				<fpage>289</fpage>
				<lpage>300</lpage>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Statistical significance for genomewide studies.</p>
				</title>
				<aug>
					<au>
						<snm>Storey</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Tibshirani</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>9440</fpage>
				<lpage>9445</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">170937</pubid>
						<pubid idtype="pmpid" link="fulltext">12883005</pubid>
						<pubid idtype="doi">10.1073/pnas.1530509100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Yeast Protein Database (YPD): a database for the complete proteome of <it>Saccharomyces cerevisiae</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Payne</snm>
						<fnm>WE</fnm>
					</au>
					<au>
						<snm>Garrels</snm>
						<fnm>JI</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1997</pubdate>
				<volume>25</volume>
				<fpage>57</fpage>
				<lpage>62</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">146399</pubid>
						<pubid idtype="pmpid" link="fulltext">9016505</pubid>
						<pubid idtype="doi">10.1093/nar/25.1.57</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>The yeast proteome database (YPD) and <it>Caenorhabditis elegans</it> proteome database (WormPD): comprehensive resources for the organization and comparison of model organism protein information.</p>
				</title>
				<aug>
					<au>
						<snm>Costanzo</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Hogan</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Cusick</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>BP</fnm>
					</au>
					<au>
						<snm>Fancher</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Hodges</snm>
						<fnm>PE</fnm>
					</au>
					<au>
						<snm>Kondu</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Lengieza</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lew-Smith</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Lingner</snm>
						<fnm>C</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2000</pubdate>
				<volume>28</volume>
				<fpage>73</fpage>
				<lpage>76</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">102421</pubid>
						<pubid idtype="pmpid" link="fulltext">10592185</pubid>
						<pubid idtype="doi">10.1093/nar/28.1.73</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Three yeast proteome databases: YPD, PombePD, and CalPD (MycoPathPD).</p>
				</title>
				<aug>
					<au>
						<snm>Csank</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Costanzo</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Hirschman</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Hodges</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Kranz</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Mangan</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>O'Neill</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Robertson</snm>
						<fnm>LS</fnm>
					</au>
					<au>
						<snm>Skrzypek</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Brooks</snm>
						<fnm>J</fnm>
					</au>
					<etal/>
				</aug>
				<source>Methods Enzymol</source>
				<pubdate>2002</pubdate>
				<volume>350</volume>
				<fpage>347</fpage>
				<lpage>373</lpage>
				<xrefbib>
					<pubid idtype="pmpid">12073323</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Chipper</p>
				</title>
				<url>http://llama.med.harvard.edu/Software.html</url>
			</bibl>
			<bibl id="B31">
				<title>
					<p>NCBI GEO: mining millions of expression profiles - database and tools.</p>
				</title>
				<aug>
					<au>
						<snm>Barrett</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Suzek</snm>
						<fnm>TO</fnm>
					</au>
					<au>
						<snm>Troup</snm>
						<fnm>DB</fnm>
					</au>
					<au>
						<snm>Wilhite</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Ngau</snm>
						<fnm>WC</fnm>
					</au>
					<au>
						<snm>Ledoux</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Rudnev</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lash</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Fujibuchi</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Edgar</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33 Database issue</volume>
				<fpage>D562</fpage>
				<lpage>D566</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">539976</pubid>
						<pubid idtype="pmpid" link="fulltext">15608262</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>QVALUE: The Manual. Version 1.0</p>
				</title>
				<url>http://faculty.washington.edu/~jstorey/qvalue/manual.pdf</url>
			</bibl>
		</refgrp>
	</bm>
</art>
