<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-38</ui>
   <ji>1471-2105</ji>
   <fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>On reliable discovery of molecular signatures</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Nilsson</snm>
					<fnm>Roland</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>rnilsson@broad.mit.edu</email>
				</au>
				<au id="A2">
					<snm>Bj&#246;rkegren</snm>
					<fnm>Johan</fnm>
					<insr iid="I2"/>
					<email>johan.bjorkegren@ki.se</email>
				</au>
				<au id="A3">
					<snm>Tegn&#233;r</snm>
					<fnm>Jesper</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>jesper.tegner@ki.se</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Computational Biology, Department of Physics, Link&#246;ping University, SE58183 Link&#246;ping, Sweden</p>
				</ins>
				<ins id="I2">
					<p>Unit of Computational Medicine, King Gustav V Research Institute, Department of Medicine, Karolinska Institutet, SE17176 Stockholm, Sweden</p>
				</ins>
			</insg>
			<source>BMC Bioinformatics</source>
			<issn>1471-2105</issn>
			<pubdate>2009</pubdate>
			<volume>10</volume>
			<issue>1</issue>
			<fpage>38</fpage>
			<url>http://www.biomedcentral.com/1471-2105/10/38</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">19178740</pubid>
					<pubid idtype="doi">10.1186/1471-2105-10-38</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>02</day>
					<month>4</month>
					<year>2008</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>29</day>
					<month>1</month>
					<year>2009</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>29</day>
					<month>1</month>
					<year>2009</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2009</year>
			<collab>Nilsson et al; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Molecular signatures are sets of genes, proteins, genetic variants or other variables that can be used as markers for a particular phenotype. Reliable signature discovery methods could yield valuable insight into cell biology and mechanisms of human disease. However, it is currently not clear how to control error rates such as the false discovery rate (FDR) in signature discovery. Moreover, signatures for cancer gene expression have been shown to be unstable, that is, difficult to replicate in independent studies, casting doubts on their reliability.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>We demonstrate that with modern prediction methods, signatures that yield accurate predictions may still have a high FDR. Further, we show that even signatures with low FDR may fail to replicate in independent studies due to limited statistical power. Thus, neither stability nor predictive accuracy are relevant when FDR control is the primary goal. We therefore develop a general statistical hypothesis testing framework that for the first time provides FDR control for signature discovery. Our method is demonstrated to be correct in simulation studies. When applied to five cancer data sets, the method was able to discover molecular signatures with 5% FDR in three cases, while two data sets yielded no significant findings.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>Our approach enables reliable discovery of molecular signatures from genome-wide data with current sample sizes. The statistical framework developed herein is potentially applicable to a wide range of prediction problems in bioinformatics.</p>
				</sec>
			</sec>
		</abs>
	</fm>
   <bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Molecular signatures are sets of genes, mRNA transcripts, proteins, genetic variants or other variables that can be used as markers for a particular cell or tissue phenotype, such as a cancerous or diabetic state. Signatures have a two-fold purpose: they may be useful for disease diagnosis or risk assessment (<it>prediction</it>), but they may also implicate molecules not previously known to be involved in the underlying molecular pathology (<it>discovery</it>), as illustrated in Figure <figr fid="F1">1A</figr>. Signature discovery differs from simple correlation or differential expression testing in that molecular signatures may account for multivariate effects and consists only of the variables most directly correlated with given phenotype. The signature approach has been especially popular for cancer diagnosis based on gene expression profiling, and several studies have proposed signatures for specific cancer types <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
				</abbrgrp>. A prominent example is the breast cancer signature discovered by van't Veer <it>et al</it>. <abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp>, which is currently being validated in a clinical trial <abbrgrp>
					<abbr bid="B6">6</abbr>
				</abbrgrp>.</p>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>Signature discovery</p>
				</caption>
				<text>
					<p>
						<b>Signature discovery</b>. Molecular signatures (1) are markers for a particular cell or tissue phenotype. Signatures are discovered from a given set of molecular profiles (e.g., gene expression profiles) together with phenotype labels (2). Signatures have dual uses, both as predictive models (3) and for discovery of molecular mechanisms (4). While it is well-known how to assess predictive accuracy (5), the method proposed herein is the first to control signature FDR (6), enabling reliably discovery.</p>
				</text>
				<graphic file="1471-2105-10-38-1"/>
			</fig>
			<p>Unfortunately, existing computational approaches often fail to distinguish between the different objectives of prediction and discovery. If molecular signatures are to be used for discovery, then the primary objective is to control the false discovery rate (FDR) with respect to the optimal (true) signature. On the other hand, if the end goal is an accurate predictor, then the FDR of the gene signature is not important in itself. However, it has hitherto not been possible to directly address FDR control, since an operational definition of the optimal signature (a "gold standard") has not been available. Therefore, current methods for signature discovery resort to optimizing prediction accuracy, implicitly assuming that the FDR is thereby kept reasonably low, even though there is no <it>a priori </it>reason to assume that this is the case. Recently, the <it>stability </it>of a signature, that is, the expected overlap between signatures derived from replicated experiments, has been suggested as an alternative quality measure <abbrgrp>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
				</abbrgrp>. Signatures derived from cancer gene expression data have been found to be unstable, raising concerns that existing signature discovery methods may not be sound <abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
				</abbrgrp>. While the stability measure seems intuitively reasonable and cleverly avoids the gold standard problem, it has not been shown that low stability actually indicates high FDR.</p>
			<p>In this paper, we build upon a recently discovered operational definition of the optimal signature to study the actual FDR in signature discovery. First, we demonstrate that high FDR can occur even with very accurate predictors. Therefore, current methods for signature discovery that focus on optimizing prediction accuracy offer no means of controlling the FDR. Second, we show that signatures can be highly unstable even when the FDR is kept low. Thus, reliable signature discovery may be possible in spite of the recent reports of unstable signatures in cancer <abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
				</abbrgrp>. Third, we propose a novel hypothesis testing procedure based on our definition of the optimal signature that for the first time directly addresses signature FDR. We show that our method achieves FDR control on simulated data. Application to well-known cancer data sets uncovers three novel molecular signatures for leukemia, colon and breast cancer.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>The optimal signature</p>
				</st>
				<p>For simplicity, we will consider a two-class prediction setting throughout, although the methods could be generalized to other prediction problems as well. A <it>predictor </it>is then a function <it>g </it>:<inline-formula>
						<m:math name="1471-2105-10-38-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi mathvariant="script">X</m:mi>
									<m:mo>&#8614;</m:mo>
									<m:mi mathvariant="script">Y</m:mi>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFxepwcqWIMgsycqWFyeFwaaa@3AE3@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>, where we take <inline-formula>
						<m:math name="1471-2105-10-38-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="script">X</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFxepwaaa@3743@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> = &#8477;<sup>
						<it>n </it>
					</sup>and <inline-formula>
						<m:math name="1471-2105-10-38-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="script">Y</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFyeFwaaa@3745@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> = {-1, +1}. The <it>accuracy </it>of a predictor <it>g </it>is 1 minus the probability of error or <it>risk R</it>(<it>g</it>) = <it>P</it>(<it>g</it>(<it>X</it>) &#8800; <it>Y</it>). An optimal predictor, denoted <it>g</it>* is one with maximal accuracy. An optimal signature can be defined as a minimal set of variables <it>S</it>* such that the optimal predictor obtained using only these variables is at least as accurate as any predictor obtained with any other set, that is,</p>
				<p>
					<display-formula id="M1">
						<m:math name="1471-2105-10-38-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mo>&#8704;</m:mo>
									<m:mi>S</m:mi>
									<m:mo>:</m:mo>
									<m:mo>&#8704;</m:mo>
									<m:msub>
										<m:mi>g</m:mi>
										<m:mi>S</m:mi>
									</m:msub>
									<m:mo>:</m:mo>
									<m:mi>R</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msubsup>
										<m:mi>g</m:mi>
										<m:mrow>
											<m:msup>
												<m:mi>S</m:mi>
												<m:mo>&#8727;</m:mo>
											</m:msup>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>&#8804;</m:mo>
									<m:mi>R</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msub>
										<m:mi>g</m:mi>
										<m:mi>S</m:mi>
									</m:msub>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeyiaIiIaem4uamLaeiOoaOJaeyiaIiIaem4zaC2aaSbaaSqaaiabdofatbqabaGccqGG6aGocqWGsbGucqGGOaakcqWGNbWzdaqhaaWcbaGaem4uam1aaWbaaWqabeaacqGHxiIkaaaaleaacqGHxiIkaaGccqGGPaqkcqGHKjYOcqWGsbGucqGGOaakcqWGNbWzdaWgaaWcbaGaem4uamfabeaakiabcMcaPiabcYcaSaaa@4389@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <it>gS </it>denotes a predictor on the subspace <inline-formula>
						<m:math name="1471-2105-10-38-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="script">X</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFxepwaaa@3743@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>
					<sub>
						<it>S </it>
					</sub>of <inline-formula>
						<m:math name="1471-2105-10-38-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="script">X</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFxepwaaa@3743@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> corresponding to the variable set <it>S</it>. Unfortunately, this criterion does not yield a unique <it>S</it>* in general, and there are examples of data distributions such that no tractable (polynomial-time) algorithms exist for computing <it>S</it>* [<abbrgrp>
						<abbr bid="B11">11</abbr>
					</abbrgrp>, pp. 562]. Consequently, most research has focused on heuristic algorithms for discovering approximate signatures with near-optimal prediction accuracy <abbrgrp>
						<abbr bid="B12">12</abbr>
					</abbrgrp>.</p>
				<p>While this approach has been largely successful at attaining good predictive accuracy, the lack of a "gold standard" has rendered direct evaluation of error rates for signature discovery algorithms impossible. To address this problem, we have recently shown <abbrgrp>
						<abbr bid="B13">13</abbr>
					</abbrgrp> that using a mild restriction on the class of data distributions, the set <it>S</it>* becomes unique and can be expressed as</p>
				<p>
					<display-formula id="M2">
						<m:math name="1471-2105-10-38-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msup>
										<m:mi>S</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msup>
									<m:mo>=</m:mo>
									<m:mo>{</m:mo>
									<m:mi>i</m:mi>
									<m:mo>:</m:mo>
									<m:mi>R</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msubsup>
										<m:mi>g</m:mi>
										<m:mrow>
											<m:mo>{</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>i</m:mi>
											<m:mo>&#8722;</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mi>i</m:mi>
											<m:mo>+</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>n</m:mi>
											<m:mo>}</m:mo>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>></m:mo>
									<m:mi>R</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msubsup>
										<m:mi>g</m:mi>
										<m:mrow>
											<m:mo>{</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>n</m:mi>
											<m:mo>}</m:mo>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>}</m:mo>
									<m:mo>.</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uam1aaWbaaSqabeaacqGHxiIkaaGccqGH9aqpcqGG7bWEcqWGPbqAcqGG6aGocqWGsbGucqGGOaakcqWGNbWzdaqhaaWcbaGaei4EaSNaeGymaeJaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaemyAaKMaeyOeI0IaeGymaeJaeiilaWIaemyAaKMaey4kaSIaeGymaeJaeiilaWIaeiOla4IaeiOla4IaeiOla4IaeiilaWIaemOBa4MaeiyFa0habaGaey4fIOcaaOGaeiykaKIaeyOpa4JaemOuaiLaeiikaGIaem4zaC2aa0baaSqaaiabcUha7jabigdaXiabcYcaSiabc6caUiabc6caUiabc6caUiabcYcaSiabd6gaUjabc2ha9bqaaiabgEHiQaaakiabcMcaPiabc2ha9jabc6caUaaa@60AC@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>That is, <it>S</it>* consists precisely of the variables <it>i </it>such that the error probability of the optimal predictor <it>g</it>* increases when <it>i </it>is removed. The required restriction is that the data density <it>f </it>(<it>x</it>) is everywhere strictly positive. This condition is satisfied by nearly all commonly used statistical models, including the exponential family, and we believe that it is reasonable for biological data. A formal proof of the correctness of (2) is given in Additional File <supplr sid="S1">1</supplr>.</p>
				<suppl id="S1">
					<title>
						<p>Additional file 1</p>
					</title>
					<text>
						<p>
							<b>Proofs.</b> This document provides proofs of uniqueness and optimality of the optimal signature <it>S</it>*.</p>
					</text>
					<file name="1471-2105-10-38-S1.pdf">
						<p>Click here for file</p>
					</file>
				</suppl>
				<p>Note that <it>S</it>* may be quite different from the set of variables that are marginally correlated with the phenotype (<it>e.g</it>., differentially expressed genes). This is because some correlated variables may be "redundant" for prediction: while these do contain information about the phenotype, that information is also present in other variables, so that the redundant variables are excluded from <it>S</it>*. Indeed, it can be proved that <it>S</it>* only contains variables <it>X</it>
					<sub>
						<it>i </it>
					</sub>that are conditionally dependent on <it>Y </it>regardless of what other variable set is conditioned on <abbrgrp>
						<abbr bid="B13">13</abbr>
					</abbrgrp>. In this sense, <it>S* </it>constitutes the variables "directly" correlated with <it>Y</it>. Moreover, some variables may be predictive only when considered together with certain other variables in a multivariate fashion, and thus <it>S</it>* may contain variables that are not detectable by standard univariate methods <abbrgrp>
						<abbr bid="B14">14</abbr>
					</abbrgrp>.</p>
				<p>The simple form (2) immediately suggests a general, linear-time, asymptotically correct algorithm for discovering <it>S</it>* from data, as described elsewhere <abbrgrp>
						<abbr bid="B13">13</abbr>
					</abbrgrp>. Here, we make use of the fact that (2) permits <it>S</it>* to be calculated for any given data distribution, thus providing the gold standard required for evaluating signature discovery methods and developing hypothesis testing procedures.</p>
			</sec>
			<sec>
				<st>
					<p>Accurate predictions despite high signature FDR</p>
				</st>
				<p>First, we tested whether high prediction accuracy implies a low false discovery rate with respect to <it>S</it>*. We performed a simulation study on a simple two-class prediction problem using a multivariate normal distribution with <it>n </it>= 1, 000 variables, of which 10% were in <it>S</it>* (see Methods for details). In each run, a signature <it>S </it>was chosen to achieve a given power and FDR with respect to <it>S</it>*, whereafter a Support Vector Machine (SVM) classifier was trained on a sample from the corresponding subspace of the data distribution. We found that FDR as high as 50% did not degrade predictive accuracy discernably, provided that statistical power was sufficient (Figure <figr fid="F2">2</figr>). Thus, prediction accuracy is not a valid measure of the reliability of a signature in terms of false positives.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Good predictive accuracy despite high FDR</p>
					</caption>
					<text>
						<p>
							<b>Good predictive accuracy despite high FDR</b>. Probability of prediction error for the Support Vector Machine (gray level) as a function of signature false discovery rate (FDR) and statistical power (fraction of true positives). Nearly horizontal level curves indicate weak dependence on FDR.</p>
					</text>
					<graphic file="1471-2105-10-38-2"/>
				</fig>
				<p>The likely reason for this behavior is that modern predictive methods such as the SVM have internal mechanisms for suppressing noise (regularization). They are therefore rather insensitive to false positives within the signature. For prediction purposes, it is more important that the signature does contain some true positives genes, while a large fraction of irrelevant genes may be tolerated without degrading predictive accuracy. As a consequence, discovering signatures by optimizing prediction accuracy should not be expected control FDR, as we will further demonstrate below.</p>
			</sec>
			<sec>
				<st>
					<p>Unstable Signatures with Low FDR</p>
				</st>
				<p>To investigate the relation between signature stability and FDR, we conducted a second simulation experiment, again with <it>n </it>= 1, 000 variables. Here, each variable was conditionally independent of all others within each class, so that <it>S</it>* has the form</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-10-38-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msup>
										<m:mi>S</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msup>
									<m:mo>=</m:mo>
									<m:mo>{</m:mo>
									<m:mi>i</m:mi>
									<m:mo>:</m:mo>
									<m:mi mathvariant="double-struck">E</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:msub>
										<m:mi>x</m:mi>
										<m:mi>i</m:mi>
									</m:msub>
									<m:mo>|</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>=</m:mo>
									<m:mo>+</m:mo>
									<m:mn>1</m:mn>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>&#8800;</m:mo>
									<m:mi mathvariant="double-struck">E</m:mi>
									<m:mo stretchy="false">[</m:mo>
									<m:msub>
										<m:mi>X</m:mi>
										<m:mi>i</m:mi>
									</m:msub>
									<m:mo>|</m:mo>
									<m:mi>Y</m:mi>
									<m:mo>=</m:mo>
									<m:mo>&#8722;</m:mo>
									<m:mn>1</m:mn>
									<m:mo stretchy="false">]</m:mo>
									<m:mo>}</m:mo>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uam1aaWbaaSqabeaacqGHxiIkaaGccqGH9aqpcqGG7bWEcqWGPbqAcqGG6aGotuuDJXwAK1uy0HMmaeHbfv3ySLgzG0uy0HgiuD3BaGabaiab=ri8fjabcUfaBjabdIha4naaBaaaleaacqWGPbqAaeqaaOGaeiiFaWNaemywaKLaeyypa0Jaey4kaSIaeGymaeJaeiyxa0LaeyiyIKRae8hHWxKaei4waSLaemiwaG1aaSbaaSqaaiabdMgaPbqabaGccqGG8baFcqWGzbqwcqGH9aqpcqGHsislcqaIXaqmcqGGDbqxcqGG9bqFcqGGSaalaaa@5B9F@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>and can be discovered by simply testing the marginal distributions for a nonzero mean difference. For this we used Student's t-test with the Benjamini-Hochberg correction for FDR control, since the t-test has optimal power in this case and the FDR can be controlled exactly <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>. Nevertheless, we found that the resulting signatures can be very unstable (Figure <figr fid="F3">3</figr>). For small effect sizes where power was low, stability was also low, despite a stringent FDR. Conversely, with strong effects and high power, stability was high, even with a high FDR. Also, the dependence of stability on FDR was different between low- and high-power regimes, indicating that the relationship between these measures is complicated and data-dependent. Clearly, unstable signatures need not contain many false positives. In the low power regime, the situation is rather that small signatures are being selected more or less at random from a large set of true positives, resulting in small overlap between experiments (Figure <figr fid="F3">3</figr>). Hence, in situations where many genes are weakly associated with a given phenotype and power is limited, it is simply not feasible to reproduce molecular signatures in independent experiments, even with the most stringent and correct methods. This implies that the lack of reproducibility observed for cancer gene expression signatures <abbrgrp>
						<abbr bid="B7">7</abbr>
						<abbr bid="B8">8</abbr>
					</abbrgrp> is not necessarily problematic. The same mechanism may also account for the low reproducibility of whole-genome association studies of complex diseases <abbrgrp>
						<abbr bid="B16">16</abbr>
					</abbrgrp>, where many genes are believed to be weakly associated with a given disease trait.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Signatures with low FDR may be unstable</p>
					</caption>
					<text>
						<p>
							<b>Signatures with low FDR may be unstable</b>. Left, statistical power <it>vs</it>. effect size (arbitrary units) for varying FDR. Middle, stability, defined as the average normalized overlap between two signatures <it>vs</it>. effect size and FDR. Right, illustration of how power affects stability.</p>
					</text>
					<graphic file="1471-2105-10-38-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>A Statistical Framework for Signature Discovery</p>
				</st>
				<p>The above results show that neither predictive accuracy nor stability are relevant measures of signature FDR. To directly control false discovery rates for signature discovery, we instead propose a general method for directly testing the hypothesis <it>i </it>&#8712; <it>S</it>* for each variable. From equation (2) it follows that a generally applicable test statistic is</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-10-38-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msub>
										<m:mi>T</m:mi>
										<m:mi>i</m:mi>
									</m:msub>
									<m:mo>=</m:mo>
									<m:mover accent="true">
										<m:mi>R</m:mi>
										<m:mo>^</m:mo>
									</m:mover>
									<m:mo stretchy="false">(</m:mo>
									<m:msubsup>
										<m:mi>g</m:mi>
										<m:mrow>
											<m:mo>{</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>i</m:mi>
											<m:mo>&#8722;</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mi>i</m:mi>
											<m:mo>+</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>n</m:mi>
											<m:mo>}</m:mo>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>&#8722;</m:mo>
									<m:mover accent="true">
										<m:mi>R</m:mi>
										<m:mo>^</m:mo>
									</m:mover>
									<m:mo stretchy="false">(</m:mo>
									<m:msubsup>
										<m:mi>g</m:mi>
										<m:mrow>
											<m:mo>{</m:mo>
											<m:mn>1</m:mn>
											<m:mo>,</m:mo>
											<m:mn>...</m:mn>
											<m:mo>,</m:mo>
											<m:mi>n</m:mi>
											<m:mo>}</m:mo>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemivaq1aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpcuWGsbGugaqcaiabcIcaOiabdEgaNnaaDaaaleaacqGG7bWEcqaIXaqmcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGPbqAcqGHsislcqaIXaqmcqGGSaalcqWGPbqAcqGHRaWkcqaIXaqmcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGUbGBcqGG9bqFaeaacqGHxiIkaaGccqGGPaqkcqGHsislcuWGsbGugaqcaiabcIcaOiabdEgaNnaaDaaaleaacqGG7bWEcqaIXaqmcqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaalcqWGUbGBcqGG9bqFaeaacqGHxiIkaaGccqGGPaqkcqGGSaalaaa@5BC3@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <inline-formula>
						<m:math name="1471-2105-10-38-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mover accent="true">
									<m:mi>R</m:mi>
									<m:mo>^</m:mo>
								</m:mover>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmOuaiLbaKaaaaa@2D12@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> is an estimated error probability, for example a cross-validated error estimate. This statistic is asymptotically correct for any data distribution, that is, with a sufficiently large sample size, the globally optimal solution will always be found <abbrgrp>
						<abbr bid="B13">13</abbr>
					</abbrgrp>. However, the sample sizes required for reasonable performance could be very large, since the error rate estimate <inline-formula>
						<m:math name="1471-2105-10-38-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mover accent="true">
									<m:mi>R</m:mi>
									<m:mo>^</m:mo>
								</m:mover>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmOuaiLbaKaaaaa@2D12@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> is uncertain. For particular types of predictors, it is therefore preferable to develop specialized statistics. As we are interested in applications to gene expression data, where simple prediction rules tend to work well, we here consider linear classifiers of the form <it>g</it>(<it>x</it>) = sign (&#8721;<sub>
						<it>i</it>
					</sub>
					<it>w</it>
					<sub>
						<it>i</it>
					</sub>
					<it>x</it>
					<sub>
						<it>i</it>
					</sub>). It is easy to see that in this case, equation (2) reduces to</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-10-38-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msup>
										<m:mi>S</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msup>
									<m:mo>=</m:mo>
									<m:mo>{</m:mo>
									<m:mi>i</m:mi>
									<m:mo>:</m:mo>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mi>i</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo>&#8800;</m:mo>
									<m:mn>0</m:mn>
									<m:mo>}</m:mo>
									<m:mo>,</m:mo>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4uam1aaWbaaSqabeaacqGHxiIkaaGccqGH9aqpcqGG7bWEcqWGPbqAcqGG6aGocqWG3bWDdaqhaaWcbaGaemyAaKgabaGaey4fIOcaaOGaeyiyIKRaeGimaaJaeiyFa0NaeiilaWcaaa@3C62@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <inline-formula>
						<m:math name="1471-2105-10-38-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mi>i</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabdMgaPbqaaiabgEHiQaaaaaa@2FC3@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> denote the weights of the optimal classifier. Assuming that the classifier used is consistent, we have that <inline-formula>
						<m:math name="1471-2105-10-38-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mi>i</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabdMgaPbqaaiabgEHiQaaaaaa@2FC3@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>[<it>w</it>
					<sub>
						<it>i</it>
					</sub>] &#8594; <inline-formula>
						<m:math name="1471-2105-10-38-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mi>i</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabdMgaPbqaaiabgEHiQaaaaaa@2FC3@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> as sample size increases. Hence, in this case we can equivalently test the null hypothesis <inline-formula>
						<m:math name="1471-2105-10-38-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="double-struck">E</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFraaa@37B5@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>[<it>w</it>
					<sub>
						<it>i</it>
					</sub>] = 0. More complicated parametric forms such as polynomials in <it>x</it>
					<sub>
						<it>i </it>
					</sub>could be used in a similar way, although the number of weights would increase accordingly.</p>
				<p>Since the statistical distribution of <it>w</it>
					<sub>
						<it>i </it>
					</sub>is unknown, we used a bootstrap technique to test whether <inline-formula>
						<m:math name="1471-2105-10-38-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mi mathvariant="double-struck">E</m:mi>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFraaa@37B5@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>[<it>w</it>
					<sub>
						<it>i</it>
					</sub>] = 0. By sampling with replacement from the given data set and re-training the classifier on each sample, we obtain <it>B </it>vectors <inline-formula>
						<m:math name="1471-2105-10-38-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mn>1</m:mn>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo>,</m:mo>
									<m:mn>...</m:mn>
									<m:mo>,</m:mo>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mi>B</m:mi>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabigdaXaqaaiabgEHiQaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabcYcaSiabdEha3naaDaaaleaacqWGcbGqaeaacqGHxiIkaaaaaa@376E@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula>. For each variable <it>i</it>, the corresponding <inline-formula>
						<m:math name="1471-2105-10-38-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mrow>
											<m:mn>1</m:mn>
											<m:mi>i</m:mi>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
									<m:mo>,</m:mo>
									<m:mn>...</m:mn>
									<m:msubsup>
										<m:mi>w</m:mi>
										<m:mrow>
											<m:mi>B</m:mi>
											<m:mi>i</m:mi>
										</m:mrow>
										<m:mo>&#8727;</m:mo>
									</m:msubsup>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabigdaXiabdMgaPbqaaiabgEHiQaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabdEha3naaDaaaleaacqWGcbGqcqWGPbqAaeaacqGHxiIkaaaaaa@3944@</m:annotation>
							</m:semantics>
						</m:math>
					</inline-formula> are then used to obtain a bootstrap confidence interval for <it>w</it>
					<sub>
						<it>i</it>
					</sub>. This interval is inverted to obtain a bootstrap <it>p</it>-values <it>p</it>
					<sub>
						<it>i </it>
					</sub>for each variable <it>i </it>(that is, the null hypothesis is rejected at level <it>&#945; </it>if the (1 - <it>&#945;</it>) confidence interval does not cover zero). Importantly, this procedure preserves the full dependency structure of the data distribution. Finally, FDR control was performed using the Benjamini-Hochberg procedure <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Simulation Experiments</p>
				</st>
				<p>To validate our method, we conducted simulations using two-class data with 1, 000 variables and 100 samples. To model the variable dependencies often present in gene expression data, we used a class-conditional multivariate Gaussian distribution with precision matrices generated randomly as previously described <abbrgrp>
						<abbr bid="B17">17</abbr>
					</abbrgrp>. For this distribution class, it is straightforward to calculate <it>S</it>* (see methods). We chose sampling parameters so that <it>S</it>* constituted approx. 200 variables on average (since <it>S</it>* depends on the randomly chosen covariance matrix, its size fluctuates somewhat). We evaluated three linear classification methods: the Support Vector Machine (SVM) <abbrgrp>
						<abbr bid="B18">18</abbr>
					</abbrgrp>, the Kernel Fisher Discriminant (KFD) <abbrgrp>
						<abbr bid="B19">19</abbr>
					</abbrgrp> and the Weighted Voting (WV) algorithm of Golub <it>et al</it>. <abbrgrp>
						<abbr bid="B2">2</abbr>
					</abbrgrp>. Since the results were highly similar for all of these, we here only present results for the SVM (see Additional File <supplr sid="S2">2</supplr> for KFD and VW). For each learning method and across a range of effect sizes, our bootstrap test produced correct <it>p</it>-values, while power increased with increasing effect size (Figure <figr fid="F4">4A</figr>). This demonstrates that the bootstrap test is sound. After correcting for multiplicity using the procedure of Benjamini and Hochberg <abbrgrp>
						<abbr bid="B15">15</abbr>
					</abbrgrp>, we verified that our method did control FDR at nominal levels (Figure <figr fid="F4">4B</figr>). Power was limited however, especially for predictors with low accuracy. We therefore expect that for high-dimensional data, predictors must be quite accurate in order to yield reliable signatures. We also verified that the power of our bootstrap method approaches 1 as sample size increases, as one would expect (see Additional File <supplr sid="S2">2</supplr>). However, power depends on a number of distribution properties, and it is difficult to make predictions about the sample sizes required in practise from simulations.</p>
				<suppl id="S2">
					<title>
						<p>Additional file 2</p>
					</title>
					<text>
						<p>
							<b>KFD and WV methods, and convergence with increasing sample size.</b> This figure shows the results corresponding to Figure <figr fid="F4">4</figr> for the Kernel Fisher Discriminant (A-B) and Weighted Voting classification methods (C-D). Also shown is the convergence of the bootstrap method for the SVM classifier (E), where power approaches 1 as sample size increases.</p>
					</text>
					<file name="1471-2105-10-38-S2.pdf">
						<p>Click here for file</p>
					</file>
				</suppl>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Controlling error rates for gene signatures</p>
					</caption>
					<text>
						<p>
							<b>Controlling error rates for gene signatures</b>. <b>A: </b>Realized level and power for the bootstrap test at 5% nominal level. <b>B: </b>Realized FDR, power and stability for signatures selected by the bootstrap test after Benjamini-Hochberg (BH) correction. Here the nominal FDR was set at 5%. <b>C: </b>Same as (B) for signatures selected by recursive feature elimination (RFE). <b>D: </b>Same as (B) for signatures selected as the top 200 genes. Acc, classifier accuracy.</p>
					</text>
					<graphic file="1471-2105-10-38-4"/>
				</fig>
				<p>We repeated the simulation study using the popular Recursive Feature Elimination (RFE) method <abbrgrp>
						<abbr bid="B20">20</abbr>
					</abbrgrp> to discover signatures. While this method did produce accurate predictive models (data not shown), we observed that FDR was high (above 40% in this experiment) and depended on the effect size in an unpredictable manner. Indeed, optimizing prediction accuracy by RFE does not guarantee a reliable signature. High FDR was also observed when choosing the signature <it>S </it>as a fixed-size "top list" by the rank according to the <it>w</it>
					<sub>
						<it>g </it>
					</sub>statistics (Figure <figr fid="F4">4D</figr>). We have also previously observed high FDR for other methods that optimize the signature for prediction accuracy <abbrgrp>
						<abbr bid="B21">21</abbr>
					</abbrgrp>. Often, these methods attempt to include more variables in the signature when the prediction problem is harder, thus sacrificing FDR control for better predictive accuracy. Conversely, for less difficult prediction problems, many true positives may be removed from the signature because they do not influence predictive power discernably.</p>
			</sec>
			<sec>
				<st>
					<p>Application to Cancer Gene Expression</p>
				</st>
				<p>We applied our method together with the SVM prediction method to analyze a number of publicly available cancer gene expression data sets (Table <tblr tid="T1">1</tblr>). For the data sets by van't Veer <abbrgrp>
						<abbr bid="B4">4</abbr>
					</abbrgrp> and Wang <abbrgrp>
						<abbr bid="B5">5</abbr>
					</abbrgrp> where the SVM had poor accuracy, the bootstrap method did not call any genes significant. Note that these signatures may still be useful for prediction; the fact that no genes are called significant merely demonstrates that it is not possible to ascertain which genes are responsible for the predictive accuracy. For the remaining data sets, we found that higher predictive accuracy tends to result in greater power, in accordance with our simulation results. The largest signature, obtained for the data set by Golub <it>et al</it>. <abbrgrp>
						<abbr bid="B2">2</abbr>
					</abbrgrp>, contained over 500 genes at 5% FDR (see Additional Files <supplr sid="S3">3</supplr>, <supplr sid="S4">4</supplr> and <supplr sid="S5">5</supplr> for complete gene lists).</p>
				<suppl id="S3">
					<title>
						<p>Additional file 3</p>
					</title>
					<text>
						<p>
							<b>Gene signature for the Alon data set.</b> Excel file detailing the gene signature discovered by the bootstrap method using the SVM classifier. The corresponding signature from Recursive Features elimination is also provided for reference.</p>
					</text>
					<file name="1471-2105-10-38-S3.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
				<suppl id="S4">
					<title>
						<p>Additional file 4</p>
					</title>
					<text>
						<p>
							<b>Gene signature for the Golub data set.</b> Excel file detailing the gene signature discovered by the bootstrap method using the SVM classifier. The corresponding signature from Recursive Features elimination is also provided for reference.</p>
					</text>
					<file name="1471-2105-10-38-S4.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
				<suppl id="S5">
					<title>
						<p>Additional file 5</p>
					</title>
					<text>
						<p>
							<b>Gene signature for the Singh data set. </b>Excel file detailing the gene signature discovered by the bootstrap method using the SVM classifier. The corresponding signature from Recursive Features elimination is also provided for reference.</p>
					</text>
					<file name="1471-2105-10-38-S5.xls">
						<p>Click here for file</p>
					</file>
				</suppl>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Results on cancer gene expression data</p>
					</caption>
					<tblbdy cols="10">
						<r>
							<c ca="left">
								<p>Data set (ref.)</p>
							</c>
							<c ca="center">
								<p>
									<it>n</it>
								</p>
							</c>
							<c ca="center">
								<p>MCF,%</p>
							</c>
							<c ca="right">
								<p>CV,%</p>
							</c>
							<c ca="center">
								<p>TA,%(ref.)</p>
							</c>
							<c ca="center">
								<p>BS</p>
							</c>
							<c ca="center">
								<p>BS<sub>0</sub>
								</p>
							</c>
							<c ca="center">
								<p>RFE</p>
							</c>
							<c ca="center">
								<p>RFE<sub>0</sub>
								</p>
							</c>
							<c ca="center">
								<p>DE</p>
							</c>
						</r>
						<r>
							<c cspan="10">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Golub (2)</p>
							</c>
							<c ca="center">
								<p>72</p>
							</c>
							<c ca="center">
								<p>32</p>
							</c>
							<c ca="right">
								<p>97.0 &#177; 4.2</p>
							</c>
							<c ca="center">
								<p>99.3 (28)</p>
							</c>
							<c ca="center">
								<p>537</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
							<c ca="center">
								<p>154</p>
							</c>
							<c ca="center">
								<p>1007</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Singh (4)</p>
							</c>
							<c ca="center">
								<p>136</p>
							</c>
							<c ca="center">
								<p>43</p>
							</c>
							<c ca="right">
								<p>92.6 &#177; 3.0</p>
							</c>
							<c ca="center">
								<p>81.1 (27)</p>
							</c>
							<c ca="center">
								<p>99</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>312</p>
							</c>
							<c ca="center">
								<p>3807</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alon (1)</p>
							</c>
							<c ca="center">
								<p>62</p>
							</c>
							<c ca="center">
								<p>35</p>
							</c>
							<c ca="right">
								<p>81 &#177; 7.2</p>
							</c>
							<c ca="center">
								<p>97.9 (29)</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>55</p>
							</c>
							<c ca="center">
								<p>94</p>
							</c>
							<c ca="center">
								<p>303</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Wang (6)</p>
							</c>
							<c ca="center">
								<p>286</p>
							</c>
							<c ca="center">
								<p>37</p>
							</c>
							<c ca="right">
								<p>65 &#177; 4.3</p>
							</c>
							<c ca="center">
								<p>N/A</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>261</p>
							</c>
							<c ca="center">
								<p>1250</p>
							</c>
							<c ca="center">
								<p>106</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>van't Veer (5)</p>
							</c>
							<c ca="center">
								<p>97</p>
							</c>
							<c ca="center">
								<p>47</p>
							</c>
							<c ca="right">
								<p>62 &#177; 8.4</p>
							</c>
							<c ca="center">
								<p>N/A</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>0</p>
							</c>
							<c ca="center">
								<p>42</p>
							</c>
							<c ca="center">
								<p>153</p>
							</c>
							<c ca="center">
								<p>1</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>Results are ordered by prediction accuracy. <it>n</it>, number of samples; MCF, minority class frequency; CVA, balanced cross-validated prediction accuracy, mean &#177; std.dev.; TA, balanced prediction accuracy of bootstrap signature on an independent test set (reference given in parentheses); BS, significant genes using the bootstrap with SVM at 5% FDR; RFE, genes chosen by recursive feature elimination; BS<sub>0 </sub>and RFE<sub>0</sub>, gene chosen by the bootstrap and RFE methods respectively on randomized data. DE, differentially expressed genes using the t-test at 5% FDR.</p>
					</tblfn>
				</tbl>
				<p>As a negative control, we applied our bootstrap test on randomized versions of each original data set where the phenotype values were randomly permuted, corresponding to the complete null hypothesis. This yielded zero significant genes in each case, confirming that we do not obtain spurious findings. In contrast, when applying the RFE method to randomized data, we consistently obtained even larger signatures than with the real data sets. We also tested each signature on an independent data set, confirming that the signatures are indeed predictive.</p>
				<p>For comparison, we performed a conventional differential expression test for each data set using the t-test statistic with the Benjamini-Hochberg correction (Table <tblr tid="T1">1</tblr>). This identified a substantially larger set of genes than the bootstrap method &#8211; in one case, more than half of the genes tested were significant. This illustrates the ability of the gene signature approach to distinguish the genes directly related to the phenotype variable from a much larger set of differentially expressed genes: many of the latter turn out to be "redundant" for prediction, meaning that they are correlated with the phenotype only indirectly, through genes in <it>S</it>*.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>Molecular signatures offer a systematic way to focus on the genes most directly associated with a given phenotype, and may yield valuable insights into the underlying biological system. It is therefore unfortunate that the reliability of signatures <it>per se </it>is poorly understood. Since no gold standard for signature discovery has been available, validation of discovered signatures often amounts to mining the scientific literature for documented connections between the phenotype being studied and the elements (genes) of a hypothesized signature. However, this approach is necessarily biased and rather speculative: it is by no means clear that a gene should be included in a predictive signature simply because it is somehow "related" to the phenotype. For example, approximately 25% of all known human genes have some documented relation to cancer <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>, but it is unlikely that all of these should be included in an optimal signature for cancer prediction.</p>
			<p>To address this issue, we have herein developed a statistical method for signature discovery based on a formal definition of the "gold standard" optimal signature. This allows for assessing the reliability of signatures without detailed knowledge of the biological system. To our knowledge, our method is the first to provide statistical guarantees for the reliability of molecular signatures, although we note that random forests are similar to our bootstrap testing scheme and also give indications of what variables are important for prediction.</p>
			<p>For two of the gene expression data sets investigated, including the well-studied cancer data by van't Veer <it>et al</it>. <abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp>, our method did not call any genes significant, indicating that these data sets did not contain sufficient information to uncover gene signatures at the specified false discovery rate (5%). We emphasize that this does not necessarily mean that it is infeasible to construct predictive models for these studies, but merely that it is difficult to determine which genes are responsible for the predictive accuracy. In this sense, discovering reliable gene signatures can be a harder problem than obtaining accurate predictors. Prediction and signature discovery are two separate problems, and must be treated differently.</p>
			<p>For simplicity, we have here restricted our analysis to two-class problems and linear predictors. However, the proposed method is applicable to any learning method for which a reasonably well-powered statistic can be derived to test the signature null hypothesis. Continuous phenotype variables can easily be addressed by substituting the classification methods used herein for regression methods, such as ridge regression <abbrgrp>
					<abbr bid="B22">22</abbr>
				</abbrgrp> or the relevance vector machine <abbrgrp>
					<abbr bid="B23">23</abbr>
				</abbrgrp>. General methods for handling non-linear dependencies have also been described <abbrgrp>
					<abbr bid="B13">13</abbr>
					<abbr bid="B24">24</abbr>
				</abbrgrp>, although it is unclear whether signature discovery from gene expression data would benefit from these more complex models with currently available sample sizes.</p>
			<p>Some technical issues remain to be considered. First, testing the null hypothesis <inline-formula>
					<m:math name="1471-2105-10-38-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mi mathvariant="double-struck">E</m:mi>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFraaa@37B5@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>[<it>w</it>
				<sub>
					<it>i</it>
				</sub>] = 0 is technically correct only in the limit of large samples where <inline-formula>
					<m:math name="1471-2105-10-38-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mi mathvariant="double-struck">E</m:mi>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFraaa@37B5@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>[<it>w</it>
				<sub>
					<it>i</it>
				</sub>] &#8594; <inline-formula>
					<m:math name="1471-2105-10-38-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msubsup>
									<m:mi>w</m:mi>
									<m:mi>i</m:mi>
									<m:mo>&#8727;</m:mo>
								</m:msubsup>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabdMgaPbqaaiabgEHiQaaaaaa@2FC3@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>. While our simulation studies indicate correct behavior for the sample sizes tested, this issue warrants further study. Second, bootstrap hypothesis testing is known to provide only approximate <it>p</it>-values, satisfying the inequality <it>P</it>(<it>p </it>&#8804; <it>&#945;</it>) &#8804; <it>&#945; </it>+ <inline-formula>
					<m:math name="1471-2105-10-38-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mi mathvariant="script">O</m:mi>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFoe=taaa@3731@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>(1/<it>l</it>), where <it>l </it>is the sample size <abbrgrp>
					<abbr bid="B25">25</abbr>
				</abbrgrp>. While the additional term <inline-formula>
					<m:math name="1471-2105-10-38-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mi mathvariant="script">O</m:mi>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWFoe=taaa@3731@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>(1/<it>l</it>) was negligible in our simulations, this should be verified in each particular case before applying bootstrap testing. A possible future improvement could be to estimate this term from simulations and correct the bootstrap <it>p</it>-values accordingly, thereby "calibrating" the method.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>As we have shown, neither predictive accuracy nor stability constitute valid measures of FDR for molecular signatures. Indeed, highly accurate predictions may be obtained despite an FDR as high as 50% (Figure <figr fid="F2">2</figr>), while in situations where many weak effects are present and statistical power is low, signatures can be unstable at an FDR as low as 2.5% (Figure <figr fid="F3">3</figr>). This analysis explains at least some of the difficulties with reproducing cancer gene expression signatures <abbrgrp>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
				</abbrgrp> and possibly also the similar reproducibility problems of recent association studies in complex diseases <abbrgrp>
					<abbr bid="B16">16</abbr>
				</abbrgrp>. Moreover, it suggests that this lack of reproducibility need not be problematic.</p>
			<p>We have developed and validated a statistical hypothesis testing framework that for the first time provides false discovery rates control for signature discovery. In application to cancer gene expression, we have showed that reliable signature discovery is feasible with currently available sample sizes. Many important problems in bioinformatics are prediction problems and may benefit from reliable signature discovery. We therefore hope that our method will be of general interest.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<p>Signature stability is defined as the normalized expected overlap between two signatures <it>S, S' </it>derived from independent, replicate experimental data sets,</p>
			<p>
				<display-formula id="M3">
					<m:math name="1471-2105-10-38-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:mtext>Stability</m:mtext>
								<m:mo>=</m:mo>
								<m:msub>
									<m:mi mathvariant="double-struck">E</m:mi>
									<m:mrow>
										<m:mi>S</m:mi>
										<m:mo>,</m:mo>
										<m:msup>
											<m:mi>S</m:mi>
											<m:mo>&#8242;</m:mo>
										</m:msup>
									</m:mrow>
								</m:msub>
								<m:mrow>
									<m:mo>[</m:mo>
									<m:mrow>
										<m:mfrac>
											<m:mrow>
												<m:mrow>
													<m:mo>|</m:mo>
													<m:mrow>
														<m:mi>S</m:mi>
														<m:mo>&#8745;</m:mo>
														<m:msup>
															<m:mi>S</m:mi>
															<m:mo>&#8242;</m:mo>
														</m:msup>
													</m:mrow>
													<m:mo>|</m:mo>
												</m:mrow>
											</m:mrow>
											<m:mrow>
												<m:mi>max</m:mi>
												<m:mo>&#8289;</m:mo>
												<m:mo>{</m:mo>
												<m:mrow>
													<m:mo>|</m:mo>
													<m:mrow>
														<m:mi>S</m:mi>
														<m:mo>&#8746;</m:mo>
														<m:msup>
															<m:mi>S</m:mi>
															<m:mo>&#8242;</m:mo>
														</m:msup>
													</m:mrow>
													<m:mo>|</m:mo>
												</m:mrow>
												<m:mo>,</m:mo>
												<m:mn>1</m:mn>
												<m:mo>}</m:mo>
											</m:mrow>
										</m:mfrac>
									</m:mrow>
									<m:mo>]</m:mo>
								</m:mrow>
								<m:mo>,</m:mo>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaee4uamLaeeiDaqNaeeyyaeMaeeOyaiMaeeyAaKMaeeiBaWMaeeyAaKMaeeiDaqNaeeyEaKNaeyypa0Zefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFrdaWgaaWcbaGaem4uamLaeiilaWIafm4uamLbauaaaeqaaOWaamWaaKqbagaadaWcaaqaamaaemaabaGaem4uamLaeyykICSafm4uamLbauaaaiaawEa7caGLiWoaaeaacyGGTbqBcqGGHbqycqGG4baEcqGG7bWEdaabdaqaaiabdofatjabgQIiilqbdofatzaafaaacaGLhWUaayjcSdGaeiilaWIaeGymaeJaeiyFa0haaaGccaGLBbGaayzxaaGaeiilaWcaaa@6378@</m:annotation>
						</m:semantics>
					</m:math>
				</display-formula>
			</p>
			<p>where <inline-formula>
					<m:math name="1471-2105-10-38-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mi mathvariant="double-struck">E</m:mi>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWefv3ySLgznfgDOjdaryqr1ngBPrginfgDObcv39gaiqaacqWFecFraaa@37B5@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> denotes statistical expectation. The stability is always between 0 (no expected overlap) and 1 (complete overlap).</p>
			<p>Simulations were performed with data drawn from two-class multivariate Gaussian distributions <it>f </it>(<it>x </it>| <it>y</it>) = <it>N </it>(<it>y&#956;</it>, &#931;) with equal class frequencies, covariance matrix &#931; independent of the class (phenotype) variable <it>y </it>and varying degrees of class separation to achieve different effect sizes. Results were averaged over 100 randomly selected Gaussian distributions. for each parameter setting tested. We measure the effect size of the resulting prediction problem by the expected SVM accuracy. Here the accuracy was computed exactly for each SVM from the data density: for any given <it>&#956; </it>and &#931;, a separating hyperplane with normal vector <it>w </it>has classification accuracy</p>
			<p>
				<display-formula>
					<m:math name="1471-2105-10-38-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:mtext>Acc</m:mtext>
								<m:mo stretchy="false">(</m:mo>
								<m:mi>w</m:mi>
								<m:mo stretchy="false">)</m:mo>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mn>1</m:mn>
									<m:mn>2</m:mn>
								</m:mfrac>
								<m:mrow>
									<m:mo>[</m:mo>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mo>+</m:mo>
										<m:mtext>erf</m:mtext>
										<m:mrow>
											<m:mo>(</m:mo>
											<m:mrow>
												<m:mfrac>
													<m:mrow>
														<m:msup>
															<m:mi>w</m:mi>
															<m:mi>T</m:mi>
														</m:msup>
														<m:mi>&#956;</m:mi>
													</m:mrow>
													<m:mrow>
														<m:msqrt>
															<m:mrow>
																<m:mn>2</m:mn>
																<m:msup>
																	<m:mi>w</m:mi>
																	<m:mi>T</m:mi>
																</m:msup>
																<m:mi>&#931;</m:mi>
																<m:mi>w</m:mi>
															</m:mrow>
														</m:msqrt>
													</m:mrow>
												</m:mfrac>
											</m:mrow>
											<m:mo>)</m:mo>
										</m:mrow>
									</m:mrow>
									<m:mo>]</m:mo>
								</m:mrow>
								<m:mo>,</m:mo>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeyqaeKaee4yamMaee4yamMaeiikaGIaem4DaCNaeiykaKIaeyypa0tcfa4aaSaaaeaacqaIXaqmaeaacqaIYaGmaaGcdaWadaqaaiabigdaXiabgUcaRiabbwgaLjabbkhaYjabbAgaMnaabmaajuaGbaWaaSaaaeaacqWG3bWDdaahaaqabeaacqWGubavaaGaeqiVd0gabaWaaOaaaeaacqaIYaGmcqWG3bWDdaahaaqabeaacqWGubavaaGaeu4OdmLaem4DaChabeaaaaaakiaawIcacaGLPaaaaiaawUfacaGLDbaacqGGSaalaaa@4CAA@</m:annotation>
						</m:semantics>
					</m:math>
				</display-formula>
			</p>
			<p>where <inline-formula>
					<m:math name="1471-2105-10-38-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:mtext>erf</m:mtext>
								<m:mo stretchy="false">(</m:mo>
								<m:mi>x</m:mi>
								<m:mo stretchy="false">)</m:mo>
								<m:mo>=</m:mo>
								<m:mn>2</m:mn>
								<m:msup>
									<m:mi>&#960;</m:mi>
									<m:mrow>
										<m:mo>&#8722;</m:mo>
										<m:mn>1</m:mn>
										<m:mo>/</m:mo>
										<m:mn>2</m:mn>
									</m:mrow>
								</m:msup>
								<m:mstyle displaystyle="true">
									<m:mrow>
										<m:msubsup>
											<m:mo>&#8747;</m:mo>
											<m:mn>0</m:mn>
											<m:mi>x</m:mi>
										</m:msubsup>
										<m:mrow>
											<m:msup>
												<m:mi>e</m:mi>
												<m:mrow>
													<m:mo>&#8722;</m:mo>
													<m:msup>
														<m:mi>t</m:mi>
														<m:mn>2</m:mn>
													</m:msup>
												</m:mrow>
											</m:msup>
										</m:mrow>
									</m:mrow>
								</m:mstyle>
								<m:mi>d</m:mi>
								<m:mi>t</m:mi>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeyzauMaeeOCaiNaeeOzayMaeiikaGIaemiEaGNaeiykaKIaeyypa0JaeGOmaiJaeqiWda3aaWbaaSqabeaacqGHsislcqaIXaqmcqGGVaWlcqaIYaGmaaGcdaWdXaqaaiabdwgaLnaaCaaaleqabaGaeyOeI0IaemiDaq3aaWbaaWqabeaacqaIYaGmaaaaaaWcbaGaeGimaadabaGaemiEaGhaniabgUIiYdGccqWGKbazcqWG0baDaaa@470A@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> is the error function.</p>
			<p>To evaluate signature error rates, we used the fact that for <it>f </it>(<it>x </it>| <it>y</it>) = <it>N </it>(<it>y&#956;</it>, &#931;), the optimal separating hyperplane has normal vector <it>w</it>* = &#931;<sup>-1 </sup>
				<it>&#956;</it>, and so the optimal set <it>S</it>* can be determined as the nonzero components <inline-formula>
					<m:math name="1471-2105-10-38-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msubsup>
									<m:mi>w</m:mi>
									<m:mi>i</m:mi>
									<m:mo>&#8727;</m:mo>
								</m:msubsup>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabdMgaPbqaaiabgEHiQaaaaaa@2FC3@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> of this vector.</p>
			<p>For hypothesis testing, we used a parametric bootstrap with <it>B </it>= 50 repetitions, fitting a Gaussian distribution <it>N </it>(<it>&#956;</it>
				<sub>
					<it>i</it>
				</sub>, <it>&#963;</it>
				<sub>
					<it>i</it>
				</sub>) to the observed <inline-formula>
					<m:math name="1471-2105-10-38-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msubsup>
									<m:mi>w</m:mi>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mi>g</m:mi>
									</m:mrow>
									<m:mo>&#8727;</m:mo>
								</m:msubsup>
								<m:mo>,</m:mo>
								<m:mn>...</m:mn>
								<m:msubsup>
									<m:mi>w</m:mi>
									<m:mrow>
										<m:mi>B</m:mi>
										<m:mi>g</m:mi>
									</m:mrow>
									<m:mo>&#8727;</m:mo>
								</m:msubsup>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aa0baaSqaaiabigdaXiabdEgaNbqaaiabgEHiQaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabdEha3naaDaaaleaacqWGcbGqcqWGNbWzaeaacqGHxiIkaaaaaa@393C@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> prior to computing two-sided <it>p</it>-values. In preliminary studies, the difference between this method and a nonparametric bootstrap with <it>B </it>= 1000 was negligible, while the parametric version is computationally more efficient since a much smaller <it>B </it>can be used. The SVM <abbrgrp>
					<abbr bid="B18">18</abbr>
				</abbrgrp>, KFD <abbrgrp>
					<abbr bid="B19">19</abbr>
				</abbrgrp> and VW <abbrgrp>
					<abbr bid="B2">2</abbr>
				</abbrgrp> methods were implemented as previously described. In all experiments, the SVM <it>C</it>-parameter and the KFD regularization parameter were set to 1. Recursive Feature Elimination (RFE) was performed as previously described <abbrgrp>
					<abbr bid="B20">20</abbr>
				</abbrgrp>, using the radius-margin bound <abbrgrp>
					<abbr bid="B26">26</abbr>
				</abbrgrp> as accuracy measure and removing 20% of the genes in each iteration.</p>
			<p>Microarray data sets <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
				</abbrgrp> were preprocessed by removing genes displaying small variation, keeping the 5,000 most variable genes in each case, except for the data sets by van't Veer <it>et al</it>. <abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp> and Alon <it>et al</it>. <abbrgrp>
					<abbr bid="B1">1</abbr>
				</abbrgrp> which were preprocessed in a similar fashion by the original authors. Genes were normalized to zero mean and unit standard deviation prior to SVM training, following standard practise for kernel methods. Independent test data sets <abbrgrp>
					<abbr bid="B27">27</abbr>
					<abbr bid="B28">28</abbr>
					<abbr bid="B29">29</abbr>
				</abbrgrp> were normalized in the same fashion. No other preprocessing was done prior to classifier training or testing.</p>
			<p>Since many data sets were had low minor class frequencies are (Table <tblr tid="T1">1</tblr>), performance was evaluated with the balanced accuracy measure</p>
			<p>
				<display-formula>
					<m:math name="1471-2105-10-38-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msub>
									<m:mrow>
										<m:mtext>Acc</m:mtext>
									</m:mrow>
									<m:mrow>
										<m:mtext>balanced</m:mtext>
									</m:mrow>
								</m:msub>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mrow>
										<m:msub>
											<m:mrow>
												<m:mtext>Acc</m:mtext>
											</m:mrow>
											<m:mo>+</m:mo>
										</m:msub>
										<m:mo>+</m:mo>
										<m:msub>
											<m:mrow>
												<m:mtext>Acc</m:mtext>
											</m:mrow>
											<m:mo>&#8722;</m:mo>
										</m:msub>
									</m:mrow>
									<m:mn>2</m:mn>
								</m:mfrac>
								<m:mo>,</m:mo>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeyqaeKaee4yamMaee4yam2aaSbaaSqaaiabbkgaIjabbggaHjabbYgaSjabbggaHjabb6gaUjabbogaJjabbwgaLjabbsgaKbqabaGccqGH9aqpjuaGdaWcaaqaaiabbgeabjabbogaJjabbogaJnaaBaaabaGaey4kaScabeaacqGHRaWkcqqGbbqqcqqGJbWycqqGJbWydaWgaaqaaiabgkHiTaqabaaabaGaeGOmaidaaOGaeiilaWcaaa@4841@</m:annotation>
						</m:semantics>
					</m:math>
				</display-formula>
			</p>
			<p>where Acc<sub>+ </sub>and Acc<sub>- </sub>are the accuracy measures for each class. Except for the independent test sets, these were measured by cross-validation, where in each round a randomized set consisting of 2/3 of the samples was used for training, and the remaining 1/3 was used for testing. Splits were balanced so that class frequencies were equal between training/test data. Mean and standard deviation of the balanced test error over 50 cross-validation repetitions are reported.</p>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>RN, JB and JT designed research; RN performed research; RN and JT wrote the paper.</p>
		</sec>
	</bdy>
   <bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>The authors would like to thank Drs. Jos&#233; M. Pe&#241;a and Albert Compte for helpful discussions. This work was supported by grants from the Ph.D. Programme in Medical Bioinformatics at Karolinska Institutet (RN), Clinical Gene Networks AB, Vinnova (JT), Swedish Research Council (JT) and Link&#246;ping University.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays</p>
				</title>
				<aug>
					<au>
						<snm>Alon</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Barkai</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Notterman</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ybarra</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Mack</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Levine</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci U S A</source>
				<pubdate>1999</pubdate>
				<volume>96</volume>
				<issue>12</issue>
				<fpage>6745</fpage>
				<lpage>6750</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">10359783</pubid>
						<pubid idtype="pmcid">21986</pubid>
						<pubid idtype="doi">10.1073/pnas.96.12.6745</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Molecular classifiation of cancer: class discovery and class prediction by gene expression monitoring</p>
				</title>
				<aug>
					<au>
						<snm>Golub</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Slonim</snm>
						<fnm>DK</fnm>
					</au>
					<au>
						<snm>Tamayo</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Huard</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Gaasenbeek</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Mesirov</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Coller</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Loh</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Downing</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Caligiuri</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bloomfield</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lander</snm>
						<fnm>ES</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1999</pubdate>
				<volume>286</volume>
				<issue>5439</issue>
				<fpage>531</fpage>
				<lpage>537</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">10521349</pubid>
						<pubid idtype="doi">10.1126/science.286.5439.531</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Gene expression correlates of clinical prostate cancer behavior</p>
				</title>
				<aug>
					<au>
						<snm>Singh</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Febbo</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Ross</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Jackson</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Manola</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ladd</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Tamayo</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Renshaw</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>D'Amico</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Richie</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Lander</snm>
						<fnm>ES</fnm>
					</au>
					<au>
						<snm>Loda</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kanto3</snm>
						<fnm>PW</fnm>
					</au>
					<au>
						<snm>Golub</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Sellers</snm>
						<fnm>WR</fnm>
					</au>
				</aug>
				<source>Cancer Cell</source>
				<pubdate>2002</pubdate>
				<volume>1</volume>
				<issue>2</issue>
				<fpage>203</fpage>
				<lpage>209</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12086878</pubid>
						<pubid idtype="doi">10.1016/S1535-6108(02)00030-2</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Gene expression profiling predicts clinical outcome of breast cancer</p>
				</title>
				<aug>
					<au>
						<snm>van't Veer</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Vijver</snm>
						<mnm>Van De</mnm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>YD</fnm>
					</au>
					<au>
						<snm>Hart</snm>
						<fnm>AAM</fnm>
					</au>
					<au>
						<snm>Mao</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Peterse</snm>
						<fnm>HL</fnm>
					</au>
					<au>
						<snm>Kooy</snm>
						<mnm>Van Der</mnm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Marton</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Witteveen</snm>
						<fnm>AT</fnm>
					</au>
					<au>
						<snm>Schreiber</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Kerkhoven</snm>
						<fnm>RM</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Linsley</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Bernards</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Friend</snm>
						<fnm>SH</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>415</volume>
				<issue>6871</issue>
				<fpage>530</fpage>
				<lpage>536</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">11823860</pubid>
						<pubid idtype="doi">10.1038/415530a</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Klijn</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Sieuwerts</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Look</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Talantov</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Timmermans</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Meijer-van Gelder</snm>
						<fnm>ME</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Jatkoe</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Berns</snm>
						<fnm>EM</fnm>
					</au>
					<au>
						<snm>Atkins</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Foekens</snm>
						<fnm>JA</fnm>
					</au>
				</aug>
				<source>Lancet</source>
				<pubdate>2005</pubdate>
				<volume>365</volume>
				<issue>9460</issue>
				<fpage>671</fpage>
				<lpage>679</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15721472</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Gene signature evaluation as a prognostic tool: challenges in the design of the MINDACT trial</p>
				</title>
				<aug>
					<au>
						<snm>Bogaerts</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Cardoso</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Buyse</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Braga</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Loi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Harrison</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Bines</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Mook</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Decker</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Ravdin</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Therasse</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Rutgers</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>van't Veer</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Piccart</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Nat Clin Pract Oncol</source>
				<pubdate>2006</pubdate>
				<volume>3</volume>
				<issue>10</issue>
				<fpage>540</fpage>
				<lpage>551</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">17019432</pubid>
						<pubid idtype="doi">10.1038/ncponc0591</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Prediction of cancer outcome with microarrays: a multiple random validation strategy</p>
				</title>
				<aug>
					<au>
						<snm>Michiels</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Koscielny</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Lancet</source>
				<pubdate>2005</pubdate>
				<volume>365</volume>
				<issue>9458</issue>
				<fpage>488</fpage>
				<lpage>492</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15705458</pubid>
						<pubid idtype="doi">10.1016/S0140-6736(05)17866-0</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Outcome signature genes in breast cancer: is there a unique set?</p>
				</title>
				<aug>
					<au>
						<snm>Ein-Dor</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Kela</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Getz</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Givol</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Domany</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>2</issue>
				<fpage>171</fpage>
				<lpage>178</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15308542</pubid>
						<pubid idtype="doi">10.1093/bioinformatics/bth469</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer</p>
				</title>
				<aug>
					<au>
						<snm>Ein-Dor</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Zuk</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Domany</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci U S A</source>
				<pubdate>2006</pubdate>
				<volume>103</volume>
				<issue>15</issue>
				<fpage>5923</fpage>
				<lpage>5928</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">16585533</pubid>
						<pubid idtype="pmcid">1458674</pubid>
						<pubid idtype="doi">10.1073/pnas.0601231103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Microarrays and molecular research: noise discovery?</p>
				</title>
				<aug>
					<au>
						<snm>Ioannidis</snm>
						<fnm>JP</fnm>
					</au>
				</aug>
				<source>Lancet</source>
				<pubdate>2005</pubdate>
				<volume>365</volume>
				<issue>9458</issue>
				<fpage>454</fpage>
				<lpage>455</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15705441</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<aug>
					<au>
						<snm>Devroye</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Gy&#246;rfi</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Lugosi</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>A probabilistic theory of pattern recognition. Applications of mathematics</source>
				<publisher>New York: Springer-Verlag</publisher>
				<pubdate>1996</pubdate>
			</bibl>
			<bibl id="B12">
				<title>
					<p>An introduction to variable and feature selection</p>
				</title>
				<aug>
					<au>
						<snm>Guyon</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Elisseeff</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Journ Mach Learn Res</source>
				<pubdate>2003</pubdate>
				<volume>3</volume>
				<fpage>1157</fpage>
				<lpage>1182</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1162/153244303322753616</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Consistent feature selection for pattern recognition in polyomial time</p>
				</title>
				<aug>
					<au>
						<snm>Nilsson</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Pe&#241;a</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Bj&#246;rkegren</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tegn&#233;r</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Jour of Mach Learn Res</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<fpage>589</fpage>
				<lpage>612</lpage>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Detecting multivariate differentially expressed genes</p>
				</title>
				<aug>
					<au>
						<snm>Nilsson</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Pe&#241;a</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Bj&#246;rkegren</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tegn&#233;r</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<fpage>150</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">17490475</pubid>
						<pubid idtype="pmcid">1885271</pubid>
						<pubid idtype="doi">10.1186/1471-2105-8-150</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Controlling the false discovery rate: a practical and powerful approach to multiple testing</p>
				</title>
				<aug>
					<au>
						<snm>Benjamini</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Hochberg</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>J R Statist Soc B</source>
				<pubdate>1995</pubdate>
				<volume>57</volume>
				<fpage>289</fpage>
				<lpage>300</lpage>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Genome-wide association studies provide new insights into type 2 diabetes aetiology</p>
				</title>
				<aug>
					<au>
						<snm>Frayling</snm>
						<fnm>TM</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<issue>9</issue>
				<fpage>657</fpage>
				<lpage>662</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">17703236</pubid>
						<pubid idtype="doi">10.1038/nrg2178</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>An empirical Bayes approach to inferring large-scale gene association networks</p>
				</title>
				<aug>
					<au>
						<snm>Sch&#228;fer</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Strimmer</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>6</issue>
				<fpage>754</fpage>
				<lpage>764</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15479708</pubid>
						<pubid idtype="doi">10.1093/bioinformatics/bti062</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Support-Vector Networks</p>
				</title>
				<aug>
					<au>
						<snm>Cortes</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Vapnik</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Mach Learn</source>
				<pubdate>1995</pubdate>
				<volume>20</volume>
				<issue>3</issue>
				<fpage>273</fpage>
				<lpage>297</lpage>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Fisher Discriminant Analysis with Kernels</p>
				</title>
				<aug>
					<au>
						<snm>Mika</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ratsch</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Weston</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Scholkopf</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Muller</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Proceedings of IEEE Neural Networks for Signal Processing Workshop</source>
				<editor>Hen YH, Larsen J, Wilson E</editor>
				<pubdate>1999</pubdate>
				<fpage>41</fpage>
				<lpage>48</lpage>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Gene Selection for Cancer Classification using Support Vector Machines</p>
				</title>
				<aug>
					<au>
						<snm>Guyon</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Weston</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Barnhill</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Vapnik</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Mach Learn</source>
				<pubdate>2002</pubdate>
				<volume>46</volume>
				<fpage>389</fpage>
				<lpage>422</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1023/A:1012487302797</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Evaluating feature selection for SVMs in high dimensions</p>
				</title>
				<aug>
					<au>
						<snm>Nilsson</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Pe&#241;a</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Bj&#246;rkegren</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tegn&#233;r</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Proceedings of the 17th European Conference on Machine Learning</source>
				<pubdate>2006</pubdate>
				<fpage>719</fpage>
				<lpage>726</lpage>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Ridge regression: biased estimation of nonorthogonal problems</p>
				</title>
				<aug>
					<au>
						<snm>Heorl</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kennard</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Technometrics</source>
				<pubdate>1970</pubdate>
				<volume>12</volume>
				<fpage>69</fpage>
				<lpage>82</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/1267352</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Sparse Bayesian learning and the relevance vector machine</p>
				</title>
				<aug>
					<au>
						<snm>Tipping</snm>
						<fnm>ME</fnm>
					</au>
				</aug>
				<source>Journ Mach Learn Res</source>
				<pubdate>2001</pubdate>
				<volume>1</volume>
				<fpage>211</fpage>
				<lpage>244</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1162/15324430152748236</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>From LASSO regression to feature vector machine</p>
				</title>
				<aug>
					<au>
						<snm>Li</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Xing</snm>
						<fnm>EP</fnm>
					</au>
				</aug>
				<source>Advances in Neural Information Processing Systems 18</source>
				<publisher>MIT Press, Cambridge</publisher>
				<editor>Weiss Y</editor>
				<pubdate>2005</pubdate>
				<fpage>411</fpage>
				<lpage>418</lpage>
			</bibl>
			<bibl id="B25">
				<aug>
					<au>
						<snm>Efron</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Tibshirani</snm>
						<fnm>RJ</fnm>
					</au>
				</aug>
				<source>An introduction to the bootstrap</source>
				<publisher>Chapman &amp; Hall, Inc. New York</publisher>
				<pubdate>1993</pubdate>
			</bibl>
			<bibl id="B26">
				<aug>
					<au>
						<snm>Vapnik</snm>
						<fnm>VN</fnm>
					</au>
				</aug>
				<source>Statistical Learning Theory</source>
				<publisher>John Wiley and Sons, Inc. New Jersey</publisher>
				<pubdate>1998</pubdate>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Gene expression alterations in prostate cancer predicting tumor aggression and preceding development of malignancy</p>
				</title>
				<aug>
					<au>
						<snm>Yu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Landsittel</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Jing</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ren</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>McDonald</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Thomas</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Dhir</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Finkelstein</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Michalopoulos</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Becich</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Luo</snm>
						<fnm>JH</fnm>
					</au>
				</aug>
				<source>J Clin Oncol</source>
				<pubdate>2004</pubdate>
				<volume>22</volume>
				<issue>14</issue>
				<fpage>2790</fpage>
				<lpage>2799</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">15254046</pubid>
						<pubid idtype="doi">10.1200/JCO.2004.05.158</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>New data on robustness of gene expression signatures in leukemia: comparison of three distinct total RNA preparation procedures</p>
				</title>
				<aug>
					<au>
						<snm>Campo Dell'Orto</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Zangrando</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Trentin</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>te Kronnie</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Basso</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kohlmann</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2007</pubdate>
				<volume>8</volume>
				<fpage>188</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">17587440</pubid>
						<pubid idtype="pmcid">1925098</pubid>
						<pubid idtype="doi">10.1186/1471-2164-8-188</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Available from the NCBI Gene Expression Omnibus, accession GSE10960</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/geo/</url>
			</bibl>
		</refgrp>
	</bm>
</art>

