<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2004-5-8-r59</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Method</dochead>
		<bibl>
			<title>
				<p>Statistical modeling for selecting housekeeper genes</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Szabo</snm>
					<fnm>Aniko</fnm>
					<insr iid="I1"/>
					<email>aniko.szabo@hci.utah.edu</email>
				</au>
				<au id="A2">
					<snm>Perou</snm>
					<mi>M</mi>
					<fnm>Charles</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A3">
					<snm>Karaca</snm>
					<fnm>Mehmet</fnm>
					<insr iid="I2"/>
				</au>
				<au id="A4">
					<snm>Perreard</snm>
					<fnm>Laurent</fnm>
					<insr iid="I3"/>
				</au>
				<au id="A5">
					<snm>Quackenbush</snm>
					<mi>F</mi>
					<fnm>John</fnm>
					<insr iid="I4"/>
				</au>
				<au id="A6">
					<snm>Bernard</snm>
					<mi>S</mi>
					<fnm>Philip</fnm>
					<insr iid="I3"/>
					<insr iid="I4"/>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Department of Oncological Sciences, Huntsman Cancer Institute, Salt Lake City, UT 84112, USA</p>
				</ins>
				<ins id="I2">
					<p>Lineberger Comprehensive Cancer Center and Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA</p>
				</ins>
				<ins id="I3">
					<p>ARUP Laboratories Inc., Salt Lake City, UT 84108, USA</p>
				</ins>
				<ins id="I4">
					<p>Department of Pathology, University of Utah, Salt Lake City, UT 84112, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2004</pubdate>
			<volume>5</volume>
			<issue>8</issue>
			<fpage>R59</fpage>
			<url>http://genomebiology.com/2004/5/8/R59</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15287981</pubid><pubid idtype="doi">10.1186/gb-2004-5-8-r59</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>16</day>
					<month>1</month>
					<year>2004</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>13</day>
					<month>5</month>
					<year>2004</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>11</day>
					<month>6</month>
					<year>2004</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>29</day>
					<month>7</month>
					<year>2004</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2004</year>
			<collab>Szabo et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
		</cpyrt>
		<shorttitle>
			<p>Statistical modeling for selecting housekeeper genes</p>
		</shorttitle>
		<shortabs>
			<p>Statistical models are presented for selecting the best housekeepers to normalize quantitative data within a given tissue type and across different types of tissue samples.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>There is a need for statistical methods to identify genes that have minimal variation in expression across a variety of experimental conditions. These 'housekeeper' genes are widely employed as controls for quantification of test genes using gel analysis and real-time RT-PCR. Using real-time quantitative RT-PCR, we analyzed 80 primary breast tumors for variation in expression of six putative housekeeper genes (<it>MRPL19 </it>(mitochondrial ribosomal protein L19), <it>PSMC4 </it>(proteasome (prosome, macropain) 26S subunit, ATPase, 4), <it>SF3A1 </it>(splicing factor 3a, subunit 1, 120 kDa), <it>PUM1 </it>(pumilio homolog 1 (<it>Drosophila</it>)), <it>ACTB </it>(actin, beta) and <it>GAPD </it>(glyceraldehyde-3-phosphate dehydrogenase)). We present appropriate models for selecting the best housekeepers to normalize quantitative data within a given tissue type (for example, breast cancer) and across different types of tissue samples.</p>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010002">Bioinformatics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Genes that exhibit minimal variation in messenger RNA (mRNA) quantity across a variety of cell types and biological conditions provide valuable controls for relative quantification. Normalizing quantitative data with housekeeper genes has many applications, from identifying genes regulated during embryogenesis to developing new cancer diagnostics. Although finding biological significance in gene-expression data can rely heavily on the performance of the housekeeper genes, there is a paucity of information on testing these genes for their suitability for this role.</p>
			<p>The copy number of a housekeeper gene should be proportional to the amount of poly(A) RNA present in the sample and this proportion should be maintained across a variety of experimental conditions. As nucleic acids show high absorbance at 260 nm (A<sub>260</sub>), spectrophotometers provide approximate amounts of total DNA/RNA present in a sample. Using absorbance methods alone, however, gives no information about the type of nucleic acid (for example DNA versus RNA) or contributions from different nucleic acid fractions (for example rRNA versus mRNA). It is assumed that mRNA comprises approximately 1-3% of the total RNA. However, this contribution may change depending on the extraction method used. For instance, column extraction methods provide better exclusion of ribosomal RNA than solvent extraction methods <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. By combining capillary electrophoresis with absorbance, it is possible to accurately quantify these different fractions <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
			<p>Traditionally, housekeepers have been used in Northern blot analysis to represent the amount of mRNA in the sample and to control for sample loading, blot transfer and probe hybridization. Highly expressed genes serving fundamental roles in the cell are commonly used for this purpose but may not be optimal under certain experimental conditions <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. For example, the sensitivity and accuracy of northern blot analysis with densitometry may be decreased using a highly expressed housekeeper gene that can saturate the autoradiographic signal <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. To resolve this problem and compensate for limitations in dynamic range, control genes may be chosen to have a level of gene expression similar to the gene(s) of interest (that is, the test genes).</p>
			<p>Microarrays are more practical for genome-wide expression analysis than northern blots <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. With cDNA microarrays, a common reference sample is usually used to compare the expression of each gene across many experimental sample(s) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. Because each gene in the experimental sample is directly compared to the same gene in the common reference, housekeeper genes are not necessary for normalization. Microarrays are commonly applied to finding genes with differential expression across experimental conditions, however the data may also be used to identify stably expressed genes that can serve as important controls for northern blot analysis, ribonuclease protection assays and quantitative reverse transcription PCR (RT-PCR). In turn, these other quantitative methods are often used to verify differentially expressed genes identified by microarray <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
			<p>Housekeeper genes are often adopted from the literature and used across a variety of experimental conditions, some of which may induce differences in their expression. If unrecognized, unexpected changes in housekeeper expression could result in erroneous conclusions about real biological effects such as responses to drugs. In addition, this type of change would be difficult to detect because most experiments only include a single housekeeper gene. It is difficult to determine whether a given gene has the constitutive property of a housekeeper when the true amount of mRNA in a sample is unknown. As a way round this dilemma, Vandesompele <it>et al</it>. postulate that gene pairs that have stable expression patterns relative to each other are proper control genes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. An alternative method for quantitative analysis of RT-PCR data that does not require housekeeper genes for normalization is to use global pattern recognition (GPR). For instance, Akilesh <it>et al</it>. used a GPR algorithm to search for eligible normalizing genes within an assay plate and then used those genes as controls to identify differentially expressed genes <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
			<p>Although relative quantification using housekeeper genes is a practical method of estimating the expression level of a test gene, the transcript amount in the sample is a summation and the method does not consider transcript differences on a cell-to-cell basis. Fluorescence <it>in situ </it>hybridization (FISH) is clinically used to determine absolute DNA copy number (for example, <it>HER2 </it>amplification) in a cell, but these methods still average the copy number after counting many cells and the technique is expensive and laborious <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <it>In situ </it>methods for detecting RNA transcripts have been developed but the assays are semiquantitative and subjective <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
			<p>In the work presented here, we applied several models to selecting the best housekeeper genes for breast cancer and give algorithms that can be generalized to find housekeeper genes that are appropriate for normalizing quantitative data within and between tissue types.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>One tissue type</p>
				</st>
				<p>The genes <it>MRPL19</it>, <it>PSMC4</it>, <it>SF3A1</it>, <it>PUM1</it>, <it>ACTB </it>and <it>GAPD </it>were analyzed by real-time quantitative RT-PCR. Starting copy numbers for the six candidate housekeeping genes were measured across 80 primary breast tumor samples. The data are available as Additional data file 1 with the online version of this article. Plots of the raw and log-scaled expression levels (all logarithms in this paper are natural base (e) logarithms) are shown in Figure <figr fid="F1">1</figr>. The breast tumor samples are ordered according to the mean of the log-expression levels of all the genes. It is evident from the plot that for the raw data the variability of within-sample measurements increases with the mean expression, whereas the variability stays approximately the same for all the samples with the log-transformation. In addition, the log-transformation allows us to model fold changes in expression levels in an additive way.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Relative levels of expression determined by real-time quantitative RT-PCR are shown for 6 housekeeper genes in 80 breast tumors</p>
					</caption>
					<text>
						<p>Relative levels of expression determined by real-time quantitative RT-PCR are shown for 6 housekeeper genes in 80 breast tumors. The top panel displays the raw data and the bottom panel displays a log-scale of the data.</p>
					</text>
					<graphic file="gb-2004-5-8-r59-1"/>
				</fig>
				<p>To select the best housekeepers for normalizing data across a single tissue type, we tested three variations of a model (Model 1, a-c) with real-time quantitative RT-PCR data generated from primary breast samples (see Materials and methods for details).</p>
				<sec>
					<st>
						<p>Model 1a</p>
					</st>
					<p>We model the expression <it>y</it><sub><it>ij </it></sub>of gene <it>j </it>in sample <it>I </it>by</p>
					<p>log <it>y</it><sub><it>ij </it></sub>= <it>&#956; </it>+ <it>T</it><sub><it>i </it></sub>+ <it>G</it><sub><it>j </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>, where <it>&#949;</it><sub><it>ij </it></sub>~ <it>N</it>(0,<it>&#963;</it><graphic file="gb-2004-5-8-r59-i1.gif"/>)</p>
					<p>where <it>&#956; </it>is the overall mean (log-) expression, <it>T</it><sub><it>i </it></sub>is the difference of the <it>i</it>th tissue sample from the overall average and <it>G</it><sub><it>j </it></sub>is the difference of the <it>j</it>th gene from the overall average. The key feature of this model that makes it different from a traditional ANOVA model is that it allows heteroscedastic errors to account for different variability in the genes <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The variability around the gene-specific mean log-expression <it>&#956; </it>+ <it>T</it><sub><it>i </it></sub>+ <it>G</it><sub><it>j </it></sub>is quantified by the error standard deviation <it>&#963;</it><sub><it>j</it></sub>. The Bayesian information criterion (BIC) was used to avoid overfitting the data <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Model 1a had the best BIC value and was selected from a range of competing models that included a method with equal error variances (Model 1b in Materials and methods) and a more complex method with correlated errors (Model 1c in Materials and methods).</p>
					<p>Using Model 1a, standard deviations were determined to select the best control genes for breast cancer. Table <tblr tid="T1">1</tblr> shows that <it>MRPL19 </it>has the smallest variability across the breast cancer samples and would be the best choice for a single housekeeper control. Although some of the confidence intervals overlap, a direct comparison between the genes selected from the microarray (<it>MRPL19</it>, <it>PSMC4</it>, <it>PUM1</it>, <it>SF3A1</it>) to the classical housekeepers (<it>GAPD </it>and <it>ACTB</it>) shows significant difference (<it>p </it>= 0.0014).</p>
					<tbl id="T1">
						<title>
							<p>Table 1</p>
						</title>
						<caption>
							<p>Standard deviation estimates of log expression using Model 1a for selecting the single best housekeeper gene for breast cancer</p>
						</caption>
						<tblbdy cols="3">
							<r>
								<c ca="left">
									<p>Gene</p>
								</c>
								<c ca="center">
									<p>Estimated standard deviation</p>
								</c>
								<c ca="center">
									<p>95% confidence interval</p>
								</c>
							</r>
							<r>
								<c cspan="3">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>MRPL19</it>
									</p>
								</c>
								<c ca="center">
									<p>0.218</p>
								</c>
								<c ca="center">
									<p>(0.168, 0.284)</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>PUM1</it>
									</p>
								</c>
								<c ca="center">
									<p>0.265</p>
								</c>
								<c ca="center">
									<p>(0.215, 0.328)</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>PSMC4</it>
									</p>
								</c>
								<c ca="center">
									<p>0.288</p>
								</c>
								<c ca="center">
									<p>(0.235, 0.352)</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>SF3A1</it>
									</p>
								</c>
								<c ca="center">
									<p>0.393</p>
								</c>
								<c ca="center">
									<p>(0.327, 0.472)</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>ACTB</it>
									</p>
								</c>
								<c ca="center">
									<p>0.448</p>
								</c>
								<c ca="center">
									<p>(0.376, 0.533)</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>GAPD</it>
									</p>
								</c>
								<c ca="center">
									<p>0.519</p>
								</c>
								<c ca="center">
									<p>(0.439, 0.613)</p>
								</c>
							</r>
						</tblbdy>
					</tbl>
					<p>As the biological function of many genes is still unknown, it is difficult to predict how different experimental conditions may affect the expression of putative housekeeper genes. Thus, a safer approach is to use an average expression of several genes that show small variance across conditions. On the basis of the selected model, the estimate of the variance of the log-average of the expression of several genes can be calculated (see Materials and methods for details). Table <tblr tid="T2">2</tblr> shows the standard deviations of the log-average of the best gene set for each possible set size (that is, 1-6). These standard deviation values are approximately equal to the coefficient of variation in the original scale. From the estimates, the four-gene set of <it>PSMC4</it>, <it>MRPL19</it>, <it>PUM1 </it>and <it>SF3A1 </it>provides the lowest overall variability when choosing a combination of genes. However, this four-gene set is barely different from the three-gene combination of <it>MRPL19</it>, <it>PUM1 </it>and <it>PSMC4</it>, which in turn is far better than the best two-gene combination. For economy, and because <it>SF3A1 </it>had a relatively high individual variability compared to others in the set, our choice for the normalizing set is the geometric mean of the expressions of <it>MRPL19</it>, <it>PUM1 </it>and <it>PSMC4</it>.</p>
					<tbl id="T2">
						<title>
							<p>Table 2</p>
						</title>
						<caption>
							<p>Standard deviation estimates of log expression using Model 1a for selecting the best housekeeper gene(s) for breast cancer</p>
						</caption>
						<tblbdy cols="3">
							<r>
								<c ca="left">
									<p>Set size</p>
								</c>
								<c ca="left">
									<p>Gene set</p>
								</c>
								<c ca="center">
									<p>Standard deviation</p>
								</c>
							</r>
							<r>
								<c cspan="3">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1</p>
								</c>
								<c ca="left">
									<p>
										<it>MRPL19</it>
									</p>
								</c>
								<c ca="center">
									<p>0.2182</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>2</p>
								</c>
								<c ca="left">
									<p><it>PUM1</it>, <it>MRPL19</it></p>
								</c>
								<c ca="center">
									<p>0.1718</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>3</p>
								</c>
								<c ca="left">
									<p><it>PSMC4</it>, <it>MRPL19</it>, <it>PUM1</it></p>
								</c>
								<c ca="center">
									<p>0.1494</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>4</p>
								</c>
								<c ca="left">
									<p><it>PSMC4</it>, <it>MRPL19</it>, <it>PUM1</it>, <it>SP3A1</it></p>
								</c>
								<c ca="center">
									<p>0.1490</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>5</p>
								</c>
								<c ca="left">
									<p><it>PSMC4</it>, <it>MRPL19</it>, <it>SF3A1</it>, <it>PUM1</it>, <it>ACTB</it></p>
								</c>
								<c ca="center">
									<p>0.1491</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>6</p>
								</c>
								<c ca="left">
									<p><it>PSMC4</it>, <it>MRPL19</it>, <it>SF3A1</it>, <it>PUM1</it>, <it>GAPD</it>, <it>ACTB</it></p>
								</c>
								<c ca="center">
									<p>0.1513</p>
								</c>
							</r>
						</tblbdy>
					</tbl>
					<p>These findings illustrate the importance of performing an unbiased and genome-wide search for housekeepers rather than relying on traditional housekeeper genes. We used microarray data to select genes with low variability in expression across breast tumors and cell lines. Because the quantitative differences between the microarray and RT-PCR platforms are relative, genes with low variability in expression across tumors by microarray should also show low variability in expression by RT-PCR. Although the quantitative data from microarray tends to have an overall smaller dynamic range compared to RT-PCR, this is primarily due to loss of information from genes expressed at low levels. Our microarray dataset was filtered to remove genes with signals near background noise.</p>
					<p>The result is very similar using Vandesompele <it>et al</it>.'s <it>M </it>value method, with only the positions of <it>PUM1 </it>and <it>PSMC4 </it>changing in stability rank. It should be noted that the <it>M-</it>value method does not order the two best genes (<it>MRPL19 </it>and <it>PSMC4</it>). Their best gene-set selection approach would suggest using the (log-scale) average of these two best genes as a control. Such a concordance is not surprising given the close relationship between the <it>M </it>value and our model using the variability of the average of several genes (see Materials and methods for details). A benefit of our approach is the ability to compare the variability of individual genes to that of an average of several genes.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Multiple tissue types</p>
				</st>
				<p>Gene(s) with minimal variation in expression across different cell types serve as good 'universal' housekeepers. A universal control may be a single gene or combination of genes. While the former should display both low variability within a given tissue type and consistent basal levels of expression across tissue types, the latter may comprise a gene set with individually different, but complementary, basal expression levels across tissue types.</p>
				<p>To test our models for selecting universal housekeepers, we used published data from Vandesompele <it>et al</it>. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. They measured the expression level of 10 genes in neuroblastoma cell lines (NB), cultured normal fibroblasts (FIB), normal leukocytes (LEU) and cells from normal bone marrow (BM). In addition, normal tissues from pooled organs (breast, brain, fetal brain, heart, kidney, uterus, lung, trachea and small intestine) were also profiled. A plot of these housekeepers across the different tissues is shown in Figure <figr fid="F2">2</figr>. It is notable that a gene can have stable expression within a given tissue type but can change rank position compared to other housekeepers across tissues. For example, <it>GAPD </it>has relatively high expression in fibroblasts compared to other housekeepers but low expression in leukocytes. Thus, <it>GAPD </it>may be a good single housekeeper within certain tissue types but may not be an optimal universal housekeeper unless it is used within a complementary gene set.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Dataset from Vandesompele et al. [13] in log scale-showing expression levels of 10 putative housekeeper genes by sample and tissue type</p>
					</caption>
					<text>
						<p>Dataset from Vandesompele et al. [13] in log-scale showing expression levels of 10 putative housekeeper genes by sample and tissue type. Tissue types analyzed included normal bone marrow (BM), cultured normal fibroblasts (FIB), normal leukocytes (LEU), neuroblastoma cell lines (NB), and pooled normal tissue from breast, brain, fetal brain, heart, kidney, uterus, lung, trachea and small intestine (POOL).</p>
					</text>
					<graphic file="gb-2004-5-8-r59-2"/>
				</fig>
				<sec>
					<st>
						<p>Model 2</p>
					</st>
					<p>To compare the performance of housekeepers within and between different tissues, we made a Model 2 (see Materials and methods for further details) that models the expression of gene <it>j </it>in the <it>i</it>th sample of tissue-type <it>k </it>by</p>
					<p>log <it>y</it><sub><it>i(k)j </it></sub>= <it>&#956; </it>+ <it>C</it><sub><it>k </it></sub>+ <it>T</it><sub><it>i(k) </it></sub>+ <it>G</it><sub><it>j </it></sub>+ (<it>CG</it>)<sub><it>kj </it></sub>+ <it>&#949;</it><sub><it>i(k)j</it></sub>, where <it>&#949;</it><sub><it>i(k)j </it></sub>~ <it>N</it>(0,<graphic file="gb-2004-5-8-r59-i2.gif"/>)</p>
					<p>where <it>&#956; </it>denotes the overall mean (log-) expression, <it>C</it><sub><it>k </it></sub>is the difference of the <it>k</it>th tissue type from the overall average, <it>T</it><sub><it>i</it>(<it>k</it>) </sub>is the specific effect of the <it>i</it>th sample of tissue-type <it>k</it>, <it>G</it><sub><it>j </it></sub>is the difference of the <it>j</it>th gene from the overall average and (<it>CG</it>)<sub><it>kj </it></sub>is the tissue-type specific effect of gene <it>j</it>. Variability in calculation comes from two sources: the specific gene (<it>&#963;</it><sub><it>j</it></sub>) and the tissue-type (&#962;<sub><it>k</it></sub>). The estimates of these parameters are given in Table <tblr tid="T3">3</tblr>. The single gene with the overall lowest variability within each tissue type is <it>GAPD</it>, followed closely by <it>UBC </it>(ubiquitin C), <it>HPRT1 </it>(hypoxanthine phosphoribosyltransferase 1) and <it>YWHAZ </it>(tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, zeta polypeptide). This result correlates closely with Vandesompele <it>et al.</it>'s approach. That is, the top five genes have exactly the same order when we rank the genes within each tissue type according to their <it>M</it>-value. Here we assign a rank of 1.5 to the unordered best pair and then average the ranks to obtain an overall ordering of the genes.</p>
					<tbl id="T3">
						<title>
							<p>Table 3</p>
						</title>
						<caption>
							<p>Components of the standard deviation estimates of the log-expression of the data of Vandesompele <it>et al. </it>[13]</p>
						</caption>
						<tblbdy cols="10">
							<r>
								<c cspan="10" ca="left">
									<p>Standard deviation of genes (<it>&#963;</it><sub><it>j</it></sub>)</p>
								</c>
							</r>
							<r>
								<c cspan="10">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>
										<it>GAPD</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>UBC</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>HPRT1</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>YWHAZ</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>SDHA</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>RPL13A</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>TBP</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>HMBS</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>ACTB</it>
									</p>
								</c>
								<c ca="left">
									<p>
										<it>B2M</it>
									</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>0.211</p>
								</c>
								<c ca="left">
									<p>0.226</p>
								</c>
								<c ca="left">
									<p>0.227</p>
								</c>
								<c ca="left">
									<p>0.232</p>
								</c>
								<c ca="left">
									<p>0.255</p>
								</c>
								<c ca="left">
									<p>0.339</p>
								</c>
								<c ca="left">
									<p>0.339</p>
								</c>
								<c ca="left">
									<p>0.431</p>
								</c>
								<c ca="left">
									<p>0.460</p>
								</c>
								<c ca="left">
									<p>0.562</p>
								</c>
							</r>
							<r>
								<c cspan="10">
									<p/>
								</c>
							</r>
							<r>
								<c cspan="10" ca="left">
									<p>Tissue-type specific multipliers (<it>&#962;</it><sub><it>k</it></sub>)</p>
								</c>
							</r>
							<r>
								<c cspan="10">
									<hr/>
								</c>
							</r>
							<r>
								<c cspan="2" ca="left">
									<p>Bone marrow</p>
								</c>
								<c cspan="2" ca="left">
									<p>Cultured normal fibroblasts</p>
								</c>
								<c cspan="2" ca="left">
									<p>Neuroblastoma cell lines</p>
								</c>
								<c cspan="2" ca="left">
									<p>Normal leukocytes</p>
								</c>
								<c cspan="2" ca="left">
									<p>Pooled*</p>
								</c>
							</r>
							<r>
								<c cspan="2" ca="left">
									<p>1.000</p>
								</c>
								<c cspan="2" ca="left">
									<p>1.204</p>
								</c>
								<c cspan="2" ca="left">
									<p>1.582</p>
								</c>
								<c cspan="2" ca="left">
									<p>1.879</p>
								</c>
								<c cspan="2" ca="left">
									<p>2.014</p>
								</c>
							</r>
						</tblbdy>
						<tblfn>
							<p>*Pooled normal tissue from breast, brain, fetal brain, heart, kidney, uterus, lung, trachea and small intestine.</p>
						</tblfn>
					</tbl>
					<p>The risk of normalizing data to a housekeeper gene with variable overall expression level across different tissues can be represented mathematically as bias error. A housekeeper that has low bias for a particular tissue has an expression level that is near its mean expression across tissues. In our second model, the term (<it>CG</it>)<sub><it>kj </it></sub>represents this tissue-type specific bias. The measure of variability around an intended value when bias is present is called the mean squared error (MSE): MSE = bias<sup>2 </sup>+ variance. Thus, to find a set of genes for normalization across the various tissue types we use a minimax MSE criterion: minimizing the largest MSE of the combination. Table <tblr tid="T4">4</tblr> provides a list for the best gene set of each size along with the minimax-MSE value. Although <it>GAPD </it>has relatively low overall variability within each tissue type, its basal expression changes across tissue types making it a poor choice for a single universal control. The data shows that <it>RPL13A </it>(ribosomal protein L13a) is the best single universal housekeeper, but it is clear that no single gene is optimal for a universal housekeeper. Actually, choosing all the candidates provides the smallest MSE, which is not surprising as the set of all 10 genes is unbiased by definition. For routine application it is reasonable to limit the number of control genes, as the cost of assaying additional genes needs to balance the extra precision obtained. With this in mind, it is instructive to note that the three-member set of <it>HPRT1</it>, <it>RPL13A </it>and <it>UBC </it>is an excellent choice because it maintains a priority ranking even when selection is open to including four- or five-element sets. The housekeeper genes we tested by RT-PCR on breast tumor samples were not assayed across other tissue types and thus could not be evaluated as universal controls. Nevertheless, it is likely that our results in breast tissue would hold up across other tissue types as our genes were initially selected from microarray data that included 17 different and diverse cell lines as well as primary breast tumors <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
					<tbl id="T4">
						<title>
							<p>Table 4</p>
						</title>
						<caption>
							<p>Minimax MSE optimal gene sets for each set size</p>
						</caption>
						<tblbdy cols="3">
							<r>
								<c ca="left">
									<p>Maximum number of members</p>
								</c>
								<c ca="left">
									<p>Gene set</p>
								</c>
								<c ca="center">
									<p>Maximal MSE</p>
								</c>
							</r>
							<r>
								<c cspan="3">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>1</p>
								</c>
								<c ca="left">
									<p>
										<it>RPL13A</it>
									</p>
								</c>
								<c ca="center">
									<p>0.544</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>2</p>
								</c>
								<c ca="left">
									<p><it>HPRT1</it>, <it>UBC</it></p>
								</c>
								<c ca="center">
									<p>0.328</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>3</p>
								</c>
								<c ca="left">
									<p><it>HPRT1</it>, <it>RPL13A</it>, <it>UBC</it></p>
								</c>
								<c ca="center">
									<p>0.136</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>4</p>
								</c>
								<c ca="left">
									<p><it>HPRT1</it>, <it>RPL13A</it>, <it>UBC</it></p>
								</c>
								<c ca="center">
									<p>0.136</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>5</p>
								</c>
								<c ca="left">
									<p><it>HPRT1</it>, <it>RPL13A</it>, <it>UBC</it></p>
								</c>
								<c ca="center">
									<p>0.136</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>6</p>
								</c>
								<c ca="left">
									<p><it>ACTB</it>, <it>HPRT1</it>, <it>SDHA</it>, <it>TBP</it>, <it>UBC</it>, <it>YWHAZ</it></p>
								</c>
								<c ca="center">
									<p>0.131</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>7</p>
								</c>
								<c ca="left">
									<p><it>ACTB</it>, <it>HPRT1</it>, <it>RPL13A</it>, <it>SDHA</it>, <it>TBP</it>, <it>UBC</it>, <it>YWHAZ</it></p>
								</c>
								<c ca="center">
									<p>0.064</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>8</p>
								</c>
								<c ca="left">
									<p><it>ACTB</it>, <it>HPRT1</it>, <it>RPL13A</it>, <it>SDHA</it>, <it>TBP</it>, <it>UBC</it>, <it>YWHAZ</it></p>
								</c>
								<c ca="center">
									<p>0.064</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>9</p>
								</c>
								<c ca="left">
									<p><it>ACTB</it>, <it>HPRT1</it>, <it>RPL13A</it>, <it>SDHA</it>, <it>TBP</it>, <it>UBC</it>, <it>YWHAZ</it></p>
								</c>
								<c ca="center">
									<p>0.064</p>
								</c>
							</r>
							<r>
								<c ca="left">
									<p>10</p>
								</c>
								<c ca="left">
									<p><it>ACTB</it>, <it>B2M</it>, <it>GAPD</it>, <it>HMBS</it>, <it>HPRT1</it>, <it>RPL13A</it>, <it>SDHA</it>, <it>TBP</it>, <it>UBC</it>, <it>YWHAZ</it></p>
								</c>
								<c ca="center">
									<p>0.049</p>
								</c>
							</r>
						</tblbdy>
					</tbl>
					<p>Figure <figr fid="F3">3</figr> shows the MSE of each gene broken down into the squared-bias and variance components. The direction of each bar shows the sign of the bias. It is apparent that the large bias dominates the large values of MSE. The use of the (log-) average of several genes tends to reduce the variance, due to the effect of bias reduction where opposite biases cancel each other out. For example, both <it>ACTB </it>and <it>TBP </it>(TATA box binding protein) have a large bias in the pooled normal samples, but in opposing directions. The mean squared error of the (log-) average of <it>ACTB </it>and <it>TBP </it>in these samples is only 0.35, which is much lower than their individual MSEs above 6.</p>
					<fig id="F3">
						<title>
							<p>Figure 3</p>
						</title>
						<caption>
							<p>Mean squared error (MSE) of each gene by tissue-type</p>
						</caption>
						<text>
							<p>Mean squared error (MSE) of each gene by tissue-type. The sign is determined by the direction of the bias. The MSE is broken down into the contributing components of the squared bias (Bias<sup>2</sup>) and the variance (Sigma<sup>2</sup>). Dataset from Vandesompele <it>et al</it>. [13]. Tissue types analyzed included normal bone marrow (BM), cultured normal fibroblasts (FIB), normal leukocytes (LEU), neuroblastoma cell lines (NB), and pooled normal tissue from breast, brain, fetal brain, heart, kidney, uterus, lung, trachea and small intestine (POOL).</p>
						</text>
						<graphic file="gb-2004-5-8-r59-3"/>
					</fig>
					<p>In summary, we have modeled the performance of putative housekeepers to test their goodness-of-fit in serving as normalization controls for relative insert quantification. A major advantage of a model approach is that the terms are placed within a solid statistical framework and are not <it>ad hoc</it>, which allows the algorithm to be generalized to a variety of different experimental conditions. The genes and algorithms that we have selected for normalization should have broad utility for diagnostics and research.</p>
				</sec>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Pre-selection of assayed genes from microarray experiments</p>
				</st>
				<p>Four candidate housekeepers (<it>PSMC4</it>, <it>MRPL19</it>, <it>PUM1 </it>and <it>SF3A1</it>) were selected from a microarray dataset containing 40 different breast tumors, three normal breast samples and 19 cell lines representing 17 different cell lines of diverse nature including lymphocytes, fibroblasts and epithelial cells <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. All experiments were done using a common reference strategy in which all experimental samples are compared to the same reference comprised of a pool of RNAs isolated from 11 diverse human cell lines <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
				<p>To select housekeepers, we first 'filtered' the microarray data to select genes with Cy3 and Cy5 signal intensities greater than 500 units across at least 75% of the experiments. This requirement ensures that the gene is well expressed not only in the experimental samples, but also in the common reference sample. Next, we used the SAS/STAT Analysis Package Version 8 (SAS Institute Inc., Cary, NC) to identify a set of genes that showed a small range of expression across sample types and the least variance of the array-mean normalized log-ratios. For real-time RT-PCR, we selected four of the top six genes - <it>PUM1</it>, <it>PSMC4</it>, <it>MRPL19 </it>and <it>SF3A</it>. The two other low-variability genes identified in the data were <it>IER3 </it>(immediate early response 3) and <it>SRY </it>((sex determining region Y)-box 2). We did not select these genes because of their potential for being differentially regulated under other conditions. However, we did include <it>GAPD </it>and <it>ACTB</it>, which are commonly used reference genes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, in the set of candidate genes for comparison to the microarray selection.</p>
			</sec>
			<sec>
				<st>
					<p>Samples and cDNA preparation</p>
				</st>
				<p>Breast samples were acquired under informed consent and received at the Huntsman Cancer Institute (Salt Lake City, UT) for gene expression analysis (University of Utah, IRB #8533). All specimens were expediently processed in pathology upon arrival from surgery. Samples were grossly dissected, procured by flash freezing in liquid nitrogen, and stored at -80&#176;C until RNA extraction. Approximately 50-100 mg cancer tissue was homogenized from each sample, and total RNA was prepared using the RNeasy midi kit (Qiagen). The integrity of RNA was determined using the RNA 6000 Nano LabChip kit (Agilent Technologies) and an Agilent 2100 Bioanalyzer. Two microliters of total RNA (50 ng/&#956;l) were heated to 70&#176;C and 1 &#956;l was loaded on the column. Degradation was evaluated using the signal of the 18S and 28S ribosomal peaks <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
				<p>First-strand cDNA was synthesized from 1 &#956;g total RNA using oligo(dT) primers and Superscript III reverse transcriptase following manufacturer's instructions (Superscript III First-Strand Synthesis System, Invitrogen Life Technologies). Briefly, the reaction was held at 48&#176;C for 50 min, followed by a 15 min step at 70&#176;C. The cDNA was washed on QIAquick PCR purification column (Qiagen) and eluted in 2 &#215; 50 &#956;l of elution buffer. The cDNA was then diluted in TE' (10 mM Tris, 0.1 mM EDTA, pH 8.0), aliquoted and stored at -80&#176;C for further use.</p>
			</sec>
			<sec>
				<st>
					<p>Real-time quantitative PCR</p>
				</st>
				<p>All PCR reactions were performed on the LightCycler. Each 20 &#956;l reaction included 1 &#215; PCR buffer with 3 mM MgCl<sub>2 </sub>(Idaho Technology), 0.2 mM each of dATP, dCTP, and dGTP (Roche), 0.1 mM dTTP (Roche), 0.3 mM dUTP (Roche), 1 U of Platinum taq (Invitrogen Life Technologies), 1/40000 SYBR Green I (Molecular Probes), approximately 5 ng cDNA, and 0.4 &#956;M of each primer. The primers used for the RNA control genes are shown in Table <tblr tid="T5">5</tblr>.</p>
				<tbl id="T5">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>Primers for housekeeper genes</p>
					</caption>
					<tblbdy cols="4">
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>Length (bases)</p>
							</c>
							<c ca="center">
								<p>GC(%)</p>
							</c>
							<c ca="center">
								<p>*Tm (&#176;C)</p>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><it>PSMC4 </it>(UniGene reference Hs.211594 - Gene ID: 5704)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GGCATGGACATCCAGAAG</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>55.6</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CCACGACCCGGATGAAT</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>58.8</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>190</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
							<c ca="center">
								<p>90</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><it>MRPL19 </it>(UniGene reference Hs.44024 - Gene ID: 9801)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GGGATTTGCATTCAGAGATCAG</p>
							</c>
							<c ca="center">
								<p>22</p>
							</c>
							<c ca="center">
								<p>45</p>
							</c>
							<c ca="center">
								<p>62</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GGAAGGGCATCTCGTAAG</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>56</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>182</p>
							</c>
							<c ca="center">
								<p>44</p>
							</c>
							<c ca="center">
								<p>83</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><it>PUM1 </it>(UniGene reference Hs.153834 - Gene ID:9698)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TGAGGTGTGCACCATGAAC</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>53</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CAGAATGTGCTTGCCATAGG</p>
							</c>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>50</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>187</p>
							</c>
							<c ca="center">
								<p>53</p>
							</c>
							<c ca="center">
								<p>87</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><it>SF3A1 </it>(UniGene reference Hs.406277 -Gene ID:10291)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GGAGGATTCTGCACCTTCTAA</p>
							</c>
							<c ca="center">
								<p>21</p>
							</c>
							<c ca="center">
								<p>47</p>
							</c>
							<c ca="center">
								<p>61</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GCGGTAGTAGGCATGGTAA</p>
							</c>
							<c ca="center">
								<p>19</p>
							</c>
							<c ca="center">
								<p>52</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>196</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>85</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><it>ACTB </it>(UniGene reference Hs.426930 - Gene ID:60)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>TTCCTGGGCATGGAGTC</p>
							</c>
							<c ca="center">
								<p>17</p>
							</c>
							<c ca="center">
								<p>59</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>CAGGTCTTTGCGGATGTC</p>
							</c>
							<c ca="center">
								<p>18</p>
							</c>
							<c ca="center">
								<p>55</p>
							</c>
							<c ca="center">
								<p>60</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>84</p>
							</c>
							<c ca="center">
								<p>55</p>
							</c>
							<c ca="center">
								<p>84</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p><sup>&#8224;</sup><it>GAPD </it>(UniGene reference Hs.169476 - Gene ID:2597)</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>AACAGCCTCAAGATCATCAGC</p>
							</c>
							<c ca="center">
								<p>21</p>
							</c>
							<c ca="center">
								<p>48</p>
							</c>
							<c ca="center">
								<p>63</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>GGATGATGTTCTGGAGAGCC</p>
							</c>
							<c ca="center">
								<p>20</p>
							</c>
							<c ca="center">
								<p>55</p>
							</c>
							<c ca="center">
								<p>62</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Amplified fragment</p>
							</c>
							<c ca="center">
								<p>198</p>
							</c>
							<c ca="center">
								<p>56</p>
							</c>
							<c ca="center">
								<p>89</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>*Primers Tm determined using Tm Utility with algorithms adapted from Santa Lucia [23]. The Tm for the amplified fragment is the empirical Tm. <sup>&#8224;</sup>Primers for <it>GAPD </it>[20].</p>
					</tblfn>
				</tbl>
				<p>PCR was done using the following protocol: initial denaturation 95&#176;C for 1 min 30 sec, then 50 cycles at 94&#176;C for 1 sec for denaturation, 60&#176;C for 5 sec (20&#176;C/sec transition) for annealing, 72&#176;C for 8 sec (2&#176;C/sec transition) for extension. Fluorescence emission of SYBR Green I (channel 1, 530 nm) was acquired each cycle after the extension step. A melting step was performed after PCR to determine product purity. For melting curve analysis, the reactions were rapidly (20&#176;C/sec) cooled from 95&#176;C to 60&#176;C and then slowly heated (0.1&#176;C/sec) back to 95&#176;C while continuously monitoring fluorescence.</p>
			</sec>
			<sec>
				<st>
					<p>Relative quantification</p>
				</st>
				<p>Copy number was determined using the crossing point (Cp) value, which is automatically calculated using the LightCycler 3.5 software (Roche Molecular Biochemicals). The Cp value is reported as a fractional cycle number that is determined from the second derivative maximum (point of maximum acceleration) on the PCR amplification curve (fluorescence versus cycle number) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. A relative starting copy number was determined for each housekeeper using a calibration curve done with the same batch of master mix. Efficiency (<it>E</it>) of PCR was calculated from a plot of Cp versus log ng cDNA <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
				<p><it>E </it>= 10<sup>-1/<it>slope</it></sup></p>
			</sec>
			<sec>
				<st>
					<p>Modeling expression data</p>
				</st>
				<p>As the effects of interest are fold changes, we modeled the log-transformed expression Model 1a.</p>
				<p>log <it>y</it><sub><it>ij </it></sub>= <it>&#956; </it>+ <it>T</it><sub><it>i </it></sub>+ <it>G</it><sub><it>j </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>,</p>
				<p>where <graphic file="gb-2004-5-8-r59-i3.gif"/> independent</p>
				<p>where <it>&#956; </it>denotes the overall mean (log) expression, <it>T</it><sub><it>i </it></sub>is the difference of the <it>i</it>th tissue sample from the overall average and <it>G</it><sub><it>j </it></sub>is the difference of the <it>j</it>th gene from the overall average. The key feature of this model that makes it different from a traditional ANOVA model is that it allows heteroscedastic errors: the variability of the genes is different.</p>
				<p>We fitted the model using the gls routine of the nlme library for R, however other commonly available software such as PROC MIXED from SAS could have been used.</p>
				<p>Based on the model, the variability of the logarithm of the geometric mean</p>
				<p>
					<graphic file="gb-2004-5-8-r59-i4.gif"/>
				</p>
				<p>of a gene-set <it>S </it>was estimated as</p>
				<p>
					<graphic file="gb-2004-5-8-r59-i5.gif"/>
				</p>
				<p>Vandesompele <it>et al</it>.'s <it>M-</it>value is the average of relative standard deviations of the log-expression levels. Under Model 1, the <it>M</it>-value of the gene is closely related to its variance (under Models 2 and 3 below, the similar relationships can be derived):</p>
				<p>
					<graphic file="gb-2004-5-8-r59-i6.gif"/>
				</p>
				<p>We tested the assumption of unequal variances by fitting Model 1b that forces all the genes to have the same variability (this is the classical ANOVA model).</p>
				<p>log <it>y</it><sub><it>ij </it></sub>= <it>&#956; </it>+ <it>T</it><sub><it>i </it></sub>+ <it>G</it><sub><it>j </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>,</p>
				<p>where <graphic file="gb-2004-5-8-r59-i7.gif"/> independent</p>
				<p>Model 1c with a correlated error structure can be used to assess the assumption of (conditional) independence of the genes given the sample mean. If warranted, a more complicated correlation structure can be imposed.</p>
				<p>log <it>y</it><sub><it>ij </it></sub>= <it>&#956; </it>+ <it>T</it><sub><it>i </it></sub>+ <it>G</it><sub><it>j </it></sub>+ <it>&#949;</it><sub><it>ij</it></sub>,</p>
				<p>where</p>
				<p>
					<graphic file="gb-2004-5-8-r59-i8.gif"/>
				</p>
				<p>For the multiple tissue-type set-up the notation and the model need to be extended. We will denote the expression level of gene <it>j </it>of in the <it>i</it>th sample of type <it>k </it>by <it>y</it><sub><it>i</it>(<it>k</it>)<it>j</it></sub>, <it>i </it>= 1,...<it>n</it><sub><it>k</it></sub>, <it>j </it>= 1,...,<it>g </it>and <it>k </it>= 1,...,<it>m</it>. The best-fitting model for the data, which we call Model 2, had the form</p>
				<p>log <it>y</it><sub><it>i(k)j </it></sub>= <it>&#956; </it>+ <it>C</it><sub><it>k </it></sub>+ <it>T</it><sub><it>i(k) </it></sub>+ <it>G</it><sub><it>j </it></sub>+ (<it>CG</it>)<sub><it>kj </it></sub>+ <it>&#949;</it><sub><it>i(k)j</it></sub>,</p>
				<p>where <graphic file="gb-2004-5-8-r59-i9.gif"/></p>
				<p>Thus the errors are independent and their variability is decomposed into a gene-specific and tissue-type specific multiplicative components. The last restriction ensures the uniqueness of the solution. Simpler models that we considered used uniform error variance, equal error variance for tissue types, and equal error variance for genes. We also considered more complex models that used exchangeable correlation structure for the errors and unstructured error variance (each gene-tissue-type combination has a variance parameter). The BIC was used as a basis for model selection.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>Additional data available with this paper online is an Excel file with the relative copy numbers of six genes in the 80 breast cancer samples used in this study (Additional data file <supplr sid="s1">1</supplr>).</p>
			<suppl id="s1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>An Excel file with the relative copy numbers of six genes in the 80 breast cancer samples used in this study</p>
				</caption>
				<text>
					<p>An Excel file with the relative copy numbers of six genes in the 80 breast cancer samples used in this study</p>
				</text>
				<file name="gb-2004-5-8-r59-s1.xls">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>This work has been supported in part by the National Cancer Institute (R33 CA097769-01).</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Methods to optimize the generation of cDNA from postmortem human brain tissue.</p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>CL</fnm>
					</au>
					<au>
						<snm>Yolken</snm>
						<fnm>RH</fnm>
					</au>
				</aug>
				<source>Brain Res Brain Res Protoc</source>
				<pubdate>2003</pubdate>
				<volume>10</volume>
				<fpage>156</fpage>
				<lpage>167</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1385-299X(02)00214-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12565686</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Evaluation of DNA fragment sizing and quantification by the Agilent 2100 bioanalyzer.</p>
				</title>
				<aug>
					<au>
						<snm>Panaro</snm>
						<fnm>NJ</fnm>
					</au>
					<au>
						<snm>Yuen</snm>
						<fnm>PK</fnm>
					</au>
					<au>
						<snm>Sakazume</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Fortina</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Kricka</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Wilding</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Clin Chem</source>
				<pubdate>2000</pubdate>
				<volume>46</volume>
				<fpage>1851</fpage>
				<lpage>1853</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11067828</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Control selection for RNA quantitation.</p>
				</title>
				<aug>
					<au>
						<snm>Suzuki</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Crawford</snm>
						<fnm>DR</fnm>
					</au>
				</aug>
				<source>Biotechniques</source>
				<pubdate>2000</pubdate>
				<volume>29</volume>
				<fpage>332</fpage>
				<lpage>337</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10948434</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Comparison of glyceraldehyde-3-phosphate dehydrogenase and 28S-ribosomal RNA gene expression as RNA loading controls for northern blot analysis of cell lines of varying malignant potential.</p>
				</title>
				<aug>
					<au>
						<snm>Bhatia</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Taylor</snm>
						<fnm>WR</fnm>
					</au>
					<au>
						<snm>Greenberg</snm>
						<fnm>AH</fnm>
					</au>
					<au>
						<snm>Wright</snm>
						<fnm>JA</fnm>
					</au>
				</aug>
				<source>Anal Biochem</source>
				<pubdate>1994</pubdate>
				<volume>216</volume>
				<fpage>223</fpage>
				<lpage>226</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/abio.1994.1028</pubid>
						<pubid idtype="pmpid" link="fulltext">8135355</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Problems related to the interpretation of autoradiographic data on gene expression using common constitutive transcripts as controls.</p>
				</title>
				<aug>
					<au>
						<snm>Spanakis</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1993</pubdate>
				<volume>21</volume>
				<fpage>3809</fpage>
				<lpage>3819</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8367299</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Relative quantitative RT-PCR protocol for TrkB expression in neuroblastoma using GAPD as an internal control.</p>
				</title>
				<aug>
					<au>
						<snm>Eggert</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Brodeur</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>Ikegaki</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Biotechniques</source>
				<pubdate>2000</pubdate>
				<volume>28</volume>
				<fpage>681</fpage>
				<lpage>691</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10769746</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Quantitative monitoring of gene expression patterns with a complementary DNA microarray.</p>
				</title>
				<aug>
					<au>
						<snm>Schena</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Shalon</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1995</pubdate>
				<volume>270</volume>
				<fpage>467</fpage>
				<lpage>470</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7569999</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Molecular portraits of human breast tumours.</p>
				</title>
				<aug>
					<au>
						<snm>Perou</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Sorlie</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>MB</fnm>
					</au>
					<au>
						<snm>van de Rijn</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Jeffrey</snm>
						<fnm>SS</fnm>
					</au>
					<au>
						<snm>Rees</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Pollack</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Ross</snm>
						<fnm>DT</fnm>
					</au>
					<au>
						<snm>Johnsen</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Akslen</snm>
						<fnm>LA</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2000</pubdate>
				<volume>406</volume>
				<fpage>747</fpage>
				<lpage>752</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35021093</pubid>
						<pubid idtype="pmpid" link="fulltext">10963602</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>A gene-expression signature as a predictor of survival in breast cancer.</p>
				</title>
				<aug>
					<au>
						<snm>van de Vijver</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>YD</fnm>
					</au>
					<au>
						<snm>van't Veer</snm>
						<fnm>LJ</fnm>
					</au>
					<au>
						<snm>Dai</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Hart</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Voskuil</snm>
						<fnm>DW</fnm>
					</au>
					<au>
						<snm>Schreiber</snm>
						<fnm>GJ</fnm>
					</au>
					<au>
						<snm>Peterse</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Marton</snm>
						<fnm>MJ</fnm>
					</au>
					<etal/>
				</aug>
				<source>N Engl J Med</source>
				<pubdate>2002</pubdate>
				<volume>347</volume>
				<fpage>1999</fpage>
				<lpage>2009</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1056/NEJMoa021967</pubid>
						<pubid idtype="pmpid" link="fulltext">12490681</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Delineation of prognostic biomarkers in prostate cancer.</p>
				</title>
				<aug>
					<au>
						<snm>Dhanasekaran</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Barrette</snm>
						<fnm>TR</fnm>
					</au>
					<au>
						<snm>Ghosh</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Shah</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Varambally</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kurachi</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Pienta</snm>
						<fnm>KJ</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Chinnaiyan</snm>
						<fnm>AM</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>412</volume>
				<fpage>822</fpage>
				<lpage>826</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35090585</pubid>
						<pubid idtype="pmpid" link="fulltext">11518967</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Analysis of gene expression profiles in normal and neoplastic ovarian tissue samples identifies candidate molecular markers of epithelial ovarian cancer.</p>
				</title>
				<aug>
					<au>
						<snm>Welsh</snm>
						<fnm>JB</fnm>
					</au>
					<au>
						<snm>Zarrinkar</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Sapinoso</snm>
						<fnm>LM</fnm>
					</au>
					<au>
						<snm>Kern</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>Behling</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Monk</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Lockhart</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Burger</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Hampton</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2001</pubdate>
				<volume>98</volume>
				<fpage>1176</fpage>
				<lpage>1181</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.98.3.1176</pubid>
						<pubid idtype="pmpid" link="fulltext">11158614</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Molecular analysis of glioblastoma: pathway profiling and its implications for patient therapy.</p>
				</title>
				<aug>
					<au>
						<snm>Mischel</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Cloughesy</snm>
						<fnm>TF</fnm>
					</au>
				</aug>
				<source>Cancer Biol Ther</source>
				<pubdate>2003</pubdate>
				<volume>2</volume>
				<fpage>242</fpage>
				<lpage>247</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12878856</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes.</p>
				</title>
				<aug>
					<au>
						<snm>Vandesompele</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>De Preter</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Pattyn</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Poppe</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Van Roy</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>De Paepe</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Speleman</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>research0034.1</fpage>
				<lpage>0034.11</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">12184808</pubid>
						<pubid idtype="doi">10.1186/gb-2002-3-7-research0034</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Customized molecular phenotyping by quantitative gene expression and pattern recognition analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Akilesh</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Shaffer</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Roopenian</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>1719</fpage>
				<lpage>1727</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1101/gr.533003</pubid>
						<pubid idtype="pmpid" link="fulltext">12840047</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Discrepancies in clinical laboratory testing of eligibility for trastuzumab therapy: apparent immunohistochemical false-positives do not get the message.</p>
				</title>
				<aug>
					<au>
						<snm>Tubbs</snm>
						<fnm>RR</fnm>
					</au>
					<au>
						<snm>Pettay</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Roche</snm>
						<fnm>PC</fnm>
					</au>
					<au>
						<snm>Stoler</snm>
						<fnm>MH</fnm>
					</au>
					<au>
						<snm>Jenkins</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>Grogan</snm>
						<fnm>TM</fnm>
					</au>
				</aug>
				<source>J Clin Oncol</source>
				<pubdate>2001</pubdate>
				<volume>19</volume>
				<fpage>2714</fpage>
				<lpage>2721</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11352964</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Overexpression of cyclin D1 mRNA in colorectal carcinomas and relationship to clinicopathological features: an <it>in situ </it>hybridization analysis.</p>
				</title>
				<aug>
					<au>
						<snm>Kristt</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Turner</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Koren</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ramadan</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Gal</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Pathol Oncol Res</source>
				<pubdate>2000</pubdate>
				<volume>6</volume>
				<fpage>65</fpage>
				<lpage>70</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10749591</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<aug>
					<au>
						<snm>Pinheiro</snm>
						<fnm>JCBD</fnm>
					</au>
				</aug>
				<source>Mixed-effects Models in S and S-PLUS</source>
				<publisher>New York: Springer</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Estimating the dimension of a model.</p>
				</title>
				<aug>
					<au>
						<snm>Schwarz</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Annls Stat</source>
				<pubdate>1978</pubdate>
				<volume>6</volume>
				<fpage>461</fpage>
				<lpage>464</lpage>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Tumor classification using gene expression patterns from DNA microarrays.</p>
				</title>
				<aug>
					<au>
						<snm>Perou</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>PO</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>New Technologies for Life Sciences: A Trends Guide</source>
				<pubdate>2000</pubdate>
				<fpage>67</fpage>
				<lpage>76</lpage>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Effects of prostaglandins on human hematopoietic osteoclast precursors.</p>
				</title>
				<aug>
					<au>
						<snm>Roux</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Pichaud</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Quinn</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Lalande</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Morieux</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jullienne</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>de Vernejoul</snm>
						<fnm>MC</fnm>
					</au>
				</aug>
				<source>Endocrinology</source>
				<pubdate>1997</pubdate>
				<volume>138</volume>
				<fpage>1476</fpage>
				<lpage>1482</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1210/en.138.4.1476</pubid>
						<pubid idtype="pmpid" link="fulltext">9075705</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>Profiling breast cancer using real-time quantitative PCR.</p>
				</title>
				<aug>
					<au>
						<snm>Frank</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>Bernard</snm>
						<fnm>PS</fnm>
					</au>
				</aug>
				<source>In Rapid Cycle Real-Time PCR: Methods and Applications</source>
				<publisher>Heidelberg: Springer</publisher>
				<editor>Wittwer CT, Meuer S, Nakagawara K</editor>
				<pubdate>2003</pubdate>
				<fpage>95</fpage>
				<lpage>106</lpage>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Quantification on the LightCycler.</p>
				</title>
				<aug>
					<au>
						<snm>Rasmussen</snm>
						<fnm>RP</fnm>
					</au>
				</aug>
				<source>In Rapid Cycle Real-Time PCR: Methods and Applications</source>
				<publisher>Heidelberg: Springer</publisher>
				<editor>Wittwer CT, Meuer S, Nakagawara K</editor>
				<pubdate>2003</pubdate>
				<fpage>21</fpage>
				<lpage>34</lpage>
			</bibl>
			<bibl id="B23">
				<title>
					<p>A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics.</p>
				</title>
				<aug>
					<au>
						<snm>SantaLucia</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<fpage>1460</fpage>
				<lpage>1465</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1073/pnas.95.4.1460</pubid>
						<pubid idtype="pmpid" link="fulltext">9465037</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
