Log on / register
Feedback | Support | My details
Open AccessResearch article

Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays

Leah Barrera1,2 email, Chris Benner1,2 email, Yong-Chuan Tao3 email, Elizabeth Winzeler1,4 email and Yingyao Zhou1 email

1Genomics Institute of the Novartis Research Foundation, 10675 John Jay Hopkins Drive, California 92121, USA

2Bioinformatics Graduate Program, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA

3Novartis Institutes for Biomedical Research, 100 Technology Square, Cambridge, MA 02139, USA

4The Scripps Research Institute, La Jolla, California 92037, USA

author email corresponding author email

BMC Bioinformatics 2004, 5:42doi:10.1186/1471-2105-5-42

Published: 20 April 2004

Abstract

Background

To identify differentially expressed genes across experimental conditions in oligonucleotide microarray experiments, existing statistical methods commonly use a summary of probe-level expression data for each probe set and compare replicates of these values across conditions using a form of the t-test or rank sum test. Here we propose the use of a statistical method that takes advantage of the built-in redundancy architecture of high-density oligonucleotide arrays.

Results

We employ parametric and nonparametric variants of two-way analysis of variance (ANOVA) on probe-level data to account for probe-level variation, and use the false-discovery rate (FDR) to account for simultaneous testing on thousands of genes (multiple testing problem). Using publicly available data sets, we systematically compared the performance of parametric two-way ANOVA and the nonparametric Mack-Skillings test to the t-test and Wilcoxon rank-sum test for detecting differentially expressed genes at varying levels of fold change, concentration, and sample size. Using receiver operating characteristic (ROC) curve comparisons, we observed that two-way methods with FDR control on sample sizes with 2–3 replicates exhibits the same high sensitivity and specificity as a t-test with FDR control on sample sizes with 6–9 replicates in detecting at least two-fold change.

Conclusions

Our results suggest that the two-way ANOVA methods using probe-level data are substantially more powerful tests for detecting differential gene expression than corresponding methods for probe-set level data.


© 1999-2008 BioMed Central Ltd unless otherwise stated