Log on / register
Feedback | Support | My details
Open AccessResearch article

Detecting differential expression in microarray data: comparison of optimal procedures

Elena Perelman1* email, Alexander Ploner1* email, Stefano Calza1,2 email and Yudi Pawitan1 email

Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 17177 Stockholm, Sweden

Department of Biomedical Sciences and Biotechnologies, Brescia, Italy

author email corresponding author email* Contributed equally

BMC Bioinformatics 2007, 8:28doi:10.1186/1471-2105-8-28

Published: 26 January 2007

Abstract

Background

Many procedures for finding differentially expressed genes in microarray data are based on classical or modified t-statistics. Due to multiple testing considerations, the false discovery rate (FDR) is the key tool for assessing the significance of these test statistics. Two recent papers have generalized two aspects: Storey et al. (2005) have introduced a likelihood ratio test statistic for two-sample situations that has desirable theoretical properties (optimal discovery procedure, ODP), but uses standard FDR assessment; Ploner et al. (2006) have introduced a multivariate local FDR that allows incorporation of standard error information, but uses the standard t-statistic (fdr2d). The relationship and relative performance of these methods in two-sample comparisons is currently unknown.

Methods

Using simulated and real datasets, we compare the ODP and fdr2d procedures. We also introduce a new procedure called S2d that combines the ODP test statistic with the extended FDR assessment of fdr2d.

Results

For both simulated and real datasets, fdr2d performs better than ODP. As expected, both methods perform better than a standard t-statistic with standard local FDR. The new procedure S2d performs as well as fdr2d on simulated data, but performs better on the real data sets.

Conclusion

The ODP can be improved by including the standard error information as in fdr2d. This means that the optimality enjoyed in theory by ODP does not hold for the estimated version that has to be used in practice. The new procedure S2d has a slight advantage over fdr2d, which has to be balanced against a significantly higher computational effort and a less intuititive test statistic.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.