Evaluation of normalization methods for microarray data
1 Department of Statistics, Seoul National University, Seoul, Korea
2 Department of Applied Mathematics, Sejong University, Seoul, Korea
3 Department of Biochemistry, Hanyang University College of Medicine, Seoul, Korea
4 Biometric Research Branch, Division of Cancer Treatment & Diagnosis National Cancer Institute, Bethesda MD, USA
BMC Bioinformatics 2003, 4:33 doi:10.1186/1471-2105-4-33Published: 2 September 2003
Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. This novel technique helps us to understand gene regulation as well as gene by gene interactions more systematically. In the microarray experiment, however, many undesirable systematic variations are observed. Even in replicated experiment, some variations are commonly observed. Normalization is the process of removing some sources of variation which affect the measured gene expression levels. Although a number of normalization methods have been proposed, it has been difficult to decide which methods perform best. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.
In this paper, we use the variability among the replicated slides to compare performance of normalization methods. We also compare normalization methods with regard to bias and mean square error using simulated data.
Our results show that intensity-dependent normalization often performs better than global normalization methods, and that linear and nonlinear normalization methods perform similarly. These conclusions are based on analysis of 36 cDNA microarrays of 3,840 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells. Simulation studies confirm our findings.