Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

Open Access Highly Accessed Research article

Evaluation of normalization methods for microarray data

Taesung Park1*, Sung-Gon Yi1, Sung-Hyun Kang1, SeungYeoun Lee2, Yong-Sung Lee3 and Richard Simon4

Author Affiliations

1 Department of Statistics, Seoul National University, Seoul, Korea

2 Department of Applied Mathematics, Sejong University, Seoul, Korea

3 Department of Biochemistry, Hanyang University College of Medicine, Seoul, Korea

4 Biometric Research Branch, Division of Cancer Treatment & Diagnosis National Cancer Institute, Bethesda MD, USA

For all author emails, please log on.

BMC Bioinformatics 2003, 4:33  doi:10.1186/1471-2105-4-33

Published: 2 September 2003

Abstract

Background

Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. This novel technique helps us to understand gene regulation as well as gene by gene interactions more systematically. In the microarray experiment, however, many undesirable systematic variations are observed. Even in replicated experiment, some variations are commonly observed. Normalization is the process of removing some sources of variation which affect the measured gene expression levels. Although a number of normalization methods have been proposed, it has been difficult to decide which methods perform best. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.

Results

In this paper, we use the variability among the replicated slides to compare performance of normalization methods. We also compare normalization methods with regard to bias and mean square error using simulated data.

Conclusions

Our results show that intensity-dependent normalization often performs better than global normalization methods, and that linear and nonlinear normalization methods perform similarly. These conclusions are based on analysis of 36 cDNA microarrays of 3,840 genes obtained in an experiment to search for changes in gene expression profiles during neuronal differentiation of cortical stem cells. Simulation studies confirm our findings.