Log on / register
Feedback | Support | My details
Open AccessMethodology article

Empirical validation of the S-Score algorithm in the analysis of gene expression data

Richard E Kennedy1 email, Kellie J Archer1,4 email and Michael F Miles2,3,4 email

1Department of Biostatistics, Virginia Commonwealth University, Richmond, VA 23298, USA

2Department of Pharmacology and Toxicology, Virginia Commonwealth University, Richmond, VA 23298, USA

3Department of Neurology, Virginia Commonwealth University, Richmond, VA 23298, USA

4Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VA 23298, USA

author email corresponding author email

BMC Bioinformatics 2006, 7:154doi:10.1186/1471-2105-7-154

Published: 17 March 2006

Abstract

Background

Current methods of analyzing Affymetrix GeneChip® microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0) for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5.

Results

The S-score showed excellent sensitivity and specificity in detecting low-level gene expression changes. Rank ordering of S-Score values more accurately reflected known fold-change values compared to other algorithms.

Conclusion

The S-score method, utilizing probe level data directly, offers significant advantages over comparisons using only probe set expression summaries.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.