BMC Bioinformatics

official impact factor 3.03

Open Access Methodology article

RCMAT: a regularized covariance matrix approach to testing gene sets

Phillip D Yates and Mark A Reimers*

Author Affiliations

Department of Biostatistics, Virginia Commonwealth University, Richmond, Virginia 23298, USA

For all author emails, please log on.

BMC Bioinformatics 2009, 10:300 doi:10.1186/1471-2105-10-300

Published: 21 September 2009

Abstract

Background

Gene sets are widely used to interpret genome-scale data. Analysis techniques that make better use of the correlation structure of microarray data while addressing practical "n<p" concerns could provide a real increase in power. However correlation structure is hard to estimate with typical genomics sample sizes. In this paper we present an extension of a classical multivariate procedure that confronts this challenge by the use of a regularized covariance matrix.

Results

We evaluated our testing procedure using both simulated data and a widely analyzed diabetes data set. We compared our approach to another popular multivariate test for both sets of data. Our results suggest an increase in power for detecting gene set differences can be obtained using our approach relative to the popular multivariate test with no increase in the false positive rate.

Conclusion

Our regularized covariance matrix multivariate approach to gene set testing showed promise in both real and simulated data comparisons. Our findings are consistent with the recent literature in gene set methodology.