RCMAT: a regularized covariance matrix approach to testing gene sets
-
* Corresponding author: Mark A Reimers mreimers@vcu.edu
Department of Biostatistics, Virginia Commonwealth University, Richmond, Virginia 23298, USA
BMC Bioinformatics 2009, 10:300 doi:10.1186/1471-2105-10-300
Published: 21 September 2009Abstract
Background
Gene sets are widely used to interpret genome-scale data. Analysis techniques that make better use of the correlation structure of microarray data while addressing practical "n<p" concerns could provide a real increase in power. However correlation structure is hard to estimate with typical genomics sample sizes. In this paper we present an extension of a classical multivariate procedure that confronts this challenge by the use of a regularized covariance matrix.
Results
We evaluated our testing procedure using both simulated data and a widely analyzed diabetes data set. We compared our approach to another popular multivariate test for both sets of data. Our results suggest an increase in power for detecting gene set differences can be obtained using our approach relative to the popular multivariate test with no increase in the false positive rate.
Conclusion
Our regularized covariance matrix multivariate approach to gene set testing showed promise in both real and simulated data comparisons. Our findings are consistent with the recent literature in gene set methodology.