Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior
1 Bioinformatics Graduate Program, Department of Biochemistry and Molecular Biology, Medical University of South Carolina, 135 Cannon St. Charleston, SC 29425, USA
2 Department of Statistics, Florida State University, 117 N. Woodward Ave. Tallahassee, FL 32306, USA
3 Division of Bioinformatics, Department of Biochemistry and Molecular Biology, Medical University of South Carolina, 135 Cannon St. Charleston, SC 29425, USA
BMC Bioinformatics 2011, 12:438 doi:10.1186/1471-2105-12-438Published: 11 November 2011
To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets.
We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as AMIGO2, Gem, and CXCL11 that have not been shown to associate with, but may play roles in, metastasis.
CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray experiments.