Open Access Highly Accessed Methodology article

Identification of differentially expressed subnetworks based on multivariate ANOVA

Taeyoung Hwang1 and Taesung Park1,2*

Author Affiliations

1 Interdisciplinary Program of Bioinformatics, Seoul National University, Seoul, Republic of Korea

2 Department of Statistics, Seoul National University, Seoul, Republic of Korea

For all author emails, please log on.

BMC Bioinformatics 2009, 10:128 doi:10.1186/1471-2105-10-128

Published: 30 April 2009

Abstract

Background

Since high-throughput protein-protein interaction (PPI) data has recently become available for humans, there has been a growing interest in combining PPI data with other genome-wide data. In particular, the identification of phenotype-related PPI subnetworks using gene expression data has been of great concern. Successful integration for the identification of significant subnetworks requires the use of a search algorithm with a proper scoring method. Here we propose a multivariate analysis of variance (MANOVA)-based scoring method with a greedy search for identifying differentially expressed PPI subnetworks.

Results

Given the MANOVA-based scoring method, we performed a greedy search to identify the subnetworks with the maximum scores in the PPI network. Our approach was successfully applied to human microarray datasets. Each identified subnetwork was annotated with the Gene Ontology (GO) term, resulting in the phenotype-related functional pathway or complex. We also compared these results with those of other scoring methods such as t statistic- and mutual information-based scoring methods. The MANOVA-based method produced subnetworks with a larger number of proteins than the other methods. Furthermore, the subnetworks identified by the MANOVA-based method tended to consist of highly correlated proteins.

Conclusion

This article proposes a MANOVA-based scoring method to combine PPI data with expression data using a greedy search. This method is recommended for the highly sensitive detection of large subnetworks.