Log on / register
Feedback | Support | My details

This article is part of the supplement: NIPS workshop on New Problems and Methods in Computational Biology

Open AccessHighly AccessProceedings

ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context

Adam A Margolin1,2 email, Ilya Nemenman2 email, Katia Basso3 email, Chris Wiggins2,4 email, Gustavo Stolovitzky5 email, Riccardo Dalla Favera3 email and Andrea Califano1,2 email

1Department of Biomedical Informatics, Columbia University, New York, NY 10032

2Joint Centers for Systems Biology, Columbia University, New York, NY 10032

3Institute for Cancer Genetics, Columbia University, New York, NY 10032

4Department of Applied Physics and Applied Mathematics, Columbia University, New York, NY 10032

5IBM T.J. Watson Research Center, Yorktown Heights, NY 10598

author email corresponding author email

BMC Bioinformatics 2006, 7(Suppl 1):S7doi:10.1186/1471-2105-7-S1-S7

Published: 20 March 2006

Abstract

Background

Elucidating gene regulatory networks is crucial for understanding normal cell physiology and complex pathologic phenotypes. Existing computational methods for the genome-wide "reverse engineering" of such networks have been successful only for lower eukaryotes with simple genomes. Here we present ARACNE, a novel algorithm, using microarray expression profiles, specifically designed to scale up to the complexity of regulatory networks in mammalian cells, yet general enough to address a wider range of network deconvolution problems. This method uses an information theoretic approach to eliminate the majority of indirect interactions inferred by co-expression methods.

Results

We prove that ARACNE reconstructs the network exactly (asymptotically) if the effect of loops in the network topology is negligible, and we show that the algorithm works well in practice, even in the presence of numerous loops and complex topologies. We assess ARACNE's ability to reconstruct transcriptional regulatory networks using both a realistic synthetic dataset and a microarray dataset from human B cells. On synthetic datasets ARACNE achieves very low error rates and outperforms established methods, such as Relevance Networks and Bayesian Networks. Application to the deconvolution of genetic networks in human B cells demonstrates ARACNE's ability to infer validated transcriptional targets of the cMYC proto-oncogene. We also study the effects of misestimation of mutual information on network reconstruction, and show that algorithms based on mutual information ranking are more resilient to estimation errors.

Conclusion

ARACNE shows promise in identifying direct transcriptional interactions in mammalian cellular networks, a problem that has challenged existing reverse engineering algorithms. This approach should enhance our ability to use microarray data to elucidate functional mechanisms that underlie cellular processes and to identify molecular targets of pharmacological compounds in mammalian cellular networks.


© 1999-2008 BioMed Central Ltd unless otherwise stated