Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Research article

Investigation of factors affecting prediction of protein-protein interaction networks by phylogenetic profiling

Anis Karimpour-Fard1, Lawrence Hunter1 and Ryan T Gill2*

Author Affiliations

1 Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado 80045, USA

2 Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80309, USA

For all author emails, please log on.

BMC Genomics 2007, 8:393  doi:10.1186/1471-2164-8-393

Published: 29 October 2007

Abstract

Background

The use of computational methods for predicting protein interaction networks will continue to grow with the number of fully sequenced genomes available. The Co-Conservation method, also known as the Phylogenetic profiles method, is a well-established computational tool for predicting functional relationships between proteins.

Results

Here, we examined how various aspects of this method affect the accuracy and topology of protein interaction networks. We have shown that the choice of reference genome influences the number of predictions involving proteins of previously unknown function, the accuracy of predicted interactions, and the topology of predicted interaction networks. We show that while such results are relatively insensitive to the E-value threshold used in defining homologs, predicted interactions are influenced by the similarity metric that is employed. We show that differences in predicted protein interactions are biologically meaningful, where judicious selection of reference genomes, or use of a new scoring scheme that explicitly considers reference genome relatedness, produces known protein interactions as well as predicted protein interactions involving coordinated biological processes that are not accessible using currently available databases.

Conclusion

These studies should prove valuable for future studies seeking to further improve phylogenetic profiling methodologies as well for efforts to efficiently employ such methods to develop new biological insights.