Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

Open Access Research article

Enhancing the prediction of protein pairings between interacting families using orthology information

Jose MG Izarzugaza1, David Juan1, Carles Pons24, Florencio Pazos3* and Alfonso Valencia14

Author Affiliations

1 Structural Bioinformatics Group, Spanish National Cancer Research Centre (CNIO), C/Melchor Fernández Almagro, 3. 28029, Madrid, Spain

2 Barcelona Supercomputing Centre, C/Jordi Girona, 29. 08034, Barcelona, Spain

3 Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), C/Darwin, 3. Cantoblanco, 28049, Madrid, Spain

4 Spanish National Bioinformatics Institute (INB), C/Melchor Fernández Almagro, 3. 28029, Madrid, Spain

For all author emails, please log on.

BMC Bioinformatics 2008, 9:35  doi:10.1186/1471-2105-9-35

Published: 23 January 2008

Abstract

Background

It has repeatedly been shown that interacting protein families tend to have similar phylogenetic trees. These similarities can be used to predicting the mapping between two families of interacting proteins (i.e. which proteins from one family interact with which members of the other). The correct mapping will be that which maximizes the similarity between the trees. The two families may eventually comprise orthologs and paralogs, if members of the two families are present in more than one organism. This fact can be exploited to restrict the possible mappings, simply by impeding links between proteins of different organisms. We present here an algorithm to predict the mapping between families of interacting proteins which is able to incorporate information regarding orthologues, or any other assignment of proteins to "classes" that may restrict possible mappings.

Results

For the first time in methods for predicting mappings, we have tested this new approach on a large number of interacting protein domains in order to statistically assess its performance. The method accurately predicts around 80% in the most favourable cases. We also analysed in detail the results of the method for a well defined case of interacting families, the sensor and kinase components of the Ntr-type two-component system, for which up to 98% of the pairings predicted by the method were correct.

Conclusion

Based on the well established relationship between tree similarity and interactions we developed a method for predicting the mapping between two interacting families using genomic information alone. The program is available through a web interface.