Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Research article

Comprehensive analysis of co-occurring domain sets in yeast proteins

Inbar Cohen-Gihon1, Ruth Nussinov12 and Roded Sharan3*

Author Affiliations

1 Sackler Institute of Molecular Medicine, Department of Human Genetics, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel

2 Center for Cancer Research Nanobiology Program, SAIC-Frederick, Inc., NCI-Frederick, Frederick, Maryland 21702, USA

3 School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel

For all author emails, please log on.

BMC Genomics 2007, 8:161  doi:10.1186/1471-2164-8-161

Published: 11 June 2007

Abstract

Background

Protein domains are fundamental evolutionary units of protein architecture, composing proteins in a modular manner. Combinations of two or more, possibly non-adjacent, domains are thought to play specific functional roles within proteins. Indeed, while the number of potential co-occurring domain sets (CDSs) is very large, only a few of these occur in nature. Here we study the principles governing domain content of proteins, using yeast as a model species.

Results

We design a novel representation of proteins and their constituent domains as a protein-domain network. An analysis of this network reveals 99 CDSs that occur in proteins more than expected by chance. The identified CDSs are shown to preferentially include ancient domains that are conserved from bacteria or archaea. Moreover, the protein sets spanned by these combinations were found to be highly functionally coherent, significantly match known protein complexes, and enriched with protein-protein interactions. These observations serve to validate the biological significance of the identified CDSs.

Conclusion

Our work provides a comprehensive list of co-occurring domain sets in yeast, and sheds light on their function and evolution.