Email updates

Keep up to date with the latest news and content from BMC Evolutionary Biology and BioMed Central.

Open Access Highly Accessed Research article

Emergence of novel domains in proteins

Macarena Toll-Riera13 and M Mar Albà12*

Author Affiliations

1 Evolutionary Genomics Group, Research Programme on Biomedical Informatics (GRIB) - Hospital del Mar Research Institute (IMIM), Universitat Pompeu Fabra (UPF), Barcelona, Spain

2 Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain

3 Current address: Department of Zoology, University of Oxford, Oxford, UK

For all author emails, please log on.

BMC Evolutionary Biology 2013, 13:47  doi:10.1186/1471-2148-13-47

Published: 20 February 2013

Additional files

Additional file 1: Figure S1:

Length distribution of domains of different age. Table S2. Non-synonymous to synonymous (dN/dS) substitution rates for domains classified in different age classes defined using different Hmmpfam E-value cut-offs. Table S3. List of the most abundant domains in each age group by total number of domain occurrences. Table S4. Relationship between evolutionary rates and protein domain age. Figure S5. Distribution of the non-synonymous to synonymous (dN/dS) substitution rates for each domain type. Figure S6. Examples of human proteins containing Vertebrate and Old domains. Figure S7. Distribution of non-synonymous to synonymous (dN/dS) values for D. melanogaster protein domains classified in different age groups. Table S8. Relationship between evolutionary rate and protein domain age in D. melanogaster proteins.

Format: PDF Size: 1.6MB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data

Additional file 2:

Additional Pfam annotations for proteins containing Pfam domains of different age, including Clan Accession number, Clan Identifier and Pfam Description.

Format: XLS Size: 198KB Download file

This file can be viewed with: Microsoft Excel Viewer

Open Data

Additional file 3:

Complete list of proteins and Pfam domains employed in this study, including Ensembl Protein identifier, Pfam Accession number, age class, sequence length and non-synonymous (dN) and synonymous (dS) substitution rates.

Format: XLS Size: 11.9MB Download file

This file can be viewed with: Microsoft Excel Viewer

Open Data