Table 1

Computational coverage of the protein sequence space
Sequence spacea All proteins Protein regions All regions
LC TM CC SP
Total sequence space aa 5.64E + 09 4.14E + 08 3.74E + 08 6.78E + 07 5.43E + 07 9.10E + 08
% 100 7.3 6.6 1.2 1.0 16.1
Domain space aa 2.90E + 09 2.72E + 08 1.20E + 08 4.65E + 07 4.62E + 07 4.84E + 08
% 51.4 9.4b 4.1b 1.6b 1.6b 16.7b

aData for nr December 2011 is shown. Abbreviations: LC, regions of low complexity; TM, transmembrane regions; CC, coiled coils; SP, signal peptides.

bShown as relative percentage with respect to the 51.4% of domain space.

Rekapalli et al.

Rekapalli et al. BMC Genomics 2012 13:634   doi:10.1186/1471-2164-13-634

Open Data