Table 2

Percentage of residues in the different datasets.

Dataset

Number of residues with disorder annotation (%)

Number of residues with order annotation (%)

Non-annotated residues (%)

Total number of residues in dataset


DisProt r4.5

(Jul 2008)

24.7

1.2

74.1

239120

(in 520 proteins)

Remark 465

7.2

53.7

39.1

164793

(in 364 proteins)

SL

26.3

33.0

40.7

239120

(in 520 proteins)


The SL dataset comprises the DisProt release 4.5 data in addition to residues in the same proteins annotated as having an ordered 3D structure found by similarity searches among sequences of known tertiary structure. Since some of these structures contain unidentified segments (Remark 465 regions), the number of residues with disorder annotation in SL is slightly larger than in DisProt

Sirota et al. BMC Genomics 2010 11(Suppl 1):S15   doi:10.1186/1471-2164-11-S1-S15

Open Data