Table 3

Statistics of the EPI corpus

Item

Train

Devel

Test

Total


Abstract

600

200

400

1,200

Word

127,312

43,497

82,819

253,628

Protein

7,595

2,499

5,096

15,190

Event

1,852

601

1,261

3,714

Modification

173

79

117

369


Test set statistics shown only for the primary test data.

Pyysalo et al. BMC Bioinformatics 2012 13(Suppl 11):S2   doi:10.1186/1471-2105-13-S11-S2

Open Data