Table 1

Data set size

Data Set
Sentences
Gene Mentions

training
7500
9000
(development) test
2500
3000
(final) test
5000
6000

Yeh et al. BMC Bioinformatics 2005 6(Suppl 1):S2   doi:10.1186/1471-2105-6-S1-S2