Table 1

Basic statistics of the JNLPBA dataset


# abstracts
# sentences
# words

Training Set
2,000
18,546
472,006 (236.00/abs) (22.97/sen)
Test Set
404
3,856
96,780 (239.55/abs) (22.72/sen)

Tsai et al. BMC Bioinformatics 2006 7:92   doi:10.1186/1471-2105-7-92