Table 1

KL Divergence on Training, Cross Validation and Testing Set

Unigram Feature

Relevant Probability

Irrelevant Probability


Training Set Vs Cross Validation Set

0.0216

0.0703

Training Set Vs Testing Set

0.0369

0.9926


(Top 50 features according to Chi-Square statistics)

Wang et al. BMC Bioinformatics 2008 9(Suppl 3):S4   doi:10.1186/1471-2105-9-S3-S4

Open Data