Table 6

Performance changes on the ACT development set by varying feature types.

Used Features

Avg Prec

Precision

Recall

F1 score


Baseline

0.7073

0.6403

0.6290

0.6346

–Gene Anonymization

0.7017

0.6166

0.6320

0.6242

–Multi-words

0.7035

0.6358

0.6349

0.6354

–Sub-strings

0.7019

0.6329

0.6320

0.6324

–MeSH Terms

0.7009

0.6410

0.6334

0.6372


Baseline+Higher Order

0.7077

0.6311

0.6496

0.6402


The baseline performance is the result obtained from our system pipeline with the same setting used for Run 4. A row shows the evaluation results when a specific feature type is not used for the experiment. However, the last row is the performance results when higher-order features are applied.

Kim and Wilbur BMC Bioinformatics 2011 12(Suppl 8):S9   doi:10.1186/1471-2105-12-S8-S9

Open Data