Table 1

Features Used Description of the Full Feature Set Used In the Closed Section Submission.

Word Features

wi

wi-1

wi+1

Last "real" word

Next "real" word

Disjunction of 4 previous words

Disjunction of 4 next words


Bigrams

wi + wi-1

wi + wi+1


TnT POS

POSi

POSi-1

POSi+1


Character Substrings

Up to a length of 6


Abbreviations

abbri

abbri-1 + abbri

abbri + abbri+1

abbri-1 + abbri + abbri+1


Word Shape

shapei

shapei-1

shapei+1

shapei-1 + shapei

shapei + shapei+1

shapei-1 + shapei + shapei+1


Previous NE

NEi-1

NEi-2 + NEi-1


Previous NE + Word

NEi-1 + wi


Previous NE + POS

NEi-1 + POSi-1 + POSi

NEi-2 + NEi-1 + POSi-2 + POSi-1 + POSi


Previous NE + Shape

NEi-1 + shapei

NEi-1 + shapei+1

NEi-1 + shapei-1 + shapei

NEi-2 + NEi-1 + shapei-2 + shapei-1 + shapei


Paren-Matching

A feature that signals when one parentheses in a pair has been assigned a different tag than the other in a window of 4 words


Finkel et al. BMC Bioinformatics 2005 6(Suppl 1):S5   doi:10.1186/1471-2105-6-S1-S5

Open Data