Table 1 |
|
|
Word lists used to generate features |
|
|
Dictionary |
Source |
|
|
|
|
Proper name |
|
|
Common word |
Ispell, GNU spell-checker dictionary; inspired by Thomas et al. [8] |
|
Stop word |
Generic list of very common English words |
|
Medical word |
|
|
Drug word |
Generated from the Cerner Multum Drug Lexicon, (Denver, CO) |
|
Honorific |
Compiled by hand (e.g., mr., mrs, dr.) |
|
All user |
Users that have posted to this message board, generated from “author” field of each message |
|
User variant |
Users who have posted to this particular thread, with variants of these names derived automatically (strip digits, split by delimiters/camel case/known names and words) |
|
|
|
|
Benton et al. BMC Bioinformatics 2011 12(Suppl 3):S2 doi:10.1186/1471-2105-12-S3-S2 |
|