Additional file 1.

EntrezGene official symbols with PubMed abstracts and their aliases classified by the algorithm. Description of data: 73 randomly chosen official gene symbols that produced text corpora of PubMed abstracts and their aliases. Aliases were classified by the algorithm as “synonyms”, “ambiguous”, “aliases with PubMed abstract but not passing the filters”, or “aliases without PubMed abstracts”.

Format: XLS Size: 43KB Download file

This file can be viewed with: Microsoft Excel Viewer

Coimbra et al. BMC Genomics 2010 11(Suppl 5):S3   doi:10.1186/1471-2164-11-S5-S3