BMC Bioinformatics

official impact factor 3.03

Open Access Research article

Statistical modeling of biomedical corpora: mining the Caenorhabditis Genetic Center Bibliography for genes related to life span

DM Blei1*, K Franks2, MI Jordan3,4* and IS Mian2*

Author Affiliations

1 Computer Science Department, Princeton University, Princeton, New Jersey 08540 USA

2 Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720-8265, USA

3 Department of Statistics, University of California Berkeley, Berkeley, California 94720, USA

4 Department of EECS, University of California Berkeley, Berkeley, California 94720, USA

For all author emails, please log on.

BMC Bioinformatics 2006, 7:250 doi:10.1186/1471-2105-7-250

Published: 8 May 2006

Additional files

Additional File 1:

Results for each of the LDA topics specified by a 50-topic model estimated from a corpus of 5,225 documents and a 28,971 word vocabulary.

Format: PDF Size: 1.3MB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data