Log on / register
Feedback | Support | My details
Open AccessHighly AccessSoftware

Literature Lab: a method of automated literature interrogation to infer biology from microarray analysis

Phillip G Febbo1,2,3 email, Mike G Mulligan4 email, David A Slonina4 email, Kimberly Stegmaier5 email, Dolores Di Vizio6 email, Paul R Martinez4 email, Massimo Loda6 email and Stephen C Taylor4 email

1Institute for Genome Science and Policy, Duke University, Durham, North Carolina, USA

2Department of Medicine, Duke University School of Medicine, Durham, North Carolina, USA

3Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, North Carolina, USA

4Acumenta Corporation, Boston, Massachusetts, USA

5Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts, USA

6Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts, USA

author email corresponding author email

BMC Genomics 2007, 8:461doi:10.1186/1471-2164-8-461

Published: 18 December 2007

Abstract

Background

The biomedical literature is a rich source of associative information but too vast for complete manual review. We have developed an automated method of literature interrogation called "Literature Lab" that identifies and ranks associations existing in the literature between gene sets, such as those derived from microarray experiments, and curated sets of key terms (i.e. pathway names, medical subject heading (MeSH) terms, etc).

Results

Literature Lab was developed using differentially expressed gene sets from three previously published cancer experiments and tested on a fourth, novel gene set. When applied to the genesets from the published data including an in vitro experiment, an in vivo mouse experiment, and an experiment with human tumor samples, Literature Lab correctly identified known biological processes occurring within each experiment. When applied to a novel set of genes differentially expressed between locally invasive and metastatic prostate cancer, Literature Lab identified a strong association between the pathway term "FOSB" and genes with increased expression in metastatic prostate cancer. Immunohistochemistry subsequently confirmed increased nuclear FOSB staining in metastatic compared to locally invasive prostate cancers.

Conclusion

This work demonstrates that Literature Lab can discover key biological processes by identifying meritorious associations between experimentally derived gene sets and key terms within the biomedical literature.


© 1999-2008 BioMed Central Ltd unless otherwise stated