Open Access Open Badges Research article

Literature-based discovery of diabetes- and ROS-related targets

Junguk Hur12, Kelli A Sullivan2, Adam D Schuyler4, Yu Hong2, Manjusha Pande23, David J States5, H V Jagadish13 and Eva L Feldman123*

Author Affiliations

1 Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA

2 Department of Neurology, University of Michigan, Ann Arbor, MI 48109, USA

3 National Center for Integrative Biomedical Informatics, University of Michigan, Ann Arbor, MI 48109, USA

4 Department of Molecular, Microbial and Structural Biology, University of Connecticut Health Center, Farmington, CT 06030, USA

5 School of Health Information Science, University of Texas, Houston, TX 77030, USA

For all author emails, please log on.

BMC Medical Genomics 2010, 3:49  doi:10.1186/1755-8794-3-49

Published: 27 October 2010



Reactive oxygen species (ROS) are known mediators of cellular damage in multiple diseases including diabetic complications. Despite its importance, no comprehensive database is currently available for the genes associated with ROS.


We present ROS- and diabetes-related targets (genes/proteins) collected from the biomedical literature through a text mining technology. A web-based literature mining tool, SciMiner, was applied to 1,154 biomedical papers indexed with diabetes and ROS by PubMed to identify relevant targets. Over-represented targets in the ROS-diabetes literature were obtained through comparisons against randomly selected literature. The expression levels of nine genes, selected from the top ranked ROS-diabetes set, were measured in the dorsal root ganglia (DRG) of diabetic and non-diabetic DBA/2J mice in order to evaluate the biological relevance of literature-derived targets in the pathogenesis of diabetic neuropathy.


SciMiner identified 1,026 ROS- and diabetes-related targets from the 1,154 biomedical papers ( webcite). Fifty-three targets were significantly over-represented in the ROS-diabetes literature compared to randomly selected literature. These over-represented targets included well-known members of the oxidative stress response including catalase, the NADPH oxidase family, and the superoxide dismutase family of proteins. Eight of the nine selected genes exhibited significant differential expression between diabetic and non-diabetic mice. For six genes, the direction of expression change in diabetes paralleled enhanced oxidative stress in the DRG.


Literature mining compiled ROS-diabetes related targets from the biomedical literature and led us to evaluate the biological relevance of selected targets in the pathogenesis of diabetic neuropathy.