Open Access Highly Accessed Database

Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network

Helge G Roider1, Nadia Pavlova2, Ivaylo Kirov2, Stoyan Slavov2, Todor Slavov2, Zlatyo Uzunov2 and Bertram Weiss1*

Author Affiliations

1 Bayer Pharma AG, Müllerstr 178, 13342 Berlin, Germany

2 Metalife AG, Im Metapark 1, 79297 Winden, Germany

For all author emails, please log on.

BMC Bioinformatics 2014, 15:68  doi:10.1186/1471-2105-15-68

Published: 11 March 2014



Information about drug-target relations is at the heart of drug discovery. There are now dozens of databases providing drug-target interaction data with varying scope, and focus. Therefore, and due to the large chemical space, the overlap of the different data sets is surprisingly small. As searching through these sources manually is cumbersome, time-consuming and error-prone, integrating all the data is highly desirable. Despite a few attempts, integration has been hampered by the diversity of descriptions of compounds, and by the fact that the reported activity values, coming from different data sets, are not always directly comparable due to usage of different metrics or data formats.


We have built Drug2Gene, a knowledge base, which combines the compound/drug-gene/protein information from 19 publicly available databases. A key feature is our rigorous unification and standardization process which makes the data truly comparable on a large scale, allowing for the first time effective data mining in such a large knowledge corpus. As of version 3.2, Drug2Gene contains 4,372,290 unified relations between compounds and their targets most of which include reported bioactivity data. We extend this set with putative (i.e. homology-inferred) relations where sufficient sequence homology between proteins suggests they may bind to similar compounds. Drug2Gene provides powerful search functionalities, very flexible export procedures, and a user-friendly web interface.


Drug2Gene v3.2 has become a mature and comprehensive knowledge base providing unified, standardized drug-target related information gathered from publicly available data sources. It can be used to integrate proprietary data sets with publicly available data sets. Its main goal is to be a ‘one-stop shop’ to identify tool compounds targeting a given gene product or for finding all known targets of a drug. Drug2Gene with its integrated data set of public compound-target relations is freely accessible without restrictions at webcite.

Drug-target relations; Compound-protein relations; Drug development; Drug discovery; Drug repositioning; Knowledge base; Tool compounds; Biological effect; Bioactivity