This article is part of the supplement: Selected articles from the Third International Symposium on Optimization and Systems Biology
Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces
1 Bioinformatics and Bioengineering Program, The Methodist Hospital Research Institute, Weill Medical College, Cornell University, Houston, TX 77030, USA
2 Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100080, China
BMC Systems Biology 2010, 4(Suppl 2):S6 doi:10.1186/1752-0509-4-S2-S6Published: 13 September 2010
Predicting drug-protein interactions from heterogeneous biological data sources is a key step for in silico drug discovery. The difficulty of this prediction task lies in the rarity of known drug-protein interactions and myriad unknown interactions to be predicted. To meet this challenge, a manifold regularization semi-supervised learning method is presented to tackle this issue by using labeled and unlabeled information which often generates better results than using the labeled data alone. Furthermore, our semi-supervised learning method integrates known drug-protein interaction network information as well as chemical structure and genomic sequence data.
Using the proposed method, we predicted certain drug-protein interactions on the enzyme, ion channel, GPCRs, and nuclear receptor data sets. Some of them are confirmed by the latest publicly available drug targets databases such as KEGG.
We report encouraging results of using our method for drug-protein interaction network reconstruction which may shed light on the molecular interaction inference and new uses of marketed drugs.