BMC Bioinformatics Volume 10
|
Viewing options:Associated material:Related literature:- Articles citing this article
- Other articles by authors
- Related articles/pages
Tools: Post to:
|
 Research articleDisease candidate gene identification and prioritization using protein interaction networksJing Chen1,2 , Bruce J Aronow1,2,3 and Anil G Jegga1,3  1Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA 2Department of Biomedical Engineering, University of Cincinnati, Cincinnati, OH, USA 3Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA author email corresponding author email
BMC Bioinformatics 2009,
10:73doi:10.1186/1471-2105-10-73
|
|
| Published: |
27 February 2009 |
Abstract
Background
Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN) analyses.
Results
For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds"), and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance.
Conclusion
Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization. |