Log on / register
Feedback | Support | My details
Open AccessResearch article

A network-based integrative approach to prioritize reliable hits from multiple genome-wide RNAi screens in Drosophila

Li Wang1 email, Zhidong Tu2 email and Fengzhu Sun1,3 email

1Molecular and Computational Biology Program, University of Southern California, Los Angeles, CA 90089, USA

2Rosetta Inpharmatics, a wholly owned subsidiary of Merck & Co., Inc, 401 Terry Ave N, Seattle WA 98109, USA

3MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing, PR China

author email corresponding author email

BMC Genomics 2009, 10:220doi:10.1186/1471-2164-10-220

Published: 12 May 2009

Abstract

Background

The recently developed RNA interference (RNAi) technology has created an unprecedented opportunity which allows the function of individual genes in whole organisms or cell lines to be interrogated at genome-wide scale. However, multiple issues, such as off-target effects or low efficacies in knocking down certain genes, have produced RNAi screening results that are often noisy and that potentially yield both high rates of false positives and false negatives. Therefore, integrating RNAi screening results with other information, such as protein-protein interaction (PPI), may help to address these issues.

Results

By analyzing 24 genome-wide RNAi screens interrogating various biological processes in Drosophila, we found that RNAi positive hits were significantly more connected to each other when analyzed within a protein-protein interaction network, as opposed to random cases, for nearly all screens. Based on this finding, we developed a network-based approach to identify false positives (FPs) and false negatives (FNs) in these screening results. This approach relied on a scoring function, which we termed NePhe, to integrate information obtained from both PPI network and RNAi screening results. Using a novel rank-based test, we compared the performance of different NePhe scoring functions and found that diffusion kernel-based methods generally outperformed others, such as direct neighbor-based methods. Using two genome-wide RNAi screens as examples, we validated our approach extensively from multiple aspects. We prioritized hits in the original screens that were more likely to be reproduced by the validation screen and recovered potential FNs whose involvements in the biological process were suggested by previous knowledge and mutant phenotypes. Finally, we demonstrated that the NePhe scoring system helped to biologically interpret RNAi results at the module level.

Conclusion

By comprehensively analyzing multiple genome-wide RNAi screens, we conclude that network information can be effectively integrated with RNAi results to produce suggestive FPs and FNs, and to bring biological insight to the screening results.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.