Three factors underlying incorrect in silico predictions of essential metabolic genes
Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive 0412, La Jolla, CA 92093, USA
BMC Systems Biology 2008, 2:14 doi:10.1186/1752-0509-2-14Published: 4 February 2008
The indispensability of certain genes in an organism is important for studies of microorganism physiology, antibiotic targeting, and the engineering of minimal genomes. Time and resource intensive genome-wide experimental screens can be conducted to determine which genes are likely essential. For metabolic genes, a reconstructed metabolic network can be used to predict which genes are likely essential. The success rate of these predictions is less than desirable, especially with regard to comprehensively locating essential genes.
We show that genes that are falsely predicted to be non-essential (for growth) share three characteristics across multiple organisms and growth media. First, these genes are on average connected to fewer reactions in the network than correctly predicted essential genes, suggesting incomplete knowledge of the functions of these genes. Second, they are more likely to be blocked (their associated reactions are prohibited from carrying flux in the given condition) than other genes, implying incomplete knowledge of metabolism surrounding these genes. Third, they are connected to less overcoupled metabolites.
The results presented herein indicate genes that cannot be correctly predicted as essential have commonalities in different organisms. These elucidated failure modes can be used to better understand the biology of individual organisms and to improve future predictions.