Email updates

Keep up to date with the latest news and content from BMC Systems Biology and BioMed Central.

Open Access Highly Accessed Research article

A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2

Ines Thiele12, Daniel R Hyduke3, Benjamin Steeb4, Guy Fankam3, Douglas K Allen5, Susanna Bazzani6, Pep Charusanti3, Feng-Chi Chen7, Ronan MT Fleming18, Chao A Hsiung7, Sigrid CJ De Keersmaecker9, Yu-Chieh Liao7, Kathleen Marchal9, Monica L Mo3, Emre Özdemir10, Anu Raghunathan11, Jennifer L Reed12, Sook-Il Shin11, Sara Sigurbjörnsdóttir13, Jonas Steinmann13, Suresh Sudarsan14, Neil Swainston1516, Inge M Thijs9, Karsten Zengler3, Bernhard O Palsson3, Joshua N Adkins17 and Dirk Bumann4*

Author Affiliations

1 Center for Systems Biology, University of Iceland, Reykjavik, Iceland

2 Faculty of Industrial Engineering, Mechanical Engineering & Computer Science University of Iceland, Reykjavik, Iceland

3 Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA

4 Infection Biology, Biozentrum, University of Basel, Basel, Switzerland

5 USDA-ARS, Plant Genetics Research Unit, Donald Danforth Plant Science Center, St Louis, MO, USA

6 Technical University Braunschweig, Institute for Bioinformatics & Biochemistry, Braunschweig, Germany

7 Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan

8 Science Institute, University of Iceland, Reykjavik, Iceland

9 Centre of Microbial and Plant Genetics, Department of Microbial & Molecular Systems, Katholieke Universiteit Leuven, Leuven, Belgium

10 Laboratory of Computational Systems Biotechnology, Ecole Polytechnique Fédérale de Lausanne, Swiss Institute of Bioinformatics, Lausanne, Switzerland

11 Department of Infectious Diseases, Mount Sinai School of Medicine, New York City, NY, USA

12 Department of Chemical & Biological Engineering, University of Wisconsin-Madison, Madison, WI, USA

13 Faculty of Life & Environmental Sciences, University of Iceland, Reykjavik, Iceland

14 Department of Biochemical and Chemical Engineering, Technische Universität Dortmund, Dortmund, Germany

15 School of Computer Science, The University of Manchester, Manchester, UK

16 The Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, The University of Manchester, Manchester, UK

17 Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA

For all author emails, please log on.

BMC Systems Biology 2011, 5:8  doi:10.1186/1752-0509-5-8


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1752-0509/5/8


Received:26 May 2010
Accepted:18 January 2011
Published:18 January 2011

© 2011 Thiele et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Metabolic reconstructions (MRs) are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG) knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem.

Results

Here, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i) development and implementation of a community-based workflow for MR annotation and reconciliation; ii) incorporation of thermodynamic information; and iii) use of the consensus MR to identify potential multi-target drug therapy approaches.

Conclusion

Taken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.

Background

The evolution of antibiotic resistance by a variety of human pathogens is a looming public health threat [1,2]. Salmonella is a major human pathogen and a model organism for bacterial pathogenesis research [3]. S. enterica subspecies I serovar Typhimurium (S. Typhimurium) is the principle subspecies employed in molecular biology and its variants are causative agents in gastroenteritis in humans. The publication of the annotated genome for S. Typhimurium LT2 provided a foundation for numerous applications, such as drug discovery [4]. Previous efforts to systematically identify candidate drug targets within metabolism did not result in a plethora of new candidates, due to the robustness and redundancy of S. Typhimurium's metabolic network [5]. Since new single protein targets are missing, we need to target multiple proteins conjointly. Unfortunately, antibiotic regimens, which require multiple targets to be hit simultaneously, have an increased probability of the pathogen evolving resistance relative to a single target therapy. However, the continuous clinical success of the combination of beta-lactams and beta-lactamase inhibitors actually demonstrates that inhibitor combinations can be successful even if each individual inhibitor is non-effective on its own. The robustness inherent to S. Typhimurium's metabolic network imposes combinatorial challenges for in vitro and in vivo approaches to identify synthetic lethal genes sets (i.e., experimental enumeration of all synthetic lethal pairs in S. Typhimurium would require the creation of ~500,000 double gene deletion strains, see below). Employing a systems biology network perspective could facilitate their identification.

GEnome scale Network REconstructions (GENRE) [6] represent biochemical, genetic, and genomic (BiGG) knowledge-bases [7] for target organisms; and have been developed for expression [8,9], metabolic [6,10], regulatory [11], and signaling [12,13] networks. Metabolic reconstructions (MRs) are the most developed out of the four GENRES. The metabolic network reconstruction process is well established [14] and has been used for various biotechnological and biomedical applications [15,16]. Given the rapidly growing interest in MRs and modeling, parallel reconstruction efforts for the same target organism have arisen and resulted in alternative MRs for a number of organisms [17-23]. These parallel MRs may vary in content and format due to differences in reconstruction approaches, literature interpretation, and domain expertise of the reconstructing group. Subsequent network comparison and discoveries are hampered by these differences. Consequently, the need for a community approach to divide the substantial effort required in reconciling and expanding these MRs has been formulated [17].

Results and Discussion

Salmonella, a reconstruction jamboree for an infectious disease agent

In June 2008, it became apparent that two MRs were being assembled by two different research groups [20] (Bumann, unpublished data). Subsequently, a Salmonella reconstruction jamboree was held at the University of Iceland, Reykjavik, from September 5th to 6th, 2008. The jamboree team consisted of over 20 experts in microbiology, proteomics, Salmonella physiology, and computational modeling. Based on the experience with the yeast reconstruction jamboree [17], a methodology was devised to increase the efficiency of community-based network reconstruction [24] and applied to the Salmonella reconstruction jamboree.

The goal of a network reconstruction jamboree is to provide a 2-D genome annotation that is of higher quality than it may be achieved by bioinformatic analyses alone [24,25]. The objective of this jamboree was to re-evaluate, reconcile, and expand the currently available MRs for S. Typhimurium with a focus on virulence. Furthermore, we aimed to include standard identifiers for reconstruction metabolites, reactions, and genes to facilitate subsequent mapping of 'omics' data. The starting MRs were AJRecon (a variant is published in [20]) and BRecon (D. Bumann, unpublished data), which were derived from published E. coli MRs, iJR904 [26] and iAF1260 [27], respectively, and their contents were modified to account for Salmonella-specific properties; i.e., transport and enzymatic reactions not present in Salmonella were removed and the proteins associated with the reactions were modified to contain proteins present in S. Typhimurium LT2.

Comparison of two metabolic reconstructions for S. Typhimurium

We developed an automatic approach to initiate the reconciliation of the two MRs by converting their metabolites and reactions into a common language (Figure 1). The MR contents were grouped into three categories: (1) identical, (2) similar, and (3) dissimilar. A similar reaction was one, in which there was a minor discrepancy, such as reaction reversibility, a missing reactant or product, or a difference in associated enzyme(s). Dissimilar reactions were those with distinct sets of reactants and products, and often represented metabolic reactions that were not included in one of the starting MRS. The identical content was transferred to the consensus MR without further evaluation. The similar and dissimilar content was evaluated at the jamboree. Genes and proteins associated with the reactions were also carefully compared and refined where necessary. At its end, the meeting yielded an approximately 80% reconciled consensus reconstruction. The remaining discrepancies were manually curated by the Bumann and Palsson groups following the jamboree meeting.

thumbnailFigure 1. Approach to reconcile two metabolic reconstructions (MR). This figure illustrates the automated comparison tool developed for the Salmonella reconstruction jamboree. Both MRs are translated into a common language (based on KEGG [44]). Metabolites and reactions that could not be mapped onto KEGG were subject to manual evaluation by the jamboree team. The overlapping part of the MRs was directly moved into the consensus MR while reactions and metabolites unique to a MR were evaluated manually. This approach can be readily applied to comparison of any two MRs.

Initial comparison revealed that there were 760 reactions common to the starting MRs while 521 and 1684 reactions were unique to AJRecon and BRecon, respectively (Additional file 1 Table S1). Some of these differences could be explained by changes introduced to the E. coli MR when it was converted from its earlier version, iJR904 [26], to the most recent version, iAF1260 [27] (i.e., explicit definition of a periplasm compartment; more detailed fatty acid metabolism).

Additional file 1. Consensus MR. This xlsx file contains the consensus reconstruction and simulation setup/results. - Table S1. Statistics for automated reconciliation of starting reconstructions. - Table S2. Consensus Reconstruction in SBML format. - Table S3a. M9/glc. - Table S3b. LB. - Table S3c. Biomass. - Table S4. Growth benchmark errors. - Table S5a. All Lethal deletion predictions. - Table S5b. Single Deletion/Virulence. - Table S6a. LB Synthetic Lethal. - Table S6b. M9 Synthetic Lethal

Format: XLSX Size: 926KB Download fileOpen Data

Characteristics of the Salmonella Consensus Reconstruction

The resulting knowledge-base, STM_v1.0 (Table 1; Additional file 2; Additional file 1 Table S2), represents the final product of a community-effort to develop a detailed MR of S. Typhimurium. STM_v1.0 integrates the novel and common features of the starting MRs into a vetted, well-documented consensus knowledge-base, capturing currently available BiGG knowledge about S. Typhimurium. Key features of STM_v1.0 include i) accounting for the periplasm as a compartment between the extracellular space and cytoplasm; ii) Salmonella-specific virulence characteristics, such as iron chelation by salmochelin and serovar Typhimurium LT2 O-antigen production; iii) the possibility to employ the consensus MR as mathematical, predictive model; and iv) comprehensive support data for reactions and associated genes (Additional file 1 Table S2a). Some information was excluded from STM_v1.0, such as the 26 dipeptide and tripeptide transport/digestion reactions that are present in AJRecon, as they represent generic compounds. Accounting for all potential consumable oligopeptides would make computational analysis intractable or unnecessarily difficult. Appropriate oligopeptides may be manually added to STM_v1.0 to represent a specific growth environment. We also attempted to exclude reactions that were included to fit some growth data [28], but were contrary to other observations [20,29] as was the case for growth with D-aspartic acid [30] as the sole carbon source which requires an unknown transporter and an unknown aspartate racemase [31].

Table 1. Basic Statistics for the original and the consensus reconstructions.

Additional file 2. Consensus MR in SBML format. Consensus MR as a computational model in SBML format.

Format: ZIP Size: 149KB Download fileOpen Data

Additionally, we evaluated the reaction directionality of consensus MR reactions by considering thermodynamic properties of participating metabolites. In the case that a thermodynamic prediction was inconsistent with experimental evidence, the experimental evidence was followed. Thermodynamic predictions are made using the knowledge that is available [45], and incorrect predictions highlight gaps in our knowledge of biology.

A bacterial MR often includes a biomass reaction that lists all known biomass precursors and their fractional contribution necessary to produce a new bacterial cell in a given environment. The individual biomass constituents of a S. Typhimurium cell have been measured [20], and adapted for the consensus reconstruction by accounting for the changes in naming and compartments introduced during reconciliation (Additional file 1 Table S3c).

Comparison with a third metabolic reconstruction of S. Typhimurium

After finishing the consensus reconstruction, a third metabolic reconstruction (iMA945) was published [21]. Similar to one of our starting MRs (BRecon), iMA945 was built by using homology, and other bioinformatics criteria [32], starting from the E. coli metabolic reconstruction (iAF1260). Gaps in iMA945 were detected and filled with GapFind and GapFill, respectively [33]; and iMA945's content was further augmented by the GrowMatch algorithm [34] to fit experimental measurements. These automated optimization methods are excellent tools for identifying gaps in network reconstructions and proposing candidate reactions to fill these gaps and fit the model to growth data, however, they often do not associate genes with the candidate reactions. The candidate reactions are typically taken from a universal reaction database (such as KEGG) that includes pathways from all domains of life, thus candidate reactions proposed by these methods should be taken as hypotheses and require additional validation from published literature or direct experimental evidence.

We performed a preliminary comparison between STM_v1.0 and iMA945. However, we did not reconcile iMA945 with the consensus reconstruction, as this will require detailed evaluation of the discrepancies in a subsequent jamboree meeting. Overall, 2,057 reactions were present in both the consensus reconstruction and iMA945, of which 1,706 reactions have identical gene-protein-reaction (GPR) associations (Additional file 1 Table S2d). A total of 26 reactions had identical reaction identifiers but different reactions (e.g., different reactants, products, stoichiometry, or directionality: reversible, forward only, backward only) and GPR associations. There were a total of 629 distinct reaction ids between STM_v1.0 and iMA945: 446 were unique to STM_v1.0 and 183 to iMA945. Of the 183 reactions flagged as unique to iMA945, the majority represents reactions that were intentionally excluded from the consensus reaction (e.g., 45 dipeptide exchange, transport, and peptidase reactions and >60 additional exchange, transport, and enzymatic reactions not supported by literature). Some of the distinct reactions, such as adenosylcobalamin phosphate synthase, were due to different metabolite and reaction identifiers. No bibliomic data were included in iMA945, so it was not possible to assess whether the reactions were inserted by the automated gap-filling methods or supported by additional evidence. The 446 reactions unique to STM_v1.0 include Salmonella-specific chelators, O-antigens, and lipid modifications that were not present in the starting network derived from the E. coli MR (iAF1260). Overall, the core metabolic network is similar between STM_v1.0 and iMA945, which is expected as the draft scaffolds for both MRs were derived from E. coli MRs and S. Typhimurium has a notable metabolic homology with E. coli; however, STM_v1.0 includes over 300 more genes than iMA945 and includes a variety of Salmonella-specific reactions that are essential for virulence and could serve as coupling points for constructing a host-pathogen model.

Metabolic Network Reconstruction Assessment

To assess the utility of a mathematical approximation of reality, it is essential to determine the consistency of the model's predictions with real-world benchmarks. In the case of MRs, comparing experimental growth data with predicted biomass production is a commonly employed metric in benchmarking metabolic models [14]. Although biomass production is a commonly employed metric, the results should always be taken with a grain of salt; for instance, it is possible to improve the fitting of a model's predictions to growth data by including enzymatic reactions for which no evidence exists or which are contrary to published experimental observations. The reconstruction committee chose not to include invalidated enzymatic reactions that improved the fit between growth predictions and experimental observations; the failings of the model's predictions highlight areas where knowledge is lacking and experimental undertakings could identify new knowledge.

For S. Typhimurium, there is a wealth of experimental growth data [29]. Overall, we found good agreement between the qualitative growth phenotype predictions and the experimental data (Table 2 Additional file 1 Table S4); with the notable exception of sulfur metabolism where the prediction accuracy was about 40%. As we are becoming increasingly aware of the importance of sulfur-related metabolism in host-pathogen interactions [35-38], the deficiencies in our knowledge highlighted by this analysis represent viable targets for experimental enquiry. For the carbon and nitrogen sources accessible by AJRecon our results were comparable, however STM_v1.0 has the ability to metabolize 20 carbon sources and 15 nitrogen sources not accessible to AJRecon. The additional metabolic capabilities of STM_v1.0 are due, in part, to the presence of ~200 additional gene products in STM_v1.0.

Table 2. Growth benchmark results

Gene Essentiality Simulations

To combat the rise in antibiotic-resistant pathogens, it is crucial to identify new drug targets. Genes or sets of genes that are essential for growth are potential drug targets. To identify novel drug targets in STM_v1.0, we performed single and double gene deletion studies. We identified 201 essential genes in M9/glc, 144 of which were also essential in LB (Additional file 1 Table S5a). The synthetic lethal gene pair simulations were performed using only genes that were found to be non-essential in the condition of interest (Additional file 1 Table S6). In M9/glc, there were 87 synthetic lethal gene-pairs comprised of 102 unique genes. For E. coli, Suthers et al. [39] predicted 86 synthetic lethal gene-pairs, however, there were only 83 unique genes involved. In LB, there were 56 synthetic lethal gene-pairs comprised of 76 unique genes. Interestingly, 10 of LB synthetic lethal genes were also essential in M9/glc and were members of 12 of the LB synthetic lethal gene-pairs. The very small fraction of essential synthetic lethal gene pairs (< 100 synthetic lethalities out of >500,000 possibilities - assuming approx. 1000 non-essential metabolic genes) emphasizes the robustness of S. Typhimurium's metabolic network, which has previously been noted [5].

Candidate drug targets

Our observed, very small number of synthetic lethal pairs in STM_v1.0 indicates that antimicrobial regimens may need to target more than two elements to be effective. Unfortunately, it will take less time for a pathogen to evolve a solution to a conjoint two-target antimicrobial strategy compared to a single-target strategy. To reduce the probability of a pathogen evolving resistance to a conjoint two-target strategy, it may be plausible to employ a combination of two-target strategies. Although a combination approach may be suitable for dealing with antibiotic resistance, there are potential shortcomings associated with clearance and toxicity because all the components of a regimen must reach a target at a specific time with the requisite concentrations. Despite these difficulties, multi-component, multi-target drugs are becoming standard therapeutics for complex diseases, including cancer, diabetes, and infectious diseases [40]. Experimental identification and characterization of therapeutic strategies that require multiple targets for effectiveness is a resource intensive undertaking (e.g., creating over 500,000 double mutant strains). An in silico approach using an MR, such as STM_v1.0, could be implemented to prioritize the experiments by indicating which multi-target therapies would adversely affect the pathogen's metabolic capabilities.

As mentioned above, the synthetic gene deletion analysis yielded 56 synthetic lethal gene pairs disrupting growth of S. Typhimurium in silico. We grouped these gene pairs based on different criteria to assess their potential value as multi-drug targets (Figure 2). It is notable that five gene pairs are between protein complexes while a further three gene pairs are between genes involved in the same pathway - this indicates the presence of a layer of 'redundancy' for the enzyme or pathway that confers protection against a single-target therapy. Moreover, three of the genes involved in gene pairs are known to be essential for virulence, but not for growth, and have known inhibitors based on BRENDA [41]. This structured overview of in silico synthetic lethal gene pairs identified numerous candidate drug targets many of which have known inhibitors. In subsequent studies, these model-generated hypotheses need to be tested and validated.

thumbnailFigure 2. Candidate drug targets. The figure contains all predicted synthetic lethal interactions for STM_v1.0 in LB medium. A line connecting two genes represents a synthetic lethal pair. A group of genes surrounded by a dashed box represents a protein complex requiring all enclosed elements to function. Yellow background means associated with sulfur metabolism. Blue background indicates that the synthetic lethal pairs are functional homolog's. Red gene means that there is a chemical known to inhibit the gene-product in STM_v1.0.

Additional gene products shown to play a central role in virulence yet are not essential for growth in laboratory conditions or do not have an unequivocal functional annotation represent additional therapeutic targets. These gene products could serve as potential points for manipulating host metabolism [38], could be essential for metabolism in the host environment (e.g., Salmonella-containing vacuoles are nutrient poor)[42], and will represent an energy and materials demand when creating integrated metabolic and expression reconstructions [8,9]. Recent examples of relevant gene products that have not been annotated but are crucial for virulence include gene products STM3117-STM3120 [43]; as the metabolic functions of unannotated genes are elucidated, they will need to be incorporated into future revisions of the MR.

Conclusions

Taken together, the community-developed consensus MR is a curated reconstruction with the combined properties of the starting MRs and new information that was added during and after the reconstruction jamboree. The expanded metabolic versatility with a focus on virulence, updated annotation, including corrections, and curation of hundreds of additional reactions, genes, and metabolites by a community of experts present in STM_v1.0 highlights the value of a community-based approach. Another MR for S. Typhimurium was published after the jamboree [21], which was also based on an E. coli MR[27]. The reconciliation with this third MR will need to be done in subsequent jamboree meetings, which will also lead to a further expansion of knowledge and data included in the consensus knowledge-base. The publication of the third MR for S. Typhimurium emphasizes the importance and the value of the effort presented in this report as well as the need for additional outreach when assembling jamboree committees.

Methods

Metabolic network reconstructions of Salmonella enterica serovar Typhimurium LT2

The starting reconstructions, AJRecon and BRecon, were built on scaffolds derived from published E. coli MRS. AJRecon is a pre-publication version of iRR1083 [20], and was based on iJR904 [26]. For its scaffold, BRecon (Bumann, unpublished) employed iAF1260 [27]- a direct descendent of iJR904. The two reconstructions, differ in content due to: (1) different components being targeted for manual curation (e.g., BRecon extended Fe chelation and AJRecon extended lipid production), and (2) differences in E. coli MRs that were used as comparative genomics scaffolds for initializing the Salmonella MRs (e.g., iAF1260 accounted for the periplasm whereas its ancestor did not).

Method for community-based network reconstruction

There are three essential phases for community-based MR development: (1) preparation, (2) jamboree, and (3) reconstruction finalization [24]. The preparation and finalization phases are carried out by a small contingent of researchers, whereas, the collective knowledge of the community is harnessed during the jamboree. In the preparation phase, the two MRs were compared as described below in terms of metabolites, reactions, and gene-protein-reaction associations (GPRs). Overlapping content between both original MRs was directly moved into the consensus MR (Additional file 1 Table S1). Discrepancies in the listed three areas were presented to the jamboree team, which was split into three groups: metabolite curation, reaction curation group, and GPR curation group. The metabolites group curated the list of all metabolites present in either original MR for i) protonation state of metabolites at physiological pH, ii) missing metabolite identifiers: KEGGID, PubChemID, ChEBI ID, and iii) comparison of neutral formulae in reconstruction and metabolite databases. The reaction group was responsible for identifying evidence for orphan reactions in either original MR with and without a KEGG reaction ID. Reactions without a KEGG ID had to be extensively audited as there were no database evidences for the correctness of the reaction mechanisms. The GPR group had to resolve the discrepancies in GPR assignments using genome databases and literature. Each team evaluates their problem set based on evidence within the consensus MR and available resources (literature, databases, and annotations). Items that are not adequately addressed during the jamboree are subject to extensive manual curation during the MR finalization phase. The finalization phase includes: (1) manual curation, (2) benchmarking the consensus MR against experimentally-derived phenotypic data, and (3) MR dissemination. The consensus MR is expected to be maintained, updated and expanded in subsequent reconstruction jamborees.

Metabolic Reconstruction Reconciliation

Reconciling multiple MRs requires that the MRs' contents employ a common nomenclature so that the contents may be compared. For this work, we employed the KEGG database [44] as the source of common identifiers (Figure 1); although all of the reactions and metabolites in KEGG may not be accurate or complete, KEGG has the benefit of being an extensive, freely accessible resource used by the broader biological community. The complete consensus reconstruction can be found in Additional file 1 Table S6 and in Additional file 2 as an SBML file.

Thermodynamic directionality

Thermodynamic directionality for each reaction was calculated as described in [45]. Briefly, assuming a temperature of at 310.15 K, intracellular pH of 7.7, extracellular/periplasmic pH of 7.0, and a concentration range of 0.01-20 mM, we calculated upper and lower bounds on transformed reaction Gibbs energy, and assigned reaction directionality accordingly. Transport reactions were not subject to thermodynamic consistency analysis as there is still uncertainty associated with the directionality prediction of transmembrane transport.

Conversion of reconstruction into a mathematical model

The conversion of a reconstruction into a mathematical model has been described in detail elsewhere [14]. The unit of reaction fluxes was defined as mmol/gDW/hr.

Phenotypic assessment

Flux balance analysis [46] was employed to assess the STM_v1.0 model's ability to correctly predict biomass production in a variety of limiting conditions. The accuracy of the model was assessed by comparing the predictions to benchmarks drawn from experimental data [20,29]. In this assessment, there are four possible observations: (1) STM_v1.0 model correctly predicts growth (G/G), (2) STM_v1.0 model incorrectly predicts growth (G/NG), (3) STM_v1.0 model correctly predicts no growth (NG/NG), and (4) STM_v1.0 model incorrectly predicts no growth (NG/G). For a prediction to be counted as a true positive (G/G) or true negative (NG/NG), the prediction needed to match one or more experimental observations. The predictions were first compared with the Biolog phenotype microarray (PM) data http://www.biolog.com webcite. False positive predictions (G/NG) and false negative predictions (NG/G) were then compared with the data from Gutnick et al. [29] and references cited in Ragunathan et al. [20]. For limiting conditions not represented in the PM, predictions were only compared with data from Gutnick et al. [29] or cited in Ragunathan et al. [20].

Gene essentiality analysis

The gene deletion studies were performed by converting STM_v1.0 into a stoichiometric model and performing flux balance analysis [46]. For each gene, or gene pair, the associated reaction(s) were disabled (vmin, i = vmax, i = 0 mmol.gDW-1.hr-1) and the ability of the model to produce biomass was assessed, i.e., the biomass reaction was chosen as the objective function and maximized.

All simulations were performed using the COBRA Toolbox v2.0 [47] using Matlab (Mathworks, Inc) as the programming environment, and Tomlab (TomOpt, Inc) as the linear programming solver.

Authors' contributions

IT, DRH, BOP, JNA, and DB conceived the study. BS and DB compiled the BRecon. IT and DRH compiled the consensus MR. IT, DRH, BOP, and DB wrote the manuscript. GF and IT designed and performed initial MR comparisons. RMTF and DRH performed thermodynamic directionality analysis. DHR and IT carried out the computational analysis of the consensus MR. IT, BOP, DB, BS, DKA, SB, PC, FCC, RMTF, CAH, SCJK, YCL, KM, MLM, EÖ, AR, JLR, SIS, SS, JS, SS, NS, IMT, KZ, BOP, JNA, DB actively participated during and/or after the metabolic reconstruction jamboree to generate content for the consensus MR. All authors read and approved the final manuscript.

Acknowledgements

The authors would like to thank R. Archila and K.C. Soh for participation at the opening day of the reconstruction jamboree. I.T. would like to thank M. Herrgard for the valuable discussions. This work was supported in part by the National Institute of Allergy and Infectious Diseases NIH/DHHS through interagency agreement Y1-AI-8401-01. I.T. was supported in part by a Marie Curie International Reintegration Grant within the 7th European Community Framework Program (PIRG05-GA-2009-249261).

References

  1. Bjorkman J, Hughes D, Andersson DI: Virulence of antibiotic-resistant Salmonella typhimurium.

    Proceedings of the National Academy of Sciences of the United States of America 1998, 95(7):3949-3953. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  2. Norrby SR, Nord CE, Finch R: Lack of development of new antimicrobial drugs: a potential serious threat to public health.

    The Lancet infectious diseases 2005, 5(2):115-119. PubMed Abstract | Publisher Full Text OpenURL

  3. Ohl ME, Miller SI: Salmonella: a model for bacterial pathogenesis.

    Annual review of medicine 2001, 52:259-274. PubMed Abstract | Publisher Full Text OpenURL

  4. McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney L, Porwollik S, Ali J, Dante M, Du F, Hou S, Layman D, Leonard S, Nguyen C, Scott K, Holmes A, Grewal N, Mulvaney E, Ryan E, Sun H, Florea L, Miller W, Stoneking T, Nhan M, Waterston R, Wilson RK: Complete genome sequence of Salmonella enterica serovar Typhimurium LT2.

    Nature 2001, 413(6858):852-856. PubMed Abstract | Publisher Full Text OpenURL

  5. Becker D, Selbach M, Rollenhagen C, Ballmaier M, Meyer TF, Mann M, Bumann D: Robust Salmonella metabolism limits possibilities for new antimicrobials.

    Nature 2006, 440(7082):303-307. PubMed Abstract | Publisher Full Text OpenURL

  6. Feist AM, Herrgard MJ, Thiele I, Reed JL, Palsson BO: Reconstruction of biochemical networks in microorganismS.

    Nature reviews 2009, 7(2):129-143. PubMed Abstract | Publisher Full Text OpenURL

  7. Palsson BO: Systems biology: properties of reconstructed networks. New York: Cambridge University Press; 2006.

  8. Thiele I, Jamshidi N, Fleming RM, Palsson BO: Genome-scale reconstruction of Escherichia coli's transcriptional and translational machinery: a knowledge base, its mathematical formulation, and its functional characterization.

    PLoS Comput Biol 2009, 5(3):e1000312. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  9. Thiele I, Fleming RM, Bordbar A, Schellenberger J, Palsson BO: Functional characterization of alternate optimal solutions of Escherichia coli's transcriptional and translational machinery.

    Biophysical journal 2010, 98(10):2072-2081. PubMed Abstract | Publisher Full Text OpenURL

  10. Reed JL, Famili I, Thiele I, Palsson BO: Towards multidimensional genome annotation.

    Nature reviews 2006, 7(2):130-141. PubMed Abstract | Publisher Full Text OpenURL

  11. Gianchandani EP, Joyce AR, Palsson BO, Papin JA: Functional States of the genome-scale Escherichia coli transcriptional regulatory system.

    PLoS Comput Biol 2009, 5(6):e1000403. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  12. Hyduke DR, Palsson BO: Towards genome-scale signalling-network reconstructions.

    Nature reviews 2010, 11(4):297-307. PubMed Abstract | Publisher Full Text OpenURL

  13. Li F, Thiele I, Jamshidi N, Palsson BO: Identification of potential pathway mediation targets in Toll-like receptor signaling.

    PLoS Comput Biol 2009, 5(2):e1000292. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  14. Thiele I, Palsson BO: A protocol for generating a high-quality genome-scale metabolic reconstruction.

    Nature protocols 2010, 5(1):93-121. PubMed Abstract | Publisher Full Text OpenURL

  15. Feist AM, Palsson BO: The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli.

    Nat Biotech 2008, 26(6):659-667. Publisher Full Text OpenURL

  16. Oberhardt MA, Palsson BO, Papin JA: Applications of genome-scale metabolic reconstructionS.

    Molecular systems biology 2009, 5:320. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Herrgard MJ, Swainston N, Dobson P, Dunn WB, Arga KY, Arvas M, Bluthgen N, Borger S, Costenoble R, Heinemann M, Hucka M, Le Novère N, Li P, Liebermeister W, Mo ML, Oliveira AP, Petranovic D, Pettifer S, Simeonidis E, Smallbone K, Spasić I, Weichart D, Brent R, Broomhead DS, Westerhoff HV, Kirdar B, Penttilä M, Klipp E, Palsson BØ, Sauer U, Oliver SG, Mendes P, Nielsen J, Kell DB: A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.

    Nat Biotechnol 2008, 26(10):1155-1160. PubMed Abstract | Publisher Full Text OpenURL

  18. Jamshidi N, Palsson BO: Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targetS.

    BMC systems biology 2007, 1:26. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  19. Beste DJ, Hooper T, Stewart G, Bonde B, Avignone-Rossa C, Bushell ME, Wheeler P, Klamt S, Kierzek AM, McFadden J: GSMN-TB: a web-based genome-scale network model of Mycobacterium tuberculosis metabolism.

    Genome biology 2007, 8(5):R89. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  20. Raghunathan A, Reed J, Shin S, Palsson B, Daefler S: Constraint-based analysis of metabolic capacity of Salmonella typhimurium during host-pathogen interaction.

    BMC systems biology 2009, 3:38. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  21. Abuoun M, Suthers PF, Jones GI, Carter BR, Saunders MP, Maranas CD, Woodward MJ, Anjun MF: Genome scale reconstruction of a Salmonella metabolic model: comparison of similarity and differences with a commensal Escherichia coli strain.

    J Biol Chem 2009, 284(43):29480-8. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  22. Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, Srivas R, Palsson BO: Global reconstruction of the human metabolic network based on genomic and bibliomic data.

    Proceedings of the National Academy of Sciences of the United States of America 2007, 104(6):1777-1782. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  23. Ma H, Sorokin A, Mazein A, Selkov A, Selkov E, Demin O, Goryanin I: The Edinburgh human metabolic network reconstruction and its functional analysis.

    Molecular systems biology 2007, 3:135. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  24. Thiele I, Palsson BO: Reconstruction annotation jamborees: a community approach to systems biology.

    Molecular systems biology 2010, 6:361. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Palsson BO: Two-dimensional annotation of genomes.

    Nat Biotechnol 2004, 22(10):1218-1219. PubMed Abstract | Publisher Full Text OpenURL

  26. Reed JL, Vo TD, Schilling CH, Palsson BO: An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR).

    Genome biology 2003, 4(9):R54.51-R54.12. BioMed Central Full Text OpenURL

  27. Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, Broadbelt LJ, Hatzimanikatis V, Palsson BO: A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information.

    Molecular systems biology 2007, 3:121. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Biolog Inc [http://www.biolog.com/] webcite

  29. Gutnick D, Calvo JM, Klopotowski T, Ames BN: Compounds which serve as the sole source of carbon or nitrogen for Salmonella typhimurium LT-2.

    Journal of bacteriology 1969, 100(1):215-219. PubMed Abstract | PubMed Central Full Text OpenURL

  30. Bayne HG, Stokes JL: Amino acid metabolism of Salmonellae.

    Journal of bacteriology 1961, 81:126-129. PubMed Abstract | PubMed Central Full Text OpenURL

  31. Oikawa T: Alanine, Aspartate, and Asparagine Metabolism in Microorganisms. In Amino Acid Biosynthesis ~ Pathways, Regulation and Metabolic Engineering. Volume 5. Edited by Wendisch V. Berlin/Heidelberg: Springer; 2007::273-288. Publisher Full Text OpenURL

  32. Suthers PF, Dasika MS, Kumar VS, Denisov G, Glass JI, Maranas CD: A genome-scale metabolic reconstruction of Mycoplasma genitalium, iPS189.

    PLoS Comput Biol 2009, 5(2):e1000285. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  33. Satish Kumar V, Dasika MS, Maranas CD: Optimization based automated curation of metabolic reconstructions.

    BMC Bioinformatics 2007, 8:212. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  34. Kumar VS, Maranas CD: GrowMatch: an automated method for reconciling in silico/in vivo growth predictions.

    PLoS Comput Biol 2009, 5(3):e1000308. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  35. Justino MC, Almeida CC, Goncalves VL, Teixeira M, Saraiva LM: Escherichia coli YtfE is a di-iron protein with an important function in assembly of iron-sulphur clusters.

    FEMS microbiology letters 2006, 257(2):278-284. PubMed Abstract | Publisher Full Text OpenURL

  36. Hyduke DR, Jarboe LR, Tran LM, Chou KJ, Liao JC: Integrated network analysis identifies nitric oxide response networks and dihydroxyacid dehydratase as a crucial target in Escherichia coli.

    Proceedings of the National Academy of Sciences of the United States of America 2007, 104(20):8484-8489. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  37. Jarboe LR, Hyduke DR, Tran LM, Chou KJ, Liao JC: Determination of the Escherichia coli S-nitrosoglutathione response network using integrated biochemical and systems analysiS.

    The Journal of biological chemistry 2008, 283(8):5148-5157. PubMed Abstract | Publisher Full Text OpenURL

  38. Winter SE, Thiennimitr P, Winter MG, Butler BP, Huseby DL, Crawford RW, Russell JM, Bevins CL, Adams LG, Tsolis RM, Roth JR, Bäumler AJ: Gut inflammation provides a respiratory electron acceptor for Salmonella.

    Nature 2010, 467(7314):426-429. PubMed Abstract | Publisher Full Text OpenURL

  39. Suthers PF, Zomorrodi A, Maranas CD: Genome-scale gene/reaction essentiality and synthetic lethality analysis.

    Molecular systems biology 2009, 5:301. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  40. Zimmermann GR, Lehar J, Keith CT: Multi-target therapeutics: when the whole is greater than the sum of the parts.

    Drug discovery today 2007, 12(1-2):34-42. PubMed Abstract | Publisher Full Text OpenURL

  41. Barthelmes J, Ebeling C, Chang A, Schomburg I, Schomburg D: BRENDA, AMENDA and FRENDA: the enzyme information system in 2007.

    Nucleic Acids Res 2007, (35 Database):D511-514. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  42. Garcia-del Portillo F, Nunez-Hernandez C, Eisman B, Ramos-Vivas J: Growth control in the Salmonella-containing vacuole.

    Current opinion in microbiology 2008, 11(1):46-52. PubMed Abstract | Publisher Full Text OpenURL

  43. Shi L, Adkins JN, Coleman JR, Schepmoes AA, Dohnkova A, Mottaz HM, Norbeck AD, Purvine SO, Manes NP, Smallwood HS, Wang H, Forbes J, Gros P, Uzzau S, Rodland KD, Heffron F, Smith RD, Squier TC: Proteomic analysis of Salmonella enterica serovar typhimurium isolated from RAW 264.7 macrophages: identification of a novel protein that contributes to the replication of serovar typhimurium inside macrophages.

    J Biol Chem 2006, 281(39):29131-29140. PubMed Abstract | Publisher Full Text OpenURL

  44. Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y: KEGG for linking genomes to life and the environment.

    Nucleic Acids Res 2008, (36 Database):D480-484. PubMed Abstract | PubMed Central Full Text OpenURL

  45. Fleming RM, Thiele I, Nasheuer HP: Quantitative assignment of reaction directionality in constraint-based models of metabolism: application to Escherichia coli.

    Biophys Chem 2009, 145(2-3):47-56. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  46. Varma A, Palsson BO: Metabolic Flux Balancing: Basic concepts, Scientific and Practical Use.

    Nat Biotechnol 1994, 12:994-998. Publisher Full Text OpenURL

  47. Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NER, Kang J, Hydruke D, Palsson BO: Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0.

    Nat Prot, in press. OpenURL