<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1752-0509-6-9</ui><ji>1752-0509</ji><fm>
<dochead>Research article</dochead>
<bibl>
<title>
<p>Predicting outcomes of steady-state <sup>13</sup>C isotope tracing experiments using Monte Carlo sampling</p>
</title>
<aug>
<au id="A1"><snm>Schellenberger</snm><fnm>Jan</fnm><insr iid="I1"/><email>j.schellenberger@gmail.com</email></au>
<au id="A2"><snm>Zielinski</snm><mi>C</mi><fnm>Daniel</fnm><insr iid="I2"/><email>dczielin@ucsd.edu</email></au>
<au id="A3"><snm>Choi</snm><fnm>Wing</fnm><insr iid="I2"/><email>wing.choi@gmail.com</email></au>
<au id="A4"><snm>Madireddi</snm><fnm>Sunthosh</fnm><insr iid="I2"/><email>msunthosh@gmail.com</email></au>
<au id="A5"><snm>Portnoy</snm><fnm>Vasiliy</fnm><insr iid="I2"/><email>vasiliy.portnoy@gmail.com</email></au>
<au id="A6"><snm>Scott</snm><mi>A</mi><fnm>David</fnm><insr iid="I3"/><email>dscott@sanfordburnham.org</email></au>
<au id="A7"><snm>Reed</snm><mi>L</mi><fnm>Jennifer</fnm><insr iid="I4"/><email>reed@cae.wisc.edu</email></au>
<au id="A8"><snm>Osterman</snm><mi>L</mi><fnm>Andrei</fnm><insr iid="I3"/><email>osterman@sanfordburnham.org</email></au>
<au id="A9" ca="yes"><snm>Palsson</snm><mi>&#8709;</mi><fnm>Bernhard</fnm><insr iid="I2"/><email>palsson@ucsd.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Bioinformatics and Systems Biology Program, University of California - San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0419 USA</p></ins>
<ins id="I2"><p>Department of Bioengineering, University of California - San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0412 USA</p></ins>
<ins id="I3"><p>Bioinformatics and Systems Biology Program, Sanford-Burnham Institute, 10901 North Torrey Pines Road, La Jolla, CA, 92037 USA</p></ins>
<ins id="I4"><p>Department of Chemical and Biological Engineering, University of Wisconsin Madison, 1415 Engineering Drive, Madison, WI, 53706-1607 USA</p></ins>
</insg>
<source>BMC Systems Biology</source>
<section><title><p>Synthetic biology</p></title></section>
<issn>1752-0509</issn>
<pubdate>2012</pubdate>
<volume>6</volume>
<issue>1</issue>
<fpage>9</fpage>
<url>http://www.biomedcentral.com/1752-0509/6/9</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1752-0509-6-9</pubid><pubid idtype="pmpid">22289253</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>4</day><month>8</month><year>2011</year></date></rec><acc><date><day>30</day><month>1</month><year>2012</year></date></acc><pub><date><day>30</day><month>1</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>Schellenberger et al. ; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>Carbon-13 (<sup>13</sup>C) analysis is a commonly used method for estimating reaction rates in biochemical networks. The choice of carbon labeling pattern is an important consideration when designing these experiments. We present a novel Monte Carlo algorithm for finding the optimal substrate input label for a particular experimental objective (flux or flux ratio). Unlike previous work, this method does not require assumption of the flux distribution beforehand.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>Using a large <it>E. coli </it>isotopomer model, different commercially available substrate labeling patterns were tested computationally for their ability to determine reaction fluxes. The choice of optimal labeled substrate was found to be dependent upon the desired experimental objective. Many commercially available labels are predicted to be outperformed by complex labeling patterns. Based on Monte Carlo Sampling, the dimensionality of experimental data was found to be considerably less than anticipated, suggesting that effectiveness of <sup>13</sup>C experiments for determining reaction fluxes across a large-scale metabolic network is less than previously believed.</p>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>While <sup>13</sup>C analysis is a useful tool in systems biology, high redundancy in measurements limits the information that can be obtained from each experiment. It is however possible to compute potential limitations before an experiment is run and predict whether, and to what degree, the rate of each reaction can be resolved.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>
<it>In vivo </it>metabolic reaction flux data provides insight into the dynamic function of the cell <abbrgrp>
<abbr bid="B1">1</abbr>
<abbr bid="B2">2</abbr>
<abbr bid="B3">3</abbr>
</abbrgrp>. One widely-used experimental method for measuring <it>in vivo </it>reaction fluxes is steady-state substrate <sup>13</sup>C isotope labeling <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B5">5</abbr>
<abbr bid="B6">6</abbr>
</abbrgrp>. An overview of the general <sup>13</sup>C methods is described in Figure <figr fid="F1">1</figr>. Isotopomers, or isomers created from inserting labeled isotopes (often <sup>13</sup>C) at different positions in a molecule, provide a unique way to track the progress of carbon through a metabolic network. By measuring the enrichment for <sup>13</sup>C in metabolite pools after growing on a <sup>13</sup>C labeled substrate, inferences about the internal flux state can be made. The approach can be summarized as a data fitting problem between simulated and experimentally measured <sup>13</sup>C labeled metabolite concentrations. An isotopomer model, describing the positional transfer of carbon atoms for all or a subset of reactions in the network, is used to simulate data (Figure <figr fid="F1">1a</figr>). For a specified carbon input label, an isotopomer model enables the calculation of an isotopomer distribution vector (IDV) corresponding to a particular simulated steady-state flux distribution (Figure <figr fid="F1">1b</figr>). Mass spectrometry (MS) experiments on <sup>13</sup>C-labeled metabolites (e.g. macromolecules) generate fractional <sup>13</sup>C enrichments from fragmented macromolecules, forming a mass distribution vector (MDV) (Figure <figr fid="F1">1c</figr>). The error between the measured MDV and the MDV corresponding to the simulated IDV summarizes how well the presumed flux distribution fits the <sup>13</sup>C experiment. The flux distribution <it>v </it>that minimizes this error can be computed by solving a non-linear optimization problem. Simulating <sup>13</sup>C enrichment given a flux distribution is computationally inexpensive; however, the inverse problem of calculating the flux distribution that best fits a <sup>13</sup>C experiment is both of greater interest and significantly more computationally difficult (Figure <figr fid="F1">1d</figr>). A review of these methods and associated challenges can be found in <abbrgrp>
<abbr bid="B6">6</abbr>
<abbr bid="B7">7</abbr>
<abbr bid="B8">8</abbr>
</abbrgrp>.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Isotopomer Overview</p></caption><text>
   <p><b>Isotopomer Overview</b>. a) definition of the network, including carbon fates b) isotopomer balance equations - solving these equations yields the Isotopomer Distribution Vector (IDV) c) experimental data are compared to computed Mass Distribution Vectors (MDV) yielding experimental fit. d) two types of possible computations - the forward computation uses a flux distribution as input to compute the MDV, while the inverse problem attempts to find the flux distribution that minimizes the experimental discrepancy.</p>
</text><graphic file="1752-0509-6-9-1" hint_layout="double"/></fig>
<p>There are several distinct sources of variability in a <sup>13</sup>C experiment that limit the confidence with which particular reactions can be determined. First, due to experimental accuracy limitations and biological variability, uncertainty arises in the experimentally measured MDV. Second, due to alternate pathways present in metabolic networks, the mass balance equations underlying a metabolic steady-state are significantly under-determined <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>. While the full network flux distribution may not be resolved at high confidence by a given experiment, certain labeling patterns may resolve fluxes through certain pathways with greater confidence than other labeling patterns, as has previously been shown <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B10">10</abbr>
</abbrgrp>.</p>
<p>For a given n-carbon compound, there are 2<it>
<sup>n </sup>
</it>possible <sup>13</sup>C labeling states (as well as mixtures), and the choice of label is known to affect the ability to determine reactions fluxes <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>. As <sup>13</sup>C methods are based upon computational modeling of isotopomer distributions, it is possible to computationally optimize the choice of substrate labeling pattern to enhance the information gained from an experiment. There are two primary motivations that drive such an endeavor. First, <sup>13</sup>C experiments are expensive, so choosing the best experiment <it>a priori </it>is desirable. Second, we can assess the capability of the steady-state <sup>13</sup>C labeling approach towards determining reaction fluxes in an unbiased manner. The issue of optimization of <sup>13</sup>C labeling experiments has been addressed in the literature <abbrgrp>
<abbr bid="B4">4</abbr>
<abbr bid="B10">10</abbr>
<abbr bid="B11">11</abbr>
</abbrgrp>. However, the use of flux sampling for optimal isotopomer experiment prediction has not been explored previously, and this approach presents several unique advantages over previous methods.</p>
<p>We describe a Monte Carlo sampling-based method for choosing the optimal substrate label, based upon the Constraint-Based Reconstruction and Analysis (COBRA) computational platform <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
</abbrgrp>. COBRA methods use manually-curated biochemical network reconstructions of known reaction stoichiometries and measurable nutrient uptake and secretion rates to define feasible ranges for internal reaction fluxes. Many of these reconstructions have been generated <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp> and the procedure is well-established <abbrgrp>
<abbr bid="B13">13</abbr>
<abbr bid="B15">15</abbr>
</abbrgrp>. These models can be used for methods such as computing growth rates <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B17">17</abbr>
</abbrgrp>, predicting the effects of gene knockouts <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
</abbrgrp>, predicting the endpoint of adaptive evolutions <abbrgrp>
<abbr bid="B20">20</abbr>
</abbrgrp>, and designing strains for industrial production <abbrgrp>
<abbr bid="B21">21</abbr>
<abbr bid="B22">22</abbr>
</abbrgrp>. A review of these methods can be found here <abbrgrp>
<abbr bid="B12">12</abbr>
<abbr bid="B13">13</abbr>
<abbr bid="B23">23</abbr>
</abbrgrp>. Monte Carlo sampling of constraint-based metabolic models can be used to generate sets of biochemically feasible flux distributions that obey measured uptake and secretion rate constraints <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>. IDVs generated from these flux distributions in an isotopomer model can then be compared against simulated <sup>13</sup>C data to evaluate the ability of the experiment to determine reaction fluxes. Monte Carlo sampling takes advantage of the speed with which IDVs can be simulated from putative flux distributions, making this approach suitable for large-scale analysis of <it>in silico </it>experiments.</p>
<p>A Monte Carlo sampling approach was implemented using a newly developed isotopomer model to evaluate the efficiency of different carbon labeling patterns toward determining reaction fluxes in <it>E. coli</it>. The dimensionality of simulated <sup>13</sup>C data was calculated using singular value decomposition (SVD) for different substrate labeling patterns and compared to the number of undetermined dimensions in the network. <sup>13</sup>C experiments were performed for three substrate labeling patterns to validate the prior theoretical analysis. The methods developed represent a flexible computational analysis that can be applied to various biological systems and experimental setups to estimate, <it>a priori</it>, the efficiency of isotopomer experiments in determining reaction fluxes.</p>
</sec>
<sec>
<st>
<p>Results and Discussion</p>
</st>
<sec>
<st>
<p>Expanded Isotopomer Model</p>
</st>
<p>An isotopomer model was constructed in two phases. First, a central metabolic isotopomer model that accounts for 85 reactions including glycolysis, the TCA cycle, the pentose phosphate pathway, oxidative phosphorylation, pyruvate metabolism, and anaplerotic reactions was derived from the iJR904 <it>E. coli </it>reconstruction <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>. This initial model was equivalent in reaction content to commonly used isotopomer models for <it>E. coli </it>
<abbrgrp>
<abbr bid="B25">25</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp>.</p>
<p>An expanded model was then constructed that includes both central and biosynthetic pathways. The iMC1010 metabolic network <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp> was evaluated to determine which reactions can sustain non-zero fluxes during growth on glucose, acetate, or lactate when only certain by-products are allowed to be secreted (acetate, formate, D-lactate, pyruvate, succinate, glycerol, CO<sub>2</sub>, and ethanol). Blocked reactions, which must have zero net flux at steady state, were subsequently omitted from consideration. Groups of reactions that could be merged together without affecting model results (e.g. linear pathways) were combined in order to reduce the number of variables. Large sets of biosynthesis reactions that produce phospholipids, nucleotides, co-factors were also combined, since there no experimental measurements existed for these high-carbon metabolites. However, by-products resulting from high-carbon metabolite production (e.g. CO<sub>2</sub>, formate, succinate, fumarate, and pyruvate) that could enter back into the metabolic network were tracked. Of the original 932 reactions in the complete metabolic iMC1010 network, nearly a third were represented in the biosynthetic isotopomer model, either individually or as grouped reactions.</p>
<p>The final isotopomer model accounts for a total of 313 irreversible reactions, including 278 which track carbon. Inclusion of these additional pathways is likely important for accurate assessment of the flux-resolving power of <sup>13</sup>C experiments both within and beyond central metabolism <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. A complete listing of the reactions and metabolites in the biosynthetic network can be found in the Additional File <supplr sid="S1">1</supplr>.</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>Z-scores, confidence intervals, and isotopomer model</b>. This file contains the absolute and relative Z-scores for individual reactions across all glucose labeling patterns tested, confidence intervals calculated using experimental <sup>13</sup>C tracing data, and details of the model that was used in calculations.</p>
</text>
<file name="1752-0509-6-9-S1.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>Monte Carlo Sampling Approach</p>
</st>
<p>To compute possible flux distributions of the <it>E. coli </it>model, the network was sampled using a Markov Chain, Monte Carlo (MCMC) sampling algorithm (see Methods). The steady-state mass balance and uptake rate constraints for the metabolic network create a convex hyperspace that contains all biochemically feasible steady-state flux distributions <abbrgrp>
<abbr bid="B27">27</abbr>
</abbrgrp>. Monte Carlo sampling generates a set of flux distributions that are spread uniformly throughout the feasible space. The inclusion of <sup>13</sup>C experimental data reduces the feasible space in which the true flux state must lie by requiring that the IDV calculated from the putative flux distribution must match the experimental data within error. While the space of feasible flux distributions depends only on reaction stoichiometry, the space of resulting simulated IDVs differs depending on the input substrate labeling pattern. Hence, different labeling patterns can have differing abilities to resolve each reaction flux.</p>
<p>Here, we used Monte Carlo sampling of flux distributions to analyze the degree to which reaction fluxes can be determined by steady-state <sup>13</sup>C labeling experiments in terms of several possible experimental objectives. For example, one possible experimental objective is to determine whether a particular reaction has a flux above or below a specified value. For this objective, a well-designed labeling pattern would be one in which flux distributions that have an objective reaction flux greater than the specified value can be easily distinguished from flux distributions with an objective reaction flux less than the specified value. As seen in Figure <figr fid="F2">2</figr>, a hypothetical experiment 1 produces measurement distributions which overlap whereas experiment 2 shows greater separation. If one were interested in differentiating between the two partitions, experiment 2 would be much preferable. This method allows for the scoring of any label for any given experimental objective without first knowing the true cellular flux distribution <it>v</it>.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Method Overview</p></caption><text>
   <p><b>Method Overview</b>. a) The space of flux distributions is partitioned in two parts corresponding to 'high' flux versus 'low' flux. A uniform random sample is drawn from the space and is also partitioned into partition 1 and partition 2. b) For each point in the space the distribution of experimental measurements is simulated. Hypothetical experiment 1 and experiment 2 with different glucose label mixtures produce different measurement distributions. The distributions from experiment 2 are more separated, indicating parameters of experiment 2 are more conducive for differentiating between the high and low partition.</p>
</text><graphic file="1752-0509-6-9-2" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>
<b>Generating and Evaluating </b>
<sup>13</sup>
<b>C Experimental Hypotheses</b>
</p>
</st>
<p>An experimental hypothesis is defined as a partition of the sampled flux distribution set. While many possible hypotheses could be considered, two rational hypotheses were studied. The first case attempts to elucidate whether a reaction has high or low flux (hi-lo). The solution space is partitioned into all points with <it>v<sub>j </sub>&gt; </it>threshold versus <it>v<sub>j </sub>&lt; </it>threshold. A different hypothesis is generated for each reaction <it>j</it>. The threshold was chosen to be the median of all <it>v<sub>j </sub>
</it>so that half of all points would be in each of the two partitions. The second set of hypotheses tested consisted of biologically relevant flux ratios. For each point the ratio of two reactions, <it>v<sub>i</sub>
</it>/<it>v<sub>j</sub>
</it>, was determined to be above or below some threshold that formed a partition.</p>
<p>Intuitively, a hypothesis score should be high if the isotopomer distributions coming from one partition are distinguishable from distributions in the other partition. While there are several ways of doing this, we chose a heuristic metric based on a Z-score, which is commonly used to determine the difference between two samples. A Z-score was calculated for each fragment (element) of the calculated MDV for each simulated flux distribution:</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi>Z</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfrac>
      <m:mrow>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mover accent="true">
                  <m:mrow>
                     <m:mi>x</m:mi>
                  </m:mrow>
                  <m:mo class="MathClass-op"> &#772;</m:mo>
               </m:mover>
            </m:mrow>
            <m:mrow>
               <m:mi>h</m:mi>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-bin">-</m:mo>
         <m:msub>
            <m:mrow>
               <m:mover accent="true">
                  <m:mrow>
                     <m:mi>x</m:mi>
                  </m:mrow>
                  <m:mo class="MathClass-op"> &#772;</m:mo>
               </m:mover>
            </m:mrow>
            <m:mrow>
               <m:mi>l</m:mi>
               <m:mi>o</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
      </m:mrow>
      <m:mrow>
         <m:msqrt>
            <m:mrow>
               <m:msubsup>
                  <m:mrow>
                     <m:mi>s</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mi>h</m:mi>
                     <m:mi>i</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mn>2</m:mn>
                  </m:mrow>
               </m:msubsup>
               <m:mo class="MathClass-bin">+</m:mo>
               <m:msubsup>
                  <m:mrow>
                     <m:mi>s</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mi>l</m:mi>
                     <m:mi>o</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mn>2</m:mn>
                  </m:mrow>
               </m:msubsup>
               <m:mo class="MathClass-bin">+</m:mo>
               <m:msup>
                  <m:mrow>
                     <m:mi>&#963;</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mn>2</m:mn>
                  </m:mrow>
               </m:msup>
            </m:mrow>
         </m:msqrt>
      </m:mrow>
   </m:mfrac>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>where <inline-formula>
<m:math name="1752-0509-6-9-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>x</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op"> &#772;</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>h</m:mi>
      <m:mi>i</m:mi>
   </m:mrow>
</m:msub>
</m:math>
</inline-formula> and <inline-formula>
<m:math name="1752-0509-6-9-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>x</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op"> &#772;</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>l</m:mi>
      <m:mi>o</m:mi>
   </m:mrow>
</m:msub>
</m:math>
</inline-formula> are the average fragment enrichments for the upper and lower partitions, respectively, <inline-formula>
<m:math name="1752-0509-6-9-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:msubsup>
   <m:mrow>
      <m:mi>s</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>h</m:mi>
      <m:mi>i</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msubsup>
</m:math>
</inline-formula> and <inline-formula>
<m:math name="1752-0509-6-9-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:msubsup>
   <m:mrow>
      <m:mi>s</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>l</m:mi>
      <m:mi>o</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msubsup>
</m:math>
</inline-formula> are the variances of fragment enrichments for the upper and lower partitions, respectively, and the <it>&#945; </it>is a constant equal to 0.014. <it>&#945; </it>is on the order of magnitude of the uncertainty in measurements. This slight modification to the standard Z-score puts a lower bound on the expected experimental variation. The Z-score of each fragment is added together to give the Z-score of the experiment.</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi>Z</m:mi>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:munder class="msub">
      <m:mrow>
         <m:mo mathsize="big"> &#8721;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mo class="MathClass-rel">&#8712;</m:mo>
         <m:mi>f</m:mi>
         <m:mi>r</m:mi>
         <m:mi>a</m:mi>
         <m:mi>g</m:mi>
         <m:mi>m</m:mi>
         <m:mi>e</m:mi>
         <m:mi>n</m:mi>
         <m:mi>t</m:mi>
      </m:mrow>
   </m:munder>
   <m:msub>
      <m:mrow>
         <m:mi>Z</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
      </m:mrow>
   </m:msub>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>Using this approach, candidate flux states were sampled uniformly and experimental hypotheses tested. Z-scores were calculated for the hi-lo hypothesis corresponding to 1) individual reactions 2) reaction ratios and 3) two 'random' reactions or ratios. Random hypotheses were tested to estimate the level of noise associated with the set of flux distributions. Raw and normalized Z-scores are given in the Additional File <supplr sid="S1">1</supplr>. Z-scores varied from the level of noise to a maximum of <it>&gt;</it>20-fold the level of noise.</p>
<p>To illustrate the differences in label-dependent reaction resolving capacity, two sets of Z-scores corresponding to [1-<sup>13</sup>C] glucose and [6-<sup>13</sup>C] glucose are plotted in Figure <figr fid="F3">3</figr>. Lighter colors indicate higher Z-scores and ease of measurement. In this case, [6-<sup>13</sup>C] glucose scores higher at measuring the pentose phosphate pathway and most of lower glycolysis, whereas [1-<sup>13</sup>C] glucose glucose scores much higher at measuring the glyoxylate shunt. The results suggest that there is no single label that yields a high score for all experimental objectives. For example, the exchange of formate (EX_for) could be easiest measured with a [1,2-<sup>13</sup>C] glucose; however, this labeling pattern is bested by [1-<sup>13</sup>C] glucose for the measurement of reaction formyltetrahydrofolate deformylase (FTHFD) (Figure <figr fid="F4">4</figr>). This non-universality of labels is in line with expectations, as it has been previously shown that the choice of labels can affect the flux resolution. For many reactions, the best experiment that could be performed involves hypothetical (non-commercially available) labels. One example is the ratio of phosphofructokinase (PFK) flux to fructose bisphosphate aldolase (FBA) flux. The best label for determining this ratio is [1,2,3-<sup>13</sup>C] glucose (Z = 28.0), which gives a much higher Z-score than the best commercially available label, [1,2-<sup>13</sup>C] glucose (Z = 18.5). Thus, there may be motivation to synthesize compounds with more complex labeling patterns than commonly used. Additionally, there are certain reactions which are predicted to be difficult to measure with any labeling pattern. For example, the Z-scores for each possible labeled glucose substrate for the reaction pyruvate oxidase (POX) all lie within the level of noise, as determined by the comparison with random hi-lo experiment Z-scores.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Simulated Z-Scores</p></caption><text>
   <p><b>Simulated Z-Scores</b>. Two possible glucose label patterns show different strengths in evaluating different parts of the network. Brighter colors indicate more easily determined fluxes. a) [1-<sup>13</sup>C] glucose Z-scores illustrates flux determinability with 100% [1-<sup>13</sup>C] glucose. b) [6-<sup>13</sup>C] glucose Z-scores shows the same network evaluated with [6-<sup>13</sup>C] glucose. It is observed, for example, that [6-<sup>13</sup>C] glucose is predicted to elucidate the pentose phosphate pathway more easily, while [1-<sup>13</sup>C] glucose better elucidates the glyoxylate shunt.</p>
</text><graphic file="1752-0509-6-9-3" hint_layout="double"/></fig>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Computational Evaluation of Glucose</p></caption><text>
   <p><b>Computational Evaluation of Glucose</b>. Potential glucose labels are evaluated based on both absolute Z-scores and Z-scores normalized with respect to the labeling pattern with the highest score. Glucose labels are listed on top including hypothetical labeling patterns (e.g. [1,2,3-<sup>13</sup>C]) and commercially available labels (e.g. [1-<sup>13</sup>C]). [U-<sup>13</sup>C] = uniform labeled and [<sup>12</sup>C] = unlabeled. Reaction and reaction ratio hypotheses are listed on the left. The 'random' hypotheses, as described in the methods, shows the level of noise.</p>
</text><graphic file="1752-0509-6-9-4" hint_layout="double"/></fig>
<p>In addition to label-specific reaction flux elucidation properties, <sup>13</sup>C experiments show a clear pathway bias regardless of labeling pattern. The maximum Z-score of all labeling patterns was found for each reaction, giving a metric for the maximum potential for reaction flux determination using <sup>13</sup>C-labeled glucose (Figure <figr fid="F5">5A</figr>). Then, the fraction of reactions that had a maximum potential at least twice the noise level was found for each subsystem (Figure <figr fid="F5">5b</figr>). Reactions that were stoichiometrically fixed by the measured constraints on acetate, glucose, D-lactate, oxygen, growth rate, and ATP maintenance were also categorized by subsystem. Stoichiometrically fixed (i.e. constraint-determined) reactions have a confidence interval of zero, and thus are label-independent and receive no additional knowledge from <sup>13</sup>C experiments. It was found that histidine, valine, leucine, and isoleucine metabolism fluxes are completely identified solely based on the flux constraints. On the other hand, prior constraints fix none of the fluxes in central carbon metabolic systems such as glycolysis, citric acid cycle, pentose phosphate pathway, and anaplerotic reactions; however, fluxes in these pathways are all predicted to be identifiable with a <sup>13</sup>C experiment using optimal labeling patterns for each reaction. This result is expected as these identifiable pathways are the typical pathways being studied using <sup>13</sup>C analysis. Other pathways, such as cysteine, threonine, and lysine metabolism, are completely identifiable through a combination of prior stoichiometric constraints combined with well-chosen <sup>13</sup>C experiments. However, many of the remaining subsystems have a fraction of reactions that cannot be determined using any <sup>13</sup>C-labeling pattern of glucose. In particular, no additional information can be obtained from <sup>13</sup>C-labeled glucose experiments about certain biosynthetic pathways, nucleotide salvage pathways, reductive citric acid cycle reactions, and certain alternate pathways peripheral to glycolysis, such as an alternate pathways from DHAP to D-lactate. Measuring metabolites other than amino acids may give more information on these pathways. Note that in this discussion of identifiability, the Z-score metric indicates that an experiment can significantly reduce the confidence interval of a particular reaction but does not specifically predict the value of the confidence interval. Confidence intervals are directly calculated for experimental data sets in a later section and compared to the Z-scores for the same labeling patterns.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Pathway Analysis of Optimal <sup>13</sup>C Experiments</p></caption><text>
   <p><b>Pathway Analysis of Optimal <sup>13</sup>C Experiments</b>. a) The maximum Z-score of all labeling patterns for each non-fixed reaction was found. Notably, greater than 50% of reactions have no experiment with a predicted Z-score greater than twice the noise level. This indicates that many reactions will be difficult to elucidate. b) Reactions separated by subsystem. Reactions with fluxes fixed by constraints are termed "constraint determined." The reactions with maximum Z-scores at least twice the noise level were termed "<sup>13</sup>C determinable." The difference between determinable/determined fluxes and 1 is the fraction of reactions that cannot be determined through either <sup>13</sup>C experiments or constraints.</p>
</text><graphic file="1752-0509-6-9-5" hint_layout="double"/></fig>
</sec>
<sec>
<st>
<p>Dimensionality of Isotopomer Data</p>
</st>
<p>The Monte Carlo sampling approach enables the determination of the dimensionality of simulated <sup>13</sup>C experiments for a particular substrate labeling pattern. The dimensionality gives an indication of the degree to which a particular substrate labeling pattern can specify the free dimensions inherent in a network structure, given a set experimental error. In an extreme case, if all the data falls on one point (zero dimensions), no additional information is given from the data. Similarly, a dimensionality of one indicates that the data can specify one degree of freedom. Singular value decomposition (SVD) is a data reduction technique that allows the estimation of data dimensionality (Figure <figr fid="F6">6a</figr>). A data matrix <it>M </it>of size (<it>n<sub>fragments </sub>
</it>x <it>n<sub>points</sub>
</it>), consisting of all sample points generated from Monte Carlo Sampling, is decomposed into <it>M </it>= <it>U </it>&#183; &#931; &#183; <it>V <sup>T </sup>
</it>where <it>U </it>and <it>V </it>are orthonormal bases and &#931; is a diagonal matrix containing singular values in descending order. The singular values are effectively weightings that describe the information content of the corresponding vectors in <it>U </it>and <it>V </it>towards reconstructing the full matrix <it>M</it>. A partial reconstruction of <it>M </it>is possible by taking only a subset of the singular values greater than some threshold. These thresholds have a direct interpretation as the uncertainty with which a data point can be measured. For example, a threshold cutoff of 0.01 indicates that the remaining uncertainty of the data falls within 0.01 or 1% error in the measurement of isotope enrichment.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Data Dimensionality with SVD</p></caption><text>
   <p><b>Data Dimensionality with SVD</b>. The linear dimensionality of experimental data space is measured with Singular Value Decomposition. a) The E. coli model has 313 reactions and 139 degrees of freedom. The isotopomer fragments were computed for a random sample of flux distributions and plotted in the 186 dimensional space of simulated measurements. The upper bound on the number of degrees of freedom in this space was determined by singular value decomposition on the samples. The number of singular values was counted until the magnitude of the next singular value fell below the instrument threshold. b) The number of significant singular values at different levels of experimental error is tabulated for various labeling patterns, with values ranging from 26 to 100.</p>
</text><graphic file="1752-0509-6-9-6" hint_layout="double"/></fig>
<p>To determine the dimensionality of the isotopomer data, SVD was performed on <sup>13</sup>C fragments derived from uniformly sampled flux distribution sets for several glucose labels. The results are summarized in Figure <figr fid="F6">6b</figr>. Globally, the choice of glucose labels affects the dimensionality of the resulting isotopomer data set. At the 1% (0.01) threshold, the label with the highest dimensionality was hypothetical [1,2,5-<sup>13</sup>C] glucose with 73 dimensions. The three labels for which experimental data was measured in the subsequent section, [1-<sup>13</sup>C] glucose, [6-<sup>13</sup>C] glucose, and 20% [U-<sup>13</sup>C] glucose, had dimensions 53, 53 and 35, respectively, at this cutoff. These values are all significantly lower than the best label, and, in particular, the uniform labeled experiment only produces half of the dimensionality as the optimal experiment. This result is significant. While 139 dimensions (the number of undetermined dimensions for the model used) are required to specify a unique flux vector, the dimensionality of the <sup>13</sup>C data for each label is significantly lower. The best labeling experiment specifies just over half (73/139 = 0.52) the degrees of freedom required, and 20% [U-<sup>13</sup>C] glucose only specifies about one fourth of the possible degrees of freedom (0.26). It is worth noting that SVD is a linear operation used to approximate properties of a non-linear system and the true degrees of freedom may be even lower than reported. SVD serves as a useful upper bound on the dimensionality of data for non-linear systems, but the difference between SVD dimensionality and true dimensionality may grow to be unacceptable for large systems. For the system studied here, SVD was found to be of practical use.</p>
</sec>
<sec>
<st>
<p>Experimental Validation</p>
</st>
<p>In order to assess the agreement of computationally predicted flux elucidation capacity with experimental data, we took fluxomic measurements for three labeling patterns in <it>E. coli</it>. Flux distributions that best explain each set of <sup>13</sup>C data were calculated using a non-linear optimization problem:</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mtable equalrows="false" columnlines="none none none none none none none none none none none none none none none none none none none" equalcolumns="false" class="array">
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:munder class="msub">
               <m:mrow>
                  <m:mtext>min</m:mtext>
               </m:mrow>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
            </m:munder>
            <m:mtext>Error</m:mtext>
            <m:mrow>
               <m:mo class="MathClass-open">(</m:mo>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mo class="MathClass-close">)</m:mo>
            </m:mrow>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mtext>subject</m:mtext>
            <m:mspace width="2.77695pt" class="tmspace"/>
            <m:mtext>to</m:mtext>
            <m:mo class="MathClass-rel">:</m:mo>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>i</m:mi>
                  <m:mi>n</m:mi>
               </m:mrow>
            </m:msub>
            <m:mo class="MathClass-rel">&lt;</m:mo>
            <m:mi>v</m:mi>
            <m:mo class="MathClass-rel">&lt;</m:mo>
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>a</m:mi>
                  <m:mi>x</m:mi>
               </m:mrow>
            </m:msub>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mi>S</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>v</m:mi>
            <m:mo class="MathClass-rel">=</m:mo>
            <m:mn>0</m:mn>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center"/>
      </m:mtr>
   </m:mtable>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>The function Error(v) is a score of how well a given flux distribution fits the experimental data. It is defined as:</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mtext>Error(</m:mtext>
   <m:mi>v</m:mi>
   <m:mtext>)</m:mtext>
   <m:mspace width="2.77695pt" class="tmspace"/>
   <m:mtext>&#160;=&#160;</m:mtext>
   <m:munder class="msub">
      <m:mrow>
         <m:mo mathsize="big"> &#8721;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mo class="MathClass-rel">&#8712;</m:mo>
         <m:mspace width="2.77695pt" class="tmspace"/>
         <m:mtext>fragments</m:mtext>
      </m:mrow>
   </m:munder>
   <m:mfrac>
      <m:mrow>
         <m:msup>
            <m:mrow>
               <m:mrow>
                  <m:mo class="MathClass-open">(</m:mo>
                  <m:mrow>
                     <m:mtext>fragmen</m:mtext>
                     <m:msub>
                        <m:mrow>
                           <m:mtext>t</m:mtext>
                        </m:mrow>
                        <m:mrow>
                           <m:mi>i</m:mi>
                        </m:mrow>
                     </m:msub>
                     <m:mrow>
                        <m:mo class="MathClass-open">(</m:mo>
                        <m:mrow>
                           <m:mi>v</m:mi>
                        </m:mrow>
                        <m:mo class="MathClass-close">)</m:mo>
                     </m:mrow>
                     <m:mo class="MathClass-bin">-</m:mo>
                     <m:mtext>measure</m:mtext>
                     <m:msub>
                        <m:mrow>
                           <m:mtext>d</m:mtext>
                        </m:mrow>
                        <m:mrow>
                           <m:mi>i</m:mi>
                        </m:mrow>
                     </m:msub>
                  </m:mrow>
                  <m:mo class="MathClass-close">)</m:mo>
               </m:mrow>
            </m:mrow>
            <m:mrow>
               <m:mn>2</m:mn>
            </m:mrow>
         </m:msup>
      </m:mrow>
      <m:mrow>
         <m:msup>
            <m:mrow>
               <m:mi>&#963;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mn>2</m:mn>
            </m:mrow>
         </m:msup>
      </m:mrow>
   </m:mfrac>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>where measured <it>
<sub>i </sub>
</it>is the measured fractional enrichment of fragment <it>i</it>, fragment <it>
<sub>i</sub>
</it>(<it>v</it>) is the computed fractional enrichment of fragment <it>i </it>as a function of the flux distribution <it>v</it>, and <it>&#945; </it>= 0<it>:</it>014 is the standard deviation of the fragments as calculated from experimental replicates.</p>
<p>Reaction flux confidence ranges were then computed for all reactions using all three sets of <sup>13</sup>C data and all combinations thereof. Confidence intervals for reaction rates were computed by maximizing and minimizing the value of each reaction in turn subject to a slightly relaxed score.</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mtable equalrows="false" columnlines="none none none none none none none none none none none none none none none none none none none" equalcolumns="false" class="array">
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:munder class="msub">
               <m:mrow>
                  <m:mtext>min</m:mtext>
               </m:mrow>
               <m:mrow>
                  <m:mi>&#945;</m:mi>
               </m:mrow>
            </m:munder>
            <m:mo class="MathClass-bin">/</m:mo>
            <m:munder class="msub">
               <m:mrow>
                  <m:mtext>max</m:mtext>
               </m:mrow>
               <m:mrow>
                  <m:mi>&#945;</m:mi>
               </m:mrow>
            </m:munder>
            <m:msubsup>
               <m:mrow>
                  <m:mi>c</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>i</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>T</m:mi>
               </m:mrow>
            </m:msubsup>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">subject</m:mtext>
            </m:mstyle>
            <m:mspace width="2.77695pt" class="tmspace"/>
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">to</m:mtext>
            </m:mstyle>
            <m:mo class="MathClass-rel">:</m:mo>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>i</m:mi>
                  <m:mi>n</m:mi>
               </m:mrow>
            </m:msub>
            <m:mo class="MathClass-rel">&#8804;</m:mo>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
            <m:mo class="MathClass-rel">&#8804;</m:mo>
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>a</m:mi>
                  <m:mi>x</m:mi>
               </m:mrow>
            </m:msub>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">Error&#160;(</m:mtext>
            </m:mstyle>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">)</m:mtext>
            </m:mstyle>
            <m:mo class="MathClass-rel">&#8804;</m:mo>
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">Erro</m:mtext>
            </m:mstyle>
            <m:msub>
               <m:mrow>
                  <m:mstyle class="text">
                     <m:mtext class="textsf" mathvariant="sans-serif">r</m:mtext>
                  </m:mstyle>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>a</m:mi>
                  <m:mi>x</m:mi>
               </m:mrow>
            </m:msub>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center"/>
      </m:mtr>
   </m:mtable>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>where <it>c<sub>i </sub>
</it>= (0, 0, ...0, 1, 0...0) is a vector of all zeros with a 1 in position i, and <it>Error<sub>max </sub>
</it>was set based on the confidence value. Because different data sets provide different levels of consistency, Error<it>
<sub>max </sub>
</it>was chosen to be 30 units greater than the minimum error found.</p>
<p>These intervals were compared with Z-scores calculated through Monte Carlo methods to assess the ability of the Z-scores to predict the size of experimental reaction ranges in a label-specific manner (Figure <figr fid="F7">7a</figr>). The Z-scores were found to be correlated with the relative flux ranges in a statistically significant manner (Student's T-test, p <it>&lt; </it>8.6 &#215; 10<sup>-34</sup>). A receiver operating characteristic (ROC) curve suggests that the Z-scores can identify with both sensitivity and specificity the reactions that can be elucidated in a label-specific manner, with better performance predicting ranges that are restricted more by data (Figure <figr fid="F7">7b</figr>). These findings indicate that the Z-score is indeed a useful predictor of the degree a flux range will be constrained by a particular <sup>13</sup>C experiment and provide experimental support for the computational approach taken.</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Comparison of Calculated Z-scores and Experimental Flux Ranges</p></caption><text>
   <p><b>Comparison of Calculated Z-scores and Experimental Flux Ranges</b>. a) For several reactions, the computed Z scores are compared to the resulting measured flux ranges. Z-scores show (color coded) Z-scores for each of the 12 reactions and three glucose labels. FVA indicates the absolute allowable flux ranges for three glucoses ([1-<sup>13</sup>C], [6-<sup>13</sup>C], [U-<sup>13</sup>C]) as well as the range if no <sup>13</sup>C data is imposed ('none'). A normalized version of this table is also presented where all flux ranges are divided by the FVA range thus showing the fraction of flux range remaining. This quantity ranges from 0 (range fully specified) to 1 (no additional information). b) A receiver operating characteristic (ROC) curve for the ability of Z-scores to predict confidence interval reduction by different fractions. Z-scores are observed to predict larger interval reductions (e.g. ratio<it>&lt;</it>0.2) with higher specificity and sensitivity than smaller reductions (e.g. ratio<it>&lt;</it>0.8).</p>
</text><graphic file="1752-0509-6-9-7" hint_layout="double"/></fig>
<p>The number of reactions elucidated at particular confidence intervals was then found (Figure <figr fid="F8">8</figr>). Using different labels provides different levels of reaction confidence (Additional File <supplr sid="S1">1</supplr>). Including no <sup>13</sup>C data generates the largest flux ranges (lower black line), while adding <sup>13</sup>C data reduces the ranges and shifts the curve left. With almost no exception, including one experiment yields larger confidence intervals than any combination of two carbon sources which in turn is a larger range than including all three sets. Of the single experiment curves, the 20% [U-<sup>13</sup>C] glucose curve provides notably worse ranges than the other two experiments, consistent with the finding that 20% [U-<sup>13</sup>C] substrate provides data with the smallest number of dimensions.</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>Reaction Confidence Intervals with <sup>13</sup>C Data</p></caption><text>
   <p><b>Reaction Confidence Intervals with <sup>13</sup>C Data</b>. The number of reactions determined at a particular confidence given different data is plotted. The range of allowable fluxes (<it>v<sub>max </sub></it>- <it>v<sub>min</sub></it>) were computed for each reaction, constrained by none, one, two or three sets of <sup>13</sup>C data. Constraints alone are seen to determine a significant number of fluxes, while using all three <sup>13</sup>C data sets is seen to provide the most elucidated reactions for a particular confidence interval.</p>
</text><graphic file="1752-0509-6-9-8" hint_layout="double"/></fig>
<p>At a reaction confidence of 1 mmol &#183; gDW<sup>-1</sup>&#183;h<sup>-1 </sup>(a relatively non-stringent cutoff), 85 reactions are specified simply from uptake rate data without any <sup>13</sup>C data. Performing the least informative <sup>13</sup>C experiment, using 20% [U-<sup>13</sup>C] substrate, yields 105 reactions that meet the confidence criterion, whereas the combination of all three <sup>13</sup>C experiments yields 125 reactions that meet the criterion. In other words, performing all three experiments will increase the number of elucidated reactions by 40 reactions or about 50%. As the model used contains 278 carbon-tracking reactions and reaction groups, the increase in knowledge at 1 mmol &#183; gDW<sup>-1</sup>&#183;h<sup>-1 </sup>confidence from 85 to 125 reactions from using <sup>13</sup>C data indicates that a large gap in the knowledge remains. It seems apparent that other methods must be developed to obtain flux information at the genome scale from single experiments, as would ultimately be desirable. However, as noted in the above section, the majority of reactions that are elucidated by <sup>13</sup>C-labeled glucose experiments lie in central metabolic pathways, which tend to be both of high interest and not-specifiable by constraints alone.</p>
</sec>
</sec>
<sec>
<st>
<p>Conclusions</p>
</st>
<p>We introduce a new framework for calculating the uncertainty inherent to <sup>13</sup>C experiments using Monte Carlo Sampling. This allows us to predict the success of experiments before performing them. The method used here 1) does not require experimental identification of the 'real' flux state <it>a priori </it>
<abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp> and 2) reports scores for the resolution capability for each reaction as opposed to Boolean identification calls <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>. This framework reveals several key findings:</p>
<p indent="1">&#8226; The choice of input label is important, as different labels perform better than others. In particular, the commonly-used 20% mixture of uniform label + 80% natural label was shown computationally and experimentally to resolve significantly fewer reaction fluxes than either [1-<sup>13</sup>C] glucose or [6-<sup>13</sup>C] glucose. Thus, the amount of information likely to be obtained by a <sup>13</sup>C experiment can be predicted in a reaction-specific manner before having to carry out an experiment.</p>
<p indent="1">&#8226; There is no universally best label. The best label depends on the experimental objective. Certain reactions are more precisely measured with some labels than others, and no label is best at elucidating all reactions. Certain hypothetical <sup>13</sup>C labels of glucose, for example [1,2,3-<sup>13</sup>C] glucose and [1,2,5-<sup>13</sup>C] glucose, are predicted to perform better than commercially available single labels for many reactions.</p>
<p indent="1">&#8226; The <sup>13</sup>C data dimensionality is less than anticipated. Whereas each <sup>13</sup>C experiment can measure 186 pieces of information at a time, there is a high degree of interdependence. We measured the true data dimensionality to be between 35 and 50 dimensions for commercially available labels and as high as 73 for exotic labels. This high data redundancy can partially explain why <sup>13</sup>C experiments yield many reaction rates with high uncertainties.</p>
<p>This study suggests limitations of steady-state <sup>13</sup>C analysis using solely amino acids due to the lower than expected dimensionality of the isotopomer data. However, steady-state <sup>13</sup>C analysis is clearly still useful elucidating reaction fluxes in <it>E. coli </it>metabolism. Notably, as the study was conducted using only protein-derived amino acids, it would be of immediate interest to determine the additional benefit of measuring other classes of labeled metabolites, as well as the benefit of more recently developed experimental techniques such as multi-substrate <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp> and dynamic flux labeling <abbrgrp>
<abbr bid="B28">28</abbr>
<abbr bid="B29">29</abbr>
</abbrgrp> experiments. Monte Carlo methods are generic and thus are well suited to be adapted to the experimental setup of interest. Additionally, Monte Carlo methods are amenable to biasing the sampling space based on known data to improve results; however, this is expected to incur a corresponding cost in convergence time. Possible sources of error in the current method include unwanted bias in the sampled flux space, possible inadequacy of the Z-score as a statistical metric over more sophisticated tests such as the Kolmogorov-Smirnov test, and inadequacy of the distribution median as a threshold for use in hi-lo hypotheses. Conducting Monte Carlo isotopomer analysis on new systems will become more accessible with the availability of the open source COBRA Toolbox v2.0 for MATLAB, which includes the algorithms presented here <abbrgrp>
<abbr bid="B30">30</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Methods</p>
</st>
<sec>
<st>
<p>Isotopomer Network Description</p>
</st>
<p>The isotopomer network was derived from the iMC1010 <it>E. coli </it>reconstruction <abbrgrp>
<abbr bid="B19">19</abbr>
</abbrgrp>. The content is reported in the Additional File <supplr sid="S1">1</supplr>. There are a total of 313 irreversible reactions including 278 that track carbon. All carbon tracking reactions are broken into elementary forward and reverse reactions.</p>
<p>First, a central metabolic isotopomer model was generated that includes a total of 85 reactions, including a biomass production reaction, which drains the precursor metabolites used to make biomass, and 14 system boundary exchange fluxes (for glucose, oxygen, phosphate, NO<sub>2</sub>, NO<sub>3</sub>, acetate, CO<sub>2</sub>, ethanol, formate, fumarate, glycerol, D-lactate, pyruvate, and succinate). The biomass composition is based on one that was reported previously <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B31">31</abbr>
</abbrgrp> and used in the biosynthetic isotopomer model (see details below), but where the biomass components are replaced by the amount of ATP, NADH, NADPH, and central metabolic precursors needed to synthesize the biomass components (Additional File <supplr sid="S1">1</supplr>). The remaining 70 reactions participate in glycolysis, TCA cycle, pentose phosphate pathway, oxidative phosphorylation, pyruvate metabolism, and anaplerotic metabolism. The central metabolic isotopomer model includes linear mass balance equations for 67 metabolites. Carbon atoms are tracked through 46 metabolites in the core metabolic network. Changes to the central metabolic reactions include assigning fumarate reductase to utilize menaquinone and demethylmenaquinone rather than ubiquinone and adding a phosphate transport reaction coupled to proton symport.</p>
<p>Aside from the central metabolic reactions contained in the central isotopomer model, the biosynthetic model also includes a number of other catabolic and anabolic reactions. The fluxes were calculated with an additional constraint that flux through formyltetrahydrofolate deformylase (which removes the C1 unit from 10-formyltetrahydrofolate) was less than or equal to the measured formate secretion flux. When higher flux through this reaction was allowed the minimum error improved by only 0.3%, but the flux through this reaction was high (around half the glucose uptake rate). To adjust this aberrant behavior, the optimal flux distributions and confidence intervals were calculated with this additional constraint on the formyltetrahydrofolate deformylase flux. The resulting biosynthetic isotopomer model includes 189 metabolites (126 of which have tracked carbon atoms), 313 irreversible metabolic reactions (63 of which are reversible and involve tracked carbon atoms), and 8,612 isotopomer variables (which is equal to the number of non-linear isotopomer mass balance constraints). The model also includes a biomass reaction and 19 system boundary exchange reactions. The biomass reaction was altered to include amino acids, nucleotides, co-factors, and macromolecules rather than their precursor metabolites. In addition, the biosynthetic model balances intracellular protons as well as water molecules similar to iJR904 <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Monte Carlo Sampling</p>
</st>
<p>With traditional MCMC, a point is selected within the space which is then iteratively moved around. At each step, a random direction is chosen and the next point is chosen uniformly along this line. The set of points that this algorithm visits will converge to a uniformly distributed set. Two modifications were made: 1) Artificial Centering <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp> - Because these biological spaces tend to be elongated in one direction, it is often beneficial to choose directions along the "long" direction rather than uniformly. This can be done by choosing the direction based on previously visited points. At each step, the direction is chosen by drawing a vector from the center of the previous points to one of the previous points chosen at random. 2) In place sampling - Instead of moving just one point throughout the space, many points are moved simultaneously. In this way, no "history" is kept, only the updated position of all the points. This method is described in greater detail in other literature <abbrgrp>
<abbr bid="B33">33</abbr>
<abbr bid="B34">34</abbr>
</abbrgrp>.</p>
</sec>
<sec>
<st>
<p>Computing the Isotopomer Distribution</p>
</st>
<p>Each flux distribution and glucose input results in a unique isotopomer distribution. The cumomer method <abbrgrp>
<abbr bid="B35">35</abbr>
</abbrgrp> and the elementary metabolite unit (EMU) method <abbrgrp>
<abbr bid="B36">36</abbr>
</abbrgrp> were implemented in Matlab and utilized in calculating isotopomer distributions. These methods involve solving several linear systems of equations to compute different groups of isotopomers. For numerical reasons, a routine is introduced which checks whether all parts of the network are still connected at every step. Disconnected components can occur when fluxes to and from the component are zero, making it impossible to compute a unique isotopomer distribution within this subnetwork as many isotopomers satisfy the balance equations. By removing these components first, the other metabolites can be solved in a numerically stable fashion. The resulting isotopomers for the amino acids for each flux distribution are transformed to a mass distribution. This way, each experiment is abstracted to a (number of distributions) &#215; (number of fragments) matrix.</p>
</sec>
<sec>
<st>
<p>
<b>Sample Preparation and </b>
<sup>13</sup>
<b>C Measurement</b>
</p>
</st>
<sec>
<st>
<p>Culture labeling</p>
</st>
<p>Prior to labeling, single colonies of <it>E. coli </it>K12 MG1655 were selected from stock plates and inoculated directly into 250 ml M9 medium in 500 Erlenmeyer flasks aerated by stirring at 1000 rpm. Cells were grown overnight, harvested, washed twice with water and used to inoculate 50 ml flasks containing 25 ml medium with 2 g/L 13C-labeled D-glucose, with initial OD600 0.005-0.01. Glucose was supplied as either 100% [1-<sup>13</sup>C]-labeled, 100% [6-<sup>13</sup>C]-labeled, or a mixture of 20% uniformly [U-<sup>13</sup>C]-labeled with 80% natural glucose (which is randomly 1% 13C). Cells were grown to mid-log phase, corresponding to OD600 of 0.6. 3 ml of each culture was harvested by centrifugation at 4&#176;C. The media was aspirated and analyzed with HPLC to determine the remaining glucose concentration. Cell pellets were placed at -80&#176;C prior to further analysis.</p>
</sec>
<sec>
<st>
<p>Derivatization and GC-MS analysis</p>
</st>
<p>Cells were resuspended in 0.1 ml 6 M HCl and transferred to glass vials. Protein was digested into amino acids under a nitrogen atmosphere for 18 hr at 105&#176;C in an Eldex H/D Work Station. Digested samples were dried to remove residual HCl, resuspended with 75 <it>&#956;</it>l each of tetrahydrofuran and <it>N-tert</it>-butyldimethylsilyl-<it>N</it>-methyltrifluoroacetamide (Aldrich), and incubated for 1 hr at 80&#176;C to derivatize amino acids. Samples were filtered through 0.2 <it>&#956;</it>m PVDF filters and injected into a Shimadzu QP2010 Plus GC-MS (0.5 <it>&#956;</it>l with 1:50 split ratio). GC injection temperature was 250&#176;C and the GC oven temperature was initially 130&#176;C for 4 min, rising to 230&#176;C at 4&#176;C/min and to 280&#176;C at 20&#176;C/min with a final hold at this temperature for 2 min. GC flow rate with helium carrier gas was 50 cm/s. The GC column used was a 15 m &#215; 0.25 mm &#215; 0.25 m SHRXI-5ms (Shimadzu). GC-MS interface temperature was 300 degrees with 70 eV ionization voltage. The mass spectrometer was set to scan an m/z range of 50 to 600.</p>
</sec>
<sec>
<st>
<p>Processing of GC-MS data</p>
</st>
<p>Mass data were retrieved from the GC-MS for fragments of 14 derivatized amino acids: cysteine and tryptophan were degraded during amino acid hydrolysis; asparagine and glutamine were converted respectively to aspartate and glutamate; arginine was not stable to the derivatization procedure. For each fragment, these data comprised mass intensities for the base isotopomer (without any heavy isotopes, M+0), and isotopomers with increasing unit mass (up to M+6) relative to that of M+0. These mass distributions were normalized by dividing by the sum of M+0 to M+6, and corrected for naturally-occurring heavy isotopes of the elements H, N, O, Si, S, and (in moieties from the derivatizing reagent) C, using matrix-based probabilistic methods as described <abbrgrp>
<abbr bid="B37">37</abbr>
<abbr bid="B38">38</abbr>
</abbrgrp> implemented in Microsoft Excel. Data were also corrected for carry-over of unlabeled inoculum <abbrgrp>
<abbr bid="B37">37</abbr>
</abbrgrp>.</p>
</sec>
</sec>
<sec>
<st>
<p>
<b>Computing Reaction Rates from </b>
<sup>13</sup>
<b>C Data</b>
</p>
</st>
<p>Reaction rates were computed from <sup>13</sup>C data as described in the Results. In calculating the best fit flux values from experimental data, a small variation was introduced to reduce the number of variables and remove constraints. Let N be a basis for the null space of <it>S</it>. Then all valid fluxes can be written as:</p>
<p>
<display-formula>
<m:math name="1752-0509-6-9-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mtable equalrows="false" columnlines="none none none none none none none none none none none none none none none none none none none" equalcolumns="false" class="array">
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mi>v</m:mi>
            <m:mo class="MathClass-rel">=</m:mo>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:munder class="msub">
               <m:mrow>
                  <m:mtext>min</m:mtext>
               </m:mrow>
               <m:mrow>
                  <m:mi>&#945;</m:mi>
               </m:mrow>
            </m:munder>
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">Error&#160;</m:mtext>
            </m:mstyle>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">subject</m:mtext>
            </m:mstyle>
            <m:mspace width="2.77695pt" class="tmspace"/>
            <m:mstyle class="text">
               <m:mtext class="textsf" mathvariant="sans-serif">to</m:mtext>
            </m:mstyle>
            <m:mo class="MathClass-rel">:</m:mo>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center">
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>i</m:mi>
                  <m:mi>n</m:mi>
               </m:mrow>
            </m:msub>
            <m:mo class="MathClass-rel">&lt;</m:mo>
            <m:mi>N</m:mi>
            <m:mo class="MathClass-bin">&#8901;</m:mo>
            <m:mi>&#945;</m:mi>
            <m:mo class="MathClass-rel">&lt;</m:mo>
            <m:msub>
               <m:mrow>
                  <m:mi>v</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>m</m:mi>
                  <m:mi>a</m:mi>
                  <m:mi>x</m:mi>
               </m:mrow>
            </m:msub>
         </m:mtd>
      </m:mtr>
      <m:mtr>
         <m:mtd class="array" columnalign="center"/>
      </m:mtr>
   </m:mtable>
</m:mrow>
</m:math>
</display-formula>
</p>
<p>This reduced the number of variables from |<it>v</it>| = 335 to |<it>&#945;</it>| = 139. Optimization was performed with the Tomlab/SNOPT package. This method is an iterative local optimization and is therefore not guaranteed to find the optimal solution. To address the issue of local minima, the procedure was run with many randomly generated starting points and the lowest minimum was taken.</p>
</sec>
<sec>
<st>
<p>Code and Equipment</p>
</st>
<p>The code was written in the MATLAB environment and the COBRA toolbox. Linear Programming was done with the Tomlab/CPLEX package and nonlinear optimization with the TOMLAB/SNOPT interface. The EMU and cumomer method were written in native Matlab but generated in Perl. Computations were performed on a Dell Studio XPS desktops (2.6 Ghz core i7 with 9-12 GB ram) and a custom Rocks cluster (100 dual Xeon 5500 series nodes).</p>
</sec>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>JS devised methods and drafted manuscript. DZ performed additional analysis and completed manuscript. WC and SM implemented methods in Matlab and performed computational analyses. VP grew <it>E. coli </it>samples. DS performed MS fragmentation and natural abundance correction. JLR generated the <sup>13</sup>C <it>E. coli </it>model with <sup>13</sup>C mappings. AO and BOP were responsible for the strategic vision. All authors have read and approved the manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>This work was supported by National Institutes of Health grants GM068837, GM057089, and GM062791.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Understanding the Warburg Effect: The Metabolic Requirements of Cell Proliferation</p></title><aug><au><snm>Heiden</snm><fnm>MGV</fnm></au><au><snm>Cantley</snm><fnm>LC</fnm></au><au><snm>Thompson</snm><fnm>CB</fnm></au></aug><source>Science</source><pubdate>2009</pubdate><volume>324</volume><issue>5930</issue><fpage>1029</fpage><lpage>1033</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1160809</pubid><pubid idtype="pmcid">2849637</pubid><pubid idtype="pmpid" link="fulltext">19460998</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Identification of genome-scale metabolic network models using experimentally measured flux profiles</p></title><aug><au><snm>Herrgard</snm><fnm>MJ</fnm></au><au><snm>Fong</snm><fnm>SS</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2006</pubdate><volume>2</volume><issue>7</issue><fpage>e72</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0020072</pubid><pubid idtype="pmcid">1487183</pubid><pubid idtype="pmpid" link="fulltext">16839195</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Using Metabolic Flux Data to Further Constrain the Metabolic Solution Space and Predict Internal Flux Patterns: The Escherichia coli alpha-Spectrum</p></title><aug><au><snm>Wiback</snm><fnm>S</fnm></au><au><snm>Mahadevan</snm><fnm>R</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Biotechnology and Bioengineering</source><pubdate>2004</pubdate><volume>86</volume><issue>3</issue><fpage>317</fpage><lpage>31</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.20011</pubid><pubid idtype="pmpid" link="fulltext">15083512</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Bidirectional Reaction Steps in Metabolic Networks: IV. Optimal Design of Isotopomer Labeling Experiments</p></title><aug><au><snm>Mollney</snm><fnm>M</fnm></au><au><snm>Wiechert</snm><fnm>W</fnm></au><au><snm>Kownatzki</snm><fnm>D</fnm></au><au><snm>de Graaf</snm><fnm>AA</fnm></au></aug><source>Biotechnology and Bioengineering</source><pubdate>1999</pubdate><volume>66</volume><issue>2</issue><fpage>86</fpage><lpage>103</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1097-0290(1999)66:2&lt;86::AID-BIT2&gt;3.0.CO;2-A</pubid><pubid idtype="pmpid" link="fulltext">10567067</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Metabolic networks in motion: 13C-based flux analysis</p></title><aug><au><snm>Sauer</snm><fnm>U</fnm></au></aug><source>Molecular Systems Biology</source><pubdate>2006</pubdate></bibl><bibl id="B6"><title><p>13C metabolic flux analysis</p></title><aug><au><snm>Wiechert</snm><fnm>W</fnm></au></aug><source>Metab Eng</source><pubdate>2001</pubdate><volume>3</volume><issue>3</issue><fpage>195</fpage><lpage>206</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mben.2001.0187</pubid><pubid idtype="pmpid" link="fulltext">11461141</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Metabolic flux elucidation for large-scale models using 13C labeled isotopes</p></title><aug><au><snm>Suthers</snm><fnm>PF</fnm></au><au><snm>Burgard</snm><fnm>AP</fnm></au><au><snm>Dasika</snm><fnm>MS</fnm></au><au><snm>Nowroozi</snm><fnm>F</fnm></au><au><snm>Van Dien</snm><fnm>S</fnm></au><au><snm>Keasling</snm><fnm>JD</fnm></au><au><snm>Maranas</snm><fnm>CD</fnm></au></aug><source>Metab Eng</source><pubdate>2007</pubdate><volume>9</volume><issue>5-6</issue><fpage>387</fpage><lpage>405</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ymben.2007.05.005</pubid><pubid idtype="pmcid">2121621</pubid><pubid idtype="pmpid" link="fulltext">17632026</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>(13)C-based metabolic flux analysis</p></title><aug><au><snm>Zamboni</snm><fnm>N</fnm></au><au><snm>Fendt</snm><fnm>SM</fnm></au><au><snm>Ruhl</snm><fnm>M</fnm></au><au><snm>Sauer</snm><fnm>U</fnm></au></aug><source>Nat Protoc</source><pubdate>2009</pubdate><volume>4</volume><issue>6</issue><fpage>878</fpage><lpage>92</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nprot.2009.58</pubid><pubid idtype="pmpid" link="fulltext">19478804</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Modeling Isotopomer Distributions in Biochemical Reaction Networks Using Isotopomer Mapping Matrices</p></title><aug><au><snm>Schmidt</snm><fnm>K</fnm></au><au><snm>Carlsen</snm><fnm>M</fnm></au><au><snm>Nielsen</snm><fnm>J</fnm></au><au><snm>Villadsen</snm><fnm>J</fnm></au></aug><source>Biotechnology and Bioengineering</source><pubdate>1997</pubdate><volume>55</volume><issue>6</issue><fpage>831</fpage><lpage>40</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1097-0290(19970920)55:6&lt;831::AID-BIT2&gt;3.0.CO;2-H</pubid><pubid idtype="pmpid" link="fulltext">18636594</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Evaluation of 13C isotopic tracers for metabolic flux analysis in mammalian cells</p></title><aug><au><snm>Metallo</snm><fnm>CM</fnm></au><au><snm>Walther</snm><fnm>JL</fnm></au><au><snm>Stephanopoulos</snm><fnm>G</fnm></au></aug><source>J Biotechnol</source><pubdate>2009</pubdate><volume>144</volume><issue>3</issue><fpage>167</fpage><lpage>74</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jbiotec.2009.07.010</pubid><pubid idtype="pmcid">3026314</pubid><pubid idtype="pmpid" link="fulltext">19622376</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Identification of Optimal Measurement Sets for Complete Flux Elucidation in Metabolic Flux Analysis Experiments</p></title><aug><au><snm>Chang</snm><fnm>Y</fnm></au><au><snm>Suthers</snm><fnm>PF</fnm></au><au><snm>Maranas</snm><fnm>CD</fnm></au></aug><source>Biotechnology and Bioengineering</source><pubdate>2008</pubdate><volume>100</volume><issue>6</issue><fpage>1039</fpage><lpage>1049</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.21926</pubid><pubid idtype="pmpid" link="fulltext">18553391</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Genome-scale models of microbial cells: evaluating the consequences of constraints</p></title><aug><au><snm>Price</snm><fnm>ND</fnm></au><au><snm>Reed</snm><fnm>JL</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Nat Rev Microbiol</source><pubdate>2004</pubdate><volume>2</volume><issue>11</issue><fpage>886</fpage><lpage>897</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrmicro1023</pubid><pubid idtype="pmpid" link="fulltext">15494745</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Towards multidimensional genome annotation</p></title><aug><au><snm>Reed</snm><fnm>JL</fnm></au><au><snm>Famili</snm><fnm>I</fnm></au><au><snm>Thiele</snm><fnm>I</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2006</pubdate><volume>7</volume><issue>2</issue><fpage>130</fpage><lpage>41</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg1769</pubid><pubid idtype="pmpid" link="fulltext">16418748</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions</p></title><aug><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Park</snm><fnm>JO</fnm></au><au><snm>Conrad</snm><fnm>TM</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2010</pubdate><volume>11</volume><fpage>213</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-11-213</pubid><pubid idtype="pmcid">2874806</pubid><pubid idtype="pmpid" link="fulltext">20426874</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>A protocol for generating a high-quality genome-scale metabolic reconstruction</p></title><aug><au><snm>Thiele</snm><fnm>I</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Nat Protoc</source><pubdate>2010</pubdate><volume>5</volume><fpage>93</fpage><lpage>121</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3125167</pubid><pubid idtype="pmpid" link="fulltext">20057383</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR)</p></title><aug><au><snm>Reed</snm><fnm>J</fnm></au><au><snm>Vo</snm><fnm>T</fnm></au><au><snm>Schilling</snm><fnm>CH</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Genome Biology</source><pubdate>2003</pubdate><volume>4</volume><issue>9</issue><fpage>R54.1</fpage><lpage>R54.12</lpage></bibl><bibl id="B17"><title><p>The biomass objective function</p></title><aug><au><snm>Feist</snm><fnm>AM</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Curr Opin Microbiol</source><pubdate>2010</pubdate></bibl><bibl id="B18"><title><p>The Escherichia coli MG1655 in silico metabolic genotype: Its definition, characteristics, and capabilities</p></title><aug><au><snm>Edwards</snm><fnm>J</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2000</pubdate><volume>97</volume><issue>10</issue><fpage>5528</fpage><lpage>5533</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.97.10.5528</pubid><pubid idtype="pmcid">25862</pubid><pubid idtype="pmpid" link="fulltext">10805808</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Integrating high-throughput and computational data elucidates bacterial networks</p></title><aug><au><snm>Covert</snm><fnm>MW</fnm></au><au><snm>Knight</snm><fnm>EM</fnm></au><au><snm>Reed</snm><fnm>JL</fnm></au><au><snm>Herrgard</snm><fnm>MJ</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>429</volume><issue>6987</issue><fpage>92</fpage><lpage>6</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02456</pubid><pubid idtype="pmpid" link="fulltext">15129285</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Metabolic analysis of adaptive evolution for in silico designed lactate-producing strains</p></title><aug><au><snm>Hua</snm><fnm>Q</fnm></au><au><snm>Joyce</snm><fnm>AR</fnm></au><au><snm>Fong</snm><fnm>SS</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Biotechnol Bioeng</source><pubdate>2006</pubdate><volume>95</volume><issue>5</issue><fpage>992</fpage><lpage>1002</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.21073</pubid><pubid idtype="pmpid" link="fulltext">16807925</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization</p></title><aug><au><snm>Burgard</snm><fnm>AP</fnm></au><au><snm>Pharkya</snm><fnm>P</fnm></au><au><snm>Maranas</snm><fnm>CD</fnm></au></aug><source>Biotechnol Bioeng</source><pubdate>2003</pubdate><volume>84</volume><issue>6</issue><fpage>647</fpage><lpage>57</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.10803</pubid><pubid idtype="pmpid" link="fulltext">14595777</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Model-driven evaluation of the production potential for growth-coupled products of Escherichia coli</p></title><aug><au><snm>Feist</snm><fnm>AM</fnm></au><au><snm>Zielinski</snm><fnm>DC</fnm></au><au><snm>Orth</snm><fnm>JD</fnm></au><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Herrgard</snm><fnm>MJ</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Metab Eng</source><pubdate>2009</pubdate></bibl><bibl id="B23"><title><p>The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli</p></title><aug><au><snm>Feist</snm><fnm>AM</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Nat Biotech</source><pubdate>2008</pubdate><volume>26</volume><issue>6</issue><fpage>659</fpage><lpage>667</lpage><xrefbib><pubid idtype="doi">10.1038/nbt1401</pubid></xrefbib></bibl><bibl id="B24"><title><p>Use of randomized sampling for analysis of metabolic networks</p></title><aug><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>J Biol Chem</source><pubdate>2009</pubdate><volume>284</volume><issue>9</issue><fpage>5457</fpage><lpage>61</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">18940807</pubid></xrefbib></bibl><bibl id="B25"><title><p>High-throughput metabolic flux analysis based on gas chromatography-mass spectrometry derived 13C constraints</p></title><aug><au><snm>Fischer</snm><fnm>E</fnm></au><au><snm>Zamboni</snm><fnm>N</fnm></au><au><snm>Sauer</snm><fnm>U</fnm></au></aug><source>Anal Biochem</source><pubdate>2004</pubdate><volume>325</volume><issue>2</issue><fpage>308</fpage><lpage>16</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ab.2003.10.036</pubid><pubid idtype="pmpid" link="fulltext">14751266</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Metabolic flux analysis of Escherichia coli K12 grown on 13C-labeled acetate and glucose using GC-MS and powerful flux calculation method</p></title><aug><au><snm>Zhao</snm><fnm>J</fnm></au><au><snm>Shimizu</snm><fnm>K</fnm></au></aug><source>J Biotechnol</source><pubdate>2003</pubdate><volume>101</volume><issue>2</issue><fpage>101</fpage><lpage>17</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0168-1656(02)00316-4</pubid><pubid idtype="pmpid" link="fulltext">12568740</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Metabolic pathway analysis: basic concepts and scientific applications in the post-genomic era</p></title><aug><au><snm>Schilling</snm><fnm>CH</fnm></au><au><snm>Schuster</snm><fnm>S</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au><au><snm>Heinrich</snm><fnm>R</fnm></au></aug><source>Biotechnology Progress</source><pubdate>1999</pubdate><volume>15</volume><issue>3</issue><fpage>296</fpage><lpage>303</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/bp990048k</pubid><pubid idtype="pmpid" link="fulltext">10356246</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>Metabolic flux analysis in a nonstationary system: Fed-batch fermentation of a high yielding strain of E. coli producing 1,3-propanediol</p></title><aug><au><snm>Antoniewicz</snm><fnm>MR</fnm></au><au><snm>Kraynie</snm><fnm>DF</fnm></au><au><snm>Laffend</snm><fnm>LA</fnm></au><au><snm>Gonzalez-Lergier</snm><fnm>J</fnm></au><au><snm>Kelleher</snm><fnm>JK</fnm></au><au><snm>Stephanopoulos</snm><fnm>G</fnm></au></aug><source>Metab Eng</source><pubdate>2007</pubdate></bibl><bibl id="B29"><title><p>Kinetic flux profiling for quantitation of cellular metabolic fluxes</p></title><aug><au><snm>Yuan</snm><fnm>J</fnm></au><au><snm>Bennett</snm><fnm>BD</fnm></au><au><snm>Raobinowitz</snm><fnm>JD</fnm></au></aug><source>Nature Protocols</source><pubdate>2008</pubdate><volume>3</volume><fpage>1328</fpage><lpage>1340</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nprot.2008.131</pubid><pubid idtype="pmcid">2710581</pubid><pubid idtype="pmpid" link="fulltext">18714301</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0</p></title><aug><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Que</snm><fnm>R</fnm></au><au><snm>Fleming</snm><fnm>RMT</fnm></au><au><snm>Thiele</snm><fnm>I</fnm></au><au><snm>Orth</snm><fnm>JD</fnm></au><au><snm>Feist</snm><fnm>AM</fnm></au><au><snm>Zielinski</snm><fnm>DC</fnm></au><au><snm>Bordbar</snm><fnm>A</fnm></au><au><snm>Lewis</snm><fnm>NE</fnm></au><au><snm>Rahmanian</snm><fnm>S</fnm></au><au><snm>Kang</snm><fnm>J</fnm></au><au><snm>Hyduke</snm><fnm>DR</fnm></au><au><snm>Palsson</snm><fnm>B</fnm></au></aug><source>Nature Protocols</source><pubdate>2011</pubdate><volume>6</volume><fpage>1290</fpage><lpage>1307</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nprot.2011.308</pubid><pubid idtype="pmpid" link="fulltext">21886097</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Multiple steady states in kinetic models of red cell metabolism</p></title><aug><au><snm>Edwards</snm><fnm>JS</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Journal of Theoretical Biology</source><pubdate>2000</pubdate><volume>207</volume><fpage>125</fpage><lpage>7</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/jtbi.2000.2165</pubid><pubid idtype="pmpid" link="fulltext">11027485</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Direction choice for accelerated convergence in hit-and-run sampling</p></title><aug><au><snm>Kaufmann DE</snm><fnm>SR</fnm></au></aug><source>Operations Research</source><pubdate>1998</pubdate><volume>46</volume><fpage>84</fpage><lpage>95</lpage><xrefbib><pubid idtype="doi">10.1287/opre.46.1.84</pubid></xrefbib></bibl><bibl id="B33"><title><p>Large-scale in silico modeling of metabolic interactions between cell types in the human brain</p></title><aug><au><snm>Lewis</snm><fnm>NE</fnm></au><au><snm>Schramm</snm><fnm>G</fnm></au><au><snm>Bordbar</snm><fnm>A</fnm></au><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Andersen</snm><fnm>MP</fnm></au><au><snm>Cheng</snm><fnm>JK</fnm></au><au><snm>Patel</snm><fnm>N</fnm></au><au><snm>Yee</snm><fnm>A</fnm></au><au><snm>Lewis</snm><fnm>RA</fnm></au><au><snm>Eils</snm><fnm>R</fnm></au><au><snm>Konig</snm><fnm>R</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au></aug><source>Nature Biotechnology</source><pubdate>2010</pubdate><volume>28</volume><fpage>1279</fpage><lpage>1285</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1711</pubid><pubid idtype="pmcid">3140076</pubid><pubid idtype="pmpid" link="fulltext">21102456</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Insight into human alveolar macrophage and M. tuberculosis interactions via metabolic reconstructions</p></title><aug><au><snm>Bordbar</snm><fnm>A</fnm></au><au><snm>Lewis</snm><fnm>NE</fnm></au><au><snm>Schellenberger</snm><fnm>J</fnm></au><au><snm>Palsson</snm><fnm>BO</fnm></au><au><snm>Jamshidi</snm><fnm>N</fnm></au></aug><source>Molecular Systems Biology</source><pubdate>2010</pubdate><volume>6</volume><issue>422</issue></bibl><bibl id="B35"><title><p>Bidirectional reaction steps in metabolic networks: III. Explicit solution and analysis of isotopomer labeling systems</p></title><aug><au><snm>Wiechert</snm><fnm>W</fnm></au><au><snm>Mollney</snm><fnm>M</fnm></au><au><snm>Isermann</snm><fnm>N</fnm></au><au><snm>Wurzel</snm><fnm>M</fnm></au><au><snm>de Graaf</snm><fnm>AA</fnm></au></aug><source>Biotechnol Bioeng</source><pubdate>1999</pubdate><volume>66</volume><issue>2</issue><fpage>69</fpage><lpage>85</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/(SICI)1097-0290(1999)66:2&lt;69::AID-BIT1&gt;3.0.CO;2-6</pubid><pubid idtype="pmpid" link="fulltext">10567066</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Elementary metabolite units (EMU): a novel framework for modeling isotopic distributions</p></title><aug><au><snm>Antoniewicz</snm><fnm>MR</fnm></au><au><snm>Kelleher</snm><fnm>JK</fnm></au><au><snm>Stephanopoulos</snm><fnm>G</fnm></au></aug><source>Metab Eng</source><pubdate>2007</pubdate><volume>9</volume><fpage>68</fpage><lpage>86</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ymben.2006.09.001</pubid><pubid idtype="pmcid">1994654</pubid><pubid idtype="pmpid" link="fulltext">17088092</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Determination of metabolic flux ratios from 13C-experiments and gas chromatography-mass spectrometry data: protocol and principles</p></title><aug><au><snm>Nanchen</snm><fnm>A</fnm></au><au><snm>Fuhrer</snm><fnm>T</fnm></au><au><snm>Sauer</snm><fnm>U</fnm></au></aug><source>Methods Mol Biol</source><pubdate>2007</pubdate><volume>358</volume><fpage>177</fpage><lpage>97</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/978-1-59745-244-1_11</pubid><pubid idtype="pmpid">17035687</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Correcting mass isotopomer distributions for naturally occurring isotopes</p></title><aug><au><snm>van Winden</snm><fnm>WA</fnm></au><au><snm>Wittmann</snm><fnm>C</fnm></au><au><snm>Heinzle</snm><fnm>E</fnm></au><au><snm>Heijnen</snm><fnm>JJ</fnm></au></aug><source>Biotechnol Bioeng</source><pubdate>2002</pubdate><volume>80</volume><issue>4</issue><fpage>477</fpage><lpage>9</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/bit.10393</pubid><pubid idtype="pmpid" link="fulltext">12325156</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>