Abstract
Motivation
Within Flux Balance Analysis, the investigation of complex subtasks, such as finding the optimal perturbation of the network or finding an optimal combination of drugs, often requires to set up a bilevel optimization problem. In order to keep the linearity and convexity of these nested optimization problems, an ON/OFF description of the effect of the perturbation (i.e. Boolean variable) is normally used. This restriction may not be realistic when one wants, for instance, to describe the partial inhibition of a reaction induced by a drug.
Results
In this paper we present a formulation of the bilevel optimization which overcomes the oversimplified ON/OFF modeling while preserving the linear nature of the problem. A case study is considered: the search of the best multidrug treatment which modulates an objective reaction and has the minimal perturbation on the whole network. The drug inhibition is described and modulated through a convex combination of a fixed number of Boolean variables. The results obtained from the application of the algorithm to the core metabolism of E.coli highlight the possibility of finding a broader spectrum of drug combinations compared to a simple ON/OFF modeling.
Conclusions
The method we have presented is capable of treating partial inhibition inside a bilevel optimization, without loosing the linearity property, and with reasonable computational performances also on large metabolic networks. The more finegraded representation of the perturbation allows to enlarge the repertoire of synergistic combination of drugs for tasks such as selective perturbation of cellular metabolism. This may encourage the use of the approach also for other cases in which a more realistic modeling is required.
Background
In recent years, genomescale metabolic networks have represented an important paradigm of systems biology, well describing how interesting (and relevant) biological features can be deduced in spite of the complexity of the model [14]. Thanks to the use of genomic techniques, metabolic networks have been reconstructed for many organisms, ranging from small bacteria up to the human cells. In parallel, the development of quantitative descriptions of these large and complex systems based on simple computational framework such as Flux Balance Analysis (FBA) [5,6] has increased both their characterization [79] and the spectrum of applications. Two important examples are (i) strain improvement [10,11], i.e. the identification of the best knockout or gene manipulation maximizing the biosynthesis of a key metabolite, (ii) support to drug discovery through the identification of new inhibition targets [1215] or of new drug therapies for various medical purposes [16,17]. All the studies just mentioned are based on the FBA formalism.
FBA is a linear constraintbased framework for stoichiometric models of metabolic
networks; the network is described by the stoichiometric matrix S = (s_{i,j}), where s_{i,j} represents the stoichiometric coefficient of the ith metabolite in the jth reaction (with i = 1,…,m, j = 1,…,r), and by the reaction fluxes denoted by the vector
(this holds for all the metabolites since the vector v includes all the processes). Thermodynamical constraints and availability of nutrients add further constraints, such as finite lower (L_{i}) and upperbounds (U_{i}) on the fluxes:
The constraints (1) and (2) generate a convex and bounded set
When applications like strain engineering or drug target identification are treated with this formalism, an additional function, Ψ[v(h)], is normally introduced, where h are the variables we can control and which describe, for example, the knockouts or the drug inhibitions inducing a reduction of the set W to W(h) (see later for its more precise definition). The corresponding secondary optimization problem can consist, for instance, of the maximization of the production of a certain metabolite [10] or of the maximal inhibition of a target reaction [16]. The problem becomes therefore a bilevel optimization [20]:
where arg min_{x∈Q}f(x) stands for the set of x for which the function f attains its minimum value in Q (or equivalently, its maximum value for "arg max"). Therefore, (3) says that the output of the bilevel optimization is the vector h such that the corresponding vector v(h), which minimizes Φ on W(h), maximizes the function Ψ. The reformulation of a bilevel optimization as a single optimization is commonly obtained through duality theory [21,22]. When performing this reformulation, in order to save the linear nature of the optimization procedure, the variables h (the real output of the algorithm) have to be Boolean, rather than continuous [2,10,13]. While this approach is correct for gene knockout, in the case of drug treatment (where the enzymes are inhibited by drug) or gene modification (on which one changes the enzymes activity) it represents only a rough approximation which may not constitute a realistic description of the biological effect. It is in fact more plausible to assume that a drug acting on an enzyme leads to a partial loss of functionality of the latter, and hence to a partial inhibition of the corresponding reaction(s). In order to treat the inhibition as a variable to be optimized and to avoid the ON/OFF oversimplification it is necessary to reformulate the bilevel optimization as a nonlinear (nonconvex) single optimization problem [23] but this leads to a more complicated situation from a numerical point of view.
Although a complete inhibition of a diseasecausing target may not represent the right therapeutic solution (in healthy cells, the level of each metabolite must be in a finite range) and although expected to be a potential strategy in a multitarget approach [24], partial inhibition has been considered only in a few computational works; moreover, studies like [25,26] dealt with a small part of the network, modeling the kinetic reactions explicitly and solving then numerically. The partial inhibition then amounts, for instance, to a modulation of one or more kinetic parameters. Because of the complexity of the metabolic networks and because of the impossibility of knowing the kinetic parameters of all biochemical reactions, this approach cannot be applied at a genomewide level. On the other hand, the authors of [27] consider the whole network, but the (partial) inhibition is given as initial fixed parameter of the model and only the effect of the perturbation is quantified. A different approach is presented in [28] in the context of the prediction of new drug targets; these targets are identified though a twostages FBA (which differs from a bilevel formulation because the two optimizations are not nested). However, the potential targets obtained with this method must be verified exhaustively, which may represent a problem for networks with more than the 26 reactions of the human hyperuricemia metabolic pathway considered in [28].
Therefore, the aim of the present paper is to describe a novel algorithm which allows to provide a more realistic description of the partial inhibition induced by the drugs on large networks while still remaining within the framework of Linear Programming (LP). In order to introduce the algorithm, we refer to a realistic case where a bilevel minimization is used. Namely we consider the search for the optimal combination of drugs capable, through a synergistic effect, to inhibit (or enhance) an objective reaction (i.e. a putative target for a disease) while inducing the minimal perturbation on the rest of the network. Indeed, the selectivity of the therapy is one of the most important aspects of any drug discovery project. Replacing a single Boolean variable by a convex combination of a fixed number of Boolean variables, we are able to model the inhibition as any number belonging to a discretized representation of the interval [0,1]. This approach preserves the linear nature of the final problem. Notice that the method we propose can be extended to any bilevel optimization which needs to deviate from the simple ON/OFF description.
The paper is organized as follows. We first formalize our example about drug combinations; then, within this case study, we describe why Boolean variables are necessary in the reformulation of a bilevel optimization problem via the strong duality theorem of LP. The presentation of the basic idea of the proposed algorithm and the discussion of its limits conclude the Methods Section. Results and Discussion sections present and comment the outcomes obtained on a benchmark application to the E.coli core metabolism and to some other larger networks. Final considerations are then reported in the Conclusion.
Methods
Optimal drug combination: a guiding example
In FBA the vector v of the metabolic fluxes is obtained through the optimization of a certain function Φ(v). For unperturbed networks, the production of the macromolecular building blocks for the biomass (the growth rate) is often maximized [18]: we denote by v^{ut} (ut="untreated"; all symbols and variables are listed in Table 1) the reaction fluxes obtained after this optimization. This fluxes can be nonunique [29]: an analysis of the case in which v^{ut} has degenerate values is reported in the Additional file 1. In any case, throughout the paper these unperturbed fluxes are considered as given parameters of the problem. In the following all reactions are irreversible (L_{i} = 0, ∀i = 1,…,r): indeed, by decomposing any reversible reaction in a couple of irreversible reactions, we can always assume that fluxes have nonnegative values.
Table 1. Symbol, value range, meaning and type of all quantities used in the algorithm description
Additional file 1. Nonuniquess. This pdf file presents the behaviour of the algorithm in case of nonunique solution of the untreated network (v^{ut}).
Format: PDF Size: 31KB Download file
This file can be viewed with: Adobe Acrobat Reader
The guiding example we introduce here consists in the search of the most selective
combination of drugs: in particular, we suppose to have a metabolic network and a
set of drugs which inhibit some reactions of this network (the set of targets of the
kth drug is indicated by
where U_{i} is the upperbound of the flux v_{i} and where, for modeling with partial inhibition, h_{k} ∈ [0,1]. Through this formalism we do not consider the allosteric interaction between two (or more) drugs on the same enzyme: indeed, our model simply takes the maximum inhibition over the set of drugs which affect the enzyme of reaction i:
Then, the vector h ∈ [0,1]^{d} represents the drug treatment, i.e. the inhibition due to the drugs: for example, for d = 3, the vector h = [0.5, 0, 0.8] indicates that drug 2 is not used (h_{2} = 0) while drugs 1 and 3 are used at dosages which cause respectively a 50% and 80% inhibition of their targets (hence, in (4), the reduction of their upperbounds to 50% and 20% in the original values). For each choice of h these inhibitions reduce the set W (generated by the constraints (1) and (2)) to a subset W(h):
The determination of the reaction fluxes v^{tr}(h) (tr="treated") for the drugtreated network is obtained through the MOMA problem (Minimization Of Metabolic Adjustment) which has been shown to generate reasonable and realistic results for perturbed metabolism [4,19,3032]. In order to apply the theory of linear programming, we use the definition of MOMA in terms of norm L^{1}[33]. Then
In the following the side effect of a drug treatment is quantified in terms of the distance ∥v^{tr}(h)  v^{ut}∥_{1} used in (5): the greater the distance, the bigger the impact of the drugs on the whole network.
The problem can be stated as follows:
Given:
• a metabolic network, which means a stoichiometric matrix
• the unperturbed fluxesv^{ut};
• the setof drugs together with their inhibition targets
• the index (denoted by "mod") of the objective reaction whose flux (v_{mod}) must be modulated;
• a threshold τ ∈ [0,1) for the modulation constraint on v_{mod};
we want to find the inhibitionh ∈ [0,1]^{d} such that
According to (5), for a given set of drugs (i.e., for a given inhibition vector h), we can calculate both v^{tr}(h) (and then check whether
The bilevel optimization (6) is a minmin linear program. The inner problem adjusts the fluxes so as to achieve the minimal metabolic adjustment, subject to the drug inhibitions imposed by the outer problem and to the stoichiometric constraints. The outer problem selects the combination of drugs which has the minimum side effect and guarantees a modulated flux lower than the desired threshold.
Since we are looking for a minimum, the absolute value operation
The sum of a_{i} (i.e.
Then, the detailed equations of the bilevel optimization (6) are the following:
The strong duality theorem and the need of Boolean variables
This bilevel optimization is commonly solved by applying the strong duality theorem of LP [10,11,34] which states: "Let A be a matrix, and letb andc be vectors. Then
providing that both sets are not empty"[22], where x are the primal variables and θ are the dual variables (note that, because of the use of the transpose of the matrix A in the dual problem, there is a dual variable for each constraint of the primal problem). Therefore, the application of this theorem to the inner problem consists in appending a list of constraints, A^{T} θ ≥ c, corresponding to the dual form of the constraints of the inner problem and setting the inner objective function equal to its dual c^{T}x = b^{T}θ (see Additional file 2 for a depiction of the structure of the matrix eventually obtained). Since, from the strong duality theorem, this equality holds only at the optimal points of both primal and dual problems, the resulting set of constraints is equivalent to selecting only the solutions of the inner problem.
Additional file 2. Matrix of linear constraints. This pdf file provides the details about the construction of the matrix of the final single optmiziation problem, including primal, dual and Boolean variables of the drugs.
Format: PDF Size: 613KB Download file
This file can be viewed with: Adobe Acrobat Reader
This leads to the following single minimization, in which Greek letters refer to dual variables (for clarity, we differentiate them according to the associated constraints of the primal problem, as detailed in Table 1):
where (7a) specifies the objective function of the outer problem; equations (7b)–(7f) refer to the primal inner problem (the constraints of the original inner problem); (7g) and (7h) are the dual constraints; (7i) imposes the outer problem contraint on v_{mod} and (7j) is the duality theorem equality (namely c^{T}x = b^{T}θ). However, this last equation is no longer a linear constraint since it contains the product between the outer problem variable h_{k} and the dual variable δ_{i}; hence, the problem can no longer be solved by a linear optimization. It is common to overcome this complication by restricting the h_{k} variables to Boolean values. In this case, in fact, the nonlinear terms δ_{i}h_{k} can be exactly linearized as follows:
where
The restriction to Boolean variables saves the linear nature of the problem (which however requires now Mixed Integer Linear Programming) but it implies the assumption that drugs can only act as switches on the reactions, or equivalently, that we are considering only an ON/OFF model.
Partial inhibition
In this Section we propose a solution which can still use the duality theorem for solving the bilevel optimization while including the possibility of inducing a partial inhibition of the reactions targeted by the drugs. This requires to create a discretization of the interval [0,1] and to replace the ON/OFF action of each drug with P + 1 Boolean variables describing this discretization (P is a fixed parameter of the problem). For the kth drug (k = 1,…,d) we introduce the set of Boolean variables {x_{k,n}}_{n=0,…,P} and define an inhibition coefficient h_{k} by the following convex combination:
In (9) the integer P is related to the desired accuracy of the [0,1] discretization. Indeed, the factor h_{k} assumes values between 0 and 1 with precision 2^{P}. Notice that for P = 0 we have the ON/OFF model of the previous section. We can replace now (4) with the following inequality:
When the strong duality theorem is applied, the nonlinear terms are the δ_{i}h_{k} products. Expanding the product according to the definition in Eq. (9):
the nonlinearity is now spread over the products δ_{i}x_{k,n} with again x_{k,n} a Boolean variables. Similarly to (8), we can write an equivalent set of linear inequalities:
Notice that any "representation" of partial inhibition values can be used in place of (9). Let us imagine, for instance, that we would like h_{k} to have the same values obtained in the doseresponse experiments for the determination of the half maximal inhibitory concentration (IC_{50}) of drug k (see Figure 1):
(with
with a series of inequalities
Of course, the discretization (9) is the most efficient because, for a given number of Boolean variables (P + 1), it generates the maximum precision (2^{P}). For this reason, in the following we will refer to (9) only.
Figure 1. Constructing the inhibition h from the experimental doseresponse curve: an example. The points of the curve are hypothetical experimental measurements of the effect of the drug on the activity of the enzyme. The discretization of the curve can be used as basis for the discretization of the interval [0,1]: therefore, referring to (10), we may define h_{k} = 0.10x_{k,0} + 0.15x_{k,1} + 0.50x_{k,2} + 0.15x_{k,3} + 0.10x_{k,4}.
Inhibitions and activation of the objective reaction
The evaluation of the effect on the fluxes induced by the drugs is performed through the MOMA formalism. It is known that this approach describes well the spreading across the network of the effect of the perturbation: many processes are downregulated or upregulated in order to adjust and compensate the effect of the perturbation (see for example [4]). For the same reason, a second intervention may amplify the deactivation (recovery) of a certain metabolic function that was downregulated (activated) after the first perturbation [30]. In terms of multiple drug effect, this means that a drug synergism may reinforce both the inhibition and the activation of the reaction fluxes.
In our algorithm, one can selects between these two situations through a different constraint on v_{mod} in the outer problem: indeed, imposing as in the previous Section
(for 0 ≤ τ < 1) the algorithm identifies synergistic inhibitions, whereas requiring
(for τ > 1) the algorithm generates drug interactions that upregulates the objective reaction.
In the following, both versions are applied.
Cases of multiple equivalent solutions (nonuniqueness)
Apart from the nonuniqueness of the unpertubed fluxes v^{ut} analyzed in the Additional file 1, there are also other cases of degeneracy of the outcome of the algorithm, which we present here. Indeed, it is worth noting that the use of the norm L^{1} does not guarantee the uniqueness of the solution: indeed balls in L^{1} and the polytope W(h) are convex but not strictly convex sets. Unfortunately this limit can be overcome only passing to the L^{2} formulation with the consequent loss of the linearity of the problem. However, we expect that such a type of situations are quite rare since they appear only when the hyperplane (or the intersection of some of them) which defines W(h) and which realizes the minimum distance with respect to the vector v^{ut}, is parallel to an edge (or face) of the L^{1}ball (see Figure 2).
Figure 2. Convexity of L^{1} and L^{2}formulations and uniqueness of the solution. The two pictures report the unperturbed fluxes (v^{ut}) in the original set W. When the inhibition h is applied (dotted blue line), the set reduces to W(h). (a) With the L^{1}norm, balls are not strictly convex, therefore cases of multiple equivalent solutions may appear. As one can see, these situations occur only when the hyperplane of W(h) which realizes the minimum distance with respect to v^{ut} is parallel to an edge of the L^{1}ball. (b) Conversely, L^{2} balls are strictly convex and always lead to a unique solution.
Other (more common) cases of multiple equivalent solutions (i.e. solutions with the
same network perturbation ∥v^{ut}v^{tr}∥_{1}) are avoided through a specific correction mechanism. For instance, if there exists
a pair of drugs k and l such that
where the parameter b is chosen small enough (10^{3} in our computations, much smaller than 10^{3}, the common upperbounds of the fluxes) in order to keep this term smaller than the difference in the side effects and therefore not to change the order between nonequivalent solutions.
A similar approach is used to solve the redundancy in the definition of h_{k}. Indeed, since in (9) x_{k,0} and x_{k,1} have the same coefficient, the inhibition h_{k} does not change when swapping the values of these two variables. To avoid this degeneracy, we require that x_{k,0} be one only if all the other x_{k,j} (for j = 1,…,P) are one. This is obtained through the following extra linear constraint:
Since the problem is not strictly convex and since equivalent drug combinations may always appear (for instance, the combination of drug A which targets reaction 1 plus drug B which targets reactions 2 and 3 is equivalent to the combination of drug C which targets reaction 2 plus drug D which targets reactions 1 and 3), the problem of multiple equivalent solutions needs to be considered. In all these cases, because of the numerical implementation of the algorithm, a random choice of one of these optimal solutions is taken. Since, by definition, all these equivalent solutions fulfill the conditions on the objective reaction and on the minimization of the side effect, their differences are irrelevant: indeed they concern only some other fluxes on which we do not have any specific requirements. For this reason any solution chosen by the implementation of the algorithm can be considered acceptable.
Results
Computational performaces
The proposed algorithm has been implemented in MATLAB (2012R) and the optimization has been performed using ILOGIBM CPLEX 12.1 (http://www01.ibm.com/software/commerce/optimization/cplexoptimizer/ webcite) under academic license.
First, the impact of the parameter P on the computational cost is evaluated on the core metabolism of E.coli (see Table 2 for the main features of the corresponding network and the number of drugs that have been selected from online databases): we choose ribose5phosphate isomerase as reaction to be modulated (τ = 0.35) and run the algorithm with different values of P from 0 to 5, recording the computational time required to find the solution. Since there are 8 drugs, the extremal values of P correspond to 8 and 48 Boolean variables in the whole problem. Notice that for P = 5, the accuracy on the definition of h_{k} is quite high (2^{P} = 1/32 < 5%). In addition, we estimate the time needed to perform the evaluation of the inhibitory effect (MOMA) of a single drug combination as an average over 20 random subsets of the 8 available drugs. From this value we can predict the approximate computational cost of an exhaustive search over all possible drug combinations (and dosages). The comparison of the performaces of the algorithm with this estimation is plotted in panel (a) of Figure 3.
Table 2. Main properties of the metabolic networks used in this paper (from BIGG http://bigg.ucsd.edu/ webcite)
Figure 3. Algorithm performances. (a) Computational time of the algorithm compared with an estimation of the exhaustive search for different values of P on the E.coli core metabolism. (b) The influence of the size of the network on the performances is here reported as distribution of the computational time for the 6 metabolic networks listed in Table 2. Each distribution is built over 100 random samples obtained by changing the targets of a fixed number of drugs and keeping the precision parameter and the threshold constant (P = 2 and τ = 0.35). (c): Averaged computational time (same color code of the corresponding distributions reported in panel (b)) and error bar as a function of the size of the network (expressed as number of reactions, see Table 2). All tests have been carried out using the software ILOGIBM CPLEX on a 2.3 GHz CPU.
Moreover, we run the algorithm on the metabolic network of the six microorganisms listed in Table 2. Our scope is to evaluate the impact of the size of the network (parametrized by the number of reactions r) on the computational performaces. In order to limit the interference of other parameters, these calculations are carried out with the same objective reaction (in particular we still keep ribose5phosphate isomerase since it appears on all networks we have considered) at constant precision (P = 2) and threshold (τ = 0.35), with the same number of drugs (d = 8), and choosing their inhibition targets in a random manner (unfeasible problems are ignored). However, since on very large networks it is quite unlikely to induce the sought modulation on the objective reaction when only 8 targets are inhibited, the number of targets of each drug is proportionally increased (on average the total number of inhibitions is approximately 6% of the total number of reactions). Because of the randomness in the choice of the targets, the computational times may present a significant variation. Therefore, Figure 3 (b) shows the whole distributions of the computational time over 100 runs for each one of the six metabolic networks we considered. Finally, Figure 3 (c) reports the mean and the standard deviation of these distributions as a function of the size of the network. On average, also for very large networks, the computational time is approximately one hour (on a 2.3 GHz CPU). All these characterizations show the good performaces of the algorithm.
Screening for optimal drug combinations
The main scope of these calculations is to show the advantage given by the use of value of P higher than zero, i.e. of passing from the ON/OFF to a more accurate description. In order to better characterize its behavior (performing a large number of tests), we run the algorithm on the small network of the core metabolism of E.coli. A sketch of this network is depicted in Figure 4.
Figure 4. Network representation of E. coli core metabolism. This reconstruction refers to the work of Orth et al.[35] and maintains the same notation. Metabolites are in ellipses (yellow color: external nutrients; blue color: cytoplasmatic compounds) and reactions in boxes (dark gray for irreversible and light gray for reversible processes). Nodes highlighted with red border belong to Krebs cycle. For a better readability, common species like water, Oxygen, H^{+}, Ammonium, CO_{2}, phosphate, NAD (in all forms), CoenzymeA, AMP, ADP and ATP, as well as all transport and exchange processes, have been excluded from the representation.
A set of tests have been carried out combining different values of τ and P, in particular:
For each pair, we perform a screening that considers each metabolic reaction as objective process to be modulated (down or upregulated depending on the value of τ) and finds the most selective drug combination. The following characterization of the solutions is performed. For a given P and for a given objective reaction v_{mod}, we consider the solutions h at different values of τ. When the same drug combination is found for two values of τ (for example τ_{1} = 0.1 and τ_{2} = 0.5), the solution is considered valid only for the most stringent constraint (τ_{1} = 0.1, in the example; similarly, if τ_{1} = 1.5 and τ_{2} = 2.0 then the solution is associated to τ_{2} = 2.0 only). This procedure allows to considered only cases when passing to a weaker constraint on v_{mod} the severity (for instance the dosage) of the corresponding optimal drug treatment is reduced too. We analyze the results by looking at the following four indices.
Number of solutions: Figure 5 (a) shows the total number of solutions we have found in the screening of all reactions at different P and τ. One can see that, when a complete stop of the objective reaction is required (τ = 0) there is no significant advantage in increasing the precision P. However, when it is necessary to induce a more accurate modulation of the flux (inhibitory when 0 < τ < 1), higher values of P allow to find a larger number of solutions. Through the partial inhibition, indeed, we can find solutions which are closer to the desired threshold, whereas the simple ON/OFF model can mostly induce a complete stop of the objective reaction. A similar improvement can be also identified while passing from τ = 1.5 to τ = 2.0.
Figure 5. Effect of the precision parameter P (results from the reactions screening). (a) The plots show the number of solutions that are found for different values of τ (see legend) as function of P (color codes as in the following plots). As one can see, for τ > 0, increasing the precision P, it is nearly always more frequent to find a drug combination which induces the sought modulation. (b) The plot is a detail of the curve in panel (a) for the case τ = 0.5, and shows the frequency of the drug cardinality of the solutions (i.e. the number of drugs used in the solution). For larger P, the distribution is slightly shifted to higher cardinality (data for each τ are reported in Table 3). (c) Beside the number of solutions, higher precision produces more selective outcomes, i.e. with a lower side effect (as clearly shown by the mean values reported in the legend). The counting is performed over all different values of the threshold τ. (d) This plot shows the histogram of the values of the nonlinearity index η(h) (as reported in (11)) calculated for all the solutions of the screening (still regardless the value of τ). From the curves and from the mean values reported in the legend, it is possible to see that higher amount of nonlinearity are obtained when P is increased.
Table 3. Averaged cardinality of the drug combinations from the screening
Cardinality of the solutions: More details are presented in panel (b) of Figure 5, which reports the histogram of the cardinality of the solutions and their mean values for the case of τ = 0.5 (averages for each value of τ are reported in Table 3). When the precision increases, the distribution of the cardinality shifts slightly to higher values, meaning that multiple drug treatments are (slightly) preferred.
Perturbation induced by the solutions: For each solution that we have identified during this screening, also the corresponding perturbation (i.e. the side effect ∥v^{tr}v^{ut}∥_{1}) can be evaluated. We calculate the frequency of these perturbation values (regardless of the value of τ). The result is shown in Figure 5 (c). We notice that at higher precision, smaller perturbations become slightly more probable: as expected, for high values of P, the algorithm can modulate the inhibition more accurately and therefore reduce the impact on the network, while still satisfying the request on the flux of the objective reaction.
Nonlinearity exploited by the solutions: The interaction between drugs is normally interpreted as the deviation of the effect
of combined drugs with respect to the linear superposition of the single drug perturbations.
Therefore, similarly to the scaled epistasis measure presented in literature
[42], a index of nonlinearity η(h) can be defined on the basis of the flux of the objective reaction as follows. Let
since it holds
Drug interaction surfaces: three case studies
For three of the solutions found through the screening procedure, we detail now the drug interactions exploited by the algorithm. In particular, we considered the synergisms in the inhibition of transketolase and of ribose5phosphate isomerase, and the synergism in the upregulation of glutamate dehydrogenase. Each of the first two solutions contains a pair of drugs (Fomepizole plus Halofantrine and Fomepizole plus Hexachlorophene respectively). We explore the drug interaction surface changing the amount of inhibition induced by each compound, as could correspond in experiments to using different drug dosages (the interval [0,1] has been discretized using (9) with P = 4). The 2D surfaces are reported in Figure 6 panels (a) and (d). In the third case (upregulation of glutamate dehydrogenase, panel (g)) the synergism is obtained combining three drugs (Nitrofurazone, Halofantrine and Pemetrexen); therefore, in order to have the 2D surface of interaction, the first drug is kept at the optimal inhibition value (h_{1} = 1) and the combinations are explored changing the dosages of the remaining two drugs. Figure 6 reports also the nonlinearity index η(h) as defined in (11); panels (b), (e) and (h) show that in all cases there is a clear enhancement of the effect when the drugs are combined.
Figure 6. Surfaces of drug interaction. Axes x and y report the inhibition coefficients h (dosage) for the two drugs; the zaxis reports the percentage of the flux through the modulated reaction after drug treatment calculated with L^{1} formulation of MOMA with respect to the untreated value (left panels a, d and g), or the relative deviation, η(h), of the effect induced by the drug combination from the linear superposition of the effects due to the single drugs as expressed by (11) (middle column, panels b, e and h). The right panels (c, f and j) show the calculations made with MOMA based on the L^{2}norm. The higher smoothness of the surface makes this formulation more reliable: however the surfaces obtained with our method (L^{1}norm) well reproduce the main characteristic of the synergism between the drugs. In the first two rows inhibitory synergisms are shown (τ < 1), whereas the third row is an example of activating synergism (τ > 1). Notice that for the sake of readability, in the plots of the last row, both x and y axes have been inverted.
These calculations has been performed with MOMA based on the L^{1}norm because, as already mentioned, it allows a definition of a linear function in the optimization problem. When compared to the surfaces obtained from the original quadratic formulation of MOMA (last column: panels (c), (f) and (j) of Figure 6), the results from L^{1}norm show some irregularity of the surfaces (which makes the original L^{2} version more reliable) but the main features of the drugdrug interaction are still well described.
Discussion
Optimization is a concept widely used in many scientific fields; for instance, in systems biology, FBA makes use of it for discriminating reaction fluxes in large metabolic networks. Following the same philosophy, in order to cope with more complex situations, multiple optimization criteria can be needed simultaneously leading in some situations, like the one discussed in this paper, to a bilevel optimization problem. The bilevel approach is promising for studying several features and applications of metabolic networks, for instance for identifying metabolic objective functions [23] or for studying perturbations around a nominal optimum [10,11]. In the context of drug combinatorics, in order to efficiently solve the bilevel optimization, Boolean variables are commonly used in the outer problem. However, this ON/OFF description of the corresponding biological quantities may represent a very rough approximation, as it is the case for the (partial) inhibition induced by drugs acting on the enzymes of a metabolic network.
In order to overcome this limitation, we propose an improvement on the formulation of the bilevel optimization in which a single Boolean variable is replaced by a convex combination of several Boolean quantities: in this manner the convex and linear nature of the problem is preserved and the description of the inhibitory effects becomes more realistic. Since the problem contains Boolean variables, the optimization falls in the Mixed Interger Linear Programming (MILP) class: compared to LP, the NPhard complexity of MILP [22] makes the new algorithm more expensive from a computational point of view. For the tasks at hand (see Figure 3), the algorithm behaves well also for large metabolic networks. The logarithm of the computational time scales linearly with the number of reactions, but with a small slope, so that on average the solution is found in a reasonable computational time, also for networks with around 2500 reactions and for P = 2.
For testing purposes, we run the algorithm on the central carbon metabolism of E.coli screening all reactions. We have found that increasing the number of Boolean variables used in the convex combination (the precision parameter P), it is more likely to find a solution which succeeds on the modulation of the objective reaction (see Figure 5 (a)). In particular, partial inhibitions (i.e. modulations of the dosage of the drugs) are more frequent for multicomponent solutions (panel (b) of Figure 5): this result may be interpreted as a wider possibility, offered by the synergism, to calibrate a drug treatment according to the specific needs. Moreover our computations represent a confirmation on large networks of the expected, but still not verified, higher efficiency of multiple targets drug treatments in presence of partial inhibition [24]. In this perspective, the results show that this approach may also lead to treatments which are more selective (panel (c) of Figure 5 and Table 3).
A possible explanation can be found in the unexpected or hardly predictable drug synergism which are typical of complex systems such as metabolic networks, even in a simplified framework like FBA. In particular, concerning the synergistic interactions between drugs, the analysis done through the drugdrug interaction surface (Figure 6) reveals that nonlinear effects, not explained by superposition of the single drug perturbation, are significant and can be captured and exploited by the method proposed, unlike with a more coarsegrained ON/OFF description. We should mention, that the three case studies presented in Figure 6 do not pretend to have any clinical value: they have been selected only for the purpose of illustrating the method and the advantages it may give in the context of drug synergism and drug reprofiling for reconstructed metabolic networks.
Conclusion
The purpose of this paper is to present a novel algorithm, able to relax the assumption on the variables of a bilevel optimization problem from ON/OFF type to more finegraded description. This setting is of interest in the context of FBA of metabolic networks and in particular in the modulation of the fluxes by means of drugs, capable of reducing (but not suppressing completely) the activity of the metabolic enzymes on which they are acting. With our algorithm, the problem can be formulated as a MILP problem of moderate practical complexity. Indeed we have shown that the algorithm performs well also on large metabolic networks at a more finegraded level of resolution than an ON/OFF representation. Furthermore, it is capable of exploiting with higher efficiency the peculiar nonlinearity which originates from the topology of the network, of finding more selective solutions and, therefore, of offering a larger spectrum of drug combinations. These features become more evident when the modulation required for the objective reaction is itself a fraction (τ ≠ 0) of the nominal flux, rather than a simple complete switch off (τ = 0). Indeed, if a disease corresponds to an anomalous biosynthesis of certain compounds, most often the cure consists in regulating back those fluxes to an healthy range, not to a complete inhibition.
It is worth noting that the problem of drug synergism we presented in this paper must be read as a guiding example for a more general class of situations: indeed, the idea we have proposed for treating bilevel optimization can be applied to any other case which requires a more realistic modeling with respect to the oversimplified ON/OFF description, in biology as well as in all the other fields where LP is already used.
Abbreviations
FBA: Flux balance analysis; MOMA: Minimization of metabolic adjustment; LP: Linear programming; MILP: Mixed integer linear programming.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
GF developed the method and performed the calculations. CA and GF participated at the discussions of the results, have been involved in drafting the manuscript and approved the final manuscript’s preparation.
Acknowledgments
We would like to thank the Reviewers for their valuable and costructive comments; we thank also Mattia Zampieri (ETH Zurich) for usefull discussion on the method. This work is supported by a grant from MIUR (Ministero dell’Istruzione, Università e Ricerca).
References

Bordbar A, Feist A, UsaiteBlack R, Woodcock J, Palsson B, Famili I: A multitissue type genomescale metabolic network for analysis of wholebody systems physiology.
BMC Syst Biol 2011, 5:180. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Folger O, Jerby L, Frezza C, Gottlieb E, Ruppin E, Shlomi T: Predicting selective drug targets in cancer through metabolic networks.
Mol Syst Biol 2011, 7:501. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Snitkin E, Segrè D, Mackay T: Epistatic interaction maps relative to multiple metabolic phenotypes.
PLoS Genetics 2011, 7(2):e1001294. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Cornelius S, Lee J, Motter A: Dispensability of Escherichia coli’s latent pathways.
Proc Natl Acad Sci 2011, 108(8):3124. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Raman K, Chandra N: Flux balance analysis of biological systems: applications and challenges.
Brief Bioinform 2009, 10(4):435449. PubMed Abstract  Publisher Full Text

Lee J, Gianchandani E, Papin J: Flux balance analysis in the era of metabolomics.
Brief Bioinform 2006, 7(2):140150. PubMed Abstract  Publisher Full Text

Jeong H, Tombor B, Albert R, Oltvai Z, Barabási A: The largescale organization of metabolic networks.
Nature 2000, 407(6804):651654. PubMed Abstract  Publisher Full Text

Deutscher D, Meilijson I, Kupiec M, Ruppin E: Multiple knockout analysis of genetic robustness in the yeast metabolic network.
Nat Gen 2006, 38(9):993998. Publisher Full Text

Klamt S, Gilles E: Minimal cut sets in biochemical reaction networks.
Bioinformatics 2004, 20(2):226234. PubMed Abstract  Publisher Full Text

Burgard A, Pharkya P, Maranas C: OptKnock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization.
Biotechnol Bioeng 2003, 84(6):647657. PubMed Abstract  Publisher Full Text

Kim J, Reed J: OptORF: Optimal metabolic and regulatory perturbations for metabolic engineering of microbial strains.
BMC Syst Biol 2010, 4:53. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Jamshidi N, Palsson B: Investigating the metabolic capabilities of Mycobacterium tuberculosis H37Rv using the in silico strain iNJ661 and proposing alternative drug targets.
BMC Syst Biol 2007, 1:26. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Huthmacher C, Hoppe A, Bulik S, Holzhutter H: Antimalarial drug targets in Plasmodium falciparum predicted by stagespecific metabolic network analysis.
BMC Syst Biol 2010, 4:120. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Bazzani S, Hoppe A, Holzhütter H: Networkbased assessment of the selectivity of metabolic drug targets in Plasmodium falciparum with respect to human liver metabolism.
BMC Syst Biol 2012, 6:118. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Chavali A, Hewlett E, Pearson R, Papin J, et al.: A metabolic network approach for the identification and prioritization of antimicrobial drug targets.
Trends Microbiol 2012, 20:113123. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Suthers P, Zomorrodi A, Maranas C: Genomescale gene/reaction essentiality and synthetic lethality analysis.

Facchetti G, Zampieri M, Altafini C: Predicting and characterizing selective multiple drug treatments for metabolic diseases and cancer.
BMC Syst Biol 2012.
doi:10.1186/175205096115

Palsson B, Varma A: Metabolic capabilities of Escherichia coli II: optimal growth pattern.
J Theor Biol 1993, 165:503522. Publisher Full Text

Segré D, Vitkup D, Church G: Analysis of optimality in natural and perturbed metabolic networks.
Proc Natl Acad Sci USA 2002, 99(23):1511215117. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Dempe S: Foundations of bilevel programming. New York: Kluwer Academic Publisher; 2010.

Matoušek J, Gärtner B: Understanding and Using Linear Programming. Berlin: Springer; 2000.

Schrijver A: Theory of Linear and Integer Programming. New York: John Wiley & Sons; 1986.

Burgard A, Maranas C: Optimizationbased framework for inferring and testing hypothesized metabolic objective functions.
Biotechnol Bioeng 2003, 82(6):670677. PubMed Abstract  Publisher Full Text

Csermely P, Agoston V, Pongor S: The efficiency of multitarget drugs: the network approach might help drug design.
Trends Pharmacol Sci 2005, 26:178182. PubMed Abstract  Publisher Full Text

Peng H, Wen J, Li H, Chang J, Zhou X: Drug inhibition profile prediction for NFκB pathway in multiple myeloma.
PloS one 2011, 6(3):e14750. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Salvador A: Synergism analysis of biochemical systems. I. Conceptual framework.
Math Biosci 2000, 163(2):105129. PubMed Abstract  Publisher Full Text

Holzhütter H: The generalized fluxminimization method and its application to metabolic networks affected by enzyme deficiencies.
Biosystems 2006, 83(2):98107. PubMed Abstract  Publisher Full Text

Li Z, Wang R, Zhang X: Twostage flux balance analysis of metabolic networks for drug target identification.
BMC Syst Biol 2011, 5(Suppl 1):S11. BioMed Central Full Text

Mahadevan R, Schilling C, et al.: The effects of alternate optimal solutions in constraintbased genomescale metabolic models.
Metab Eng 2003, 5(4):264. PubMed Abstract  Publisher Full Text

Motter A, Gulbahce N, Almaas E, Barabási A: Predicting synthetic rescues in metabolic networks.
Mol Syst Biol 2008, 4:168. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Boghigian BA, Armando J, Salas D, Pfeifer BA: Computational identification of gene overexpression targets for metabolic engineering of taxadiene production.
Appl Microbiol Biotechnol 2012, 93(5):20632073. PubMed Abstract  Publisher Full Text

Snitkin ES, Segrè D: Epistatic interaction maps relative to multiple metabolic phenotypes.
PLoS Genet 2011, 7(2):e1001294. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Becker S, Feist A, Mo M, Hannum G, Palsson B, Herrgard M: Quantitative prediction of cellular metabolism with constraintbased models: the COBRA Toolbox.
Nat Protoc 2007, 2(3):727738. PubMed Abstract  Publisher Full Text

Ranganathan S, Suthers P, Maranas C: OptForce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions.
PLoS Comput Biol 2010, 6(4):e1000744. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Orth J, Fleming R, Palsson B: Reconstruction and use of microbial metabolic networks: the core Escherichia coli metabolic model as an educational guide.
EcoSal 2009,.
doi: 10.1128/ecosal.10.2.1

Becker S, Palsson B: Genomescale reconstruction of the metabolic network in Staphylococcus aureus N315: an initial draft to the twodimensional annotation.
BMC Microbiol 2005, 5(1):8. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Feist A, Scholten JCM, Palsson B, Brockman F, Ideker T: Modeling methanogenesis with a genomescale metabolic reconstruction of Methanosarcina barkeri.
Molecular Syst Biol 2006., 2
doi:10.1038/msb4100046

Pinchuk G, Hill E, De Ingeniis J, Zhang X, Osterman A, et al.: Constraintbased model of Shewanella oneidensis MR1 Metabolism: a tool for data analysis and hypothesis generation.
PLoS Comput Biol 2010, 6:e1000822. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Edwards JS, Palsson BO: The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities.
Proc Natl Acad Sci USA 2000, 97(10):55285533,.
[ http://www.pnas.org/content/97/10/5528.abstract webcite]
PubMed Abstract  Publisher Full Text  PubMed Central Full Text 
Raghunathan A, Reed J, Shin S, Palsson B, Daefler S: Constraintbased analysis of metabolic capacity of Salmonella typhimurium during hostpathogen interaction.
BMC Syst Biol 2009, 3(1):38. PubMed Abstract  BioMed Central Full Text  PubMed Central Full Text

Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, et al.: DrugBank 3.0: a comprehensive resource for 'omics’ research on drugs.
Nucleic Acids Res 2011, 39(suppl 1):D1035—D1041. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Segré D, De Luna A, Church G, Kishony R: Modular epistasis in yeast metabolism.
Nat Genet 2005, 37(1):7783. PubMed Abstract  Publisher Full Text