Abstract
The Steady State (SS) probability distribution is an important quantity needed to characterize the steady state behavior of many stochastic biochemical networks. In this paper, we propose an efficient and accurate approach to calculating an approximate SS probability distribution from solution of the Chemical Master Equation (CME) under the assumption of the existence of a unique deterministic SS of the system. To find the approximate solution to the CME, a truncated statespace representation is used to reduce the statespace of the system and translate it to a finite dimension. The subsequent illposed eigenvalue problem of a linear system for the finite statespace can be converted to a wellposed system of linear equations and solved. The proposed strategy yields efficient and accurate estimation of noise in stochastic biochemical systems. To demonstrate the approach, we applied the method to characterize the noise behavior of a set of biochemical networks of ligandreceptor interactions for Bone Morphogenetic Protein (BMP) signaling. We found that recruitment of type II receptors during the receptor oligomerization by itself doesn't not tend to lower noise in receptor signaling, but regulation by a secreted cofactor may provide a substantial improvement in signaling relative to noise. The steady state probability approximation method shortened the time necessary to calculate the probability distributions compared to earlier approaches, such as Gillespie's Stochastic Simulation Algorithm (SSA) while maintaining high accuracy.
Introduction
Many biological networks exhibit stochasticity due to a combinatorial effect of low molecular concentrations and slow system dynamics. One important biological context where stochastic events likely have a large impact is the Bone Morphogenetic Protein (BMP) signaling pathway. BMPs make up the largest subfamily of the Transforming Growth Factorβ superfamily and are involved in numerous processes including growth, differentiation and diseases [1]. Due to their potency at driving development, they are also of great value for stemcell differentiations in cell culture. BMPs activate near maximal signaling at 1nM concentration, have very slow binding kinetics and require oligomerization between multiple receptor subunits [1]. These properties naturally lead to conditions for significant and longduration stochastic fluctuations in cellular signaling. Interestingly, variability of BMP signaling appears to be very low in vivo, while it is very high in stem cell culture studies [2]. To understand the differences between in vivo and in vitro signaling and determine how various receptor oligomerization events might alter the signal and noise, a more efficient means of solving the steady state distributions for stochastic model was needed that would allow for continuation of both parameters and levels of the BMP pathway components.
Stochastic regulation can negatively impact the robustness of the system [3,4] or instead, constructively contribute to the phenotypic variation [57] in a species. In stochastic reaction networks, the state of a species traverses different trajectories in a probabilistic manner and the distributions of states can be difficult to predict. As more biological data is available, stochastic modeling is becoming increasingly popular to estimate properties in networks where the time evolution of the system is unpredictable and dependent on unavoidable randomness inherent to the system. The complete solution can be calculated from a Chemical Master Equation (CME) [810], that is based on a Markovian approach that captures the inherent randomness of biochemical systems.
The Chemical Master Equation (CME) describes the dynamics of the probability distribution of a species of chemical reactions. Precisely, the CME captures the rate of change of probability that a system will be in state X at time t for all the species of the system. Solution of the CME is practically intractable due to the curse of dimensionality, as the statespace of the system becomes enormously large with increases in the species number and concentrations (number of states n^{N}, for N → species, n → copies of each species). Moreover, the system often involves interactions between different timescales (slow and fast reactions, frequent and infrequent transitions between states) [11], which add further complexity. Instead, numerical approaches are commonly used [1214] to realize the CME of the stochastic system.
In the analysis of stochastic biochemical networks, steady state probability distributions for each species in the system are determined to measure variability about the deterministic steady state. The deviation around the solution contributes to stochastic noise that can be quantified by measuring the coefficient of variation (the ratio between the standard deviation σ and the mean level of species concentration μ) obtained by solving the CME [9].
Frequently, Montecarlo based simulation approaches [9,13] are used to solve stochastic problems. But, there are drawbacks in this approach for networks where the dynamics of different intermediate states of the system are unknown and continuation of several parameters is necessary to explore the system's dependency. As a screen of parameter values becomes necessary in such a scenario, the Montecarlo based approach doesn't prove to be efficient, as it generally takes longer time to numerically simulate the process and satisfy the imposed conditions. Moreover, simulation times increase with increases in the total number of molecules, species and the number of interactions between species.
In order to ameliorate the computational cost and complexity, we present a method here to approximate the steady state probability distribution by 1) reducing the system's statespace to a finite dimension using truncated statespace method [15] and 2) subsequently, translating an eigenvalue problem associated with a CME to a system of linear equations. We illustrate that the eigenvector (for an eigenvalue = 0) that represents the steady state probability distribution can be solved algebraically by approximating it as a system of linear equations. Previously, the influence of stochastic fluctuations on system behavior was studied also in [16], where a moment closure approach was applied to reduce the complexity associated with the identification of distributions. In contrast to the previous studies, here we use a truncated statespace for steady state probability distribution approximation, which is arguably more general since we make no assumptions on the relationships of the moments of the distribution.
The usefulness of the proposed method is illustrated by considering the example networks of BMP signaling, as described earlier in [17]. Here we examine two potential mechanisms of BMP signaling: 1) regulation between the type I and type II receptors, and 2) regulation by secreted cofactors, such as Crossveinless2 (Cv2). First, we apply the approach to the recruitment of Type II receptor into a BMPbound type I receptor complex to see if such step of receptor oligomerization contributes qualitatively to the noise profile of the system. Following this, we extend our earlier work that focused on extracellular regulation of BMP signaling by SBPs to evaluate the calculation approach and compare results to the Type I/Type II receptor system.
Unlike SBPs, which tend to improve the signal to noise ratio, we did not see a significant change in stochastic variability with inclusion of Type II receptors. This result supports a previous assumption made in [4] where the recruitment of Type II receptor was excluded to simplify the modeling steps while characterizing the noise profile of a SBP regulated BMP signaling system.
Methods
Proposed approximation method
The Chemical Master Equation (CME), which is a set of first order differential (ODE) equations, demonstrates loss and gain of probabilities of discrete states of a system [10] and is often applicable to represent the stochasticity of the system. Consider a wellstirred system at thermal equilibrium of N different species {S_{1}, S_{2}, . . . , S_{N}} with {X_{1}, X_{2}, . . . , X_{N}} molecules respectively, participating in a total of M biochemical reactions R_{μ}, where μ = 1, 2, . . . M. The state of such a system is represented by the copy number (X_{n} molecules) of each species (S_{n}) at any given time t and is represented as X = [X_{1}(t), X_{2}(t), . . . , X_{N}(t)]. Unless a nonzero initial state is assigned, the default initial species concentrations are always zero (X_{n}(t = 0) = 0, where 1 ≤ n ≤ N).
Two other quantities are further required to construct the system: 1) a statechange vector ν_{μ }and 2) propensity functions [8,12,14,18] for the reactions R_{μ}, μ = {1, 2, . . . , M}. The statechange vector ν_{µ }for reactions R_{μ }is defined as ν_{μ }= [ν_{1μ}, ν_{2μ}, . . . , ν_{Nμ}]^{T}, where ν_{nμ }represents the change in concentration of species S_{n}, caused by the occurrence of R_{μ }reaction of the underlying biochemical system. These equations fully define the system and the time evolution of the probability function P(X, t) can be obtained by the solution of the Chemical Master Equation(CME) [8,14,18]:
Here, [(X +ν_{μ}) ∈ Ω] is 1 if X +ν_{μ }∈ Ω and 0 otherwise. The CME representing the rate of change of probability P(X, t) in an in finitely large statespace X ∈ Ω is given by taking Ω to be the nontruncated space: Ω = ℕ^{N}, ℕ = {0, 1, 2 . . .}
In Eq.1, a_{μ }represents the propensity function to account for transition from a given state X to any other state, and ν_{μ }indicates the stoichiometry of the reaction μ that results in such a transition. Eq.1 is a linear system of differential equations and may be rewritten as follows:
where P is the probability distribution P(X, t) for a vector X = [X_{1}, X_{2}, . . . , X_{N}] and L is the time independent connection operator. For the steady state (SS) distribution P_{ss}, we have:
We assume that the deterministic steady state (SS) is unique. The nontruncated statespace Ω can be replaced with a truncated statespace [15,19] to approximate the probability distribution P(X, t). We define the truncated space as:
where α_{i }and β_{i }are extendable left and right boundaries of the truncated statespace. This approach is similar to that in [20], in which it is shown that the approximation based on the truncated space converges to the true steady state distribution as the limits of the truncated statespace converge to the limits of the original space.
The truncated statespace representation implies that given some ε > 0, for a sufficiently large β_{i }> 0 and sufficiently small α_{i }≥ 0, the steady state probability distribution P_{ss }(X) is approximated to within ε:
For an optimal SS probability approximation, ε should be made as small as possible. In the truncated statespace, Eq.3(iii) is represented as:
where is a matrix of propensities in . To get the entries in we use Eq.1 modified so that . In the truncated statespace , Eq.5 is an eigenvalue problem for eigenvalue λ = 0 and the eigenvector can be obtained algebraically, or with an iterative algorithm for a large, sparse matrix .
Instead of finding the eigenvector, which can be an illconditioned problem when there are nonzero eigenvalues close to 0, we translate the problem to a wellconditioned system of linear equations as follows.
We first evaluate the deterministic steady state (Y_{0}) of the system, and then select state X_{0} of the discrete system closest to this deterministic steady state, where X_{0 }= round(Y_{0}). Taking to be the solution of Eq. 5 and using the fact that the deterministic steady state solution is unique, we observe that (X_{0}) is among the largest entries of . The states in are labeled as 1, 2, . . . , K with state X_{0} denoted by j.
Then
where , k = 1 . . . K, . With Eq.5 now becomes . Let be the k^{th }column of . Expanding by column and rearranging gives the following wellconditioned problem:
In Eq.7, is the matrix with column j removed and is with entry removed. The error criterion for the system is checked for the calculation of until a satisfactory value is obtained (see algorithm 1 for further details).
Application
In order to demonstrate the usability of the proposed steady state probability approximation method, we present here two example networks (example network 1 and 2) from Bone Morphogenetic Protein(BMP) mediated signaling, and characterize the stochastic behavior of the systems. In the example network 1, we consider the role of a specific extracellular protein, Crossveineless2(Cv2), which is part of a class of proteins known as Surfaceassociated BMPbinding Proteins (SBPs) [4]. Cv2 has the ability to regulate the stochastic noise in BMP signaling, and in this example we demonstrate that the role of Cv2 is heavily dependent on reaction kinetics of the network: for some sets of parameter values, Cv2 increases the coefficient of variation of the steady state signaling distribution, while for other parameter values it decreases the coefficient of variation.
In the second example network, we consider a model simplification strategy as used in [4,17]. This strategy is to omit a Type II receptor recruitment step from the receptor oligomerization in a BMP patterning model, under the assumption that the simplification step does not affect the outcome of a BMPmediated patterning model. The obtained results by the use of steady state probability approximation method provide a numerical justification for the aforementioned simplification.
Background
During embryonic development, positional information is transduced by morphogens to underlying cells that respond to the concentration gradient of morphogen and eventually differentiate into distinct cell types [21]. For example, Decapentaplegic (Dpp), a drosophila homologue of BMP2/4, forms a spatial profile to pattern dorsal tissues in Drosophila development [21]. In a canonical BMP signaling pathway, dimeric ligands bind to receptors and form a heterotetrameric complex that consists of two Type I and two Type II receptors. The heterotetrameric receptor complex then phosphorylates the intracellular signal transducer Smad and the phosphorylated Smad forms a complex with a coSmad. Subsequently, the Smad/CoSmad complex accumulates in the nucleus and regulates gene expressions in a concentration dependent manner [17,22].
BMP regulation occurs at many points along the pathway, and a lot of focus has been on identifying and understanding how the ligand activity is regulated in the extracellular environment by secreted binding proteins. These include molecules such as Cv2, HSPGs, among other reviewed in [1]. A focus of this work is to gain a better understanding of how regulation in the extracellular region impacts cell signaling noise, and eventually celltocell variability.
Example networks
In many biochemical networks, where dynamics of the intermediate interactions of different species (proteins) and molecular complexes are largely unknown, screening plays a significant role in the classification of dynamicsdependent network behavior. For example:
1. In a biochemical network where a class of secreted, surfaceassociated BMP binding proteins (SBPs) such as, Crossveinless2 (Cv2, node D as in Figure 1) [4] is allowed to regulate BMP signaling, the intermediate dynamics of the system that result in the formation and decoupling of a transient state BMP:Type I:Cv2 (node M as in Figure 1) are largely unknown.
Figure 1. Example network of BMP signaling. BMP signaling is mediated by
 B
 R
2. In the patterning modeling of BMP signaling pathways, it is often argued as a simplification strategy that omitting the step of recruitment of a Type II receptor to a bound BMP:Type I receptor complex doesn't affect the outcomes of patterning models [4,17,23,24]. While valid in the deterministic sense, it is not clear how this reduction impacts our estimates for noise in the sytem.
In these systems, we apply our SS probability approximation method to evaluate the probability distribution of different species and calculate the mean (μ), standard deviation (σ) and the coefficient of variation (; defined as the ratio between the standard deviation and the mean of any species) of the species distribution. Together with this information, we can screen the network for largely unknown dynamics of the intermediate interactions and classify solutions according to a model's ability to meet specific performance objectives.
BMPsignaling regulation by SBPs
Signaling network
The singlecell local stochastic model that includes extracellular BMP(A), receptors (B), and SBPs such as, Cv2 (D) with biochemical interactions, rate parameters, and connectivity is based on the network shown in Figure 1. Mass balance equations are listed below:
Out of all complexes (C =
 B
 R
 B
 R
The simplified model as obtained from reactions R_{1} to R_{10}, has 5 species {S_{1}, S_{2}, . . . , S_{5}} and is described completely by a total of 8 different chemical reactions. Time evolution of all species quantities are specified by a state vector X(t) = [X_{1}(t), X_{2}(t), . . . , X_{5}(t)]^{T }and statechange vector v_{μ }(μ = 1, 2, . . . 8), corresponding to all reactions that describe the system. For example, when μ = 1, v_{1} is [1 +1 0 0 0]^{T }for reaction of the simplified system. In this example network, techniques like those in [25] were used to verify that there is a unique steady state equilibrium and this ensures the applicability of the algorithm for this example network. Numerically, we used the polynomial root finding package hom4ps2 to ensure that there was only one equilibrium in the positive orthant [26]. It's worthwhile to mention that a similar approach is adopted in example network 2 to ensure the unique deterministic steady state. For both the networks, we numerically determined the deterministic steady state value Y_{0} using Newton's method as incorporated in SBTOOLBOX2 [27]. A generalized algorithm for simulation according to the steady state approximation as outlined in Methods section is given in algorithm 1.
Algorithm 1 Evaluate steady state (SS) distribution:
Require: Unique deterministic SS solution X_{0}
1. Reaction Networks with N Reaction R_{1}, . . . , R_{N}
2. Choose: ε, γ_{0}, γ
3. Solve: ODE for steady state(SS) = Y_{0} and find discrete state X_{0} closest to Y_{0}, where X_{0} = round(Y_{0}).
4. Initiate, α_{i}, β_{i}; where ,
5. Determine: Truncated statespace as shown in Eq.4 and as described after Eq.5
6. Determine: Column j of corresponding to X_{0}
7. Form and as describe after Eq. 7.
9. Find , where and η > 0 and is chosen so that
10. Verify:
if , for δ_{i }= α_{i}, or δ_{i }= β_{i }then
α_{i ← αi } γ
β_{i }_{← }β_{i }+ γ
Return to 5
end if
In the algorithm, the values of γ_{0}, γ are problem dependent based on the anticipated spread of the steady state (SS) distribution. Larger γ and γ_{0} favor a larger , with correspondingly better accuracy, but this comes at the expense of a larger statespace and more time required to solve the Eq.7. The parameter ε also controls the accuracy of the solution. In the truncated statespace, the tail of the distribution is essentially pushed in to the main part of the distribution and smaller ε means that less of the tail is changed.
Simulation and discussion
The binding kinetics between BMPs (species A, Figure 1) + receptors (species B), and BMPs + Cv2 (species C) are largely known from the biacore analysis data [28,29]. However, the kinetic data associated with the intermediate tripartite complex BMP:Cv2:Type I receptor (species Z, Figure 1) are currently unknown. In order to better understand the dependence of the steady state distribution on the kinetic parameters, we performed a parameter screen for the forward and reverse reaction rates (k_{±s}, s = 3, 4) for the formation and decoupling of species Z. For each of these four parameters, we use 5 evenlyspaced points on a logarithmic scale with the range [10^{1 }to 10^{1}] nM^{1}s^{1} for the forward rates and [10^{3 }to 10^{0}] s^{1} for the reverse reaction rates. This produces a parameter grid that contains a total of 625 different parameter vectors.
For an appropriate comparison of the noise attenuation both in the presence and in the absence of Cv  2, species C (BMP: Type I Receptor = BR) concentration should remain the same regardless of the intermediate dynamics. During simulation, the amount of available receptors (B) was fixed at 100, and a maximum of 30% receptor occupancy was allowed for the screening of the network. To ensure an equal amount of BR formation for each parameter vector, we modified the level of free ligand (A). For computational tractability, the screen is limited to a maximum continuation of 200 Cv2 molecules, which allowed us to capture responses for all 625 different parameter sets.
In order to quantify noise in the system we measured the coefficient of variation that relates the standard deviation (σ) to the mean (μ) level of bound receptors. The parameter screen on the intermediate dynamics yields three primary qualitative subclasses for Cv2 behavior in regulation of extracellular BR (C) fluctuation amplitude: i) reduced amplitude ii) increased amplitude and iii) mixed amplitude behavior [4]. Three primary types of responses of Cv2 action on BMP fluctuations are shown in Figure 2ac. As seen from Figure 2a, Cv2 leads to a reduction of BR noise amplitude, and this is true for all Cv2 ∈ [0,200]. The value of the coefficient of variation (Λ) decreases for both increases in the level of bound receptor and the level of Cv2. The subclass of increased amplitude demonstrates that increasing the level of Cv2 in the system increases the value of the coefficient of variation (Λ_{Cv}_{2 ≠ 0 }> Λ_{Cv}_{2 = 0}) and is valid for the range of Cv2 values considered in the screen (for a detailed discussion on this, interested readers can refer [4]). Lastly, mixed amplitude is classified as type iii, which demonstrates that Cv2 can both increase and decrease the level of stochastic noise in the system (Figure 2c).
Figure 2. Screening result. a, b, c) Coefficient of variation for BR (C) (1 to 30 molecules) formation is shown both in the presence of Cv  2 = 105 molecules (gray) and in the absence of Cv2 (black), where a, b, c represents the noise attenuation, noise amplification and biphasic subclasses of Cv2 behavior respectively. d) Histogram of noise change for all 625 parameter sets demonstrates that the majority of solutions result in noise attenuation.
To clarify Cv2 action further, we calculated percentage change in the amplitude using for each parameter set output for a given amount of BR production (30 complexes) and Cv2 value (105 molecules). Based on the percent change of the coefficient of variation, we classify the screening outcome: negative percent change corresponds to noise attenuation whereas the positive change gives noise amplification. A histogram of all 625 parameters are shown in Figure 2d and in [4]. The implication of the screening result is that Cv2 clearly reduces the variability of receptor activation throughout the range of Cv2 tested. However, as demonstrated in Figure 2d, such a phenomenon as exhibited by SBPs like Cv2 is found to be highly parameter dependent.
During the simulation, the kinetic rate constants for the intermediate complex BMP:receptor:Cv2 (node Z, Figure 1) formation and decoupling were chosen from the parameter grid and representative kinetic rate constants for three different type of Cv2 responses are enumerated in Table 1.
Table 1. Kinetic rates, Figure 2(a,b,c)
Analysis of Type II receptor recruitment process
In the signaling network shown in Figure 3, recruitment of Type II (= R_{1}) receptors can happen in two different ways: 1) BMP binds with Type I (= R_{1}) first and subsequently, recruits Type II receptors to form a tripartite complex BMP:Type I: Type II (BR_{1}R_{2}), and 2) BMP directly interacts with Type I and Type II separately, and an intermediate state forms a tripartite BMP:Type I: Type II complex. In both situations, BMP:Type I:Type II complex (BR_{1}R_{2}) is the sole signaling complex responsible for the activation of downstream pathways.
Figure 3. Network cases for Type II recruitment analysis in context of Drosophila melanogaster development. Case I) Recruitment of Type II is overlooked here and it imitates the simplified model used in previous studies. In this type of network, BMP:Type I complex (BR_{1}) acts as the sole signaling complex. Case II) Upon the formation of a BMP: Type I complex, subsequent recruitment of Type II receptor is considered here. But a direct interaction between BMP and Type II receptor doesn't happen in the network. Here, a tripartite complex BMP:Type I:Type II (BR_{1}R_{2}) activates the downstream pathways. Case III) Similar to Case II, but with an exception that a direct interaction between BMP and Type II receptor is allowed to form a BMP:Type II complex (BR_{2}) by changing BR_{1} to BR_{2}. The kinetic equations are equivalent to the SBP system investigated in [4].
All possible biochemical interactions that represent the ligand binding with Type I receptors and further recruitment of Type II receptors are:
The chemical interaction of Case II can easily be obtained from the interactions (r_{1} to r_{8}) of Case III (Figure 3) by equaling the kinetic rate constants k_{±2} and k_{±4} of Case III to zero. For the kinetics, we relied on the published data [1]. The rate at which a Type II receptor is recruited upon formation of a BMP:Type I complex (BR_{1}) is comparatively faster than the rate of BMPs and Type I receptors interactions [17]. However, exact values of the rates of formulation of (BR_{1}R_{2}) complex are not readily available, and hence, parameters were screened over the physiological ranges with values between [10^{1 }to 10^{1}] nM^{1 }s^{1} for the forward rates and [10^{3 }to 10^{0}] s^{1} for the reverse reaction rates.
Simulation and discussion
To simulate the networks (as shown in Figure 3) for the calculation of the coefficient of variation Λ, we applied the truncated statespace approximation. During the simulation, a target of 1 to 30 signaling complexes (BR_{1} for Case I and BR_{1}R_{2} for Case II, Case III) in the extracellular region is considered so a direct comparison can be made for the coefficient of variation between BR_{1} and BR_{1}R_{2}.
The coefficient of variation (Λ) for BR_{1}R_{2} remains very close to the coefficient of variation of BR_{1} as shown in Figure 4a. Proximity in the coefficient of variation between BR_{1} and BR_{1}R_{2} (as shown in Figure 4a) demonstrates that the stochastic variability of the system is not affected by the recruitment of the Type II receptor. It is also found that increasing the concentration of R_{2} brings the coefficient of variation of BR_{1}R_{2} into very close agreement with the coefficient of variation of BR_{1} as shown in Figure 4b. A similar outcome is obtained from the simulation of Case III of the Figure 3 and the result is shown in Figure 4c. Finally, all the outcomes are summarized in Figure 4d, where it is shown that regardless of the different cases as shown in Figure 3 the coefficient of variation (Λ) of BR_{1}R_{2} is approximately equal to that of BR_{1}.
Figure 4. Comparison of Λ. a) The coefficient of variation of BR_{1} (calculated from Case I Figure 3)and BR_{1}R_{2} complexes (calculated from Case II Figure 3) is compared. The variability of the system seems to be invariant in the presence of Type II. b) Concentration dependency of Λ as a function of R_{2}. c) Same as plot "a", however, direct interaction of BMP and Type II is allowed as in Case III, Figure 3. It's clear that the stochasticity of the system does not change over the range of values tested. d) Summary of BR_{1}R_{2} formation and its impact on signaling noise.
Additionally, it is also found from the simulated result that the rate at which the BMP:Type I recruits Type II receptor (Case II in Figure 3) also decides the effect of Type II recruitment process on the stochastic variability of the system. With a comparatively slower rate, the coefficient of variation tends to oscillate as observed in Figure 5b. When the recruitment rate is slower than the formation rate of BMP:Type I complex, free Type II receptors fail to get frequent access to BMPs via the BMP:Type I:Type II tripartite complex, and can cause the concentration of BMP:Type I:Type II complex to oscillate more than the case with a comparatively faster dynamics. Thus, mitigating noise is not a natural output of receptor oligomerization + transudction and instead, requires another cofactor such as Cv2 [4].
Figure 5. Comparison between SSA and Direct SS method. a) In Gillespie's method larger 'End Time' (ET) is required (which translates into a higher processing cost and time) to ensure the accuracy of outcome. Three different ET: 280 hrs, 2800 hrs, 28000 hrs are shown. b) Effect of kinetics associated with BR_{1} interacting with R_{2}. The steps of interactions are clearly shown in Case II of Figure 3.
Benchmarking of Direct SS approximation method
Carrying out largescale stochastic simulation can be time consuming but calculation of the approximate solution via a truncated statespace can greatly improve the speed. In order to show the performance improvement in terms of computational cost and time of direct SS approximation in the analysis of stochastic biochemical networks, we benchmarked the method by comparing it with Gillespie's stochastic simulation algorithm (SSA) method [9] for numerical calculations of stochastic biochemical networks. In the benchmarking, the processing time taken by each method was calculated based on the steps in the blue box as mentioned in the flow chart diagram (Figure 6). The sample problem was calculated for both methods on the same hardware and software configuration: Processor: Intel(R) Xeon (R) CPU E5405, 2.00 GHz (quadcore), RAM: 16 GB, SBTOOLBOX2 [27] and Matlab R2010a with SiMBiology 3.0.
Figure 6. Benchmarking of Direct SS approximation method. Benchmarking of Direct SS approximation method.
The processing time and computed Λ values for a target BR_{1}R_{2} = 20 for Case II, Figure 3, is provided in Table 2 to show the accuracy and time gain that can be obtained if the proposed direct SS distribution approximation method is used. Gillespie's SSA takes longer to generate an output that contains enough information to calculate the distribution as compared to the time taken by Direct SS approximation method. The problem becomes severe when continuation of a multiple parameters are necessary to explore the system's parameter dependency as done previously in [4].
Table 2. Benchmarking: Gillespie's SSA and Direct SS approximation for a target BR_{1}R_{2} = 20
In Table 2, the term 'End time (ET) in Gillespie's SSA' corresponds to the amount of time the system dynamics were allowed to evolve. The accuracy of the Gillespie's SSA approach depends on the 'End time in Gillespie's SSA'(directly contributes to the processing time) set in the model simulation, and is shown clearly in Figure 5a and Table 2. Very low propensities require long simulation times in Gillespie's SSA due to the infrequency of events. Accuracy of Gillespie's method for the sample example increases as the 'End time in Gillespie's SSA' is increased. This large simulation time in turn directly impacts the processing time, resulting in a large computational cost to achieve the desired accuracy (Table 2).
Conclusions
In this study, we illustrate an approach of determining the steady state probability distribution efficiently to carry on continuation in multiple variables within a largescale parameter screen. The approach is demonstrated further with a couple of applications, where we investigated 1) the dynamic dependency of a class of proteins, known as SBPs, in the regulation of BMP signaling, and 2) the binding of Type II receptor in BMP signaling. The results suggest that the recruitment of a type II receptor in BMP signaling doesn't affect the stochasticity of the system over the range of concentration and parameters investigated. Direct calculation of the SS probability distribution can be successfully applied to systems with a unique deterministic SS solution, and future work will investigate similar approaches for other biochemical systems.
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
MSK designed the research, contributed to algorithm development, performed simulation and data analysis, and wrote the manuscript. GB contributed to algorithm development, discussed the results and wrote the manuscript. DU contributed in research design, discussed the results and wrote the paper. All authors contributed to replying to reviewers' comments and approved the final manuscript of the paper.
Acknowledgements
Based on “Steady state probability approximation applied to stochastic model of biological network”, by Shahriar Karim, David M Umulis and Gregery T Buzzard which appeared in Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on. © 2011 IEEE [30].
We would like to thank Purdue University, West Lafayette, IN 47907, for financial support.
This article has been published as part of BMC Genomics Volume 13 Supplement 6, 2012: Selected articles from the IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS) 2011. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/13/S6.
References

Umulis D, O'Connor M, Blair S: The extracellular regulation of bone morphogenetic protein signaling.
Development 2009, 136(22):3715. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Fox J: Human iPSC and ESC translation potential debated.
Nature Biotechnology 2011, 29(5):375376. PubMed Abstract  Publisher Full Text

Lander A, Lo W, Nie Q, Wan F: The measure of success: constraints, objectives, and tradeoffs in morphogenmediated patterning.
Cold Spring Harbor Perspectives in Biology 2009, 1:a002022. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Karim M, Buzzard G, Umulis D: Secreted, receptorassociated bone morphogenetic protein regulators reduce stochastic noise intrinsic to many extracellular morphogen distributions.
J R Soc Interface 2012, 9:10731083. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Raj A, van Oudenaarden A: Nature, nurture, or chance: stochastic gene expression and its consequences.
Cell 2008, 135(2):216226. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Samoilov M, Price G, Arkin A: From fluctuations to phenotypes: the physiology of noise.
Science's STKE 2006, 2006:(366). PubMed Abstract  Publisher Full Text

Thattai M, Van Oudenaarden A: Intrinsic noise in gene regulatory networks.
proceedings of the national academy of sciences of the united states of America 2001, 98(15):8614. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Gillespie D: A rigorous derivation of the chemical master equation.
Physica A: Statistical Mechanics and its Applications 1992, 188(13):404425. Publisher Full Text

Gillespie D: A general method for numerically simulating the stochastic time evolution of coupled chemical reactions.
Journal of computational physics 1976, 22(4):403434. Publisher Full Text

van Kampen N: Stochastic processes in physics and chemistry. North Holland; 2007.

Peleš S, Munsky B, Khammash M: Reduction and solution of the chemical master equation using time scale separation and finite state projection.
The Journal of chemical physics 2006, 125:204104. PubMed Abstract  Publisher Full Text

Gillespie D, Petzold L: Improved leapsize selection for accelerated stochastic simulation.
The Journal of Chemical Physics 2003, 119:8229. Publisher Full Text

Gillespie D: Exact stochastic simulation of coupled chemical reactions.
The journal of physical chemistry 1977, 81(25):23402361. Publisher Full Text

Hegland M, Burden C, Santoso L, MacNamara S, Booth H: A solver for the stochastic master equation applied to gene regulatory networks.
Journal of Computational and Applied Mathematics 2007, 205(2):708724. Publisher Full Text

Goutsias J: Classical versus stochastic kinetics modeling of biochemical reaction systems.
Biophysical journal 2007, 92(7):23502365. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Serpe M, Umulis D, Ralston A, Chen J, Olson D, Avanesov A, Othmer H, O'Connor M, Blair S: The BMPbinding protein Crossveinless 2 is a shortrange, concentrationdependent, biphasic modulator of BMP signaling in Drosophila.
Developmental cell 2008, 14(6):940953. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Gillespie D: Approximate accelerated stochastic simulation of chemically reacting systems.
The Journal of Chemical Physics 2001, 115:1716. Publisher Full Text

Hegland M, Hellander A, L "otstedt P: Sparse grids and hybrid methods for the chemical master equation.
BIT Numerical Mathematics 2008, 48(2):265283. Publisher Full Text

Munsky B, Khammash M: The finite state projection algorithm for the solution of the chemical master equation.
The Journal of chemical physics 2006, 124:044104. PubMed Abstract  Publisher Full Text

Wolpert L: Princiles of Development. UK: Oxford University Press; 2006.

Schmierer B, Tournier A, Bates P, Hill C: Mathematical modeling identifies Smad nucleocytoplasmic shuttling as a dynamic signalinterpreting system.
Proc Natl Acad Sci U S A 2008, 105(18):66086613. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

BenZvi D, Shilo B, Fainsod A, Barkai N: Scaling of the BMP activation gradient in Xenopus embryos.
Nature 2008, 453:12051211. PubMed Abstract  Publisher Full Text

Mizutani CM, Nie Q, Wan FY, Zhang YT, Vilmos P, SousaNeves R, Bier E, Marsh JL, Lander AD: Formation of the BMP activity gradient in the Drosophila embryo.
Dev Cell 2005, 8(6):91524. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Craciun G, Helton J, Williams R: Homotopy methods for counting reaction network equilibria.
Mathematical biosciences 2008, 216(2):140149. PubMed Abstract  Publisher Full Text

Lee T, Li T, Tsai C: HOM4PS2.0: a software package for solving polynomial systems by the polyhedral homotopy continuation method.
Computing 2008, 83(2):109133. Publisher Full Text

Schmidt H, Jirstrand M: Systems Biology Toolbox for MATLAB: a computational platform for research in systems biology.
Bioinformatics 2006, 22(4):514515. PubMed Abstract  Publisher Full Text

Kirsch T, Nickel J, Sebald W: BMP2 antagonists emerge from alterations in the lowaffinity binding epitope for receptor BMPRII.
The EMBO Journal 2000, 19(13):33143324. PubMed Abstract  Publisher Full Text  PubMed Central Full Text

Rentzsch F, Zhang J, Kramer C, Sebald W, Hammerschmidt M: Crossveinless 2 is an essential positive feedback regulator of Bmp signaling during zebrafish gastrulation.
Development 2006, 133(5):801. PubMed Abstract  Publisher Full Text

Karim S, Umulis DM, Buzzard GT: Steady state probability approximation applied to stochastic model of biological network.
Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on: 46 December 2011 2011, 5659. Publisher Full Text