Computing preimages of Boolean networks

Klotz, Johannes Georg; Bossert, Martin; Schober, Steffen

doi:10.1186/1471-2105-14-S10-S4

Volume 14 Supplement 10

Selected articles from the 10th International Workshop on Computational Systems Biology (WCSB) 2013: Bioinformatics

Research
Open access
Published: 12 August 2013

Computing preimages of Boolean networks

Johannes Georg Klotz¹,
Martin Bossert¹ &
Steffen Schober¹

BMC Bioinformatics volume 14, Article number: S4 (2013) Cite this article

2189 Accesses
3 Citations
Metrics details

Abstract

In this paper we present an algorithm based on the sum-product algorithm that finds elements in the preimage of a feed-forward Boolean networks given an output of the network. Our probabilistic method runs in linear time with respect to the number of nodes in the network. We evaluate our algorithm for randomly constructed Boolean networks and a regulatory network of Escherichia coli and found that it gives a valid solution in most cases.

Introduction

In systems and computational biology Boolean networks (BN) are widely used to model regulative dependencies of organisms [1, 2]. We consider networks, which map a set of environmental conditions to the presence of proteins and finally to actual chemical reactions, which are often modeled as fluxes of a flux-balance analysis [3]. Hence, these networks are used to make in silico predictions of behavior of organisms in a certain environment [4].

In this paper we address the inverse problem, i.e., we want to predict environmental conditions that allow certain reactions to take place, and others not. Hence, in general, we need to find a set of possible inputs that lead to a given output. This so called predecessor problem or preimage problem has been addressed by Wuensche in [5] and has been shown to NP-hard in general [6], which makes it infeasible to solve it for large networks. In [7] an algorithm with reduced complexity for BNs with canalizing Boolean functions has been introduced. However, the problem is infeasible under certain conditions. Both algorithms are designed to find the whole set of preimages, i.e., all inputs to the BN with lead to a certain, desired, output.

In some applications, knowledge of the whole preimage set is not important, merely it can be sufficient to know a subset of the preimage set. Here, we propose a probabilistic algorithm, which solves this problem in linear time with respect to the number of nodes in the network, based on a variation of the well known Sum-Product Algorithm (SPA) [8], which is used for a variety of tasks, including decoding error correction codes in communication engineering [9].

Methods

Boolean networks and main idea

We consider networks like shown in Figure 1, mapping the values of the N in-nodes $I = {1, 2, 3}$ to the M out-nodes $O = {12, 13, 14, 15, 16}$ , i.e., we can represent this BN as a function mapping the N input values uniquely to the M output values:

f : {0, 1}^{N} \to {0, 1}^{M} .

The network itself consists of n nodes, and a set of directed edges connecting these nodes. Each node i has a certain state, which can be either zero or one, represented by a variable x_i. Its value is determined by evaluating a Boolean function (BF) f_i. Further, lets define the set ñ(f_j) as the incoming nodes of node j. For example in Figure 1, ñ(f₅) = {1, 3}. The BF f_j is a function mapping k_j = |ñ(f_j)| values of {0, 1}^k to {0, 1}, where k is also called the in-degree of node j. The number of edges emerging from a node is called out-degree.

Given a vector of input values x ∈ {0, 1}^N , x = (x₁, x₂, . . . , x_N ) the corresponding output of f is y = f (x), y ∈ {0, 1}^M. In general there does not exist a unique inverse function f⁻¹. Instead the cardinality of the set Ω_y := {x : f(x) = y} will be larger one. We call Ω_y the set of preimages of y. In this paper we are interested to find at least parts of Ω_y. Suppose there is a probability distribution P_y on {0, 1}^N such that

P_{y} {x} = \{\begin{matrix} \frac{1}{| Ω_{y} |} & if x \in Ω_{y} \\ 0 & else \end{matrix} .

If we knew the probability distribution P_y, we would have solved the problem. But as explained, this is too difficult in general. Our main idea now is to approximate P_y by the product of the marginal distributions P_i on the individual x_i, i.e.,

P_{y} \approx \prod_{i = 1}^{N} P_{i},

as the well-known SPA can be used to compute the marginals efficiently. If the approximation is good enough sampling out the product of the marginals will yield an element in Ω_y with reasonable probability.

Proposed algorithm

In this section we will first discuss the basic principles of factor graphs and the SPA. Then we will describe the BN as factor graph and will formulate the actual algorithm to find the marginals. Finally, the sampling is described.

Factor graphs and sum-product algorithm

Assume some function g(x₁, . . . , x_n) defined on some domain $A^{n}$ , which can be factorized in m local functions h_j, j ∈ [m] := {1, 2, . . . , m}, i.e.,

g (x_{1}, \dots, x_{n}) = \prod_{j} h_{j} (X_{j}),

where X_j is the subset of [n] containing the argument of h_j . We can then define a factor graph [8] as a bipartite graph consisting of n nodes representing variables {x₁, . . . , x_n} (variable nodes) and of m nodes representing functions {h₁, . . . h_m} (function node). Edges exist between a function node and a variable node if and only if x_i is an input to function h_j .

The marginal function g_i(x_i) is defined as [8]

g_{i} (x_{i}) = \sum_{~ \{x_{j}\}} g (x_{1}, \dots, x_{n}),

where $\sum_{~ \{x_{j}\}} g (x_{1}, \dots, x_{n})$ is defined as

\begin{array}{l} \sum_{~ {x_{i}}} g (x_{1}, \dots, x_{n}) \\ = \sum_{x_{1} \in A} \dots \sum_{x_{i - 1} \in A} \sum_{x_{i + 1} \in A} \dots \sum_{x_{n} \in A} g (x_{1}, \dots, x_{n}), \end{array}

In general the computation of the g_i is difficult, but due to the factorization of g the task can be efficiently solved using the the so called Sum-Product Algorithm (SPA) [8]. The algorithm iteratively passes messages between the nodes of the graph. At each iteration the messages µ are sent from the function nodes to the variable nodes, containing the corresponding marginal function of the local function. These messages are computed as follows [8]:

Function to variable node

μ_{h \to x} (x) = \sum_{~ {x}} (h (n (h)) \prod_{y \in n (h) \ {x}} λ_{y \to h} (y)),

where n(i) give the set of neighboring nodes of node i.

At the variable nodes, these messages are then combined to a marginal function λ and sent back to the function nodes [8]:

Variable to function node

λ_{x \to h} (x) = \prod_{q \in n (x) \ {h}} μ_{q \to x} (x) .

The Boolean network as factor graph

We apply the concept of factor graphs to BNs. Each node in the network represents one variable x_i ∈ {0, 1}, i ∈ [n] of the factor graph, hence we have n variable nodes. Each BF f_j of the BN $(j \in [n] \ I)$ is a function node and is connected to the node j and the incoming nodes ñ(f_j). Lets to define ${\tilde{X}}_{j}$ as the variables of the incoming nodes of node j, i.e. the argument of the BN f_j. Further, we define ${\tilde{X}}_{j}^{(i)}$ as ${\tilde{X}}_{j}$ without the node i.

Finally, if we consider the variables as each node as random variables, we have a common distribution of all variables nodes described by the density function,

g_{x_{1}, \dots, x_{n}} (x_{1}, \dots, x_{n}) \equiv g (x_{1}, \dots, x_{n}),

For sake of readability we will omit the subscripts of the density function, if they are obvious from context. We are interested in finding the marginal distributions of the in-nodes, which can be described by the density functions

g_{x_{i}} (x_{i}) = \sum_{~ x_{i}} g_{x_{1}, . . ., x_{n}} (x_{1}, \dots, x_{n}) \forall i \in I .

This problem is an instance of the problem described in Section Factor Graphs and Sum-Product Algorithm, hence we apply the same methods here.

Update rule: function to variable node

If we focus on one function node $j \in [n] \ I$ there exists a common distribution of all variables relevant for this node. Namely, these relevant variables are the ones located in ${\tilde{X}}_{j}$ of the BF f_j, and the value of node j. We can write the density of this distribution as:

p (x_{j}, {\tilde{X}}_{j}) .

Lets define ñ(f_j) as the set of indices of the input nodes of the BF f_j.

We need to send the local marginal distribution of each variable i ∈ {j} ∪ ñ(f_j) back to the variable node, or more formally:

μ_{j \to i} (x_{i}) = \sum_{~ {x_{i}}} p (x_{j}, {\tilde{X}}_{j}) = \sum_{~ {x_{i}}} p (x_{j}, x_{i}, {\tilde{X}}_{j}^{(i)})

(1)

If i = j, i.e. if the message is designated for the node containing the output of the BF, the density of the marginal distribution becomes:

\begin{gathered} μ_{j \to j} (x_{j}) = \sum_{~ {x_{j}}} p (x_{j} | {\tilde{X}}_{j}) \cdot ({\tilde{X}}_{j}) \\ = \sum_{~ {x_{j}}} f_{j} ({\tilde{X}}_{j}) \cdot ({\tilde{X}}_{j}) \end{gathered}

which is the probability distribution of the functions output. We can assume that the elements of ${\tilde{X}}_{j}$ are pairwise independent, hence we can write:

p ({\tilde{X}}_{j}) = \prod_{l \in ñ (f_{j})} λ_{l} (x_{l}),

where λ_l is the probability distribution of variable node l and is defined in Eq. 3.

In the other cases, i.e., i ≠ j, Eq. (1) becomes:

μ_{j} \to i (x_{i}) = \sum_{~ {x_{i}}} p (x_{i} | x_{j}, {\tilde{X}}_{j}^{(i)}) \cdot (x_{j}, {\tilde{X}}_{j}^{(i)}) .

We still can assume that the elements of ${\tilde{X}}_{j}^{(i)}$ are pairwise independent, hence we can write:

\begin{array}{l} p (x_{j}, n (f_{j}) \ x_{i}) & = p (x_{j} | {\tilde{X}}_{j}^{(i)}) \cdot ({\tilde{X}}_{j}^{(i)}) \\ = p (x_{j} | {\tilde{X}}_{j}^{(i)}) \prod_{l \in ñ (f_{j}) \ {i}} λ_{l} (x_{l}) . \end{array}

If the Boolean functions output $x_{j} = f_{j} ({\tilde{X}}_{j})$ is already completely determined by ${\tilde{X}}_{j}^{(i)}$ , i.e., if the variable x_i has no influence on the output for this particular choice of the other variables, we assume x_i to be uniformly distributed:

p (x_{i} | x_{j}, {\tilde{X}}_{j}^{(i)}) = \frac{1}{2} p_{x_{j}} (f ({\tilde{X}}_{j}^{(i)}, x_{i}) = x_{j})

and since x_j is completely determined by ${\tilde{X}}_{j}^{(i)}$

p (x_{j}, {\tilde{X}}_{j}^{(i)}) = \prod_{l \in \tilde{n} (f_{j}) / {i}} λ_{l} (x_{l}) .

Otherwise, x_i is totally determined by x_j and the other variables, i.e., x_i is 0 or 1 depending on BF. Hence, we can write

p (x_{i} | x_{j}, n (f_{j}) \ x_{i}) = p_{x_{j}} (f ({\tilde{X}}_{j}^{(i)}, x_{i}) = x_{j}),

where $p_{x_{j}} (f ({\tilde{X}}_{j}^{(i)}, x_{i}) = x_{j})$ is either 0 or 1. Further we can assume x_j independent of ${\tilde{X}}_{j}^{(i)}$ , hence

p (x_{j}, {\tilde{X}}_{j}^{(i)}) = λ_{j} (x_{j}) \prod_{l \in ñ (f_{j}) \ {i}} λ_{l} (x_{l}) .

Finally, we can summarize for i ≠ j:

μ_{j \to i} (x_{i}) = \sum_{~ {x_{i}}} ξ_{i, j} p_{x_{j}} (f ({\tilde{X}}_{j}^{(i)}, x_{i}) = x_{j}) \prod_{l \in ñ (f_{j}) \ {i}} λ_{l} (x_{l}),

(2)

with

ξ_{i, j} = \{\begin{matrix} \frac{1}{2} & , if f_{j} ({\tilde{X}}_{j}^{(i)}, x_{i} = 0) = f_{j} ({\tilde{X}}_{j}^{(i)}, x_{i} = 1) \\ λ_{j} (x_{j}) & , else \end{matrix} .

Update rule: variable to function node

The update rule is the same for all variable nodes j ∈ [n] and is independent of the function node to which they are directed.

λ_{j} (x_{j}) = \prod_{l \in S_{j}} μ_{l \to j} (x_{j}),

where $S_{j}$ is the set of all function nodes, which have node j as input.

Finding the input distributions

In our algorithm, we use the well known log-likelihood ratio (LLR) to represent the probability distribution of binary variables [10]. It is defined as:

L_{X} = ln \frac{p (x = 0)}{p (x = 1)} .

(4)

A scheme of the algorithm is given in Algorithm 1.

The probability distribution of each node j ∈ [n] at iteration t is given as $L_{j}^{(i)}$ and are initialized with $L_{j}^{(0)} = 0$ , which is equivalent to the uniform distribution. Then we set the LLRs for the out-nodes to either −∞ or +∞ depending on the desired output y of the BN. At each iteration the algorithm can be split in two steps. The first step iterates over all function nodes $(j \in [n] \ I)$ and all input variables i ∈ ñ(f_j ) calculating the LLR $L_{j \to i}^{(t)}$ using Eq. (2) and Eq. (4).

In the second step we update all variables-nodes, where the LLRs L_j represents the distributions λ_j and, hence the product of Eq. 3 becomes a summation. Please note, that the LLR of the previous iteration is also added to the sum, in order to prevent rapid changes of the distributions.

After performing a certain number of iterations t_max, the desired marginal distributions of the input variables are found.

Algorithm 1

Initialize $L_{j}^{(0)} = 0$ for all nodes

Set the desired LLRs of the out-nodes, i.e., $L_{j}^{(0)}$ is either −∞ or +∞, for all out-nodes $j \in O$ .

t = 0

repeat

t=t+1

for each non-in-node $(j \in [n] \ I)$ do

for each input variable i ∈ ñ(f_j) do

calculate L_j→i using Eq. (2) and Eq. (4)

end for

for each non-out-node v do

$L_{j}^{(t)} = L_{j}^{(t - 1)} + \sum_{l \in S_{j}} L_{l \to j}^{(t)}$

end for

until maximum number of iterations reached

Sampling

The sampling part of our approach is straightforward. Using the marginal distributions $L_{j}^{(t_{m a x})}$ , $j \in I$ we randomly draw vectors x and check if they fulfill y = f(x). If so, they are added to the set ${\tilde{Ω}}_{y}$ . This procedure is repeated for a certain number of samples.

Simulation results and discussion

We tested our algorithm with randomly generated networks and the regulatory network of Escherichia coli (E-coli) [2]. The random networks consist of 2400 nodes with N = 200 and M = 1200. We have chosen the BFs from:

· all functions with k ≤ 15 (Type A)

unate, i.e. locally monotone, functions with k ≤ 15 (Type B)

After generating a network we draw a certain number T of uniformly distributed input vectors x and obtain y = f(x). For each y we applied then Algorithm 1 to obtain the marginal distributions $L_{j}^{(t_{m a x})}$ , $j \in I$ . To investigate the convergence behavior with respect to t_max we first apply hard-decision to evaluate a good choice for t_max, i.e., we generate an estimate $\tilde{x}$ by setting

{\tilde{x}}_{j} = \{\begin{matrix} 0 & if L_{j}^{(t_{m a x})} > 0 \\ 1 & if L_{j}^{(t_{m a x})} < 0 \end{matrix}

Then we evaluate the network $ỹ = f (\tilde{x})$ , and measure the similarity between y and $ỹ$ by counting the equal entries and divide them by the length of y. We did so for 100 networks of Type A and B, and set T = 100. The averaged results can be seen in Figure 2.

One can see, that for t_max ≥ 14 there is almost no improvement in the similarity. This number is equal to two times the number of nodes between input and output, i.e., it seems to be sufficient that the messages travel once through the network and back. Thus, the following simulations have been perform setting t_max = 14.

Next, we apply sampling as described in Section Sampling. We did so for 100 different networks of Type A and B, and the E-coli network. For each random network we did T = 100 runs, for E-coli T = 1000. The results can be viewed in Table 1. We depict the percentage of solved networks, i.e. the portion of networks we found at least one valid x ∈ Ω_y . Further, we give the average number of valid x and the average number of unique x.

Table 1 Simulation results for different networks

Full size table

One can see from the results, that in general for most networks and y s at least one preimage can be found. It is worth mentioning, that for the E-coli network every sampled solution was unique. This is due to the fact, that there exist a few inputs, who completely determine the output. The other input variables have then no influence and hence a marginal distribution of 0.5. Further, the results for the network of type B are much better than for type A. It seems that the marginal distributions for unate functions give better estimation of the actual distribution than the marginal distributions for non-unate functions.

Conclusions

In this work, we proposed a probabilistic algorithm to address the preimage problem of Boolean networks. This is of interest when designing experiments, in which certain regulators are supposed to be in a specific state. Performing a series of simulations with Random networks we showed, that the algorithm works not only for unate functions, of which most biologically motivated networks consist, but for any kind of Boolean functions. By replacing the fixed output values of the network by probabilities one can simply apply the algorithm to networks, whose designated output is described by probability distributions. Further, the algorithm may be easily adjusted to work on stochastic, e.g. Bayesian, networks, where the nodes contain only transition probabilities instead of Boolean function. Therefore, it is needed to adapt the update rules accordingly. It remains an open question, which influence topographic properties, such as number of layers and number of nodes in these layers, have to the performance of the proposed algorithms, since we only investigated networks which are similar to the regulatory network of E-coli.

Abbreviations

E-coli :: Escherichia coli
BF:: Boolean Function
BN:: Boolean Network
Eq:: Equation
LLR:: Log-Likelihood Ratio
SPA:: Sum-Product Algorithm

References

Kauffman SA, Peterson C, Samuelsson B, Troein C: Random Boolean network models and the yeast transcriptional network. Proceedings of the National Academy of Sciences of the United States of America. 2003, 100 (25): 14796-14799. 10.1073/pnas.2036429100.
Article PubMed Central CAS PubMed Google Scholar
Covert MW, Knight EM, Reed JL, Herrgard MJ, Palsson BO: Integrating high-throughput and computational data elucidates bacterial networks. Nature. 2004, 429 (6987): 92-96. 10.1038/nature02456.
Article CAS PubMed Google Scholar
Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, Broadbelt LJ, Hatzimanikatis V, Palsson BO: A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Molecular Systems Biology. 2007, 3: 121-
Article PubMed Central PubMed Google Scholar
Feuer R, Gottlieb K, Viertel G, Klotz J, Schober S, Bossert M, Sawodny O, Sprenger G, Ederer M: Model-based analysis of an adaptive evolution experiment with Escherichia coli in a pyruvate limited continuous culture with glycerol. EURASIP Journal on Bioinformatics and Systems Biology. 2012, 2012: 14-10.1186/1687-4153-2012-14.
Article PubMed Central PubMed Google Scholar
Wuensche A: The Ghost in the Machine: Basins of Attraction of Random Boolean Networks. Artificial Life III Proceedings, Santa Fe Institute Studies in the Sciences of Complexity. 1994, Addison-Wesley
Google Scholar
Akutsu T, Hayashida M, Zhang SQ, Ching WK, Ng MK: Analyses and Algorithms for Predecessor and Control Problems for Boolean Networks of Bounded Indegree. Information and Media Technologies. 2009, 4 (2): 338-349.
Google Scholar
Klotz JG, Schober S, Bossert M: On the Predecessor Problem in Boolean Network Models of Regulatory Networks. International Journal of Computers and Their Applications. 2012, 19 (2): 93-100.
Google Scholar
Kschischang F, Frey BJ, Loeliger HA: Factor Graphs and the Sum-Product Algorithm. IEEE Transactions on Information Theory. 2001, 47: 498-519. 10.1109/18.910572.
Article Google Scholar
Gallager RG: Low-Density Parity-Check Codes. 1963, Cambridge: M.I.T. Press
Google Scholar
Hagenauer J, Offer E, Papke L: Iterative decoding of binary block and convolutional codes. Information Theory, IEEE Transactions on. 1996, 42 (2): 429-445. 10.1109/18.485714.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Shrief Rizkalla for implementing parts of the simulation.

Declarations

This work was supported by the German research council " Deutsche Forschungsgemeinschaft" (DFG) under Grant Bo 867/25-2 and the Ulm University.

This article has been published as part of BMC Bioinformatics Volume 14 Supplement 10, 2013: Selected articles from the 10th International Workshop on Computational Systems Biology (WCSB) 2013: Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/14/S10

Author information

Authors and Affiliations

Institute of Communications Engineering, Ulm University, Albert-Einstein-Allee 43, 89081, Ulm, Germany
Johannes Georg Klotz, Martin Bossert & Steffen Schober

Authors

Johannes Georg Klotz
View author publications
You can also search for this author in PubMed Google Scholar
Martin Bossert
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Schober
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johannes Georg Klotz.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

Idea and Concept: JK, SS. Design of the overall project: MB. Scientific mentor of JK and SS: MB. Implementation and Evaluation: JK. Wrote Paper: JK and SS. All authors discussed the results and implications and commented on the manuscript at all stages.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Klotz, J.G., Bossert, M. & Schober, S. Computing preimages of Boolean networks. BMC Bioinformatics 14 (Suppl 10), S4 (2013). https://doi.org/10.1186/1471-2105-14-S10-S4

Download citation

Published: 12 August 2013
DOI: https://doi.org/10.1186/1471-2105-14-S10-S4

Selected articles from the 10th International Workshop on Computational Systems Biology (WCSB) 2013: Bioinformatics

Computing preimages of Boolean networks

Abstract

Introduction