<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2105-8-S6-S5</ui>
	<ji>1471-2105</ji>
	<fm>
		<dochead>Review</dochead>
		<bibl>
			<title>
				<p>Inferring cellular networks &#8211; a review</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Markowetz</snm>
					<fnm>Florian</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>florian@genomics.princeton.edu</email>
				</au>
				<au id="A2">
					<snm>Spang</snm>
					<fnm>Rainer</fnm>
					<insr iid="I1"/>
					<insr iid="I3"/>
					<email>rainer.spang@klinik.uni-regensburg.de</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany</p>
				</ins>
				<ins id="I2">
					<p>Princeton University, Lewis-Sigler Institute for Integrative Genomics and Dept. of Computer Science, Princeton, NJ 08544, USA</p>
				</ins>
				<ins id="I3">
					<p>Present affiliation: University Regensburg, Institute of Functional Genomics, Josef-Engert-Str. 9, 93053 Regensburg, Germany</p>
				</ins>
			</insg>
			<source>BMC Bioinformatics</source>
			<supplement>
				<title>
					<p>Otto Warburg International Summer School and Workshop on Networks and Regulation</p>
				</title>
				<editor>Peter F Arndt and Martin Vingron</editor>
				<note>Reviews</note>
			</supplement>
			<issn>1471-2105</issn>
			<pubdate>2007</pubdate>
			<volume>8</volume>
			<issue>Suppl 6</issue>
			<fpage>S5</fpage>
			<url>http://www.biomedcentral.com/1471-2105/8/S6/S5</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">17903286</pubid><pubid idtype="doi">10.1186/1471-2105-8-S6-S5</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>27</day>
					<month>9</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Markowetz and Spang; licensee BioMed Central Ltd.</collab>
			<note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>In this review we give an overview of computational and statistical methods to reconstruct cellular networks. Although this area of research is vast and fast developing, we show that most currently used methods can be organized by a few key concepts. The first part of the review deals with <it>conditional independence </it>models including Gaussian graphical models and Bayesian networks. The second part discusses probabilistic and graph-based methods for data from experimental <it>interventions and perturbations</it>.</p>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Introduction</p>
			</st>
			<p>The success of genome sequencing projects has led to the identification of almost all the genes responsible for the biological complexity of several organisms. The next important task is to assign a function to each of these genes. Genes and gene products do not work in isolation; rather, they are connected in highly structured networks of information flow through the cell. The inference of such cellular networks using computational and statistical methods is the topic of this review.</p>
			<p>The first two sections describe models based on correlation and statistical dependence using the concept of <it>conditional independence</it>. The objective of this kind of modeling is to explain observed correlations between genes by the presence of other genes. The set of models can be ordered according to how deeply correlations are resolved: correlation graphs are the most simple example, and Bayesian networks the most sophisticated.</p>
			<p>The last section discusses computational approaches building on observed effects of external interventions into the cell. In modern biology, perturbation experiments are key to inferring gene function and regulatory pathways. A common biological technique is to perturb a gene of interest and to study which other genes' activities are affected. Either, the effects of interventions can be included into conditional independence models, or the observed cause-effect relations can be used directly. We also discuss the situation when instead of direct observations of intervention effects on pathway components, only secondary effects on downstream genes are available.</p>
			<sec>
				<st>
					<p>Notation</p>
				</st>
				<p>Let <it>V </it>be a set of <it>p </it>network components. The biological meaning of a "network component" depends on what kind of data we analyze. We mostly speak of network components as genes, since the primary data for inference is microarray data and the network is a transcriptional gene regulatory network. However, the methods are general and can also be applied to protein data <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. In probabilistic models we treat each component <it>v </it>&#8712; <it>V </it>as a random variable <it>X</it><sub><it>v </it></sub>and the set of all components in the model as a random vector <b><it>X </it></b>= (<it>X</it><sub>1</sub>,..., <it>X</it><sub><it>p</it></sub>). The dataset <b><it>D </it></b>consists of <it>N </it>measurements, that is, realizations <b><it>x</it></b><sup>1</sup>,..., <b><it>x</it></b><sup><it>N </it></sup>of the random vector <b><it>X</it></b>. Network components are identified with nodes in a graph. The goal is to find an edge set <inline-formula><m:math name="1471-2105-8-S6-S5-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8496;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaaimaacqWFWesraaa@3785@</m:annotation></m:semantics></m:math></inline-formula> representing the dependency structure of the network components. We call the graph <it>T </it>= (<it>V</it>, <inline-formula><m:math name="1471-2105-8-S6-S5-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8496;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaaimaacqWFWesraaa@3785@</m:annotation></m:semantics></m:math></inline-formula>) the topology of the cellular network. Depending on the model, <it>T </it>can be directed or undirected, cyclic or acyclic. In the important special case, where <it>T </it>is a directed acyclic graph (DAG), we call it <it>G</it>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conditional independence models</p>
			</st>
			<p>The first section describes statistical models to resolve the correlation structure of genes. The methods can be distinguished by the way they remove influences of other genes from the observed correlations. We first introduce coexpression networks, then discuss models building on statements of full conditional independence or low-order conditional independence.</p>
			<sec>
				<st>
					<p>Coexpression networks</p>
				</st>
				<p>Biological processes result from the concerted action of interacting molecules. This general observation suggests a simple idea, which has already motivated the first approaches to clustering expression profiles <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and is still widely used in functional genomics. It is called the <it>guilt-by-association heuristic</it>: if two genes show similar expression profiles, they are supposed to follow the same regulatory regime. To put it more pointedly: coexpression hints at coregulation. Coexpression networks (also known as relevance networks) are constructed by computing a similarity score for each pair of genes. If similarity is above a certain threshold, the gene pair gets connected in the graph, if not, it remains unconnected. Wolfe <it>et al</it>. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> argue that networks of coexpressed genes provide a widely applicable framework for assigning gene function. They show that coexpression agrees well with functional similarity as it is encoded in the Gene Ontology <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Examining coexpression is an integral part of most methods combining diverse data sources to uncover biological relationships between genes or proteins <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
				<sec>
					<st>
						<p>Building coexpression networks</p>
					</st>
					<p>The first critical point in building a coexpression network is how to formalize the notion of similarity of expression profiles. Several measures have been proposed, the most simple of which is correlation. In a Gaussian model, zero correlation corresponds to statistical independence. Correlation networks are easy to interpret and can be accurately estimated even if <it>p </it>&#8811; <it>N</it>, that is, the number of genes is much larger than the number of samples. Stuart <it>et al</it>. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> have used this approach to build a graph from coexpression across multiple organisms (humans, flies, worms and yeast), finding that many coexpression relationships are conserved over evolution. Correlation networks can be enhanced in several ways. Bickel <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> generalizes them to time series data by introducing a time-lag for correlation. Kostka and Spang <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> introduce the concept of <it>differential coexpression</it>, which can be interpreted as the gain or loss of a regulatory mechanism. They formulate a method to find sets of genes, which are highly correlated under one condition (e.g. in healthy cells), but show random behavior under a second condition (e.g. in tumor cells).</p>
					<p>Correlation is a linear measure of independence, non-linear dependencies between genes are not necessarily found. This problem can be addressed by using networks built from more flexible similarity measures like pair-wise mutual information <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, or non-linear kernel-functions <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Yamanishi <it>et al</it>. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> use kernel functions for supervised network reconstruction by tuning kernel parameters in known parts of a protein-interaction graph and then using them to infer unknown parts. Kato <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> weight different data sources according to noise and information content and combine them into a single kernel function.</p>
					<p>The second critical step in building coexpression networks is assessing the significance of results. Many pairs of genes show similar behavior in expression profiles by chance even though they are not biologically related. A practical, though time-consuming strategy consists of permuting the data matrix and comparing the network obtained from real data with the distribution of similarity scores achieved in the permutations, as Bickel <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> does to estimate the false discovery rate of spurious connections. In the supervised setting of Yamanishi <it>et al</it>. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> cross-validation can be applied to choose optimal parameters.</p>
				</sec>
				<sec>
					<st>
						<p>Problems of coexpression based approaches</p>
					</st>
					<p>Even high similarity of expression tells us little about the underlying biological mechanisms. Coexpression networks include regulatory relationships, but we cannot distinguish direct from indirect dependencies based on the similarity of expression patterns. Fig. <figr fid="F1">1</figr> exemplifies this problem on a small set of three highly coexpressed genes, which form a clique (a completely connected subgraph) in a coexpression network. The figure shows that several regulatory mechanism can explain this observation, and from coexpression data alone we have no way of choosing between them. There are two possible solutions. Functional genomics has a long tradition of perturbing the natural state of a cell and inferring a gene's function from the observed effects. These interventions allow us to distinguish between the three models in Fig. <figr fid="F1">1</figr>, because each model results in different predictions of effects, which can be compared to those obtained in experiments. For example, perturbing gene <it>Y </it>in the cascade <it>X </it>&#8594; <it>Y </it>&#8594; <it>Z </it>will only have an effect on gene <it>Z </it>but none on gene <it>X</it>. In the case where <it>Y </it>regulates both <it>X </it>and <it>Z</it>, perturbing it will result in changes at both regulatees. In the last case, where all three genes are regulated by a hidden regulator, perturbing one of them will not lead to changes at the other two. Methods formalizing these considerations are covered in the second part of this review. In the absence of perturbation data statistical methods may be used to find which of the possibilities is most likely. The theoretical background is the concept of <it>conditional independence</it>.</p>
					<fig id="F1">
						<title>
							<p>Figure 1</p>
						</title>
						<caption>
							<p>Different mechanisms can explain coexpression</p>
						</caption>
						<text>
							<p><b>Different mechanisms can explain coexpression</b>. The left plot in the dashed box shows three coexpressed genes forming a clique in the coexpression graph. The other three plots show possible regulatory relationships that can explain coexpression: The genes could be regulated in a cascade (left), or one regulates both others (middle), or there is a common "hidden" regulator (right), which is not part of the model.</p>
						</text>
						<graphic file="1471-2105-8-S6-S5-1"/>
					</fig>
				</sec>
				<sec>
					<st>
						<p>Conditional independence</p>
					</st>
					<p>Let <it>X</it>, <it>Y</it>, <it>Z </it>be random variables with joint distribution <it>P</it>. We say that <it>X is conditionally independent of Y given Z </it>(and write <it>X </it>&#8869; <it>Y </it>| <it>Z</it>) if and only if</p>
					<p>
						<display-formula><it>P</it>(<it>X </it>= <it>x</it>, <it>Y </it>= <it>y </it>| <it>Z </it>= <it>z</it>) = <it>P</it>(<it>X </it>= <it>x </it>| <it>Z </it>= <it>z</it>)&#183;<it>P</it>(<it>Y </it>= <it>y </it>| <it>Z </it>= <it>z</it>).</display-formula>
					</p>
					<p>This is the same as saying</p>
					<p>
						<display-formula><it>P</it>(<it>X </it>= <it>x </it>| <it>Y </it>= <it>y</it>, <it>Z </it>= <it>z</it>) = <it>P</it>(<it>X </it>= <it>x </it>| <it>Z </it>= <it>z</it>)</display-formula>
					</p>
					<p>and is a direct generalization of the independence condition for <it>X </it>and <it>Y</it>,</p>
					<p>
						<display-formula><it>P</it>(<it>X </it>= <it>x</it>, <it>Y </it>= <it>y</it>) = <it>P</it>(<it>X </it>= <it>x</it>)&#183;<it>P</it>(<it>Y </it>= <it>y</it>).</display-formula>
					</p>
					<p>The same definitions hold if conditioning is not on a single variable <it>Z </it>but on a set of variables <b><it>Z</it></b>. For an interpretation, we can think of random variables as abstract pieces of knowledge obtained from, say, reading books <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Then <it>X </it>&#8869; <it>Y </it>| <it>Z </it>means: "Knowing <it>Z</it>, reading <it>Y </it>is irrelevant for reading <it>X</it>"; or in other words: "If I already know <it>Z</it>, then <it>Y </it>offers me no new information to understand <it>X</it>." Variable <it>Z </it>can explain the correlation between <it>X </it>and <it>Y</it>.</p>
					<p>The statistical models we discuss in the following all build on the concept of conditional independence. Instead of retrieving sets of coexpressed genes &#8211; as coexpression networks do &#8211; they try to recover the regulatory relationships between genes. To decide on an edge between <it>X </it>and <it>Y </it>in the graph, statistical models ask questions of the form "Is <it>X </it>independent of <it>Y </it>given <b><it>Z</it></b>?", but differ with respect to what <b><it>Z </it></b>stands for: either all other variables except for <it>X </it>and <it>Y</it>, or single third variables, or any subset of all the other variables. Coexpression networks can be seen as the special case <b><it>Z </it></b>= &#8709;, which encodes marginal dependencies.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Full conditional models</p>
				</st>
				<p>Full conditional models (also called Markov networks, undirected graphical models, or Markov random fields) ask: "Can the correlation observed between two genes be explained by <it>all other genes </it>in the model?" Nodes <it>i </it>and <it>j </it>are connected by an edge if and only if</p>
				<p>
					<display-formula><it>X</it><sub><it>i </it></sub><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math><it>X</it><sub><it>j </it></sub>| <b><it>X</it></b><sub>rest</sub>,</display-formula>
				</p>
				<p>where "rest" denotes the set of all variables in <it>V </it>without <it>i </it>and <it>j</it>. Full conditional models become especially simple in a Gaussian setting. Assume that <b><it>X </it></b>~N (<it>&#956;</it>, &#931;), where <it>&#956;</it>, is the mean vector and the covariance matrix &#931; is invertible. Let <it>K </it>= &#931;<sup>-1 </sup>be the <it>concentration matrix </it>of the distribution (also called the <it>precision matrix</it>). The value -<it>k</it><sub><it>ij</it></sub>/<inline-formula><m:math name="1471-2105-8-S6-S5-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msqrt><m:mrow><m:msub><m:mi>k</m:mi><m:mrow><m:mi>i</m:mi><m:mi>i</m:mi></m:mrow></m:msub><m:msub><m:mi>k</m:mi><m:mrow><m:mi>j</m:mi><m:mi>j</m:mi></m:mrow></m:msub></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaGcaaqaaiabdUgaRnaaBaaaleaacqWGPbqAcqWGPbqAaeqaaOGaem4AaS2aaSbaaSqaaiabdQgaQjabdQgaQbqabaaabeaaaaa@354C@</m:annotation></m:semantics></m:math></inline-formula> is called the <it>partial correlation coefficient </it>between genes <it>i </it>and <it>j </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Then it holds for <it>i</it>, <it>j </it>&#8712; <it>V </it>with <it>i </it>&#8800; <it>j </it>that</p>
				<p>
					<display-formula><it>X</it><sub><it>i </it></sub>&#8869; <it>X</it><sub><it>j </it></sub>| <b><it>X</it></b><sub>rest </sub>&#8660; <it>k</it><sub><it>ij </it></sub>= 0.</display-formula>
				</p>
				<p>This relation is used to define Gaussian graphical models (GGMs) <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. A GGM is an undirected graph on vertex set <it>V</it>. Each vertex <it>i </it>&#8712; <it>V </it>corresponds with a random variable <it>X</it><sub><it>i </it></sub>&#8712; <b><it>X</it></b>. The edge set of a GGM is defined by non-zero partial correlations. Vertices <it>i </it>and <it>j </it>are adjacent if and only if <it>k</it><sub><it>ij </it></sub>&#8800; 0. An example is shown in Fig. <figr fid="F2">2</figr>.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>A small Gaussian graphical model</p>
					</caption>
					<text>
						<p><b>A small Gaussian graphical model</b>. Example of a full conditional model. Missing edges between nodes indicate independencies of two genes given all the other genes in the model. We can read from the graph that <it>X </it>&#8869; <it>W </it>| {<it>Y</it>, <it>Z</it>} and <it>Y </it>&#8869; <it>W </it>| {<it>X</it>, <it>Z</it>} and <it>X </it>&#8869; <it>Z |</it> {<it>Y</it>, <it>W</it>}.</p>
					</text>
					<graphic file="1471-2105-8-S6-S5-2"/>
				</fig>
				<p>To estimate a GGM from data we need to know which elements of the precision matrix <it>K </it>are zero. This can be either done jointly for all edges, or by using non-exhaustive search algorithms like forward and backward search <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Alternatively, hypothesis testing-based model selection can be pursued by testing each edge separately for inclusion <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
				<sec>
					<st>
						<p>Comparison to correlation networks</p>
					</st>
					<p>Correlation graphs visualize the structure encoded in the correlation matrix &#931;, which tells us about the similarity of expression profiles. In GGMs, we model using the precision matrix <it>K </it>= &#931;<sup>-1</sup>, which shows correlation after correcting for the influence of all other genes. GGMs not only filter out all high correlations, which can be attributed to other genes, but may also draw attention to genes which are only very weakly correlated with a gene of interest, but highly related in terms of partial correlations in the context of the other neighboring genes in the GGM. These genes can be overlooked in correlation networks <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. GGMs have a second clear advantage over correlation networks. Whether directly or indirectly, almost all genes will be correlated. Thus, the <it>correlation coefficient </it>is a weak criterion for dependence, but zero correlation is a strong indicator for independence. On the other hand, <it>partial correlation coefficients </it>usually vanish. They provide a strong measure of dependence and, correspondingly, only a weak criterion of independence <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
				</sec>
				<sec>
					<st>
						<p>Problems with GGMs</p>
					</st>
					<p>Full conditional relationships can only be accurately estimated if the number of samples <it>N </it>is relatively large compared to the number of variables <it>p</it>. If the number of genes to be analyzed exceeds the number of distinct expression measurements (that is, if <it>p </it>&#8811; <it>N</it>), the correlation matrix of expression profiles between genes does not have full rank and cannot be inverted <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The <it>p </it>&#8811; <it>N</it>-situation is true for almost all genomic applications of graphical models. Therefore, one must either improve the estimators of partial correlations or resort to a simpler model. The basic idea in all of these approaches is that biological data are high-dimensional but <it>sparse </it>in the sense that only a small number of genes will regulate one specific gene of interest.</p>
					<p>Several papers suggest ways to estimate GGMs in a <it>p </it>&#8811; <it>N</it>-situation. Kishino and Waddell <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> propose gene selection by setting very low partial correlation coefficents to zero. As they state, the estimate still remains unstable. In one study, Sch&#228;fer and Strimmer <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> use bootstrap resampling together with the Moore-Penrose pseudoinverse and false discovery rate multiple testing, while in another <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, they discuss a linear shrinkage approach to regularization. Li and Gui <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> prospose a threshold gradient descent regularization procedure for estimating a sparse precision matrix.</p>
				</sec>
				<sec>
					<st>
						<p>Heuristic regression-based estimation</p>
					</st>
					<p>Full conditional independence models are closely related to a class of graphical models called dependency networks <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Dependency networks are built using sparse regression models to regress each gene <it>X</it><sub><it>i </it></sub>onto the remaining genes <b><it>X</it></b><sub><it>V </it>\ <it>i</it></sub>. The genes, which predict the state of gene <it>X</it><sub><it>i </it></sub>well, are connected to it by (directed) edges in the graph. In general, dependency networks may be inconsistent, <it>i.e</it>. the local regression models may not consistently specify a joint distribution over all genes. Thus, the resulting model is only an approximation of the true full conditional model. Still, dependency networks are widely used because of their flexibility and the computational advantage compared to structure learning in full conditional independence models. When learning dependency networks, a variety of sparse classification/regression techniques may be used to estimate the local distributions, including linear models with an <it>L</it><sub>1</sub>-penalty on model parameters <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B26">26</abbr></abbrgrp>, classification trees <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr></abbrgrp>, or sparse Bayesian regression <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B28">28</abbr></abbrgrp>. We will see later that these approaches are very similar to the local regression models used in Bayesian networks. The difference is that in Bayesian networks an additional order of the variables is enforced.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Low-order conditional independence</p>
				</st>
				<p>Full conditional models are hard to estimate if the number of samples is small compared to the number of genes. While the last section described different statistical techniques for <it>p </it>&#8811; <it>N</it>-situations, this section introduces the complementary approach. Instead of enhancing the estimation procedure, one can ask a simpler question: "Can the correlation between two genes be explained by a <it>single third gene</it>?" In contrast to GGMs, low-order conditional independence models condition not on the rest of the genes, but only on single third genes. An edge between vertices <it>i </it>and <it>j</it>(<it>i </it>&#8800; <it>j</it>) is drawn if the correlation coefficient <it>&#961;</it><sub><it>ij </it></sub>&#8800; 0 and no third gene can explain the correlation:</p>
				<p>
					<display-formula><it>X</it><sub><it>i </it></sub><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math><it>X</it><sub><it>j </it></sub>| <it>X</it><sub><it>k </it></sub>for all <it>k </it>&#8712; <it>V</it>\{<it>i</it>, <it>j</it>}.</display-formula>
				</p>
				<p>This general idea can be implemented in different ways: In a Gaussian setting, first order conditional independence models were proposed by several authors <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Testing for first order conditional independence involves only triples of genes at a time; thus, the problem for GGMs in high dimensions no longer exists. Wille <it>et al</it>. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> use sparse Gaussian graphical modelling to identify modules of closely related genes and candidate genes for cross-talk between pathways in the Isoprenoid gene network in <it>Arabidopsis thaliana</it>.</p>
				<p>In another approach, Margolin <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> use conditional mutual information to test for first-order independence. The resulting method is called ARACNe and has been applied to expression profiles of human B cells <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The advantage of this approach is that the Gaussian assumption is dropped. However, this increased flexibility comes at a prize: ARACNe involves a number of computational approximations and Monte Carlo simulations, which could make the method unstable.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Bayesian networks</p>
			</st>
			<p>In the last sections we have seen methods to build graphs from marginal dependencies (<it>X</it><sub><it>i </it></sub><inline-formula><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math></inline-formula><it>X</it><sub><it>j</it></sub>), full conditional dependencies (<it>X</it><sub><it>i </it></sub><inline-formula><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math></inline-formula><it>X</it><sub><it>j </it></sub>| <b><it>X</it></b><sub>rest</sub>), or first order dependencies (<it>X</it><sub><it>i </it></sub><inline-formula><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math></inline-formula><it>X</it><sub><it>j </it></sub>| <it>X</it><sub><it>k </it></sub>for all <it>k </it>&#8712; rest). The logical next step is to ask for independencies <it>of all orders</it>. In the resulting graph, two vertices <it>i </it>and <it>j </it>are connected if <it>no subset </it>of the other variables can explain the correlation, that is, if</p>
			<p>
				<display-formula><it>X</it><sub><it>i </it></sub><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math><it>X</it><sub><it>j </it></sub>| <b><it>X</it></b><sub><it>S </it></sub>for all <it>S </it>&#8838; <it>V</it>\{<it>i</it>, <it>j</it>}.</display-formula>
			</p>
			<p>This includes testing marginal, first order and full conditional independencies. Thus, the number of edges will be smaller compared to the models in the previous sections. The graph encoding the above independence statements for all pairs of nodes is still undirected. It can be shown that knowing independences of all orders gives an even higher resolved representation of the correlation structure. The collection of independence statements already implies directions of some of the edges in the graph <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. The resulting directed probabilistic model is called a <it>Bayesian network</it>.</p>
			<sec>
				<st>
					<p>Definition of a Bayesian network</p>
				</st>
				<p>A (static) Bayesian network is a graphical representation of the dependency structure between the components of a random vector <b><it>X</it></b>. The individual random variables are associated with the vertices of a directed acyclic graph (DAG) <it>G</it>, which describes the dependency structure. Each node is described by a local probability distribution (LPD) and the joint distribution <it>p</it>(<b><it>x</it></b>) over all nodes factors as</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-8-S6-S5-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>p</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:mi>x</m:mi>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>=</m:mo>
									<m:mstyle displaystyle="true">
										<m:munder>
											<m:mo>&#8719;</m:mo>
											<m:mrow>
												<m:mi>v</m:mi>
												<m:mo>&#8712;</m:mo>
												<m:mi>V</m:mi>
											</m:mrow>
										</m:munder>
										<m:mrow>
											<m:mi>p</m:mi>
											<m:mo stretchy="false">(</m:mo>
											<m:msub>
												<m:mi>x</m:mi>
												<m:mi>v</m:mi>
											</m:msub>
											<m:mo>|</m:mo>
											<m:msub>
												<m:mi>x</m:mi>
												<m:mrow>
													<m:mi>p</m:mi>
													<m:mi>a</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>v</m:mi>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
											</m:msub>
											<m:mo>,</m:mo>
											<m:msub>
												<m:mi>&#952;</m:mi>
												<m:mi>v</m:mi>
											</m:msub>
											<m:mo stretchy="false">)</m:mo>
											<m:mo>,</m:mo>
										</m:mrow>
									</m:mstyle>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakieWacqWF4baEcqGGPaqkcqGH9aqpdaqeqbqaaiabdchaWjabcIcaOiabdIha4naaBaaaleaacqWG2bGDaeqaaOGaeiiFaWNae8hEaG3aaSbaaSqaaiabdchaWjabdggaHjabcIcaOiabdAha2jabcMcaPaqabaGccqGGSaaliiGacqGF4oqCdaWgaaWcbaGaemODayhabeaakiabcMcaPiabcYcaSaWcbaGaemODayNaeyicI4SaemOvayfabeqdcqGHpis1aaaa@4CFA@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>where <it>&#952;</it><sub><it>v </it></sub>denotes the parametrization of the local distribution and <b><it>x</it></b><sub><it>pa</it>(<it>v</it>) </sub>is the vector of parent states denoting the activity levels of a gene's regulators. The DAG structure implies an ordering of the variables. The parents of each node are those variables that render it independent of all other predecessors. The factorization of the joint distribution is the key property of Bayesian networks. It allows to segment the set of variables into families, which can be treated individually. This basic definition of Bayesian networks poses a number of further questions, which are addressed in the following: (1.) How do the local probability distributions <it>p</it>(<it>x</it><sub><it>v </it></sub>| <b><it>x</it></b><sub><it>pa</it>(<it>v</it>)</sub>, <it>&#952;</it><sub><it>v</it></sub>) look like? (2.) How is conditional independence defined for DAGs? (3.) How can we learn a Bayesian network structure from data? (4.) Are there natural limits to structure learning?</p>
			</sec>
			<sec>
				<st>
					<p>Local probability distributions (LPDs)</p>
				</st>
				<p>Bayesian network models differ with respect to assumptions about the local probability distributions <it>p</it>(<it>x</it><sub><it>v </it></sub>| <b><it>x</it></b><sub><it>pa</it>(<it>v</it>)</sub>, <it>&#952;</it><sub><it>v</it></sub>) attached to each node <it>v </it>&#8712; <it>V</it>. There are two types of parametric LPDs used in practice: multinomial distributions for discrete nodes and Gaussian distributions (normal distributions) for continuous nodes. A discrete node with discrete parents follows a multinomial distribution parametrized by a set of probability vectors, one for each parent configuration. A continuous node with continuous parents follows a Gaussian distribution, where the mean is a linear combination of parent states. Conditional Gaussian (CG) networks are a combination of discrete and Gaussian networks. Continuous nodes follow a Gaussian distribution and are allowed discrete and continuous parents, while discrete nodes follow a multinomial distribution and are restricted to discrete parents. CG networks constitute the general class of graphical models studied in statistics <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
				<p>Another kind of parametric LPDs are regression trees, as used by Segal <it>et al</it>. <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Each regression tree is a rooted binary tree with parents in the DAG as internal nodes and leaf nodes associated with univariate Gaussian distributions. The regression trees capture the local structure in the data, whereas the DAG describes the global structure <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>.</p>
				<p>Instead of the parametric approaches discussed so far, the relationship between parents and children in the DAG can also be modeled by non-parametric regression models <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>. The result is a non-linear continuous model. This is an advantage over multinomial or Gaussian Bayesian networks, which are either discrete or linear.</p>
				<p>Bulashevska and Eils <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> constrain LPDs to noisy logic functions modelling activatory or inhibitory parent-child relations. This has the advantage of simplifying and regularizing the model, while at the same time making it easier to interpret.</p>
				<p>Nachman <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> use non-linear Michaelis-Mentens dynamics to model how the transcription rate of a gene depends on its regulators. This approach combines Bayesian networks with a biochemically realistic quantitative model of gene regulation.</p>
			</sec>
			<sec>
				<st>
					<p>Conditional independence in directed graphs</p>
				</st>
				<p>In Fig. <figr fid="F2">2</figr> we saw how to read off independence statements from a full conditional independence graph. How does this work in the case of Bayesian networks? The answer is given by the definition of <it>d-separation </it><abbrgrp><abbr bid="B36">36</abbr></abbrgrp> ("d" for directed), also called the <it>directed Global Markov condition </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The three archetypical situations of d-separation (chain, fork, and collider) can be seen in Fig. <figr fid="F3">3</figr>. In a chain <it>X </it>&#8594; <it>Y </it>&#8594; <it>Z</it>, the middle node <it>Y </it>blocks the information flow between <it>X </it>and <it>Z </it>and thus it holds that <it>X </it>&#8869; <it>Z </it>| <it>Y</it>. In a fork, where <it>X </it>and <it>Z </it>are both regulated by <it>Y</it>, knowing the state of the regulator renders the regulatees conditionally independent and thus again <it>X </it>&#8869; <it>Z </it>| <it>Y</it>. The last case is more surprising: If <it>X </it>and <it>Z </it>are <it>independent </it>regulators with a common target <it>Y</it>, then the state of <it>Y </it>gives us information about <it>X </it>and <it>Z</it>. For example, imagine that <it>Y </it>is only expressed if only one of its regulators is active, then seeing <it>Y </it>expressed and <it>X </it>active implies <it>Z </it>being inactive. Thus, in the collider <it>X </it>&#8594; <it>Y </it>&#8592; <it>Z </it>the middle node <it>Y </it>"unblocks" the path between <it>X </it>and <it>Z </it>and thus <it>X </it><inline-formula><m:math name="1471-2105-8-S6-S5-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:menclose notation="updiagonalstrike"><m:mo>&#8869;</m:mo></m:menclose><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuGHLkIxgaGcaaaa@2E6E@</m:annotation></m:semantics></m:math></inline-formula><it>Z </it>| <it>Y</it>.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Conditional indendence in directed graphs</p>
					</caption>
					<text>
						<p><b>Conditional indendence in directed graphs</b>. The three archetypal situations in the definition of d-separation. In the chain and the fork, conditioning on the middle node makes the others independent. In a collider, <it>X </it>and <it>Z </it>are marginally independent, but become dependent once <it>Y </it>is known.</p>
					</text>
					<graphic file="1471-2105-8-S6-S5-3"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Markov equivalence</p>
				</st>
				<p>Many Bayesian networks may represent the same statements of conditional independence. They are statistically undistinguishable and we call them <it>Markov equivalent</it>. All equivalent networks share the same underlying undirected graph (called the <it>skeleton</it>) but may differ in the direction of edges that are not part of a collider (also called a <it>v-structure</it>) <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Markov equivalence poses a theoretical limit on structure learning from data: even with infinitely many samples, we cannot resolve the structures in an equivalence class. In biological terms this means: even if we find two genes to be related it may not be clear which one is the regulator and which one is the regulatee. Without perturbation experiments this situation can not be further resolved.</p>
			</sec>
			<sec>
				<st>
					<p>Acyclicity in a cyclic world</p>
				</st>
				<p>Bayesian networks allow the highest resolution of correlation structure. Still, they suffer from a severe shortcoming: they are acyclic. With cycles, we cannot decompose the joint distribution as in the definition of Bayesian networks. Biological networks are all known to contain feedback loops and cycles <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Modeling the cell cycle with an acyclic model <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> can only be a preliminary step. One extension of Bayesian networks that encompasses cyclic structures is the factor graph network model of Gat-Viks <it>et. al</it>. <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. A second way to address the cycle problem is by assuming that the system evolves over time. This is shown in Fig. <figr fid="F4">4</figr>. We no longer model a static random vector <b><it>X </it></b>but a time series <b><it>X </it></b>[1],..., <b><it>X </it></b>[<it>T</it>] of observing <b><it>X </it></b>at <it>T </it>timepoints. If we assume that <it>X</it><sub><it>v </it></sub>at time <it>t </it>+ 1 can only have parents at time <it>t</it>, then cycles "unroll" and the resulting model is again acyclic and tractable: it is called a <it>Dynamic Bayesian network </it>(DBN) <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. DBNs have found many applications in network reconstruction <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. They are often augmented with hidden nodes <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, which can describe transcription factor activity <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> or any other kind of environmental or non-transcriptional effects in the cell <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. In summary, DBNs provide one of the most flexible frameworks for modeling cellular networks.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Cycles unroll over time</p>
					</caption>
					<text>
						<p><b>Cycles unroll over time</b>. The cycle unrolls into an acyclic graph over different time slices.</p>
					</text>
					<graphic file="1471-2105-8-S6-S5-4"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Score based structure learning</p>
				</st>
				<p>In correlation networks, GGMs and sparse GGMs we use statistical tests for each gene pair to decide whether the data support an edge or not. The number of tests to be done in these models is limited, even though it can be big in the case of sparse GGMs. For Bayesian networks we need to test independence of a gene pair for every subset of the other genes. This is called <it>constraint-based </it>learning of Bayesian networks <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. For problems with more than a handful of variables testing becomes infeasible very quickly. In applications in computational biology the network structure is therefore mostly estimated by score based techniques. In the following we review maximum likelihood scores and Bayesian scores to evaluate model fit to data. Once the score is defined, model selection is posed as an optimization problem over the discrete space of possible model structures. Additional topics are how to fight overfitting and how to encode prior information.</p>
				<sec>
					<st>
						<p>Maximum likelihood</p>
					</st>
					<p>A straight-forward idea for model selection is to choose the DAG <it>G</it>, which allows the best fit to data <b><it>D</it></b>. The best fit for a given DAG <it>G </it>is determined by maximizing the likelihood <it>p</it>(<b><it>D</it></b>|<it>G</it>, <it>&#952;</it>) as a function of <it>&#952;</it>, the parameters of the local probability distributions. A score for DAG <it>G </it>is then given by</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:msub>
											<m:mrow>
												<m:mtext>score</m:mtext>
											</m:mrow>
											<m:mrow>
												<m:mi>M</m:mi>
												<m:mi>L</m:mi>
											</m:mrow>
										</m:msub>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>G</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>=</m:mo>
										<m:munder>
											<m:mrow>
												<m:mi>max</m:mi>
												<m:mo>&#8289;</m:mo>
											</m:mrow>
											<m:mi>&#952;</m:mi>
										</m:munder>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>D</m:mi>
										<m:mo>|</m:mo>
										<m:mi>G</m:mi>
										<m:mo>,</m:mo>
										<m:mi>&#952;</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>.</m:mo>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqqGZbWCcqqGJbWycqqGVbWBcqqGYbGCcqqGLbqzdaWgaaWcbaGaemyta0KaemitaWeabeaakiabcIcaOiabdEeahjabcMcaPiabg2da9maaxababaGagiyBa0MaeiyyaeMaeiiEaGhaleaaiiGacqWF4oqCaeqaaOGaemiCaaNaeiikaGccbmGae4hraqKaeiiFaWNaem4raCKaeiilaWIae8hUdeNaeiykaKIaeiOla4caaa@4A35@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>Unfortunately, the likelihood is not an appropriate score to decide between models since it tends to overfitting. Richer models with more edges will have a better likelihood than simpler ones, since the additional number of parameters allows a better fit to the data. A standard solution to this problem is to penalize the maximum likelihood score according to model complexity. An often used example of this general strategy is scoring with the Bayesian information criterion.</p>
				</sec>
				<sec>
					<st>
						<p>Bayesian information criterion (BIC)</p>
					</st>
					<p>Contrary to what the name suggests, the BIC score <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> is not a Bayesian score. It is a regularized maximum likelihood estimate, which controls overfitting by penalizing the maximal likelihood of the model with respect to the number of model parameters. It is defined as</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:msub>
											<m:mrow>
												<m:mtext>score</m:mtext>
											</m:mrow>
											<m:mrow>
												<m:mi>B</m:mi>
												<m:mi>I</m:mi>
												<m:mi>C</m:mi>
											</m:mrow>
										</m:msub>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>G</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>=</m:mo>
										<m:munder>
											<m:mrow>
												<m:mi>max</m:mi>
												<m:mo>&#8289;</m:mo>
											</m:mrow>
											<m:mi>&#952;</m:mi>
										</m:munder>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>D</m:mi>
										<m:mo>|</m:mo>
										<m:mi>G</m:mi>
										<m:mo>,</m:mo>
										<m:mi>&#952;</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>&#8722;</m:mo>
										<m:mfrac>
											<m:mi>d</m:mi>
											<m:mn>2</m:mn>
										</m:mfrac>
										<m:mi>log</m:mi>
										<m:mo>&#8289;</m:mo>
										<m:mi>N</m:mi>
										<m:mo>,</m:mo>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqqGZbWCcqqGJbWycqqGVbWBcqqGYbGCcqqGLbqzdaWgaaWcbaGaemOqaiKaemysaKKaem4qameabeaakiabcIcaOiabdEeahjabcMcaPiabg2da9maaxababaGagiyBa0MaeiyyaeMaeiiEaGhaleaaiiGacqWF4oqCaeqaaOGaemiCaaNaeiikaGccbmGae4hraqKaeiiFaWNaem4raCKaeiilaWIae8hUdeNaeiykaKIaeyOeI0YaaSaaaeaacqWGKbazaeaacqaIYaGmaaGagiiBaWMaei4Ba8Maei4zaCMaemOta4KaeiilaWcaaa@53A7@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>where <it>d </it>is the number of parameters and the factor log <it>N </it>scales the penalty with respect to the likelihood. The BIC score can also be used to learn Bayesian networks with missing values or hidden variables. The likelihood has then to be maximized via the Expectation-Maximization (EM) algorithm. In such a scenario, the BIC score is used by Nachman <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> to learn kinetic models of transcription factors and their targets. They treat protein activities and kinetic constants as hidden variables.</p>
				</sec>
				<sec>
					<st>
						<p>Bayesian scores</p>
					</st>
					<p>In most cases a full Bayesian approach is preferred over ML or BIC. In Bayesian structure learning we evaluate the posterior probability of model topology <it>G </it>given data <b><it>D</it></b>:</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:msub>
											<m:mrow>
												<m:mtext>score</m:mtext>
											</m:mrow>
											<m:mrow>
												<m:mi>B</m:mi>
												<m:mi>a</m:mi>
												<m:mi>y</m:mi>
												<m:mi>e</m:mi>
												<m:mi>s</m:mi>
											</m:mrow>
										</m:msub>
										<m:mo>=</m:mo>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>G</m:mi>
										<m:mo>|</m:mo>
										<m:mi>D</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>=</m:mo>
										<m:mfrac>
											<m:mrow>
												<m:mi>p</m:mi>
												<m:mo stretchy="false">(</m:mo>
												<m:mi>D</m:mi>
												<m:mo>|</m:mo>
												<m:mi>G</m:mi>
												<m:mo stretchy="false">)</m:mo>
												<m:mo>&#8901;</m:mo>
												<m:mi>p</m:mi>
												<m:mo stretchy="false">(</m:mo>
												<m:mi>G</m:mi>
												<m:mo stretchy="false">)</m:mo>
											</m:mrow>
											<m:mrow>
												<m:mi>p</m:mi>
												<m:mo stretchy="false">(</m:mo>
												<m:mi>D</m:mi>
												<m:mo stretchy="false">)</m:mo>
											</m:mrow>
										</m:mfrac>
										<m:mo>.</m:mo>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqqGZbWCcqqGJbWycqqGVbWBcqqGYbGCcqqGLbqzdaWgaaWcbaGaemOqaiKaemyyaeMaemyEaKNaemyzauMaem4Camhabeaakiabg2da9iabdchaWjabcIcaOiabdEeahjabcYha8Hqadiab=reaejabcMcaPiabg2da9maalaaabaGaemiCaaNaeiikaGIae8hraqKaeiiFaWNaem4raCKaeiykaKIaeyyXICTaemiCaaNaeiikaGIaem4raCKaeiykaKcabaGaemiCaaNaeiikaGIae8hraqKaeiykaKcaaiabc6caUaaa@5580@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>The denominator <it>p</it>(<b><it>D</it></b>) is an average of data likelihoods over all possible models. This normalizing constant is the same for all models, and thus we do not need compute it to decide between competing models. The two main terms to consider in the Bayesian score are the prior over model structures, <it>p</it>(<it>G</it>), and the marginal likelihood <it>p</it>(<b><it>D</it></b>|<it>G</it>).</p>
				</sec>
				<sec>
					<st>
						<p>Marginal likelihood of network structure</p>
					</st>
					<p>The marginal likelihood <it>p</it>(<b><it>D</it></b>|<it>G</it>) is the key component of Bayesian scoring metrics. It equals the full model likelihood averaged over parameters of local probability distributions, that is,</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>D</m:mi>
										<m:mo>|</m:mo>
										<m:mi>G</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>=</m:mo>
										<m:mstyle displaystyle="true">
											<m:mrow>
												<m:msub>
													<m:mo>&#8747;</m:mo>
													<m:mi>&#920;</m:mi>
												</m:msub>
												<m:mrow>
													<m:mi>p</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>D</m:mi>
													<m:mo>|</m:mo>
													<m:mi>G</m:mi>
													<m:mo>,</m:mo>
													<m:mi>&#952;</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mi>p</m:mi>
													<m:mo stretchy="false">(</m:mo>
													<m:mi>&#952;</m:mi>
													<m:mo>|</m:mo>
													<m:mi>G</m:mi>
													<m:mo stretchy="false">)</m:mo>
													<m:mtext>d</m:mtext>
													<m:mi>&#920;</m:mi>
													<m:mo>.</m:mo>
												</m:mrow>
											</m:mrow>
										</m:mstyle>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakieWacqWFebarcqGG8baFcqWGhbWrcqGGPaqkcqGH9aqpdaWdraqaaiabdchaWjabcIcaOiab=reaejabcYha8jabdEeahjabcYcaSGGaciab+H7aXjabcMcaPiabdchaWjabcIcaOiab+H7aXjabcYha8jabdEeahjabcMcaPiabbsgaKjabfI5arjabc6caUaWcbaGaeuiMdefabeqdcqGHRiI8aaaa@4C65@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>Marginalization is the reason why the LPD parameters <it>&#952; </it>do not enter the definition of the posterior above. They are treated as <it>nuisance parameters </it>and have been integrated out. It is important to note that the LPD parameters were not maximized as it would be done in a maximum likelihood estimate or in a BIC score. Averaging instead of maximizing prevents the Bayesian score from overfitting.</p>
					<p>Computation of marginal likelihood depends on the choice of local probability distributions and local priors in the Bayesian network model. To compute the marginal likelihood analytically, the prior <it>p</it>(<it>&#952;</it>|<it>G</it>) must fit to the likelihood <it>p</it>(<b><it>D</it></b>|<it>G</it>, <it>&#952;</it>). Statistically, this fit is called "conjugacy". A prior distribution is called <it>conjugate </it>to a likelihood, if the posterior is of the same distributional form as the prior <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. If no conjugate prior is available, the marginal likelihood has to be approximated.</p>
					<p>The marginal likelihood for discrete Bayesian networks was first computed by Cooper and Herskovits <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> and is further discussed by Heckerman <it>et al</it>. <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. The conjugate prior for the multinomial distribution is the Dirichlet prior <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Assuming independence of the prior for each node and each parent configuration, the score decomposes into independent contributions for each family of nodes. Corresponding results exist for Gaussion networks using a Normal-Wishart prior <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. The marginal likelihood again decomposes into node-wise contributions. Conditional Gaussian networks are a mix of discrete and Gaussian nodes <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. The marginal likelihood decomposes into a Gaussian part and a discrete part.</p>
					<p>For tree LPDs, the marginal likelihood at each node of the DAG further splits into independent components for each leaf of the local regression tree. Conjugate analysis and analytic results are possible using normal-gamma priors for each leaf node <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>.</p>
					<p>For non-parametrix LPDs, Boolean logic LPDs, and kinetic modelling LPDs, conjugate analysis and analytic computation of the marginal likelihood are not possible. Imoto <it>et al</it>. <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> use a Laplace approximation to approach the true marginal likelihood. Bulashevska and Eils <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> use Gibbs sampling to estimate the model posterior <it>p</it>(<it>G</it>|<b><it>D</it></b>) and the parameter posterior <it>p</it>(<it>&#952;</it>|<b><it>D</it></b>). Nachman <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> use the BIC score for model selection.</p>
				</sec>
				<sec>
					<st>
						<p>Structure prior</p>
					</st>
					<p>Structure priors <it>p</it>(<it>G</it>) help to focus inference on reasonable models by including biological prior knowledge or integrating different data sources. Very often the biological prior knowledge can be encoded in a prior network, which is then to be refined by statistical structure learning. The first idea is to restrict the search space to a &#8211; conveniently defined &#8211; vicinity <inline-formula><m:math name="1471-2105-8-S6-S5-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">V</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaaimaacqWFveVvaaa@384B@</m:annotation></m:semantics></m:math></inline-formula> (<inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula>) of the prior network <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula>. All the DAGs in the restricted search space are considered equally likely. This can be interpreted as a rigid structure prior of the form</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>G</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>=</m:mo>
										<m:mrow>
											<m:mo>{</m:mo>
											<m:mrow>
												<m:mtable columnalign="left">
													<m:mtr columnalign="left">
														<m:mtd columnalign="left">
															<m:mrow>
																<m:mn>1</m:mn>
																<m:mo>/</m:mo>
																<m:mo>|</m:mo>
																<m:mi mathvariant="script">V</m:mi>
																<m:mo stretchy="false">(</m:mo>
																<m:mtext mathvariant="script">P</m:mtext>
																<m:mo stretchy="false">)</m:mo>
																<m:mo>|</m:mo>
															</m:mrow>
														</m:mtd>
														<m:mtd columnalign="left">
															<m:mrow>
																<m:mtext>if&#160;</m:mtext>
																<m:mi>G</m:mi>
																<m:mo>&#8712;</m:mo>
																<m:mi mathvariant="script">V</m:mi>
																<m:mo stretchy="false">(</m:mo>
																<m:mtext mathvariant="script">P</m:mtext>
																<m:mo stretchy="false">)</m:mo>
																<m:mo>,</m:mo>
															</m:mrow>
														</m:mtd>
													</m:mtr>
													<m:mtr columnalign="left">
														<m:mtd columnalign="left">
															<m:mn>0</m:mn>
														</m:mtd>
														<m:mtd columnalign="left">
															<m:mrow>
																<m:mtext>else</m:mtext>
																<m:mo>.</m:mo>
															</m:mrow>
														</m:mtd>
													</m:mtr>
												</m:mtable>
											</m:mrow>
										</m:mrow>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWGhbWrcqGGPaqkcqGH9aqpdaGabaqaauaabaqaciaaaeaacqaIXaqmcqGGVaWlcqGG8baFt0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaacqWFveVvcqGGOaakcqqGqbaucqGGPaqkcqGG8baFaeaacqqGPbqAcqqGMbGzcqqGGaaicqWGhbWrcqGHiiIZcqWFveVvcqGGOaakcqqGqbaucqGGPaqkcqGGSaalaeaacqaIWaamaeaacqqGLbqzcqqGSbaBcqqGZbWCcqqGLbqzcqGGUaGlaaaacaGL7baaaaa@5942@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>A smoother way to guarantee that DAGs similar to the prior network <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula> get higher prior probability is the following. We measure the confidence of edge (<it>v</it>, <it>w</it>) by a value 0 &lt;<it>&#954;</it><sub><it>vw </it></sub>&#8804; 1. A structure prior can then be defined proportional to a product of weights <it>&#954;</it><sub><it>vw </it></sub>over all edges (<it>v</it>, <it>w</it>):</p>
					<p>
						<display-formula>
							<m:math name="1471-2105-8-S6-S5-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
								<m:semantics>
									<m:mrow>
										<m:mi>p</m:mi>
										<m:mo stretchy="false">(</m:mo>
										<m:mi>G</m:mi>
										<m:mo stretchy="false">)</m:mo>
										<m:mo>&#8733;</m:mo>
										<m:mstyle displaystyle="true">
											<m:munder>
												<m:mo>&#8719;</m:mo>
												<m:mrow>
													<m:mi>v</m:mi>
													<m:mo>,</m:mo>
													<m:mi>w</m:mi>
													<m:mo>&#8712;</m:mo>
													<m:mi>V</m:mi>
												</m:mrow>
											</m:munder>
											<m:mrow>
												<m:msub>
													<m:mi>&#954;</m:mi>
													<m:mrow>
														<m:mi>v</m:mi>
														<m:mi>w</m:mi>
													</m:mrow>
												</m:msub>
											</m:mrow>
										</m:mstyle>
										<m:mo>.</m:mo>
									</m:mrow>
									<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWGhbWrcqGGPaqkcqGHDisTdaqeqbqaaGGaciab=P7aRnaaBaaaleaacqWG2bGDcqWG3bWDaeqaaaqaaiabdAha2jabcYcaSiabdEha3jabgIGiolabdAfawbqab0Gaey4dIunakiabc6caUaaa@40A9@</m:annotation>
								</m:semantics>
							</m:math>
						</display-formula>
					</p>
					<p>The normalization constant, which would be necessary to make the right-hand side a density, is the same for all models and can be ignored when computing relative posterior probabilities. What are smart choices of <it>&#954;</it><sub><it>vw</it></sub>? There are several approaches suggested in the literature. Heckerman <it>et al</it>. <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> assume constant penalty <it>&#954;</it><sub><it>vw </it></sub>&#8801; <it>&#954; </it>for all edges in which <it>G </it>and <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula> differ. Thus, <it>p</it>(<it>D</it>) &#8733; <it>&#954;</it><sup><it>&#949; </it></sup>where <it>&#949; </it>is the number of edges in which <it>G </it>differs from the prior DAG <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula>. Another approach by Imoto <it>et al</it>. <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and Tamada <it>et. al</it>. <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> uses a network prior in an iterative scheme. They construct a Bayesian network from microarray data, propose putative transcription factors from the network structure, and search for common motifs in the DNA sequences of children and grand-children of transcription factors. They then re-learn the network by penalizing edges without motif evidence harder than edges with motif evidence. Bernard <it>et al</it>. <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> define weights from <it>p</it>-values of binding location data. They assume that <it>p</it>-values follow an exponential distribution if the edge is present and a uniform distribution if it is not. By Bayes' rule they derive probabilities for an edge to be present given the <it>p</it>-values from the location data. The free parameter of the exponential distribution is then integrated out and the final probabilities <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula><sub><it>vw </it></sub>are used as weights in a structure prior. Fig. <figr fid="F5">5</figr> shows a comparison of these three prior definitions. They can be organized by the weights <it>&#954;</it><sub><it>vw </it></sub>they give for the presence or absence of an edge given prior information.</p>
					<fig id="F5">
						<title>
							<p>Figure 5</p>
						</title>
						<caption>
							<p>An overview of structure priors</p>
						</caption>
						<text>
							<p><b>An overview of structure priors</b>. Comparison of edge weights suggested by Heckerman <it>et al</it>. [65], Imoto <it>et al</it>. [43] and Bernard <it>et al</it>. [55]. Rows correspond to prior information. In the left two examples the prior can be described as binary, while on the right it is expressed as a <it>p</it>-value derived from a second data set. The entries in the table are the weights <it>&#954;</it><sub><it>vw </it></sub>for each edge depending on whether <it>G </it>agrees with <inline-formula><m:math name="1471-2105-8-S6-S5-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mtext mathvariant="script">P</m:mtext><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqqGqbauaaa@3786@</m:annotation></m:semantics></m:math></inline-formula> or not. The middle table holds <it>&#958;</it><sub>1 </sub>&lt;<it>&#958;</it><sub>2</sub>, i.e. edges with motif evidence contribute more than edges without. The structure prior is then a product over the weights for all edges.</p>
						</text>
						<graphic file="1471-2105-8-S6-S5-5"/>
					</fig>
				</sec>
				<sec>
					<st>
						<p>Discretization</p>
					</st>
					<p>In most applications the Bayesian score for discrete data is used. When learning gene regulatory networks from microarray data, it is necessary to preprocess the continuous gene expression values and discretize them. In general, discretization may be carried out for computational efficiency, or because background knowledge suggests that the underlying variables are indeed discrete. Discretizing continuous variables results in a loss of information, although it can also reduce noise, since discretized data can be more stable with respect to random variations of the mRNA measurements. Several methods to discretize microarray data have been proposed in the literature: Friedman <it>et al</it>. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> discretize expression values into three categories, depending on whether the expression rate is significantly lower than, similar to, or greater than control, respectively. Pe'er <it>et al</it>. <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> introduce an adaptive discretization procedure. They model the expression level of a gene in different experiments as samples from a mixture of normal distributions, where each normal component corresponds to a specific state. They then use standard <it>k</it>-means clustering for inference. Hartemink <it>et al</it>. <abbrgrp><abbr bid="B69">69</abbr></abbrgrp> use a discretization coalescence method, which incrementally reduces the number of discretization levels for each gene while preserving as much total mutual information between genes as possible. In the previous three approaches, expression levels were discretized before and independently of structure learning. Suboptimal discretization algorithms can lead to degraded network structure. To avoid this, Steck and Jaakkola <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> derive a scoring function to efficiently <it>jointly </it>optimize the discretization policy and the structure of the graphical model.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Regularization</p>
				</st>
				<p>Regularization is a technique used in machine learning to ensure uniqueness of solution and to fight overfitting by constraining admissible models <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B71">71</abbr></abbrgrp>. Regularization is always needed in <it>p </it>&#8811; <it>N</it>-situations. We already saw examples of regularization when Gaussian graphical models were adapted to the <it>p </it>&#8811; <it>N</it>-situation. Different methods were proposed for Bayesian networks. Steck and Jaakkola <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> show that scaling down the parameters of the Dirichlet prior used for the computation of the marginal likelihood leads to a strong regularization of the model structure and a sparse graph. Another way to regularize Bayesian networks is to constrain the local probability distributions. Bulashevska and Eils <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> suggest learning noisy logic gates instead of unconstrained multinomial LPDs. The drawback is that Bayesian conjugate analysis, which leads to the analytic solution of the marginal likelihood, is no longer possible and Gibbs sampling has to be applied. Module networks <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp> constrain the number of parameters in the model by assuming that groups of genes (so called <it>modules</it>) share the same dependence on regulators. Learning module networks involves an iteration of assigning genes to modules and searching for dependencies between modules.</p>
			</sec>
			<sec>
				<st>
					<p>Model selection and assessment</p>
				</st>
				<p>To search for the DAG with highest score is mathematically trivial: compute the score for every possible DAG and choose the one that achieves the highest value. What makes exhaustive search computationally infeasible in almost all applications is the huge number of DAGs. The number of DAGs on <it>n </it>edges is</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-8-S6-S5-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:msub>
										<m:mi>a</m:mi>
										<m:mi>n</m:mi>
									</m:msub>
									<m:mo>=</m:mo>
									<m:mstyle displaystyle="true">
										<m:munderover>
											<m:mo>&#8721;</m:mo>
											<m:mrow>
												<m:mi>k</m:mi>
												<m:mo>=</m:mo>
												<m:mn>1</m:mn>
											</m:mrow>
											<m:mi>n</m:mi>
										</m:munderover>
										<m:mrow>
											<m:msup>
												<m:mrow>
													<m:mo stretchy="false">(</m:mo>
													<m:mo>&#8722;</m:mo>
													<m:mn>1</m:mn>
													<m:mo stretchy="false">)</m:mo>
												</m:mrow>
												<m:mrow>
													<m:mi>k</m:mi>
													<m:mo>&#8722;</m:mo>
													<m:mn>1</m:mn>
												</m:mrow>
											</m:msup>
										</m:mrow>
									</m:mstyle>
									<m:mrow>
										<m:mo>(</m:mo>
										<m:mrow>
											<m:mtable>
												<m:mtr>
													<m:mtd>
														<m:mi>n</m:mi>
													</m:mtd>
												</m:mtr>
												<m:mtr>
													<m:mtd>
														<m:mi>k</m:mi>
													</m:mtd>
												</m:mtr>
											</m:mtable>
										</m:mrow>
										<m:mo>)</m:mo>
									</m:mrow>
									<m:msup>
										<m:mn>2</m:mn>
										<m:mrow>
											<m:mi>k</m:mi>
											<m:mo stretchy="false">(</m:mo>
											<m:mi>n</m:mi>
											<m:mo>&#8722;</m:mo>
											<m:mi>k</m:mi>
											<m:mo stretchy="false">)</m:mo>
										</m:mrow>
									</m:msup>
									<m:msub>
										<m:mi>a</m:mi>
										<m:mrow>
											<m:mi>n</m:mi>
											<m:mo>&#8722;</m:mo>
											<m:mi>k</m:mi>
										</m:mrow>
									</m:msub>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGHbqydaWgaaWcbaGaemOBa4gabeaakiabg2da9maaqahabaGaeiikaGIaeyOeI0IaeGymaeJaeiykaKYaaWbaaSqabeaacqWGRbWAcqGHsislcqaIXaqmaaaabaGaem4AaSMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdGcdaqadaqaauaabeqaceaaaeaacqWGUbGBaeaacqWGRbWAaaaacaGLOaGaayzkaaGaeGOmaiZaaWbaaSqabeaacqWGRbWAcqGGOaakcqWGUbGBcqGHsislcqWGRbWAcqGGPaqkaaGccqWGHbqydaWgaaWcbaGaemOBa4MaeyOeI0Iaem4AaSgabeaaaaa@4FF8@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>with <it>a</it><sub>0 </sub>= 1 <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. The number of DAGs increases explosively, as the first few steps in the recursion show: 1, 1, 3, 25, 543, 29 281, 3 781 503, 1 138 779 265. That means, we must use heuristic strategies to find high-scoring Bayesian networks without enumerating all possible DAGs.</p>
				<sec>
					<st>
						<p>Defining search space</p>
					</st>
					<p>First we need to decide how to describe models of interest. This defines the model space, in which we search for models describing the data well. To apply search heuristics we have to equip the search space with a neighborhood relation, that is, operators to move from one point of the search space to the next one. The most simple search space results from defining a neighborhood relation on DAGs. Two DAGs are neighbors if they differ by one edge, which is either missing in one of them or directed the other way round. Madigan <it>et al</it>. <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> and Chickering <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> restrict the search space to Markov equivalence classes of DAGs which uniquely describe a joint distribution. Thus, no time is lost in evaluating DAG models which are equivalent anyway. Friedman and Koller <abbrgrp><abbr bid="B76">76</abbr></abbrgrp> search over orders of nodes rather than over network structures. They argue that the space of orders is smaller and more regular than the space of structures and has a much smoother posterior landscape.</p>
				</sec>
				<sec>
					<st>
						<p>Search heuristics</p>
					</st>
					<p>Most of the following search algorithms can be applied to all search spaces, even though they are usually applied to DAGs. They return a single best network. A simple and fast but still powerful method is <it>hill-climbing </it>by greedy search. First, choose a point in search space to start from, e.g. a random graph or the empty graph. Compute the posterior probability for all graphs in the neighborhood of the current graph and select the graph with highest score. Iterate until no graph in the neighborhood has a larger score than the current graph. This procedure finds local maxima of the Bayesian scoring metric. The K2-algorithm <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> is a variant of greedy search, which assumes that the order of nodes is known. Several approaches have been suggested to speed up model search. The <it>sparse candidate algorithm </it><abbrgrp><abbr bid="B77">77</abbr></abbrgrp> restricts the number of possible parents for each node by searching for pairs of nodes which are highly dependent. The <it>ideal parent algorithm </it><abbrgrp><abbr bid="B47">47</abbr><abbr bid="B78">78</abbr></abbrgrp> constructs a parent profile perfectly explaining the child behaviour and uses it to guide parent selection and to restrict the search space. Pe&#241;a <it>et al</it>. <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> grow Bayesian networks starting from a target gene of interest. They iteratively add to the Bayesian network parents and children of all the genes already included in it. The algorithm stops after a predefined number of steps and thus, intuitively, highlights the surrounding area of the seed gene without having to compute the complete Bayesian network over all genes. Friedman <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr></abbrgrp> introduces the <it>structural EM algorithm </it>to learn Bayesian networks in the presence of missing values or hidden variables. It is an extension of the Expectation-Maximization (EM) algorithm that performs structure search <it>inside </it>the EM procedure and shows improvements in terms of speed and accuracy.</p>
				</sec>
				<sec>
					<st>
						<p>Assessing uncertainty</p>
					</st>
					<p>The problem with optimal models is, as Edwards <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> puts it: "Any method (or statistician) that takes a complex multivariate dataset and, from it, claims to identify one true model, is both naive and misleading". The emphasis is on "one true model". A better strategy than choosing a single best model is exploring the whole posterior distribution. Unfortunately, direct sampling from the posterior is intractable. The most we know about the data distribution is the empirical distribution of observations in the dataset. A classical approach to assess variability in the data is bootstrapping <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. The strategy is to sample with replacement from the observations in the data set to get a number of bootstrap datasets, and then learn a network on every bootstrap dataset. The relative frequency of network features in the resulting network structures can be used as a measure of reliability <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B68">68</abbr></abbrgrp>. Bootstrap samples can contain multiple copies of identical data points. This implies strong statistical dependencies between variables when given a small dataset. As a consequence, the resulting network structure can be considerably biased towards denser graphs. Steck and Jaakkola <abbrgrp><abbr bid="B83">83</abbr></abbrgrp> propose a correction for this bias. As another simple way to avoid the bootstrap-bias Steck and Jaakkola <abbrgrp><abbr bid="B70">70</abbr></abbrgrp> use the <it>leave-k-out </it>method. Instead of resampling with replacement, <it>k </it>cases are left out of the dataset when estimating a model. Repeating this many times also gives an estimate of model variability.</p>
					<p>Markov Chain Monte Carlo (MCMC) is a simulation technique which can be also used to sample from the posterior <it>p</it>(<it>G|<b>D</b></it>). Given a network structure, a new neighboring structure is proposed. This new structure is accepted with the Metropolis Hastings acceptance criterion <abbrgrp><abbr bid="B84">84</abbr></abbrgrp>. The iteration of this procedure produces a Markov chain that under fairly general conditions converges in distribution to the true posterior. MCMC is used by Husmeier <abbrgrp><abbr bid="B85">85</abbr></abbrgrp> to learn dynamic Bayesian networks. Madigan <it>et al</it>. <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> use MCMC over Markov equivalence classes and Friedman and Koller <abbrgrp><abbr bid="B76">76</abbr></abbrgrp> over orders of nodes.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Benchmarking</p>
				</st>
				<p>Graphical models visualize a multivariate dependency structure. They can only answer biological questions if they succeed in reliably and accurately reconstructing biologically relevant features of cellular networks. Unfortunately, rigorous assessment and benchmarking of methods are still rare. One of the first evaluation studies is by Smith <it>et al</it>. <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>. They sample data from a songbird's brain model and report excellent recovery success when learning a Bayesian network from it. Zak <it>et al</it>. <abbrgrp><abbr bid="B87">87</abbr></abbrgrp> develop a realistic 10 gene network, where the biological processes at the different levels of transcription, translation and post-translational modifications were modeled with systems of differential equations. They show that linear and log-linear methods fail to recover the network structure. Husmeier <abbrgrp><abbr bid="B85">85</abbr></abbrgrp> uses the same simulation network <abbrgrp><abbr bid="B87">87</abbr></abbrgrp> to specify sensitivity and specificity of dynamic Bayesian networks. He demonstrates how the network inference performance varies with the training set size, the degree of inadequacy of prior assumptions, and the experimental sampling strategy. By analyzing ROC curves Husmeier can show fair performance of DBNs. Wimberly <it>et al</it>. <abbrgrp><abbr bid="B88">88</abbr></abbrgrp> test 10 algorithms, including Boolean and Bayesian networks, on a simulation <abbrgrp><abbr bid="B89">89</abbr></abbrgrp> of the genetic network of the sea urchin embryo <abbrgrp><abbr bid="B90">90</abbr></abbrgrp>. They report that reconstruction is unreliable with all methods and that the performance of the better algorithms quickly degrades as simulations become more realistic. Basso <it>et al</it>. <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> show that their own method, ARACNe, compares favorably against static Bayesian networks on a simulated network with 19 nodes <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> &#8211; but only if the dataset includes several hundreds of observations. On the other hand, Hartemink <abbrgrp><abbr bid="B91">91</abbr></abbrgrp> finds <it>dynamic </it>Bayesian networks to be even more accurate than ARACNe on datasets of the same size. In a recent comparative evaluation, Werhli <it>et al</it>. <abbrgrp><abbr bid="B92">92</abbr></abbrgrp> find no significant differences between coexpression networks, Gaussian graphical models and Bayesian networks, if they are used on nonlinear simulated data and real data. They conclude that the higher computational cost of inferring Bayesian networks over GGMs and coexpression networks is often not justified.</p>
				<p>All in all the results show severe limitations of graphical models for microarray data: They need a large sample size and capture only parts of biologically relevant networks. One reason for this shortcoming is that the models we discussed so far all use purely observational data, where the cellular network was not perturbed experimentally. In simulations <abbrgrp><abbr bid="B92">92</abbr><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr></abbrgrp> and on real data <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B92">92</abbr></abbrgrp> it was shown that data from perturbation experiments greatly improve performance in network reconstruction. In Bayesian networks this improvement is especially pronounced <abbrgrp><abbr bid="B92">92</abbr></abbrgrp>. Accordingly, the following section introduces methodology for learning from effects of interventions in a probabilistic framework.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Learning from experimental interventions</p>
			</st>
			<p>Physicist Richard Feynman once said: "What I cannot create, I do not understand". This quote stresses the importance of action for understanding. A complex system is not understood solely by passive contemplation, it needs active manipulation by the researcher. In biology this fact is long known. Functional genomics has a long tradition of inferring the inner working of a cell by breaking it. "What I cannot break, I do not understand" is the credo of functional genomics research.</p>
			<sec>
				<st>
					<p>Linking causes with effects</p>
				</st>
				<p>Rung <it>et al</it>. <abbrgrp><abbr bid="B95">95</abbr></abbrgrp> build a directed disruption graph by drawing an edge (<it>i,j</it>) if perturbing gene <it>i </it>results in a significant expression change at gene <it>j</it>. The authors focus on features of the disruption network that are robust over a range of significance cutoffs. Disruption networks do not distinguish between direct and indirect effects (and are in this sence similar to co-expression networks). Fig. <figr fid="F6">6</figr> shows the difference between a causal network and a disruption network.</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Causal and discruption network</p>
					</caption>
					<text>
						<p><b>Causal and discruption network</b>. From the causal network (left) it is easy to deduce how effects spread through the pathway (right). The harder problem is to deduce the causal pathway from observing effects of interventions (going from right to left).</p>
					</text>
					<graphic file="1471-2105-8-S6-S5-6"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Distinguishing direct from indirect effects</p>
				</st>
				<p>Disruption networks can be used as a starting point for further analysis. Wagner <abbrgrp><abbr bid="B96">96</abbr><abbr bid="B97">97</abbr><abbr bid="B98">98</abbr></abbrgrp> uses graph-theoretic methods of <it>transitive reduction </it><abbrgrp><abbr bid="B99">99</abbr><abbr bid="B100">100</abbr></abbrgrp> to find parsimonious subgraphs explaining all the effects in the disruption network. These methods are deterministic and do not take measurement noise into account. Wang and Cooper <abbrgrp><abbr bid="B101">101</abbr></abbrgrp> describe a Bayesian generalization of the Wagner algorithm <abbrgrp><abbr bid="B98">98</abbr></abbrgrp> yielding a distribution over possible causal relationships between genes.</p>
			</sec>
			<sec>
				<st>
					<p>Boolean networks</p>
				</st>
				<p>A simple deterministic model of regulatory networks are Boolean networks: they are defined by a directed (and possibly cyclic) graph. For each node exists a boolean function relating parent states to the child state. Perturbations allow us to infer the structure and the logic of Boolean networks <abbrgrp><abbr bid="B102">102</abbr><abbr bid="B103">103</abbr><abbr bid="B104">104</abbr></abbrgrp>.</p>
				<p>Unfortunately, deterministic models can not compensate for the noise inherent in biological systems. This is the reason why also for perturbation data probabilistic models are preferred.</p>
				<sec>
					<st>
						<p>Correlation</p>
					</st>
					<p>Rice <it>et al</it>. <abbrgrp><abbr bid="B105">105</abbr></abbrgrp> build correlation graphs using knockout data. They assume that the data contain measurements of the unperturbed cell and several replicates of measurements for every gene knockout. For each gene <it>i</it>, they concatenate the wild-type data with the intervention data of this gene and compute on the joint data the correlation of gene <it>i </it>to all other genes. In the final graph, there is an arrow (<it>i, j</it>) if gene <it>j </it>was highly correlated to gene <it>i</it>. Since the correlation was computed on knockout data, the graph encodes causation and not only correlation. The big disadvantage of this method is the need for many (&gt; 10) replicates of knockout experiments for <it>every </it>gene in the model, which is unrealistic for most real-world applications. Several regression methods make more efficient use of data.</p>
				</sec>
				<sec>
					<st>
						<p>Regression</p>
					</st>
					<p>Rogers and Girolami <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> use sparse Bayesian regression based on a Gaussian linear model to estimate a dependency network from knock-out data. They regress each gene onto all other genes by combining all the data corresponding to knockouts of genes other than the particular gene of interest. The measurements of the knockout gene are ignored when predicting this gene's expression from the other genes. We will see below that this strategy is the same as Pearl's <it>ideal interventions </it>used in Bayesian networks <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. A prior on model parameters constrains most regression coefficients to zero and enforces a sparse solution. Non-zero regression coefficients are indicated by arrows in the regulation network. Other regression methods for network reconstruction are derived from a branch of engineering called <it>system identification </it><abbrgrp><abbr bid="B106">106</abbr></abbrgrp>. Functional relations between network components are inferred from measurements of system dynamics. Several papers <abbrgrp><abbr bid="B107">107</abbr><abbr bid="B108">108</abbr><abbr bid="B109">109</abbr><abbr bid="B110">110</abbr></abbrgrp> use multiple regression to model the response of genes and proteins to external perturbations.</p>
				</sec>
			</sec>
			<sec>
				<st>
					<p>Ideal interventions</p>
				</st>
				<p>Pearl <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> proposes an idealized model of interventions in Bayesian networks. He assumes that an external manipulation completely controls the target node <it>v</it>. The influence of parent nodes <it>pa</it>(<it>v</it>) is cut and the LPD of <it>X</it><sub><it>v </it></sub>degenerates to a point mass at the target state <it>x'</it><sub><it>v</it></sub>, that is,</p>
				<p>
					<display-formula>
						<m:math name="1471-2105-8-S6-S5-i14" xmlns:m="http://www.w3.org/1998/Math/MathML">
							<m:semantics>
								<m:mrow>
									<m:mi>p</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msub>
										<m:mi>x</m:mi>
										<m:mi>v</m:mi>
									</m:msub>
									<m:mo>|</m:mo>
									<m:msub>
										<m:mi>x</m:mi>
										<m:mrow>
											<m:mi>p</m:mi>
											<m:mi>a</m:mi>
											<m:mo stretchy="false">(</m:mo>
											<m:mi>v</m:mi>
											<m:mo stretchy="false">)</m:mo>
											<m:mo>,</m:mo>
										</m:mrow>
									</m:msub>
									<m:msub>
										<m:mi>&#952;</m:mi>
										<m:mi>v</m:mi>
									</m:msub>
									<m:mo stretchy="false">)</m:mo>
									<m:mover>
										<m:mo>&#8594;</m:mo>
										<m:mrow>
											<m:mtext>intervention</m:mtext>
										</m:mrow>
									</m:mover>
									<m:mi>p</m:mi>
									<m:mo stretchy="false">(</m:mo>
									<m:msub>
										<m:mi>x</m:mi>
										<m:mi>v</m:mi>
									</m:msub>
									<m:mo stretchy="false">)</m:mo>
									<m:mo>=</m:mo>
									<m:mrow>
										<m:mo>{</m:mo>
										<m:mrow>
											<m:mtable columnalign="left">
												<m:mtr columnalign="left">
													<m:mtd columnalign="left">
														<m:mn>1</m:mn>
													</m:mtd>
													<m:mtd columnalign="left">
														<m:mrow>
															<m:mtext>if&#160;</m:mtext>
															<m:msub>
																<m:mi>x</m:mi>
																<m:mi>v</m:mi>
															</m:msub>
															<m:mo>=</m:mo>
															<m:msub>
																<m:msup>
																	<m:mi>x</m:mi>
																	<m:mo>&#8242;</m:mo>
																</m:msup>
																<m:mi>v</m:mi>
															</m:msub>
														</m:mrow>
													</m:mtd>
												</m:mtr>
												<m:mtr columnalign="left">
													<m:mtd columnalign="left">
														<m:mn>0</m:mn>
													</m:mtd>
													<m:mtd columnalign="left">
														<m:mrow>
															<m:mtext>else</m:mtext>
															<m:mo>.</m:mo>
														</m:mrow>
													</m:mtd>
												</m:mtr>
											</m:mtable>
										</m:mrow>
									</m:mrow>
								</m:mrow>
								<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWG4baEdaWgaaWcbaGaemODayhabeaakiabcYha8Hqabiab=Hha4naaBaaaleaacqWGWbaCcqWGHbqycqGGOaakcqWG2bGDcqGGPaqkcqGGSaalaeqaaGGacOGae4hUde3aaSbaaSqaaiabdAha2bqabaGccqGGPaqkdaGdKaWcbaGaeeyAaKMaeeOBa4MaeeiDaqNaeeyzauMaeeOCaiNaeeODayNaeeyzauMaeeOBa4MaeeiDaqNaeeyAaKMaee4Ba8MaeeOBa4gabeGccaGLsgcacqWGWbaCcqGGOaakcqWG4baEdaWgaaWcbaGaemODayhabeaakiabcMcaPiabg2da9maaceaabaqbaeaabiGaaaqaaiabigdaXaqaaiabbMgaPjabbAgaMjabbccaGiabdIha4naaBaaaleaacqWG2bGDaeqaaOGaeyypa0JafmiEaGNbauaadaWgaaWcbaGaemODayhabeaaaOqaaiabicdaWaqaaiabbwgaLjabbYgaSjabbohaZjabbwgaLjabc6caUaaaaiaawUhaaaaa@6DE3@</m:annotation>
							</m:semantics>
						</m:math>
					</display-formula>
				</p>
				<p>Ideal interventions are easily introduced into the computation of marginal likelihood. The key observation is that fixing a variable to a state tells us nothing about its "natural" behaviour. Cooper and Yoo <abbrgrp><abbr bid="B111">111</abbr></abbrgrp> show that only cases in which a node was not fixed by an external manipulation enter into this node's contribution to the marginal likelihood. Ideal interventions were applied in Bayesian networks <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B68">68</abbr><abbr bid="B94">94</abbr></abbrgrp>, factor graphs <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and dependency networks <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. In simulations <abbrgrp><abbr bid="B92">92</abbr><abbr bid="B93">93</abbr><abbr bid="B94">94</abbr></abbrgrp> and on real data <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B92">92</abbr></abbrgrp> it was found that interventions are critical for effective inference, particularly to establish directionality of the connections.</p>
			</sec>
			<sec>
				<st>
					<p>Soft interventions</p>
				</st>
				<p>Pearl's model of ideal interventions contains a number of idealizations. The most important of these are that manipulations only affect single genes and that results can be controlled deterministically. The first assumption may not be true if there are compensatory effects involving other genes. The second assumption is also very limiting in realistic biological scenarios. Often the experimentalist lacks knowledge about the exact size of perturbation effects. To cope with this uncertainty, Markowetz <it>et al</it>. <abbrgrp><abbr bid="B112">112</abbr></abbrgrp> introduced soft interventions as a generalization of ideal interventions. Variables are "pushed" in the direction of target states without fixing them. This idea is formalized in a Bayesian framework based on Conditional Gaussian networks.</p>
			</sec>
			<sec>
				<st>
					<p>Physical network models</p>
				</st>
				<p>Yeang <it>et al</it>. <abbrgrp><abbr bid="B113">113</abbr></abbrgrp> find the most likely annotated molecular interaction graph given a variety of data sources including gene-knockouts. Knock-out data is <it>functional </it>in nature and provides only indirect evidence about network structure. Yeang <it>et al</it>. associate each observed knock-out effect in the deletion mutant data with molecular cascades that could in principle explain the effect.</p>
			</sec>
			<sec>
				<st>
					<p>Genetic interactions</p>
				</st>
				<p>Genetic interactions are defined by comparing the phenotypes of two single gene perturbations with the phenotype of the double gene perturbation. One example for a genetic interaction is epistasis <abbrgrp><abbr bid="B114">114</abbr></abbrgrp>; it is defined as one gene masking the effect of another gene. Driessche <it>et al</it>. <abbrgrp><abbr bid="B115">115</abbr></abbrgrp> use expression profiles as phenotypes and partly reconstruct a developmental pathway in <it>D. discoideum</it>. Another example of a genetic interaction is synthetic lethality, where two genes with a viable phenotype show a lethal phenotype in a double perturbation <abbrgrp><abbr bid="B116">116</abbr></abbrgrp>. Wong <it>et al</it>. <abbrgrp><abbr bid="B117">117</abbr></abbrgrp> propose a classification method to predict synthetic lethal interactions. Epistasis and synthetic lethality are just two examples of a broad range of possible genetic interactions. Drees <it>et al</it>. <abbrgrp><abbr bid="B118">118</abbr></abbrgrp> define nine modes of genetic interactions for quantitative phenotypes that can be described by inequality constraints between the phenotypic values. They show that all modes of genetic interactions can be identified in agar-invasion phenotypes of mutant yeast.</p>
			</sec>
			<sec>
				<st>
					<p>Nested effects models</p>
				</st>
				<p>A key obstacle to inferring genetic networks from perturbation screens is that phenotypic profiles generally offer only indirect information on how genes interact. Cell morphology or sensitivity to stresses are global features of the cell, which are hard to relate directly to the genes contributing to them. Gene expression phenotypes also offer only an indirect view of pathway structure due to the high number of non-transcriptional regulatory events like protein modifications. For example, when silencing a kinase we might not be able to observe changes in the activation states of other proteins involved in the pathway; the only information we may get is that genes downstream of the pathway show expression changes. Thus, phenotypic profiles may provide only indirect information about information flow and pathway structure.</p>
				<p>A recent approach <abbrgrp><abbr bid="B119">119</abbr></abbrgrp> especially designed to learning from indirect information and high-dimensional phenotypes are <it>Nested Effects Models </it>that reconstruct features of the internal organization of the cell from the nested structure of observed perturbation effects. Perturbing some genes may have an influence on a global process, while perturbing others affects sub-processes of it. Imagine, for example, a signaling pathway activating several transcription factors. Blocking the entire pathway will affect all targets of the transcription factors, while perturbing a single downstream transcription factor will only affect its direct targets, which are a subset of the phenotype obtained by blocking the complete pathway. Figure <figr fid="F7">7</figr> shows a schematic plot of how the position of perturbed genes in a pathway corresponds to a nested structure of observed effects. Markowetz <it>et al</it>. <abbrgrp><abbr bid="B119">119</abbr></abbrgrp> demonstrate the power of Nested Effects Models in the controlled setting of simulation studies and explain its practical use in the context of an RNAi data set investigating the response to microbial challenge in <it>Drosophila melanogaster</it>.</p>
				<fig id="F7">
					<title>
						<p>Figure 7</p>
					</title>
					<caption>
						<p>Nested effects models</p>
					</caption>
					<text>
						<p><b>Nested effects models</b>. Markowetz <it>et al</it>. [119] introduce a probabilistic model to infer a pathway structure (left) from the observed downstream effects of interventions (right). The model predicts that genes high up in the pathway (like <b><it>A</it></b>) will have a broader set of effects than genes more downstream (like <b><it>B </it></b>to <b><it>E</it></b>). The branching in the pathway below <b><it>B </it></b>corresponds to (partly) disjoint effect subsets for <b><it>C </it></b>and <b><it>D</it></b>. The intersection of effect sets for <b><it>C </it></b>and <b><it>D </it></b>can be attributed to <b><it>E </it></b>and is explained by the collider at <b><it>E </it></b>in the pathway.</p>
					</text>
					<graphic file="1471-2105-8-S6-S5-7"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Experimental design</p>
				</st>
				<p>Statistical analysis of high-throughput data sets aims at generating <it>hypotheses </it>about the functions and regulatory roles of genes and proteins. Small-scale traditional experimental techniques are needed to verify the statistical predictions and inferred pathways. Experimental design or <it>active learning </it>deals with deciding which interventions to perform to discriminate optimally between alternative models. For reconstruction of regulatory networks, a number of methods have been proposed in different frameworks: for Bayesian networks <abbrgrp><abbr bid="B120">120</abbr><abbr bid="B121">121</abbr></abbrgrp>, physical network models <abbrgrp><abbr bid="B122">122</abbr></abbrgrp>, Boolean networks <abbrgrp><abbr bid="B102">102</abbr></abbrgrp>, and dynamical modeling <abbrgrp><abbr bid="B123">123</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Summary and Outlook</p>
			</st>
			<p>Fig. <figr fid="F8">8</figr> organizes the network reconstruction methods we discussed in this review with respect to a few basic questions: Does the data include gene knockout or knockdown experiments? If not, we call it <it>purely observational data</it>; if yes, we call it <it>interventional data</it>. Is the model probabilistic or deterministic? Does the model allow for changes over time? If yes, we call it <it>dynamic</it>, otherwise <it>static</it>. Does the model describe transcriptional regulatory networks? And if so, are additional non-transcriptional effects taken into account? In the leaf nodes of the decision tree modeling techniques fall together that are methodologically similar. Fig. <figr fid="F8">8</figr> shows representative examples. Some branches in the tree are missing, where we found no research on a given approach.</p>
			<fig id="F8">
				<title>
					<p>Figure 8</p>
				</title>
				<caption>
					<p>A guide to the literature on network reconstruction</p>
				</caption>
				<text>
					<p><b>A guide to the literature on network reconstruction</b>. Methodological similar approaches are clustered together by a decision tree built on five basic questions: Do the data contain knock-out or knock-down samples? Is the model deterministic or probabilistic? Does the model account for changes over time or is it static? Does the model focus on transcriptional gene regulation networks? And if so, does the model take additional non-transcriptional effects into account? In the leaf nodes of the tree, a few representative references are shown.</p>
				</text>
				<graphic file="1471-2105-8-S6-S5-8"/>
			</fig>
			<sec>
				<st>
					<p>The need for a holistic view</p>
				</st>
				<p>The internal organization of the cell comprises many layers. The <it>genome </it>refers to the collection of information stored in the DNA, while the <it>transcriptome </it>includes all gene transcripts. On the next level the <it>proteome </it>covers the set of all proteins. The <it>metabolome </it>contains small molecules &#8211; sugars, salts, amino acids, and nucleotides &#8211; that participate in metabolic reactions required for the maintenance and normal function of a cell. Results of internal reactions are features of the cell like growth or viability, which can be taken as <it>phenotypes </it>to study gene function. To understand the complexity of living cells future research will need to build models including all these layers. Statistical inference on parts of the system will not provide the mechanistic insights functional genomics is seeking for. Recent research concentrates on combining information from genome, transcriptome and proteome, <it>e.g</it>. for reconstructing signaling pathways <abbrgrp><abbr bid="B113">113</abbr><abbr bid="B124">124</abbr></abbrgrp> and networks of functionally related proteins <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B125">125</abbr></abbrgrp>. This is a necessary step in the right direction. However, these models will still be fragmentary if they do not include and predict phenotypical changes of interventions perturbing the normal course of action in the cell. We will only understand what we can break.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Further reading</p>
			</st>
			<p>There are a number of other reviews and method comparisons available: some focus on applications and biological interpretations <abbrgrp><abbr bid="B126">126</abbr><abbr bid="B127">127</abbr><abbr bid="B128">128</abbr></abbrgrp>, and some concentrate on methodology <abbrgrp><abbr bid="B129">129</abbr><abbr bid="B130">130</abbr><abbr bid="B131">131</abbr><abbr bid="B132">132</abbr><abbr bid="B133">133</abbr><abbr bid="B134">134</abbr><abbr bid="B135">135</abbr><abbr bid="B136">136</abbr><abbr bid="B137">137</abbr></abbrgrp>. Even more sources can be found in a regularly updated bibliography <abbrgrp><abbr bid="B138">138</abbr></abbrgrp>.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>This paper grew out of lectures given at the Institute for Theoretical Physics and Mathematics (IPM) in Tehran, Iran, in April 2005 and a tutorial given at the German Conference on Bioinformatics (GCB) in Hamburg, October 2005. F.M. thanks the organizers of both conferences for inviting him and giving him the opportunity to present. The authors thank Stefanie Scheid, Dennis Kostka and Patrick Bradley for their thorough readings of drafts of the manuscript.</p>
				<p>This article has been published as part of <it>BMC Bioinformatics</it> Volume 8 Supplement 6, 2007: Otto Warburg International Summer School and Workshop on Networks and Regulation. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/8?issue=S6</url></p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Protein network inference from multiple genomic data: a supervised approach</p>
				</title>
				<aug>
					<au>
						<snm>Yamanishi</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Vert</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Kanehisa</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>Suppl 1</issue>
				<fpage>i363</fpage>
				<lpage>370</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth910</pubid>
						<pubid idtype="pmpid" link="fulltext">15262821</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Selective integration of multiple biological data for supervised network inference</p>
				</title>
				<aug>
					<au>
						<snm>Kato</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tsuda</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Asai</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>10</issue>
				<fpage>2488</fpage>
				<lpage>95</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti339</pubid>
						<pubid idtype="pmpid" link="fulltext">15728114</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Causal protein-signaling networks derived from multiparameter single-cell data</p>
				</title>
				<aug>
					<au>
						<snm>Sachs</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Perez</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lauffenburger</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Nolan</snm>
						<fnm>GP</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>308</volume>
				<issue>5721</issue>
				<fpage>523</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1105809</pubid>
						<pubid idtype="pmpid" link="fulltext">15845847</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Cluster analysis and display of genome-wide expression patterns</p>
				</title>
				<aug>
					<au>
						<snm>Eisen</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Spellman</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<issue>25</issue>
				<fpage>14863</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">24541</pubid>
						<pubid idtype="pmpid" link="fulltext">9843981</pubid>
						<pubid idtype="doi">10.1073/pnas.95.25.14863</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization</p>
				</title>
				<aug>
					<au>
						<snm>Spellman</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Sherlock</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Iyer</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Anders</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Eisen</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Futcher</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Mol Biol Cell</source>
				<pubdate>1998</pubdate>
				<volume>9</volume>
				<issue>12</issue>
				<fpage>3273</fpage>
				<lpage>97</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">25624</pubid>
						<pubid idtype="pmpid" link="fulltext">9843569</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks</p>
				</title>
				<aug>
					<au>
						<snm>Wolfe</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Kohane</snm>
						<fnm>IS</fnm>
					</au>
					<au>
						<snm>Butte</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>227</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1186/1471-2105-6-227</pubid>
						<pubid idtype="pmpid" link="fulltext">16162296</pubid>
						<pubid idtype="pmcid">1239911</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
				</title>
				<aug>
					<au>
						<snm>Ashburner</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ball</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Blake</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Butler</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Cherry</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Dolinski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Dwight</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Eppig</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Harris</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Issel-Tarver</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Kasarskis</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Matese</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ringwald</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Sherlock</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2000</pubdate>
				<volume>25</volume>
				<fpage>25</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/75556</pubid>
						<pubid idtype="pmpid" link="fulltext">10802651</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Discovery of biological networks from diverse functional genomic data</p>
				</title>
				<aug>
					<au>
						<snm>Myers</snm>
						<fnm>CL</fnm>
					</au>
					<au>
						<snm>Robson</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Wible</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hibbs</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Chiriac</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Theesfeld</snm>
						<fnm>CL</fnm>
					</au>
					<au>
						<snm>Dolinski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Troyanskaya</snm>
						<fnm>OG</fnm>
					</au>
				</aug>
				<source>Genome Biology</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>R114</issue>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1414113</pubid>
						<pubid idtype="pmpid" link="fulltext">16420673</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae)</p>
				</title>
				<aug>
					<au>
						<snm>Troyanskaya</snm>
						<fnm>OG</fnm>
					</au>
					<au>
						<snm>Dolinski</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Owen</snm>
						<fnm>AB</fnm>
					</au>
					<au>
						<snm>Altman</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<issue>14</issue>
				<fpage>8348</fpage>
				<lpage>8353</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">166232</pubid>
						<pubid idtype="pmpid" link="fulltext">12826619</pubid>
						<pubid idtype="doi">10.1073/pnas.0832373100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>A gene-coexpression network for global discovery of conserved genetic modules</p>
				</title>
				<aug>
					<au>
						<snm>Stuart</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>SK</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>302</volume>
				<issue>5643</issue>
				<fpage>249</fpage>
				<lpage>55</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1087447</pubid>
						<pubid idtype="pmpid" link="fulltext">12934013</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Probabilities of spurious connections in gene networks: application to expression time series</p>
				</title>
				<aug>
					<au>
						<snm>Bickel</snm>
						<fnm>DR</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>7</issue>
				<fpage>1121</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti140</pubid>
						<pubid idtype="pmpid" link="fulltext">15546939</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Finding disease specific alterations in the co-expression of genes</p>
				</title>
				<aug>
					<au>
						<snm>Kostka</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Spang</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>Suppl 1</issue>
				<fpage>I194</fpage>
				<lpage>I199</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth909</pubid>
						<pubid idtype="pmpid" link="fulltext">15262799</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements</p>
				</title>
				<aug>
					<au>
						<snm>Butte</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kohane</snm>
						<fnm>I</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2000</pubdate>
				<fpage>418</fpage>
				<lpage>29</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10902190</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<aug>
					<au>
						<snm>Sch&#246;lkopf</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Smola</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Learning with kernels</source>
				<publisher>Cambridge, MA: The MIT Press</publisher>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B15">
				<aug>
					<au>
						<snm>Lauritzen</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Graphical Models</source>
				<publisher>Oxford: Clarendon Press</publisher>
				<pubdate>1996</pubdate>
			</bibl>
			<bibl id="B16">
				<aug>
					<au>
						<snm>Edwards</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Introduction to Graphical Modelling</source>
				<publisher>Springer</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Edge exclusion tests for graphical Gaussian models</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>PWF</fnm>
					</au>
					<au>
						<snm>Whittaker</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Learning in Graphical Models</source>
				<publisher>MIT Press</publisher>
				<editor>Jordan M</editor>
				<pubdate>1999</pubdate>
				<fpage>555</fpage>
				<lpage>574</lpage>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Model Selection for Gaussian Concentration Graphs</p>
				</title>
				<aug>
					<au>
						<snm>Drton</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Perlman</snm>
						<fnm>MD</fnm>
					</au>
				</aug>
				<source>Biometrika</source>
				<pubdate>2004</pubdate>
				<volume>91</volume>
				<issue>3</issue>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Sparse graphical models for exploring gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Dobra</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hans</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Nevins</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Yao</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>West</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Journal of Multivariate Analysis</source>
				<pubdate>2004</pubdate>
				<volume>90</volume>
				<fpage>196</fpage>
				<lpage>212</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/j.jmva.2004.02.009</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>High dimensional graphs and variable selection with the Lasso</p>
				</title>
				<aug>
					<au>
						<snm>Meinshausen</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>B&#252;hlmann</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Annals of Statistics</source>
				<pubdate>2005</pubdate>
				<volume>34</volume>
				<issue>3</issue>
				<fpage>1436</fpage>
				<lpage>1462</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1214/009053606000000281</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>An empirical Bayes approach to inferring large-scale gene association networks</p>
				</title>
				<aug>
					<au>
						<snm>Sch&#228;fer</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Strimmer</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>6</issue>
				<fpage>754</fpage>
				<lpage>64</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti062</pubid>
						<pubid idtype="pmpid" link="fulltext">15479708</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Correspondence Analysis of Genes and Tissue Types and Finding Genetic Links from Microarray Data</p>
				</title>
				<aug>
					<au>
						<snm>Kishino</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Waddell</snm>
						<fnm>PJ</fnm>
					</au>
				</aug>
				<source>Genome Informatics</source>
				<publisher>Tokyo: Universal Academy Press</publisher>
				<editor>Dunker A, Konagaya A, Miyano S, TTakagi</editor>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B23">
				<title>
					<p>A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics</p>
				</title>
				<aug>
					<au>
						<snm>Sch&#228;fer</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Strimmer</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>Stat Appl Genet Mol Biol</source>
				<pubdate>2005</pubdate>
				<volume>4</volume>
				<fpage>Article32</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.2202/1544-6115.1175</pubid>
						<pubid idtype="pmpid">16646851</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Gradient Directed Regularization for Sparse Gaussian Concentration Graphs, with Applications to Inference of Genetic Networks</p>
				</title>
				<aug>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Gui</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Biostatistics</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>2</issue>
				<fpage>302</fpage>
				<lpage>317</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/biostatistics/kxj008</pubid>
						<pubid idtype="pmpid" link="fulltext">16326758</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Dependency Networks for Inference, Collaborative Filtering, and Data Visualization</p>
				</title>
				<aug>
					<au>
						<snm>Heckerman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Chickering</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Meek</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Rounthwaite</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Kadie</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Journal of Machine Learning Research</source>
				<pubdate>2000</pubdate>
				<volume>1</volume>
				<issue>Oct</issue>
				<fpage>49</fpage>
				<lpage>75</lpage>
			</bibl>
			<bibl id="B26">
				<title>
					<p>The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo</p>
				</title>
				<aug>
					<au>
						<snm>Bonneau</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Reiss</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Shannon</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Facciotti</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hood</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Baliga</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>5</issue>
				<fpage>R36</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1779511</pubid>
						<pubid idtype="pmpid" link="fulltext">16686963</pubid>
						<pubid idtype="doi">10.1186/gb-2006-7-5-r36</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Towards reconstruction of gene networks from expression data by supervised learning</p>
				</title>
				<aug>
					<au>
						<snm>Soinov</snm>
						<fnm>LA</fnm>
					</au>
					<au>
						<snm>Krestyaninova</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Brazma</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Genome Biology</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<fpage>R6</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">151290</pubid>
						<pubid idtype="pmpid" link="fulltext">12540298</pubid>
						<pubid idtype="doi">10.1186/gb-2003-4-1-r6</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>A Bayesian regression approach to the inference of regulatory networks from gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Rogers</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Girolami</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>14</issue>
				<fpage>3131</fpage>
				<lpage>7</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti487</pubid>
						<pubid idtype="pmpid" link="fulltext">15879452</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Low-Order Conditional Independence Graphs for Inferring Genetic Networks</p>
				</title>
				<aug>
					<au>
						<snm>Wille</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>B&#252;hlmann</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Stat Appl Genet Mol Biol</source>
				<pubdate>2006</pubdate>
				<volume>5</volume>
				<fpage>Article1</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.2202/1544-6115.1170</pubid>
						<pubid idtype="pmpid">16646863</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana</p>
				</title>
				<aug>
					<au>
						<snm>Wille</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Zimmermann</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Vranov&#225;</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>F&#252;rholz</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Laule</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Bleuler</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hennig</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Prelic</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>von Rohr</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Thiele</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Zitzler</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Gruissem</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>B&#252;hlmann</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<issue>11</issue>
				<fpage>R92</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">545783</pubid>
						<pubid idtype="pmpid" link="fulltext">15535868</pubid>
						<pubid idtype="doi">10.1186/gb-2004-5-11-r92</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Estimating genomic coexpression networks using first-order conditional independence</p>
				</title>
				<aug>
					<au>
						<snm>Magwene</snm>
						<fnm>PM</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<issue>12</issue>
				<fpage>R100</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">545795</pubid>
						<pubid idtype="pmpid" link="fulltext">15575966</pubid>
						<pubid idtype="doi">10.1186/gb-2004-5-12-r100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Discovery of meaningful associations in genomic data using partial correlation coefficients</p>
				</title>
				<aug>
					<au>
						<snm>de la Fuente</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bing</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Hoeschele</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Mendes</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>18</issue>
				<fpage>3565</fpage>
				<lpage>3574</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth445</pubid>
						<pubid idtype="pmpid" link="fulltext">15284096</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context</p>
				</title>
				<aug>
					<au>
						<snm>Margolin</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Nemenman</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Basso</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Wiggins</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Stolovitzky</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Favera</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Califano</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>Suppl 1</issue>
				<fpage>S7</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1810318</pubid>
						<pubid idtype="pmpid" link="fulltext">16723010</pubid>
						<pubid idtype="doi">10.1186/1471-2105-7-S1-S7</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Reverse engineering of regulatory networks in human B cells</p>
				</title>
				<aug>
					<au>
						<snm>Basso</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Margolin</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Stolovitzky</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Klein</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Dalla-Favera</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Califano</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2005</pubdate>
				<volume>37</volume>
				<issue>4</issue>
				<fpage>382</fpage>
				<lpage>90</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1532</pubid>
						<pubid idtype="pmpid" link="fulltext">15778709</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<aug>
					<au>
						<snm>Pearl</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Probabilistic Reasoning in Intelligent Systems: networks of plausible inference</source>
				<publisher>Morgan Kaufmann</publisher>
				<pubdate>1988</pubdate>
			</bibl>
			<bibl id="B36">
				<aug>
					<au>
						<snm>Pearl</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Causality: Models, Reasoning and Inference</source>
				<publisher>Cambridge: Cambridge University Press</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B37">
				<aug>
					<au>
						<snm>Spirtes</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Glymour</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Scheines</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Causation, Prediction, and Search</source>
				<publisher>Cambridge MA: MIT Press</publisher>
				<edition>2</edition>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Learning Module Networks</p>
				</title>
				<aug>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Journal of Machine Learning Research</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>Apr</issue>
				<fpage>557</fpage>
				<lpage>588</lpage>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</p>
				</title>
				<aug>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Shapira</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Botstein</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Nature Genetics</source>
				<pubdate>2003</pubdate>
				<volume>34</volume>
				<issue>2</issue>
				<fpage>166</fpage>
				<lpage>176</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12740579</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Learning Bayesian Networks with Local Structure</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Goldszmidt</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Learning in Graphical Models</source>
				<publisher>Cambridge, MA: MIT Press</publisher>
				<editor>Jordan MI</editor>
				<pubdate>1998</pubdate>
				<fpage>421</fpage>
				<lpage>459</lpage>
			</bibl>
			<bibl id="B41">
				<title>
					<p>A Bayesian approach to learning Bayesian networks with local structure</p>
				</title>
				<aug>
					<au>
						<snm>Chickering</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Heckerman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Meek</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence</source>
				<publisher>Providence, RI: Morgan Kaufmann</publisher>
				<pubdate>1997</pubdate>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Estimation of genetic networks and functional structures between genes by using Bayesian network and nonparametric regression</p>
				</title>
				<aug>
					<au>
						<snm>Imoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Goto</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2002</pubdate>
				<volume/>
				<fpage>175</fpage>
				<lpage>186</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11928473</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Imoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Higuchi</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Goto</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tashiro</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Proc 2nd Computational Systems Bioinformatics</source>
				<pubdate>2003</pubdate>
				<fpage>104</fpage>
				<lpage>113</lpage>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Bayesian network and nonparametric heteroscedastic regression for nonlinear modeling of genetic network</p>
				</title>
				<aug>
					<au>
						<snm>Imoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Goto</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Aburatani</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tashiro</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Bioinform Comput Biol</source>
				<pubdate>2003</pubdate>
				<volume>1</volume>
				<issue>2</issue>
				<fpage>231</fpage>
				<lpage>52</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1142/S0219720003000071</pubid>
						<pubid idtype="pmpid" link="fulltext">15290771</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection</p>
				</title>
				<aug>
					<au>
						<snm>Tamada</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bannai</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Imoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tashiro</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>Suppl 2</issue>
				<fpage>II227</fpage>
				<lpage>II236</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btg1082</pubid>
						<pubid idtype="pmpid" link="fulltext">14534194</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Inferring genetic regulatory logic from expression data</p>
				</title>
				<aug>
					<au>
						<snm>Bulashevska</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Eils</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>11</issue>
				<fpage>2706</fpage>
				<lpage>13</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti388</pubid>
						<pubid idtype="pmpid" link="fulltext">15784747</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Inferring quantitative models of regulatory networks from expression data</p>
				</title>
				<aug>
					<au>
						<snm>Nachman</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>suppl 1</issue>
				<fpage>i248</fpage>
				<lpage>256</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth941</pubid>
						<pubid idtype="pmpid" link="fulltext">15262806</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Equivalence and Synthesis of Causal Models</p>
				</title>
				<aug>
					<au>
						<snm>Verma</snm>
						<fnm>TS</fnm>
					</au>
					<au>
						<snm>Pearl</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Proc Sixth Conf on Uncertainty in Artificial Intelligence</source>
				<publisher>Amsterdam: North-Holland</publisher>
				<editor>Bonissone PB, Henrion M, Kanal LN, Lemmer JF</editor>
				<pubdate>1990</pubdate>
				<fpage>255</fpage>
				<lpage>268</lpage>
			</bibl>
			<bibl id="B49">
				<aug>
					<au>
						<snm>Alberts</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Raff</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Walter</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Molecular Biology of the Cell</source>
				<publisher>New York: Garland Science</publisher>
				<edition>4</edition>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B50">
				<title>
					<p>Using Bayesian networks to analyze expression data</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Linial</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Nachman</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Journal of Computational Biology</source>
				<pubdate>2000</pubdate>
				<volume>7</volume>
				<issue>3</issue>
				<fpage>601</fpage>
				<lpage>620</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/106652700750050961</pubid>
						<pubid idtype="pmpid" link="fulltext">11108481</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>A probabilistic methodology for integrating knowledge and experiments on biological networks</p>
				</title>
				<aug>
					<au>
						<snm>Gat-Viks</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Tanay</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Raijman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Shamir</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>J Comput Biol</source>
				<pubdate>2006</pubdate>
				<volume>13</volume>
				<issue>2</issue>
				<fpage>165</fpage>
				<lpage>181</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/cmb.2006.13.165</pubid>
						<pubid idtype="pmpid" link="fulltext">16597233</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Learning the Structure of Dynamic Probabilistic Networks</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Murphy</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Russell</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Proceedings of the 14th Annual Conference on Uncertainty in Artificial Intelligence (UAI-98)</source>
				<publisher>San Francisco, CA: Morgan Kaufmann Publishers</publisher>
				<pubdate>1998</pubdate>
				<fpage>139</fpage>
				<lpage>147</lpage>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Modelling gene expression data using dynamic Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Murphy</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Mian</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<publisher>Tech. rep., Computer Science Division, University of California, Berkeley, CA</publisher>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Advances to Bayesian network inference for generating causal networks from observational biological data</p>
				</title>
				<aug>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>VA</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>PP</fnm>
					</au>
					<au>
						<snm>Hartemink</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Jarvis</snm>
						<fnm>ED</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>18</issue>
				<fpage>3594</fpage>
				<lpage>603</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth448</pubid>
						<pubid idtype="pmpid" link="fulltext">15284094</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>Informative structure priors: joint learning of dynamic regulatory networks from multiple types of data</p>
				</title>
				<aug>
					<au>
						<snm>Bernard</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hartemink</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2005</pubdate>
				<fpage>459</fpage>
				<lpage>70</lpage>
				<xrefbib>
					<pubid idtype="pmpid">15759651</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data</p>
				</title>
				<aug>
					<au>
						<snm>Zou</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Conzen</snm>
						<fnm>SD</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>71</fpage>
				<lpage>79</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth463</pubid>
						<pubid idtype="pmpid" link="fulltext">15308537</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>Gene networks inference using dynamic Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Perrin</snm>
						<fnm>BE</fnm>
					</au>
					<au>
						<snm>Ralaivola</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Mazurie</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bottani</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Mallet</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>d'Alche Buc</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>Suppl 2</issue>
				<fpage>II138</fpage>
				<lpage>II148</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btg1071</pubid>
						<pubid idtype="pmpid" link="fulltext">14534183</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B58">
				<title>
					<p>A Bayesian approach to reconstructing genetic regulatory networks with hidden factors</p>
				</title>
				<aug>
					<au>
						<snm>Beal</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Falciani</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Ghahramani</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Rangel</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Wild</snm>
						<fnm>DL</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>3</issue>
				<fpage>349</fpage>
				<lpage>356</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti014</pubid>
						<pubid idtype="pmpid" link="fulltext">15353451</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B59">
				<title>
					<p>Modeling T-cell activation using gene expression profiling and state-space models</p>
				</title>
				<aug>
					<au>
						<snm>Rangel</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Angus</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ghahramani</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Lioumi</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Sotheran</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Gaiba</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wild</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Falciani</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>9</issue>
				<fpage>1361</fpage>
				<lpage>1372</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth093</pubid>
						<pubid idtype="pmpid" link="fulltext">14962938</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B60">
				<title>
					<p>Modeling biological responses using gene expression profiling and linear dynamical systems</p>
				</title>
				<aug>
					<au>
						<snm>Rangel</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Wild</snm>
						<fnm>DL</fnm>
					</au>
					<au>
						<snm>Falciani</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Ghahramani</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gaiba</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Proceedings of the 2nd International Conference on Systems Biology</source>
				<publisher>Madison, WI: Omnipress</publisher>
				<pubdate>2001</pubdate>
				<fpage>248</fpage>
				<lpage>256</lpage>
			</bibl>
			<bibl id="B61">
				<title>
					<p>Modelling regulatory pathways in E. coli from time series expression profiles</p>
				</title>
				<aug>
					<au>
						<snm>Ong</snm>
						<fnm>IM</fnm>
					</au>
					<au>
						<snm>Glasner</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Page</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<issue>Suppl 1</issue>
				<fpage>S241</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12169553</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B62">
				<title>
					<p>Estimating the Dimension of a Model</p>
				</title>
				<aug>
					<au>
						<snm>Schwarz</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Annals of Statistics</source>
				<pubdate>1978</pubdate>
				<volume>6</volume>
				<issue>2</issue>
				<fpage>461</fpage>
				<lpage>464</lpage>
			</bibl>
			<bibl id="B63">
				<aug>
					<au>
						<snm>Gelman</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Carlin</snm>
						<fnm>JB</fnm>
					</au>
					<au>
						<snm>Stern</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>DB</fnm>
					</au>
				</aug>
				<source>Bayesian Data Analysis</source>
				<publisher>Chapman and Hall-CRC</publisher>
				<pubdate>1996</pubdate>
			</bibl>
			<bibl id="B64">
				<title>
					<p>A Bayesian Method for the Induction of Probabilistic Networks from Data</p>
				</title>
				<aug>
					<au>
						<snm>Cooper</snm>
						<fnm>GF</fnm>
					</au>
					<au>
						<snm>Herskovits</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Machine Learning</source>
				<pubdate>1992</pubdate>
				<volume>9</volume>
				<fpage>309</fpage>
				<lpage>347</lpage>
			</bibl>
			<bibl id="B65">
				<title>
					<p>Learning Bayesian Networks: The Combination of Knowledge and Statistical Data</p>
				</title>
				<aug>
					<au>
						<snm>Heckerman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Geiger</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Chickering</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Machine Learning</source>
				<pubdate>1995</pubdate>
				<volume>20</volume>
				<issue>3</issue>
				<fpage>197</fpage>
				<lpage>243</lpage>
			</bibl>
			<bibl id="B66">
				<title>
					<p>Learning Gaussian Networks</p>
				</title>
				<aug>
					<au>
						<snm>Geiger</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Heckerman</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proceedings of the Tenth Annual Conference on Uncertainty in Artificial Intelligence</source>
				<publisher>Washington, USA: Morgan Kaufmann</publisher>
				<editor>de M&#225;ntaras RL, Poole D, Seattle</editor>
				<pubdate>1994</pubdate>
				<fpage>235</fpage>
				<lpage>243</lpage>
			</bibl>
			<bibl id="B67">
				<title>
					<p>Learning Bayesian Networks with Mixed Variables</p>
				</title>
				<aug>
					<au>
						<snm>B&#248;ttcher</snm>
						<fnm>SG</fnm>
					</au>
				</aug>
				<source>PhD thesis</source>
				<publisher>Aalborg University, Denmark</publisher>
				<pubdate>2004</pubdate>
			</bibl>
			<bibl id="B68">
				<title>
					<p>Inferring subnetworks from perturbed expression profiles</p>
				</title>
				<aug>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Elidan</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<issue>Suppl 1</issue>
				<fpage>S215</fpage>
				<lpage>S224</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11473012</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B69">
				<title>
					<p>Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models</p>
				</title>
				<aug>
					<au>
						<snm>Hartemink</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Gilford</snm>
						<fnm>DK</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>TS</fnm>
					</au>
					<au>
						<snm>Young</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2002</pubdate>
				<volume/>
				<fpage>437</fpage>
				<lpage>449</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11928497</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B70">
				<title>
					<p>(Semi-)Predictive Discretization during Model Selection</p>
				</title>
				<aug>
					<au>
						<snm>Steck</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<publisher>Tech. Rep. AI Memo AIM-2003-002, MIT</publisher>
				<pubdate>2003</pubdate>
			</bibl>
			<bibl id="B71">
				<title>
					<p>Molecular Diagnosis: classification, model selection, and performance evaluation</p>
				</title>
				<aug>
					<au>
						<snm>Markowetz</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Spang</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Methods of Information in Medicine</source>
				<pubdate>2005</pubdate>
				<volume>44</volume>
				<issue>3</issue>
				<fpage>438</fpage>
				<lpage>43</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">16113770</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B72">
				<title>
					<p>On the Dirichlet Prior and Bayesian Regularization</p>
				</title>
				<aug>
					<au>
						<snm>Steck</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Advances in Neural Information Processing Systems 15</source>
				<publisher>Cambridge, MA: MIT Press</publisher>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B73">
				<title>
					<p>Counting labeled acyclic digraphs</p>
				</title>
				<aug>
					<au>
						<snm>Robinson</snm>
						<fnm>RW</fnm>
					</au>
				</aug>
				<source>New Directions in the Theory of Graphs</source>
				<publisher>New York: Academic Press</publisher>
				<editor>Harary F</editor>
				<pubdate>1973</pubdate>
				<fpage>239</fpage>
				<lpage>273</lpage>
			</bibl>
			<bibl id="B74">
				<title>
					<p>Bayesian model averaging and model selection for Markov equivalence classes of acyclic graphs</p>
				</title>
				<aug>
					<au>
						<snm>Madigan</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Andersson</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Perlman</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Volinsky</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Communications in Statistics: Theory and Methods</source>
				<pubdate>1996</pubdate>
				<volume>25</volume>
				<fpage>2493</fpage>
				<lpage>2519</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1080/03610929608831853</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B75">
				<title>
					<p>Learning Equivalence Classes of Bayesian Network Structures</p>
				</title>
				<aug>
					<au>
						<snm>Chickering</snm>
						<fnm>DM</fnm>
					</au>
				</aug>
				<source>Proceedings of Twelfth Conference on Uncertainty in Artificial Intelligence, Portland, OR, Morgan Kaufmann</source>
				<pubdate>1996</pubdate>
				<fpage>150</fpage>
				<lpage>157</lpage>
			</bibl>
			<bibl id="B76">
				<title>
					<p>Being Bayesian about Network Structure: A Bayesian Approach to Structure Discovery in Bayesian Networks</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Machine Learning</source>
				<pubdate>2003</pubdate>
				<volume>50</volume>
				<fpage>95</fpage>
				<lpage>126</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1023/A:1020249912095</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B77">
				<title>
					<p>Learning Bayesian Network Structures from Massive Datasets: The Sparse Candidate Algorithm</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Nachman</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Peer</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc of Uncertainty in Artificial Intelligence</source>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B78">
				<title>
					<p>"Ideal Parent" structure learning for continuous variable networks</p>
				</title>
				<aug>
					<au>
						<snm>Nachman</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Elidan</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Proceedings of the 20th conference on Uncertainty in artificial intelligence</source>
				<publisher>Arlington, Virginia, United States: AUAI Press</publisher>
				<pubdate>2004</pubdate>
				<fpage>400</fpage>
				<lpage>409</lpage>
			</bibl>
			<bibl id="B79">
				<title>
					<p>Growing Bayesian network models of gene networks from seed genes</p>
				</title>
				<aug>
					<au>
						<snm>Pe&#241;a</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bj&#246;rkegren</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Tegn&#233;r</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>Suppl 2</issue>
				<fpage>ii224</fpage>
				<lpage>ii229</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti1137</pubid>
						<pubid idtype="pmpid" link="fulltext">16204109</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B80">
				<title>
					<p>Learning Belief Networks in the Presence of Missing Values and Hidden Variables</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Proc of the Fourteenth Inter Conf on Machine Learning (ICML97)</source>
				<publisher>San Francisco, CA: Morgan Kaufmann</publisher>
				<editor>Fisher D</editor>
				<pubdate>1997</pubdate>
				<fpage>125</fpage>
				<lpage>133</lpage>
			</bibl>
			<bibl id="B81">
				<title>
					<p>The Bayesian Structural EM Algorithm</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Proc of the Fourteenth Conf on Uncertainty in Artificial Intelligence (UAI'98)</source>
				<publisher>San Francisco, CA: Morgan Kaufmann</publisher>
				<editor>Cooper GF, Moral S</editor>
				<pubdate>1998</pubdate>
				<fpage>129</fpage>
				<lpage>138</lpage>
			</bibl>
			<bibl id="B82">
				<aug>
					<au>
						<snm>Efron</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Tibshirani</snm>
						<fnm>RJ</fnm>
					</au>
				</aug>
				<source>An introduction to the boostrap</source>
				<publisher>Chapman and Hall</publisher>
				<pubdate>1993</pubdate>
			</bibl>
			<bibl id="B83">
				<title>
					<p>Bias-Corrected Bootstrap and Model Uncertainty</p>
				</title>
				<aug>
					<au>
						<snm>Steck</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>TS</fnm>
					</au>
				</aug>
				<source>Advances in Neural Information Processing Systems 16</source>
				<publisher>Cambridge, MA: MIT Press</publisher>
				<editor>Thrun S, Saul L, Sch&#246;lkopf B</editor>
				<pubdate>2004</pubdate>
			</bibl>
			<bibl id="B84">
				<title>
					<p>Monte Carlo sampling methods using Markov chains and their applications</p>
				</title>
				<aug>
					<au>
						<snm>Hastings</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Biometrika</source>
				<pubdate>1970</pubdate>
				<volume>57</volume>
				<fpage>97</fpage>
				<lpage>109</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1093/biomet/57.1.97</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B85">
				<title>
					<p>Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Husmeier</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>17</issue>
				<fpage>2271</fpage>
				<lpage>2282</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btg313</pubid>
						<pubid idtype="pmpid" link="fulltext">14630656</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B86">
				<title>
					<p>Evaluating functional network inference using simulations of complex biological</p>
				</title>
				<aug>
					<au>
						<snm>Smith</snm>
						<fnm>VA</fnm>
					</au>
					<au>
						<snm>Jarvis</snm>
						<fnm>ED</fnm>
					</au>
					<au>
						<snm>Hartemink</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<issue>Suppl 1</issue>
				<fpage>S216</fpage>
				<lpage>S224</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12169550</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B87">
				<title>
					<p>Simulation studies for the identification of genetic networks from cDNA array and regulatory activity data</p>
				</title>
				<aug>
					<au>
						<snm>Zak</snm>
						<fnm>DE</fnm>
					</au>
					<au>
						<snm>Doyle</snm>
						<fnm>FJ</fnm>
					</au>
					<au>
						<snm>Gonye</snm>
						<fnm>GE</fnm>
					</au>
					<au>
						<snm>Schwaber</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Proceedings of the Second International Conference on Systems Biology</source>
				<pubdate>2001</pubdate>
				<fpage>231</fpage>
				<lpage>238</lpage>
			</bibl>
			<bibl id="B88">
				<title>
					<p>Experiments on the Accuracy of Algorithms for Inferring the Structure of Genetic Regulatory Networks from Microarray Expression Levels. In</p>
				</title>
				<aug>
					<au>
						<snm>Wimberly</snm>
						<fnm>FC</fnm>
					</au>
					<au>
						<snm>Heiman</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Ramsey</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Glymour</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Proc IJCAI 2003 Bioinformatics Workshop</source>
				<pubdate>2003</pubdate>
			</bibl>
			<bibl id="B89">
				<title>
					<p>New computational approaches for analysis of cis-regulatory networks</p>
				</title>
				<aug>
					<au>
						<snm>Brown</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Rust</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Clarke</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Pan</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Schilstra</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Buysscher</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Griffin</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Wold</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Cameron</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Davidson</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Bolouri</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Developmental Biology</source>
				<pubdate>2002</pubdate>
				<volume>246</volume>
				<fpage>86</fpage>
				<lpage>102</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/dbio.2002.0619</pubid>
						<pubid idtype="pmpid" link="fulltext">12027436</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B90">
				<title>
					<p>A Genomic Regulatory Network for Development</p>
				</title>
				<aug>
					<au>
						<snm>Davidson</snm>
						<fnm>EH</fnm>
					</au>
					<au>
						<snm>Rast</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Oliveri</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Ransick</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Calestani</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Yuh</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Minokawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Amore</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Hinman</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Arenas-Mena</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Otim</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>CT</fnm>
					</au>
					<au>
						<snm>Livi</snm>
						<fnm>CB</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>PY</fnm>
					</au>
					<au>
						<snm>Revilla</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Rust</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>jun Pan</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Schilstra</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Clarke</snm>
						<fnm>PJC</fnm>
					</au>
					<au>
						<snm>Arnone</snm>
						<fnm>MI</fnm>
					</au>
					<au>
						<snm>Rowen</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Cameron</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>McClay</snm>
						<fnm>DR</fnm>
					</au>
					<au>
						<snm>Hood</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Bolouri</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>295</volume>
				<issue>5560</issue>
				<fpage>1669</fpage>
				<lpage>1678</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1069883</pubid>
						<pubid idtype="pmpid" link="fulltext">11872831</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B91">
				<title>
					<p>Reverse engineering gene regulatory networks</p>
				</title>
				<aug>
					<au>
						<snm>Hartemink</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2005</pubdate>
				<volume>23</volume>
				<issue>5</issue>
				<fpage>554</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt0505-554</pubid>
						<pubid idtype="pmpid" link="fulltext">15877071</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B92">
				<title>
					<p>Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical Gaussian models and Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Werhli</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Grzegorczyk</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Husmeier</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2006</pubdate>
				<volume>22</volume>
				<fpage>2523</fpage>
				<lpage>2531</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btl391</pubid>
						<pubid idtype="pmpid" link="fulltext">16844710</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B93">
				<title>
					<p>Importance of Input Perturbations and Stochastic Gene Expression in the Reverse Engineering of Genetic Regulatory Networks: Insights From an Identiflability Analysis of an In Silico Network</p>
				</title>
				<aug>
					<au>
						<snm>Zak</snm>
						<fnm>DE</fnm>
					</au>
					<au>
						<snm>Gonye</snm>
						<fnm>GE</fnm>
					</au>
					<au>
						<snm>Schwaber</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Doyle</snm>
						<fnm>FJ</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<issue>11</issue>
				<fpage>2396</fpage>
				<lpage>2405</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403758</pubid>
						<pubid idtype="pmpid" link="fulltext">14597654</pubid>
						<pubid idtype="doi">10.1101/gr.1198103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B94">
				<title>
					<p>Evaluating the Effect of Perturbations in Reconstructing Network Topologies</p>
				</title>
				<aug>
					<au>
						<snm>Markowetz</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Spang</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003)</source>
				<editor>Hornik K, Leisch F, Zeileis A</editor>
				<pubdate>2003</pubdate>
			</bibl>
			<bibl id="B95">
				<title>
					<p>Building and analysing genome-wide gene disruption networks</p>
				</title>
				<aug>
					<au>
						<snm>Rung</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Schlitt</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Brazma</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Freivalds</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Vilo</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<issue>Suppl 2</issue>
				<fpage>S202</fpage>
				<lpage>S210</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12386004</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B96">
				<title>
					<p>Reconstructing Pathways in Large Genetic Networks from Genetic Perturbations</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Journal of Computational Biology</source>
				<pubdate>2004</pubdate>
				<volume>11</volume>
				<fpage>53</fpage>
				<lpage>60</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/106652704773416885</pubid>
						<pubid idtype="pmpid" link="fulltext">15072688</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B97">
				<title>
					<p>Estimating Coarse Gene Network Structure from Large-Scale Gene Perturbation Data</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2002</pubdate>
				<volume>12</volume>
				<issue>2</issue>
				<fpage>309</fpage>
				<lpage>315</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">155274</pubid>
						<pubid idtype="pmpid" link="fulltext">11827950</pubid>
						<pubid idtype="doi">10.1101/gr.193902</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B98">
				<title>
					<p>How to reconstruct a large genetic network from n gene perturbations in fewer than <it>n</it><sup>2 </sup>easy steps</p>
				</title>
				<aug>
					<au>
						<snm>Wagner</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>17</volume>
				<issue>12</issue>
				<fpage>1183</fpage>
				<lpage>1197</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1093/bioinformatics/17.12.1183</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B99">
				<title>
					<p>The transitive reduction of a directed graph</p>
				</title>
				<aug>
					<au>
						<snm>Aho</snm>
						<fnm>AV</fnm>
					</au>
					<au>
						<snm>Garey</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ullman</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>SIAM J Comput</source>
				<pubdate>1972</pubdate>
				<volume>1</volume>
				<issue>2</issue>
				<fpage>131</fpage>
				<lpage>137</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1137/0201008</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B100">
				<title>
					<p>Graph algorithms</p>
				</title>
				<aug>
					<au>
						<snm>van Leeuwen</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Handbook of Theoretical Computer Science</source>
				<editor>van Leeuwen J, Elsevier</editor>
				<pubdate>1990</pubdate>
				<fpage>525</fpage>
				<lpage>632</lpage>
			</bibl>
			<bibl id="B101">
				<title>
					<p>An Bayesian Method for Biological Pathway Discovery from High-Throughput Experimental Data</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Cooper</snm>
						<fnm>GF</fnm>
					</au>
				</aug>
				<source>Proc 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference (CSB 2004)</source>
				<publisher>Stanford, CA, USA: IEEE Computer Society</publisher>
				<pubdate>2004</pubdate>
				<fpage>645</fpage>
				<lpage>646</lpage>
			</bibl>
			<bibl id="B102">
				<title>
					<p>Discovery of regulatory interactions through perturbation: inference and experimental design</p>
				</title>
				<aug>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Karp</snm>
						<fnm>RM</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2000</pubdate>
				<volume/>
				<fpage>302</fpage>
				<lpage>313</lpage>
			</bibl>
			<bibl id="B103">
				<title>
					<p>Identification of Gene Regulatory Networks by Strategic Gene Disruptions and Gene Overexpressions</p>
				</title>
				<aug>
					<au>
						<snm>Akutsu</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Maruyama</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Proc 9th Annual ACM-SIAM Symposium on Discrete Algorithms</source>
				<pubdate>1998</pubdate>
				<fpage>695</fpage>
				<lpage>702</lpage>
			</bibl>
			<bibl id="B104">
				<title>
					<p>A System for Identifying Genetic Networks from Gene Expression Patterns Produced by Gene Disruptions and Overexpressions</p>
				</title>
				<aug>
					<au>
						<snm>Akutsu</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kuhara</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Maruyama</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Miyano</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Genome Informatics 9</source>
				<publisher>Tokyo: Universal Academy Press</publisher>
				<editor>Miyano S, Takagi T</editor>
				<pubdate>1998</pubdate>
				<fpage>151</fpage>
				<lpage>160</lpage>
			</bibl>
			<bibl id="B105">
				<title>
					<p>Reconstructing biological networks using conditional correlation analysis</p>
				</title>
				<aug>
					<au>
						<snm>Rice</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Tu</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Stolovitzky</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<fpage>765</fpage>
				<lpage>773</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti064</pubid>
						<pubid idtype="pmpid" link="fulltext">15486043</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B106">
				<aug>
					<au>
						<snm>Ljung</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>System Identification &#8211; Theory for the User</source>
				<publisher>Prentice Hall</publisher>
				<edition>2</edition>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B107">
				<title>
					<p>Reverse engineering gene networks using singular value decomposition and robust regression</p>
				</title>
				<aug>
					<au>
						<snm>Yeung</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tegn&#233;r</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<issue>9</issue>
				<fpage>6163</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122920</pubid>
						<pubid idtype="pmpid" link="fulltext">11983907</pubid>
						<pubid idtype="doi">10.1073/pnas.092576199</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B108">
				<title>
					<p>Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling</p>
				</title>
				<aug>
					<au>
						<snm>Gardner</snm>
						<fnm>TS</fnm>
					</au>
					<au>
						<snm>di Bernardo</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lorenz</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>301</volume>
				<issue>5629</issue>
				<fpage>102</fpage>
				<lpage>105</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1081900</pubid>
						<pubid idtype="pmpid" link="fulltext">12843395</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B109">
				<title>
					<p>Robust identification of large genetic networks</p>
				</title>
				<aug>
					<au>
						<snm>di Bernardo</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>S</snm>
						<fnm>GT</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2004</pubdate>
				<fpage>486</fpage>
				<lpage>97</lpage>
				<xrefbib>
					<pubid idtype="pmpid">14992527</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B110">
				<title>
					<p>Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks</p>
				</title>
				<aug>
					<au>
						<snm>di Bernardo</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Gardner</snm>
						<fnm>TS</fnm>
					</au>
					<au>
						<snm>Chobot</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Eastwood</snm>
						<fnm>EL</fnm>
					</au>
					<au>
						<snm>Wojtovich</snm>
						<fnm>AP</fnm>
					</au>
					<au>
						<snm>Elliott</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Schaus</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2005</pubdate>
				<volume>23</volume>
				<issue>3</issue>
				<fpage>377</fpage>
				<lpage>83</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt1075</pubid>
						<pubid idtype="pmpid" link="fulltext">15765094</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B111">
				<title>
					<p>Causal Discovery from a Mixture of Experimental and Observational Data</p>
				</title>
				<aug>
					<au>
						<snm>Cooper</snm>
						<fnm>GF</fnm>
					</au>
					<au>
						<snm>Yoo</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Proc. Fifthteenth Conference on Uncertainty in Artificial Intelligence (UAI '99)</source>
				<publisher>San Francisco, Calif.: Morgan Kaufman</publisher>
				<editor>Laskey K, Prade H</editor>
				<pubdate>1999</pubdate>
				<fpage>116</fpage>
				<lpage>125</lpage>
			</bibl>
			<bibl id="B112">
				<title>
					<p>Probabilistic soft interventions in Conditional Gaussian networks</p>
				</title>
				<aug>
					<au>
						<snm>Markowetz</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Grossmann</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Spang</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Proc Tenth International Workshop on Artificial Intelligence and Statistics</source>
				<editor>Cowell R, Ghahramani Z</editor>
				<pubdate>2005</pubdate>
			</bibl>
			<bibl id="B113">
				<title>
					<p>Physical Network Models</p>
				</title>
				<aug>
					<au>
						<snm>Yeang</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Journal of Computational Biology</source>
				<pubdate>2004</pubdate>
				<volume>11</volume>
				<issue>2</issue>
				<fpage>243</fpage>
				<lpage>262</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/1066527041410382</pubid>
						<pubid idtype="pmpid" link="fulltext">15285891</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B114">
				<title>
					<p>Ordering gene function: the interpretation of epistasis in regulatory hierarchies</p>
				</title>
				<aug>
					<au>
						<snm>Avery</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wasserman</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>1992</pubdate>
				<volume>8</volume>
				<issue>9</issue>
				<fpage>312</fpage>
				<lpage>6</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0168-9525(92)90263-4</pubid>
						<pubid idtype="pmpid" link="fulltext">1365397</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B115">
				<title>
					<p>Epistasis analysis with global transcriptional phenotypes</p>
				</title>
				<aug>
					<au>
						<snm>Driessche</snm>
						<fnm>NV</fnm>
					</au>
					<au>
						<snm>Demsar</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Booth</snm>
						<fnm>EO</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Juvan</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Zupan</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kuspa</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Shaulsky</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2005</pubdate>
				<volume>37</volume>
				<issue>5</issue>
				<fpage>471</fpage>
				<lpage>7</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1545</pubid>
						<pubid idtype="pmpid" link="fulltext">15821735</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B116">
				<title>
					<p>Global mapping of the yeast genetic interaction network</p>
				</title>
				<aug>
					<au>
						<snm>Tong</snm>
						<fnm>AHY</fnm>
					</au>
					<au>
						<snm>Lesage</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bader</snm>
						<fnm>GD</fnm>
					</au>
					<au>
						<snm>Ding</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Xin</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Young</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Berriz</snm>
						<fnm>GF</fnm>
					</au>
					<au>
						<snm>Brost</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Chang</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Chua</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Friesen</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Goldberg</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Haynes</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Humphries</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Hussein</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ke</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Krogan</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Levinson</snm>
						<fnm>JN</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>M&#233;nard</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Munyana</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Parsons</snm>
						<fnm>AB</fnm>
					</au>
					<au>
						<snm>Ryan</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Tonikian</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Sdicu</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Shapiro</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sheikh</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Suter</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>LV</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Burd</snm>
						<fnm>CG</fnm>
					</au>
					<au>
						<snm>Munro</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Sander</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Rine</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Greenblatt</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Peter</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Bretscher</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bell</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Roth</snm>
						<fnm>FP</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>GW</fnm>
					</au>
					<au>
						<snm>Andrews</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Bussey</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Boone</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>303</volume>
				<issue>5659</issue>
				<fpage>808</fpage>
				<lpage>813</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1091317</pubid>
						<pubid idtype="pmpid" link="fulltext">14764870</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B117">
				<title>
					<p>Combining biological networks to predict genetic interactions</p>
				</title>
				<aug>
					<au>
						<snm>Wong</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>LV</fnm>
					</au>
					<au>
						<snm>Tong</snm>
						<fnm>AHY</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Goldberg</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>King</snm>
						<fnm>OD</fnm>
					</au>
					<au>
						<snm>Lesage</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Vidal</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Andrews</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Bussey</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Boone</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Roth</snm>
						<fnm>FP</fnm>
					</au>
				</aug>
				<source>PNAS</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<issue>44</issue>
				<fpage>15682</fpage>
				<lpage>15687</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">524818</pubid>
						<pubid idtype="pmpid" link="fulltext">15496468</pubid>
						<pubid idtype="doi">10.1073/pnas.0406614101</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B118">
				<title>
					<p>Derivation of genetic interaction networks from quantitative phenotype data</p>
				</title>
				<aug>
					<au>
						<snm>Drees</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Thorsson</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Carter</snm>
						<fnm>GW</fnm>
					</au>
					<au>
						<snm>Rives</snm>
						<fnm>AW</fnm>
					</au>
					<au>
						<snm>Raymond</snm>
						<fnm>MZ</fnm>
					</au>
					<au>
						<snm>Avila-Campillo</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Shannon</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Galitski</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>4</issue>
				<fpage>R38</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1088966</pubid>
						<pubid idtype="pmpid" link="fulltext">15833125</pubid>
						<pubid idtype="doi">10.1186/gb-2005-6-4-r38</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B119">
				<title>
					<p>Non-transcriptional pathway features reconstructed from secondary effects of RNA interference</p>
				</title>
				<aug>
					<au>
						<snm>Markowetz</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Bloch</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Spang</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>21</issue>
				<fpage>4026</fpage>
				<lpage>4032</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bti662</pubid>
						<pubid idtype="pmpid" link="fulltext">16159925</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B120">
				<title>
					<p>Reconstruction of gene networks using Bayesian learning and manipulation experiments</p>
				</title>
				<aug>
					<au>
						<snm>Pournara</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Wernisch</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>17</issue>
				<fpage>2934</fpage>
				<lpage>2942</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth337</pubid>
						<pubid idtype="pmpid" link="fulltext">15180938</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B121">
				<title>
					<p>Evaluation of a System that Recommends Microarray Experiments to Perform to Discover Gene-Regulation Pathways</p>
				</title>
				<aug>
					<au>
						<snm>Yoo</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Cooper</snm>
						<fnm>GF</fnm>
					</au>
				</aug>
				<source>Artif Intell Med</source>
				<pubdate>2004</pubdate>
				<volume>31</volume>
				<issue>2</issue>
				<fpage>169</fpage>
				<lpage>182</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.artmed.2004.01.018</pubid>
						<pubid idtype="pmpid" link="fulltext">15219293</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B122">
				<title>
					<p>Validation and refinement of gene-regulatory pathways on a network of physical interactions</p>
				</title>
				<aug>
					<au>
						<snm>Yeang</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Mak</snm>
						<fnm>HC</fnm>
					</au>
					<au>
						<snm>McCuine</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Workman</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Jaakkola</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Ideker</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<fpage>R62</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1175993</pubid>
						<pubid idtype="pmpid" link="fulltext">15998451</pubid>
						<pubid idtype="doi">10.1186/gb-2005-6-7-r62</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B123">
				<title>
					<p>Reverse engineering gene networks: integrating genetic perturbations with dynamical modeling</p>
				</title>
				<aug>
					<au>
						<snm>Tegner</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Yeung</snm>
						<fnm>MKS</fnm>
					</au>
					<au>
						<snm>Hasty</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<issue>10</issue>
				<fpage>5944</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">156306</pubid>
						<pubid idtype="pmpid" link="fulltext">12730377</pubid>
						<pubid idtype="doi">10.1073/pnas.0933416100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B124">
				<title>
					<p>A data integration methodology for systems biology: experimental verification</p>
				</title>
				<aug>
					<au>
						<snm>Hwang</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Leslie</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Weston</snm>
						<fnm>AD</fnm>
					</au>
					<au>
						<snm>Rust</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Ramsey</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>de Atauri</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Siegel</snm>
						<fnm>AF</fnm>
					</au>
					<au>
						<snm>Bolouri</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Aitchison</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Hood</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<issue>48</issue>
				<fpage>17302</fpage>
				<lpage>17307</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1297683</pubid>
						<pubid idtype="pmpid" link="fulltext">16301536</pubid>
						<pubid idtype="doi">10.1073/pnas.0508649102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B125">
				<title>
					<p>A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data</p>
				</title>
				<aug>
					<au>
						<snm>Jansen</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Greenbaum</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Kluger</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Krogan</snm>
						<fnm>NJ</fnm>
					</au>
					<au>
						<snm>Chung</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Emili</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Snyder</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Greenblatt</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Gerstein</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>302</volume>
				<issue>5644</issue>
				<fpage>449</fpage>
				<lpage>453</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1087361</pubid>
						<pubid idtype="pmpid" link="fulltext">14564010</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B126">
				<title>
					<p>Reconstruction of cellular signalling networks and analysis of their properties</p>
				</title>
				<aug>
					<au>
						<snm>Papin</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Hunter</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Palsson</snm>
						<fnm>BO</fnm>
					</au>
					<au>
						<snm>Subramaniam</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nat Rev Mol Cell Biol</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>2</issue>
				<fpage>99</fpage>
				<lpage>111</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrm1570</pubid>
						<pubid idtype="pmpid" link="fulltext">15654321</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B127">
				<title>
					<p>From signatures to models: understanding cancer using microarrays</p>
				</title>
				<aug>
					<au>
						<snm>Segal</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Kaminski</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Regev</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Koller</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2005</pubdate>
				<volume>37</volume>
				<issue>Suppl</issue>
				<fpage>S38</fpage>
				<lpage>45</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1561</pubid>
						<pubid idtype="pmpid" link="fulltext">15920529</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B128">
				<title>
					<p>Inferring Cellular Networks Using Probabilistic Graphical Models</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>303</volume>
				<issue>5659</issue>
				<fpage>799</fpage>
				<lpage>805</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1094068</pubid>
						<pubid idtype="pmpid" link="fulltext">14764868</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B129">
				<title>
					<p>Graph-based methods for analysing networks in cell biology</p>
				</title>
				<aug>
					<au>
						<snm>Aittokallio</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Schwikowski</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Brief Bioinform</source>
				<pubdate>2006</pubdate>
				<volume>7</volume>
				<issue>3</issue>
				<fpage>243</fpage>
				<lpage>255</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bib/bbl022</pubid>
						<pubid idtype="pmpid" link="fulltext">16880171</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B130">
				<title>
					<p>Inference in Bayesian networks</p>
				</title>
				<aug>
					<au>
						<snm>Needham</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Bradford</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Bulpitt</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Westhead</snm>
						<fnm>DR</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2006</pubdate>
				<volume>24</volume>
				<fpage>51</fpage>
				<lpage>3</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt0106-51</pubid>
						<pubid idtype="pmpid" link="fulltext">16404397</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B131">
				<title>
					<p>Inferring network interactions within a cell</p>
				</title>
				<aug>
					<au>
						<snm>Carter</snm>
						<fnm>GW</fnm>
					</au>
				</aug>
				<source>Brief Bioinform</source>
				<pubdate>2005</pubdate>
				<volume>6</volume>
				<issue>4</issue>
				<fpage>380</fpage>
				<lpage>389</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bib/6.4.380</pubid>
						<pubid idtype="pmpid" link="fulltext">16420736</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B132">
				<title>
					<p>Gene Network Inference From Large-Scale Expression Data</p>
				</title>
				<aug>
					<au>
						<snm>Filkov</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Handbook of Computational Molecular Biology (Chapman &amp; All/Crc Computer and Information Science Series)</source>
				<publisher>Chapman &amp; Hall/CRC</publisher>
				<editor>Aluru S</editor>
				<pubdate>2005</pubdate>
			</bibl>
			<bibl id="B133">
				<title>
					<p>Bayesian Network Analysis of Signaling Networks: A Primer</p>
				</title>
				<aug>
					<au>
						<snm>Pe'er</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Science's STKE</source>
				<pubdate>2005</pubdate>
				<volume>2005</volume>
				<issue>281</issue>
				<fpage>pl4.</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/stke.2812005pl4</pubid>
						<pubid idtype="pmpid" link="fulltext">15855409</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B134">
				<title>
					<p>Representing bioinformatics causality</p>
				</title>
				<aug>
					<au>
						<snm>Mandel</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Palfreyman</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Lopez</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Dubitzky</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Brief Bioinform</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<issue>3</issue>
				<fpage>270</fpage>
				<lpage>83</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bib/5.3.270</pubid>
						<pubid idtype="pmpid" link="fulltext">15383213</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B135">
				<title>
					<p>Modeling and Simulation of Genetic Regulatory Systems: A Literature Review</p>
				</title>
				<aug>
					<au>
						<snm>De Jong</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Journal of Computational Biology</source>
				<pubdate>2002</pubdate>
				<volume>9</volume>
				<fpage>67</fpage>
				<lpage>103</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1089/10665270252833208</pubid>
						<pubid idtype="pmpid" link="fulltext">11911796</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B136">
				<title>
					<p>A comparison of genetic network models</p>
				</title>
				<aug>
					<au>
						<snm>Wessels</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>van Someren</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Reinders</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Pac Symp Biocomput</source>
				<pubdate>2001</pubdate>
				<fpage>508</fpage>
				<lpage>519</lpage>
				<xrefbib>
					<pubid idtype="pmpid">11262968</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B137">
				<title>
					<p>Genetic network inference: from co-expression clustering to reverse engineering</p>
				</title>
				<aug>
					<au>
						<snm>D'haeseleer</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Liang</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Somogyi</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2000</pubdate>
				<volume>16</volume>
				<issue>8</issue>
				<fpage>707</fpage>
				<lpage>726</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/16.8.707</pubid>
						<pubid idtype="pmpid" link="fulltext">11099257</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B138">
				<title>
					<p>A bibliography on learning causal networks of gene interactions</p>
				</title>
				<aug>
					<au>
						<snm>Markowetz</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<pubdate>2006</pubdate>
				<url>http://genomics.princeton.edu/~florian/</url>
			</bibl>
		</refgrp>
	</bm>
</art>
