<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>1471-2105-13-53</ui>
<ji>1471-2105</ji>
<fm>
<dochead>Methodology article</dochead>
<bibl>
<title><p>Mouse obesity network reconstruction with a variational Bayes algorithm to employ aggressive false positive control</p></title>
<aug>
<au id="A1"><snm>Logsdon</snm><mi>A</mi><fnm>Benjamin</fnm><insr iid="I1"/><insr iid="I2"/><email>blogsdon@fhcrc.org</email></au>
<au id="A2"><snm>Hoffman</snm><mi>E</mi><fnm>Gabriel</fnm><insr iid="I2"/><email>gh258@cornell.edu</email></au>
<au ca="yes" id="A3"><snm>Mezey</snm><mi>G</mi><fnm>Jason</fnm><insr iid="I2"/><insr iid="I3"/><email>jgm45@cornell.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA</p></ins>
<ins id="I2"><p>Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, USA</p></ins>
<ins id="I3"><p>Department of Genetic Medicine, Weill Cornell Medical College, New York, New York, USA</p></ins>
</insg>
<source>BMC Bioinformatics</source>
<section><title><p>Networks analysis</p></title></section><issn>1471-2105</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>1</issue>
<fpage>53</fpage>
<url>http://www.biomedcentral.com/1471-2105/13/53</url>
<xrefbib><pubidlist><pubid idtype="pmpid">22471599</pubid><pubid idtype="doi">10.1186/1471-2105-13-53</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>14</day><month>10</month><year>2011</year></date></rec><acc><date><day>2</day><month>4</month><year>2012</year></date></acc><pub><date><day>2</day><month>4</month><year>2012</year></date></pub></history><cpyrt><year>2012</year><collab>Logsdon et al; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>We propose a novel variational Bayes network reconstruction algorithm to extract the most relevant disease factors from high-throughput genomic data-sets. Our algorithm is the only scalable method for regularized network recovery that employs Bayesian model averaging and that can internally estimate an appropriate level of sparsity to ensure few false positives enter the model without the need for cross-validation or a model selection criterion. We use our algorithm to characterize the effect of genetic markers and liver gene expression traits on mouse obesity related phenotypes, including weight, cholesterol, glucose, and free fatty acid levels, in an experiment previously used for discovery and validation of network connections: an F2 intercross between the C57BL/6 J and C3H/HeJ mouse strains, where apolipoprotein E is null on the background.</p>
</sec>
<sec><st><p>Results</p></st>
<p>We identified eleven genes, Gch1, Zfp69, Dlgap1, Gna14, Yy1, Gabarapl1, Folr2, Fdft1, Cnr2, Slc24a3, and Ccl19, and a quantitative trait locus directly connected to weight, glucose, cholesterol, or free fatty acid levels in our network. None of these genes were identified by other network analyses of this mouse intercross data-set, but all have been previously associated with obesity or related pathologies in independent studies. In addition, through both simulations and data analysis we demonstrate that our algorithm achieves superior performance in terms of power and type I error control than other network recovery algorithms that use the lasso and have bounds on type I error control.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Our final network contains 118 previously associated and novel genes affecting weight, cholesterol, glucose, and free fatty acid levels that are excellent obesity risk candidates.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>Network analysis algorithms have been applied to genome-wide polymorphism and gene activity data to identify molecular pathways that mediate risk for complex diseases <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Such analyses have led to the discovery of novel network connections that have been subsequently validated by experiment. For example, Yang et al. <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> validated three novel genes involved in obesity and obesity related phenotypes in an F<sub>2 </sub>mouse cross, based on predictions made from network analysis of genome-wide data. While there have been a few successful validations of this type <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, it has been noted that the false discovery rates of most network analysis techniques are still unacceptably high, given the significant time, financial, and resource investment required for such validation experiments <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. This is a problem for all current statistical network modeling approaches, whether focused on ensemble behavior of groups of genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B4">4</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, specific conditional network interactions among genes <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, or directed networks <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. For both broad pattern and specific network modeling methods, there can be high false discovery rates due to random noise and systematic error among samples, unless these are correctly accounted for in the experimental design or underlying statistical modeling framework <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
<p>We propose a novel algorithm that is able to directly control both systematic error and over-fitting sources of high false discovery rates in network reconstruction. The method balances the need for a network modeling methodology with an aggressively controlled false discovery rate, that is capable of representing rich statistical dependencies. To control false discovery rate, the method uses a regularized regression framework for undirected network inference <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B22">22</abbr></abbrgrp> by employing a spike-and-slab prior on the regression coefficients <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> along with a probabilistic consistency bound on the model size <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The spike-and-slab has been conjectured to approach optimal estimation for sparse models <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, and does not suffer from the irrepresentability condition that is a property of many popular penalties, such as the lasso <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, where the wrong model can be returned even asymptotically <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. By using a Bayesian framework, the mixture proportions of the prior are estimated directly from the data, negating the need for penalty selection by cross-validation or information theoretic model selection, as with other penalized approaches <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B28">28</abbr></abbrgrp>. For scaling purposes, the full algorithm makes use of a variational Bayes approximation to allow Bayesian model averaging when considering very large sets of putative network features (i.e. tens of thousands to millions) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This approach results in the algorithm returning a sparse network model in which all connections have strong statistical support, instead of a model where only the top few are expected to have a low false discovery rate. To control possible sources of systematic, or confounding error, our method also incorporates the top eigenvectors from a principal component analysis as unpenalized coefficients, an error controlling approach that has been successful in related applications <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B30">30</abbr></abbrgrp>.</p>
<p>We demonstrate the strength of our methodology by analyzing genotype, gene expression, and downstream phenotype data from the F2 intercross generated from mouse strains C57BL/6 J and C3H/HeJ with apolipoprotein E as null on the background (BxH.ApoE<sup>-/-</sup>) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B31">31</abbr></abbrgrp> to identify network connections among genes and obesity related phenotypes. The genome-wide data from this cross have been used to generate large-scale network predictions of genetic interactions affecting metabolic syndrome associated phenotypes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B6">6</abbr><abbr bid="B9">9</abbr><abbr bid="B31">31</abbr></abbrgrp> and have been used as a foundation for experimental validation of predicted network connections between genes and obesity <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. On a practical level, this experiment has a sufficient sample size (298 F<sub>2 </sub>progeny) to justify the use of a rich statistical model. We demonstrate that our algorithm performs better than the popular lasso <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and the adaptive lasso <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> penalized regression approaches, by demonstrating that neither approach can return a sparse model, where all variables are strongly supported, when using approaches to bound Type I errors or a standard cross-validation choice of penalty parameter. The improved control of false positives with our variational Bayes algorithm is a direct consequence of the forced sparsity of the solution which is induced by the specification of a probabilistic bound on the model size within the algorithm as a function of the sample size and number of variables.</p>
<sec><st><p>Theory - undirected network models</p></st>
<p>A probabilistic undirected network model is defined as an undirected graph with an associated probability measure <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. An undirected graph <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i1"><m:mi mathvariant="script">G</m:mi>
</m:math></inline-formula> is specified as a pair <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i2"><m:mi mathvariant="script">G</m:mi>
<m:mo class="MathClass-rel">=</m:mo>
<m:mfenced close="}" open="{" separators="">
   <m:mrow>
      <m:mi mathvariant="script">V</m:mi>
      <m:mo class="MathClass-punc">,</m:mo>
      <m:mi mathvariant="script">E</m:mi>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>,with <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i3"><m:mrow>
   <m:mi mathvariant="script">V</m:mi>
</m:mrow>
</m:math></inline-formula> a set of vertices (i.e. gene expression products, single nucleotide polymorphisms, or downstream phenotypes), and <it>&#8496; </it>a set of unordered pairs of vertices, specifying undirected edges between vertices <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. We focus on a type of Gaussian graphical model (GGM) defined with respect to a conditional multivariate normal distribution, where we model the distribution of expression traits, conditional on the genotypic states (see Methods for parametric details of a conditional GGM). This particular type of model only captures linear interactions between variables (we do not consider nonlinear or epistatic interactions in this paper). The goal of any network discovery algorithm is to determine the relevant set of edges for each vertex in the graph (i.e. the neighborhood of other vertices connected to each vertex), based on the observed genetic polymorphism, expression, and downstream phenotype data. Most practical genomics applications have far more variables (i.e. vertices) than samples, and therefore require a carefully constrained solution to the neighborhood selection problem, to prevent over-fitting and high false discovery rates <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B12">12</abbr></abbrgrp>. We therefore focus our attention on the regularized solutions to this problem <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B22">22</abbr></abbrgrp>, where the identified neighborhood size can be restricted based on the choice of penalization, and where the neighborhood selection problem is solved through penalized regression of each phenotype on all other phenotypes and genotypes <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Bayesian spike-and-slab prior sparse feature selection and accounting for systematic error</p></st>
<p>In our algorithm, we treat the neighborhood selection problem for any given phenotype <it>y </it>as a regularized regression problem. More specifically, this neighborhood selection problem involves identifying a subset of expression products or genotypes with non-zero regression coefficients in a multiple regression equation <it>y<sub>i </sub></it>= <it>&#956; </it>+ &#8721;<it><sub>j </sub>x<sub>ij </sub>&#946;<sub>j </sub></it>+ <it>e<sub>i</sub></it>, where a penalty is defined over the regression coefficients (<it>&#946;</it><sub>1</sub>,...,<it>&#946;</it><sub><it>p </it>+ <it>m</it></sub>) for the <it>i<sup>th </sup></it>sample with <it>j </it>= 1,...,<it>p </it>+ <it>m </it>possible expression phenotypes and genotypes. We use a mixture spike-and-slab prior as our penalty, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i4"><m:msub>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">~</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>p</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>&#946;</m:mi>
      <m:mo class="MathClass-rel">=</m:mo>
      <m:mn>0</m:mn>
   </m:mrow>
</m:msub>
<m:mstyle class="text">
   <m:mtext class="textsf" mathvariant="sans-serif">I</m:mtext>
</m:mstyle>
<m:mfenced close="]" open="[" separators="">
   <m:mrow>
      <m:mi>&#946;</m:mi>
      <m:mo class="MathClass-rel">=</m:mo>
      <m:mn>0</m:mn>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-bin">+</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>p</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>&#946;</m:mi>
      <m:mo class="MathClass-rel">&#8800;</m:mo>
      <m:mn>0</m:mn>
   </m:mrow>
</m:msub>
<m:mi mathvariant="script">N</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mn>0</m:mn>
      <m:mo class="MathClass-punc">,</m:mo>
      <m:msubsup>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>2</m:mn>
         </m:mrow>
      </m:msubsup>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>, with the spike (<it>p</it><sub>&#946; = 0</sub>I[<it>&#946; </it>= 0]) being related to an <it>l</it><sub>0 </sub>type penalty which drives the sparse feature selection and the slab <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i5"><m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
            <m:mo class="MathClass-rel">&#8800;</m:mo>
            <m:mn>0</m:mn>
         </m:mrow>
      </m:msub>
      <m:mi mathvariant="script">N</m:mi>
      <m:mfenced close=")" open="(" separators="">
         <m:mrow>
            <m:mn>0</m:mn>
            <m:mo class="MathClass-punc">,</m:mo>
            <m:msubsup>
               <m:mrow>
                  <m:mi>&#963;</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mi>&#946;</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mn>2</m:mn>
               </m:mrow>
            </m:msubsup>
         </m:mrow>
      </m:mfenced>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> being related to a ridge or <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i6"><m:msubsup>
   <m:mrow>
      <m:mi>l</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> type penalty which effectively smooths the identified sparse model. The combination of the two different types of penalization is similar in principle to the elastic net penalty <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, which incorporates a combination of <it>l</it><sub>1 </sub>and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i6"><m:msubsup><m:mrow><m:mi>l</m:mi></m:mrow><m:mrow><m:mn>2</m:mn></m:mrow><m:mrow><m:mn>2</m:mn></m:mrow></m:msubsup></m:math></inline-formula> penalties. There is both theoretical <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and empirical evidence <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> that the spike-and-slab prior is more effective than other penalties such as the lasso <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> at generating sparse solutions with low false discovery rates, for ultra-high-dimensional problems when p &#8811; n, where <it>p </it>is the number of variables, and <it>n </it>is the sample size.</p>
<p>We employ a fully Bayesian framework to handle the spike-and-slab prior, where the mixture proportions are estimated directly from the data. The algorithm finds an appropriate level of sparsity supported by the data via the probabilistic bound on model size without relying on cross-validation type approaches or information theoretic model selection criterion. Specifically, this is done by constraining the model dimension such that the number of selected features (<it>s</it>) for any given problem are on the order <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i7"><m:mi>s</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-rel">=</m:mo>
<m:mi mathvariant="script">O</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msqrt>
         <m:mrow>
            <m:mi>n</m:mi>
         </m:mrow>
      </m:msqrt>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>. This is done by truncating the distribution of <it>p</it><sub><it>&#946;</it>&#8800;0 </sub>such that <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i8"><m:msub>
   <m:mrow>
      <m:mi>p</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>&#946;</m:mi>
      <m:mo class="MathClass-rel">&#8800;</m:mo>
      <m:mn>0</m:mn>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">&#8804;</m:mo>
<m:msqrt>
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
</m:msqrt>
<m:mo class="MathClass-bin">/</m:mo>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mi>m</m:mi>
      <m:mo class="MathClass-bin">+</m:mo>
      <m:mi>p</m:mi>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> for <it>p </it>gene expression or downstream phenotypes and <it>m </it>genotypes. Given mild regularity conditions it has been shown in the context of linear regression that consistency can be established for both <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i9"><m:mi>s</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-rel">=</m:mo>
<m:mi>o</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msqrt>
         <m:mrow>
            <m:mi>n</m:mi>
         </m:mrow>
      </m:msqrt>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and under further mild assumptions <it>s</it>(<it>n</it>) = <it>o</it>(<it>n</it>/log(<it>n</it>)) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Note that the latter bound is a much weaker constraint on the model size as a function of the number of observations than the former bound. In addition, Zhang et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> show that the <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i7"><m:mi>s</m:mi> <m:mfenced close=")" open="(" separators=""><m:mrow><m:mi>n</m:mi></m:mrow></m:mfenced> <m:mo class="MathClass-rel">=</m:mo> <m:mi mathvariant="script">O</m:mi><m:mfenced close=")" open="(" separators=""><m:mrow><m:msqrt><m:mrow><m:mi>n</m:mi></m:mrow></m:msqrt></m:mrow></m:mfenced></m:math></inline-formula> choice of model size will asymptotically lead to minimum prediction error at a rate <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i10"><m:mi mathvariant="script">O</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msup>
         <m:mrow>
            <m:mi>n</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mo class="MathClass-bin">-</m:mo>
            <m:mn>1</m:mn>
            <m:mo class="MathClass-bin">/</m:mo>
            <m:mn>2</m:mn>
         </m:mrow>
      </m:msup>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>. This justifies our choice of the strength of the penalization to be sufficiently conservative in terms of ensuring few irrelevant features enter the model when p &#8811; n, especially for data with at least hundreds of observations, because of the optimal rate. This bound is also consistent with the results from the simulations within this paper, as well as results from previous applications of this bound <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B29">29</abbr></abbrgrp>. The Bayesian framework also allows the algorithm to take advantage of the multiple modality of the posterior with Bayesian model averaging, a particularly valuable approach when any well-supported sparse solution is expected to capture only a portion of the true network connections.</p>
<p>Another feature of our algorithm is that we also simultaneously correct for systematic error, or other large scale confounding factors among samples, based upon the expression data, by including the top twenty eigenvectors from a principal component analysis as unpenalized fixed effects in our model selection procedure. Therefore, the previous multiple regression equation becomes <it>y<sub>i </sub></it>= <it>&#956; </it>+ &#8721;<it><sub>j</sub> x<sub>ij </sub>&#946;<sub>j </sub></it>+ &#8721;<it><sub>k </sub>t<sub>ik </sub>&#945;<sub>k </sub></it>+ <it>e<sub>i</sub></it>, with <it>t</it><sub>1</sub>,...,<it>t</it><sub>20 </sub>being the top 20 across sample eigenvectors obtained from a standard principal component analysis of the joint gene expression data. The motivation for this is analogous to the use of eigenvectors from principal component analysis to correct for confounding population structure in genetic association analyses <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B30">30</abbr></abbrgrp>, which aims to remove any potentially confounding effects from the inference of the neighborhood of any given phenotype.</p>
</sec>
<sec><st><p>Variational Bayes approximate inference</p></st>
<p>As in Logsdon et al. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, we use a variational Bayes approximate inference approach to solve the high-dimensional feature selection problem. For the feature selection problem, the variational Bayes approximation is a good tradeoff between speed, since it is much faster than alternative exact inference approaches, and quality of the identified solutions, where empirical evidence shows that it performs well for underlying sparse models <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The variational Bayes approximation consists of minimizing the Kullback-Leibler divergence between an approximate factorized posterior distribution <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i11"><m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>1</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>1</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-rel">&#8943;</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>m</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>m</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
            <m:mo class="MathClass-rel">&#8800;</m:mo>
            <m:mn>0</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
            <m:mo class="MathClass-rel">&#8800;</m:mo>
            <m:mn>0</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>e</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>e</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> and the full posterior distribution <it>p</it>(<it>&#946;</it><sub>1</sub>,...,<it>&#946;<sub>m</sub></it>, <it>p</it><sub><it>&#946; </it>&#8800; 0</sub>, <it>&#963;<sub>e</sub></it>, <it>&#963;<sub>&#946;</sub></it>), using iterative expectation-type steps as in an Expectation-Maximization algorithm <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. The relevant statistic for the <it>j<sup>th </sup></it>expression product or genotype produced by the algorithm for the problem of feature selection is the posterior probability of inclusion in the model denoted a <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">^</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msub>
</m:math></inline-formula> from thereon. This <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula> parameter comes from the approximate posterior inference of the mixture parameters in the spike-and-slab prior. A detailed description of this statistic in terms of the other model parameters is given in the Additional file <supplr sid="S1">1</supplr>, Equations 2, 4, and 17. In our approach we perform a two-step reconstruction of the joint genotype, expression, and downstream phenotype network, where we first perform neighborhood selection for each downstream phenotype individually on all expression traits and genotypes. Then, in the second step, we perform neighborhood selection for each expression trait on all other expression traits and genotypes. The procedure is split into two steps because of the primary interest in the neighborhoods of the downstream phenotypes, followed by interest in the expression Quantitative Trait Loci (eQTL) networks associated with the neighborhoods of the downstream phenotypes. To resolve discrepancies in neighborhoods identified in two directions of regression, we average the <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula> scores across both directions of regression at a cutoff of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i13"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">^</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">&gt;</m:mo>
<m:mn>0</m:mn>
<m:mi>.</m:mi>
<m:mn>99</m:mn>
</m:math></inline-formula>. This approach is supported by the significant improvement in results obtained from simulations (see Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>). We then combine the neighborhoods of the first and second step of the algorithm through a simple union operation.</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Supplementary methods and results</b>. A supplementary file containing additional descriptions of the variational method, and additional results from simulations and data analysis.</p></text>
<file name="1471-2105-13-53-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Precision-recall curves for simulated networks</p></caption><text>
   <p><b>Precision-recall curves for simulated networks</b>. Precision-recall curves for different strategies for setting the significance threshold for the variational Bayes method as a function of the posterior probability and the randomized lasso with stability selection as a function of the empirical recovery probabilities for stability selection. Twenty replicate networks with 1000 variables, 300 observations, and an average of 1.47 edges per node were simulated (see Methods for further details). The network reconstruction methods compared are as follows, vb<it><sup>&#945;</sup></it>: variational Bayes method with posterior probabilities averaged in both directions of regression, vb<it><sup>&#946;</sup></it>: variational Bayes method with posterior probabilities not averaged, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i14"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>&#945;</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: randomized lasso with stability selection with the number of false positives bounded below 1 and recovery probabilities averaged in both directions of regression, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i15"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: same as <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i14"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>&#945;</m:mi></m:mrow></m:msubsup></m:math></inline-formula>, except without averaging, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i16"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>&#947;</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: randomized lasso with stability selection with the penalty parameter chosen such that the number of false positives are bounded below 1000 and recovery probabilities averaged in both directions of regression, and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i17"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>&#948;</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: same as <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i16"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>&#947;</m:mi></m:mrow></m:msubsup></m:math></inline-formula>, except without averaging.</p>
</text><graphic file="1471-2105-13-53-1" hint_layout="single"/></fig>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Type I errors and power for simulated networks</p></caption><text>
   <p><b>Type I errors and power for simulated networks</b>. The left panel shows the average number of false positives per replicate simulation (out of twenty simulated networks as in Figure 1) for different strategies for setting the significance threshold for both the variational Bayes method and the lasso methods. The right panel shows the corresponding power for all of the methods. The methods compared are as follows, vb<sup>a </sup>and vb<sup>b</sup>: variational Bayes methods without and with averaging in both directions of regression, and a posterior probability threshold of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i13"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">^</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">&gt;</m:mo>
<m:mn>0</m:mn>
<m:mi>.</m:mi>
<m:mn>99</m:mn>
</m:math></inline-formula>, vb<sup>c </sup>and vb<sup>d</sup>: variational Bayes methods without and with averaging, and a posterior probability threshold of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i18"><m:msub>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">^</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">&gt;</m:mo>
<m:mn>0</m:mn>
<m:mi>.</m:mi>
<m:mn>5</m:mn>
</m:math></inline-formula>, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i19"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">a</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i20"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">b</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: randomized lasso with stability selection with the number of false positives bounded below 1 for the entire network, without and with averaging, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i21"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">c</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i22"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">d</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: randomized lasso with stability selection with the number of false positives bounded below 1000 for the entire network, without and with averaging, and finally <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i23"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">e</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i24"><m:msubsup>
   <m:mrow>
      <m:mi>&#8467;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">f</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>: regular lasso with the number of false positive bounded below 1 and 1000 for the entire network.</p>
</text><graphic file="1471-2105-13-53-2" hint_layout="double"/></fig>
</sec>
</sec>
<sec><st><p>Results</p></st>
<sec><st><p>Simulation results</p></st>
<p>We performed a simulation study to compare our approach to other comparable methods that use the lasso with mechanisms for bounding the number of type I errors, as well as to lasso methods using a standard cross-validation approach, shrinkage estimation, partial least squares estimation, and ridge estimation methods. For the bounded type I error lasso methodologies, this included the randomized lasso with stability selection <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and the regular lasso with the penalization chosen to bound the number of type I errors as in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. We simulated twenty networks with a random underlying topology, <it>p </it>= 1000 variables, <it>n </it>= 300 observations, and on average 1.47 edges per variable (further details of the simulation are presented in the Methods). In Figure <figr fid="F1">1</figr> we show the precision-recall curves for two different strategies for defining the posterior probability of edge inclusion for the variational Bayes methodology: vb<it><sup>&#945; </sup></it>where the posterior probabilities are averaged in both directions of regression, and vb<it><sup>&#946;</sup></it>, where the posterior probabilities are not averaged in both directions of regression. We also show four different strategies for defining the empirical selection probabilities defined by the randomized lasso with stability selection: <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i19"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">a</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula> where the penalization parameter is chosen as in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> to bound the number of false positives to be less than one, and the empirical selection probabilities are averaged in both directions, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i15"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>&#946;</m:mi></m:mrow></m:msubsup></m:math></inline-formula> where the penalty parameter is chosen similarly, but the empirical selection probabilities are not averaged, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i16"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>&#947;</m:mi></m:mrow></m:msubsup></m:math></inline-formula> where the penalization parameter is chosen as in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> to bound the number of false positives to be less than 1000, and finally <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i17"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mi>&#948;</m:mi></m:mrow></m:msubsup></m:math></inline-formula>, where the penalty parameter is chosen similarly, but the empirical selection probabilities are not averaged. All curves are generated as a function of the threshold for declaring significance based on the associated probability statistics. We see that the variational Bayes approach significantly outperforms the randomized lasso with stability selection in terms of both power and type I error control for the averaged and non-averaged (vb<it><sup>&#945;</sup></it>, vb<it><sup>&#946;</sup></it>) posterior probability statistics across most thresholds for declaring significance.</p>
<p>In Figure <figr fid="F2">2</figr>, we illustrate the performance in terms of the average number of false positives observed in the entire network per replicate network (the left panel), and the overall power (the right panel) for specific thresholds of the variational Bayes and lasso approaches. Specifically, we investigate the variational Bayes approach for the conservative posterior probability thresholds of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i13"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub> <m:mo class="MathClass-rel">&gt;</m:mo> <m:mn>0</m:mn><m:mi>.</m:mi><m:mn>99</m:mn></m:math></inline-formula>, not averaged (vb<sup>a</sup>), and averaged (vb<sup>b</sup>), as well as for the more liberal <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i18"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub> <m:mo class="MathClass-rel">&gt;</m:mo> <m:mn>0</m:mn><m:mi>.</m:mi><m:mn>5</m:mn></m:math></inline-formula>, not averaged (vb<sup>c</sup>), and averaged (vb<sup>d</sup>) as described above. We also show the randomized lasso with stability selection for the more conservative strategy where the number of false positives is bounded to be less than 1, not averaged (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i19"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">a</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), and averaged (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i20"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">b</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), as well as for the more liberal strategy of the number of false positives bounded to be less than 1000, not averaged (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i21"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">c</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), and averaged (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i22"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">d</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>). Finally, we show the method of choosing the penalization of the regular lasso to bound the number of false positives to be less than 1 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i23"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">e</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), and less than 1000 (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i24"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">f</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>). Across all of these results, we see that the more conservative variational Bayes approaches (vb<sup>a </sup>and vb<sup>b</sup>) outperforms the conservative lasso approaches (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i19"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">a</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i20"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">b</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>, and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i23"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">e</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), as well as most of the more liberal lasso approaches (except with vb<sup>b </sup>and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i21"><m:msubsup><m:mrow><m:mi>&#8467;</m:mi></m:mrow><m:mrow><m:mn>1</m:mn></m:mrow><m:mrow><m:mstyle class="text"><m:mtext class="textsf" mathvariant="sans-serif">c</m:mtext></m:mstyle></m:mrow></m:msubsup></m:math></inline-formula>), while at the same time recovering far fewer false positives. At the more liberal threshold of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i18"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub> <m:mo class="MathClass-rel">&gt;</m:mo> <m:mn>0</m:mn><m:mi>.</m:mi><m:mn>5</m:mn></m:math></inline-formula>, the performance of the variational Bayes algorithm is further improved, especially for the averaged solution, vb<sup>d</sup>, which has a comparable number of false positives to any of the liberal lasso solutions, but has much greater power. The lasso methods had the most comparable performance to our algorithm based on additional simulated data considering five competing methods: lasso, adaptive lasso, shrinkage estimation <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, partial least squares estimator <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, and a ridge estimator <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> (Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>, Additional file <supplr sid="S1">1</supplr>: Figure S4, Additional file <supplr sid="S1">1</supplr>: Figure S5, Additional file <supplr sid="S1">1</supplr>: figure S6, Additional file <supplr sid="S1">1</supplr>: figure S7) (see the Additional file <supplr sid="S1">1</supplr> for a detailed description of additional simulations and network reconstruction methods). Therefore we only compared the lasso approaches to our algorithm when analyzing the experimental data.</p>
</sec>
<sec><st><p>Mouse downstream phenotype neighborhood identification</p></st>
<p>For the data analysis, we analyzed the F<sub>2 </sub>progeny of a cross between the C57BL/6 J (B6) and C3H/HeJ (C3H) strains on an apolipoprotein E null (ApoE -/-) background (BXH.ApoE<sup>-/-</sup>), as presented in Ghazalpour et al. and Wang et al. <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B31">31</abbr></abbrgrp>. We focused on the gene expression data that was collected in the liver of the mice where expression was assayed on 23,574 custom probes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. In addition, there were 22 downstream phenotypes that were assayed, including weight, cholesterol, glucose, free fatty acid, among other metabolic phenotypes, as well as 1,347 genetic markers <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. A total of 298 individuals were retained after filtering down to those for which both expression and genetic markers were collected. Previous authors have shown the antagonistic sex effects within this data <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, i.e. the effect of a risk locus is opposite between males and females. To address the sex specific effects, as well as other possibly confounding factors, we included both the sex and the 20 first eigenvectors from a principal component analysis computed across samples for expression phenotypes as unpenalized fixed effects in our linear model for all methods that we compared.</p>
<p>We first ran our variational algorithm on each of the 22 obesity related downstream phenotypes individually, where we performed sparse feature selection on all gene products and genetic markers. Our variational algorithm produced a sparse set of expression and genetic markers for each downstream phenotype, with the phenotypes with more than seven expression or genotype features identified shown in Additional file <supplr sid="S1">1</supplr>: Table S1. We ran the randomized lasso with stability selection to bound the number of false positives to be less than 1 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, and the regular lasso with the choice of penalty to bound the false positives to be less than 1 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and found that no expression traits or genotypes were identified as having non-zero effects by either approach. We also ran the lasso and adaptive lasso with ten-fold cross-validation for the same set of downstream phenotypes, as shown in Additional file <supplr sid="S1">1</supplr>: Table S1. The number of identified genetic interactions of each downstream phenotype was on average much larger for the lasso and the adaptive lasso with ten-fold cross-validation. The variational algorithm identifies additional features, with only 55% overlap with the lasso, and 46% overlap with the adaptive lasso for the seven phenotypes shown in Additional file <supplr sid="S1">1</supplr>: Table S1.</p>
<p>To assess the statistical confidence of our initial mouse obesity analysis, we determined the confidence intervals for each of the downstream phenotype network connections recovered with each feature selection method, in an independent, non-penalized linear multiple regression model. Both the lasso and the adaptive lasso contained many features that were not statistically significant at the <it>P </it>&lt; 0.05 significance level, indicating that the use of cross-validation as a method to control the sparsity of the model for the lasso or adaptive lasso allows an unacceptable number of false positives to be included in the model. We depict the network model and confidence intervals for the network connections identified by the lasso and adaptive lasso for weight in Additional file <supplr sid="S1">1</supplr>: Figure S1 and Additional file <supplr sid="S1">1</supplr>: Figure S2. For all phenotypes, the features identified as having a downstream network connection by the variational algorithm were all significant (the models and confidence intervals are depicted in Figures <figr fid="F3">3</figr> and <figr fid="F4">4</figr>). We also recapitulated a similar result in additional simulations where we demonstrate that at the <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i13"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub> <m:mo class="MathClass-rel">&gt;</m:mo> <m:mn>0</m:mn><m:mi>.</m:mi><m:mn>99</m:mn></m:math></inline-formula> cutoff, the variational Bayes method returns fewer false positives and tighter confidence intervals on all predicted network connections as opposed to the lasso and adaptive lasso (Additional file <supplr sid="S1">1</supplr>: Figure S3). Given the increased performance in terms of learning a statistically robust model and the appropriate sparsity of that model, we proceeded with only the variational Bayes algorithm for the expanded network analysis.</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Local networks for obesity phenotypes</p></caption><text>
   <p><b>Local networks for obesity phenotypes</b>. Network graph models of the genetic markers and gene expression traits identified by the variational Bayes algorithm as being associated with the downstream phenotypes: weight, high-density lipoprotein cholesterol levels (HDL), total cholesterol levels (TC), unesterified cholesterol (UC) levels. The plots below present the ordered estimates and 0.95 confidence intervals for the parameters from an independently fit, unpenalized multiple regression model of the downstream phenotypes on the genetic markers and gene expression trait in each of the identified models.</p>
</text><graphic file="1471-2105-13-53-3" hint_layout="double"/></fig>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Local networks for obesity phenotypes</p></caption><text>
   <p><b>Local networks for obesity phenotypes</b>. Network graph models and confidence interval plots as described in Figure 3 for free fatty acid (FFA) levels, glucose levels, and low-density lipoprotein + very low-density lipoprotein (LDL + VLDL) levels.</p>
</text><graphic file="1471-2105-13-53-4" hint_layout="double"/></fig>
</sec>
<sec><st><p>Expanded undirected network reconstruction</p></st>
<p>In the second step of our analysis, we used our variational Bayes algorithm to generate an undirected network among genotypes, expressed genes, and downstream phenotypes, by solving the neighborhood selection problem for each gene expression product individually, against all other genes and genetic markers. We resolved the neighborhoods of the networks very conservatively, by averaging the <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula> scores in both directions of regression for the expression phenotypes, and only declaring an interaction between genes present in the model if the averaged <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula> scores were greater than 0.99. To determine the most relevant aspects of this sparse network with respect to weight and other related phenotypes, we combined the neighborhoods produced for each of the downstream phenotypes from the first phase of the analysis and the second phase expression undirected network, to depict the local sub-networks associated with each downstream phenotype, for weight, total cholesterol, high density lipoprotein (HDL) cholesterol, unesterified cholesterol (UC), free fatty acids (FFA), glucose levels, and low density lipoprotein + very low density lipoprotein (LDL + VLDL) levels (Figure <figr fid="F5">5</figr>). Table <tblr tid="T1">1</tblr> summarizes identified genes which have been previously implicated in obesity, or related diseases and pathologies (Additional file <supplr sid="S1">1</supplr>: Table S2 is a version of this table with references available in the Additional file <supplr sid="S1">1</supplr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Integrated obesity network</p></caption><text>
   <p><b>Integrated obesity network</b>. An undirected obesity and metabolic network reconstructed from the mouse F<sub>2 </sub>intercross data using the variational Bayes algorithm. The network depicts genetic markers (blue), gene expression traits (red), and the downstream phenotypes (green) weight, high-density lipoprotein cholesterol levels (HDL), total cholesterol levels (TC), unesterified cholesterol (UC) levels, free fatty Acid (FFA) levels, glucose levels, and low-density lipoprotein + very low-density lipoprotein (LDL + VLDL) levels.</p>
</text><graphic file="1471-2105-13-53-5" hint_layout="double"/></fig>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Obesity related interactions.</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b>Gene/SNP</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Disease</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Organism(s)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Zfp69</p>
         </c>
         <c ca="left">
            <p>Candidate gene for diabetes associated with obesity</p>
         </c>
         <c ca="left">
            <p>Mouse and Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gna14</p>
         </c>
         <c ca="left">
            <p>Association study of hypertension</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>F11r</p>
         </c>
         <c ca="left">
            <p>Induces hypertension in the brain stem</p>
         </c>
         <c ca="left">
            <p>Rat</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gabarapl1</p>
         </c>
         <c ca="left">
            <p>Regulator of insulin dependent</p>
            <p>hepatic autophagy</p>
         </c>
         <c ca="left">
            <p>Mouse</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Wisp1</p>
         </c>
         <c ca="left">
            <p>Association study of hypertension</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Fdft1</p>
         </c>
         <c ca="left">
            <p>Squalene (cholesterol) biosynthesis gene</p>
         </c>
         <c ca="left">
            <p>Mouse and Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ier2</p>
         </c>
         <c ca="left">
            <p>Induced gene in insulin signalling pathways</p>
         </c>
         <c ca="left">
            <p>Rat</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Slc24a3</p>
         </c>
         <c ca="left">
            <p>Down regulated in diet sensitive</p>
            <p>obesity</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Crhr1</p>
         </c>
         <c ca="left">
            <p>Candidate obesity gene possibly affecting feeding behavior</p>
         </c>
         <c ca="left">
            <p>Mouse and Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Qpctl</p>
         </c>
         <c ca="left">
            <p>Association study identified</p>
            <p>candidate obesity gene</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Vcam-1</p>
         </c>
         <c ca="left">
            <p>Atherosclerotic plaque</p>
            <p>associated gene</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Gch1</p>
         </c>
         <c ca="left">
            <p>Identified in linkage studies</p>
            <p>of maximal sedentary oxygen uptake</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Dlgap1</p>
         </c>
         <c ca="left">
            <p>Type-2 diabetes associated gene</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Yy1</p>
         </c>
         <c ca="left">
            <p>Type-1 diabetes associated gene</p>
         </c>
         <c ca="left">
            <p>Rat</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Ccl19</p>
         </c>
         <c ca="left">
            <p>Adipocyte inflammation</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Cnr2</p>
         </c>
         <c ca="left">
            <p>Obesity associated</p>
            <p>adipocyte inflammation</p>
         </c>
         <c ca="left">
            <p>Mouse</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Atp10a/rs3664823</p>
         </c>
         <c ca="left">
            <p>Obesity associated gene</p>
         </c>
         <c ca="left">
            <p>Mouse</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Folr2</p>
         </c>
         <c ca="left">
            <p>Up-regulated in obesity associated adipose tissue</p>
         </c>
         <c ca="left">
            <p>Human</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Interactions identified by the variational method with previous evidence of being associated with obesity, or obesity related phenotypes</p>
   </tblfn></tbl>
<p>The network recovered by our algorithm is enriched for interactions that have been previously associated with these phenotypes: a total of 18 out of 118 recovered. While this may appear modest, it still suggests that this list of 118 variables is enriched for good candidate genes for follow up studies. From the first step of the analysis we find eleven genes (Gch1, Zfp69, Dlgap1, Gna14, Yy1, Gabarapl1, Folr2, Fdft1, Cnr2, Slc24a3, and Ccl19), as well as a single nucleotide polymorphism (rs3664823) that are directly linked to the downstream metabolic phenotypes and have independent evidence of being associated with obesity or obesity related pathologies (along with 52 novel genetic variables). We further identify six genes that feed into the genes that directly interact with the metabolic phenotypes (Ier2, H11r, Wisp1, Crhr1, Qpctl, Vcam-1), as well as an additional 54 novel interactions. These other novel interactions included Dcamkl1, Ercc1, and Cyp7a1, implicating a possible connection with weight, intestinal stem cell lineage <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and DNA damage repair <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, along with a connection between the levels of free fatty acids and bile acid production <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>.</p>
</sec>
</sec>
<sec><st><p>Discussion</p></st>
<p>Our variational Bayes algorithm is designed as a scalable and robust method for recovering a sparse network from the analysis of genome-wide data where only statistically relevant features are returned. This makes it particularly well suited to analyses aimed towards experimental validation of predicted biological interactions, for which the burden of false positives is costly <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. While our algorithm does not provide a large-scale picture of network topology that is the goal of the majority of network analysis methodologies <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, it nevertheless provides a short list of very statistically significant features, an outcome advantageous to the experimentalist interested in following up on the highest quality predictions. In addition, we analyzed the entire network of 24,921 variables in 48 hours on a single machine with dual quad-core Intel Xeon processors (fitting a model with 3.08 &#215; 10<sup>8 </sup>possible linear interactions). Recent work has shown that the variational Bayes approach for the spike-and-slab regression model is orders of magnitude faster than the corresponding Markov chain Monte Carlo approach <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, indicating that we are able to solve high-dimensional problems much faster than the corresponding exact inference approach.</p>
<p>Our algorithm focuses on the reconstruction of undirected networks because they are more amenable to highly scalable, sparse feature selection methods <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Only a few of the undirected graphical algorithms that have been proposed, such as the lasso, that can simultaneously return sparse network models and analyze the entirety of genome-wide variables without requiring a step-wise procedure (e.g. the PC-algorithm <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>). We demonstrated through simulations that the lasso with type I error control <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B40">40</abbr></abbrgrp> does not perform as well as the variational spike-and-slab approach that we propose, as shown in Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>. In addition, when we applied the lasso with type I error control to the mouse data, we saw that the performance was even worse than in the simulations, with both stability selection and a choice of penalization to bound the type I error being severely under-powered (returning the null model for all analyses). This was not entirely surprising, given similar results in the context of genome-wide association studies for the lasso with stability selection <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, where high-dimensional data with p &#8811; n and significant correlations among variables caused stability selection to perform very poorly in terms of power (likely because of the difficulty in deciding which variable within a given correlated block of variables should be included in the model). However, the lasso and adaptive lasso with cross-validation were able to identify non-null models for the downstream phenotypes, though these methods may not necessarily produce solutions that are highly enriched for true positives. We find that while the lasso and adaptive lasso with cross-validation can produce some quality predictions of biological interactions, they also include a majority of statistically less supported results compared to our variational Bayes algorithm, which is able to estimate both a strongly statistically supported model, and the degree of sparsity of the model.</p>
<p>It should be noted that despite the similarities between the model we propose in this paper, and previous work for variational Bayes algorithms with spike-and-slab priors <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B45">45</abbr></abbrgrp>, there remains some important distinctions. First, as opposed to Logsdon et al. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, we only consider a single non-zero component for the slab in the spike-and-slab prior, instead of two truncated normal distributions, therefore reducing the dimensionality of the parameter space. Second, we incorporate a Bayesian model averaging procedure that increases the stability of the solutions identified across the space of possible models. By down-weighting variables that are inconsistent between models with similar fits to the data, Bayesian model averaging may also address some of the concerns of Carbonetto and Stephens <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> with regards to the variational Bayes approach incorrectly identifying false positives that are correlated with true positives. Third, we simplify the estimation of the effect of unpenalized covariates by treating them as non-random parameters whose effects are estimated through the maximization of the lower bound. Finally, because we are reconstructing networks among multiple phenotypes, we show we can improve the performance of our approach by averaging the posterior probability statistics in both directions of regression (Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>).</p>
<p>In addition, 5 of the 11 genes and single nucleotide polymorphism with independent evidence of possible metabolic functionality in the downstream phenotype analysis were uniquely identified by the variational spike-and-slab method but not found by either the lasso or adaptive lasso with cross-validation (Gabarapl1, Dlgap1, Folr2, Cnr2, and rs3664823). This indicates that even when the lasso is tuned to be more liberal, as with cross-validation, the variational spike-and-slab methodology can identify additional high confidence results within a particular data-set. There is evidence that this is the case because of both the non-convex nature of the spike-and-slab penalty, which does not over-penalize true effects as severely as the lasso <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and because of the additional regularization associated with Bayesian model averaging. Bayesian model averaging effectively regularizes over the ensemble of identified solutions to only include effects with strong evidence across model space.</p>
<p>We identified 18 genetic variables that have been previously linked to obesity or obesity related phenotypes using our variational method with a strict control of false discovery rate. These variables include genes related to cholesterol biosynthesis such as the gene farnesyl diphosphate farnesyl transferase 1 (Fdft1). This is a known squalene (i.e. cholesterol) synthesis gene where high levels of this gene are known to be associated with visceral obesity <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. It has also been shown to be up-regulated in mice on a high fat diet <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and is directly linked to the unesterified cholesterol levels. We also found genes related to neurological regulation of appetite including Cnr2 and Crhr1 <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp>, and variables involved in insulin signaling pathways including the gene immediate early response 2 (Ier2) also known as Pip92, which is known to be induced by insulin signaling <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, and is linked through BC021367 (a transmembrane protein also known as Tmem161a) to the levels of free fatty acids in our model. We also see three genes implicated in hypertension: Wisp1, Gna14, and F11r. The gene WNT1 induced signaling pathway protein 1 (Wisp1) is connected with HDL levels through the N-myc downstream regulated gene 1 (Ndrg1) gene in our model. In addition, guanine nucleotide binding protein, alpha 14 (Gna14) is directly linked to weight. Finally, the gene F11r, also known as junctional adhesion molecule-1 (JAM-1) is related to both the total cholesterol levels in our model, as well as the combined LDL and VLDL cholesterol levels, through the 1190002J23Ri expression probe i.e. kelch domain containing 9 (Klhdc9) gene.</p>
<p>We also observe previously identified obesity associated genes, such as Zfp69, Slc24a3, Qpctl, Atp10a, and Folr2. This coverage of a broad spectrum of previously identified etiologies underlying obesity indicates the quality of the data as well as the predictions, given the complex nature of the obesity phenotype. In addition, through the network construction we were able to generate novel predictions, such as a <it>cis</it>-eQTL near rs3686646 may interact with Cytochrome c assembly, which in turn may have an impact on weight. In addition, we predict that the gene Slc24a3 modulates the effect of Crhr1 and Qpctl on the levels of glucose in the blood.</p>
<p>These additional network inferences also provide information with respect to how the effect of a given gene on a downstream metabolic phenotype may be mediated (Figure <figr fid="F5">5</figr>). For example, solute carrier family 24, member 3 (Slc24a3), which has been previously identified as having significantly decreased expression in diet-sensitive obese women and is directly linked to glucose levels in our network <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, has both the genes corticotropin releasing hormone receptor 1 (Crhr1) and glutaminyl-peptide cyclotransferase-like (Qpctl) directly linked to it. Both of these genes have been previously implicated as candidate obesity genes <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B53">53</abbr></abbrgrp>. This suggests that the effect of the obesity risk associated with Qpctl andCrhr1 are mediated by Slc24a3's effect on the levels of glucose in the bloodstream.</p>
<p>The network connections recovered by our method identified a number of novel features important for obesity related phenotypes not previously identified by network analysis of these data. This includes the zinc-fingered protein 69 (Zfp69), which is directly linked to weight in our model. This gene has previously been identified as a candidate gene, for the diabetogenic effect of the Nidd/SJL loci in obese mice <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Another variable is the expression of cannabinoid receptor 2 (Cnr2), which is connected to both free fatty acids and glucose in our model, and that has been shown to mediate an innate immune response leading to inflammation in obese mouse adipocytes <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Methods</p></st>
<sec><st><p>The network model</p></st>
<p>Similarly to Yin and Li <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, we assume that the expression data for the <it>i<sup>th </sup></it>sample (<b>y<sub>i</sub></b>) conditioned on a set of genotypes and unpenalized covariates (<b>x<sub>i</sub></b>) is distributed normally <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i25"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi mathvariant="bold">y</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi mathvariant="bold">i</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-rel">|</m:mo>
   <m:msubsup>
      <m:mrow>
         <m:mi mathvariant="bold">x</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi mathvariant="bold">i</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mi>i</m:mi>
         <m:mi>d</m:mi>
      </m:mrow>
   </m:msubsup>
   <m:mo class="MathClass-rel">~</m:mo>
   <m:mi mathvariant="script">N</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mi mathvariant="text">&#915;</m:mi>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">x</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi mathvariant="bold">i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msubsup>
            <m:mrow>
               <m:mi mathvariant="text">&#920;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi mathvariant="bold">y</m:mi>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mn>1</m:mn>
            </m:mrow>
         </m:msubsup>
      </m:mrow>
   </m:mfenced>
</m:mrow>
</m:math></inline-formula>, with means determined by possibly sparse linear functions of genotypes and unpenalized covariates (&#915;<b>x<sub>i</sub></b>) as well as a sparse precision matrix (&#920;<b><sub>yy</sub></b>) (i.e. the inverse covariance matrix). Yet, in contrast to Yin and Li <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, we define an alternative parameterization of the mean effects &#915;, such that <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i26"><m:mi mathvariant="text">&#915;</m:mi>
<m:mo class="MathClass-rel">=</m:mo>
<m:msubsup>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">y</m:mi>
      <m:mi mathvariant="bold">x</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">T</m:mi>
   </m:mrow>
</m:msubsup>
<m:msubsup>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">y</m:mi>
      <m:mi mathvariant="bold">y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>. This lets us consider not only the conditional independencies among phenotypes correcting for the effect of genotypes and covariates as in Yin and Li <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, but this also allows us to identify a set of genotypic effects <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i27"><m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msubsup>
         <m:mrow>
            <m:mi mathvariant="text">&#920;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi mathvariant="bold">y</m:mi>
            <m:mi mathvariant="bold">x</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi mathvariant="bold">T</m:mi>
         </m:mrow>
      </m:msubsup>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> that directly takes into account the conditional independence structure among the expression phenotypes. The log-likelihood defined by this model is as follows <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>:</p>
<p><display-formula id="M1"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i28"><m:mrow>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">log</m:mtext>
   </m:mstyle>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mi mathvariant="bold">Y</m:mi>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:mi mathvariant="bold">X</m:mi>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mi mathvariant="text">&#920;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-rel">&#8733;</m:mo>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">log</m:mtext>
   </m:mstyle>
   <m:mfenced close="}" open="{" separators="">
      <m:mrow>
         <m:mstyle class="text">
            <m:mtext class="textsf" mathvariant="sans-serif">det</m:mtext>
         </m:mstyle>
         <m:mfenced close=")" open="(" separators="">
            <m:mrow>
               <m:msub>
                  <m:mrow>
                     <m:mi mathvariant="text">&#920;</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mi mathvariant="bold">y</m:mi>
                     <m:mi mathvariant="bold">y</m:mi>
                  </m:mrow>
               </m:msub>
            </m:mrow>
         </m:mfenced>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-bin">-</m:mo>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">Tr</m:mtext>
   </m:mstyle>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mi mathvariant="bold">S</m:mi>
         <m:mi mathvariant="text">&#920;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>where:</p>
<p><display-formula id="M2"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i29"><m:mrow>
   <m:mi mathvariant="text">&#920;</m:mi>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfenced close="]" open="[" separators="">
      <m:mrow>
         <m:mtable class="array" columnlines="none" equalcolumns="false" equalrows="false">
            <m:mtr>
               <m:mtd class="array" columnalign="center">
                  <m:msub>
                     <m:mrow>
                        <m:mi mathvariant="text">&#920;</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">y</m:mi>
                        <m:mi mathvariant="bold">y</m:mi>
                     </m:mrow>
                  </m:msub>
               </m:mtd>
               <m:mtd class="array" columnalign="center">
                  <m:msub>
                     <m:mrow>
                        <m:mi mathvariant="text">&#920;</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">y</m:mi>
                        <m:mi mathvariant="bold">x</m:mi>
                     </m:mrow>
                  </m:msub>
               </m:mtd>
            </m:mtr>
            <m:mtr>
               <m:mtd class="array" columnalign="center">
                  <m:msubsup>
                     <m:mrow>
                        <m:mi mathvariant="text">&#920;</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">y</m:mi>
                        <m:mi mathvariant="bold">x</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">T</m:mi>
                     </m:mrow>
                  </m:msubsup>
               </m:mtd>
               <m:mtd class="array" columnalign="center">
                  <m:msub>
                     <m:mrow>
                        <m:mi mathvariant="text">&#920;</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">x</m:mi>
                        <m:mi mathvariant="bold">x</m:mi>
                     </m:mrow>
                  </m:msub>
               </m:mtd>
            </m:mtr>
            <m:mtr>
               <m:mtd class="array" columnalign="center"/>
            </m:mtr>
         </m:mtable>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>and</p>
<p><display-formula id="M3"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i30"><m:mrow>
   <m:mi mathvariant="bold">S</m:mi>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfrac>
      <m:mrow>
         <m:mn>1</m:mn>
      </m:mrow>
      <m:mrow>
         <m:mi>n</m:mi>
      </m:mrow>
   </m:mfrac>
   <m:mfenced close="]" open="[" separators="">
      <m:mrow>
         <m:mtable class="array" columnlines="none" equalcolumns="false" equalrows="false">
            <m:mtr>
               <m:mtd class="array" columnalign="center">
                  <m:msup>
                     <m:mrow>
                        <m:mi mathvariant="bold">Y</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">T</m:mi>
                     </m:mrow>
                  </m:msup>
                  <m:mi mathvariant="bold">Y</m:mi>
               </m:mtd>
               <m:mtd class="array" columnalign="center">
                  <m:msup>
                     <m:mrow>
                        <m:mi mathvariant="bold">Y</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">T</m:mi>
                     </m:mrow>
                  </m:msup>
                  <m:mi mathvariant="bold">X</m:mi>
               </m:mtd>
            </m:mtr>
            <m:mtr>
               <m:mtd class="array" columnalign="center">
                  <m:msup>
                     <m:mrow>
                        <m:mi mathvariant="bold">X</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">T</m:mi>
                     </m:mrow>
                  </m:msup>
                  <m:mi mathvariant="bold">Y</m:mi>
               </m:mtd>
               <m:mtd class="array" columnalign="center">
                  <m:msup>
                     <m:mrow>
                        <m:mi mathvariant="bold">X</m:mi>
                     </m:mrow>
                     <m:mrow>
                        <m:mi mathvariant="bold">T</m:mi>
                     </m:mrow>
                  </m:msup>
                  <m:mi mathvariant="bold">X</m:mi>
               </m:mtd>
            </m:mtr>
            <m:mtr>
               <m:mtd class="array" columnalign="center"/>
            </m:mtr>
         </m:mtable>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>being the sample covariance matrix, <b>X </b>and <b>Y </b>mean-centered, <b>X </b>being an <it>n </it>&#215; <it>m </it>matrix of genotypes and fixed effects, and <b>Y </b>being an <it>n </it>&#215; <it>p </it>matrix of expression or downstream phenotypes. The elements <it>&#952;<sub>ij </sub></it>of the matrix &#920; represent the pairwise Markov dependencies of the random variables <b>Y </b><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Intuitively, the set of non-zero <it>&#952;<sub>ij </sub></it>parameters for a given random variable <it>y<sub>i</sub></it>, defines the set of other phenotypes once conditioned on, make <it>y<sub>i </sub></it>probabilistically independent from the rest of the variables in the model (also known as the neighborhood of <it>y<sub>i</sub></it>). In this model everything is conditional on the state of the entire set of genotypes and fixed effects. The non-zero structure of the &#920;<b><sub>yy </sub></b>sub-matrix specifies a conditional Markov random field among the expression or downstream phenotypes. Accordingly, the element <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i31"><m:msubsup>
   <m:mrow>
      <m:mi>&#952;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
      <m:mi>y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> for <it>i </it>&#8800; <it>j </it>of the &#920;<b><sub>yy </sub></b>matrix is zero iff</p>
<p><display-formula id="M4"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i32"><m:mrow>
   <m:mi>p</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">Y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mrow>
                  <m:mo class="MathClass-open">(</m:mo>
                  <m:mrow>
                     <m:mi>i</m:mi>
                     <m:mo class="MathClass-punc">,</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
                  <m:mo class="MathClass-close">)</m:mo>
               </m:mrow>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mi mathvariant="bold">X</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mi>p</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">Y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mrow>
                  <m:mo class="MathClass-open">(</m:mo>
                  <m:mrow>
                     <m:mi>i</m:mi>
                     <m:mo class="MathClass-punc">,</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
                  <m:mo class="MathClass-close">)</m:mo>
               </m:mrow>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mi mathvariant="bold">X</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mi>p</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">Y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mrow>
                  <m:mo class="MathClass-open">(</m:mo>
                  <m:mrow>
                     <m:mi>i</m:mi>
                     <m:mo class="MathClass-punc">,</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
                  <m:mo class="MathClass-close">)</m:mo>
               </m:mrow>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mi mathvariant="bold">X</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>i.e. the probability distribution satisfies the local Markov property <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> with respect to an undirected graph <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i33"><m:mi mathvariant="script">G</m:mi>
<m:mo class="MathClass-rel">=</m:mo>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">V</m:mtext>
      </m:mstyle>
      <m:mo class="MathClass-punc">,</m:mo>
      <m:mstyle class="text">
         <m:mtext class="textsf" mathvariant="sans-serif">E</m:mtext>
      </m:mstyle>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>, with <b>Y</b><sub>-(<it>i</it>, <it>j</it>) </sub>indicating the set of other phenotypes, excluding the single variables <it>y<sub>i </sub></it>and <it>y<sub>j</sub></it>. Since this is a Markov random field conditioned on <b>X </b>assuming an underlying linear model, the non-zero structure of the &#920;<b><sub>xy </sub></b>sub-matrix does not imply a factorization over an underlying probability density, but the element <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i34"><m:msubsup>
   <m:mrow>
      <m:mi>&#952;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>x</m:mi>
      <m:mi>y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
      <m:mi>j</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> is zero iff</p>
<p><display-formula id="M5"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i35"><m:mrow>
   <m:mi>p</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">X</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi mathvariant="bold">i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">Y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi mathvariant="bold">j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi>x</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mi>p</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-rel">|</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">X</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi mathvariant="bold">i</m:mi>
            </m:mrow>
         </m:msub>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msub>
            <m:mrow>
               <m:mi mathvariant="bold">Y</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi mathvariant="bold">j</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>i.e. the conditional distribution of <it>y<sub>j </sub></it>is the same, when conditioning on <b>X</b><sub>-<b>i </b></sub>and <b>Y</b><sub>-<b>j</b></sub>, whether one conditions on <it>x<sub>i </sub></it>or not. Finally, since this is a conditional Markov random field, the rank of the matrix &#920; is <it>p </it>and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i36"><m:msub>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">x</m:mi>
      <m:mi mathvariant="bold">x</m:mi>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">=</m:mo>
<m:msub>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">x</m:mi>
      <m:mi mathvariant="bold">y</m:mi>
   </m:mrow>
</m:msub>
<m:msubsup>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">y</m:mi>
      <m:mi mathvariant="bold">y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:msubsup>
<m:msub>
   <m:mrow>
      <m:mi mathvariant="text">&#920;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi mathvariant="bold">y</m:mi>
      <m:mi mathvariant="bold">x</m:mi>
   </m:mrow>
</m:msub>
</m:math></inline-formula>.</p>
</sec>
<sec><st><p>Undirected network inference</p></st>
<p>To infer the structure of the underlying undirected graph, many authors have proposed putting different forms of element-wise penalties on the &#920; matrix, such as the lasso (<it>l</it><sub>1 </sub>norm) <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B56">56</abbr></abbrgrp>. Additionally, as other authors have noted <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, the positive-semi definite constraint on &#920; imposed by the log{det} function in the log likelihood makes optimization of the full likelihood problem challenging for large scale problems, especially when the number of phenotypes and genotypes <it>p </it>+ <it>m </it>greatly exceeds the sample size <it>n</it>. Therefore, instead of solving the full likelihood optimization problem, we follow the general strategy of Meinshausen and B&#252;hlmann, Zhou et al., and Kraemer et al. <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B22">22</abbr></abbrgrp>, and treat the structure learning problem as a neighborhood identification problem; i.e. we perform model selection on a set of uncoupled regression equations, where each expression phenotype is regressed on every other phenotype, and genotype. At the end of this process we resolve the neighborhoods of each gene expression product by averaging the posterior probabilities of edge inclusion in both directions of regression.</p>
<p>We define a given multiple regression equation as:</p>
<p><display-formula id="M6"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i37"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi>y</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mi>&#956;</m:mi>
   <m:mo class="MathClass-bin">+</m:mo>
   <m:munderover accent="false" accentunder="false">
      <m:mrow>
         <m:mo mathsize="big"> &#8721;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>j</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>p</m:mi>
         <m:mo class="MathClass-bin">-</m:mo>
         <m:mn>1</m:mn>
      </m:mrow>
   </m:munderover>
   <m:mspace class="thinspace" width="0.3em"/>
   <m:msub>
      <m:mrow>
         <m:mi>z</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mi>j</m:mi>
      </m:mrow>
   </m:msub>
   <m:msubsup>
      <m:mrow>
         <m:mi>&#946;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>j</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>y</m:mi>
      </m:mrow>
   </m:msubsup>
   <m:mo class="MathClass-bin">+</m:mo>
   <m:munderover accent="false" accentunder="false">
      <m:mrow>
         <m:mo mathsize="big"> &#8721;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>l</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>m</m:mi>
      </m:mrow>
   </m:munderover>
   <m:mspace class="thinspace" width="0.3em"/>
   <m:msub>
      <m:mrow>
         <m:mi>x</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mi>l</m:mi>
      </m:mrow>
   </m:msub>
   <m:msubsup>
      <m:mrow>
         <m:mi>&#946;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>l</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>x</m:mi>
      </m:mrow>
   </m:msubsup>
   <m:mo class="MathClass-bin">+</m:mo>
   <m:munderover accent="false" accentunder="false">
      <m:mrow>
         <m:mo mathsize="big"> &#8721;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>k</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>l</m:mi>
      </m:mrow>
   </m:munderover>
   <m:mspace class="thinspace" width="0.3em"/>
   <m:msub>
      <m:mrow>
         <m:mi>t</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
         <m:mi>k</m:mi>
      </m:mrow>
   </m:msub>
   <m:msub>
      <m:mrow>
         <m:mi>&#945;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>k</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-bin">+</m:mo>
   <m:msub>
      <m:mrow>
         <m:mi>e</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>where <it>y<sub>i </sub></it>is <it>i<sup>th </sup></it>sample of a given phenotype, the population mean is modeled as a fixed effect <it>&#956;</it>, <it>z<sub>ij </sub></it>is the <it>i<sup>th </sup></it>sample of the <it>j<sup>th </sup></it>phenotype, excluding the phenotype <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i38"><m:mi>y</m:mi>
<m:mo class="MathClass-punc">,</m:mo>
<m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> is the effect of the <it>j<sup>th </sup></it>phenotype, <it>x<sub>il </sub></it>is the <it>i<sup>th </sup></it>sample of the <it>l<sup>th </sup></it>genotype, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i39"><m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>l</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>x</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> is the effect of the <it>l<sup>th </sup></it>genotype, <it>t<sub>ik </sub></it>is the <it>i<sup>th </sup></it>sample of the <it>k<sup>th </sup></it>non-penalized effect, <it>&#945;<sub>k </sub></it>is the effect of this <it>k<sup>th </sup></it>feature, and <it>e<sup>i </sup></it>is the residual error term, assumed to be normally distributed with mean zero, and variance <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i40"><m:msubsup>
   <m:mrow>
      <m:mi>&#963;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>e</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>. In general we include the top 20 eigenvectors from a principal component analysis of the expression phenotypes as unpenalized covariates <it>t<sub>ik </sub></it>for each penalized regression model.</p>
</sec>
<sec><st><p>Connection between penalized regression solutions and the network model</p></st>
<p>While the likelihood defined in Equation 1 corresponds to a conditional GGM corresponding to the joint distribution of the gene expression phenotypes (<b>Y</b>) conditional on some set of genotypes and fixed effects (<b>X</b>), our approach focuses on solving a set of penalized regression equations for each phenotype (as in Equation 6). Our assumption (as in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>) is that the set of variables that are selected in a particular regression model for a given phenotype (e.g. which <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i41"><m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>j</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i39"><m:msubsup><m:mrow><m:mi>&#946;</m:mi></m:mrow><m:mrow><m:mi>l</m:mi></m:mrow><m:mrow><m:mi>x</m:mi></m:mrow></m:msubsup></m:math></inline-formula> are non-zero will exactly specify which set of elements of &#920; are non-zero). For example, if in the penalized regression model for the 5<it><sup>th </sup></it>phenotype we find that <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i42"><m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>1</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i43"><m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>3</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>, and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i44"><m:msubsup>
   <m:mrow>
      <m:mi>&#946;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>4</m:mn>
   </m:mrow>
   <m:mrow>
      <m:mi>x</m:mi>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> are non-zero, then this would indicate that the corresponding elements of &#920;, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i45"><m:msubsup>
   <m:mrow>
      <m:mi>&#952;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
      <m:mi>y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>15</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i46"><m:msubsup>
   <m:mrow>
      <m:mi>&#952;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
      <m:mi>y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>35</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula>, and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i47"><m:msubsup>
   <m:mrow>
      <m:mi>&#952;</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>y</m:mi>
      <m:mi>x</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>54</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> would be non-zero, and the corresponding conditional independence properties implied by Equation (4) and Equation (5) would be true for these variables.</p>
</sec>
<sec><st><p>Bayesian hierarchical model for sparse feature selection</p></st>
<p>Given the regression equation defined in Equation 6, we define the following hierarchical model, similar in vein to Zhang et al. and Logsdon et al. <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B29">29</abbr></abbrgrp>:</p>
<p><display-formula id="M7"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i48"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi>&#946;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>j</m:mi>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-rel">~</m:mo>
   <m:msub>
      <m:mrow>
         <m:mi>p</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#946;</m:mi>
         <m:mo class="MathClass-rel">=</m:mo>
         <m:mn>0</m:mn>
      </m:mrow>
   </m:msub>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">I</m:mtext>
   </m:mstyle>
   <m:mfenced close="]" open="[" separators="">
      <m:mrow>
         <m:mi>&#946;</m:mi>
         <m:mo class="MathClass-rel">=</m:mo>
         <m:mn>0</m:mn>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-bin">+</m:mo>
   <m:msub>
      <m:mrow>
         <m:mi>p</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#946;</m:mi>
         <m:mo class="MathClass-rel">&#8800;</m:mo>
         <m:mn>0</m:mn>
      </m:mrow>
   </m:msub>
   <m:mi mathvariant="script">N</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mn>0</m:mn>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:msubsup>
            <m:mrow>
               <m:mi>&#963;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>&#946;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mn>2</m:mn>
            </m:mrow>
         </m:msubsup>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p><display-formula id="M8"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i49"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi>p</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#946;</m:mi>
         <m:mo class="MathClass-rel">=</m:mo>
         <m:mn>0</m:mn>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-punc">,</m:mo>
   <m:msub>
      <m:mrow>
         <m:mi>p</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#946;</m:mi>
         <m:mo class="MathClass-rel">&#8800;</m:mo>
         <m:mn>0</m:mn>
      </m:mrow>
   </m:msub>
   <m:mo class="MathClass-rel">~</m:mo>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">Beta</m:mtext>
   </m:mstyle>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mn>1</m:mn>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mn>1</m:mn>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p><display-formula id="M9"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i50"><m:mrow>
   <m:msubsup>
      <m:mrow>
         <m:mi>&#963;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#946;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mo class="MathClass-bin">-</m:mo>
         <m:mn>2</m:mn>
      </m:mrow>
   </m:msubsup>
   <m:mo class="MathClass-rel">~</m:mo>
   <m:mi mathvariant="text">&#915;</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mn>2</m:mn>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mn>1</m:mn>
         <m:mo class="MathClass-bin">/</m:mo>
         <m:mn>2</m:mn>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p><display-formula id="M10"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i51"><m:mrow>
   <m:msubsup>
      <m:mrow>
         <m:mi>&#963;</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>e</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mo class="MathClass-bin">-</m:mo>
         <m:mn>2</m:mn>
      </m:mrow>
   </m:msubsup>
   <m:mo class="MathClass-rel">~</m:mo>
   <m:mi mathvariant="text">&#915;</m:mi>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mn>2</m:mn>
         <m:mo class="MathClass-punc">,</m:mo>
         <m:mn>1</m:mn>
         <m:mo class="MathClass-bin">/</m:mo>
         <m:mn>2</m:mn>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>with the additional truncation restriction on the prior distribution over <it>p</it><sub><it>&#946; </it>&#8800; 0 </sub>of <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i52"><m:msub>
   <m:mrow>
      <m:mi>p</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>&#946;</m:mi>
      <m:mo class="MathClass-rel">&#8800;</m:mo>
      <m:mn>0</m:mn>
   </m:mrow>
</m:msub>
<m:mo class="MathClass-rel">&#8804;</m:mo>
<m:msqrt>
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
</m:msqrt>
<m:mo class="MathClass-bin">/</m:mo>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mi>m</m:mi>
      <m:mo class="MathClass-bin">+</m:mo>
      <m:mi>p</m:mi>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>.</p>
<p>This mixture penalty in a Bayesian framework has attractive theoretical properties, including bounded shrinkage and indications that it may approach optimal efficiency for sparse underlying parameter spaces <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and may still be model selection consistent when the irrepresentability condition is not met <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. One of the main advantages of this approach is that the hierarchical model can adaptively shrink the penalty to match the sparsity of the underlying parameter space, without having to resort to prediction based metrics like cross-validation which can overestimate model size or possibly heuristic model complexity measures based on information criterion such as Akaike's Information Criterion (AIC) or Bayesian Information Criterion (BIC).</p>
<p>Because the mixture penalty is non-convex, the posterior surface can be highly multi-modal and each mode in the posterior density can represent a different set of identified features (i.e. neighborhood). A well characterized weakness of the <it>l</it><sub>0 </sub>type penalty (i.e. best subset selection) is the instability of the identified solutions <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. One of the most important novel contributions of our algorithm is a Bayesian model averaging step <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. We perform Bayesian model averaging across the identified modes by re-weighting the posterior probability of inclusion for each feature (<inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula>), proportional to the estimated volume underneath each identified mode (a measure of the relative evidence of a given model) based on the lower bound, an approach similar to bagging <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Because the algorithm is very fast, we can run it many times (up to thousands) and identify many models, along with the relative evidence of each model identified, based on the lower bound (Equation 17 in the Supporting Information), and integrate the evidence across the models. This approach is effective at integrating out model uncertainty, and generating the best estimates of which interactions are most strongly supported by the data.</p>
</sec>
<sec><st><p>Variational spike-and-slab algorithm</p></st>
<p>The variational Bayes approximation consists of minimizing the Kullback-Leibler divergence between an approximate factorized posterior distribution <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i53"><m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>1</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>1</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-rel">&#8943;</m:mo>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>m</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>m</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
            <m:mo class="MathClass-rel">&#8800;</m:mo>
            <m:mn>0</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>p</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
            <m:mo class="MathClass-rel">&#8800;</m:mo>
            <m:mn>0</m:mn>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>e</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>e</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
<m:msub>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:msub>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>&#946;</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> and the full posterior distribution <it>p</it>(<it>&#946;</it><sub>1</sub>,...,<it>&#946;<sub>m</sub></it>, <it>p</it><sub><it>&#946; </it>&#8800; 0</sub>, <it>&#963;<sub>e</sub></it>, <it>&#963;<sub>&#946;</sub></it>), using iterative expectation type steps as in an Expectation-Maximization algorithm <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Given this optimization procedure, the variational Bayes distributional approximation for the posterior distribution of an arbitrary parameter <it>&#952; </it>(i.e. <it>&#946;</it><sub>1</sub>,...,<it>&#946;</it><sub><it>p </it>+ <it>m</it>-1</sub>, <it>&#963;<sub>e</sub></it>, <it>&#963;<sub>&#946;</sub></it>, <it>p</it><sub><it>&#946; </it>&#8800; 0</sub>) is given as follows:</p>
<p><display-formula id="M11"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i54"><m:mrow>
   <m:msubsup>
      <m:mrow>
         <m:mi>q</m:mi>
      </m:mrow>
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
      <m:mrow>
         <m:mi>t</m:mi>
         <m:mo class="MathClass-bin">+</m:mo>
         <m:mn>1</m:mn>
      </m:mrow>
   </m:msubsup>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:mfrac>
      <m:mrow>
         <m:mn>1</m:mn>
      </m:mrow>
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>Z</m:mi>
            </m:mrow>
            <m:mrow>
               <m:msub>
                  <m:mrow>
                     <m:mi>&#952;</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mi>j</m:mi>
                  </m:mrow>
               </m:msub>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:mfrac>
   <m:mstyle class="text">
      <m:mtext class="textsf" mathvariant="sans-serif">exp</m:mtext>
   </m:mstyle>
   <m:mfenced close="}" open="{" separators="">
      <m:mrow>
         <m:mo class="MathClass-op">&#8747;</m:mo>
         <m:msubsup>
            <m:mrow>
               <m:mi>q</m:mi>
            </m:mrow>
            <m:mrow>
               <m:msub>
                  <m:mrow>
                     <m:mi>&#952;</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mo class="MathClass-bin">-</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
               </m:msub>
            </m:mrow>
            <m:mrow>
               <m:mi>t</m:mi>
            </m:mrow>
         </m:msubsup>
         <m:mfenced close=")" open="(" separators="">
            <m:mrow>
               <m:msub>
                  <m:mrow>
                     <m:mi>&#952;</m:mi>
                  </m:mrow>
                  <m:mrow>
                     <m:mo class="MathClass-bin">-</m:mo>
                     <m:mi>j</m:mi>
                  </m:mrow>
               </m:msub>
            </m:mrow>
         </m:mfenced>
         <m:mi>d</m:mi>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mo class="MathClass-bin">-</m:mo>
               <m:mi>j</m:mi>
            </m:mrow>
         </m:msub>
         <m:mstyle class="text">
            <m:mtext class="textsf" mathvariant="sans-serif">log</m:mtext>
         </m:mstyle>
         <m:mfenced close="}" open="{" separators="">
            <m:mrow>
               <m:mi>p</m:mi>
               <m:mfenced close=")" open="(" separators="">
                  <m:mrow>
                     <m:mi>&#952;</m:mi>
                     <m:mo class="MathClass-rel">|</m:mo>
                     <m:mi mathvariant="bold">y</m:mi>
                  </m:mrow>
               </m:mfenced>
            </m:mrow>
         </m:mfenced>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>where a factorization is defined over the joint approximate posterior distribution of parameters:</p>
<p><display-formula id="M12"><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i55"><m:mrow>
   <m:msub>
      <m:mrow>
         <m:mi>q</m:mi>
      </m:mrow>
      <m:mrow>
         <m:mi>&#952;</m:mi>
      </m:mrow>
   </m:msub>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:mi>&#952;</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-rel">=</m:mo>
   <m:munder class="msub">
      <m:mrow>
         <m:mo mathsize="big"> &#8719;</m:mo>
      </m:mrow>
      <m:mrow>
         <m:mi>i</m:mi>
      </m:mrow>
   </m:munder>
   <m:mspace class="thinspace" width="0.3em"/>
   <m:msub>
      <m:mrow>
         <m:mi>q</m:mi>
      </m:mrow>
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:msub>
   <m:mfenced close=")" open="(" separators="">
      <m:mrow>
         <m:msub>
            <m:mrow>
               <m:mi>&#952;</m:mi>
            </m:mrow>
            <m:mrow>
               <m:mi>i</m:mi>
            </m:mrow>
         </m:msub>
      </m:mrow>
   </m:mfenced>
   <m:mo class="MathClass-punc">,</m:mo>
</m:mrow>
</m:math>
</display-formula></p>
<p>and the integral in Equation 11 at iteration <it>t </it>is taken with respect to every approximate distribution except <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i56"><m:msubsup>
   <m:mrow>
      <m:mi>q</m:mi>
   </m:mrow>
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#952;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
   <m:mrow>
      <m:mi>t</m:mi>
   </m:mrow>
</m:msubsup>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:msub>
         <m:mrow>
            <m:mi>&#952;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mi>j</m:mi>
         </m:mrow>
      </m:msub>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>. The posterior density <it>p</it>(<it>&#952;</it>|<b>y</b>) in Equation 11 is defined based on multiplying the likelihood for the model in Equation 6 with the priors in Equation 7-10. The details of this approximation for each density are presented in the Additional file <supplr sid="S1">1</supplr>. This factorization is required to solve for closed form iterative updates associated with the spike-and-slab prior distribution. A probability of inclusion statistic, <it>p<sub>j </sub></it>is computed after the algorithm converges, and this statistic is averaged across all models identified (i.e. modes in the posterior surface), based on the total evidence for each model (i.e. Equation 17 in the Supporting Information). This model averaged probability of inclusion statistic, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i12"><m:msub><m:mrow><m:mover accent="true"><m:mrow><m:mi>p</m:mi></m:mrow><m:mo class="MathClass-op">^</m:mo></m:mover></m:mrow><m:mrow><m:mi>j</m:mi></m:mrow></m:msub></m:math></inline-formula> is used to determine whether the <it>j<sup>th </sup></it>feature is included in the model, at a given threshold.</p>
</sec>
<sec><st><p>Lasso with type I error control</p></st>
<p>We compared our variational spike-and-slab algorithm with two alternative methods proposed to bound the number of type I errors when using lasso penalized regression. The first method we compared was the randomized lasso with stability selection, as described by Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. As with our method, we performed this approach by solving a penalized regression model for each phenotype in the network individually, then afterwards we compared different methods for combining the results. As in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, we implemented the randomized lasso with sampling weights for the variables in the regression sampled from a Unif(0.2, 1.0), then we performed the stability selection procedure by sampling <it>n</it>/2 observations, and running 100 replicate instances of this two-level randomized algorithm. This was run using the glmnet package in R <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> on a grid of 100 logarithmically spaced penalty parameters <it>&#955; </it>= {10<sup>-2</sup>,...,10<sup>2</sup>} (this range of penalization was sufficient such that the approach never chose a level of penalization on the boundaries). As in Meinshausen and B&#252;hlmann <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, we focused on bounding the number of type I errors by choosing an empirical probability of selection cutoff of 0.9, then choosing the level of penalization that returns the expected number of selected variables which satisfies the bound on the number of false positives <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. We looked at bounds of both 1 and 1000 expected false positives for the entire network, to explore both conservative and liberal choices of the cutoffs.</p>
<p>The alternative approach we used to bound the number of type I errors was based on the original Meinshausen and B&#252;hlmann network algorithm <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, where they bound the number of type I errors between connectivity components in a graph. Intuitively, a connectivity component is just the set of variables for any given variable that can be reached through some path within the graph. We used their choice of penalty parameter <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i57"><m:mi>&#955;</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mi>&#945;</m:mi>
   </m:mrow>
</m:mfenced>
<m:mo class="MathClass-rel">=</m:mo>
<m:mn>2</m:mn>
<m:mfrac>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi>&#963;</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">^</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:msqrt>
         <m:mrow>
            <m:mi>n</m:mi>
         </m:mrow>
      </m:msqrt>
   </m:mrow>
</m:mfrac>
<m:msup>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi mathvariant="text">&#934;</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">&#771;</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:msup>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mfrac>
         <m:mrow>
            <m:mi>&#945;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mn>2</m:mn>
            <m:msup>
               <m:mrow>
                  <m:mi>p</m:mi>
               </m:mrow>
               <m:mrow>
                  <m:mn>2</m:mn>
               </m:mrow>
            </m:msup>
         </m:mrow>
      </m:mfrac>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>, where <it>&#945; </it>is the probability of making a type I error, <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i58"><m:mover accent="true">
   <m:mrow>
      <m:mi>&#963;</m:mi>
   </m:mrow>
   <m:mo class="MathClass-op">^</m:mo>
</m:mover>
<m:mo class="MathClass-rel">=</m:mo>
<m:msup>
   <m:mrow>
      <m:mi>n</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:msup>
<m:msub>
   <m:mrow>
      <m:mo class="MathClass-op"> &#8721;</m:mo>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
   </m:mrow>
</m:msub>
<m:mspace class="thinspace" width="0.3em"/>
<m:msubsup>
   <m:mrow>
      <m:mi>y</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mi>i</m:mi>
   </m:mrow>
   <m:mrow>
      <m:mn>2</m:mn>
   </m:mrow>
</m:msubsup>
</m:math></inline-formula> for any given phenotype <it>y</it>, <it>p </it>is the number of variables in the model, and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i59"><m:msup>
   <m:mrow>
      <m:mover accent="true">
         <m:mrow>
            <m:mi mathvariant="text">&#934;</m:mi>
         </m:mrow>
         <m:mo class="MathClass-op">&#771;</m:mo>
      </m:mover>
   </m:mrow>
   <m:mrow>
      <m:mo class="MathClass-bin">-</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:msup>
<m:mo class="MathClass-rel">=</m:mo>
<m:mn>1</m:mn>
<m:mo class="MathClass-bin">-</m:mo>
<m:mi mathvariant="text">&#934;</m:mi>
</m:math></inline-formula> (with &#934; the c.d.f. of the standard normal density function). We investigated bounds of both 1 and 1000 expected false positives for the entire network, by running the regular lasso penalized regression method with the glmnet package in R <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> and this level of penalization.</p>
</sec>
<sec><st><p>Simulations</p></st>
<p>For the simulations depicted in Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>, we simulated twenty replicate networks through the following procedure: first, a random directed graph with <it>p </it>= 1000 variables and adjacency matrix <b>A </b>was generated by sampling edges between variables with probability 1/<it>p </it>= 10<sup>-3 </sup>for all <it>p</it>(<it>p</it>-1) edges. Next, each edge was weighted by a <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i60"><m:mi mathvariant="script">N</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mn>0</m:mn>
      <m:mo class="MathClass-punc">,</m:mo>
      <m:mn>1</m:mn>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula> random variable. Third, the diagonal of <b>A </b>was set to one. Finally, we constructed the precision matrix &#920; = <b>AA<sup>T </sup></b>(i.e. inverse covariance matrix), and sampled a data-set <b>Y </b>from the multivariate normal distribution <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-13-53-i61"><m:mi mathvariant="script">N</m:mi>
<m:mfenced close=")" open="(" separators="">
   <m:mrow>
      <m:mn>0</m:mn>
      <m:mo class="MathClass-punc">,</m:mo>
      <m:msup>
         <m:mrow>
            <m:mi mathvariant="text">&#920;</m:mi>
         </m:mrow>
         <m:mrow>
            <m:mo class="MathClass-bin">-</m:mo>
            <m:mn>1</m:mn>
         </m:mrow>
      </m:msup>
   </m:mrow>
</m:mfenced>
</m:math></inline-formula>, with 300 independent observations. Because of the moralization (i.e. edges induced between nodes that share parents <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>) produced by converting the directed graph <b>A </b>into an undirected graph &#920;, the average number of edges per node was 1.47 instead of &#8776;1. It is important to note that we did not include any unpenalized covariates or genotypes in this particular simulation. Additional algorithms used for comparison with analysis of further simulated data are shown in the Additional file <supplr sid="S1">1</supplr>.</p>
</sec>
<sec><st><p>Data analysis</p></st>
<p>We analyzed the F<sub>2 </sub>progeny of a cross between the C57BL/6 J (B6) and C3H/HeJ (C3H) strains on an apolipoprotein E null (ApoE -/-) background (BXH.ApoE<sup>-/-</sup>) <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B31">31</abbr></abbrgrp>. After further filtering the data to a set of shared samples across all variables, we were left with 298 individuals, 22 downstream phenotypes, 1,347 genetic markers, and 23,574 expression probes. We included sex as well as the 20 first eigenvectors from a principal component analysis computed across samples for expression phenotypes as fixed effects, and ran the first phase of the algorithm (i.e. feature selection on the downstream phenotypes) with 1,000 random restarts of the algorithm to get good coverage of the posterior probability surface associated with model uncertainty for each downstream phenotype. We then ran the second phase of the algorithm between just expression traits and genotypes, still incorporating the 20 eigenvectors from principal component analysis and sex as fixed effects, with 50 random restarts. All network diagrams in Figures <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr> were generated with the network package in R <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
</sec>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>BAL derived and implemented the algorithm, and performed the simulations and data analysis. BAL and JGM designed the simulations and data analysis. GEH implemented and ran the lasso algorithm for the data analysis. BAL and JGM wrote the manuscript. All authors read and approved the final version of this manuscript.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>We thank Larsson Omberg, Rami Mahdi, Thomas Vincent, and Jean-Luc Jannik for discussion and for their comments on this manuscript. We would also like to thank two anonymous reviewers for suggestions which significantly improved the quality of this manuscript. This work was supported by a fellowship from the Center for Vertebrate Genomics and Center for Comparative and Population Genomics at Cornell University and by National Science Foundation Grant DEB0922432.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Variations in DNA elucidate molecular networks that cause disease</p></title><aug><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Lum</snm><fnm>P</fnm></au><au><snm>Yang</snm><fnm>X</fnm></au><au><snm>Pinto</snm><fnm>S</fnm></au><au><snm>MacNeil</snm><fnm>D</fnm></au><au><snm>Zhang</snm><fnm>C</fnm></au><au><snm>Lamb</snm><fnm>J</fnm></au><au><snm>Edwards</snm><fnm>S</fnm></au><au><snm>Sieberts</snm><fnm>S</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>452</volume><issue>7186</issue><fpage>429</fpage><lpage>435</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06757</pubid><pubid idtype="pmcid">2841398</pubid><pubid idtype="pmpid" link="fulltext">18344982</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Genetics of gene expression and its effect on disease</p></title><aug><au><snm>Emilsson</snm><fnm>V</fnm></au><au><snm>Thorleifsson</snm><fnm>G</fnm></au><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Leonardson</snm><fnm>A</fnm></au><au><snm>Zink</snm><fnm>F</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Carlson</snm><fnm>S</fnm></au><au><snm>Helgason</snm><fnm>A</fnm></au><au><snm>Walters</snm><fnm>G</fnm></au><au><snm>Gunnarsdottir</snm><fnm>S</fnm></au><etal/></aug><source>Nature</source><pubdate>2008</pubdate><volume>452</volume><issue>7186</issue><fpage>423</fpage><lpage>428</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature06758</pubid><pubid idtype="pmpid" link="fulltext">18344981</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Molecular networks as sensors and drivers of common human diseases</p></title><aug><au><snm>Schadt</snm><fnm>E</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>461</volume><issue>7261</issue><fpage>218</fpage><lpage>223</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature08454</pubid><pubid idtype="pmpid" link="fulltext">19741703</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks</p></title><aug><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Smith</snm><fnm>E</fnm></au><au><snm>Drees</snm><fnm>B</fnm></au><au><snm>Brem</snm><fnm>R</fnm></au><au><snm>Kruglyak</snm><fnm>L</fnm></au><au><snm>Bumgarner</snm><fnm>R</fnm></au><au><snm>Schadt</snm><fnm>E</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><issue>7</issue><fpage>854</fpage><lpage>861</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.167</pubid><pubid idtype="pmcid">2573859</pubid><pubid idtype="pmpid" link="fulltext">18552845</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Reverse engineering the genotype-phenotype map with natural genetic variation</p></title><aug><au><snm>Rockman</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>456</volume><issue>7223</issue><fpage>738</fpage><lpage>744</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07633</pubid><pubid idtype="pmpid" link="fulltext">19079051</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Validation of candidate causal genes for obesity that affect shared metabolic pathways and networks</p></title><aug><au><snm>Yang</snm><fnm>X</fnm></au><au><snm>Deignan</snm><fnm>J</fnm></au><au><snm>Qi</snm><fnm>H</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Qian</snm><fnm>S</fnm></au><au><snm>Zhong</snm><fnm>J</fnm></au><au><snm>Torosyan</snm><fnm>G</fnm></au><au><snm>Majid</snm><fnm>S</fnm></au><au><snm>Falkard</snm><fnm>B</fnm></au><au><snm>Kleinhanz</snm><fnm>R</fnm></au><etal/></aug><source>Nat Genet</source><pubdate>2009</pubdate><volume>41</volume><issue>4</issue><fpage>415</fpage><lpage>423</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.325</pubid><pubid idtype="pmcid">2837947</pubid><pubid idtype="pmpid" link="fulltext">19270708</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Causal protein-signaling networks derived from multiparameter single-cell data</p></title><aug><au><snm>Sachs</snm><fnm>K</fnm></au><au><snm>Perez</snm><fnm>O</fnm></au><au><snm>Pe'er</snm><fnm>D</fnm></au><au><snm>Lauffenburger</snm><fnm>D</fnm></au><au><snm>Nolan</snm><fnm>G</fnm></au></aug><source>Science</source><pubdate>2005</pubdate><volume>308</volume><issue>5721</issue><fpage>523</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1105809</pubid><pubid idtype="pmpid">15845847</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>A general framework for weighted gene co-expression network analysis</p></title><aug><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Horvath</snm><fnm>S</fnm></au></aug><source>Stat Appl Genet Mol Biol</source><pubdate>2005</pubdate><volume>4</volume><fpage>1128</fpage></bibl><bibl id="B9"><title><p>Integrating genetic and network analysis to characterize genes related to mouse weight</p></title><aug><au><snm>Ghazalpour</snm><fnm>A</fnm></au><au><snm>Doss</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Wang</snm><fnm>S</fnm></au><au><snm>Plaisier</snm><fnm>C</fnm></au><au><snm>Castellanos</snm><fnm>R</fnm></au><au><snm>Brozell</snm><fnm>A</fnm></au><au><snm>Schadt</snm><fnm>E</fnm></au><au><snm>Drake</snm><fnm>T</fnm></au><au><snm>Lusis</snm><fnm>A</fnm></au><etal/></aug><source>PLoS Genet</source><pubdate>2006</pubdate><volume>2</volume><issue>8</issue><fpage>e130</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020130</pubid><pubid idtype="pmcid">1550283</pubid><pubid idtype="pmpid" link="fulltext">16934000</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Weighted gene coexpression network analysis strategies applied to mouse weight</p></title><aug><au><snm>Fuller</snm><fnm>T</fnm></au><au><snm>Ghazalpour</snm><fnm>A</fnm></au><au><snm>Aten</snm><fnm>J</fnm></au><au><snm>Drake</snm><fnm>T</fnm></au><au><snm>Lusis</snm><fnm>A</fnm></au><au><snm>Horvath</snm><fnm>S</fnm></au></aug><source>Mamm Genome</source><pubdate>2007</pubdate><volume>18</volume><issue>6</issue><fpage>463</fpage><lpage>472</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-007-9043-3</pubid><pubid idtype="pmcid">1998880</pubid><pubid idtype="pmpid" link="fulltext">17668265</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context</p></title><aug><au><snm>Margolin</snm><fnm>A</fnm></au><au><snm>Nemenman</snm><fnm>I</fnm></au><au><snm>Basso</snm><fnm>K</fnm></au><au><snm>Wiggins</snm><fnm>C</fnm></au><au><snm>Stolovitzky</snm><fnm>G</fnm></au><au><snm>Favera</snm><fnm>R</fnm></au><au><snm>Califano</snm><fnm>A</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2006</pubdate><volume>7</volume><issue>Suppl 1</issue><fpage>S7</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-7-S1-S7</pubid></xrefbib></bibl><bibl id="B12"><title><p>High-dimensional graphs and variable selection with the lasso</p></title><aug><au><snm>Meinshausen</snm><fnm>N</fnm></au><au><snm>B&#252;hlmann</snm><fnm>P</fnm></au></aug><source>Ann Stat</source><pubdate>2006</pubdate><volume>34</volume><issue>3</issue><fpage>1436</fpage><lpage>1462</lpage><xrefbib><pubid idtype="doi">10.1214/009053606000000281</pubid></xrefbib></bibl><bibl id="B13"><title><p>Adaptive Lasso for high dimensional regression and Gaussian graphical modeling</p></title><aug><au><snm>Zhou</snm><fnm>S</fnm></au><au><snm>Van De Geer</snm><fnm>S</fnm></au><au><snm>B&#252;hlmann</snm><fnm>P</fnm></au></aug><source>Arxiv preprint arXiv:0903.2515</source><pubdate>2009</pubdate></bibl><bibl id="B14"><title><p>An empirical Bayes approach to inferring large-scale gene association networks</p></title><aug><au><snm>Schafer</snm><fnm>J</fnm></au><au><snm>Strimmer</snm><fnm>K</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><issue>6</issue><fpage>754</fpage><lpage>764</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti062</pubid><pubid idtype="pmpid" link="fulltext">15479708</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Using Bayesian networks to analyze expression data</p></title><aug><au><snm>Friedman</snm><fnm>N</fnm></au><au><snm>Linial</snm><fnm>M</fnm></au><au><snm>Nachman</snm><fnm>I</fnm></au><au><snm>Pe'er</snm><fnm>D</fnm></au></aug><source>J Comput Biol</source><pubdate>2000</pubdate><volume>7</volume><issue>3-4</issue><fpage>601</fpage><lpage>620</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1089/106652700750050961</pubid><pubid idtype="pmpid" link="fulltext">11108481</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Inferring subnetworks from perturbed expression profiles</p></title><aug><au><snm>Pe'er</snm><fnm>D</fnm></au><au><snm>Regev</snm><fnm>A</fnm></au><au><snm>Elidan</snm><fnm>G</fnm></au><au><snm>Friedman</snm><fnm>N</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><issue>Suppl 1</issue><fpage>S215</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.suppl_1.S215</pubid><pubid idtype="pmpid" link="fulltext">11473012</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Estimating high-dimensional directed acyclic graphs with the PC-algorithm</p></title><aug><au><snm>Kalisch</snm><fnm>M</fnm></au><au><snm>B&#252;hlmann</snm><fnm>P</fnm></au></aug><source>J Mach Learn Res</source><pubdate>2007</pubdate><volume>8</volume><fpage>613</fpage><lpage>636</lpage></bibl><bibl id="B18"><title><p>Increasing the Power to Detect Causal Associations by Combining Genotypic and Expression Data in Segregating Populations</p></title><aug><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Wiener</snm><fnm>M</fnm></au><au><snm>Zhang</snm><fnm>C</fnm></au><au><snm>Fridman</snm><fnm>A</fnm></au><au><snm>Minch</snm><fnm>E</fnm></au><au><snm>Lum</snm><fnm>P</fnm></au><au><snm>Sachs</snm><fnm>J</fnm></au><au><snm>Schadt</snm><fnm>E</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2007</pubdate><volume>3</volume><issue>4</issue><fpage>e69</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0030069</pubid><pubid idtype="pmcid">1851982</pubid><pubid idtype="pmpid" link="fulltext">17432931</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>An integrative genomics approach to infer causal associations between gene expression and disease</p></title><aug><au><snm>Schadt</snm><fnm>E</fnm></au><au><snm>Lamb</snm><fnm>J</fnm></au><au><snm>Yang</snm><fnm>X</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Edwards</snm><fnm>S</fnm></au><au><snm>GuhaThakurta</snm><fnm>D</fnm></au><au><snm>Sieberts</snm><fnm>S</fnm></au><au><snm>Monks</snm><fnm>S</fnm></au><au><snm>Reitman</snm><fnm>M</fnm></au><au><snm>Zhang</snm><fnm>C</fnm></au><etal/></aug><source>Nat Genet</source><pubdate>2005</pubdate><volume>37</volume><issue>7</issue><fpage>710</fpage><lpage>717</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1589</pubid><pubid idtype="pmcid">2841396</pubid><pubid idtype="pmpid" link="fulltext">15965475</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Gene expression network reconstruction by convex feature selection when incorporating genetic perturbations</p></title><aug><au><snm>Logsdon</snm><fnm>B</fnm></au><au><snm>Mezey</snm><fnm>J</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2010</pubdate><volume>6</volume><issue>12</issue><fpage>429</fpage><lpage>435</lpage></bibl><bibl id="B21"><title><p>Correction for hidden confounders in the genetic analysis of gene expression</p></title><aug><au><snm>Listgarten</snm><fnm>J</fnm></au><au><snm>Kadie</snm><fnm>C</fnm></au><au><snm>Schadt</snm><fnm>E</fnm></au><au><snm>Heckerman</snm><fnm>D</fnm></au></aug><source>Proc Natl Acad Sci Unit States Am</source><pubdate>2010</pubdate><volume>107</volume><issue>38</issue><fpage>16465</fpage><xrefbib><pubid idtype="doi">10.1073/pnas.1002425107</pubid></xrefbib></bibl><bibl id="B22"><title><p>Regularized estimation of large-scale gene association networks using graphical Gaussian models</p></title><aug><au><snm>Kraemer</snm><fnm>N</fnm></au><au><snm>Schafer</snm><fnm>J</fnm></au><au><snm>Boulesteix</snm><fnm>A</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2009</pubdate><volume>10</volume><fpage>384</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-10-384</pubid></xrefbib></bibl><bibl id="B23"><title><p>Variable selection via Gibbs sampling</p></title><aug><au><snm>George</snm><fnm>E</fnm></au><au><snm>McCulloch</snm><fnm>R</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1993</pubdate><volume>88</volume><issue>423</issue><fpage>881</fpage><lpage>889</lpage><xrefbib><pubid idtype="doi">10.2307/2290777</pubid></xrefbib></bibl><bibl id="B24"><title><p>Mapping multiple quantitative trait loci by Bayesian classification</p></title><aug><au><snm>Zhang</snm><fnm>M</fnm></au><au><snm>Montooth</snm><fnm>K</fnm></au><au><snm>Wells</snm><fnm>M</fnm></au><au><snm>Clark</snm><fnm>A</fnm></au><au><snm>Zhang</snm><fnm>D</fnm></au></aug><source>Genetics</source><pubdate>2005</pubdate><volume>169</volume><issue>4</issue><fpage>2305</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.104.034181</pubid><pubid idtype="pmcid">1449613</pubid><pubid idtype="pmpid" link="fulltext">15520261</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Variable selection for large p small n regression models with incomplete data: mapping QTL with epistases</p></title><aug><au><snm>Zhang</snm><fnm>M</fnm></au><au><snm>Zhang</snm><fnm>D</fnm></au><au><snm>Wells</snm><fnm>M</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2008</pubdate><volume>9</volume><fpage>251</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-9-251</pubid></xrefbib></bibl><bibl id="B26"><title><p>On model selection consistency of Lasso</p></title><aug><au><snm>Zhao</snm><fnm>P</fnm></au><au><snm>Yu</snm><fnm>B</fnm></au></aug><source>J Mach Learn Res</source><pubdate>2006</pubdate><volume>7</volume><fpage>2541</fpage><lpage>2563</lpage></bibl><bibl id="B27"><title><p>A majorization-minimization approach to variable selection using spike and slab priors</p></title><aug><au><snm>Yen</snm><fnm>T</fnm></au></aug><source>Ann Stat</source><pubdate>2011</pubdate><volume>39</volume><issue>3</issue><fpage>1748</fpage><lpage>1774</lpage><xrefbib><pubid idtype="doi">10.1214/11-AOS884</pubid></xrefbib></bibl><bibl id="B28"><title><p>On the degrees of freedom of the lasso</p></title><aug><au><snm>Zou</snm><fnm>H</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>Ann Stat</source><pubdate>2007</pubdate><volume>35</volume><issue>5</issue><fpage>2173</fpage><lpage>2192</lpage><xrefbib><pubid idtype="doi">10.1214/009053607000000127</pubid></xrefbib></bibl><bibl id="B29"><title><p>A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis</p></title><aug><au><snm>Logsdon</snm><fnm>B</fnm></au><au><snm>Hoffman</snm><fnm>G</fnm></au><au><snm>Mezey</snm><fnm>J</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2010</pubdate><volume>11</volume><fpage>58</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-11-58</pubid></xrefbib></bibl><bibl id="B30"><title><p>Principal components analysis corrects for stratification in genome-wide association studies</p></title><aug><au><snm>Price</snm><fnm>A</fnm></au><au><snm>Patterson</snm><fnm>N</fnm></au><au><snm>Plenge</snm><fnm>R</fnm></au><au><snm>Weinblatt</snm><fnm>M</fnm></au><au><snm>Shadick</snm><fnm>N</fnm></au><au><snm>Reich</snm><fnm>D</fnm></au></aug><source>Nat Genet</source><pubdate>2006</pubdate><volume>38</volume><issue>8</issue><fpage>904</fpage><lpage>909</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1847</pubid><pubid idtype="pmpid" link="fulltext">16862161</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Genetic and genomic analysis of a fat mass trait with complex inheritance reveals marked sex specificity</p></title><aug><au><snm>Wang</snm><fnm>S</fnm></au><au><snm>Yehya</snm><fnm>N</fnm></au><au><snm>Schadt</snm><fnm>E</fnm></au><au><snm>Wang</snm><fnm>H</fnm></au><au><snm>Drake</snm><fnm>T</fnm></au><au><snm>Lusis</snm><fnm>A</fnm></au></aug><source>PLoS Genet</source><pubdate>2006</pubdate><volume>2</volume><issue>2</issue><fpage>e15</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020015</pubid><pubid idtype="pmcid">1359071</pubid><pubid idtype="pmpid" link="fulltext">16462940</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Regression shrinkage and selection via the lasso</p></title><aug><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>J R Stat Soc B Met</source><pubdate>1996</pubdate><volume>58</volume><fpage>267</fpage><lpage>288</lpage></bibl><bibl id="B33"><title><p>The adaptive lasso and its oracle properties</p></title><aug><au><snm>Zou</snm><fnm>H</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>2006</pubdate><volume>101</volume><issue>476</issue><fpage>1418</fpage><lpage>1429</lpage><xrefbib><pubid idtype="doi">10.1198/016214506000000735</pubid></xrefbib></bibl><bibl id="B34"><aug><au><snm>Lauritzen</snm><fnm>S</fnm></au></aug><source>Graphical models</source><publisher>NY, NY: Oxford University Press</publisher><pubdate>1996</pubdate></bibl><bibl id="B35"><title><p>Regularization and variable selection via the elastic net</p></title><aug><au><snm>Zou</snm><fnm>H</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au></aug><source>J R Stat Soc B Met</source><pubdate>2005</pubdate><volume>67</volume><issue>2</issue><fpage>301</fpage><lpage>320</lpage><xrefbib><pubid idtype="doi">10.1111/j.1467-9868.2005.00503.x</pubid></xrefbib></bibl><bibl id="B36"><title><p>Robust regression: asymptotics, conjectures and Monte Carlo</p></title><aug><au><snm>Huber</snm><fnm>P</fnm></au></aug><source>Ann Stat</source><pubdate>1973</pubdate><volume>1</volume><issue>5</issue><fpage>799</fpage><lpage>821</lpage><xrefbib><pubid idtype="doi">10.1214/aos/1176342503</pubid></xrefbib></bibl><bibl id="B37"><title><p>Asymptotic behavior of -estimators of regression parameters when <sup>2 </sup>is large. I. Consistency</p></title><aug><au><snm>Portnoy</snm><fnm>S</fnm></au></aug><source>Ann Stat</source><pubdate>1984</pubdate><volume>12</volume><issue>4</issue><fpage>1298</fpage><lpage>1309</lpage><xrefbib><pubid idtype="doi">10.1214/aos/1176346793</pubid></xrefbib></bibl><bibl id="B38"><title><p>Maximum likelihood from incomplete data via the EM algorithm</p></title><aug><au><snm>Dempster</snm><fnm>A</fnm></au><au><snm>Laird</snm><fnm>N</fnm></au><au><snm>Rubin</snm><fnm>D</fnm></au></aug><source>J R Stat Soc B Met</source><pubdate>1977</pubdate><volume>39</volume><fpage>1</fpage><lpage>38</lpage></bibl><bibl id="B39"><aug><au><snm>Bishop</snm><fnm>C</fnm></au></aug><source>Pattern Recognition and Machine Learning</source><publisher>New York: Springer</publisher><pubdate>2006</pubdate><volume>4</volume></bibl><bibl id="B40"><title><p>Stability selection</p></title><aug><au><snm>Meinshausen</snm><fnm>N</fnm></au><au><snm>B&#252;hlmann</snm><fnm>P</fnm></au></aug><source>J R Stat Soc B Met</source><pubdate>2010</pubdate><volume>72</volume><issue>4</issue><fpage>417</fpage><lpage>473</lpage><xrefbib><pubid idtype="doi">10.1111/j.1467-9868.2010.00740.x</pubid></xrefbib></bibl><bibl id="B41"><title><p>Gene association networks from microarray data using a regularized estimation of partial correlation based on PLS regression</p></title><aug><au><snm>Tenenhaus</snm><fnm>A</fnm></au><au><snm>Guillemot</snm><fnm>V</fnm></au><au><snm>Gidrol</snm><fnm>X</fnm></au><au><snm>Frouin</snm><fnm>V</fnm></au></aug><source>IEEE IEEE/ACM Transactions on Computational Biology and Bioinformatics</source><pubdate>2008</pubdate><volume>7</volume><fpage>251</fpage><lpage>262</lpage></bibl><bibl id="B42"><title><p>Identification of a novel putative gastrointestinal stem cell and adenoma stem cell marker, Doublecortin and CaM Kinase-Like-1, following radiation injury and in adenomatous polyposis Coli/multiple intestinal neoplasia mice</p></title><aug><au><snm>May</snm><fnm>R</fnm></au><au><snm>Riehl</snm><fnm>T</fnm></au><au><snm>Hunt</snm><fnm>C</fnm></au><au><snm>Sureban</snm><fnm>S</fnm></au><au><snm>Anant</snm><fnm>S</fnm></au><au><snm>Houchen</snm><fnm>C</fnm></au></aug><source>Stem Cells</source><pubdate>2008</pubdate><volume>26</volume><issue>3</issue><fpage>630</fpage><lpage>637</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1634/stemcells.2007-0621</pubid><pubid idtype="pmpid" link="fulltext">18055444</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>DNA repair by ERCC1 in non-small-cell lung cancer and cisplatin-based adjuvant chemotherapy</p></title><aug><au><snm>Olaussen</snm><fnm>K</fnm></au><au><snm>Dunant</snm><fnm>A</fnm></au><au><snm>Fouret</snm><fnm>P</fnm></au><au><snm>Brambilla</snm><fnm>E</fnm></au><au><snm>Andr&#233;</snm><fnm>F</fnm></au><au><snm>Haddad</snm><fnm>V</fnm></au><au><snm>Taranchon</snm><fnm>E</fnm></au><au><snm>Filipits</snm><fnm>M</fnm></au><au><snm>Pirker</snm><fnm>R</fnm></au><au><snm>Popper</snm><fnm>H</fnm></au><etal/></aug><source>New Engl J Med</source><pubdate>2006</pubdate><volume>355</volume><issue>10</issue><fpage>983</fpage><lpage>991</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJMoa060570</pubid><pubid idtype="pmpid" link="fulltext">16957145</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Down-regulation of cholesterol 7&#945;-hydroxylase (CYP7A1) gene expression by bile acids in primary rat hepatocytes is mediated by the c-Jun N-terminal kinase pathway</p></title><aug><au><snm>Gupta</snm><fnm>S</fnm></au><au><snm>Stravitz</snm><fnm>R</fnm></au><au><snm>Dent</snm><fnm>P</fnm></au><au><snm>Hylemon</snm><fnm>P</fnm></au></aug><source>J Biol Chem</source><pubdate>2001</pubdate><volume>276</volume><issue>19</issue><fpage>15816</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M010878200</pubid><pubid idtype="pmpid" link="fulltext">11278771</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Scalable variational inference for Bayesian variable selection in regression, and its accuracy in genetic association studies</p></title><aug><au><snm>Carbonetto</snm><fnm>P</fnm></au><au><snm>Stephens</snm><fnm>M</fnm></au></aug><source>Bayesian Analysis</source><inpress/></bibl><bibl id="B46"><title><p>Stability selection for genome-wide association</p></title><aug><au><snm>Alexander</snm><fnm>D</fnm></au><au><snm>Lange</snm><fnm>K</fnm></au></aug><source>Genet Epidemiol</source><pubdate>2011</pubdate><volume>35</volume><issue>7</issue><fpage>722</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/gepi.20623</pubid><pubid idtype="pmpid" link="fulltext">22009793</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Visceral Obesity is Associated with High Levels of Serum Squalene</p></title><aug><au><snm>Peltola</snm><fnm>P</fnm></au><au><snm>Pihlajam&#228;ki</snm><fnm>J</fnm></au><au><snm>Koutnikova</snm><fnm>H</fnm></au><au><snm>Ruotsalainen</snm><fnm>E</fnm></au><au><snm>Salmenniemi</snm><fnm>U</fnm></au><au><snm>Vauhkonen</snm><fnm>I</fnm></au><au><snm>Kainulainen</snm><fnm>S</fnm></au><au><snm>Gylling</snm><fnm>H</fnm></au><au><snm>Miettinen</snm><fnm>T</fnm></au><au><snm>Auwerx</snm><fnm>J</fnm></au><etal/></aug><source>Obesity</source><pubdate>2006</pubdate><volume>14</volume><issue>7</issue><fpage>1155</fpage><lpage>1163</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/oby.2006.132</pubid><pubid idtype="pmpid" link="fulltext">16899796</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Hepatic gene expression profiles in a long-term high-fat diet-induced obesity mouse model</p></title><aug><au><snm>Kim</snm><fnm>S</fnm></au><au><snm>Sohn</snm><fnm>I</fnm></au><au><snm>Ahn</snm><fnm>J</fnm></au><au><snm>Lee</snm><fnm>K</fnm></au><au><snm>Lee</snm><fnm>Y</fnm></au><au><snm>Lee</snm><fnm>Y</fnm></au></aug><source>Gene</source><pubdate>2004</pubdate><volume>340</volume><fpage>99</fpage><lpage>109</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.gene.2004.06.015</pubid><pubid idtype="pmpid" link="fulltext">15556298</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Cannabinoid CB2 receptor potentiates obesity-associated inflammation, insulin resistance and hepatic steatosis</p></title><aug><au><snm>Deveaux</snm><fnm>V</fnm></au><au><snm>Cadoudal</snm><fnm>T</fnm></au><au><snm>Ichigotani</snm><fnm>Y</fnm></au><au><snm>Teixeira-Clerc</snm><fnm>F</fnm></au><au><snm>Louvet</snm><fnm>A</fnm></au><au><snm>Manin</snm><fnm>S</fnm></au><au><snm>Nhieu</snm><fnm>J</fnm></au><au><snm>Belot</snm><fnm>M</fnm></au><au><snm>Zimmer</snm><fnm>A</fnm></au><au><snm>Even</snm><fnm>P</fnm></au><etal/></aug><source>PLoS One</source><pubdate>2009</pubdate><volume>4</volume><issue>6</issue><fpage>e5844</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0005844</pubid><pubid idtype="pmcid">2688760</pubid><pubid idtype="pmpid" link="fulltext">19513120</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Editorial: The Corticotropin-Releasing Hormone System and Feeding Behavior-A Complex Web Begins to Unravel</p></title><aug><au><snm>Cone</snm><fnm>R</fnm></au></aug><source>Endocrinology</source><pubdate>2000</pubdate><volume>141</volume><issue>8</issue><fpage>2713</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1210/en.141.8.2713</pubid><pubid idtype="pmpid" link="fulltext">10919254</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Insulin signal transduction pathways and insulin-induced gene expression</p></title><aug><au><snm>Keeton</snm><fnm>A</fnm></au><au><snm>Amsler</snm><fnm>M</fnm></au><au><snm>Venable</snm><fnm>D</fnm></au><au><snm>Messina</snm><fnm>J</fnm></au></aug><source>J Biol Chem</source><pubdate>2002</pubdate><volume>277</volume><issue>50</issue><fpage>48565</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M207837200</pubid><pubid idtype="pmpid" link="fulltext">12364332</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Distinct skeletal muscle fiber characteristics and gene expression in diet-sensitive versus diet-resistant obesity</p></title><aug><au><snm>Gerrits</snm><fnm>M</fnm></au><au><snm>Ghosh</snm><fnm>S</fnm></au><au><snm>Kavaslar</snm><fnm>N</fnm></au><au><snm>Hill</snm><fnm>B</fnm></au><au><snm>Tour</snm><fnm>A</fnm></au><au><snm>Seifert</snm><fnm>E</fnm></au><au><snm>Beauchamp</snm><fnm>B</fnm></au><au><snm>Gorman</snm><fnm>S</fnm></au><au><snm>Stuart</snm><fnm>J</fnm></au><au><snm>Dent</snm><fnm>R</fnm></au><etal/></aug><source>J Lipid Res</source><pubdate>2010</pubdate><volume>51</volume><issue>8</issue><fpage>2394</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1194/jlr.P005298</pubid><pubid idtype="pmcid">2903798</pubid><pubid idtype="pmpid" link="fulltext">20332421</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>The human obesity gene map: the 2004 update</p></title><aug><au><snm>Perusse</snm><fnm>L</fnm></au><au><snm>Rankinen</snm><fnm>T</fnm></au><au><snm>Zuberi</snm><fnm>A</fnm></au><au><snm>Chagnon</snm><fnm>Y</fnm></au><au><snm>Weisnagel</snm><fnm>S</fnm></au><au><snm>Argyropoulos</snm><fnm>G</fnm></au><au><snm>Walts</snm><fnm>B</fnm></au><au><snm>Snyder</snm><fnm>E</fnm></au><au><snm>Bouchard</snm><fnm>C</fnm></au></aug><source>Obesity</source><pubdate>2005</pubdate><volume>13</volume><issue>3</issue><fpage>381</fpage><lpage>490</lpage><xrefbib><pubid idtype="doi">10.1038/oby.2005.50</pubid></xrefbib></bibl><bibl id="B54"><title><p>Positional cloning of zinc finger domain transcription factor Zfp69, a candidate gene for obesity-associated diabetes contributed by mouse locus Nidd/SJL</p></title><aug><au><snm>Scherneck</snm><fnm>S</fnm></au><au><snm>Nestler</snm><fnm>M</fnm></au><au><snm>Vogel</snm><fnm>H</fnm></au><au><snm>Bl&#252;her</snm><fnm>M</fnm></au><au><snm>Block</snm><fnm>M</fnm></au><au><snm>Diaz</snm><fnm>M</fnm></au><au><snm>Herzig</snm><fnm>S</fnm></au><au><snm>Schulz</snm><fnm>N</fnm></au><au><snm>Teichert</snm><fnm>M</fnm></au><au><snm>Tischer</snm><fnm>S</fnm></au><etal/></aug><source>PLoS Genet</source><pubdate>2009</pubdate><volume>5</volume><issue>7</issue><fpage>593</fpage><lpage>596</lpage></bibl><bibl id="B55"><title><p>A sparse conditional Gaussian graphical model for analysis of genetical genomics data</p></title><aug><au><snm>Yin</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>H</fnm></au></aug><source>Ann Appl Stat</source><pubdate>2011</pubdate><volume>5</volume><issue>4</issue><fpage>2630</fpage><lpage>2650</lpage><xrefbib><pubid idtype="doi">10.1214/11-AOAS494</pubid></xrefbib></bibl><bibl id="B56"><title><p>Sparse inverse covariance estimation with the graphical lasso</p></title><aug><au><snm>Friedman</snm><fnm>J</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>Biostatistics</source><pubdate>2008</pubdate><volume>9</volume><issue>3</issue><fpage>432</fpage><lpage>441</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3019769,3019769</pubid><pubid idtype="pmpid" link="fulltext">18079126</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Efficient computation of the l1 regularized solution path in Gaussian graphical models</p></title><aug><au><snm>Yuan</snm><fnm>M</fnm></au></aug><source>J Comput Graph Stat</source><pubdate>2006</pubdate><volume>17</volume><fpage>809</fpage><lpage>826</lpage></bibl><bibl id="B58"><title><p>Needles and straw in haystacks: Empirical Bayes estimates of possibly sparse sequences</p></title><aug><au><snm>Johnstone</snm><fnm>I</fnm></au><au><snm>Silverman</snm><fnm>B</fnm></au></aug><source>Ann Stat</source><pubdate>2004</pubdate><volume>32</volume><fpage>1594</fpage><lpage>1649</lpage><xrefbib><pubid idtype="doi">10.1214/009053604000000030</pubid></xrefbib></bibl><bibl id="B59"><title><p>Better subset regression using the nonnegative garrote</p></title><aug><au><snm>Breiman</snm><fnm>L</fnm></au></aug><source>Technometrics</source><pubdate>1995</pubdate><volume>37</volume><issue>4</issue><fpage>373</fpage><lpage>384</lpage><xrefbib><pubid idtype="doi">10.2307/1269730</pubid></xrefbib></bibl><bibl id="B60"><title><p>Bayesian model averaging for linear regression models</p></title><aug><au><snm>Raftery</snm><fnm>A</fnm></au><au><snm>Madigan</snm><fnm>D</fnm></au><au><snm>Hoeting</snm><fnm>J</fnm></au></aug><source>J Am Stat Assoc</source><pubdate>1997</pubdate><volume>92</volume><issue>437</issue><fpage>179</fpage><lpage>191</lpage><xrefbib><pubid idtype="doi">10.2307/2291462</pubid></xrefbib></bibl><bibl id="B61"><title><p>Bagging predictors</p></title><aug><au><snm>Breiman</snm><fnm>L</fnm></au></aug><source>Mach Learn</source><pubdate>1996</pubdate><volume>24</volume><issue>2</issue><fpage>123</fpage><lpage>140</lpage></bibl><bibl id="B62"><title><p>Regularization paths for generalized linear models via coordinate descent</p></title><aug><au><snm>Friedman</snm><fnm>J</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au></aug><source>J Stat Softw</source><pubdate>2010</pubdate><volume>33</volume><fpage>1</fpage><xrefbib><pubidlist><pubid idtype="pmcid">2929880</pubid><pubid idtype="pmpid">20808728</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Network: a package for managing relational data in R</p></title><aug><au><snm>Butts</snm><fnm>C</fnm></au></aug><source>J Stat Softw</source><pubdate>2008</pubdate><volume>24</volume><fpage>1</fpage><lpage>36</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2443947</pubid><pubid idtype="pmpid">18612375</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm>
</art>