<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-0509-1-37</ui>
   <ji>1752-0509</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Opgen-Rhein</snm>
               <fnm>Rainer</fnm>
               <insr iid="I1"/>
               <email>opgen-rhein@stat.uni-muenchen.de</email>
            </au>
            <au id="A2">
               <snm>Strimmer</snm>
               <fnm>Korbinian</fnm>
               <insr iid="I2"/>
               <email>strimmer@uni-leipzig.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Statistics, Ludwig-Maximilians-Universit&#228;t M&#252;nchen, Ludwigstra&#223;e 33, D-80539 M&#252;nchen, Germany</p>
            </ins>
            <ins id="I2">
               <p>Institute for Medical Informatics, Statistics and Epidemiology (IMISE), University of Leipzig, H&#228;rtelstr. 16-18, 04107 Leipzig, Germany</p>
            </ins>
         </insg>
         <source>BMC Systems Biology</source>
         <issn>1752-0509</issn>
         <pubdate>2007</pubdate>
         <volume>1</volume>
         <issue>1</issue>
         <fpage>37</fpage>
         <url>http://www.biomedcentral.com/1752-0509/1/37</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17683609</pubid>
               <pubid idtype="doi">10.1186/1752-0509-1-37</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>06</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Opgen-Rhein and Strimmer; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The use of correlation networks is widespread in the analysis of gene expression and proteomics data, even though it is known that correlations not only confound direct and indirect associations but also provide no means to distinguish between cause and effect. For "causal" analysis typically the inference of a directed graphical model is required. However, this is rather difficult due to the curse of dimensionality.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We propose a simple heuristic for the statistical learning of a high-dimensional "causal" network. The method first converts a correlation network into a partial correlation graph. Subsequently, a partial ordering of the nodes is established by multiple testing of the log-ratio of standardized partial variances. This allows identifying a directed acyclic causal network as a subgraph of the partial correlation network. We illustrate the approach by analyzing a large <it>Arabidopsis thaliana </it>expression data set.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The proposed approach is a heuristic algorithm that is based on a number of approximations, such as substituting lower order partial correlations by full order partial correlations. Nevertheless, for small samples and for sparse networks the algorithm not only yield sensible first order approximations of the causal structure in high-dimensional genomic data but is also computationally highly efficient.</p>
            </sec>
            <sec>
               <st>
                  <p>Availability and Requirements</p>
               </st>
               <p>The method is implemented in the "GeneNet" R package (version 1.2.0), available from CRAN and from <url>http://strimmerlab.org/software/genets/</url>. The software includes an R script for reproducing the network analysis of the <it>Arabidopsis thaliana </it>data.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Correlation networks are widely used to explore and visualize high-dimensional data, for instance in finance <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, ecology <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, gene expression analysis <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, or metabolomics <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Their popularity is owed to a large extent to the ease with which a correlation network can be constructed, as this requires only two simple steps: i) the computation of all pairwise correlations for the investigated variables, and ii) a thresholding or filtering procedure <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> to identify significant correlations, and hence edges, of the network.</p>
         <p>However, for shedding light on the causal processes underlying the observed data, correlation networks are only of limited use. This is due to the fact that correlations not only confound direct and indirect associations but also provide no means to distinguish between response variables and covariates (and thus between cause and effect).</p>
         <p>Therefore, causal analysis requires tools different from correlation networks: much of the work in this area has focused on Bayesian networks <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> or related regression models such as systems of recursive equations <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> or influence diagrams <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. All of these models have in common that they describe causal relations by an underlying directed acyclic graph (DAG).</p>
         <p>There already exist numerous methods for learning DAGs from observational data &#8211; see for instance the summarizing review in <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and the references therein. However, with few exceptions [e.g., the PC algorithm, <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>] virtually all of these methods have been devised for comparatively small numbers of variables and with large sample size in mind. For instance, the numerical example of the recently proposed algorithm described in <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> uses <it>n </it>= 10,000 observations for <it>p </it>= 7 variables. Unfortunately, the data that would be most interesting to explore with causal methods, namely those commonly visualized by correlation networks (see above), have completely different characteristics, in particular they are likely of high dimension.</p>
         <p>In this paper we follow <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and focus on modeling large-scale linear recursive systems. Specifically, we present a simple discovery algorithm that enables the inference of causal relations from small sampled data and for large numbers of variables. It proceeds in two steps as follows:</p>
         <p>&#8226; First, the correlation network is transformed into a partial correlation network, which is essentially an undirected graph that displays the direct linear associations only. This type of network model is also known under the names of graphical Gaussian model (GGM), concentration graph, covariance selection graph, conditional independence graph (CIG), or Markov random field. Note that there is a simple relationship between correlation and partial correlation. Moreover, in recent years there has been much progress with regard to statistical methodology for learning large-scale partial correlation graphs from small samples [e.g., <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>]. Here we employ the approach described in <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <p>&#8226; Second, the undirected GGM is converted into a <it>partially </it>directed graph. This is done by estimating a pairwise ordering of the nodes from the data using multiple testing of the log-ratios of standardized partial variances, and by subsequent projection of this partial ordering onto the GGM. The inferred causal network is the subgraph containing all the directed edges.</p>
         <p>Note that this algorithm is similar to the PC algorithm in that edges are being removed from the independence graph to obtain the underlying DAG. However, our criterion for eliminating an edge is distinctly different from that of the PC algorithm.</p>
         <p>The remainder of the paper is organized as follows. First, we describe the methodology. Second we consider its statistical interpretation and further properties. Subsequently, we illustrate the approach by analyzing an 800 gene data set from a large-scale <it>Arabidopsis thaliana </it>gene expression experiment. Finally, we conclude with some discussion of the method, commenting also on the limitations of the approach.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Theoretical basis</p>
            </st>
            <p>Consider a linear regression with <it>Y </it>as response and <it>X</it><sub>1</sub>, ..., <it>X</it><sub><it>k</it></sub>, ..., <it>X</it><sub><it>K </it></sub>as covariates. We assume that <it>X</it><sub><it>k </it></sub>and <it>Y </it>are random variables with known variances var(<it>Y</it>) and var(<it>X</it><sub><it>k</it></sub>) and with covariance cov(<it>Y</it>, <it>X</it><sub><it>k</it></sub>). The best linear predictor of <it>Y </it>in terms of the <it>X</it><sub><it>k </it></sub>that minimizes the MSE of 
&#8721;<sub><it>k </it></sub><it>&#946;</it><sub><it>k</it></sub><it>X</it><sub><it>k </it></sub>- <it>Y </it>is given by [e.g. ref. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, p. 206]</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1752-0509-1-37-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mi>&#946;</m:mi>
                              <m:mi>k</m:mi>
                              <m:mi>y</m:mi>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#961;</m:mi>
                                 <m:mo>&#732;</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>y</m:mi>
                                 <m:mi>k</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:msqrt>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>&#732;</m:mo>
                                          </m:mover>
                                          <m:mi>y</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:msubsup>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>&#732;</m:mo>
                                          </m:mover>
                                          <m:mi>k</m:mi>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                           </m:msqrt>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaOGaeyypa0Jaf8xWdiNbaGaadaWgaaWcbaGaemyEaKNaem4AaSgabeaakmaakaaabaWaaSaaaeaacuWFdpWCgaacamaaDaaaleaacqWG5bqEaeaacqaIYaGmaaaakeaacuWFdpWCgaacamaaDaaaleaacqWGRbWAaeaacqaIYaGmaaaaaaqabaGccqGGSaalaaa@410B@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math name="1752-0509-1-37-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#961;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>y</m:mi><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFbpGCgaacamaaBaaaleaacqWG5bqEcqWGRbWAaeqaaaaa@3188@</m:annotation></m:semantics></m:math></inline-formula> and is the <it>partial </it>correlation between <it>Y </it>and <it>X</it><sub><it>k</it></sub>, and <inline-formula><m:math name="1752-0509-1-37-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>y</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaDaaaleaacqWG5bqEaeaacqaIYaGmaaaaaa@311F@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-1-37-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaDaaaleaacqWGRbWAaeaacqaIYaGmaaaaaa@3103@</m:annotation></m:semantics></m:math></inline-formula> are the respective <it>partial </it>variances. </p>
            <p>The partial correlation is the correlation that remains between two variables if the effect of the other variables has been regressed away. Likewise, the partial variance is the variance that remains if the influences of all other variables are taken into account. Table <tblr tid="T1">1</tblr> lists the definitions and formulas for the computation of these quantities (note that in our notation a tilde on top of a symbol indicates ''partial'').</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Formulas for computing partial variances and partial correlations</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Definition</p>
                     </c>
                     <c ca="left">
                        <p>True value</p>
                     </c>
                     <c ca="left">
                        <p>Estimate</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Covariance matrix:</p>
                     </c>
                     <c ca="left">
                        <p>cov(<it>X</it><sub><it>k</it></sub>, <it>X</it><sub><it>l</it></sub>) = <it>&#963;</it><sub><it>kl</it></sub></p>
                     </c>
                     <c ca="left">
                        <p><b>&#931; </b>= (<it>&#963;</it><sub><it>kl</it></sub>)</p>
                     </c>
                     <c ca="left">
                        <p><b><it>S </it></b>= (<it>s</it><sub><it>kl</it></sub>)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Concentration matrix:</p>
                     </c>
                     <c ca="left">
                        <p><b>&#937; </b>= <b>&#931;</b><sup>-1</sup></p>
                     </c>
                     <c ca="left">
                        <p><b>&#937; </b>= (<it>&#969;</it><sub><it>kl</it></sub>)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Variances:</p>
                     </c>
                     <c ca="left">
                        <p>var(<it>X</it><sub><it>k</it></sub>) = <it>&#963;</it><sub><it>kk </it></sub>= <inline-formula><m:math name="1752-0509-1-37-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaem4AaSgabaGaeGOmaidaaaaa@30F4@</m:annotation></m:semantics></m:math></inline-formula></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>&#963;</it>
                           <sub>
                              <it>kk</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>s</it>
                           <sub>
                              <it>kk</it>
                           </sub>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Partial variances</p>
                     </c>
                     <c ca="left">
                        <p>var(<it>X</it><sub><it>k</it></sub>|<it>X</it><sub>&#8800;<it>k</it></sub>) = <inline-formula><m:math name="1752-0509-1-37-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>k</m:mi><m:mi>k</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaBaaaleaacqWGRbWAcqWGRbWAaeqaaaaa@316F@</m:annotation></m:semantics></m:math></inline-formula> = <inline-formula><m:math name="1752-0509-1-37-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaDaaaleaacqWGRbWAaeaacqaIYaGmaaaaaa@3103@</m:annotation></m:semantics></m:math></inline-formula> = <inline-formula><m:math name="1752-0509-1-37-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#969;</m:mi><m:mrow><m:mi>k</m:mi><m:mi>k</m:mi></m:mrow><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFjpWDdaqhaaWcbaGaem4AaSMaem4AaSgabaGaeyOeI0IaeGymaedaaaaa@3348@</m:annotation></m:semantics></m:math></inline-formula></p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <m:math name="1752-0509-1-37-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>&#732;</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>k</m:mi>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaBaaaleaacqWGRbWAcqWGRbWAaeqaaaaa@316F@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <m:math name="1752-0509-1-37-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mover accent="true">
                                             <m:mi>s</m:mi>
                                             <m:mo>&#732;</m:mo>
                                          </m:mover>
                                          <m:mrow>
                                             <m:mi>k</m:mi>
                                             <m:mi>k</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGZbWCgaacamaaBaaaleaacqWGRbWAcqWGRbWAaeqaaaaa@3114@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </inline-formula>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Correlations:</p>
                     </c>
                     <c ca="left">
                        <p>corr(<it>X</it><sub><it>k</it></sub>, <it>X</it><sub><it>l</it></sub>) = <it>&#961;</it><sub><it>kl </it></sub>= <it>&#963;</it><sub><it>kl </it></sub>(<it>&#963;</it><sub><it>kk </it></sub><it>&#963;</it><sub><it>ll</it></sub>)<sup>-1/2</sup></p>
                     </c>
                     <c ca="left">
                        <p><b><it>P </it></b>= (<it>&#961;</it><sub><it>kl</it></sub>)</p>
                     </c>
                     <c ca="left">
                        <p><b><it>R </it></b>= (<it>r</it><sub><it>kl</it></sub>)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Partial correlations:</p>
                     </c>
                     <c ca="left">
                        <p>corr(<it>X</it><sub><it>k</it></sub>, <it>X</it><sub><it>l</it></sub>|<it>X</it><sub>&#8800;<it>k</it>, <it>l</it></sub>) = <inline-formula><m:math name="1752-0509-1-37-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>&#961;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>k</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mo>&#8722;</m:mo><m:msub><m:mi>&#969;</m:mi><m:mrow><m:mi>k</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>&#969;</m:mi><m:mrow><m:mi>k</m:mi><m:mi>k</m:mi></m:mrow></m:msub><m:msub><m:mi>&#969;</m:mi><m:mrow><m:mi>l</m:mi><m:mi>l</m:mi></m:mrow></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mrow><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo>/</m:mo><m:mn>2</m:mn></m:mrow></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFbpGCgaacamaaBaaaleaacqWGRbWAcqWGSbaBaeqaaOGaeyypa0JaeyOeI0Iae8xYdC3aaSbaaSqaaiabdUgaRjabdYgaSbqabaGccqGGOaakcqWFjpWDdaWgaaWcbaGaem4AaSMaem4AaSgabeaakiab=L8a3naaBaaaleaacqWGSbaBcqWGSbaBaeqaaOGaeiykaKYaaWbaaSqabeaacqGHsislcqaIXaqmcqGGVaWlcqaIYaGmaaaaaa@4739@</m:annotation></m:semantics></m:math></inline-formula><it/><sub/><it/><sub/><it/><sub/><sup/></p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <m:math name="1752-0509-1-37-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mi>P</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mover accent="true">
                                                   <m:mi>&#961;</m:mi>
                                                   <m:mo>&#732;</m:mo>
                                                </m:mover>
                                                <m:mrow>
                                                   <m:mi>k</m:mi>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieWacuWFqbaugaacaiabg2da9maabmaabaacciGaf4xWdiNbaGaadaWgaaWcbaGaem4AaSMaemiBaWgabeaaaOGaayjkaiaawMcaaaaa@3546@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <m:math name="1752-0509-1-37-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                                 <m:semantics>
                                    <m:mrow>
                                       <m:mover accent="true">
                                          <m:mi>R</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mo>=</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mover accent="true">
                                                   <m:mi>r</m:mi>
                                                   <m:mo>&#732;</m:mo>
                                                </m:mover>
                                                <m:mrow>
                                                   <m:mi>k</m:mi>
                                                   <m:mi>l</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                    <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaieWacuWFsbGugaacaiabg2da9maabmaabaGafmOCaiNbaGaadaWgaaWcbaGaem4AaSMaemiBaWgabeaaaOGaayjkaiaawMcaaaaa@34F1@</m:annotation>
                                 </m:semantics>
                              </m:math>
                           </inline-formula>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Index <it>i </it>runs from 1 to <it>n </it>(sample size), and indices <it>k </it>and <it>l </it>run from 1 to <it>p </it>(dimension). A tilde denotes a "partial" quantity.</p>
               </tblfn>
            </tbl>
            <p>From Equation 1 it is immediately clear that the complete linear system and thus all <inline-formula><m:math name="1752-0509-1-37-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>k</m:mi><m:mi>y</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula> are determined by the joint covariance matrix of <it>Y </it>and <it>X</it><sub><it>k </it></sub>[see also, e.g., <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B24">24</abbr></abbrgrp>]. For only a single dependent variable Equation 1 reduces to the well-known relation <inline-formula><m:math name="1752-0509-1-37-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>x</m:mi><m:mi>y</m:mi></m:msubsup><m:mo>=</m:mo><m:msub><m:mi>&#961;</m:mi><m:mrow><m:mi>y</m:mi><m:mi>x</m:mi></m:mrow></m:msub><m:msqrt><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>y</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>/</m:mo><m:msubsup><m:mi>&#963;</m:mi><m:mi>x</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaemiEaGhabaGaemyEaKhaaOGaeyypa0Jae8xWdi3aaSbaaSqaaiabdMha5jabdIha4bqabaGcdaGcaaqaaiab=n8aZnaaDaaaleaacqWG5bqEaeaacqaIYaGmaaGccqGGVaWlcqWFdpWCdaqhaaWcbaGaemiEaGhabaGaeGOmaidaaaqabaaaaa@4118@</m:annotation></m:semantics></m:math></inline-formula>, which contains only the unconditioned correlation and variances (without the tilde).</p>
            <p>We emphasize that Equation 1 has a direct relation with the usual ordinary least squares (OLS) estimator for the regression coefficient. This is recovered if the empirical covariance matrix is plugged into Equation 1. However, note that Equation 1 also remains valid if other estimates of the covariance are used, such as penalized or shrinkage estimators (note that there is no hat on <inline-formula><m:math name="1752-0509-1-37-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>k</m:mi><m:mi>y</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula>).</p>
            <p>For the following it is important that Equation 1 can be further rewritten by introducing a scale factor. Specifically, by abbreviating the standardized partial variance <inline-formula><m:math name="1752-0509-1-37-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>/</m:mo><m:msubsup><m:mi>&#963;</m:mi><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacamaaDaaaleaacqWGRbWAaeaacqaIYaGmaaGccqGGVaWlcqWFdpWCdaqhaaWcbaGaem4AaSgabaGaeGOmaidaaaaa@362F@</m:annotation></m:semantics></m:math></inline-formula> by SPV<sub><it>k</it></sub>, we can decompose the regression coefficient into the simple product</p>
            <p>
               <display-formula id="M2">
                  <m:math name="1752-0509-1-37-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msubsup>
                              <m:mi>&#946;</m:mi>
                              <m:mi>k</m:mi>
                              <m:mi>y</m:mi>
                           </m:msubsup>
                           <m:mo>=</m:mo>
                           <m:munder>
                              <m:munder>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mover accent="true">
                                          <m:mi>&#961;</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mrow>
                                          <m:mi>y</m:mi>
                                          <m:mi>k</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                 </m:mrow>
                                 <m:mo stretchy="true">&#65080;</m:mo>
                              </m:munder>
                              <m:mi mathvariant="script">A</m:mi>
                           </m:munder>
                           <m:munder>
                              <m:munder>
                                 <m:mrow>
                                    <m:msqrt>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mrow>
                                                      <m:mtext>SPV</m:mtext>
                                                   </m:mrow>
                                                   <m:mi>y</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msub>
                                                   <m:mrow>
                                                      <m:mtext>SPV</m:mtext>
                                                   </m:mrow>
                                                   <m:mi>k</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                    </m:msqrt>
                                 </m:mrow>
                                 <m:mo stretchy="true">&#65080;</m:mo>
                              </m:munder>
                              <m:mi>&#8492;</m:mi>
                           </m:munder>
                           <m:munder>
                              <m:munder>
                                 <m:mrow>
                                    <m:msqrt>
                                       <m:mrow>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mi>y</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:msubsup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mi>k</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msubsup>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                    </m:msqrt>
                                 </m:mrow>
                                 <m:mo stretchy="true">&#65080;</m:mo>
                              </m:munder>
                              <m:mi mathvariant="script">C</m:mi>
                           </m:munder>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaOGaeyypa0ZaaGbaaeaacuWFbpGCgaacamaaBaaaleaacqWG5bqEcqWGRbWAaeqaaaqaamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaGabaiab+bq8bbGccaGL44padaagaaqaamaakaaabaWaaSaaaeaacqqGtbWucqqGqbaucqqGwbGvdaWgaaWcbaGaemyEaKhabeaaaOqaaiabbofatjabbcfaqjabbAfawnaaBaaaleaacqWGRbWAaeqaaaaaaeqaaaqaaiab+XsicbGccaGL44padaagaaqaamaakaaabaWaaSaaaeaacqWFdpWCdaqhaaWcbaGaemyEaKhabaGaeGOmaidaaaGcbaGae83Wdm3aa0baaSqaaiabdUgaRbqaaiabikdaYaaaaaaabeaaaeaacqGFce=qaOGaayjo+dGaeiOla4caaa@5F66@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Note that SPV<sub><it>y </it></sub>and SPV<sub><it>k </it></sub>take on values from 0 to 1. All three factors have an immediate and intuitive interpretation:</p>
            <p><inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> : This factor determines whether there is a direct association between <it>Y </it>and the covariate <it>X</it><sub><it>k</it></sub>. If the partial correlation between <it>X</it><sub><it>k </it></sub>and <it>Y </it>vanishes, so will also the two corresponding regression coefficients <inline-formula><m:math name="1752-0509-1-37-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>k</m:mi><m:mi>y</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-1-37-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>y</m:mi><m:mi>k</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaemyEaKhabaGaem4AaSgaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula>. In a partial correlation graph an edge is drawn between two nodes <it>Y </it>and <it>X</it><sub><it>k </it></sub>if <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> &#8800; 0.</p>
            <p><inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> : This factor adjusts the regression coefficient for the relative reduction in variance of <it>Y </it>and <it>X</it><sub><it>k </it></sub>due to the respective other covariates. In the algorithm outlined below a test of log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) establishes the directionality of edges of a partially causal network.</p>
            <p><inline-formula><m:math name="1752-0509-1-37-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">C</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=jq8dbaa@382F@</m:annotation></m:semantics></m:math></inline-formula> : This is a scale factor correcting for different units in <it>Y </it>and <it>X</it><sub><it>k</it></sub>.</p>
            <p>The product <inline-formula><m:math name="1752-0509-1-37-i20" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi mathvariant="script">A</m:mi><m:mi>&#8492;</m:mi><m:mo>=</m:mo><m:msubsup><m:mi>&#946;</m:mi><m:mi>k</m:mi><m:mi>y</m:mi></m:msubsup><m:msqrt><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mi>k</m:mi><m:mn>2</m:mn></m:msubsup><m:mo>/</m:mo><m:msubsup><m:mi>&#963;</m:mi><m:mi>y</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaacqWFaeFqcqWFSeIqcqGH9aqpiiGacqGFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaOWaaOaaaeaacqGFdpWCdaqhaaWcbaGaem4AaSgabaGaeGOmaidaaOGaei4la8Iae43Wdm3aa0baaSqaaiabdMha5bqaaiabikdaYaaaaeqaaaaa@4884@</m:annotation></m:semantics></m:math></inline-formula> is also known as the standardized regression coefficient. Note that for computing both <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> only the correlation matrix is needed, as the variance information is already accounted for by the third factor <inline-formula><m:math name="1752-0509-1-37-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">C</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=jq8dbaa@382F@</m:annotation></m:semantics></m:math></inline-formula>.</p>
            <p>In this context it is also helpful to recall the diverse statistical interpretations of SPV:</p>
            <p>&#8226; SPV is the <it>proportion </it>of variance that remains (unexplained) after regressing against all other variables.</p>
            <p>&#8226; For the OLS estimator SPV is equal to 1 - <it>R</it><sup>2</sup>, where <it>R </it>is the usual coefficient of determination.</p>
            <p>&#8226; SPV is the inverse of the diagonal of the inverse of the <it>correlation </it>matrix. Thus, if there is no correlation (unit diagonal correlation matrix) the partial variance equals the variance, and hence SPV = 1.</p>
            <p>&#8226; SPV may also be estimated by 1/VIF, where VIF is the usual variance inflation factor [cf. <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>].</p>
         </sec>
         <sec>
            <st>
               <p>Heuristic algorithm for discovering approximate causal networks</p>
            </st>
            <p>The above decomposition (Equation 2) suggests the following simple strategy for statistical learning of causal networks. First, by multiple testing of <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> = 0 we determine the network topology, i.e. we identify those edges for which the corresponding partial correlation is not vanishing. Second, by subsequent multiple testing of log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) = 0 we establish a partial ordering of the nodes, which in turn imposes a partial directionality upon the edges.</p>
            <p>In more detail, we propose the following five-step algorithm:</p>
            <p>1. First, it is essential to determine an accurate and positive definite estimate <b><it>R </it></b>of the correlation matrix. Only if the sample size is large with many more observations than variables (<it>n </it>> > <it>p</it>) the usual empirical correlation estimate will be suitable. In all other instances, the use of a regularized estimator is absolutely vital (e.g., the Stein-type shrinkage estimator of <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>) in order to improve efficiency and to guarantee positive definiteness. In addition, if the samples are longitudinal it may be necessary to adjust for autocorrelation <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
            <p>2. From the estimated correlations we compute the partial variances and correlations (see Table <tblr tid="T1">1</tblr>), and from those in turn plug-in estimates of the factors <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> of Equation 2 for all possible edges. Note that in this calculation each variable assumes in turn the role of the response <it>Y </it>. An efficient way to calculate the various <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> is given by taking the square root of the diagonal of the inverse of the estimated correlation matrix, and computing the corresponding pairwise ratios.</p>
            <p>3. Subsequently, we infer the partial correlation graph following the algorithm described in <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Essentially, we perform multiple testing of all partial correlation coefficients <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula>. Note that for high dimensions (large <it>p</it>) the null distribution of partial correlations across edges can be determined from the data, which in turn allows the adaptive computation of corresponding false discovery rates <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <p>4. In a similar fashion we then conduct multiple testing of all log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>). As <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> is the ratio of two variances with the same degrees of freedom, it is implicit that log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) is approximately normally distributed <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, with an unknown variance parameter <it>&#952;</it>. Thus, the observed <it>z </it>= log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) across all edges follow a mixture distribution</p>
            <p>
               <display-formula id="M3"><it>f</it>(<it>z</it>) = <it>&#951;</it><sub>0 </sub><it>N</it>(0, <it>&#952;</it>) + (1 - <it>&#951;</it><sub>0</sub>) <it>f</it><sub><it>A </it></sub>(<it>z</it>).</display-formula>
            </p>
            <p>Assuming that most <it>z </it>belong to the null model, i.e. that most edges are undirected, it is possible to infer non-parametrically the alternative distribution <it>f</it><sub><it>A </it></sub>(<it>z</it>), the proportion <it>&#951;</it><sub>0</sub>, as well as the variance parameter <it>&#952; </it>&#8211; for an algorithm see <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. From the resulting densities and distribution functions local and tail-area-based false discovery rates for the test log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) = 0 are computed. Note that in this procedure we include all edges, regardless of the corresponding value of <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> or the outcome of the test <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> = 0.</p>
            <p>5. Finally, a partially directed network is constructed as follows. All edges in the correlation graph with significant log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) &#8800; 0 are directed in such a fashion that the direction of the arrow points from the node with the larger standardized partial variance (the more "exogenous" variable) to the node with the smaller standardized partial variance (the more "endogenous" variable). The other edges with log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) &#8776; 0 remain undirected. The subgraph consisting of all directed edges constitutes the inferred causal network. Note that this does not necessarily include all nodes that are contained in the GGM network.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Interpretation of the resulting graph</p>
            </st>
            <p>The above algorithm returns a partially directed partial correlation graph, whose directed edges form a causal network.</p>
            <p>This procedure can be motivated by the following connection between partial correlation graph and a system of linear equations, where each node is in turn taken as a response variable and regressed against all other remaining nodes. In this setting the partial correlation coefficient is the geometric mean of <inline-formula><m:math name="1752-0509-1-37-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>k</m:mi><m:mi>y</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula> and the corresponding reciprocal coefficient <inline-formula><m:math name="1752-0509-1-37-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#946;</m:mi><m:mi>y</m:mi><m:mi>k</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFYoGydaqhaaWcbaGaemyEaKhabaGaem4AaSgaaaaa@315B@</m:annotation></m:semantics></m:math></inline-formula>, i.e.</p>
            <p>
               <display-formula id="M4">
                  <m:math name="1752-0509-1-37-i21" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msqrt>
                              <m:mrow>
                                 <m:msubsup>
                                    <m:mi>&#946;</m:mi>
                                    <m:mi>y</m:mi>
                                    <m:mi>k</m:mi>
                                 </m:msubsup>
                                 <m:msubsup>
                                    <m:mi>&#946;</m:mi>
                                    <m:mi>k</m:mi>
                                    <m:mi>y</m:mi>
                                 </m:msubsup>
                              </m:mrow>
                           </m:msqrt>
                           <m:mo>=</m:mo>
                           <m:mrow>
                              <m:mo>|</m:mo>
                              <m:mrow>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#961;</m:mi>
                                       <m:mo>&#732;</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>y</m:mi>
                                       <m:mi>k</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:mo>|</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaGcaaqaaGGaciab=j7aInaaDaaaleaacqWG5bqEaeaacqWGRbWAaaGccqWFYoGydaqhaaWcbaGaem4AaSgabaGaemyEaKhaaaqabaGccqGH9aqpdaabdaqaaiqb=f8aYzaaiaWaaSbaaSqaaiabdMha5jabdUgaRbqabaaakiaawEa7caGLiWoaaaa@3F24@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>[see also equation 16 of ref. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>]. In this light, an undirected edge between two nodes A and B in a partial correlation graph may also be interpreted as bidirected edge, in the sense that A influences B and vice versa in the underlying system of regression. Therefore, the test <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> = 1 can be understood as <it>removing </it>one of these two directions, where Equation 2 suggests that only the relative variance reduction between the two involved nodes needs to be considered for establishing the final direction.</p>
         </sec>
         <sec>
            <st>
               <p>Reconstruction efficiency and approximations underlying the algorithm</p>
            </st>
            <sec>
               <st>
                  <p>Topology of the network</p>
               </st>
               <p>The proposed algorithm is an extension of the GGM inference approach of <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Its accuracy of correctly recovering the <it>topology </it>of the partial correlation graph has been established, e.g., in <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. </p>
               <p>However, it is well known that a directed Bayesian network and the corresponding undirected graph are not necessarily topologically identical: in the undirected graph for computing the partial correlations one conditions on all other nodes whereas in the directed graph one conditions only on a subset of nodes, in order to avoid conditioning "on the future" (i.e. on the dependent nodes). Therefore, it is critical to evaluate to what extent full order partial correlations are reasonable approximations for lower order partial correlations. This has already been investigated intensively by <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> who showed that in certain situations (sparse graphs, faithfulness assumption etc.) lower order partial correlations may be used as approximate substitute of full conditional correlations. Therefore, in the proposed algorithm we adopt the very same argument but apply it in the different direction, i.e. we approximate lower order partial correlation by full order partial correlation.</p>
            </sec>
            <sec>
               <st>
                  <p>Node ordering</p>
               </st>
               <p>A second approximation implicit in our algorithm concerns the determination of the ordering of the nodes, which is done by multiple testing of pairwise ratios of standardized partial variances. We have conducted a number of numerical simulations (data not shown) that indicate that for randomly simulated DAGs the ordering of the nodes is indeed well reflected in the partial variances, as expected.</p>
               <p>However, from variable selection in linear models it is also known that the partial variance (or the related <it>R</it><sup>2</sup>) may not always be a reliable indicator for variable importance. Nevertheless, the partial ordering of nodes according to SPV and the implicit model selection in the underlying regressions is a very different procedure in comparison to the standard variable selection approaches, in which the increase or decrease of the <it>R</it><sup>2 </sup>is taken as indicator of whether or not a variable is to be included, or a decomposition of <it>R</it><sup>2 </sup>is sought [for a review see, e.g., <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>]. The distinctive feature of our procedure is that by performing all tests log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) &#8800; 0 simultaneously we consider all <it>p </it>regression equations at once, even if the final feature selection occurs only locally on the level of an individual regression.</p>
               <p>It is also noteworthy that, as we impose directionality from the less well explained variable (large SPV, "exogenous", "independent") to the one with relatively lower SPV (well explained, "endogenous", "dependent" variable), we effectively choose the direction with the relatively <it>smaller </it>regression coefficient (conditional that the corresponding partial correlation is also significant).</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Further properties of the heuristic algorithm and of the resulting graphs</p>
            </st>
            <p>The simple heuristic network discovery algorithm exhibits a number of further properties worth noting:</p>
            <p>1. The estimated partially directed network cannot contain any (partially) directed cycles. For instance, it is not possible for a graph to contain a pattern such as <it>A </it>&#8594; <it>B </it>&#8594; <it>A</it>. This example would imply SPV<sub><it>A </it></sub>> SPV<sub><it>B </it></sub>> SPV<sub><it>A</it></sub>, which is a contradiction. As a consequence, the subgraph containing the directed edges only is also acyclic (and hence a DAG).</p>
            <p>2. The assignment of directionality is transitive. If there is a directed edge from <it>A </it>to <it>B </it>and from <it>B </it>to <it>C </it>then there must also be a directed edge from <it>A </it>to <it>C</it>. Note however, that actual inclusion of a directed edge into the causal network is conditional on a non-zero partial correlation coefficient.</p>
            <p>3. As the algorithm relies on correlations as input, causal processes that produce the same correlation matrix lead to the same inferred graph, and hence are indistinguishable. The existence of such equivalence classes is well known for SEMs <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and also for Bayesian belief networks <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <p>4. The proposed algorithm is scale-invariant by construction. Hence, a (linear) change in any of units of the data has no effect on the overall estimated partially directed network, and the implied causal relations.</p>
            <p>5. We emphasize that the partially directed network is <it>not </it>the chain graph representing the equivalence class of the causal network that is obtained by considering only its directed edges &#8211; see <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <p>6. The computational complexity of the algorithm is <it>O</it>(<it>p</it><sup>3</sup>). Hence, it is no more expensive than computing the partial correlation graph, and thus allows for estimation of networks containing in the order of thousands and more nodes.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of a plant expression data set</p>
            </st>
            <p>To illustrate our algorithm for discovering causal structure, we applied the approach to a real world data example. Specifically, we reanalyzed expression time series resulting from an experiment investigating the impact of the diurnal cycle on the starch metabolism of <it>Arabidopsis thaliana </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. This is the same data set we used in a sister paper concerning the estimation of a vector autoregressive model <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            <p>The data are gene expression time series measurements collected at 11 different time points (0, 1, 2, 4, 8, 12, 13, 14, 16, 20, and 24 hours after the start of the experiment). The corresponding calibrated signal intensities for 22,814 genes/probe sets and for two biological replicates are available from the NASCArrays repository, experiment no. 60 <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. After log-transforming the data we filtered out all genes containing missing values and whose maximum signal intensity value was lower than 5 on a log-base 2 scale. Subsequently, we applied the periodicity test of <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> to identify the probes associated with the day-night cycle. As a result, a subset of 800 genes remained for further analysis.</p>
            <p>In order to estimate the correlation matrix for the 800 genes described by the data set we employed the dynamical correlation shrinkage estimator of <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> as this takes account of the autocorrelation. The corresponding correlation graph is displayed in Figure <figr fid="F1">1</figr>. It shows the 150 edges with the largest absolute values of correlation. This graph is very hard to interpret, the branches do not have any immediate or intuitive meaning (a complete annotation of the nodes can be found along with the dataset itself in the R package "GeneNet" <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>). For instance, there are no hubs as typically observed in biological networks <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Correlation network inferred from the <it>Arabidopsis thaliana </it>data</p>
               </caption>
               <text>
                  <p>Correlation network inferred from the <it>Arabidopsis thaliana </it>data. The solid and dotted lines indicate positive and negative correlation coefficients, respectively, and the line intensity denotes their strength. The network displays the 150 edges with the largest absolute correlation. For annotation of the nodes in this graph see the electronic information contained in the R package "GeneNet" [40] and the original data paper [35].</p>
               </text>
               <graphic file="1752-0509-1-37-1"/>
            </fig>
            <p>This is in great contrast to the partially directed partial correlation graph. For this specific data set, by multiple testing of the factor <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula> we identified 6, 102 significant edges connecting 669 nodes. For the second factor <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>, determined whether edges are directed, the distribution of log(<inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>) is displayed in Figure <figr fid="F2">2</figr>. The null distribution (dashed line) follows a normal distribution and characterizes the edges that cannot be directed. The alternative distribution (solid line) coincides with the directed edges. In total, we found 15, 928 significant directions.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Distribution of log <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> for the <it>Arabidopsis thaliana </it>data</p>
               </caption>
               <text>
                  <p>Distribution of log <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> for the <it>Arabidopsis thaliana </it>data. The null distribution is depicted by the dashed line; it follows a normal distribution with zero mean and a standard deviation of 0.014. The solid line signifies the alternative distribution. The empirical distribution (indicated by the histogram) is composed of the null distribution (<it>&#951;</it><sub>0 </sub>= 0.8995) and of the alternative distribution (<it>&#951;</it><sub><it>A </it></sub>= 0.1005).</p>
               </text>
               <graphic file="1752-0509-1-37-2"/>
            </fig>
            <p>To construct the network, we projected upon the significant edges (factor <inline-formula><m:math name="1752-0509-1-37-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi mathvariant="script">A</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=bq8bbaa@382B@</m:annotation></m:semantics></m:math></inline-formula>) the significant directions (factor <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula>). In the network of significant associations, 1,216 directions were significant. Note that the fraction of significant directions is by far greater in the subset of the significant partial correlations than in the complete set of all partial correlations. This agrees with the intuitive notion, that causal influences can only be attributed to existing connections between variables.</p>
            <p>The resulting partially causal network is shown in Figure <figr fid="F3">3</figr>. For reasons of clarity we show only the subnetwork containing the 150 most significant edges, which connect 107 nodes. This graph exhibits a clear "hub" connectivity structure (nodes filled with red color). A prominent example for this is node 570, others are 81, 558, 783 and a few more genes. We see that many of the hub nodes have mostly outgoing arcs, which is indicative for key regulatory genes. This applies, e.g., to node 570, an AP2 transcription factor, or to node 81, a gene involved in DNA-directed RNA polymerase. An interesting aspect of the partially causal network is the web of highly connected genes (colored yellow in the lower right corner of Figure <figr fid="F3">3</figr>), which we hypothesize to constitute some form of a functional module. In this module, it is not possible to determine any directions, which could be due to complex interactions among the nodes of the module. Node 627 is another hub in the network that connects the functional module with the rest of the network and which according to the annotation of <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> encodes a protein of unknown function.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Partially causal network inferred from the <it>Arabidopsis thaliana </it>data by the method introduced in this paper &#8211; note the difference to the correlation network of Figure 1</p>
               </caption>
               <text>
                  <p>Partially causal network inferred from the <it>Arabidopsis thaliana </it>data by the method introduced in this paper &#8211; note the difference to the correlation network of Figure 1. The topology of the partially causal network is identical to that of a partial correlation graph (GGM, CIG). However, edges with significant directionality (as indicated by a factor <inline-formula><m:math name="1752-0509-1-37-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mi>&#8492;</m:mi><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaaliab=Xsicbaa@3788@</m:annotation></m:semantics></m:math></inline-formula> that is significantly smaller or larger than one) are oriented.</p>
               </text>
               <graphic file="1752-0509-1-37-3"/>
            </fig>
            <p>We also see that the partially directed network contains both directed and undirected nodes. This is a distinct advantage of the present approach. Unlike, e.g., a vector autoregressive model <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, it does not <it>force </it>directions onto the edges.</p>
            <p>Finally, in order to investigate the stability of the inferred partial causal network, we randomly removed data points from the sample, and repeatedly reconstructed the network from the reduced data set. In all cases the general topological structure of the network remained intact, which indicates that this is a signal inherent in the data. This is also confirmed by the analysis using vector autoregressions <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Methods for exploring causal structures in high-dimensional data are growing in importance, particularly in the study of complex biological, medical and financial systems. As a first (and often only) analysis step these data are explored using correlation networks.</p>
         <p>Here we have suggested a simple heuristic algorithm that, starting from a (positive definite) correlation matrix, infers a partially directed network that in turn allows generating causal hypotheses of how the data were generated. Our approach is approximate, but it allows analysis of high-dimensional small sampled data, and its computational complexity is very modest. Thus, our heuristic is likely to be applicable whenever a correlation network is computed, and therefore is suitable for screening large-scale data set for causal structure.</p>
         <p>Nevertheless, there a several lines along which this method could be extended. For instance, non-linear effects could be accounted for by employing entropy criteria, or by using higher order moments <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Furthermore, more sophisticated algorithms may be used to enhance the approximation of lower order partial correlations or the inference of the ordering of the nodes. However, ultimately this would lead to a method similar to the PC algorithm <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>Note that the PC algorithm is more refined than our algorithm, primarily due to additional steps that aim at removing spurious edges (i.e. those edges that are induced between otherwise uncorrelated parent nodes by conditioning on a common child node). However, these iterative refinements may be very time consuming, in particular for high-dimensional graphs.</p>
         <p>In contrast, our procedure is non-iterative and therefore both computationally and algorithmically (nearly) as simple as a correlation network. Nevertheless, it still enables the discovery of partially directed processes underlying the data.</p>
         <p>In summary, we recommend our approach as a procedure for exploratory screening for causal mechanisms. Subsequently, the resulting hypotheses may then form the basis for more refined analyzes, such as full Bayesian network modeling.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>Both authors participated in the development of the methodology and wrote the manuscript. RO carried out all analyzes. All authors approved of the final version of the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>The method is implemented in the "GeneNet" R package (version 1.2.0), available from CRAN and from <url>http://strimmerlab.org/software/genets/</url>. The software includes an R script for reproducing the network analysis of the <it>Arabidopsis thaliana </it>data.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was in part supported by an "Emmy Noether" excellence grant of the Deutsche Forschungsgemeinschaft (to K.S.).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Mantegna</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Stanley</snm>
                  <fnm>HE</fnm>
               </au>
            </aug>
            <source>An Introduction to Econophysics: Correlations and Complexity in Finance</source>
            <publisher>Cambridge, UK: Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Clustering and information in correlation based financial networks</p>
            </title>
            <aug>
               <au>
                  <snm>Onnela</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Kaski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kert&#233;sz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Eur Phys J B</source>
            <pubdate>2004</pubdate>
            <volume>38</volume>
            <fpage>353</fpage>
            <lpage>362</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1140/epjb/e2004-00128-7</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Statistical analysis of financial networks</p>
            </title>
            <aug>
               <au>
                  <snm>Boginski</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Butenko</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pardalos</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Comp Stat Data Anal</source>
            <pubdate>2005</pubdate>
            <volume>48</volume>
            <fpage>431</fpage>
            <lpage>443</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.csda.2004.02.004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <aug>
               <au>
                  <snm>Shipley</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Cause and Correlation in Biology</source>
            <publisher>Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks</p>
            </title>
            <aug>
               <au>
                  <snm>Butte</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Tamayo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Slonim</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Kohane</snm>
                  <fnm>IS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>12182</fpage>
            <lpage>12186</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17315</pubid>
                  <pubid idtype="pmpid" link="fulltext">11027309</pubid>
                  <pubid idtype="doi">10.1073/pnas.220392197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Conservation and evolution of gene coexpression networks in human and chimpanzee brains</p>
            </title>
            <aug>
               <au>
                  <snm>Oldham</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horvath</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Geschwind</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <fpage>17973</fpage>
            <lpage>17978</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693857</pubid>
                  <pubid idtype="pmpid" link="fulltext">17101986</pubid>
                  <pubid idtype="doi">10.1073/pnas.0605938103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>On the analysis and interpretation of correlations in metabolomic data</p>
            </title>
            <aug>
               <au>
                  <snm>Steuer</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2006</pubdate>
            <volume>151</volume>
            <fpage>151</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/bib/bbl009</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A tool for filtering information in complex systems</p>
            </title>
            <aug>
               <au>
                  <snm>Tumminello</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aste</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Di Matteo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mantegna</snm>
                  <fnm>RN</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sc USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>10421</fpage>
            <lpage>10426</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0500298102</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Causality: Models, Reasoning, and Inference</source>
            <publisher>Cambridge, UK: Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B10">
            <aug>
               <au>
                  <snm>Freedman</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Statistical Models: Theory and Practice</source>
            <publisher>Cambridge, UK: Cambridge University Press</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Linear recursive equations, covariance selection, and path analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Wermuth</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>J Amer Statist Assoc</source>
            <pubdate>1980</pubdate>
            <volume>75</volume>
            <fpage>963</fpage>
            <lpage>972</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2287189</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Gaussian influence diagrams</p>
            </title>
            <aug>
               <au>
                  <snm>Schachter</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Kenley</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Management Sci</source>
            <pubdate>1989</pubdate>
            <volume>35</volume>
            <fpage>527</fpage>
            <lpage>550</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The max-min hill-climbing Bayesian network structure learning algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Tsamardinos</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Aliferis</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Machine Learning</source>
            <pubdate>2006</pubdate>
            <volume>65</volume>
            <fpage>31</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s10994-006-6889-7</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <aug>
               <au>
                  <snm>Spirtes</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Glymour</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scheines</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Causation, Prediction, and Search</source>
            <publisher>MIT Press</publisher>
            <edition>2</edition>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Estimating high-dimensional directed acyclic graphs with the PC-algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Kalisch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>B&#252;hlmann</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Machine Learn Res</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>613</fpage>
            <lpage>636</lpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A linear non-Gaussian acyclic model for causal discovery</p>
            </title>
            <aug>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hoyer</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Hyv&#228;rinen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kerminen</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Machine Learn Res</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>2003</fpage>
            <lpage>2030</lpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Discovery of meaningful associations in genomic data using partial correlation coefficients</p>
            </title>
            <aug>
               <au>
                  <snm>de la Fuente</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bing</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hoeschele</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mendes</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>3565</fpage>
            <lpage>3574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth445</pubid>
                  <pubid idtype="pmpid" link="fulltext">15284096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Sparse graphical models for exploring gene expression data</p>
            </title>
            <aug>
               <au>
                  <snm>Dobra</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hans</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Nevins</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Multiv Anal</source>
            <pubdate>2004</pubdate>
            <volume>90</volume>
            <fpage>196</fpage>
            <lpage>212</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.jmva.2004.02.009</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>An empirical Bayes approach to inferring large-scale gene association networks</p>
            </title>
            <aug>
               <au>
                  <snm>Sch&#228;fer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>754</fpage>
            <lpage>764</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti062</pubid>
                  <pubid idtype="pmpid" link="fulltext">15479708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Sch&#228;fer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Statist Appl Genet Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <fpage>32</fpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Low-order conditional independence graphs for inferring genetic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Wille</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>B&#252;hlmann</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Statist Appl Genet Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>5</volume>
            <fpage>1</fpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gui</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Biostatistics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>302</fpage>
            <lpage>317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biostatistics/kxj008</pubid>
                  <pubid idtype="pmpid" link="fulltext">16326758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Linear dependencies represented by chain graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Cox</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Wermuth</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Statistical Science</source>
            <pubdate>1993</pubdate>
            <volume>8</volume>
            <fpage>204</fpage>
            <lpage>218</lpage>
         </bibl>
         <bibl id="B24">
            <aug>
               <au>
                  <snm>Whittaker</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Graphical Models in Applied Multivariate Statistics</source>
            <publisher>New York: Wiley</publisher>
            <pubdate>1990</pubdate>
         </bibl>
         <bibl id="B25">
            <aug>
               <au>
                  <snm>Studen&#253;</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Probabilistic Conditional Independence Structures</source>
            <publisher>Springer</publisher>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Collinearity and least squares regression (with discussion)</p>
            </title>
            <aug>
               <au>
                  <snm>Stewart</snm>
                  <fnm>GW</fnm>
               </au>
            </aug>
            <source>Statist Sci</source>
            <pubdate>1987</pubdate>
            <volume>2</volume>
            <fpage>68</fpage>
            <lpage>100</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Inferring gene dependency networks from genomic longitudinal data: a functional data approach</p>
            </title>
            <aug>
               <au>
                  <snm>Opgen-Rhein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>REVSTAT</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>53</fpage>
            <lpage>65</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Large-scale simultaneous hypothesis testing: the choice of a null hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Amer Statist Assoc</source>
            <pubdate>2004</pubdate>
            <volume>99</volume>
            <fpage>96</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1198/016214504000000089</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>On a distribution yielding the error functions of several well known statistics</p>
            </title>
            <aug>
               <au>
                  <snm>Fisher</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Proc Intl Congr Math</source>
            <pubdate>1924</pubdate>
            <volume>2</volume>
            <fpage>805</fpage>
            <lpage>813</lpage>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Comparative evaluation of reverse engineering gene regulatory networks with relevance networks, graphical Gaussian models and Bayesian networks</p>
            </title>
            <aug>
               <au>
                  <snm>Werhli</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Grzegorczyk</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Husmeier</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>2523</fpage>
            <lpage>2531</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl391</pubid>
                  <pubid idtype="pmpid" link="fulltext">16844710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A robust procedure for Gaussian graphical model search from microarray data with <it>p </it>larger than <it>n</it></p>
            </title>
            <aug>
               <au>
                  <snm>Castelo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Roverato</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Machine Learn Res</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Relative importance in linear regression in R: the package relaimpo</p>
            </title>
            <aug>
               <au>
                  <snm>Gr&#246;mping</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>J Statist Soft</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <fpage>1</fpage>
         </bibl>
         <bibl id="B33">
            <aug>
               <au>
                  <snm>Bollen</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Structural Equations With Latent Variables</source>
            <publisher>John Wiley &amp; Sons</publisher>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Learning equivalence classes of Bayesian-network structures</p>
            </title>
            <aug>
               <au>
                  <snm>Chickering</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>J Machine Learn Res</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>445</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1162/153244302760200696</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Diurnal changes in the transcriptom encoding enzymes of starch metabolism provide evidence for both transcriptionaland posttranscriptional regulation of starch metabolism inArabidopsis leaves</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Chia</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Thorneycroft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chapple</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dunstan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hylton</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>SCZAM</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <fpage>2687</fpage>
            <lpage>2699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523333</pubid>
                  <pubid idtype="pmpid" link="fulltext">15347792</pubid>
                  <pubid idtype="doi">10.1104/pp.104.044347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process</p>
            </title>
            <aug>
               <au>
                  <snm>Opgen-Rhein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <issue>Suppl 2</issue>
            <fpage>S3</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1892072</pubid>
                  <pubid idtype="pmpid" link="fulltext">17493252</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-8-S2-S3</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>NASCArrays: the Nottingham Arabidopsis Stock Centre's microarray database</p>
            </title>
            <url>http://affymetrix.arabidopsis.info/narrays/experimentbrowse.pl</url>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Identifying periodically expressed transcripts in microarray time series data</p>
            </title>
            <aug>
               <au>
                  <snm>Wichert</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fokianos</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>5</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg364</pubid>
                  <pubid idtype="pmpid" link="fulltext">14693803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Using regularized dynamic correlation to infer gene dependency networks from time-series microarray data</p>
            </title>
            <aug>
               <au>
                  <snm>Opgen-Rhein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proceedings of the 4th International Workshop on Computational Systems Biology (WCSB 2006), Tampere</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>73</fpage>
            <lpage>76</lpage>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Reverse engineering genetic networks using the "GeneNet" package</p>
            </title>
            <aug>
               <au>
                  <snm>Sch&#228;fer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Opgen-Rhein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>R News</source>
            <pubdate>2006</pubdate>
            <volume>6/5</volume>
            <fpage>50</fpage>
            <lpage>53</lpage>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Hierarchical organsation of modularity in metabolic networks</p>
            </title>
            <aug>
               <au>
                  <snm>Ravasz</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Somera</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Mongru</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>A-L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>1551</fpage>
            <lpage>1555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1073374</pubid>
                  <pubid idtype="pmpid" link="fulltext">12202830</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Network biology: understanding the cell's functional organization</p>
            </title>
            <aug>
               <au>
                  <snm>Barab&#225;si</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Oltvai</snm>
                  <fnm>ZN</fnm>
               </au>
            </aug>
            <source>Nature Rev Genetics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>101</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/nrg1272</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
