<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1752-0509-2-34</ui>
   <ji>1752-0509</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Using genetic markers to orient the edges in quantitative trait networks: The NEO software</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Aten</snm>
               <mi>E</mi>
               <fnm>Jason</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>j.e.aten@gmail.com</email>
            </au>
            <au id="A2">
               <snm>Fuller</snm>
               <mi>F</mi>
               <fnm>Tova</fnm>
               <insr iid="I1"/>
               <email>msmudphud@gmail.com</email>
            </au>
            <au id="A3">
               <snm>Lusis</snm>
               <mi>J</mi>
               <fnm>Aldons</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>jlusis@mednet.ucla.edu</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Horvath</snm>
               <fnm>Steve</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>shorvath@mednet.ucla.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, USA</p>
            </ins>
            <ins id="I2">
               <p>Biomathematics, David Geffen School of Medicine, University of California, Los Angeles, USA</p>
            </ins>
            <ins id="I3">
               <p>Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, USA</p>
            </ins>
            <ins id="I4">
               <p>Biostatistics, School of Public Health, University of California, Los Angeles, USA</p>
            </ins>
         </insg>
         <source>BMC Systems Biology</source>
         <issn>1752-0509</issn>
         <pubdate>2008</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>34</fpage>
         <url>http://www.biomedcentral.com/1752-0509/2/34</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18412962</pubid>
               <pubid idtype="doi">10.1186/1752-0509-2-34</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>08</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>15</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Aten et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Systems genetic studies have been used to identify genetic loci that affect transcript abundances and clinical traits such as body weight. The pairwise correlations between gene expression traits and/or clinical traits can be used to define undirected trait networks. Several authors have argued that genetic markers (e.g expression quantitative trait loci, eQTLs) can serve as causal anchors for orienting the edges of a trait network. The availability of hundreds of thousands of genetic markers poses new challenges: how to relate (anchor) traits to multiple genetic markers, how to score the genetic evidence in favor of an edge orientation, and how to weigh the information from multiple markers.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We develop and implement Network Edge Orienting (NEO) methods and software that address the challenges of inferring unconfounded and directed gene networks from microarray-derived gene expression data by integrating mRNA levels with genetic marker data and Structural Equation Model (SEM) comparisons. The NEO software implements several manual and automatic methods for incorporating genetic information to anchor traits. The networks are oriented by considering each edge separately, thus reducing error propagation. To summarize the genetic evidence in favor of a given edge orientation, we propose Local SEM-based Edge Orienting (LEO) scores that compare the fit of several competing causal graphs. SEM fitting indices allow the user to assess local and overall model fit. The NEO software allows the user to carry out a robustness analysis with regard to genetic marker selection. We demonstrate the utility of NEO by recovering known causal relationships in the sterol homeostasis pathway using liver gene expression data from an F2 mouse cross. Further, we use NEO to study the relationship between a disease gene and a biologically important gene co-expression module in liver tissue.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The NEO software can be used to orient the edges of gene co-expression networks or quantitative trait networks if the edges can be anchored to genetic marker data. R software tutorials, data, and supplementary material can be downloaded from: <url>http://www.genetics.ucla.edu/labs/horvath/aten/NEO</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The pairwise relationships between different clinical traits (e.g. cholesterol level) and/or gene expression traits (e.g. mRNA levels) have been successfully described with undirected gene co-expression networks <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. While gene expression traits (profiles) and clinical traits represent different quantities, both can be described in undirected <it>trait networks</it>. By definition, these undirected networks cannot be used to describe causal relationships between the traits. Causal information can be encoded by directed networks where <it>A </it>&#8594; <it>B </it>if trait <it>A </it>causally influences trait <it>B</it>. We refer to the process of assigning a causal direction to at least some of the edges in a trait network as 'edge orienting'. Experimental edge orienting approaches include transgenic modifications, viral-mediated over-expression, and chemical perturbation of genes. Edge orienting methods can also be based on various approaches that involve multiple perturbations, such as genetic- and time series experiments <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, or by integrating protein interaction and gene expression data <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Using genetic markers for orienting the edges of trait networks generated in genetic experiments provides significant statistical power and specificity for recovering directed edges <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Since randomization is the most convincing method for establishing causal relationships between two traits <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>, it is natural to make use of genetically randomized genotypes (implied by Mendel's laws) to derive causality tests that are less susceptible to confounding by hidden variables <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. If a trait <it>A </it>is significantly associated with a genetic marker <it>M</it>, variation in <it>M </it>must be a cause of variation in <it>A </it>(denoted by <it>M </it>&#8594; <it>A</it>) since the randomization of marker alleles during meiosis precedes their effect on trait <it>A</it>. Since the orientation of the edge between <it>M </it>and <it>A </it>is unambiguous, <it>M </it>is referred to as a causal anchor of <it>A </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>We follow the convention of path analysis to represent a causal model by a directed graph. For example, the directed graph <it>M </it>&#8594; <it>A </it>&#8594; <it>B </it>implies that the genetic marker <it>M </it>has a causal effect on trait <it>A</it>, which in turn has a causal effect on trait <it>B</it>. A causal graph encodes independencies between variables. Conditional independence can be determined by the graphical property of d-separation <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. If two traits <it>A </it>and <it>B </it>are d-separated in the graph by a set of variables <it>S</it>, then the two traits are independent given the variables in <it>S</it>. For example, <it>M </it>&#8594; <it>A </it>&#8594; <it>B </it>implies that <it>M </it>and <it>B </it>are independent after conditioning on <it>A</it>.</p>
         <p>D-separation predicts the correlational consequences of conditioning in the causal graph <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. By testing the correlational predictions and assuming no false independencies (faithfulness assumption), one can sometimes orient edges using observational data alone <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Correlation-based tests of causal models</p>
            </st>
            <p>For simplicity, we assume that the genetic markers are single nucleotide polymorphisms (SNPs). For a given sample (e.g. a mouse), a bi-allelic SNP can take on one of three possible genotypes. By default, we assume an additive genetic effect and encode these genotypes as 0, 1, or 2, but alternative marker codings could also be considered. To quantify the linear relationship between a SNP marker <it>M </it>and a trait <it>A</it>, we use the correlation coefficient <it>cor</it>(<it>M</it>, <it>A</it>). Ordinal variables are routinely used in path analysis and structural equation modelling <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>.</p>
            <p>To determine whether trait <it>A </it>mediates the effect of marker <it>M </it>on trait <it>B </it>(<it>M </it>&#8594; <it>A </it>&#8594; <it>B</it>) one can assess how conditioning on <it>A </it>affects the correlation between <it>M </it>and <it>B</it>. To quantify the linear relationship between <it>M </it>and <it>B </it>after conditioning on <it>A</it>, we use the partial correlation coefficient:</p>
            <p>
               <display-formula id="M1">
                  <m:math name="1752-0509-2-34-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>c</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>r</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>M</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>B</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>A</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>c</m:mi>
                                 <m:mi>o</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>M</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>B</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mi>c</m:mi>
                                 <m:mi>o</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>M</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>A</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mi>c</m:mi>
                                 <m:mi>o</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mi>B</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>A</m:mi>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>c</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>M</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>c</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:msup>
                                          <m:mrow>
                                             <m:mo stretchy="false">(</m:mo>
                                             <m:mi>B</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>A</m:mi>
                                             <m:mo stretchy="false">)</m:mo>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msqrt>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyta0KaeiilaWIaemOqaiKaeiiFaWNaemyqaeKaeiykaKIaeyypa0tcfa4aaSaaaeaacqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGnbqtcqGGSaalcqWGcbGqcqGGPaqkcqGHsislcqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGnbqtcqGGSaalcqWGbbqqcqGGPaqkcqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGcbGqcqGGSaalcqWGbbqqcqGGPaqkaeaadaGcaaqaaiabcIcaOiabigdaXiabgkHiTiabdogaJjabd+gaVjabdkhaYjabcIcaOiabd2eanjabcYcaSiabdgeabjabcMcaPmaaCaaabeqaaiabikdaYaaacqGGPaqkcqGGOaakcqaIXaqmcqGHsislcqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGcbGqcqGGSaalcqWGbbqqcqGGPaqkdaahaaqabeaacqaIYaGmaaGaeiykaKcabeaaaaGccqGGUaGlaaa@70AD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>If the causal model <it>M </it>&#8594; <it>A </it>&#8594; <it>B </it>is correct, then the partial correlation coefficient <it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>) is expected to be 0.</p>
            <p>We use Fisher's Z transform to assess the statistical significance of a sample correlation coefficient <it>r </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>:</p>
            <p>
               <display-formula>
                  <m:math name="1752-0509-2-34-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>Z</m:mi>
                              <m:mrow>
                                 <m:mi>F</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>h</m:mi>
                                 <m:mi>e</m:mi>
                                 <m:mi>r</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>r</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mn>0.5</m:mn>
                           <m:msqrt>
                              <m:mrow>
                                 <m:mi>N</m:mi>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>3</m:mn>
                              </m:mrow>
                           </m:msqrt>
                           <m:mtext>&#160;</m:mtext>
                           <m:mi>l</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>g</m:mi>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mn>1</m:mn>
                                       <m:mo>+</m:mo>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mi>r</m:mi>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                           <m:mo>,</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOwaO1aaSbaaSqaaiabdAeagjabdMgaPjabdohaZjabdIgaOjabdwgaLjabdkhaYbqabaGccqGGOaakcqWGYbGCcqGGPaqkcqGH9aqpcqaIWaamcqGGUaGlcqaI1aqndaGcaaqaaiabd6eaojabgkHiTiabiodaZaWcbeaakiabbccaGiabdYgaSjabd+gaVjabdEgaNnaabmaabaWaaSaaaeaacqaIXaqmcqGHRaWkcqWGYbGCaeaacqaIXaqmcqGHsislcqWGYbGCaaaacaGLOaGaayzkaaGaeiilaWcaaa@4D90@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>N </it>denotes the sample size; <it>Z</it><sub><it>Fisher</it></sub>(<it>r</it>) asymptotically follows a normal distribution (<it>Normal</it>(<it>&#956;</it>, 1) with mean <it>&#956; </it>and variance 1. Under the null hypothesis of zero correlation, <it>&#956; </it>= 0 and <it>Z</it><sub><it>Fisher</it></sub>(<it>r</it>) follows a standard normal distribution. For brevity, we denote the Fisher transformations of the correlation coefficients <it>cor</it>(<it>A</it>, <it>B</it>) and <it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>) by <it>Z</it>(<it>A</it>, <it>B</it>) = <it>Z</it><sub><it>Fisher</it></sub>(<it>cor</it>(<it>A</it>, <it>B</it>)) and <it>Z</it>(<it>M</it>, <it>B</it>|<it>A</it>) = <it>Z</it><sub><it>Fisher</it></sub>(<it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>)), respectively.</p>
            <p>If the causal graph <it>M </it>&#8594; <it>A </it>&#8594; <it>B </it>(Figure <figr fid="F1">1a</figr>) is correct, <it>Z</it>(<it>M</it>, <it>B</it>|<it>A</it>) follows a standard normal distribution. Thus, if the p-value corresponding to <it>Z</it>(<it>M</it>, <it>B</it>|<it>A</it>) is high (non-significant), the data fit the assumed causal graph. Using path analysis rules, the causal graph <it>M </it>&#8594; <it>A </it>&#8594; <it>B </it>(Figure <figr fid="F1">1a</figr>) implies the following relationships between the correlation coefficients</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Approaches for genetic marker-based causal inference</p>
               </caption>
               <text>
                  <p><b>Approaches for genetic marker-based causal inference</b>. Here we contrast different approaches for causality testing based on genetic markers. (a) single marker edge orienting involving a candidate pleiotropic anchor (CPA) <it>M</it>. The upper half of (a) shows the starting point of network edge orienting based on a single genetic marker <it>M </it>which is associated with traits <it>A </it>and <it>B</it>. The undirected edge between <it>A </it>and <it>B </it>indicates a significant correlation <it>cor</it>(<it>A</it>, <it>B</it>) between the two traits. The causal model in the lower half of (a) implies the following relationship between the correlation coefficients <it>cor</it>(<it>M</it>, <it>B</it>) = <it>cor</it>(<it>M</it>, <it>A</it>) &#215; <it>cor</it>(<it>A</it>, <it>B</it>). Further it implies that the absolute value of the correlations |<it>cor</it>(<it>M</it>, <it>A</it>)| and |<it>cor</it>(<it>M</it>, <it>B</it>)| are high whereas the partial correlation |<it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>)| (Eq. 1) is low. Figure (b) generalizes the single marker situation to the case of multiple genetic markers <inline-formula><m:math name="1752-0509-2-34-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>A</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdgeabbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemyqaeeabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabc2ha9baa@40BB@</m:annotation></m:semantics></m:math></inline-formula>. In this case, it is straightforward to generalize single edge orienting scores to multi-marker scores. Figure (c) describes a situation when a set of genetic markers <inline-formula><m:math name="1752-0509-2-34-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>B</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdkeacbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemOqaieabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabc2ha9baa@40C1@</m:annotation></m:semantics></m:math></inline-formula> is also available for trait <it>B</it>. We refer to the <it>M</it><sub><it>B </it></sub>markers as orthogonal causal anchors (OCA) since <inline-formula><m:math name="1752-0509-2-34-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>j</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyqaeKaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabdQgaQjabcMcaPaaakiabcMcaPaaa@390B@</m:annotation></m:semantics></m:math></inline-formula> is expected to be 0 under the causal model <it>M</it><sub><it>A </it></sub>&#8594; <it>A </it>&#8594; <it>B </it>&#8594; <it>M</it><sub><it>B</it></sub>, the correlation. Using simulation studies, we find that edge scores based on OCAs can be more powerful than those based on CPAs (see Additional File <supplr sid="S1">1</supplr>).</p>
               </text>
               <graphic file="1752-0509-2-34-1"/>
            </fig>
            <p>
               <display-formula id="M2"><it>cor</it>(<it>M</it>, <it>B</it>) = <it>cor</it>(<it>M</it>, <it>A</it>)<it>cor</it>(<it>A, B</it>)</display-formula>
            </p>
            <p>We refer to the marker <it>M </it>as a <it>candidate common pleiotropic anchor </it>(CPA) of <it>A </it>and <it>B</it>. If the expected values of <it>cor</it>(<it>M</it>, <it>A</it>) and <it>cor</it>(<it>A</it>, <it>B</it>) are non-zero and the causal model holds, Eq. (2) implies that the genetic marker <it>M </it>will be significantly correlated with both <it>A </it>and <it>B</it>. Thus, the marker <it>M </it>can be confirmed as a pleiotropic anchor of <it>A </it>and <it>B </it>by confirming the fit of the causal model <it>M </it>&#8594; <it>A </it>&#8594; <it>B</it>. We will now consider a situation where the correlation between <it>A </it>and <it>B </it>stems from a hidden confounder <it>C</it>, i.e. <it>M </it>&#8594; <it>A </it>&#8592; <it>C </it>&#8594; <it>B</it>. The graph implies that <it>A </it>and <it>B </it>are correlated due to the shared confounder <it>C</it>. The correlation <it>cor</it>(<it>M</it>, <it>B</it>) is expected to be 0 since the arrows between <it>M </it>and <it>B </it>collide at <it>A</it>, i.e, <it>M </it>and <it>B </it>are d-separated without conditioning. In this situation <it>Z(M, B) = Z<sub>Fisher</sub>(cor(M, B))</it> follows a standard normal distribution. If the p-value corresponding to Z(M, B) is high (non-significant), the data fit a confounded model. In contrast, the partial correlation <it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>) is expected to be non-zero since conditioning on <it>A </it>'activates' the causal flow through the collider node, i.e. it induces conditional dependence <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
            <p>The opposite (reactive) causal graph <it>M </it>&#8594; <it>A </it>&#8592; <it>B </it>also implies that the expected value of <it>cor</it>(<it>M</it>, <it>B</it>) is zero since the causal paths collide at <it>A</it>. Conditioning on <it>A </it>activates this collider node, and the partial correlation <it>cor</it>(<it>M</it>, <it>B</it>|<it>A</it>) is expected to be non-zero.</p>
            <p>Similarly, one can show that the model <it>A </it>&#8592; <it>M </it>&#8594; <it>B </it>implies that <it>cor</it>(<it>A</it>, <it>B</it>|<it>M</it>) is expected to be zero. Under this causal model, <it>Z</it>(<it>A</it>, <it>B</it>|<it>M</it>) asymptotically follows a standard normal distribution. In contrast, <it>cor</it>(<it>M</it>, <it>B</it>) is expected to be non-zero.</p>
            <p>These considerations illustrate that one can test the predicted correlational consequences of a causal model and thus evaluate its fit.</p>
            <p>We will now consider the situation of multiple markers (Figure <figr fid="F1">1b</figr>). Denote by</p>
            <p>
               <display-formula id="M3">
                  <m:math name="1752-0509-2-34-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>M</m:mi>
                              <m:mi>A</m:mi>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mo>{</m:mo>
                           <m:msubsup>
                              <m:mi>M</m:mi>
                              <m:mi>A</m:mi>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:msubsup>
                           <m:mo>,</m:mo>
                           <m:msubsup>
                              <m:mi>M</m:mi>
                              <m:mi>A</m:mi>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:mn>2</m:mn>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:msubsup>
                           <m:mo>,</m:mo>
                           <m:mn>...</m:mn>
                           <m:msubsup>
                              <m:mi>M</m:mi>
                              <m:mi>A</m:mi>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mi>K</m:mi>
                                    <m:mi>A</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                           </m:msubsup>
                           <m:mo>}</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdgeabbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemyqaeeabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabd2eannaaDaaaleaacqWGbbqqaeaacqGGOaakcqWGlbWsdaWgaaadbaGaemyqaeeabeaaliabcMcaPaaakiabc2ha9baa@4782@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>a set of candidate common pleiotropic anchors of <it>A </it>and <it>B</it>. Analogous to Eq. (2), the causal model <it>M</it><sub><it>A </it></sub>&#8594; <it>A </it>&#8594; <it>B </it>implies <inline-formula><m:math name="1752-0509-2-34-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>i</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo><m:mo>=</m:mo><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>i</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo>,</m:mo><m:mi>B</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabdMgaPjabcMcaPaaakiabcYcaSiabdkeacjabcMcaPiabg2da9iabdogaJjabd+gaVjabdkhaYjabcIcaOiabd2eannaaDaaaleaacqWGbbqqaeaacqGGOaakcqWGPbqAcqGGPaqkaaGccqGGSaalcqWGbbqqcqGGPaqkcqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGbbqqcqGGSaalcqWGcbGqcqGGPaqkaaa@500E@</m:annotation></m:semantics></m:math></inline-formula>. The model implies that the partial correlations <inline-formula><m:math name="1752-0509-2-34-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>i</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>B</m:mi><m:mo>|</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabdMgaPjabcMcaPaaakiabcMcaPiabcYcaSiabdkeacjabcYha8jabdgeabjabcMcaPaaa@3C6E@</m:annotation></m:semantics></m:math></inline-formula> are expected to be zero, i.e. <inline-formula><m:math name="1752-0509-2-34-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>Z</m:mi><m:mrow><m:mi>f</m:mi><m:mi>i</m:mi><m:mi>s</m:mi><m:mi>h</m:mi><m:mi>e</m:mi><m:mi>r</m:mi></m:mrow></m:msub><m:mo stretchy="false">(</m:mo><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>i</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>B</m:mi><m:mo>|</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOwaO1aaWbaaSqabeaaieWacqWFMbGzcqWFPbqAcqWFZbWCcqWFObaAcqWFLbqzcqWFYbGCaaGccqGGOaakcqWGJbWycqWGVbWBcqWGYbGCcqGGOaakcqWGnbqtdaqhaaWcbaGaemyqaeeabaGaeiikaGIaemyAaKMaeiykaKcaaOGaeiykaKIaeiilaWIaemOqaiKaeiiFaWNaemyqaeKaeiykaKIaeiykaKcaaa@47C0@</m:annotation></m:semantics></m:math></inline-formula> is predicted to follow a standard normal distributions.</p>
            <p>Frequently an additional set of markers <it>M</it><sub><it>B </it></sub>is also available for trait <it>B </it>(Figure <figr fid="F1">1c</figr>). For example, when one marker is available for each trait, i.e. <inline-formula><m:math name="1752-0509-2-34-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>&#8594;</m:mo><m:mi>A</m:mi><m:mo>&#8594;</m:mo><m:mi>B</m:mi><m:mo>&#8592;</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabigdaXiabcMcaPaaakiabgkziUkabdgeabjabgkziUkabdkeacjabgcziSkabd2eannaaDaaaleaacqWGcbGqaeaacqGGOaakcqaIXaqmcqGGPaqkaaaaaa@3DB6@</m:annotation></m:semantics></m:math></inline-formula>, the correlation <inline-formula><m:math name="1752-0509-2-34-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyqaeKaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabigdaXiabcMcaPaaakiabcMcaPaaa@389E@</m:annotation></m:semantics></m:math></inline-formula> is expected to be 0 since the causal arrows 'collide' at <it>B </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Geometrically speaking, the expected zero correlation between <it>A </it>and <inline-formula><m:math name="1752-0509-2-34-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabigdaXiabcMcaPaaaaaa@30D4@</m:annotation></m:semantics></m:math></inline-formula> implies that the corresponding standardized vectors are orthogonal. Therefore, we refer to marker <inline-formula><m:math name="1752-0509-2-34-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabigdaXiabcMcaPaaaaaa@30D4@</m:annotation></m:semantics></m:math></inline-formula> as an <b>orthogonal causal anchor </b>(OCA) with respect to the edge <it>A </it>&#8594; <it>B</it>. We will argue that the availability of orthogonal causal anchors significantly improves the recovery of the causal signal (see the simulations in Additional File <supplr sid="S1">1</supplr>). If the model <inline-formula><m:math name="1752-0509-2-34-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>&#8594;</m:mo><m:mi>A</m:mi><m:mo>&#8594;</m:mo><m:mi>B</m:mi><m:mo>&#8592;</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabigdaXiabcMcaPaaakiabgkziUkabdgeabjabgkziUkabdkeacjabgcziSkabd2eannaaDaaaleaacqWGcbGqaeaacqGGOaakcqaIXaqmcqGGPaqkaaaaaa@3DB6@</m:annotation></m:semantics></m:math></inline-formula> is correct, <inline-formula><m:math name="1752-0509-2-34-i13" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>B</m:mi><m:mo>|</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo><m:mo>,</m:mo><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4yamMaem4Ba8MaemOCaiNaeiikaGIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabigdaXiabcMcaPaaakiabcYcaSiabdkeacjabcYha8jabdgeabjabcMcaPiabcYcaSiabdogaJjabd+gaVjabdkhaYjabcIcaOiabdgeabjabcYcaSiabd2eannaaDaaaleaacqWGcbGqaeaacqGGOaakcqaIXaqmcqGGPaqkaaGccqGGPaqkaaa@48D2@</m:annotation></m:semantics></m:math></inline-formula> are expected to be zero and <inline-formula><m:math name="1752-0509-2-34-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>Z</m:mi><m:mo stretchy="false">(</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mi>B</m:mi><m:mo>|</m:mo><m:mi>A</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOwaOLaeiikaGIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabigdaXiabcMcaPaaakiabcYcaSiabdkeacjabcYha8jabdgeabjabcMcaPaaa@3843@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-2-34-i15" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>Z</m:mi><m:mo stretchy="false">(</m:mo><m:mi>A</m:mi><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOwaOLaeiikaGIaemyqaeKaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabigdaXiabcMcaPaaakiabcMcaPaaa@35B8@</m:annotation></m:semantics></m:math></inline-formula> asymptotically follow standard normal distributions.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Single edge simulation study</b>. This document describes our single edge simulation studies involving the LEO.NB.CPA score (Eq. 5) and the LEO.NB.OCA score (Eq. 6). We describe the parameters used in the single edge <it>A </it>&#8592; <it>B </it>simulation model. A hidden confounder <it>C </it>affects the correlation between <it>A </it>and <it>B</it>. The effect of SNP markers on traits <it>A </it>and <it>B </it>is parameterized with the restricted heritabilities. The single edge simulation model is used i) to study the choice of thresholds for the LEO.NB scores, ii) to compare the LEO.NB.CPA with LEO.NB.OCA scores, and iii) to evaluate automatic SNP selection methods.</p>
               </text>
               <file name="1752-0509-2-34-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Figure <figr fid="F1">1(c)</figr> depicts a situation where two sets of genetic markers <inline-formula><m:math name="1752-0509-2-34-i16" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>A</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>K</m:mi><m:mi>A</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdgeabbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemyqaeeabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabd2eannaaDaaaleaacqWGbbqqaeaacqGGOaakcqWGlbWsdaWgaaadbaGaemyqaeeabeaaliabcMcaPaaakiabc2ha9baa@4734@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-2-34-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>B</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>K</m:mi><m:mi>B</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdkeacbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemOqaieabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabd2eannaaDaaaleaacqWGcbGqaeaacqGGOaakcqWGlbWsdaWgaaadbaGaemOqaieabeaaliabcMcaPaaakiabc2ha9baa@473E@</m:annotation></m:semantics></m:math></inline-formula> influence traits <it>A </it>and <it>B</it>, respectively. In this case, the correlational consequences become increasingly complicated, which is why we use structural equation models (SEMs) to evaluate the fit of different causal scenarios.</p>
         </sec>
         <sec>
            <st>
               <p>Local SEM-based edge orienting scores</p>
            </st>
            <p>While SEMs can be used to study the fit of multi-trait causal models <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr></abbrgrp> we only consider the <it>local causal models </it>depicted in Figure <figr fid="F2">2</figr> since the proposed NEO method evaluates the orientation of each edge separately based on the best causal anchors available. The fit of each single marker model in Figure <figr fid="F2">2(a)</figr> can be tested using a chi-square test with 1 degree of freedom. We refer to the resulting p-value as the model p-value. In the Methods section, we review and discuss the use of model p-values for quantifying the fit of a causal model. The main point is that the <it>higher </it>the model p-value, the better the causal model fits the data.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Illustrating the single genetic marker versus multi-marker local SEMs used in the definition of the LEO.NB score</p>
               </caption>
               <text>
                  <p><b>Illustrating the single genetic marker versus multi-marker local SEMs used in the definition of the LEO.NB score</b>. The single genetic marker is denoted by <it>M </it>in (a) and the multiple genetic markers are denoted by <inline-formula><m:math name="1752-0509-2-34-i18" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>i</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabdMgaPjabcMcaPaaaaaa@313D@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1752-0509-2-34-i19" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mi>j</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabdQgaQjabcMcaPaaaaaa@3141@</m:annotation></m:semantics></m:math></inline-formula> in (b) and (c). By definition, <it>LEO.NB</it>(<it>A</it>&#707;<it>B</it>) = log<sub>10</sub>{<it>P </it>(model 1)}/{max<sub><it>i</it>>1</sub>{<it>P </it>(model <it>i</it>)}} for a candidate <it>A </it>&#8594; <it>B </it>edge orientation, where the models in the definition are pictured in (a) for single marker LEO.NB scores, and in (b) for multiple marker LEO.NB scores. In (b) we show the orthomarker models used for the LEO.NB.OCA marker aggregation method. The hidden confounder <it>C </it>in model 4 is the causal parent of both <it>A </it>and <it>B</it>, i.e. <it>A </it>&#8592; <it>C </it>&#8594; <it>B</it>. The simulation studies in Additional File <supplr sid="S1">1</supplr> show that the LEO.NB.OCA score can be significantly more powerful than the LEO.NB.CPA score.</p>
               </text>
               <graphic file="1752-0509-2-34-2"/>
            </fig>
            <p>To summarize the genetic evidence in favor of a given edge orientation <it>A </it>&#8594; <it>B</it>, we propose the use of edge orienting scores. The higher the value of an edge orienting score for the orientation <it>A </it>&#8594; <it>B</it>, the stronger genetic evidence favors this causal model.</p>
            <p>In the following, we propose local SEM-based edge orienting (LEO) scores for orientation <it>A </it>&#8594; <it>B</it>. For a single genetic marker <it>M </it>and traits <it>A </it>and <it>B</it>, we consider the 5 different local causal models depicted in Figure <figr fid="F2">2(a)</figr>. Additional single marker models are possible. However, under the constraint that the markers are causal anchors (graphically, arrows flow only from M and not into M), then the five models pictured for nodes (M, A, B) in Figure <figr fid="F2">2(a)</figr> exhaust all possible three node models that both (1) explain <it>A </it>&#8211; <it>B </it>and (<it>M </it>&#8211; <it>A </it>or <it>M </it>&#8211; <it>B</it>) associations and (2) can be tested. The critical technical issue is having degrees of freedom (d.f.) remaining after estimating the model parameters. If the degrees of freedom are 0, the model p-values cannot be calculated. The 5 different local causal models depicted in Figure <figr fid="F2">2(a)</figr> are used to compute the following model p-values: <it>P</it>(<it>M </it>&#8594; <it>A </it>&#8594; <it>B</it>), <it>P</it>(<it>A </it>&#8592; <it>B </it>&#8592; <it>M</it>), <it>P</it>(<it>A </it>&#8592; <it>M </it>&#8594; <it>B</it>), <it>P</it>(<it>M </it>&#8594; <it>A </it>&#8592; <it>B</it>), and <it>P</it>(<it>A </it>&#8594; <it>B </it>&#8592; <it>M</it>). While a detailed analysis should consider all model p-values, we find it useful to summarize the genetic evidence in favor of a given orientation <it>A </it>&#8594; <it>B </it>(model 1) using a single number: the Local SEM-based Edge Orienting Next Best (LEO.NB) score. The LEO.NB score is defined by dividing the model p-value for <it>A </it>&#8594; <it>B </it>by the p-value of the best fitting alternative model, i.e. the best of models 2&#8211;5 in Figure <figr fid="F2">2(a)</figr>. The chi-square test p-value of the best fitting alternative model is the maximum p-value of the alternative causal models. Specifically, we define the single-marker LEO.NB score as follows:</p>
            <p>
               <display-formula id="M4">
                  <m:math name="1752-0509-2-34-i20" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>L</m:mi>
                           <m:mi>E</m:mi>
                           <m:mi>O</m:mi>
                           <m:mo>.</m:mo>
                           <m:mi>N</m:mi>
                           <m:mi>B</m:mi>
                           <m:mo>.</m:mo>
                           <m:mi>S</m:mi>
                           <m:mi>i</m:mi>
                           <m:mi>n</m:mi>
                           <m:mi>g</m:mi>
                           <m:mi>l</m:mi>
                           <m:mi>e</m:mi>
                           <m:mi>M</m:mi>
                           <m:mi>a</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>k</m:mi>
                           <m:mi>e</m:mi>
                           <m:mi>r</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>A</m:mi>
                           <m:mo>&#8594;</m:mo>
                           <m:mi>B</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>M</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mrow>
                                 <m:mi>log</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mn>10</m:mn>
                              </m:mrow>
                           </m:msub>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mfrac>
                                    <m:mrow>
                                       <m:mi>P</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mtext>model&#160;</m:mtext>
                                       <m:mn>1</m:mn>
                                       <m:mo>:</m:mo>
                                       <m:mi>M</m:mi>
                                       <m:mo>&#8594;</m:mo>
                                       <m:mi>A</m:mi>
                                       <m:mo>&#8594;</m:mo>
                                       <m:mi>B</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                    <m:mrow>
                                       <m:mi>max</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:mtable columnalign="left">
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>P</m:mi>
                                                         <m:mo stretchy="false">(</m:mo>
                                                         <m:mtext>model&#160;</m:mtext>
                                                         <m:mn>2</m:mn>
                                                         <m:mo>:</m:mo>
                                                         <m:mi>M</m:mi>
                                                         <m:mo>&#8594;</m:mo>
                                                         <m:mi>B</m:mi>
                                                         <m:mo>&#8594;</m:mo>
                                                         <m:mi>A</m:mi>
                                                         <m:mo stretchy="false">)</m:mo>
                                                         <m:mo>,</m:mo>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>P</m:mi>
                                                         <m:mo stretchy="false">(</m:mo>
                                                         <m:mtext>model&#160;</m:mtext>
                                                         <m:mn>3</m:mn>
                                                         <m:mo>:</m:mo>
                                                         <m:mi>A</m:mi>
                                                         <m:mo>&#8592;</m:mo>
                                                         <m:mi>M</m:mi>
                                                         <m:mo>&#8594;</m:mo>
                                                         <m:mi>B</m:mi>
                                                         <m:mo stretchy="false">)</m:mo>
                                                         <m:mo>,</m:mo>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>P</m:mi>
                                                         <m:mo stretchy="false">(</m:mo>
                                                         <m:mtext>model&#160;</m:mtext>
                                                         <m:mn>4</m:mn>
                                                         <m:mo>:</m:mo>
                                                         <m:mi>M</m:mi>
                                                         <m:mo>&#8594;</m:mo>
                                                         <m:mi>A</m:mi>
                                                         <m:mo>&#8592;</m:mo>
                                                         <m:mi>B</m:mi>
                                                         <m:mo stretchy="false">)</m:mo>
                                                         <m:mo>,</m:mo>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>P</m:mi>
                                                         <m:mo stretchy="false">(</m:mo>
                                                         <m:mtext>model&#160;</m:mtext>
                                                         <m:mn>5</m:mn>
                                                         <m:mo>:</m:mo>
                                                         <m:mi>M</m:mi>
                                                         <m:mo>&#8594;</m:mo>
                                                         <m:mtext>B</m:mtext>
                                                         <m:mo>&#8592;</m:mo>
                                                         <m:mtext>A)</m:mtext>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                             </m:mtable>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mfrac>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemitaWKaemyrauKaem4ta8KaeiOla4IaemOta4KaemOqaiKaeiOla4Iaem4uamLaemyAaKMaemOBa4Maem4zaCMaemiBaWMaemyzauMaemyta0KaemyyaeMaemOCaiNaem4AaSMaemyzauMaemOCaiNaeiikaGIaemyqaeKaeyOKH4QaemOqaiKaeiiFaWNaemyta0KaeiykaKIaeyypa0JagiiBaWMaei4Ba8Maei4zaC2aaSbaaSqaaiabigdaXiabicdaWaqabaGcdaqadaqcfayaamaalaaabaGaemiuaaLaeiikaGIaeeyBa0Maee4Ba8MaeeizaqMaeeyzauMaeeiBaWMaeeiiaaIaeGymaeJaeiOoaOJaemyta0KaeyOKH4QaemyqaeKaeyOKH4QaemOqaiKaeiykaKcabaGagiyBa0MaeiyyaeMaeiiEaG3aaeWaaeaafaqaaeabbaaaaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaIYaGmcqGG6aGocqWGnbqtcqGHsgIRcqWGcbGqcqGHsgIRcqWGbbqqcqGGPaqkcqGGSaalaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaIZaWmcqGG6aGocqWGbbqqcqGHqgcRcqWGnbqtcqGHsgIRcqWGcbGqcqGGPaqkcqGGSaalaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaI0aancqGG6aGocqWGnbqtcqGHsgIRcqWGbbqqcqGHqgcRcqWGcbGqcqGGPaqkcqGGSaalaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaI1aqncqGG6aGocqWGnbqtcqGHsgIRcqqGcbGqcqGHqgcRcqqGbbqqcqqGPaqkaaaacaGLOaGaayzkaaaaaaGccaGLOaGaayzkaaGaeiOla4caaa@BF7D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>A positive <it>LEO.NB</it>(<it>A </it>&#8594; <it>B</it>) score indicates that the p-value in favor of model <it>A </it>&#8594; <it>B </it>is higher than that of any of the competing models in Figure <figr fid="F2">2(a)</figr>. A negative LEO.NB score indicates that the <it>A </it>&#8594; <it>B </it>model is inferior to at least one alternative model. In our simulations, we use a threshold of 1 for <it>LEO.NB.SingleMarker</it>(<it>A </it>&#8594; <it>B</it>|<it>M</it>).</p>
            <sec>
               <st>
                  <p>Multi-marker LEO.NB score</p>
               </st>
               <p>It is straightforward to generalize the single marker LEO.NB score (Eq. 4) to a set of genetic markers <inline-formula><m:math name="1752-0509-2-34-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>A</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>A</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdgeabbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemyqaeeabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdgeabbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabc2ha9baa@40BB@</m:annotation></m:semantics></m:math></inline-formula> (Figure <figr fid="F1">1b</figr>). We refer to the resulting edge orienting score as the LEO.NB.CPA score since it is based on the set of candidate pleiotropic anchors <it>M</it><sub><it>A </it></sub>(Eq. 3):</p>
               <p>
                  <display-formula id="M5">
                     <m:math name="1752-0509-2-34-i21" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>L</m:mi>
                              <m:mi>E</m:mi>
                              <m:mi>O</m:mi>
                              <m:mo>.</m:mo>
                              <m:mi>N</m:mi>
                              <m:mi>B</m:mi>
                              <m:mo>.</m:mo>
                              <m:mi>C</m:mi>
                              <m:mi>P</m:mi>
                              <m:mi>A</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>A</m:mi>
                              <m:mo>&#8594;</m:mo>
                              <m:mi>B</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>M</m:mi>
                                 <m:mi>A</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mrow>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>10</m:mn>
                                 </m:mrow>
                              </m:msub>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mi>P</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mtext>model&#160;</m:mtext>
                                          <m:mn>1</m:mn>
                                          <m:mo>:</m:mo>
                                          <m:msub>
                                             <m:mi>M</m:mi>
                                             <m:mi>A</m:mi>
                                          </m:msub>
                                          <m:mo>&#8594;</m:mo>
                                          <m:mi>A</m:mi>
                                          <m:mo>&#8594;</m:mo>
                                          <m:mi>B</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>max</m:mi>
                                          <m:mo>&#8289;</m:mo>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable columnalign="left">
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>2</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mo>,</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>3</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mo>,</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>4</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mo>,</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>5</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mtext>B</m:mtext>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:mtext>A)</m:mtext>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemitaWKaemyrauKaem4ta8KaeiOla4IaemOta4KaemOqaiKaeiOla4Iaem4qamKaemiuaaLaemyqaeKaeiikaGIaemyqaeKaeyOKH4QaemOqaiKaeiiFaWNaemyta00aaSbaaSqaaiabdgeabbqabaGccqGGPaqkcqGH9aqpcyGGSbaBcqGGVbWBcqGGNbWzdaWgaaWcbaGaeGymaeJaeGimaadabeaakmaabmaajuaGbaWaaSaaaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaIXaqmcqGG6aGocqWGnbqtdaWgaaqaaiabdgeabbqabaGaeyOKH4QaemyqaeKaeyOKH4QaemOqaiKaeiykaKcabaGagiyBa0MaeiyyaeMaeiiEaG3aaeWaaeaafaqaaeabbaaaaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaIYaGmcqGG6aGocqWGnbqtdaWgaaqaaiabdgeabbqabaGaeyOKH4QaemOqaiKaeyOKH4QaemyqaeKaeiykaKIaeiilaWcabaGaemiuaaLaeiikaGIaeeyBa0Maee4Ba8MaeeizaqMaeeyzauMaeeiBaWMaeeiiaaIaeG4mamJaeiOoaOJaemyqaeKaeyiKHWQaemyta00aaSbaaeaacqWGbbqqaeqaaiabgkziUkabdkeacjabcMcaPiabcYcaSaqaaiabdcfaqjabcIcaOiabb2gaTjabb+gaVjabbsgaKjabbwgaLjabbYgaSjabbccaGiabisda0iabcQda6iabd2eannaaBaaabaGaemyqaeeabeaacqGHsgIRcqWGbbqqcqGHqgcRcqWGcbGqcqGGPaqkcqGGSaalaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaI1aqncqGG6aGocqWGnbqtdaWgaaqaaiabdgeabbqabaGaeyOKH4QaeeOqaiKaeyiKHWQaeeyqaeKaeeykaKcaaaGaayjkaiaawMcaaaaaaOGaayjkaiaawMcaaiabc6caUaaa@B9E9@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>Note that the multi-marker models used in the definition of LEO.NB.CPA correspond to the single marker models of Figure <figr fid="F2">2(b)</figr> with <it>M </it>replaced by <it>M</it><sub><it>A</it></sub>.</p>
               <p>If an additional genetic marker set <inline-formula><m:math name="1752-0509-2-34-i17" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>M</m:mi><m:mi>B</m:mi></m:msub><m:mo>=</m:mo><m:mo>{</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:mn>2</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>,</m:mo><m:mn>...</m:mn><m:msubsup><m:mi>M</m:mi><m:mi>B</m:mi><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>K</m:mi><m:mi>B</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:msubsup><m:mo>}</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyta00aaSbaaSqaaiabdkeacbqabaGccqGH9aqpcqGG7bWEcqWGnbqtdaqhaaWcbaGaemOqaieabaGaeiikaGIaeGymaeJaeiykaKcaaOGaeiilaWIaemyta00aa0baaSqaaiabdkeacbqaaiabcIcaOiabikdaYiabcMcaPaaakiabcYcaSiabc6caUiabc6caUiabc6caUiabd2eannaaDaaaleaacqWGcbGqaeaacqGGOaakcqWGlbWsdaWgaaadbaGaemOqaieabeaaliabcMcaPaaakiabc2ha9baa@473E@</m:annotation></m:semantics></m:math></inline-formula> associated with trait <it>B </it>is available (Figure <figr fid="F1">1c</figr>), we propose to use another edge orienting score. According to our consistency assumption, <it>M</it><sub><it>A </it></sub>contains markers that are more strongly correlated with <it>A </it>than with <it>B</it>. Similarly, <it>M</it><sub><it>B </it></sub>holds markers more strongly correlated with <it>B </it>than with <it>A</it>. If the orientation <it>A </it>&#8594; <it>B </it>is correct, then each of the <it>M</it><sub><it>A </it></sub>markers has a pleiotropic effect by impacting first <it>A </it>and subsequently <it>B</it>. Furthermore, we refer to the markers in <it>M</it><sub><it>B </it></sub>as candidate orthogonal causal anchors (OCAs) since the model <it>A </it>&#8594; <it>B </it>implies that these markers impact <it>B</it>, but are independent of both <it>A </it>and <it>M</it><sub><it>A</it></sub>. We define the likelihood-based orthogonal causal anchor (OCA) score by assessing whether the model <it>M</it><sub><it>A </it></sub>&#8594; <it>A </it>&#8594; <it>B </it>&#8592; <it>M</it><sub><it>B </it></sub>has a higher p-value than the alternative models depicted in Figure <figr fid="F2">2(b)</figr>. Specifically, we define</p>
               <p>
                  <display-formula id="M6">
                     <m:math name="1752-0509-2-34-i22" xmlns:m="http://www.w3.org/1998/Math/MathML">
                        <m:semantics>
                           <m:mrow>
                              <m:mi>L</m:mi>
                              <m:mi>E</m:mi>
                              <m:mi>O</m:mi>
                              <m:mo>.</m:mo>
                              <m:mi>N</m:mi>
                              <m:mi>B</m:mi>
                              <m:mo>.</m:mo>
                              <m:mi>O</m:mi>
                              <m:mi>C</m:mi>
                              <m:mi>A</m:mi>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>A</m:mi>
                              <m:mo>&#8594;</m:mo>
                              <m:mi>B</m:mi>
                              <m:mo>|</m:mo>
                              <m:msub>
                                 <m:mi>M</m:mi>
                                 <m:mi>A</m:mi>
                              </m:msub>
                              <m:mo>,</m:mo>
                              <m:msub>
                                 <m:mi>M</m:mi>
                                 <m:mi>B</m:mi>
                              </m:msub>
                              <m:mo stretchy="false">)</m:mo>
                              <m:mo>=</m:mo>
                              <m:msub>
                                 <m:mrow>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                 </m:mrow>
                                 <m:mrow>
                                    <m:mn>10</m:mn>
                                 </m:mrow>
                              </m:msub>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mi>P</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mtext>model&#160;</m:mtext>
                                          <m:mn>1</m:mn>
                                          <m:mo>:</m:mo>
                                          <m:msub>
                                             <m:mi>M</m:mi>
                                             <m:mi>A</m:mi>
                                          </m:msub>
                                          <m:mo>&#8594;</m:mo>
                                          <m:mi>A</m:mi>
                                          <m:mo>&#8594;</m:mo>
                                          <m:mi>B</m:mi>
                                          <m:mo>&#8592;</m:mo>
                                          <m:msub>
                                             <m:mi>M</m:mi>
                                             <m:mi>B</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>max</m:mi>
                                          <m:mo>&#8289;</m:mo>
                                          <m:mrow>
                                             <m:mo>(</m:mo>
                                             <m:mrow>
                                                <m:mtable columnalign="left">
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>2</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>B</m:mi>
                                                            </m:msub>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mo>,</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>3</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>B</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo stretchy="false">)</m:mo>
                                                            <m:mo>,</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                   <m:mtr columnalign="left">
                                                      <m:mtd columnalign="left">
                                                         <m:mrow>
                                                            <m:mi>P</m:mi>
                                                            <m:mo stretchy="false">(</m:mo>
                                                            <m:mtext>model&#160;</m:mtext>
                                                            <m:mn>4</m:mn>
                                                            <m:mo>:</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>A</m:mi>
                                                            </m:msub>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>A</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:mi>C</m:mi>
                                                            <m:mo>&#8594;</m:mo>
                                                            <m:mi>B</m:mi>
                                                            <m:mo>&#8592;</m:mo>
                                                            <m:msub>
                                                               <m:mi>M</m:mi>
                                                               <m:mi>B</m:mi>
                                                            </m:msub>
                                                            <m:mo stretchy="false">)</m:mo>
                                                         </m:mrow>
                                                      </m:mtd>
                                                   </m:mtr>
                                                </m:mtable>
                                             </m:mrow>
                                             <m:mo>)</m:mo>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                              <m:mo>.</m:mo>
                           </m:mrow>
                           <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemitaWKaemyrauKaem4ta8KaeiOla4IaemOta4KaemOqaiKaeiOla4Iaem4ta8Kaem4qamKaemyqaeKaeiikaGIaemyqaeKaeyOKH4QaemOqaiKaeiiFaWNaemyta00aaSbaaSqaaiabdgeabbqabaGccqGGSaalcqWGnbqtdaWgaaWcbaGaemOqaieabeaakiabcMcaPiabg2da9iGbcYgaSjabc+gaVjabcEgaNnaaBaaaleaacqaIXaqmcqaIWaamaeqaaOWaaeWaaKqbagaadaWcaaqaaiabdcfaqjabcIcaOiabb2gaTjabb+gaVjabbsgaKjabbwgaLjabbYgaSjabbccaGiabigdaXiabcQda6iabd2eannaaBaaabaGaemyqaeeabeaacqGHsgIRcqWGbbqqcqGHsgIRcqWGcbGqcqGHqgcRcqWGnbqtdaWgaaqaaiabdkeacbqabaGaeiykaKcabaGagiyBa0MaeiyyaeMaeiiEaG3aaeWaaeaafaqaaeWabaaabaGaemiuaaLaeiikaGIaeeyBa0Maee4Ba8MaeeizaqMaeeyzauMaeeiBaWMaeeiiaaIaeGOmaiJaeiOoaOJaemyta00aaSbaaeaacqWGbbqqaeqaaiabgkziUkabdgeabjabgcziSkabdkeacjabgcziSkabd2eannaaBaaabaGaemOqaieabeaacqGGPaqkcqGGSaalaeaacqWGqbaucqGGOaakcqqGTbqBcqqGVbWBcqqGKbazcqqGLbqzcqqGSbaBcqqGGaaicqaIZaWmcqGG6aGocqWGcbGqcqGHqgcRcqWGnbqtdaWgaaqaaiabdgeabbqabaGaeyOKH4QaemyqaeKaei4oaSJaemyqaeKaeyiKHWQaemyta00aaSbaaeaacqWGcbGqaeqaaiabgkziUkabdkeacjabcMcaPiabcYcaSaqaaiabdcfaqjabcIcaOiabb2gaTjabb+gaVjabbsgaKjabbwgaLjabbYgaSjabbccaGiabisda0iabcQda6iabd2eannaaBaaabaGaemyqaeeabeaacqGHsgIRcqWGbbqqcqGHqgcRcqWGdbWqcqGHsgIRcqWGcbGqcqGHqgcRcqWGnbqtdaWgaaqaaiabdkeacbqabaGaeiykaKcaaaGaayjkaiaawMcaaaaaaOGaayjkaiaawMcaaiabc6caUaaa@C09B@</m:annotation>
                        </m:semantics>
                     </m:math>
                  </display-formula>
               </p>
               <p>Note that model 4 in the denominator involves a hidden confounder <it>C</it>. The use of two independent genetic marker sets (<it>M</it><sub><it>A </it></sub>and <it>M</it><sub><it>B</it></sub>) alleviates the problem of model identifiability that may plague a CPA based edge orienting score.</p>
               <p>Model equivalence is also a key consideration in choosing which models to compare. From the standpoint of model equivalence, we note that the multiple anchor models presented in Figure <figr fid="F2">2(b)</figr> include a model with a hidden (latent) variable connecting <it>A </it>and <it>B</it>, and that no such model is included in the single anchor model comparisons. Such a model was found to be indistinguishable from the models with a collider node, such as single anchor models 4 and model 5. In the single marker case, both the collider node and the hidden variable models test for independence in the marginal relationship between the anchor and the more distal trait node. Future research may lead to an understanding of what type of data allow one to consider additional alternative models for the edge score computation. It should be straightforward to adapt the proposed LEO score to additional models as long as their model p-values can be calculated. Correlated markers, which are frequently encountered in practice such as in haplotype blocks, may compromise the performance of edge orienting scores. LEO scoring allows multiple parents of a node to be correctly accounted for within each model. Moreover, the parents (causal anchors) of a model are allowed to co-vary. By contrast, the orthogonal causal anchor set is, by definition, penalized for any covariation with the pleiotropic anchors.</p>
            </sec>
            <sec>
               <st>
                  <p>Thresholds for the edge orienting scores</p>
               </st>
               <p>For the single marker score <it>LEO.NB.SingleMarker</it>, we use a threshold of 1, which implies that the model p-value of the causal model is 10<sup>1 </sup>= 10 fold higher than that of the next best model. For the LEO.NB.CPA and the LEO.NB.OCA, we use lower thresholds of 0.8 and 0.3, respectively. Using simulation studies presented in the Additional File, we found that these thresholds lead to false positive rates that are often substantially below 0.05. Similar to other statistical procedures, NEO is susceptible to the pitfalls of multiple testing that may inflate the false positive rate. Permutation procedures and data dependent schemes (e.g. based on the false discovery rate) may inform the user on how to pick a threshold for a particular application. Further, we provide R software code for carrying out both single edge and multi-edge simulation studies. Simulation studies can be used to determine the power and false positive rates in different settings (sample size, causal signal, confounders, etc).</p>
               <p>In practice, one often observes strong dependence relationships between genetic markers. Our simulations show that correlations between genetic markers can reduce the power of edge orienting scores. Further, we mention that the NEO software implements an option for removing redundant markers that are highly correlated with each other. The removal of redundant markers may alleviate the loss of power.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Overview of network edge orienting with NEO</p>
            </st>
            <p>We now provide a detailed step-by-step description of a typical NEO analysis. An overview is also provided in Figure <figr fid="F3">3</figr>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Overview of the network edge orienting method</p>
               </caption>
               <text>
                  <p><b>Overview of the network edge orienting method</b>. The steps of the network overview analysis are described in the text.</p>
               </text>
               <graphic file="1752-0509-2-34-3"/>
            </fig>
            <sec>
               <st>
                  <p>Step 1: Integrate traits (gene expression traits and clinical traits) and SNPs</p>
               </st>
               <p>NEO takes trait and genetic marker data as input. Traits can include microarray gene expression data, clinical phenotypes, or other quantitative variables. Each SNP or trait is a node in the network, and the NEO software evaluates and scores the edge between traits <it>A </it>and <it>B </it>if the absolute correlation |<it>cor</it>(<it>A</it>, <it>B</it>)| lies above a user-specified threshold. For each edge <it>A </it>&#8211; <it>B</it>, NEO generates edge orienting scores for both possible orientations: <it>A </it>&#8594; <it>B </it>and <it>B </it>&#8594; <it>A</it>. If an erroneous edge exists between two traits, then it is meaningless to orient it. The NEO software can be used to orient any edge that the user chooses to consider. To allow the user to judge whether the existence of an edge is supported by the data, the NEO software outputs a Wald test statistic of the path coefficient, the corresponding p-value, and the correlation between the two traits. If the Wald test p-value is insignificant, orienting the edge may be meaningless.</p>
            </sec>
            <sec>
               <st>
                  <p>Step 2: Genetic marker selection and assignment to traits</p>
               </st>
               <p>Edge orienting scores will only be generated for edges whose traits have been anchored to at least one genetic marker. Two basic approaches for anchoring traits to markers are implemented in the NEO software: a manual selection by the user or an automatic selection by the software itself.</p>
               <sec>
                  <st>
                     <p>Manual SNP selection</p>
                  </st>
                  <p>NEO provides great flexibility to the user on how to anchor traits to markers. For example, the user can manually assign SNPs to the traits (see the example in Figure <figr fid="F4">4</figr>). This flexibility entails that the user carefully studies what constitutes a significant relationship between traits and markers and between the traits in the data set. The user may wish to anchor traits to SNPs that have been implicated by prior genetic analyses. For example, results from previous quantitative trait locus studies may implicate genetic markers associated with a trait. Multiple comparison issues are just starting to be addressed in the SEM literature <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>. Edge scores cannot be computed when an overly strict multiple testing control results in no causal anchors. On the other hand, an overly lax multiple testing control may result in spurious causal anchors which may lead to erroneous edge scores. We recommend that conservative measures of genome-wide QTL significance <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and false discovery rate be applied when selecting the initial causal anchor(s). Once a causal anchor has been established as obtaining genome-wide significance, NEO can be used to evaluate the fit of different causal models.</p>
                  <fig id="F4">
                     <title>
                        <p>Figure 4</p>
                     </title>
                     <caption>
                        <p>Manual SNP selection to study Insig1 &#8594; Dhrc7 and Insig1 &#8594; Fdft1 in mouse liver</p>
                     </caption>
                     <text>
                        <p><b>Manual SNP selection to study Insig1 &#8594; Dhrc7 and Insig1 &#8594; Fdft1 in mouse liver</b>. Using female liver gene expression data and SNP markers from the BxH mouse intercross, NEO retrieves known causal relationships in the cholesterol biosynthesis pathway: <it>Insig1 &#8594; Dhrc7 </it>and <it>Insig1 &#8594; Fdft1</it>. The single marker LOD score curves in (a) motivate our choice of manually selected SNPs (one SNP on chromosome 16 and another on chromosome 8). These SNP markers can also be used to screen for genes that are reactive to <it>Insig1</it>, see Table 2. Figures (b) and (c) show the causal models used to compute the model p-values in favor of edge orientations <it>Insig</it>1 &#8594; <it>Dhcr</it>7 and <it>Insig</it>1 &#8594; <it>Fdft</it>1, respectively. More details on the individual edges are presented in Table 1.</p>
                     </text>
                     <graphic file="1752-0509-2-34-4"/>
                  </fig>
               </sec>
               <sec>
                  <st>
                     <p>Automatic SNP selection</p>
                  </st>
                  <p>NEO can also be used to automatically relate (anchor) traits to SNPs. The automated SNP selection methods consider each trait <it>A </it>in isolation from the other traits when defining a <it>preliminary genetic marker set </it>(denoted by <it>M'</it><sub><it>A</it></sub>). Toward this end, the user can choose 1) a greedy approach based on univariate linear regression models, 2) a forward-stepwise approach based on multivariate linear regression models, or 3) both. The greedy SNP selection approach defines <it>M'</it><sub><it>A </it></sub>as the set of markers with the <it>K </it>highest absolute correlations with <it>A</it>. The greedy approach is equivalent to using univariate linear regression models to relate <it>A </it>to each marker separately and subsequently picking the <it>K </it>most significant markers.</p>
                  <p>For creating multivariate linear QTL models, NEO also implements forward-stepwise marker selection. The forward-stepwise marker selection method may avoid a pitfall that plagues the greedy SNP selection: if several genetic markers are located very close to each other (and are highly correlated), the greedy SNP selection may pick all of them before considering SNPs at other loci associated with the same trait. For this reason, we recommend combining greedy and forward-stepwise SNP selection methods.</p>
                  <p>Once the preliminary sets of markers <it>M</it>'<sub><it>A </it></sub>and <it>M'</it><sub><it>B </it></sub>are obtained, NEO evaluates the consistency of each set. We utilize a <it>marker assignment consistency heuristic: </it>a genetic marker can only serve as causal anchor for one trait. To fulfill this heuristic, a SNP is moved from <it>M'</it><sub><it>A </it></sub>to <it>M'</it><sub><it>B </it></sub>if its correlation with <it>B </it>is stronger than that with trait <it>A</it>. We denote the resulting <it>consistent genetic marker sets </it>by <it>M"</it><sub><it>A </it></sub>and <it>M"</it><sub><it>B</it></sub>. The resulting consistent genetic marker sets may be comprised of dozens of SNPs. Therefore, it can be useful to further filter the SNPs according to their joint predictive power for the trait. Toward this end, we use the Akaike Information Criterion (AIC) in conjunction with multivariate regression models to select genetic markers from within the consistent genetic marker sets <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Specifically, to define the <it>final genetic marker set M</it><sub><it>A </it></sub>for trait <it>A</it>, we use the AIC criterion to find a parsimonious multivariate regression model of <it>A </it>using predictors from within <it>M</it>"<sub><it>A</it></sub>. The final sets of markers <it>M</it><sub><it>A </it></sub>and <it>M</it><sub><it>B </it></sub>are thus comprised of consistent genetic markers that according to the AIC criterion best predict their respective traits; we use these final sets as causal anchors in computing the edge orienting scores.</p>
                  <p>The forward-stepwise approach based on multivariate linear regression models is akin to a legal courtroom where two cases are built, weighed, and judged. Broadly, the strongest genetic support (multivariate eQTL models) for the genetic influence on <it>A </it>and <it>B </it>are built independently, using AIC-based halting criteria. After consistency checks, these multivariate eQTL models are weighed by embedding them in causal models (one principal causal model in favor of edge orientation <it>A </it>&#8594; <it>B </it>and alternative causal models) and models are then compared using SEM fitting indices. When candidate CPA markers can be found for <it>A </it>and OCAs for <it>B</it>, the NEO method provides stringent consistency checks and balances against over-fitting. We consider automated SNP selection particularly useful when no prior evidence suggests causal anchors for the traits.</p>
               </sec>
            </sec>
            <sec>
               <st>
                  <p>Step 3: Compute local edge orienting scores for aggregating the genetic evidence in favor of a causal orientation</p>
               </st>
               <p>Both LEO.NB.CPA and LEO.NB.OCA scores are computed for each edge orientation (<it>A </it>&#8594; <it>B </it>and <it>B </it>&#8594; <it>A</it>). We recommend using the LEO.NB.OCA score (Eq. 6) as the primary edge orienting score if markers affect both <it>A </it>and <it>B</it>. However, if the results of the LEO.NB.CPA score strongly disagree with those of the LEO.NB.OCA score, the latter should not be trusted. As described in the next step, all fitting indices should be considered before calling an edge causal.</p>
            </sec>
            <sec>
               <st>
                  <p>Step 4: For each edge, evaluate the fit of the underlying local SEM models</p>
               </st>
               <p>Edges with high edge orienting scores may not necessarily correspond to causal relationships. Although edge orienting scores flag interesting edges, they are no substitute for carefully evaluating the fit of the underlying local SEMs. Since a LEO.NB score is defined as a ratio of two model p-values, it is advisable to check whether both p-values are small, as this would indicate poor fit of either model. If the model p-value of the confounded model <it>A </it>&#8592; <it>C </it>&#8594; <it>B </it>is high, the correlation between <it>A </it>and <it>B </it>may be largely due to a hidden confounder <it>C</it>. NEO (using the underlying <it>sem </it>R package) also report a Wald test statistic for the path coefficient from <it>A </it>&#8594; <it>B</it>. If the Wald test for an edge is significant, the data support its existence. Apart from the model p-value, many other SEM model fitting indices have been defined by contrasting the observed covariance matrix <it>S</it><sub><it>m </it>&#215; <it>m </it></sub>with the fitted covariance matrix &#931;(<inline-formula><m:math name="1752-0509-2-34-i23" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#952;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafqiUdeNbaKaaaaa@2D9B@</m:annotation></m:semantics></m:math></inline-formula>) as detailed in the Methods section. The NEO software reports the standard SEM fitting indices <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B45">45</abbr></abbrgrp> that are implemented in the R package <it>sem </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp> including the Root Mean Square Error of Approximation (RMSEA), Comparative Fit Index (CFI), Standardized Root Mean Square Residual (SRMSR), BIC. Since a single fitting index reflects only a particular aspect of model fit, a favorable value of that index does not by itself demonstrate good model fit; it is important to assess the model fit based on multiple indices. We follow the following standard guidelines for interpreting these indices <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Before calling an edge <it>A </it>&#8594; <it>B </it>causal, we recommend verifying that the corresponding causal model has a high model p-value (say > 0.05), a low RMSEA score (say &#8804; 0.05), a low SRMSR (say &#8804; 0.10), a high CFI (say &#8805; 0.90), and a significant Wald test p-value (say <it>p </it>&#8804; 0.05).</p>
            </sec>
            <sec>
               <st>
                  <p>Step 5: Robustness analysis with respect to SNP selection parameters</p>
               </st>
               <p>Since the edge orienting scores for an edge <it>A </it>&#8211; <it>B </it>critically depend on the input genetic marker sets <it>M</it><sub><it>A </it></sub>and <it>M</it><sub><it>B</it></sub>, we also recommend carrying out a robustness analysis with respect to different marker sets. In particular, the automated SNP selection results should be carefully evaluated with regard to the threshold parameters that were used to define the marker sets. For example, when using a greedy SNP selection strategy, it is advisable to study how the LEO.NB score is affected by altering the number of most highly correlated SNPs. For a given edge and a given edge orienting score (e.g. LEO.NB.OCA), NEO implements a robustness analysis with respect to automatic marker selection (see Figures <figr fid="F5">5</figr>, <figr fid="F6">6</figr>, and <figr fid="F7">7</figr>). A robustness plot shows how the LEO.NB.OCA score (y-axis) depends on sets of automatically selected SNP markers (x-axis). When using the default SNP selection method (combined greedy and forward stepwise method), robustness step <it>K </it>corresponds to choosing the top <it>K </it>SNPs by greedy and forward selection for each trait. Since the greedy and forward SNP selection may select the same SNPs, step <it>K </it>typically involves fewer than 2<it>K </it>SNPs per trait. The edge orienting results should be relatively robust with respect to different choices of <it>K</it>.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Automatic SNP selection to score Insig1 &#8594; Dhrc7 and Insig1 &#8594; Fdft1 in female and male mouse livers</p>
                  </caption>
                  <text>
                     <p><b>Automatic SNP selection to score Insig1 &#8594; Dhrc7 and Insig1 &#8594; Fdft1 in female and male mouse livers</b>. These robustness plots show how the LEO.NB scores (y-axis) depend on sets of automatically selected SNP markers (x-axis). Here we use the default SNP selection method: combined greedy and forward stepwise method. Step <it>K </it>corresponds to choosing the top <it>K </it>greedy and top <it>K </it>forward selected SNPs for each trait. Since the greedy and the forward SNP selection may select the same SNPs, step <it>K </it>typically involves fewer than 2<it>K </it>SNPs per trait. Figures (a, b, top row) and (c, d) correspond to female and male BxH mice, respectively. Figures (a) and (c) report the results for edge <it>Insig</it>1 &#8594; <it>Dhrc</it>7 in female and male mouse livers, respectively. Figures (b) and (d) report the analogous results for <it>Insig</it>1 &#8594; <it>Fdft</it>1. NEO robustly retrieves the known causal relationship between these genes.</p>
                  </text>
                  <graphic file="1752-0509-2-34-5"/>
               </fig>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p><it>Fsp27 </it>is a causal driver of a biologically important co-expression module</p>
                  </caption>
                  <text>
                     <p><b><it>Fsp27 </it>is a causal driver of a biologically important co-expression module</b>. Prior work using mouse liver expression data found the 'blue' co-expression module to be biologically important [7]. Here we used automatic SNP selection to determine whether <it>Fsp27 </it>is causal of the blue module gene expression profiles. The expression profiles of the blue module were summarized by their first principal component (referred to as module eigengene). The blue module eigengene <it>MEblue </it>can be considered as the most representative gene expression profile of the blue module. The figure shows the results of a robustness analysis regarding <it>LEO.NB</it>(<it>Fsp</it>27 &#8594; <it>MEblue</it>) (y-axis) with respect to different choices of genetic markers sets (x-axis). Both LEO.NB.CPA and LEO.NB.OCA scores show that the relationship is causal, i.e. the <it>Fsp</it>27 is upstream of the blue module expressions.</p>
                  </text>
                  <graphic file="1752-0509-2-34-6"/>
               </fig>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>Multi-edge simulation study involving 5 gene expression traits (<it>E</it>1-<it>E</it>5) and one clinical trait <it>Trait</it></p>
                  </caption>
                  <text>
                     <p><b>Multi-edge simulation study involving 5 gene expression traits (<it>E</it>1-<it>E</it>5) and one clinical trait <it>Trait</it></b>. The heatmap plot in (a) depicts the true causal model. Note that a red square in the i-th row and j-th column indicates that trait <it>i </it>causally affects trait <it>j</it>, e.g. <it>E</it>1 &#8594; <it>E</it>2. The rows and columns of the heatmap are ordered according to a hierarchical clustering tree, which was constructed using average linkage hierarchical clustering based on the pairwise correlations of the traits. Figure (b) depicts the corresponding heatmap of the observed network that was reconstructed using the LEO.NB.OCA score. Figure (c) shows an alternative output graph of NEO. Blue edges indicate significant correlations and a LEO.NB.OCA score is added to each edges whose LEO.NB.OCA score passes a user-supplied threshold. We find that all true causal edges are correctly retrieved at the recommended LEO.NB.OCA threshold of 0.3. Figure (d) shows the results of a robustness analysis for the LEO.NB.OCA and LEO.NB.CPA scores for the edge orientation <it>E</it>4 &#8594; <it>Trait</it>. The LEO.NB.OCA scores exceed the recommended threshold of 0.3 (red horizontal line), i.e. they retrieve the orientation correctly. Similarly, the LEO.NB.CPA scores exceed the threshold of 0.8.</p>
                  </text>
                  <graphic file="1752-0509-2-34-7"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Step 6: Repeat the analysis for the next A-B trait-trait edge and apply edge score thresholds to orient the network</p>
               </st>
               <p>NEO orients each edge separately in an undirected input trait network. The results are order-independent. For each edge, NEO repeats steps 1&#8211;3 until all edges have been assigned edge orienting scores. Once each edge has been scored, the user can generate a global, directed network by choosing an edge score (e.g. LEO.NB.OCA) and a corresponding threshold (Figure <figr fid="F7">7</figr>).</p>
            </sec>
            <sec>
               <st>
                  <p>NEO output and R software</p>
               </st>
               <p>The primary output of NEO is an Excel spreadsheet which reports likelihood-based edge scores (LEO.NB.CPA, LEO.NB.OCA) and other edge scores that are described in the NEO manual. For each edge, the NEO spreadsheet also contains hyperlinks that allows the user to access the log file for each edge. The log file contains a host of information regarding computation of the edge orienting scores including SEM model p-values, Wald test statistics for each path coefficient, and the SNP identifiers for the causal anchor sets <it>M</it><sub><it>A </it></sub>and <it>M</it><sub><it>B</it></sub>.</p>
               <p>Although the main output of NEO are scores for every edge orientation, one can construct a global directed network by thresholding an edge orienting score. NEO uses the R software package <it>sem </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp> to compute model p-values and other fitting indices. The NEO software is documented in a series of separate tutorials that illustrate real data applications and simulation studies. These tutorials and the real data can be downloaded from our webpage.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Applications</p>
            </st>
            <sec>
               <st>
                  <p>Research goals that can be addressed with NEO</p>
               </st>
               <p>NEO can be used to address the following four research goals. (1) On the simplest level, NEO can be used to assign edge orienting scores to a single edge using manually chosen genetic markers (see the example in Table <tblr tid="T1">1</tblr>). (2) When dealing with a single edge and multiple genetic markers, the NEO software can <it>automatically </it>select markers for edge orienting. Since the automatic marker selection entails certain parameter choices, we recommend carrying out a robustness analysis with respect to adding or removing genetic markers. (3) When dealing with a single trait <it>A </it>and manually selected genetic markers, the software can be used to screen for other traits that are causal or reactive to trait <it>A</it>. For example, in Table <tblr tid="T2">2</tblr> we screen for genes that are reactive to gene expression trait <it>Insig1</it>. (4) When dealing with multiple edges, NEO can be used to arrive at a global directed network. This can be done by thresholding a chosen edge score. If the resulting global network is acyclic (i.e., it does not contain loops) then d-separation <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and standard SEM model fitting indices can be used to evaluate the fit of the global causal model to the data.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>NEO analysis using manually specified genetic markers for computing edge scores.</p>
                  </caption>
                  <tblbdy cols="11">
                     <r>
                        <c ca="center">
                           <p>Edge no.</p>
                        </c>
                        <c ca="center">
                           <p>Edge</p>
                        </c>
                        <c ca="center">
                           <p>LEO. NB.OCA</p>
                        </c>
                        <c ca="center">
                           <p>Cor <it>&#961;</it></p>
                        </c>
                        <c ca="center">
                           <p>Path coef</p>
                        </c>
                        <c ca="center">
                           <p>Path SE</p>
                        </c>
                        <c ca="center">
                           <p>Path Z</p>
                        </c>
                        <c ca="center">
                           <p>Model prob</p>
                        </c>
                        <c ca="center">
                           <p>Model df</p>
                        </c>
                        <c ca="center">
                           <p><it>&#967;</it><sup>2 </sup>stat</p>
                        </c>
                        <c ca="center">
                           <p>RMSEA</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="11">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>rs3705921 &#8594; <it>Insig1</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>0.22</p>
                        </c>
                        <c ca="center">
                           <p>0.18</p>
                        </c>
                        <c ca="center">
                           <p>0.081</p>
                        </c>
                        <c ca="center">
                           <p>2.2</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>rs3670293 &#8594; <it>Insig1</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>-0.33</p>
                        </c>
                        <c ca="center">
                           <p>-0.31</p>
                        </c>
                        <c ca="center">
                           <p>0.081</p>
                        </c>
                        <c ca="center">
                           <p>-3.8</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>rs3675054 &#8594; <it>Dhcr7</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>-0.26</p>
                        </c>
                        <c ca="center">
                           <p>-0.15</p>
                        </c>
                        <c ca="center">
                           <p>0.049</p>
                        </c>
                        <c ca="center">
                           <p>-3.1</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>Insig1 &#8594; Dhcr7</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.2</p>
                        </c>
                        <c ca="center">
                           <p>0.81</p>
                        </c>
                        <c ca="center">
                           <p>0.79</p>
                        </c>
                        <c ca="center">
                           <p>0.049</p>
                        </c>
                        <c ca="center">
                           <p>16.1</p>
                        </c>
                        <c ca="center">
                           <p>0.24</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>6.8</p>
                        </c>
                        <c ca="center">
                           <p>0.051</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>Insig1 &#8594; Fdft1</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>1.4</p>
                        </c>
                        <c ca="center">
                           <p>0.67</p>
                        </c>
                        <c ca="center">
                           <p>0.64</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                        <c ca="center">
                           <p>10.7</p>
                        </c>
                        <c ca="center">
                           <p>0.75</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>2.7</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>6</p>
                        </c>
                        <c ca="center">
                           <p>rs3664397 &#8594; <it>Fdft1</it></p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>0.34</p>
                        </c>
                        <c ca="center">
                           <p>0.27</p>
                        </c>
                        <c ca="center">
                           <p>0.06</p>
                        </c>
                        <c ca="center">
                           <p>4.5</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                      