<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-10-S1-S66</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Partial correlation analysis indicates causal relationships between GC-content, exon density and recombination rate in the human genome</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Freudenberg</snm>
               <fnm>Jan</fnm>
               <insr iid="I1"/>
               <email>jan.freudenberg@gmail.com</email>
            </au>
            <au id="A2">
               <snm>Wang</snm>
               <fnm>Mingyi</fnm>
               <insr iid="I2"/>
               <email>mwang@noble.org</email>
            </au>
            <au id="A3">
               <snm>Yang</snm>
               <fnm>Yaning</fnm>
               <insr iid="I3"/>
               <email>ynyang@ustc.edu.cn</email>
            </au>
            <au ca="yes" id="A4">
               <snm>Li</snm>
               <fnm>Wentian</fnm>
               <insr iid="I1"/>
               <email>wli@nslij-genetics.org</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>The Robert S Boas Center for Genomics and Human GeneticsFeinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, NY 11030, USA</p>
            </ins>
            <ins id="I2">
               <p>Plant Biology Division, The Samuel Roberts Noble Foundation, Ardmore, OK 73401, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Statistics and Finance, University of Science and Technology of China, Anhui 230026, Hefei, PR China</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <supplement>
            <title>
               <p>Selected papers from the Seventh Asia-Pacific Bioinformatics Conference (APBC 2009)</p>
            </title>
            <editor>Michael Q Zhang, Michael S Waterman and Xuegong Zhang</editor>
            <note>Research</note>
         </supplement>
         <conference>
            <title>
               <p>The Seventh Asia Pacific Bioinformatics Conference (APBC 2009)</p>
            </title>
            <location>Beijing, China</location>
            <date-range>13&#8211;16 January 2009</date-range>
            <url>http://bioinfo.au.tsinghua.edu.cn/apbc2009/</url>
         </conference>
         <issn>1471-2105</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>Suppl 1</issue>
         <fpage>S66</fpage>
         <url>http://www.biomedcentral.com/1471-2105/10/S1/S66</url>
         <xrefbib>
            
         <pubidlist><pubid idtype="pmpid">19208170</pubid><pubid idtype="doi">10.1186/1471-2105-10-S1-S66</pubid></pubidlist></xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>30</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Freudenberg et al; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Several features are known to correlate with the GC-content in the human genome, including recombination rate, gene density and distance to telomere. However, by testing for pairwise correlation only, it is impossible to distinguish direct associations from indirect ones and to distinguish between causes and effects.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We use partial correlations to construct partially directed graphs for the following four variables: GC-content, recombination rate, exon density and distance-to-telomere. Recombination rate and exon density are unconditionally uncorrelated, but become inversely correlated by conditioning on GC-content. This pattern indicates a model where recombination rate and exon density are two independent causes of GC-content variation.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Causal inference and graphical models are useful methods to understand genome evolution and the mechanisms of isochore evolution in the human genome.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>GC-content (% of guanine(G) or cytosine(C) bases) is known to vary along human chromosomes. To describe large genomic regions of homogeneous GC%, the term "isochore" was coined in 1980s <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Since then, the question has been intensively debated, why genomes contain GC-high and GC-low isochore regions. The initially proposed hypotheses was that GC-rich isochore constitute an adaptation to homeothermy in warm-blooded species <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, as well as favorable bendability and B-Z helix transition that lead to more open chromating and ease transcription <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. This explanation fits well to the correlation between GC-content and gene density <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The second hypotheses to explain variation in GC-content is a mutation bias related to processes like DNA replication and repair <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The third explanation arose from the later discovery that local GC-content and recombination rate (number of crossing over events per meiosis per unit sequence length) are strongly correlated <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. The molecular basis for this explanation is recombination associated biased gene conversion (BGC), which may act to increase GC-content <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. The availability of full genome sequences now allows to draw a more complex picture of GC-content variation than only separating the genome into a set of discrete isochore categories. Early after completion of the first human genome draft sequence, it was observed that seemingly homogeneous region at one length scale may not be homogeneous at shorter length scales and that it is possible to have "domains within a domain" <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. More recently, a fine-grained picture also arose for variation of recombination rate along human chromosomes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. This facilitated the study of the relationship between GC-content and recombination rate on a much finer scale, showing that recombination hotspots are associated with local increases in GC-content <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> but do not significantly influence local substitution rate. In parallel, the BGC-hypothesis has been supported by several additional lines of evidence <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. In a most recent study, recombination rates was found to be the major determinant of limiting-GC-content &#8211; the stationary GC-content towards which the human genome is currently evolving <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, strongly supporting recombination associated BGC as a major determinant of GC-content.</p>
         <p>Nevertheless, it is not entirely clear how the two correlations of GC-content with both recombination rate and gene density relate to each other. In the simplest case, a third correlation between gene density and recombination rate would exist. In this case one could test whether increased GC-content in gene dense regions were a consequence of increased recombination. In the absence of a correlation between recombination rate and gene density, their shared relationship with GC-content remains to be explained. In particular, the correlation between GC-content and gene density is less understood. Thus, the true model of the evolution of genome-wide and regional GC-content may have a neutral (non-Darwinian) and additionally a (positive and negative) selection component <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> or it may be void of this selection component. Because the correlation with gene density has been a major argument of evolutionary models that explain local GC-content as result of selection, a better understanding of the correlations between these variables is an important task.</p>
         <p>To understand the relationship between recombination rate, gene density and GC-content, it is further important to note that even if BGC were the only reason for GC-content variation, this would not necessarily imply a purely neutral model of isochore evolution, because local recombination rate may itself evolve under the influence of natural selection. For instance, it has been observed that recombination is increased at human central nervous system genes and immune-system genes <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. These gene categories had been observed before to be subject to accelerated or faster sequence evolution, respectively <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Because more recombination at a genetic locus may increase the effective strength of selection, this led to the suggestion that gene selection intensity might be one determinant of local recombination rate variation <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>.</p>
         <p>In the present study, we aim at the assignment of "direct" and "indirect" labels, as well as "cause" and "effect", whenever possible, to variables that are informative about local GC-content. We notice that many previous analyses are based on statistical correlation, whereas the causal relationship between them remains undecided. For instance, researchers who are interested in understanding the causes of recombination rate variation or gene sequence evolution, GC-content itself or hidden variables associated with GC-content may be seen as possibly confounding factors. On the other hand, for people who are interested in in GC-content variation, recombination and the associated gene conversion, and possibly mutation events, are <it>a priori </it>treated as causal variables.</p>
         <p>When dealing with several correlated variables, a widely used statistical method is multiple regression. However, multiple regression is not always a good method to test for causal relationships, because the equality sign in a regression analysis does not have a direction. Thus, one can move an independent variable from the right-hand side of the equation to the left-hand side to be a dependent variable <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Moreover, two unconditionally independent variables can be correlated conditional on a common causal child, which is exactly what is carried out in a multiple regression <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Therefore, we propose to use techniques for inferring causal relationship by conditional correlation analysis to understand the relationship between GC-content, recombination rate, and gene density in the human genome.</p>
         <p>To this end, we start representing a group of pairwise correlated variables by an undirected graph structure: nodes/vertices represent variables and links/edges represent observed statistical correlations. In the next step, we remove all links that are inferred to be indirect associations, based on the absence of conditional correlation. Finally, we apply causal inference rules to assign causal arrows, if possible. In cases where the complete causal model cannot be inferred from the data, the result is a partially directed graph that optimally characterizes the relationship among the tested variables. Similar inference techniques have been previously applied to other genomics problems <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and for studying relationships between human-disease related intermediate-phenotypes <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Three variables: GC%, recombination rate, and distance to telomere</p>
            </st>
            <p>In a recent study, it was shown by Arndt and Duret <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> that besides the positive correlation with recombination rate (RR), GC-content (GC%) is negatively correlated with the distance to telomere (DT). These results were mainly based on the analysis of noncoding sequence in a 1 Mb sized window that have high quality finished sequence available both in the chimpanzee and the macaque genome <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We start our analysis by using both their data and our own dataset of the same 1 Mb windows for the human genome sequence, regardless of coding and noncoding status or the existence of quality sequence in other organisms. The GC% in these two datasets is not totally identical, but highly correlated (<it>&#961; </it>= 0.98). Similarly, the HapMap estimate of RR <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> in the two datasets is correlated with <it>&#961; </it>= 0.82. We discarded windows, if the number of HapMap single-nucleotide-polymorphism (SNP) is less than 20 or more than 30% of genomic sequence are missing. In total, 2647 and 2668 1 Mb windows are available with information on GC%, RR and DT for the two datasets. We performed log-transformation of distance to telomere (DT), because the scatter plot showed a non-linear correlation between DT with the other two variables, and then multiplied it by -1 to change the negative correlation with GC% to positive. The unconditional and conditional Pearson's correlation coefficients between GC%, RR and DT are shown in Table <tblr tid="T1">1</tblr>. All correlation coefficients are highly significant (<it>p</it>-value = 0) and results from both datasets are highly similar. Because an earlier study had observed that the correlation between RR and GC% is maximal when both variables are measured in the 50 kb window <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, we also looked at a dataset where GC%, RR, DT are measured by using the window size of 50 kb. Due to the smaller window size (1/20 of the 1 Mb window), RR is fluctuating in a much wider range as can be seen from the quantile values in Table <tblr tid="T2">2</tblr>. We also note that a square-root transformation of RR under 50 kb window leads to a slightly better linear correlation with GC%, and a larger correlation coefficient (result not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Correlation and and partial correlation at 1 Mb windows. Correlation and and partial correlation between GC%, recombination rate (RR), and distance to telomere (DT) (negative log-transformed) for 1 Mb windows. Conditioning is performed on the respective third variable. (A) regardless of coding status; (B) non-coding only.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left" cspan="3">
                        <p>(A)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>&#961;</it>/partial-<it>&#961;</it></p>
                     </c>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c ca="center">
                        <p>-log(DT)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GC%</p>
                     </c>
                     <c ca="center">
                        <p>0.38/0.20</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.35</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.49/0.38</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left" cspan="3">
                        <p>(B)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>&#961;</it>/partial-<it>&#961;</it></p>
                     </c>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c ca="center">
                        <p>-log(DT)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GC%</p>
                     </c>
                     <c ca="center">
                        <p>0.39/0.20</p>
                     </c>
                     <c ca="center">
                        <p>0.46/0.33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.52/0.42</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Quantile values of recombination rates. Quantile values for RR for the three datasets: 1 Mb non-coding, 1 Mb and 50 kb (in cM/Mb).</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>dataset</p>
                     </c>
                     <c ca="center">
                        <p>0%</p>
                     </c>
                     <c ca="center">
                        <p>25%</p>
                     </c>
                     <c ca="center">
                        <p>50%</p>
                     </c>
                     <c ca="center">
                        <p>75%</p>
                     </c>
                     <c ca="center">
                        <p>100%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1 Mb, nc</p>
                     </c>
                     <c ca="center">
                        <p>.012</p>
                     </c>
                     <c ca="center">
                        <p>.80</p>
                     </c>
                     <c ca="center">
                        <p>1.19</p>
                     </c>
                     <c ca="center">
                        <p>1.82</p>
                     </c>
                     <c ca="center">
                        <p>4.97</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1 Mb</p>
                     </c>
                     <c ca="center">
                        <p>.033</p>
                     </c>
                     <c ca="center">
                        <p>.90</p>
                     </c>
                     <c ca="center">
                        <p>1.40</p>
                     </c>
                     <c ca="center">
                        <p>2.10</p>
                     </c>
                     <c ca="center">
                        <p>7.47</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>50 kb</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>.26</p>
                     </c>
                     <c ca="center">
                        <p>0.72</p>
                     </c>
                     <c ca="center">
                        <p>1.89</p>
                     </c>
                     <c ca="center">
                        <p>27.55</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The correlation and partial correlation between the three variables from 50 kb window is shown in Table <tblr tid="T3">3</tblr>. In contrast to <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, we found the correlation between GC% and RR to be higher using the 1 Mb sized window than the 50 kb window. This discrepancy may result from the threefold higher SNP density provided by the HapMap phase II <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Importantly, the correlation between GC% and DT is less affected by the change of window size, although RR-DT correlation is far weaker in the 50 kb window than in the 1 Mb window. This change of the strength of the correlation of RR with GC% and DT from one window size to another may be related to the "domains within domains" phenomenon that had been found for GC-content variation and that may exist for fine-scale recombination rate variation too.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Correlation and partial correlation at 50 kb windows. Correlation and partial correlation between GC%, RR, and DT (negative log transformed) for 50 kb windows.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="center">
                        <p><it>&#961;</it>/partial-<it>&#961;</it></p>
                     </c>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c ca="center">
                        <p>-log(<it>DT</it>)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GC%</p>
                     </c>
                     <c ca="center">
                        <p>0.25/0.17</p>
                     </c>
                     <c ca="center">
                        <p>0.40/0.36</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.22/0.14</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Because none of the pairwise correlations between GC%, RR and DT is rendered insignificant by conditioning on the third variable, it is not possible to remove any edge in the relationship graph for GC%, RR, and DT (Figure <figr fid="F1">1(A)</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Causal graph models or their skeleton for GC-content, recombination-rate, number-of-exons, and distance-to-telomere</p>
               </caption>
               <text>
                  <p><b>Causal graph models or their skeleton for GC-content, recombination-rate, number-of-exons, and distance-to-telomere</b>. (A) Relationship graph for GC%, RR, -log(<it>DT</it>) that is inferred from the correlations in Table 1. (B) Causal graph for GC%, RR, NE that is inferred from the correlations in Table 5. (C) Partially directed graph for GC%, RR, -log(<it>DT</it>), and NE that is consistent with the result in Table <tblr tid="T6">6</tblr>. All edges/arrows are highly significant. (D) A hypothetical model including an extra variable NCO/R: proportion of non-crossing-over events. This model may help to orient the previously undirected edges.</p>
               </text>
               <graphic file="1471-2105-10-S1-S66-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Chromosome-specific correlation and partial correlations</p>
            </st>
            <p>In the next step, we checked the chromosome-specific correlations and partial correlations between the three variables. Table <tblr tid="T4">4</tblr> shows these result in form of correlation and partial correlation coefficients (and p-value if it is larger than 0.01) for our main dataset (1 Mb window including all available human genome sequence independent from its coding status). There are several notable observations: (1) RR-log(1/<it>DT</it>) correlation is unchanged by conditioning on GC% for non-acrocentric chromosomes, indicating that the position of the window already explains RR, rendering GC% unlikely to be causal. (2) For acrocentric chromosomes (13, 14, 15, 21, 22), the position of the window (DT) is only marginally correlated with RR. In contrast, DT is correlated with GC% for all chromosomes including the acrocentric chromosomes. (3) For some (3, 7, 8, 9, 10, 11, 12, 18, 19), but not for all, chromosomes, the correlation between GC% and RR is weakened by conditioning on DT.(4) For chromosome 2 the positive correlation between RR and DT is not turned negative by conditioning on GC. This result is interesting, because chromosome 2 is known to result from a relatively recent fusion event of different chromosomes <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp></p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Chromosome-specific correlation and partial correlation. Chromosome-specific correlation and partial correlation between GC%, RR, and DT (negative log-transformed) using the 1 Mb window. A p-value for testing zero-correlation is included only when the correlation is not significant. <it>n </it>is the number of windows per chromosome (i.e., sample size). Acrocentric chromosomes are marked by *.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>chr</p>
                     </c>
                     <c ca="center">
                        <p>GC%-RR</p>
                     </c>
                     <c ca="center">
                        <p>GC%-log(1/<it>DT</it>)</p>
                        <p><it>&#961; </it>(<it>p</it>-value)/partial-<it>&#961; </it>(<it>p</it>-value)</p>
                     </c>
                     <c ca="center">
                        <p>RR-log(1/<it>DT</it>)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>n</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.37/0.24</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.38</p>
                     </c>
                     <c ca="center">
                        <p>0.37/0.24</p>
                     </c>
                     <c ca="center">
                        <p>224</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.43/0.32</p>
                     </c>
                     <c ca="center">
                        <p>0.38/0.24</p>
                     </c>
                     <c ca="center">
                        <p>0.43/0.32</p>
                     </c>
                     <c ca="center">
                        <p>238</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.23/0.17(0.016)</p>
                     </c>
                     <c ca="center">
                        <p>0.17(0.015)/0.065(0.37)</p>
                     </c>
                     <c ca="center">
                        <p>0.51/0.49</p>
                     </c>
                     <c ca="center">
                        <p>194</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.51/0.34</p>
                     </c>
                     <c ca="center">
                        <p>0.52/0.37</p>
                     </c>
                     <c ca="center">
                        <p>0.48/0.29</p>
                     </c>
                     <c ca="center">
                        <p>187</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.58/0.43</p>
                     </c>
                     <c ca="center">
                        <p>0.56/0.40</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.21</p>
                     </c>
                     <c ca="center">
                        <p>175</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.29/0.40</p>
                     </c>
                     <c ca="center">
                        <p>0.51/0.21</p>
                     </c>
                     <c ca="center">
                        <p>0.65/0.50</p>
                     </c>
                     <c ca="center">
                        <p>166</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.20(0.01)/0.054(0.50)</p>
                     </c>
                     <c ca="center">
                        <p>0.34/0.28</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.43</p>
                     </c>
                     <c ca="center">
                        <p>154</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>0.41/0.17(0.048)</p>
                     </c>
                     <c ca="center">
                        <p>0.58/0.46</p>
                     </c>
                     <c ca="center">
                        <p>0.52/0.38</p>
                     </c>
                     <c ca="center">
                        <p>142</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>0.29/0.11(0.24)</p>
                     </c>
                     <c ca="center">
                        <p>0.39/0.30</p>
                     </c>
                     <c ca="center">
                        <p>0.51/0.45</p>
                     </c>
                     <c ca="center">
                        <p>114</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>0.37/0.21(0.015)</p>
                     </c>
                     <c ca="center">
                        <p>0.41/0.28</p>
                     </c>
                     <c ca="center">
                        <p>0.50/0.41</p>
                     </c>
                     <c ca="center">
                        <p>131</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>0.31/0.22(0.01)</p>
                     </c>
                     <c ca="center">
                        <p>0.23/0.063(0.48)</p>
                     </c>
                     <c ca="center">
                        <p>0.59/0.56</p>
                     </c>
                     <c ca="center">
                        <p>130</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>0.39/0.14(0.12)</p>
                     </c>
                     <c ca="center">
                        <p>0.51/0.37</p>
                     </c>
                     <c ca="center">
                        <p>0.58/0.48</p>
                     </c>
                     <c ca="center">
                        <p>129</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>13*</p>
                     </c>
                     <c ca="center">
                        <p>0.54/0.53</p>
                     </c>
                     <c ca="center">
                        <p>0.26/0.23(0.025)</p>
                     </c>
                     <c ca="center">
                        <p>0.13(0.20)/-0.012(0.91)</p>
                     </c>
                     <c ca="center">
                        <p>95</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14*</p>
                     </c>
                     <c ca="center">
                        <p>0.36/0.41</p>
                     </c>
                     <c ca="center">
                        <p>0.67/0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.086(0.42)/-0.23(0.035)</p>
                     </c>
                     <c ca="center">
                        <p>87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>15*</p>
                     </c>
                     <c ca="center">
                        <p>-0.097(0.38)/-0.14(0.19)</p>
                     </c>
                     <c ca="center">
                        <p>0.19(0.096)/0.21(0.054)</p>
                     </c>
                     <c ca="center">
                        <p>0.22(0.04)/0.25(0.024)</p>
                     </c>
                     <c ca="center">
                        <p>82</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>0.18(0.11)/-0.14(0.22)</p>
                     </c>
                     <c ca="center">
                        <p>0.64/0.63</p>
                     </c>
                     <c ca="center">
                        <p>0.44/0.43</p>
                     </c>
                     <c ca="center">
                        <p>77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>0.22(0.04)/0.012(0.92)</p>
                     </c>
                     <c ca="center">
                        <p>0.45/0.40</p>
                     </c>
                     <c ca="center">
                        <p>0.48/0.43</p>
                     </c>
                     <c ca="center">
                        <p>77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>0.35/0.24(0.04)</p>
                     </c>
                     <c ca="center">
                        <p>0.27(0.02)/0.051(0.66)</p>
                     </c>
                     <c ca="center">
                        <p>0.67/0.63</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>0.45/0.18(0.18)</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.24(0.082)</p>
                     </c>
                     <c ca="center">
                        <p>0.71/0.63</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>0.089(0.50)/-0.13(0.31)</p>
                     </c>
                     <c ca="center">
                        <p>0.26(0.046)/0.28(0.033)</p>
                     </c>
                     <c ca="center">
                        <p>0.70/0.70</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>21*</p>
                     </c>
                     <c ca="center">
                        <p>0.13(0.48)/0.13(0.50)</p>
                     </c>
                     <c ca="center">
                        <p>0.92/0.92</p>
                     </c>
                     <c ca="center">
                        <p>0.088(0.61)/-0.079(0.67)</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>22*</p>
                     </c>
                     <c ca="center">
                        <p>0.32(0.06)/0.27(0.13)</p>
                     </c>
                     <c ca="center">
                        <p>0.40(0.02)/0.35(0.04)</p>
                     </c>
                     <c ca="center">
                        <p>0.21(0.22)/0.099(0.58)</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ave</p>
                     </c>
                     <c ca="center">
                        <p>0.38/0.20</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.35</p>
                     </c>
                     <c ca="center">
                        <p>0.49/0.38</p>
                     </c>
                     <c ca="center">
                        <p>2655</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>To examine the robustness of these chromosome specific correlations (Table <tblr tid="T4">4</tblr>), we carried out the same correlation analysis using the noncoding sequence 1 Mb windows <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and the 50 kb window (Figure <figr fid="F2">2</figr>). Most of the correlations in Table <tblr tid="T4">4</tblr> are confirmed in these two additional datasets. One interesting observation in Figure <figr fid="F2">2</figr> is that the correlation between RR and DT is weaker for the 50 kb window, probably because finer details of recombination rate variation are revealed at this length scale and the dependence of RR on DT is no longer monotonic. Thus DT is primarily correlated with large scale recombination rate variation, which could relate to the proposed conservation of large-scale rates on longer time scales <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Chromosome-specific correlations and partial correlations</p>
               </caption>
               <text>
                  <p><b>Chromosome-specific correlations and partial correlations</b>. Chromosome-specific correlation and partial correlation for GC%-RR (top), GC%-log(1/<it>DT</it>) (middle), RR-log(1/<it>DT</it>) (bottom) in 3 datasets: 1 Mb, non-coding (black) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>; 1 Mb, disregard coding/non-coding status (blue); and 50 kb, disregard coding/non-coding status (red). Acrocentric chromosomes are marked by yellow bars.</p>
               </text>
               <graphic file="1471-2105-10-S1-S66-2"/>
            </fig>
            <p>An example of chromosome specific patterns of recombination rate was recently discussed in the context of a putative gene that controls overall recombination rate <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. This study illustrates the effect of a SNP on increasing the female recombination rates by almost the same amount on all chromosomes with the exception of chromosome 21. Another SNP reduces the male recombination rates by variable degrees for different chromosomes <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Three variables: GC%, recombination rate, and number of exons</p>
            </st>
            <p>Gene density constitutes a further variable that is known to be strongly correlated with GC% <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. To better understand this relationship, we counted the number of exons within a 1 Mb window, as it reflects both the number of genes and the intron count. The correlation and partial correlations between GC%, RR, and the number of exons (NE) are listed in Table <tblr tid="T5">5</tblr>. Unlike the previous situation, where we had looked at the three variables RR, DT and GC%, the consideration of NE instead DT is bringing up an observation that allows us to infer a causal relationship: although no significant direct correlation exists between RR-NE, a negative correlation between RR and NE emerges after conditioning on GC%.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>adding number-of-exon variable. Correlation and partial correlation between GC%, RR, and number of exons (NE) in 1 Mb windows.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="center">
                        <p><it>&#961;</it>/partial-<it>&#961;</it></p>
                     </c>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c ca="center">
                        <p>NE</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GC%</p>
                     </c>
                     <c ca="center">
                        <p>0.38/0.49</p>
                     </c>
                     <c ca="center">
                        <p>0.69/0.73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.04/-0.34</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>This result (Table <tblr tid="T5">5</tblr>) suggests the causal model in Figure <figr fid="F1">1(B)</figr>. In this causal model, RR and NE are two independent causes of GC%. The inference of this causal structure is based on the known fact that conditioning on a common child variable creates a correlation between two previously uncorrelated causes of this child variable <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Or spoken more specifically, the relationship between NE and RR can be understood as follows: normally the two variables RR and NE do not contain any information about each other and are therefore uncorrelated. However, given the status of GC-content as third variable, this situation changes and RR and NE are now mutually informative. This mutual informativeness of NE and RR depending on GC% is explained by a model where both RR and NE are independent causes of GC%. When GC% in a region is high and RR is low, NE is more likely to be high. Vice versa, when NE is low, RR is more likely to be high. Thus, given the status of GC%, a previously invisible relationship between RR and NE emerges due to the causal influence of both variables on GC%.</p>
            <p>Consistent with our present observation, a negative correlation between gene density and RR had been observed earlier in a multiple regression analysis when looking at 3 Mb windows, despite the fact that the unconditional RR/gene count correlation was weakly positive <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Importantly, window size could be a factor that exerts some influence on the magnitude of observed correlations. Recombinations tend to occur more often in physical proximity to genes, when compared to intergenic regions; but on the other hand, they also tend to occur away from exons on a finer scale <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. It might be due to this subtle variation of RR at different length scales that the correlation between RR-NE is insignificant at the 1 Mb scale, but was weakly positive on the 3 Mb scale.</p>
            <p>Nevertheless, when we repeated the chromosome-specific analysis using the variable NE (instead of DT), this confirmed the overall pattern of correlation between RR and NE. Unconditionally the correlation is not significant and can be both positive and negative. However, the partial correlations between NE and RR conditional on GC% are all negative with most of them being significant (results not shown). In principle, the absence of an unconditional correlation between RR and NE could also result from a phenomenon termed suppression <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. Suppression refers to the situation, where different signs are obtained by following two paths with opposite effects from the same starting to the same ending node. However, the observed change of the correlation from insignificant to significant is inconsistent with suppression, because this conditional dependence indicates that both the NE and the RR link with GC% are pointing towards GC%.</p>
         </sec>
         <sec>
            <st>
               <p>Four variables: GC%, recombination rate, distance to telomere, and number of exons</p>
            </st>
            <p>In the final step, we extended our 3-variable analysis to a 4-variable analysis, which includes GC%, RR, -log(<it>DT</it>), and NE. Besides the previously calculated first-order partial correlation (conditional on one variable), we now also calculate the second-order partial correlations (conditional on two other variables). The result are shown in Table <tblr tid="T6">6</tblr>. When comparing the second-order partial correlations to the first-order partial correlations, we found that conditioning on GC% is mostly responsible for any change of correlation status. Conditioning on DT, RR or NE has only some quantitative effect, instead of introducing any qualitative changes into pairwise and first-order correlations. This implies a central position of GC% among these variables.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Correlation/partial correlation between GC%, RR, DT and NE. In addition, the first-order partial correlations for RR-NE and DT-NE pairs are shown, whereas the first order partial correlations between the other variables had been already shown above.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p><it>&#961;</it>/partial-<it>&#961;</it></p>
                     </c>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c ca="center">
                        <p>-log(<it>DT</it>)</p>
                     </c>
                     <c ca="center">
                        <p>NE</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>GC%</p>
                     </c>
                     <c ca="center">
                        <p>0.38/0.33</p>
                     </c>
                     <c ca="center">
                        <p>0.47/0.33</p>
                     </c>
                     <c ca="center">
                        <p>0.69/0.72</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>RR</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.49/0.33</p>
                     </c>
                     <c ca="center">
                        <p>0.036/-0.28 (cond. on GC% and log(1/DT))</p>
                        <p>/-0.34 (cond. on GC%)</p>
                        <p>/-0.056 (cond. on log(1/DT))</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>log(1/<it>DT</it>)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>0.17/-0.12 (cond. on GC% and RR)</p>
                        <p>/-0.23 (cond. on GC%)</p>
                        <p>/0.18 (cond. on RR)</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Figure <figr fid="F1">1(C)</figr> depicts a partially directed graph that is consistent with the results in Table <tblr tid="T6">6</tblr>. Importantly, the inclusion of DT does not alter causal relationships that were inferred above in the 3-variable analysis of RR, NE and GC%. Also, the above correlations between RR, DT and GC% remain largely unaltered by the inclusion of NE. As mentioned above, telomere distance is inversely correlated with GC% and RR. Additionally, we see in the unconditional pairwise analysis that telomere distance is inversely correlated with NE too, although this correlation is of smaller magnitude. This correlation between DT and NE does not change substantially when conditioning on RR. However, when conditioning on GC%, the correlation between DT and NE changes its direction. Following a similar line of reasoning as above, this suggests a model where DT and NE are two independent causes of GC%. On the contrary, this cannot be said for the influence of RR and DT on GC%, because the correlation RR and DT does not depend on conditioning on GC%.</p>
            <p>To find the missing orientations of the links between RR, DT and GC% in the 4-variable model, we next applied the TETRAD program <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> that implements the PC-algorithm to create a causal model by a systematic search strategy (see Methods for details) <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. The graphical result that we obtained from running TETRAD is essentially the same as the one depicted in Figure <figr fid="F1">1(C)</figr> and confirmed the direction of the two arrows that we had inferred for causative influence of both RR and NE on GC%. However, the additionally proposed orientations of the links RR &#8594; -log(<it>DT</it>) and NE &#8594; -log(<it>DT</it>) are biologically counterintuitive, because telomere distance is unlikely to be an effect of any of the other variables. To explain the difficulty to infer the directions of these causal links between RR, DT and GC%, we hypothesize the causal model in Figure <figr fid="F1">1(D)</figr>. This model includes as fifth hidden variable the proportion of recombination events that are resolved exclusively as gene conversion event without any crossing-over event (NCO/R), a variable that was recently suggested to be important <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In this model in Figure <figr fid="F1">1(D)</figr>, NCO/R is a cause of GC% that does not fully depend on RR, but is influenced in its magnitude by RR. A similar relationship might connect NCO/R with NE. On the other hand, distance-to-telomere, similar to other variables measuring position or time, might play the role of providing a common environment for several other variables. In other words, one can draw a directed arrow from DT to all other variables under discussion. A similar situation is seen for the linkage disequilibrium between two neighboring genetic markers, where the position can be considered is a "cause" of both markers. However, we could not test the validity of the model in Figure <figr fid="F1">1(D)</figr> because NCO/R data are not available.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We apply partial correlation and graphical probabilistic model inference to several genomic variables that are correlated with GC-content in the human genome. We can show that recombination rate and exon density are two independent causes of GC% as measured on the 1 Mb scale. This observation adds some support to models that complement the influence of recombination rate on GC-content with a component involving selection. In addition, it appears unlikely that GC% variation is a cause of variation in recombination rate or exon density. We observe some heterogeneity in the human genome, such as differences in the correlation of RR with the distance to telomere between acrocentric and non-acrocentric chromosomes. We also see indications of window-size dependent correlation pattern, which may reflect the subtle differences of the distribution of recombination near and within genes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Terminology in relationship and causal graphs</p>
            </st>
            <p>A graph G = (V, E) contains vertex/node set V and edge/link set E &#8838; <it>V </it>&#215; <it>V</it>. An edge (<it>i</it>, <it>j</it>) &#8712; <it>E </it>is "directed" if (<it>j</it>, <it>i</it>) &#8713; <it>E</it>; and is "undirected" if (<it>j</it>, <it>i</it>) &#8712; <it>E</it>. If there is an edge between node <it>i </it>and <it>j</it>, either directed or undirected, we say there is a "direct association/relationship" between the two nodes. If there is no edge between node <it>i </it>and node <it>j</it>, the two are still connected through multiple-step edges, as all our nodes are in one single graph; then we say the two nodes are "indirectly associated".</p>
            <p>If all edges are directed, the graph is said to be "directed graph" (e.g. Figure <figr fid="F1">1(B)</figr>). If all edges are undirected, the graph is an "undirected graph" (e.g. Figure <figr fid="F1">1(A)</figr>). If some edges are directed and other edges are undirected, the graph is a "partially directed graph" (e.g. Figure <figr fid="F1">1(C)</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Partial correlations</p>
            </st>
            <p>For many situations, conditional correlation is equivalent to partial correlation <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> which is defined as follows (with one control variable <it>z</it>):</p>
            <p>
               <display-formula id="M1">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S66-i1">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#961;</m:mi>
                              <m:mrow>
                                 <m:mi>x</m:mi>
                                 <m:mi>y</m:mi>
                                 <m:mo>.</m:mo>
                                 <m:mi>z</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mi>y</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>y</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:mrow>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>x</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>y</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msqrt>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyWdi3aaSbaaSqaaiabdIha4jabdMha5jabc6caUiabdQha6bqabaGccqGH9aqpjuaGdaWcaaqaaiabeg8aYnaaBaaabaGaemiEaGNaemyEaKhabeaacqGHsislcqaHbpGCdaWgaaqaaiabdIha4jabdQha6bqabaGaeqyWdi3aaSbaaeaacqWG5bqEcqWG6bGEaeqaaaqaamaakaaabaGaeiikaGIaeGymaeJaeyOeI0IaeqyWdi3aa0baaeaacqWG4baEcqWG6bGEaeaacqaIYaGmaaGaeiykaKIaeiikaGIaeGymaeJaeyOeI0IaeqyWdi3aa0baaeaacqWG5bqEcqWG6bGEaeaacqaIYaGmaaGaeiykaKcabeaaaaGaeiOla4caaa@582C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S66-i2"><m:semantics><m:mrow><m:mi/><m:mo stretchy="false"/><m:msub><m:mi>&#961;</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mi>c</m:mi><m:mi>o</m:mi><m:mi>v</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo>,</m:mo><m:mi>y</m:mi><m:mo stretchy="false">)</m:mo><m:mo>/</m:mo><m:msqrt><m:mrow><m:mi>v</m:mi><m:mi>a</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>x</m:mi><m:mo stretchy="false">)</m:mo><m:mi>v</m:mi><m:mi>a</m:mi><m:mi>r</m:mi><m:mo stretchy="false">(</m:mo><m:mi>y</m:mi><m:mo stretchy="false">)</m:mo></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOEaONaeiikaGIaeqyWdi3aaSbaaSqaaiabdIha4jabdMha5bqabaGccqGH9aqpcqWGJbWycqWGVbWBcqWG2bGDcqGGOaakcqWG4baEcqGGSaalcqWG5bqEcqGGPaqkcqGGVaWldaGcaaqaaiabdAha2jabdggaHjabdkhaYjabcIcaOiabdIha4jabcMcaPiabdAha2jabdggaHjabdkhaYjabcIcaOiabdMha5jabcMcaPaWcbeaaaaa@4D7E@</m:annotation></m:semantics></m:math></inline-formula> is the Pearson product-moment correlation coefficient. From the linear regression framework, partial correlation is the correlation after the main terms in regression over <it>z </it>are removed:</p>
            <p>
               <display-formula id="M2">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S66-i3">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mi>x</m:mi>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mi>z</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mi>y</m:mi>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mi>z</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>x</m:mi>
                                             <m:mi>y</m:mi>
                                             <m:mo>.</m:mo>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mo>=</m:mo>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>C</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabmWaaaqaaiabdIha4bqaaiabg2da9aqaaiabdggaHnaaBaaaleaacqWG4baEaeqaaOGaey4kaSIaemOyai2aaSbaaSqaaiabdIha4bqabaGccqWG6bGEcqGHRaWktqvzynutnfgDOLeDHXwAJbqegmwBTLwmWaaceiGae8x9di7aaSbaaSqaaiabdIha4bqabaaakeaacqWG5bqEaeaacqGH9aqpaeaacqWGHbqydaWgaaWcbaGaemyEaKhabeaakiabgUcaRiabdkgaInaaBaaaleaacqWG5bqEaeqaaOGaemOEaONaey4kaSIae8x9di7aaSbaaSqaaiabdMha5bqabaaakeaacqaHbpGCdaWgaaWcbaGaemiEaGNaemyEaKNaeiOla4IaemOEaOhabeaaaOqaaiabg2da9aqaaiabdoeadjabd+gaVjabdkhaYjabcIcaOiab=v=aYoaaBaaaleaacqWG4baEaeqaaOGaeiilaWIae8x9di7aaSbaaSqaaiabdMha5bqabaGccqGGPaqkaaaaaa@69AE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Partial correlation <it>&#961;</it><sub><it>xy</it>.<it>z </it></sub>is often lower than <it>&#961;</it><sub><it>xy</it></sub>, and a significantly lower partial correlation is an indication that the <it>x </it>- <it>y </it>correlation is indirect.</p>
            <p>With more than 3 variables (<it>x</it>, <it>y</it>, <it>z</it>, <it>w</it>), the partial correlation can be defined by conditional on one variable (e.g. <it>z</it>, first order), or two variables (<it>z</it>, <it>w</it>, second order). Both Eq.(1) and Eq.(2) can be extended for calculating second-order partial correlation:</p>
            <p>
               <display-formula id="M3">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S66-i4">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>&#961;</m:mi>
                              <m:mrow>
                                 <m:mi>x</m:mi>
                                 <m:mi>y</m:mi>
                                 <m:mo>.</m:mo>
                                 <m:mi>z</m:mi>
                                 <m:mi>w</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mi>y</m:mi>
                                       <m:mo>.</m:mo>
                                       <m:mi>z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mi>w</m:mi>
                                       <m:mo>.</m:mo>
                                       <m:mi>z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:msub>
                                    <m:mi>&#961;</m:mi>
                                    <m:mrow>
                                       <m:mi>y</m:mi>
                                       <m:mi>w</m:mi>
                                       <m:mo>.</m:mo>
                                       <m:mi>z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                              </m:mrow>
                              <m:mrow>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>x</m:mi>
                                             <m:mi>w</m:mi>
                                             <m:mo>.</m:mo>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mn>1</m:mn>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msubsup>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>y</m:mi>
                                             <m:mi>w</m:mi>
                                             <m:mo>.</m:mo>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo stretchy="false">)</m:mo>
                                    </m:mrow>
                                 </m:msqrt>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqyWdi3aaSbaaSqaaiabdIha4jabdMha5jabc6caUiabdQha6jabdEha3bqabaGccqGH9aqpjuaGdaWcaaqaaiabeg8aYnaaBaaabaGaemiEaGNaemyEaKNaeiOla4IaemOEaOhabeaacqGHsislcqaHbpGCdaWgaaqaaiabdIha4jabdEha3jabc6caUiabdQha6bqabaGaeqyWdi3aaSbaaeaacqWG5bqEcqWG3bWDcqGGUaGlcqWG6bGEaeqaaaqaamaakaaabaGaeiikaGIaeGymaeJaeyOeI0IaeqyWdi3aa0baaeaacqWG4baEcqWG3bWDcqGGUaGlcqWG6bGEaeaacqaIYaGmaaGaeiykaKIaeiikaGIaeGymaeJaeyOeI0IaeqyWdi3aa0baaeaacqWG5bqEcqWG3bWDcqGGUaGlcqWG6bGEaeaacqaIYaGmaaGaeiykaKcabeaaaaaaaa@648C@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>and</p>
            <p>
               <display-formula id="M4">
                  <m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-10-S1-S66-i5">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>x</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mi>z</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mi>w</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mi>y</m:mi>
                                       <m:mo>=</m:mo>
                                       <m:msub>
                                          <m:mi>a</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>b</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mi>z</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>c</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mi>w</m:mi>
                                       <m:mo>+</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>&#961;</m:mi>
                                          <m:mrow>
                                             <m:mi>x</m:mi>
                                             <m:mi>y</m:mi>
                                             <m:mo>.</m:mo>
                                             <m:mi>z</m:mi>
                                             <m:mi>w</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:mi>C</m:mi>
                                       <m:mi>o</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>&#1013;</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:msub>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:mo>.</m:mo>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeWabmqaaaqaaiabdIha4jabg2da9iabdggaHnaaBaaaleaacqWG4baEaeqaaOGaey4kaSIaemOyai2aaSbaaSqaaiabdIha4bqabaGccqWG6bGEcqGHRaWkcqWGJbWydaWgaaWcbaGaemiEaGhabeaakiabdEha3jabgUcaRmbvLH1qn1uy0Hws0fgBPngaryWyT1wAXadaiqGacqWF1pGSdaWgaaWcbaGaemiEaGhabeaaaOqaaiabdMha5jabg2da9iabdggaHnaaBaaaleaacqWG5bqEaeqaaOGaey4kaSIaemOyai2aaSbaaSqaaiabdMha5bqabaGccqWG6bGEcqGHRaWkcqWGJbWydaWgaaWcbaGaemyEaKhabeaakiabdEha3jabgUcaRiab=v=aYoaaBaaaleaacqWG5bqEaeqaaaGcbaGaeqyWdi3aaSbaaSqaaiabdIha4jabdMha5jabc6caUiabdQha6jabdEha3bqabaGccqGH9aqpcqWGdbWqcqWGVbWBcqWGYbGCcqGGOaakcqWF1pGSdaWgaaWcbaGaemiEaGhabeaakiabcYcaSiab=v=aYoaaBaaaleaacqWG5bqEaeqaaOGaeiykaKIaeiOla4caaaaa@76B3@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>Higher order partial correlation can be defined in an analog fashion.</p>
         </sec>
         <sec>
            <st>
               <p>Establishing undirected, partially directed and directed graphs</p>
            </st>
            <p>Figure <figr fid="F3">3</figr> illustrates an example for inferring relationship and causal graph from data for three variables <it>x</it>, <it>y</it>, <it>z</it>. In Figure <figr fid="F3">3(A)</figr>, we assume all pairwise correlations are significant, so all nodes are linked to other nodes. If the correlation between two of the variables is not due to a direct cause-effect relationship, but mediated via a third variable, then the correlation between the two conditional on that third variable will be greatly reduced. Accordingly, we would end up Figure <figr fid="F3">3(B)</figr>, if assuming that the partial correlation <it>Cor</it>(<it>x</it>, <it>y</it>|<it>z</it>) becomes insignificant, while the other two partial correlations remain significant. In that case, partial correlation cannot determine the orientation of causal arrows. Except the first causal model on the top of Figure <figr fid="F3">3(C)</figr>, all the three other causal models were possible. However, in a special situation, a directed causal model can be inferred uniquely. Suppose <it>Cor</it>(<it>x</it>, <it>z</it>) and <it>Cor</it>(<it>y</it>, <it>z</it>) are both significant, but <it>Cor</it>(<it>x</it>, <it>y</it>) is insignificant, then we start with the undirected graph in Figure <figr fid="F3">3(B)</figr> from the unconditional analysis. Further suppose <it>Cor</it>(<it>x</it>, <it>y</it>|<it>z</it>), <it>Cor</it>(<it>x</it>, <it>z</it>|<it>y</it>), <it>Cor</it>(<it>y</it>, <it>y</it>|<it>x</it>) are all significant. Then by the rule of d-separation <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> only the top model in Figure <figr fid="F3">3(C)</figr> is consistent with these assumptions.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Illustration of the procedures in establishing undirected, directed, or partially directed graphs</p>
               </caption>
               <text>
                  <p><b>Illustration of the procedures in establishing undirected, directed, or partially directed graphs</b>. (A) If correlation <it>Cor</it>(<it>i</it>, <it>j</it>) is significant, draw an edge between node <it>i </it>and node <it>j </it>(<it>i</it>, <it>j </it>&#8712; (<it>x</it>, <it>y</it>, <it>z</it>)). (B) The conditional correlation Cor(<it>x</it>, <it>y</it>|<it>z</it>) is insignificant, remove the edge (<it>x</it>, <it>y</it>). (C) Using other information to select one or few causal models that are consistent with the data.</p>
               </text>
               <graphic file="1471-2105-10-S1-S66-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>TETRAD program for inference of causal models from partial correlations</p>
            </st>
            <p>The TETRAD program <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> implements the PC-algorithm to automatically infer causal relationships from partial correlation analysis <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. This algorithm can be broken into two phases: an adjacency phase and an orientation phase. In the adjacency phase, a complete undirected graph over the variables is constructed and then edges <it>X </it>- <it>Y </it>are removed, if some set <it>S </it>among either the adjacents of <it>X </it>or the adjacents of <it>Y </it>can be found such that <it>I</it>(<it>X</it>, <it>Y</it>|<it>S</it>). Then the orientation phase is begun. The first step examines unshielded triples and considers to orient them as colliders. An unshielded triple is a triple (<it>X</it>, <it>Y</it>, <it>Z</it>) where <it>X </it>is adjacent to <it>Y</it>, <it>Y </it>is adjacent to <it>Z</it>, but <it>X </it>is not adjacent to <it>Z</it>. Since <it>X </it>is not adjacent to <it>Z</it>, the edge <it>X </it>- <it>Z </it>must have been removed during the adjacency search by conditioning on some set <it>S</it><sub><it>xz</it></sub>; (<it>X</it>, <it>Y</it>, <it>Z</it>) is oriented as a collider <it>X </it>&#8594; <it>Y </it>&#8592; <it>Z </it>just in case <it>Y </it>is not in this <it>S</it><sub><it>xz</it></sub>. Once all such unshielded triples have been oriented as colliders, a series of rules orients any edge whose orientation is implied by previous orientations.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations used</p>
         </st>
         <p>BGC: biased gene conversion; DT: distance to telomere; GC%: guanine and cytosine content; NCO/R: non-crossing-over events among recombination events; NE: number of exons; PC-algorithm: Peter (Spirtes) and Clark (Glymour) algorithm; RR: recombination rate; SNP: single nucleotide polymorphism.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>W.L. proposed the project, carried out the correlation and partial correlation calculation, wrote the initial draft of the manuscript; J.F. prepared the data, contributed most of the biological discussion, wrote the final version of the manuscript; M.W. ran the TETRAD program, contributed to the theoretical aspect of causal inference; Y.Y. contributed to the theoretical aspect of partial correlation.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Peter Arndt for sharing the data he used in his paper. Yaning Yang's work was supported by &#160;the Chinese Natural Science Foundation (No. 10671189).</p>
            <p>This article has been published as part of <it>BMC Bioinformatics </it>Volume 10 Supplement 1, 2009: Proceedings of The Seventh Asia Pacific Bioinformatics Conference (APBC) 2009. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/1471-2105/10?issue=S1</url></p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The isochore organization of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Bernardi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Ann Rev Genet</source>
            <pubdate>1989</pubdate>
            <volume>23</volume>
            <fpage>637</fpage>
            <lpage>661</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.ge.23.120189.003225</pubid>
                  <pubid idtype="pmpid" link="fulltext">2694946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Isochores and the evolutionary genomics of vertebrates</p>
            </title>
            <aug>
               <au>
                  <snm>Bernardi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2000</pubdate>
            <volume>241</volume>
            <fpage>3</fpage>
            <lpage>17</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(99)00485-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>DNA helix: the importance of being GC-rich</p>
            </title>
            <aug>
               <au>
                  <snm>Vinogradov</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>1838</fpage>
            <lpage>1844</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152811</pubid>
                  <pubid idtype="pmpid" link="fulltext">12654999</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg296</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Initial sequencing and analysis of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <cnm>(International Human Genome Sequencing Consortium)</cnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>860</fpage>
            <lpage>921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35057062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11237011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Misunderstandings about isochores. Part 1</p>
            </title>
            <aug>
               <au>
                  <snm>Bernardi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <fpage>3</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(01)00644-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591466</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Recombination and mammalian genome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Royal Soc Biol Sci</source>
            <pubdate>1993</pubdate>
            <volume>252</volume>
            <fpage>237</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1098/rspb.1993.0071</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The evolution of isochores</p>
            </title>
            <aug>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>549</fpage>
            <lpage>555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35080577</pubid>
                  <pubid idtype="pmpid" link="fulltext">11433361</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Local rates of recombination are positively correlated with GC content in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Fullerton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>1139</fpage>
            <lpage>1142</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11371603</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>GC-content evolution in mammalian genomes: the biased gene conversion hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Galtier</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Piganeau</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mouchiroud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>159</volume>
            <fpage>907</fpage>
            <lpage>911</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461818</pubid>
                  <pubid idtype="pmpid" link="fulltext">11693127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The elevated GC content at exonic third sites is not evidence against neutralist models of isochore evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>757</fpage>
            <lpage>762</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319260</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Recombination explains isochores in mammalian genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Montoya-Burgos</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Boursot</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Galtier</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>128</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00021-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12615004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Delineating relative homogeneous G+C domains in DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <fpage>57</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(01)00672-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Are isochore sequences homogeneous?</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2002</pubdate>
            <volume>300</volume>
            <fpage>129</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(02)00847-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">12468094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffreys</snm>
                  <fnm>AJNR</fnm>
               </au>
               <au>
                  <snm>Kauppi</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>217</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-217</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The fine-scale structure of recombination rate variation in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deloukas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>581</fpage>
            <lpage>584</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1092500</pubid>
                  <pubid idtype="pmpid" link="fulltext">15105499</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A fine-scale map of recombination rates and hotspots across the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Myers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bottolo</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Freeman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <fpage>321</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1117196</pubid>
                  <pubid idtype="pmpid" link="fulltext">16224025</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans</p>
            </title>
            <aug>
               <au>
                  <snm>Coop</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Ober</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>319</volume>
            <fpage>1395</fpage>
            <lpage>1398</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1151851</pubid>
                  <pubid idtype="pmpid" link="fulltext">18239090</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The influence of recombination on human genetic diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Spencer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Deloukas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mullikin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Silverman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e148</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1575889</pubid>
                  <pubid idtype="pmpid" link="fulltext">17044736</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0020148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Recombination drives the evolution of GC-content in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Meunier</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>984</fpage>
            <lpage>990</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh070</pubid>
                  <pubid idtype="pmpid" link="fulltext">14963104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Fixation biases affecting human SNPs</p>
            </title>
            <aug>
               <au>
                  <snm>Webster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>122</fpage>
            <lpage>126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.01.005</pubid>
                  <pubid idtype="pmpid">15049304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Male-driven biased gene conversion governs the evolution of base composition in human Alu repeats</p>
            </title>
            <aug>
               <au>
                  <snm>Webster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hultin-Rosenberg</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Arndt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ellegren</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>1468</fpage>
            <lpage>1474</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi136</pubid>
                  <pubid idtype="pmpid" link="fulltext">15772377</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The impact of recombination on nucleotide substitutions in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Arndt</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e1000071</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2346554</pubid>
                  <pubid idtype="pmpid" link="fulltext">18464896</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.1000071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Isochores exhibit evidence of genes interacting with the large-scale genomic environment</p>
            </title>
            <aug>
               <au>
                  <snm>Press</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Robins</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <fpage>1029</fpage>
            <lpage>1040</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1534/genetics.105.054445</pubid>
                  <pubid idtype="pmpid" link="fulltext">16951086</pubid>
                  <pubid idtype="pmcid">1602094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The neoselectionist theory of genome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Bernardi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>8385</fpage>
            <lpage>8390</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1866311</pubid>
                  <pubid idtype="pmpid" link="fulltext">17494746</pubid>
                  <pubid idtype="doi">10.1073/pnas.0701652104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Both selective and neutral processes drive GC content evolution in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Pozzoli</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Menozzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fumagalli</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cereda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Comi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cagliani</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bresolin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sironi</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2292697</pubid>
                  <pubid idtype="pmpid" link="fulltext">18371205</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-8-99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Enrichment of HapMap recombination hotspot predictions around human nervous system genes: evidence for positive selection?</p>
            </title>
            <aug>
               <au>
                  <snm>Freudenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ptacek</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Eur J Hum Genet</source>
            <pubdate>2007</pubdate>
            <volume>15</volume>
            <fpage>1071</fpage>
            <lpage>1078</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201876</pubid>
                  <pubid idtype="pmpid" link="fulltext">17568387</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A second generation human haplotype map of over 3.1 million SNPs</p>
            </title>
            <aug>
               <au>
                  <snm>Frazer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <cnm>(International HapMap Consortium)</cnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>449</volume>
            <fpage>851</fpage>
            <lpage>861</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06258</pubid>
                  <pubid idtype="pmpid" link="fulltext">17943122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>A: Initial sequence of the chimpanzee genome and comparison with the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <cnm>(The Chimpanzee Sequencing, Consortium)</cnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>69</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04072</pubid>
                  <pubid idtype="pmpid" link="fulltext">16136131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <aug>
               <au>
                  <snm>Shipley</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Cause and Correlation in Biology</source>
            <publisher>Cambridge, UK: Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Discovery of meaningful associations in genomic data using partial correlation coefficients</p>
            </title>
            <aug>
               <au>
                  <snm>de la Fuente</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bing</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hoeschele</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mendes</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>3565</fpage>
            <lpage>3574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth445</pubid>
                  <pubid idtype="pmpid" link="fulltext">15284096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Inferring causal relationships among intermediate phenotypes and biomarkers: a case study of rheumatoid arthritis</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Irigoyen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gregersen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>1503</fpage>
            <lpage>1507</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl100</pubid>
                  <pubid idtype="pmpid" link="fulltext">16551663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Biased clustered substitutions in the human genome: the footprints of male-driven biased gene conversion</p>
            </title>
            <aug>
               <au>
                  <snm>Dreszer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pollard</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1420</fpage>
            <lpage>1430</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1987345</pubid>
                  <pubid idtype="pmpid" link="fulltext">17785536</pubid>
                  <pubid idtype="doi">10.1101/gr.6395807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Origin of human chromosome 2: an ancestral telomere-telomere fusion</p>
            </title>
            <aug>
               <au>
                  <snm>Ijdo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baldini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Reeders</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wells</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <fpage>9051</fpage>
            <lpage>9055</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">52649</pubid>
                  <pubid idtype="pmpid" link="fulltext">1924367</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.20.9051</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Sequence variants in the RNF212 gene associate with genome-wide recombination rate</p>
            </title>
            <aug>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>319</volume>
            <fpage>1398</fpage>
            <lpage>1401</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1152422</pubid>
                  <pubid idtype="pmpid" link="fulltext">18239089</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>A high-resolution recombination map of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>241</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12053178</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>A cautionary tale of two statistics: partial correlation and standardixed partial regression</p>
            </title>
            <aug>
               <au>
                  <snm>Cramer</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Psychol</source>
            <pubdate>2003</pubdate>
            <volume>137</volume>
            <fpage>507</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14629080</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Suppression and enhancement in bivariate regression</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Escobar</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Statistician</source>
            <pubdate>1986</pubdate>
            <volume>35</volume>
            <fpage>17</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2988294</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Simpson's Paradox, Lord's Paradox, and Suppression Effects are the same phenomenon &#8211; the reversal paradox</p>
            </title>
            <aug>
               <au>
                  <snm>Tu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gunnell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gilthorpe</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Emerg Themes Epidemiol</source>
            <pubdate>2008</pubdate>
            <volume>5</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2254615</pubid>
                  <pubid idtype="pmpid" link="fulltext">18211676</pubid>
                  <pubid idtype="doi">10.1186/1742-7622-5-2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>TETRAD</p>
            </title>
            <url>http://www.phil.cmu.edu/projects/tetrad/</url>
         </bibl>
         <bibl id="B40">
            <title>
               <p>An algorithm for fast recovery of sparse causal graphs</p>
            </title>
            <aug>
               <au>
                  <snm>Spirtes</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Glymour</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Soc Sci Comp Rev</source>
            <pubdate>1991</pubdate>
            <volume>9</volume>
            <fpage>62</fpage>
            <lpage>72</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1177/089443939100900106</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <aug>
               <au>
                  <snm>Spirtes</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Glymour</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scheines</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Causation, Prediction and Search</source>
            <publisher>Cambridge, MA: MIT Press</publisher>
            <edition>2</edition>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Partial correlation and conditional correlation as measure of conditional independence</p>
            </title>
            <aug>
               <au>
                  <snm>Baba</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sibuya</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Aus New Zealand J Stat</source>
            <pubdate>2004</pubdate>
            <volume>46</volume>
            <fpage>657</fpage>
            <lpage>664</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1467-842X.2004.00360.x</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>