<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-428</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Gaunt</snm>
               <mi>R</mi>
               <fnm>Tom</fnm>
               <insr iid="I1"/>
               <email>tom.gaunt@bristol.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Rodr&#237;guez</snm>
               <fnm>Santiago</fnm>
               <insr iid="I1"/>
               <email>santi.rodriguez@bristol.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Day</snm>
               <mi>NM</mi>
               <fnm>Ian</fnm>
               <insr iid="I1"/>
               <email>ian.day@bristol.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bristol Genetic Epidemology Laboratories (BGEL) and MRC Centre for Causal Analyses in Translational Epidemiology (CAiTE), Department of Social Medicine, University of Bristol, Canynge Hall, Whiteladies Road, Bristol, BS8 2PR, UK</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>428</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/428</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17980034</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-428</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>02</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>02</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Gaunt et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation).</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r<sup>2 </sup>and <it>&#967;</it><sup>2 </sup>values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q &lt; 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Linkage disequilibrium (LD) describes the condition that occurs when alleles at different loci are non-randomly associated in a given population. Under LD the frequency (<it>f</it><sub>11</sub>) of a haplotype (<it>h</it><sub>11</sub>) representing the "1" allele at two loci is significantly more or less than the product of the respective allele frequencies. Characterisation of LD is important in medical genetics, influencing association mapping of trait loci and providing information on interactions between genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. LD is the result of a shared history of mutation and recombination, and other factors including: genetic drift, population growth, admixture, population structure, the ages of the polymorphisms, the physical distance separating them and the effects of selective pressure <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
         <p>For unrelated individuals the estimation of LD relies on the estimation of haplotype frequencies. In a 3 &#215; 3 table for a biallelic marker the haplotype phase of all individuals is known with the exception of the centre cell (representing individuals heterozygous at both loci). The estimated frequency, <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula>, of the haplotype <it>h</it><sub>11 </sub>is described by a cubic equation of the form</p>
         <p>
            <display-formula id="M1">
               <m:math name="1471-2105-8-428-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>a</m:mi>
                        <m:msubsup>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>11</m:mn>
                           </m:mrow>
                           <m:mn>3</m:mn>
                        </m:msubsup>
                        <m:mo>+</m:mo>
                        <m:mi>b</m:mi>
                        <m:msubsup>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>11</m:mn>
                           </m:mrow>
                           <m:mn>2</m:mn>
                        </m:msubsup>
                        <m:mo>+</m:mo>
                        <m:mi>c</m:mi>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>11</m:mn>
                           </m:mrow>
                        </m:msub>
                        <m:mo>+</m:mo>
                        <m:mi>d</m:mi>
                        <m:mo>=</m:mo>
                        <m:mn>0</m:mn>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyyaeMafmOzayMbaKaadaqhaaWcbaGaeGymaeJaeGymaedabaGaeG4mamdaaOGaey4kaSIaemOyaiMafmOzayMbaKaadaqhaaWcbaGaeGymaeJaeGymaedabaGaeGOmaidaaOGaey4kaSIaem4yamMafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaakiabgUcaRiabdsgaKjabg2da9iabicdaWaaa@424C@</m:annotation>
                  </m:semantics>
               </m:math>
            </display-formula>
         </p>
         <p>that is adapted from Hill's equation (4) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> with the constants defined under Methods. With <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula> and the allele frequencies, all four haplotype frequencies can be calculated, thus estimating the unknown proportions of the middle cell.</p>
         <p>Several approaches exist for solving equation (1), the solution of which enables estimation of haplotype frequencies and LD coefficients. The first approach uses iteration-based algorithms. An initial estimate of haplotype frequency (either random, or based on the known haplotype numbers) is substituted into the equation, providing a new estimate. This is then fed back into the equation and the expectation-maximisation (EM) process repeated until the values converge. This is the basis both of the algorithm described by Hill in 1974 for the estimation of pairwise haplotype frequencies <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, and of other EM algorithms that enable the estimation of multilocus haplotype frequencies. Many programs exist that utilise variations on this approach, including: GOLD <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, GOLDsurfer <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, MIDAS <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, Haploview <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and many others reviewed in <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. The potential problem for these approaches is that algorithms may converge on one of the alternative roots of the cubic equation (a local maximum rather than the global maximum).</p>
         <p>Other approaches include parsimony, eg HAPAR <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and Bayesian algorithms, eg PHASE <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Parsimony and Bayesian methods are both better suited to estimating individual haplotypes than EM approaches, while Bayesian and EM methods are useful for estimating population frequencies <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>An alternative approach would be exact solution, such as <it>Cardan's solution </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp> of the generalized cubic equation (of which equation (1) is an example). This provides all roots to the cubic equation, from which we can select those that are both <it>real </it>(i.e. not a complex number) and <it>biologically possible</it>. If more than one solution exists then the likelihoods of the different solutions can be compared and an informed evaluation made of the result. Theoretically, the non-iterative approach may be computationally less intensive and more accurate, but computational efficiency and accuracy will be software and platform dependent.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>Hill assumed random mating and Hardy Weinberg Equilibrium (HWE) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Rearranging terms for consequent diplotype frequency expectations for two biallelic loci Luo and Suhai <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> obtained equation 1 given in the introduction (here redefining <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula> as <it>x</it>, <it>a3 </it>as <it>a</it>, <it>a2 </it>as <it>b</it>, <it>a1 </it>as <it>c </it>and <it>a </it>as <it>d </it>for convenience): <it>ax</it><sup>3 </sup>+ <it>bx</it><sup>2 </sup>+ <it>cx </it>+ <it>d </it>= 0, where <it>a </it>= 4<it>n</it>; <it>b </it>= 2<it>n </it>(1 - 2<it>p </it>- 2<it>q</it>) - 2(2<it>n</it><sub>11 </sub>+ <it>n</it><sub>12 </sub>+ <it>n</it><sub>21</sub>) - <it>n</it><sub>22</sub>; <it>c </it>= 2<it>npq </it>- (2<it>n</it><sub>11 </sub>+ <it>n</it><sub>12 </sub>+ <it>n</it><sub>21</sub>)(1 - 2<it>p </it>- 2<it>q</it>) - <it>n</it><sub>22</sub>(1 - <it>p </it>- <it>q</it>); <it>d </it>= -(2<it>n</it><sub>11 </sub>+ <it>n</it><sub>12 </sub>+ <it>n</it><sub>21</sub>)<it>pq</it>; <it>n = </it>number of subjects; <it>p </it>= common allele freq of locus 1; <it>q </it>= common allele freq of locus 2; <it>n</it><sub>11 </sub>is the number of subjects who are homozygous for the commoner allele at both loci; <it>n</it><sub>12 </sub>are common homozygous at locus 1 and heterozygous at locus 2; <it>n</it><sub>21 </sub>are heterozygous at locus 1 and common homozygous at locus 2; <it>n</it><sub>22 </sub>are heterozygous at both loci <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Equation 1 can be solved exactly for <it>x </it>(with 1 to 3 real number solutions).</p>
         <p>We have adopted the Nickalls treatment of the Cardan solution of the generalized cubic equation <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, and written a Python <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> program "CubeX" to solve equation 1 exactly. In CubeX, after calculation of constants <it>a</it>-<it>d </it>from diplotypic data the following are calculated:</p>
         <p>
            <display-formula><it>x</it><sub><it>N </it></sub>= -<it>b</it>/(<it>3a</it>); <it>&#948;</it><sup>2 </sup>= <it>(b</it><sup>2 </sup>-<it>3ac)/9a</it><sup>2</sup>; <it>h</it><sup>2 </sup>= 4<it>a</it><sup>2</sup><it>&#948;</it><sup>6</sup>; <it>y</it><sub><it>N </it></sub>= <it>ax</it><sub><it>N</it></sub><sup>3 </sup>+ <it>bx</it><sub><it>N</it></sub><sup>2 </sup>+ <it>cx</it><sub><it>N </it></sub>+ <it>d</it>.</display-formula>
         </p>
         <p>The discriminant &#916;<sub>3 </sub>= <it>y</it><sub><it>N</it></sub><sup>2 </sup>- <it>h</it><sup>2 </sup>is then used to determine the outcome in real roots (without having to go through complex number intermediates or ambiguities), with three possible outcomes:</p>
         <p>Outcome 1: if <it>y</it><sub><it>N</it></sub><sup>2 </sup>> <it>h</it><sup>2 </sup>there will be only one real root (<it>&#945;</it>) given by</p>
         <p>
            <display-formula id="M2">
               <m:math name="1471-2105-8-428-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>&#945;</m:mi>
                        <m:mo>=</m:mo>
                        <m:msub>
                           <m:mi>x</m:mi>
                           <m:mi>N</m:mi>
                        </m:msub>
                        <m:mo>+</m:mo>
                        <m:mroot>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:mi>a</m:mi>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>y</m:mi>
                                       <m:mi>N</m:mi>
                                    </m:msub>
                                    <m:mo>+</m:mo>
                                    <m:msqrt>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mi>y</m:mi>
                                             <m:mi>N</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msup>
                                             <m:mi>h</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:msqrt>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                           </m:mrow>
                           <m:mn>3</m:mn>
                        </m:mroot>
                        <m:mo>+</m:mo>
                        <m:mroot>
                           <m:mrow>
                              <m:mfrac>
                                 <m:mn>1</m:mn>
                                 <m:mrow>
                                    <m:mn>2</m:mn>
                                    <m:mi>a</m:mi>
                                 </m:mrow>
                              </m:mfrac>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>y</m:mi>
                                       <m:mi>N</m:mi>
                                    </m:msub>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msqrt>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mi>y</m:mi>
                                             <m:mi>N</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msubsup>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msup>
                                             <m:mi>h</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:msqrt>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                           </m:mrow>
                           <m:mn>3</m:mn>
                        </m:mroot>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8xSdeMaeyypa0JaemiEaG3aaSbaaSqaaiabd6eaobqabaGccqGHRaWkdaGcbaqaaKqbaoaalaaabaGaeGymaedabaGaeGOmaiJaemyyaegaaOWaaeWaaeaacqGHsislcqWG5bqEdaWgaaWcbaGaemOta4eabeaakiabgUcaRmaakaaabaGaemyEaK3aa0baaSqaaiabd6eaobqaaiabikdaYaaakiabgkHiTiabdIgaOnaaCaaaleqabaGaeGOmaidaaaqabaaakiaawIcacaGLPaaaaSqaaiabiodaZaaakiabgUcaRmaakeaabaqcfa4aaSaaaeaacqaIXaqmaeaacqaIYaGmcqWGHbqyaaGcdaqadaqaaiabgkHiTiabdMha5naaBaaaleaacqWGobGtaeqaaOGaeyOeI0YaaOaaaeaacqWG5bqEdaqhaaWcbaGaemOta4eabaGaeGOmaidaaOGaeyOeI0IaemiAaG2aaWbaaSqabeaacqaIYaGmaaaabeaaaOGaayjkaiaawMcaaaWcbaGaeG4mamdaaaaa@582E@</m:annotation>
                  </m:semantics>
               </m:math>
            </display-formula>
         </p>
         <p>Outcome 2: if <it>y</it><sub><it>N</it></sub><sup>2 </sup>= <it>h</it><sup>2 </sup>there are three real roots (<it>&#945;</it>, <it>&#946; </it>and <it>&#947;</it>) and <it>&#945; </it>and <it>&#946; </it>are equal. For a value of <inline-formula><m:math name="1471-2105-8-428-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>&#956;</m:mi><m:mo>=</m:mo><m:mroot><m:mrow><m:mfrac><m:mrow><m:msub><m:mi>y</m:mi><m:mi>N</m:mi></m:msub></m:mrow><m:mrow><m:mn>2</m:mn><m:mi>a</m:mi></m:mrow></m:mfrac></m:mrow><m:mn>3</m:mn></m:mroot></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8hVd0Maeyypa0ZaaOqaaeaajuaGdaWcaaqaaiabdMha5naaBaaabaGaemOta4eabeaaaeaacqaIYaGmcqWGHbqyaaaaleaacqaIZaWmaaaaaa@3541@</m:annotation></m:semantics></m:math></inline-formula>:</p>
         <p>
            <display-formula id="M3"><it>&#945; </it>= <it>x</it><sub><it>N </it></sub>+ <it>&#956;</it></display-formula>
         </p>
         <p>
            <display-formula id="M4"><it>&#946; </it>= <it>x</it><sub><it>N </it></sub>+ <it>&#956;</it></display-formula>
         </p>
         <p>
            <display-formula id="M5"><it>&#947; </it>= <it>x</it><sub><it>N </it></sub>- 2<it>&#956;</it></display-formula>
         </p>
         <p>Outcome 3: if <it>y</it><sub><it>N</it></sub><sup>2 </sup>&lt;<it>h</it><sup>2 </sup>there are three real roots (<it>&#945;</it>, <it>&#946; </it>and <it>&#947;</it>). Where <inline-formula><m:math name="1471-2105-8-428-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>&#952;</m:mi><m:mo>=</m:mo><m:mfrac><m:mrow><m:mi>arccos</m:mi><m:mo>&#8289;</m:mo><m:mo stretchy="false">(</m:mo><m:mo>&#8722;</m:mo><m:msub><m:mi>y</m:mi><m:mi>N</m:mi></m:msub><m:mo>/</m:mo><m:mi>h</m:mi><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>3</m:mn></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8hUdeNaeyypa0tcfa4aaSaaaeaacyGGHbqycqGGYbGCcqGGJbWycqGGJbWycqGGVbWBcqGGZbWCcqGGOaakcqGHsislcqWG5bqEdaWgaaqaaiabd6eaobqabaGaei4la8IaemiAaGMaeiykaKcabaGaeG4mamdaaaaa@3FEF@</m:annotation></m:semantics></m:math></inline-formula>:</p>
         <p>
            <display-formula id="M6"><it>&#945; </it>= <it>x</it><sub><it>N </it></sub>+ <it>2&#948;</it>cos<it>&#952;</it></display-formula>
         </p>
         <p>
            <display-formula id="M7"><it>&#946; </it>= <it>x</it><sub><it>N </it></sub>+ <it>2&#948;</it>cos(2<it>&#960;</it>/3 + <it>&#952;</it>)</display-formula>
         </p>
         <p>
            <display-formula id="M8"><it>&#947; </it>= <it>x</it><sub><it>N </it></sub>+ <it>2&#948;</it>cos(4&#960;/3 + <it>&#952;</it>)</display-formula>
         </p>
         <p>Values for D' and r<sup>2 </sup>are calculated as previously described <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>:</p>
         <p>
            <display-formula>
               <m:math name="1471-2105-8-428-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mtext>D</m:mtext>
                        <m:mo>=</m:mo>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>11</m:mn>
                           </m:mrow>
                        </m:msub>
                        <m:mo>&#215;</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>22</m:mn>
                           </m:mrow>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>&#8722;</m:mo>
                        <m:mo stretchy="false">(</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>12</m:mn>
                           </m:mrow>
                        </m:msub>
                        <m:mo>&#215;</m:mo>
                        <m:msub>
                           <m:mover accent="true">
                              <m:mi>f</m:mi>
                              <m:mo>^</m:mo>
                           </m:mover>
                           <m:mrow>
                              <m:mn>21</m:mn>
                           </m:mrow>
                        </m:msub>
                        <m:mo stretchy="false">)</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeeiraqKaeyypa0JaeiikaGIafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaakiabgEna0kqbdAgaMzaajaWaaSbaaSqaaiabikdaYiabikdaYaqabaGccqGGPaqkcqGHsislcqGGOaakcuWGMbGzgaqcamaaBaaaleaacqaIXaqmcqaIYaGmaeqaaOGaey41aqRafmOzayMbaKaadaWgaaWcbaGaeGOmaiJaeGymaedabeaakiabcMcaPaaa@44A9@</m:annotation>
                  </m:semantics>
               </m:math>
            </display-formula>
         </p>
         <p>D<sub>max </sub>= min [<it>p</it>(1-<it>q</it>),(1-<it>p</it>)<it>q</it>] <b>if </b>D > 0 <b>or </b>D<sub>max </sub>= min [<it>pq</it>, (1-<it>p</it>)(1-<it>q</it>)] <b>if </b>D &lt; 0</p>
         <p>
            <display-formula id="M9">D' = D/D<sub>max </sub></display-formula>
         </p>
         <p>
            <display-formula id="M10">r<sup>2 </sup>= D<sup>2</sup>/(<it>p</it>(1-<it>p</it>)<it>q</it>(1-<it>q</it>))</display-formula>
         </p>
         <p>Diplotype frequencies based on the estimated haplotype frequencies are compared to the input diplotype frequencies by a <it>&#967;</it><sup>2 </sup>test, which effectively tests sample deviation from the null hypothesis of HWE for the diplotypes formed of the four haplotypes. The number of degrees of freedom is equal to the number of observations (diplotype counts) minus four estimated parameters which are either three haplotypes (the fourth can be inferred) and D, or one haplotype, two allele frequencies and D. If nine different diplotypes are observed the number of degrees of freedom is therefore five. For each empty cell in the 3 &#215; 3 the number of degrees of freedom is reduced by one. If the user knows there are only three haplotypes present (and therefore six diplotypes) then there are only three estimated parameters (D is inferred by the three haplotype frequencies) and 3 df. It is important to note that in the latter case neither cubic solution nor iteration is necessary as the haplotype frequencies can be directly counted from the diplotype data. If the user believes that there are only three alleles and hence six diplotypes, but there are non-zero values for any of the other three possible diplotypes, then reconsideration of the technical veracity of the data and of the homogeneity of the population sample would be wise.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Solutions are considered biologically possible when <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula> and the derived <inline-formula><m:math name="1471-2105-8-428-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>12</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGOmaidabeaaaaa@2F46@</m:annotation></m:semantics></m:math></inline-formula>, <inline-formula><m:math name="1471-2105-8-428-i8" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>21</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGOmaiJaeGymaedabeaaaaa@2F46@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-8-428-i9" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>22</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGOmaiJaeGOmaidabeaaaaa@2F48@</m:annotation></m:semantics></m:math></inline-formula> all fall within the range 0 to 1 (i.e. <inline-formula><m:math name="1471-2105-8-428-i10" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub><m:mo>,</m:mo><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>12</m:mn></m:mrow></m:msub><m:mo>,</m:mo><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>21</m:mn></m:mrow></m:msub><m:mo>,</m:mo><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>22</m:mn></m:mrow></m:msub><m:mo>&#8712;</m:mo><m:mo stretchy="false">[</m:mo><m:mn>0</m:mn><m:mo>,</m:mo><m:mn>1</m:mn><m:mo stretchy="false">]</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaakiabcYcaSiqbdAgaMzaajaWaaSbaaSqaaiabigdaXiabikdaYaqabaGccqGGSaalcuWGMbGzgaqcamaaBaaaleaacqaIYaGmcqaIXaqmaeqaaOGaeiilaWIafmOzayMbaKaadaWgaaWcbaGaeGOmaiJaeGOmaidabeaakiabgIGiolabcUfaBjabicdaWiabcYcaSiabigdaXiabc2faDbaa@4329@</m:annotation></m:semantics></m:math></inline-formula>) and add up to 1. This constraint is tighter than those described elsewhere <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> as it relies on the inherent assumption of representative sampling and HWE, an extreme chance distortion of which could lead to three solutions at SNP allele frequencies of 0.5 in sample data drawn from a population (if all samples are heterozygous at both loci the following are possible: all could be diplotype 11/22, all could be diplotype 12/21, or there could be a combination of both).</p>
         <sec>
            <st>
               <p>Number of solutions to the cubic equation with simulated data</p>
            </st>
            <p>We have calculated the number of possible solutions to the cubic equation for genotypes of simulated pairs of SNPs with a range of allele frequencies for a range of sample sizes. The genotype numbers were calculated assuming HWE with a wide range of LD situations for the two SNPs. This was achieved by simulating all combinations of haplotype frequencies between 0 and 1, at intervals of 1/55, that add up to 1. These haplotype frequencies were then converted to diplotype frequencies according to Hardy-Weinberg equilibrium. The results are plotted in Figure <figr fid="F1">1</figr>. Small samples result in minor deviations from sample HWE, allowing more than one solution. The smaller the sample size, the greater the range of allele frequencies over which this occurs. A sample of 10 subjects allows more than one biologically possible solution at a wide range of allele frequencies (Figure <figr fid="F1">1A</figr>). With 60 individuals a broad range of allele frequencies is still affected (Figure <figr fid="F1">1B</figr>) &#8211; this has implications for analyses based on the HapMap CEU dataset of 60 unrelated individuals <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. At 100 individuals (Figure <figr fid="F1">1C</figr>) the problem is limited to allele frequencies below 15% (Figure <figr fid="F1">1C</figr>), while the plot for 1000 individuals shows no condition under which there is more than one biologically possible solution (Figure <figr fid="F1">1D</figr>). This last observation is because under perfect sample HWE (infinite samples) the number of <it>biologically possible </it>solutions is always 1, despite the number of <it>real </it>solutions exceeding 1 at lower allele frequencies (data not shown).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Simulated data in which HWE is observed to the limit of rounding errors (whole number values for counts of individuals)</p>
               </caption>
               <text>
                  <p><b>Simulated data in which HWE is observed to the limit of rounding errors (whole number values for counts of individuals)</b>. (A) Number of biologically possible solutions to the cubic equation in (A) 10 individuals; (B) 60 individuals; (C) 100 individuals (D) 1000 individuals. x-axis: allele frequency of SNP1, y-axis: allele frequency of SNP2. Black = more than one solution. Grey = one solution.</p>
               </text>
               <graphic file="1471-2105-8-428-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Number of solutions to the cubic equation with real data</p>
            </st>
            <p>We have also calculated the number of solutions to equation 1 for a set of real data from the HapMap project <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. These were a selection of SNPs from the <it>ACE</it>-<it>GH1 </it>region of chromosome 17 for the CEU population (60 unrelated individuals). Figure <figr fid="F2">2A</figr> shows that at the lower allele frequencies the possibility of more than one real solution to the cubic equation begins to arise. This is consistent with the simulated data for 60 samples (Figure <figr fid="F1">1B</figr>), except that a broader range of allele frequencies is affected. This is probably due to the inherent errors of real data increasing the deviations from HWE relative to near-perfect simulated data. In most cases of multiple solutions only two of the three real roots are biologically possible. Figure <figr fid="F2">2B</figr> compares these two values, indicating that in most cases the differences in estimated haplotype are small. In the minority of cases with three solutions these fit the same pattern. However, this can have major consequences for the calculation of D' (as illustrated in Figure <figr fid="F3">3</figr>). Note that D' and r<sup>2 </sup>behave quite differently in this respect, and r<sup>2 </sup>is much less affected. However, as a |D'| of 1 indicates the existence of three or less haplotypes (r<sup>2 </sup>of 1 indicates two haplotypes), |D'| is a good indicator of haplotype block structure, with a value of exactly 1 suggesting little or no recombination between two loci, and a value less than 1 supporting a break-down of LD. In fact CubeX provides both D' and r<sup>2</sup>, allowing the user to select their measure of preference. Figure <figr fid="F4">4</figr> illustrates the relationship between these two measures in the simulated and real datasets, which clarifies how a large |D'| value can be observed with a low r<sup>2 </sup>value, but the key point is that a |D'| of 1 indicates complete LD (i.e. three or less haplotypes) despite a low r<sup>2</sup>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Evaluation of number of solutions for real data</p>
               </caption>
               <text>
                  <p><b>Evaluation of number of solutions for real data</b>. (A) Number of biologically possible solutions over a range of allele frequencies using a large sample of SNP data (Chr. 17:60 to 60.5 MB, 121 SNPs) from the HapMap project [23,24]. x-axis: allele frequency of SNP1, y-axis: allele frequency of SNP2. Black = more than one solution. Grey = one solution. (B) Comparison of two solutions within the dataset. x-axis: higher value solution, y-axis: lower value solution.</p>
               </text>
               <graphic file="1471-2105-8-428-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Screenshot of results screen from CubeX online analysis program</p>
               </caption>
               <text>
                  <p><b>Screenshot of results screen from CubeX online analysis program</b>. In this example there are two biologically possible solutions. Results for both are shown (upper table), and observed (input values) and expected diplotype frequencies (for the two solutions) displayed for comparison (lower table).</p>
               </text>
               <graphic file="1471-2105-8-428-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The range of LD in datasets using the CubeX tool to calculate r2 and D'</p>
               </caption>
               <text>
                  <p>The range of LD in datasets using the CubeX tool to calculate r2 and D'. (A) Simulated data. D' on x-axis, r<sup>2 </sup>on y axis. (B) Real SNP data (Chr. 17:60 to 60.5 MB, 121 SNPs) from the HapMap project [23,24]. D' on x-axis, r<sup>2 </sup>on y axis.</p>
               </text>
               <graphic file="1471-2105-8-428-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Comparison of the cubic exact solution with other approaches</p>
            </st>
            <p>For the purposes of comparison we have analysed two datasets with PHASE <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, MIDAS <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> (Hill EM) and CubeX. The first is a dataset of directly haplotyped samples comprising 80 subjects from 3 ethnic groups (Asian, African and Caucasian) for <it>APOE </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Although all but one SNP was in Hardy-Weinberg equilibrium, this dataset has the potential to invalidate some of the assumptions of the programs due to the mixture of ethnicity. However, this provides a useful substrate on which to test the influence of stratification on the outcome of the cubic exact solution. The second dataset is a set of multi-locus phased data from HapMap CEU samples <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> for the <it>IGF2 </it>gene region. Although these have not been directly haplotyped, the multi-locus phased haplotypes are expected to be very accurate, and this dataset comprises Caucasians, so will not suffer from the same stratification issues. We tested the programs on pair-wise subsets of these data.</p>
            <p>For the <it>APOE </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> dataset the data are presented in Additional File <supplr sid="S1">1</supplr>, with a selected summary in Table <tblr tid="T1">1</tblr>. The subset in Table <tblr tid="T1">1</tblr> demonstrate the advantage of being provided with all possible solutions by CubeX, but also demonstrates that all three approaches can be wrong. To summarise the outcome, PHASE <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and MIDAS <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> (Hill EM) both matched the real counts in 28 of 36 SNP pairs, while CubeX matched real counts in 33 of 36 SNP pairs (for one of its solutions). However, in five of those cases the user would need to determine which of the two CubeX solutions to use based on their prior knowledge of the LD structure in the region (i.e. do they expect three or four haplotypes). This comparison confirms the risk of EM finding a local maximum when there is more than one biologically possible solution, and suggests that CubeX may offer advantages in stratified datasets or datasets with low SNP minor allele frequencies (confirming the results from simulated data above).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Comparisons of PHASE, MIDAS and CubeX on <it>APOE </it>data (from <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>). A comparison of PHASE, MIDAS and CubeX for pairwise analysis of genotype data derived from directly observed multi-locus haplotypes.</p>
               </text>
               <file name="1471-2105-8-428-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Illustrative examples of comparison of CubeX with PHASE [16] and MIDAS [7] (Hill EM).</p>
               </caption>
               <tblbdy cols="15">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="6" ca="center">
                        <p>
                           <b>Haplotype frequencies (rounded to 5 decimal places)</b>
                        </p>
                     </c>
                     <c cspan="6" ca="center">
                        <p>
                           <b>Haplotype numbers (rounded to nearest haplotype)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="15">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Example</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>SNP pair</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Haplotype</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>REAL frequency</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>PHASE frequency</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MIDAS frequency</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX alpha</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX beta</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX gamma</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>REAL number</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>PHASE number</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MIDAS number</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX alpha</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX beta</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>CUBEX gamma</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="15">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>Pair1_2</p>
                     </c>
                     <c ca="center">
                        <p>AC</p>
                     </c>
                     <c ca="center">
                        <p>0.0875</p>
                     </c>
                     <c ca="center">
                        <p>0.08689</p>
                     </c>
                     <c ca="center">
                        <p>0.0875</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0875</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>14</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_2</p>
                     </c>
                     <c ca="center">
                        <p>AT</p>
                     </c>
                     <c ca="center">
                        <p>0.725</p>
                     </c>
                     <c ca="center">
                        <p>0.72561</p>
                     </c>
                     <c ca="center">
                        <p>0.725</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.725</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>116</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>116</p>
                     </c>
                     <c ca="center">
                        <p>116</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>116</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_2</p>
                     </c>
                     <c ca="center">
                        <p>TC</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.00061</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_2</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>0.18689</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>30</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>Pair1_5</p>
                     </c>
                     <c ca="center">
                        <p>AG</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>0.75318</p>
                     </c>
                     <c ca="center">
                        <p>0.75478</p>
                     </c>
                     <c ca="center">
                        <p>0.75478</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.75</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>120</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>121 *</p>
                     </c>
                     <c ca="center">
                        <p>121 *</p>
                     </c>
                     <c ca="center">
                        <p>121 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_5</p>
                     </c>
                     <c ca="center">
                        <p>AA</p>
                     </c>
                     <c ca="center">
                        <p>0.0625</p>
                     </c>
                     <c ca="center">
                        <p>0.05932</p>
                     </c>
                     <c ca="center">
                        <p>0.05772</p>
                     </c>
                     <c ca="center">
                        <p>0.05772</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0625</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>10</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>9 *</p>
                     </c>
                     <c ca="center">
                        <p>9 *</p>
                     </c>
                     <c ca="center">
                        <p>9 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_5</p>
                     </c>
                     <c ca="center">
                        <p>TG</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>0.18432</p>
                     </c>
                     <c ca="center">
                        <p>0.18272</p>
                     </c>
                     <c ca="center">
                        <p>0.18272</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>30</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>29 *</p>
                     </c>
                     <c ca="center">
                        <p>29 *</p>
                     </c>
                     <c ca="center">
                        <p>29 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_5</p>
                     </c>
                     <c ca="center">
                        <p>TA</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.00318</p>
                     </c>
                     <c ca="center">
                        <p>0.00478</p>
                     </c>
                     <c ca="center">
                        <p>0.00478</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1 *</p>
                     </c>
                     <c ca="center">
                        <p>1 *</p>
                     </c>
                     <c ca="center">
                        <p>1 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>Pair1_9</p>
                     </c>
                     <c ca="center">
                        <p>AT</p>
                     </c>
                     <c ca="center">
                        <p>0.05625</p>
                     </c>
                     <c ca="center">
                        <p>0.06477</p>
                     </c>
                     <c ca="center">
                        <p>0.05633</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0563</p>
                     </c>
                     <c ca="center">
                        <p>0.075</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>9</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>10 *</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>12 *</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_9</p>
                     </c>
                     <c ca="center">
                        <p>AC</p>
                     </c>
                     <c ca="center">
                        <p>0.75625</p>
                     </c>
                     <c ca="center">
                        <p>0.74773</p>
                     </c>
                     <c ca="center">
                        <p>0.75617</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.7562</p>
                     </c>
                     <c ca="center">
                        <p>0.7375</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>121</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>120 *</p>
                     </c>
                     <c ca="center">
                        <p>121</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>121</p>
                     </c>
                     <c ca="center">
                        <p>118 *</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_9</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>0.01875</p>
                     </c>
                     <c ca="center">
                        <p>0.01023</p>
                     </c>
                     <c ca="center">
                        <p>0.01867</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0187</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2 *</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0 *</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair1_9</p>
                     </c>
                     <c ca="center">
                        <p>TC</p>
                     </c>
                     <c ca="center">
                        <p>0.16875</p>
                     </c>
                     <c ca="center">
                        <p>0.17727</p>
                     </c>
                     <c ca="center">
                        <p>0.16883</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.1688</p>
                     </c>
                     <c ca="center">
                        <p>0.1875</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>27</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>28 *</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>30 *</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>Pair2_3</p>
                     </c>
                     <c ca="center">
                        <p>CG</p>
                     </c>
                     <c ca="center">
                        <p>0.05625</p>
                     </c>
                     <c ca="center">
                        <p>0.04724</p>
                     </c>
                     <c ca="center">
                        <p>0.0465</p>
                     </c>
                     <c ca="center">
                        <p>0.0465</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>9</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8 *</p>
                     </c>
                     <c ca="center">
                        <p>7 *</p>
                     </c>
                     <c ca="center">
                        <p>7 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair2_3</p>
                     </c>
                     <c ca="center">
                        <p>CT</p>
                     </c>
                     <c ca="center">
                        <p>0.03125</p>
                     </c>
                     <c ca="center">
                        <p>0.04026</p>
                     </c>
                     <c ca="center">
                        <p>0.041</p>
                     </c>
                     <c ca="center">
                        <p>0.041</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>5</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>6 *</p>
                     </c>
                     <c ca="center">
                        <p>7 *</p>
                     </c>
                     <c ca="center">
                        <p>7 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair2_3</p>
                     </c>
                     <c ca="center">
                        <p>TG</p>
                     </c>
                     <c ca="center">
                        <p>0.48125</p>
                     </c>
                     <c ca="center">
                        <p>0.49026</p>
                     </c>
                     <c ca="center">
                        <p>0.491</p>
                     </c>
                     <c ca="center">
                        <p>0.491</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>77</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>78 *</p>
                     </c>
                     <c ca="center">
                        <p>79 *</p>
                     </c>
                     <c ca="center">
                        <p>79 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Pair2_3</p>
                     </c>
                     <c ca="center">
                        <p>TT</p>
                     </c>
                     <c ca="center">
                        <p>0.43125</p>
                     </c>
                     <c ca="center">
                        <p>0.42224</p>
                     </c>
                     <c ca="center">
                        <p>0.4215</p>
                     </c>
                     <c ca="center">
                        <p>0.4215</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>69</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>68 *</p>
                     </c>
                     <c ca="center">
                        <p>67 *</p>
                     </c>
                     <c ca="center">
                        <p>67 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>pair5_9</p>
                     </c>
                     <c ca="center">
                        <p>GT</p>
                     </c>
                     <c ca="center">
                        <p>0.075</p>
                     </c>
                     <c ca="center">
                        <p>0.07313</p>
                     </c>
                     <c ca="center">
                        <p>0.06664</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0666</p>
                     </c>
                     <c ca="center">
                        <p>0.075</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>12</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>11 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>11 *</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>pair5_9</p>
                     </c>
                     <c ca="center">
                        <p>GC</p>
                     </c>
                     <c ca="center">
                        <p>0.8625</p>
                     </c>
                     <c ca="center">
                        <p>0.86437</p>
                     </c>
                     <c ca="center">
                        <p>0.87086</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.8709</p>
                     </c>
                     <c ca="center">
                        <p>0.8625</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>138</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>138</p>
                     </c>
                     <c ca="center">
                        <p>139 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>139 *</p>
                     </c>
                     <c ca="center">
                        <p>138</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>pair5_9</p>
                     </c>
                     <c ca="center">
                        <p>AT</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.00187</p>
                     </c>
                     <c ca="center">
                        <p>0.00836</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0084</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>1 *</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>pair5_9</p>
                     </c>
                     <c ca="center">
                        <p>AC</p>
                     </c>
                     <c ca="center">
                        <p>0.0625</p>
                     </c>
                     <c ca="center">
                        <p>0.06063</p>
                     </c>
                     <c ca="center">
                        <p>0.05414</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>0.0541</p>
                     </c>
                     <c ca="center">
                        <p>0.0625</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>10</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>9 *</p>
                     </c>
                     <c ca="center">
                        <p>na</p>
                     </c>
                     <c ca="center">
                        <p>9 *</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>A selection of comparisons using direct haplotyped APOE data [25]. Full data are present as a additional table. For haplotype numbers (rounded to the nearest number) incorrect answers are marked '*', correct answers are unmarked. Examples: (1) Phase, MIDAS and CubeX (1 solution) give correct answer. (2) Only CubeX gives the correct answer as one of its two solutions. (3) MIDAS and CubeX give the correct answer, PHASE and the other CubeX solution are wrong. (4) All three approaches are wrong. (5) PHASE and CubeX give the correct answer, MIDAS and the other CubeX solution are wrong.</p>
               </tblfn>
            </tbl>
            <p>For the HapMap <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp><it> IGF2 </it>region data (comprising SNPs rs3802971, rs734351, rs3213221, rs4244808, rs1003483, rs3741208, rs1004446, rs4320932 and rs7924316) CubeX gives only one solution in all cases, and there is little difference between the outcome of the three approaches (Additional File <supplr sid="S2">2</supplr>). This confirms that in situations of higher allele frequencies there is less of an issue with multiple biologically possible solutions to the cubic equation, and iterative approaches are completely acceptable.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Comparisons of PHASE, MIDAS and CubeX on HapMap <it>IGF2 </it>region data (from <url>http://www.hapmap.org</url>, <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>). A comparison of PHASE, MIDAS and CubeX for pairwise analysis of genotype data derived from statistically inferred long-range multi-locus haplotypes.</p>
               </text>
               <file name="1471-2105-8-428-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We have written an online program, "CubeX", to enable simple analysis of the biologically possible estimated haplotypes for pairs of biallelic markers. This program takes data from a pair of markers as a standard 3 &#215; 3 table of nine diplotypes, generates cubic exact solutions to equation 1 and generates output in the format shown in Figure <figr fid="F3">3</figr>. The number of possible solutions is shown, followed by haplotype frequencies and LD statistics for those solutions. Below that a duplicate of the 3 &#215; 3 input table is displayed with the addition of expected absolute diplotype frequencies calculated from the haplotype frequencies. The difference between these and the input data are subjected to a <it>&#967;</it><sup>2 </sup>test, which effectively tests sample deviation from the null hypothesis of HWE for the diplotypes formed of the four haplotypes. However, the interpretation of solutions depends on the prior hypothesis. In the example in Figure <figr fid="F3">3</figr>, although solution <it>&#947; </it>exhibits a slightly worse <it>&#967;</it><sup>2 </sup>fit than solution <it>&#946;</it>, the former is consistent with a prior hypothesis of only three of the four haplotypes existing (see Figure 5 in reference <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>), which is biologically likely in the absence of recombination between any two loci. In fact, in all tested cases in Figure <figr fid="F2">2</figr> generating more than one solution, the diplotype data included zero values in at least one corner cell and the two adjacent edge cells of the 3 &#215; 3 (i.e. where one possible solution has a |D'| = 1, although it should be noted that more than one solution can occur without zero values if double heterozygotes are greatly over-represented). This suggests that the principal issue is whether three or four haplotypes exist, and in these cases the prior hypothesis (based on distance and recombination rates) is of utmost importance. If input data for individual SNPs are significantly out of HWE a warning message is given at the top of the page. For completeness, the biologically impossible real number solutions are displayed at the bottom, along with minimum and maximum biologically possible values for <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula> and allele frequencies. This program provides a convenient utility for researchers to both analyse data for haplotype frequencies and LD statistics and to check previous analyses for potential problems caused by multiple solutions.</p>
         <p>Under perfect sample HWE the frequencies of all haplotypes can be directly inferred from the corresponding corner diplotypes of the 3 &#215; 3. For example: <inline-formula><m:math name="1471-2105-8-428-i11" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mn>11</m:mn></m:mrow></m:msub><m:mo>=</m:mo><m:mtext>n&#160;</m:mtext><m:msubsup><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOBa42aaSbaaSqaaiabigdaXiabigdaXaqabaGccqGH9aqpcqqGUbGBcqqGGaaicuWGMbGzgaqcamaaDaaaleaacqaIXaqmcqaIXaqmaeaacqaIYaGmaaaaaa@36E2@</m:annotation></m:semantics></m:math></inline-formula>, so <inline-formula><m:math name="1471-2105-8-428-i12" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub><m:mo>=</m:mo><m:msqrt><m:mrow><m:mfrac><m:mrow><m:msub><m:mi>n</m:mi><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:mi>n</m:mi></m:mfrac></m:mrow></m:msqrt></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaakiabg2da9maakaaabaqcfa4aaSaaaeaacqWGUbGBdaWgaaqaaiabigdaXiabigdaXaqabaaabaGaemOBa4gaaaWcbeaaaaa@35D8@</m:annotation></m:semantics></m:math></inline-formula>. That being the case there are only two possible values for <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula>, one positive and one negative, the latter being biologically impossible. Perfect sample HWE therefore results in only a single biologically possible solution to the cubic equation. In the case of extreme sample HWD where all samples fall within the middle cell of the 3 &#215; 3, <inline-formula><m:math name="1471-2105-8-428-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>f</m:mi><m:mo>^</m:mo></m:mover><m:mrow><m:mn>11</m:mn></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGafmOzayMbaKaadaWgaaWcbaGaeGymaeJaeGymaedabeaaaaa@2F44@</m:annotation></m:semantics></m:math></inline-formula> can contribute either a half, a quarter or none of the haplotypes to the middle cell. There are therefore three biologically possible solutions under conditions of extreme sample HWD. The results from real data confirm that in some cases more than one biologically possible solution to the cubic equation for haplotype frequency can exist. The simulations suggest that this occurs where small sample size, sampling errors or non-random mating result in a distortion of sample HWE, and demonstrates the importance of testing HWE before haplotype analyses. The greater the distortion of sample HWE the higher the allele frequency at which more than one solution can occur (hence, as described above, three solutions can occur at allele frequencies of 0.5 if all samples are heterozygous at both loci). In these cases the cubic exact algorithm gives all possible solutions and a test of HWE, while an iteration-based method would only give one. This supports the hypothesis that the cubic exact approach is superior to iteration-based methods in real-world datasets where sample data rarely fit exactly to HWE (note that sample may differ from population in HWE statistics &#8211; here we refer to sample HWE). This is particularly important in the analysis of low frequency SNPs and paucimorphisms <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, for which different solutions can significantly distort D' results, despite the relatively similar solutions giving similar r<sup>2 </sup>results. In all the observed data with two solutions there were no occasions in which r<sup>2 </sup>exceeded 0.3 for any biologically possible solution, and in most cases there is only a small difference in r<sup>2 </sup>between biologically possible solutions. The largest effect is on D'. On the basis of empirical data and using different approaches to inference Wong <it>et al </it>showed that coding SNPs with minor allele frequencies &lt;0.06 are likely to be of functional importance <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, and rarer alleles, haplotypes and diplotypes of causal importance have emerged in numerous disease contexts (eg. inflammatory bowel disease, hemochromatosis). In addition to being applicable and giving exact evaluation for D' analysis of common SNPs, the cubic exact solution may prove of particular value for evaluating "post-HapMap" and "post-dbSNP" rarer haplotypes, for fully evaluating D' estimates from datasets with greater deviations from the random mating and HWE assumptions and for fully evaluating LD in small datasets.</p>
         <p>Finally, we have demonstrated by comparison with PHASE <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and MIDAS <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> (Hill EM) that in certain situations (low minor allele frequency, population stratification) the cubic exact approach can perform better for pair-wise analyses than alternative approaches by indicating the existence of multiple solutions. However, our findings confirm that in most other situations iterative approaches are robust and accurate.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We present a comprehensive analysis of the consequences of different variables on the number of solutions to the cubic equation for haplotype frequency. Our analyses demonstrate that lower allele frequencies, lower sample numbers and a possible |D'| value of 1 can result in more than one solution. This has significant implications for the calculation of LD in small sample sizes and with rarer alleles that may have particular disease relevance. This evaluation provides essential information for an understanding of the limitations of LD estimation, which is particularly relevant for genome-wide analyses (where sample sizes and allele frequencies can be low). Finally, we present a program "CubeX", freely available as an online program, which provides each of the biologically possible cubic exact solution(s) to equation 1 for haplotype frequency, enabling the user to identify the solution that best fits their prior hypothesis for number of haplotypes.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and Requirements</p>
         </st>
         <p>Project name: CubeX</p>
         <p>Project home page: <url>http://www.oege.org/software/cubex</url></p>
         <p>Operating system(s): Platform independent (web-based)</p>
         <p>Programming language: Python <url>http://www.python.org</url></p>
         <p>Licence: CubeX licence available from <url>http://www.oege.org/software/cubex</url></p>
         <p>Any restrictions to use by non-academics: royalty-free use allowed within terms of licence</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>EM &#8211; Expectation-Maximisation</p>
         <p>HWE &#8211; Hardy-Weinberg Equilibrium</p>
         <p>LD &#8211; Linkage Disequilibrium</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>TRG wrote the CubeX program, ran the simulations and analyses and drafted the manuscript. SR advised on LD calculation and output format, tested the program and contributed to the manuscript. INMD drafted the solution to the cubic equation, advised on methods, tested the program and contributed to the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>TRG is funded by a BHF (British Heart Foundation) Intermediate Fellowship (FS/05/065/19497), SR by a HOPE (Wessex Medical Trust) fellowship and work in our laboratory by the Medical Research Council (UK) (Programme Grant G9800748). We thank an anonymous reviewer for their suggestion of a comparison with PHASE on the <it>APOE </it>dataset <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Linkage disequilibrium and the mapping of complex human traits</p>
            </title>
            <aug>
               <au>
                  <snm>Weiss</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Trends in Genetics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>19</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02550-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11750696</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Shaking the tree: mapping complex disease genes with linkage disequilibrium</p>
            </title>
            <aug>
               <au>
                  <snm>Palmer</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>LR</fnm>
               </au>
            </aug>
            <source>The Lancet</source>
            <pubdate>2005</pubdate>
            <volume>366</volume>
            <fpage>1223</fpage>
            <lpage>1234</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0140-6736(05)67485-5</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Patterns of linkage disequilibrium in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Ardlie</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Seielstad</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>299</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg777</pubid>
                  <pubid idtype="pmpid" link="fulltext">11967554</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Estimation of linkage disequilibrium in randomly mating populations</p>
            </title>
            <aug>
               <au>
                  <snm>Hill</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>Heredity</source>
            <pubdate>1974</pubdate>
            <volume>33</volume>
            <fpage>229</fpage>
            <lpage>239</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4531429</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>GOLD--graphical overview of linkage disequilibrium</p>
            </title>
            <aug>
               <au>
                  <snm>Abecasis</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Cookson</snm>
                  <fnm>WO</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>182</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.2.182</pubid>
                  <pubid idtype="pmpid" link="fulltext">10842743</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>GOLDsurfer: three dimensional display of linkage disequilibrium</p>
            </title>
            <aug>
               <au>
                  <snm>Pettersson</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>LR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>3241</fpage>
            <lpage>3243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth341</pubid>
                  <pubid idtype="pmpid" link="fulltext">15201180</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>MIDAS: software for analysis and visualisation of interallelic disequilibrium between multiallelic markers</p>
            </title>
            <aug>
               <au>
                  <snm>Gaunt</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Rodriguez</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zapata</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>IN</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>227</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1479374</pubid>
                  <pubid idtype="pmpid" link="fulltext">16643648</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Haploview: analysis and visualization of LD and haplotype maps</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Maller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>263</fpage>
            <lpage>265</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth457</pubid>
                  <pubid idtype="pmpid" link="fulltext">15297300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Linkage disequilibrium and the search for complex disease genes</p>
            </title>
            <aug>
               <au>
                  <snm>Jorde</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>1435</fpage>
            <lpage>1444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.144500</pubid>
                  <pubid idtype="pmpid" link="fulltext">11042143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Linkage disequilibrium for different scales and applications</p>
            </title>
            <aug>
               <au>
                  <snm>Mueller</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>355</fpage>
            <lpage>364</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/5.4.355</pubid>
                  <pubid idtype="pmpid" link="fulltext">15606972</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A survey of current software for haplotype phase inference</p>
            </title>
            <aug>
               <au>
                  <snm>Weale</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Hum Genomics</source>
            <pubdate>2004</pubdate>
            <volume>1</volume>
            <fpage>141</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15601542</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A comprehensive literature review of haplotyping software and methods for use with unrelated individuals</p>
            </title>
            <aug>
               <au>
                  <snm>Salem</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Wessel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schork</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Human Genomics</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>39</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15814067</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Haplotype inference by maximum parsimony</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1773</fpage>
            <lpage>1780</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg239</pubid>
                  <pubid idtype="pmpid" link="fulltext">14512348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scheet</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2005</pubdate>
            <volume>76</volume>
            <fpage>449</fpage>
            <lpage>462</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1196397</pubid>
                  <pubid idtype="pmpid" link="fulltext">15700229</pubid>
                  <pubid idtype="doi">10.1086/428594</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A comparison of bayesian methods for haplotype reconstruction from population genotype data</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2003</pubdate>
            <volume>73</volume>
            <fpage>1162</fpage>
            <lpage>1169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1180495</pubid>
                  <pubid idtype="pmpid" link="fulltext">14574645</pubid>
                  <pubid idtype="doi">10.1086/379378</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A new statistical method for haplotype reconstruction from population data</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>68</volume>
            <fpage>978</fpage>
            <lpage>989</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1275651</pubid>
                  <pubid idtype="pmpid" link="fulltext">11254454</pubid>
                  <pubid idtype="doi">10.1086/319501</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A new approach to solving the cubic: Cardan's solution revealed</p>
            </title>
            <aug>
               <au>
                  <snm>Nickalls</snm>
                  <fnm>RWD</fnm>
               </au>
            </aug>
            <source>The Mathematical Gazette</source>
            <pubdate>1993</pubdate>
            <volume>77</volume>
            <fpage>354</fpage>
            <lpage>359</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/3619777</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Estimating Linkage Disequilibrium Between a Polymorphic Marker Locus and a Trait Locus in Natural Populations</p>
            </title>
            <aug>
               <au>
                  <snm>Luo</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Suhai</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>151</volume>
            <fpage>359</fpage>
            <lpage>371</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460467</pubid>
                  <pubid idtype="pmpid" link="fulltext">9872973</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The Python Programming Language</p>
            </title>
            <aug>
               <au>
                  <snm>Foundation</snm>
                  <fnm>PS</fnm>
               </au>
            </aug>
            <pubdate>2006</pubdate>
            <url>http://www.python.org</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The interaction of selection and linkage. I. General considerations; heterotic models</p>
            </title>
            <aug>
               <au>
                  <snm>Lewontin</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1964</pubdate>
            <volume>49</volume>
            <fpage>49</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1210557</pubid>
                  <pubid idtype="pmpid" link="fulltext">17248194</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Linkage disequilibrium in finite populations</p>
            </title>
            <aug>
               <au>
                  <snm>Hill</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>1968</pubdate>
            <fpage>135</fpage>
            <lpage>156</lpage>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Notes on the Maximum Likelihood Estimation of Haplotype Frequencies</p>
            </title>
            <aug>
               <au>
                  <snm>Mano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yasuda</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Katoh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tounai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Inoko</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Imanishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tamiya</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Annals of Human Genetics</source>
            <pubdate>2004</pubdate>
            <volume>68</volume>
            <fpage>257</fpage>
            <lpage>264</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1529-8817.2003.00088.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15180706</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The International HapMap Project</p>
            </title>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>426</volume>
            <fpage>789</fpage>
            <lpage>796</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02168</pubid>
                  <pubid idtype="pmpid" link="fulltext">14685227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A haplotype map of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Consortium</snm>
                  <fnm>TIHM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1299</fpage>
            <lpage>1320</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1880871</pubid>
                  <pubid idtype="pmpid" link="fulltext">16255080</pubid>
                  <pubid idtype="doi">10.1038/nature04226</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Analysis and Exploration of the Use of Rule-Based Algorithms and Consensus Methods for the Inferral of Haplotypes</p>
            </title>
            <aug>
               <au>
                  <snm>Orzack</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Gusfield</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nesbitt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Subrahmanyan</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stanton</snm>
                  <fnm>VP</fnm>
                  <suf>Jr.</suf>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>165</volume>
            <fpage>915</fpage>
            <lpage>928</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462785</pubid>
                  <pubid idtype="pmpid" link="fulltext">14573498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Paucimorphic Alleles versus Polymorphic Alleles and Rare Mutations in Disease Causation: Theory, Observation and Detection</p>
            </title>
            <aug>
               <au>
                  <snm>Day</snm>
                  <fnm>INM</fnm>
               </au>
               <au>
                  <snm>Alharbi</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Aldahmesh</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lotery</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Pante-de-Sousa</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eccles</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Cross</snm>
                  <fnm>NCP</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Rodriguez</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Current Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>431</fpage>
            <lpage>438</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2174/1389202043349156</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Mutation scanning by meltMADGE: Validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population</p>
            </title>
            <aug>
               <au>
                  <snm>Alharbi</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Aldahmesh</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Spanakis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Haddad</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Whittall</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rassoulian</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sillibourne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Briggs</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>DIW</fnm>
               </au>
               <au>
                  <snm>Lawlor</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Ye</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Humphries</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Ebrahim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Eccles</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>INM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>967</fpage>
            <lpage>977</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1172041</pubid>
                  <pubid idtype="pmpid" link="fulltext">15998910</pubid>
                  <pubid idtype="doi">10.1101/gr.3313405</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Prevalence and functionality of paucimorphic and private MC4R mutations in a large, unselected European British population, scanned by meltMADGE</p>
            </title>
            <aug>
               <au>
                  <snm>Alharbi</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Spanakis</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Aldahmesh</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>O'Dell</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Sayer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Lawlor</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Ebrahim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Davey Smith</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>O'Rahilly</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Farooqi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>DI</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>IN</fnm>
               </au>
            </aug>
            <source>Hum Mutat</source>
            <pubdate>2007</pubdate>
            <volume>28</volume>
            <issue>3</issue>
            <fpage>294</fpage>
            <lpage>302</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/humu.20404</pubid>
                  <pubid idtype="pmpid" link="fulltext">17072869</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>A Population Threshold for Functional Polymorphisms</p>
            </title>
            <aug>
               <au>
                  <snm>Wong</snm>
                  <fnm>GKS</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Passey</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Kibukawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paddock</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Bolund</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1873</fpage>
            <lpage>1879</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403778</pubid>
                  <pubid idtype="pmpid" link="fulltext">12902381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
