<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2003-4-4-r24</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Haplotypic analysis of the <it>TNF</it> locus by association efficiency and entropy</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Ackerman</snm>
               <fnm>Hans</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <insr iid="I6"/>
               <email>ackerman@fas.harvard.edu</email>
            </au>
            <au id="A2">
               <snm>Usen</snm>
               <fnm>Stanley</fnm>
               <insr iid="I4"/>
            </au>
            <au id="A3">
               <snm>Mott</snm>
               <fnm>Richard</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4">
               <snm>Richardson</snm>
               <fnm>Anna</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
            </au>
            <au id="A5">
               <snm>Sisay-Joof</snm>
               <fnm>Fatoumatta</fnm>
               <insr iid="I4"/>
            </au>
            <au id="A6">
               <snm>Katundu</snm>
               <fnm>Pauline</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A7">
               <snm>Taylor</snm>
               <fnm>Terrie</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A8">
               <snm>Ward</snm>
               <fnm>Ryk</fnm>
               <insr iid="I2"/>
               <insr iid="I7"/>
            </au>
            <au id="A9">
               <snm>Molyneux</snm>
               <fnm>Malcolm</fnm>
               <insr iid="I5"/>
            </au>
            <au id="A10">
               <snm>Pinder</snm>
               <fnm>Margaret</fnm>
               <insr iid="I4"/>
            </au>
            <au id="A11">
               <snm>Kwiatkowski</snm>
               <mi>P</mi>
               <fnm>Dominic</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Wellcome Trust for Human Genetics, Roosevelt Drive, Oxford OX3 7BN, UK</p>
            </ins>
            <ins id="I2">
               <p>Institute of Biological Anthropology, 58 Banbury Road, Oxford, OX2 9QS, UK</p>
            </ins>
            <ins id="I3">
               <p>University Department of Paediatrics, John Radcliffe Hospital, Oxford OX3 9DU, UK</p>
            </ins>
            <ins id="I4">
               <p>MRC Laboratories, Fajara, The Gambia</p>
            </ins>
            <ins id="I5">
               <p>Welcome Trust Research Laboratories and Malaria Project, College of Medicine, University of Malawi, Blantyre, Malawi</p>
            </ins>
            <ins id="I6">
               <p>Current address: Harvard Medical School, 64 Linnaean Street, Cambridge, MA 02138, USA</p>
            </ins>
            <ins id="I7">
               <p>
                  Deceased
     </p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2003</pubdate>
         <volume>4</volume>
         <issue>4</issue>
         <fpage>R24</fpage>
         <url>http://genomebiology.com/2003/4/4/R24</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2003-4-4-r24</pubid>
               <pubid idtype="pmpid">12702205</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>12</day>
               <month>7</month>
               <year>2002</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>8</day>
               <month>12</month>
               <year>2002</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>24</day>
               <month>1</month>
               <year>2003</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>3</month>
               <year>2003</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2003</year>
         <collab>Ackerman et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <shorttitle>
         <p>Haplotypic analysis of the <it>TNF</it> locus by association efficiency and entropy</p>
      </shorttitle>
      <shortabs>
         <p>To understand the causal basis of <it>TNF </it>associations with disease, it is necessary to understand the haplotypic structure of this locus. The <it>TNF</it> locus in  Gambian and Malawi human samples is haplotypically diverse and has a rich history of intragenic recombination. As a consequence, a large proportion of TNF single nucleotide polymorphisms (SNPs) must be typed to detect a disease-modifying SNP at this locus. The most informative subset of SNPs to genotype differs between the two populations.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>To understand the causal basis of <it>TNF </it>associations with disease, it is necessary to understand the haplotypic structure of this locus. We genotyped 12 single-nucleotide polymorphisms (SNPs) distributed over 4.3 kilobases in 296 healthy, unrelated Gambian and Malawian adults. We generated 592 high-quality haplotypes by integrating family- and population-based reconstruction methods.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We found 32 different haplotypes, of which 13 were shared between the two populations. Both populations were haplotypically diverse (gene diversity = 0.80, Gambia; 0.85, Malawi) and significantly differentiated (<it>p </it>&lt; 10<sup>-5 </sup>by exact test). More than a quarter of marker pairs showed evidence of intragenic recombination (29% Gambia; 27% Malawi). We applied two new methods of analyzing haplotypic data: association efficiency analysis (AEA), which describes the ability of each SNP to detect every other SNP in a case-control scenario; and the entropy maximization method (EMM), which selects the subset of SNPs that most effectively dissects the underlying haplotypic structure. AEA revealed that many SNPs in <it>TNF </it>are poor markers of each other. The EMM showed that 8 of 12 SNPs (Gambia) and 7 of 12 SNPs (Malawi) are required to describe 95% of the haplotypic diversity.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The <it>TNF </it>locus in the Gambian and Malawi sample is haplotypically diverse and has a rich history of intragenic recombination. As a consequence, a large proportion of <it>TNF </it>SNPs must be typed to detect a disease-modifying SNP at this locus. The most informative subset of SNPs to genotype differs between the two populations.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010011">Immunology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010012">Medicine</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The <it>TNF </it>locus (MIM *191160) has been associated with susceptibility to a wide range of infectious and inflammatory diseases, including malaria, typhoid, leishmaniasis, meningococcal sepsis, trachoma, asthma, multiple sclerosis, and inflammatory bowel disease <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Thus far, these associations have not been mapped in any detail, and as <it>TNF </it>lies in the central part of the major histocompatibility complex (MHC), there are many candidate genes that could potentially be responsible. A strong candidate is <it>TNF </it>itself, as it encodes the potent pro-inflammatory cytokine TNF, and there is a significant body of clinical and experimental data suggesting a causal role for it in the pathogenesis of many of the diseases with which it has been associated. Furthermore, most of the reported associations are with polymorphisms located in the <it>TNF </it>promoter region, and cellular studies of gene regulation <it>in vitro </it>suggest that the molecular basis of the disease association maybe, at least in some cases, a direct effect of the polymorphism on levels of gene expression <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>To pursue the causal origin of these <it>TNF </it>disease associations we must begin with a detailed understanding of the allelic associations between different <it>TNF </it>SNPs. This is particularly important considering that within a few hundred base pairs, there are several potentially functional polymorphisms which show apparently independent disease associations with severe malaria <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Alternatively, these <it>TNF </it>SNPs may be serving as neutral markers of functional polymorphisms elsewhere in the central MHC. In order to understand, first, how the <it>TNF </it>SNPs relate to each other, and second, which SNPs are the best markers of the <it>TNF </it>locus in general, we applied two new analytical techniques to our haplotypic data. The first, association efficiency analysis (AEA), precisely defines the ability of one SNP to detect association at every other SNP in a case-control scenario. The second technique, entropy maximization method (EMM), selects those SNPs that most effectively dissect the underlying haplotypic structure of a locus. The results of these analyses allow us to prioritize SNPs for genotyping in future disease-association studies.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Haplotyping the <it>TNF </it>locus</p>
            </st>
            <p>Twelve SNPs spanning 4.3 kb (Figure <figr fid="F1">1</figr>) were genotyped in 212 Gambian and 84 Malawian adults with no missing data. Allele frequencies for each SNP are listed in Table <tblr tid="T1">1</tblr>. The Gambian genotypic data had 354/2,544 (14%) sites where gametic phase was unknown, and the Malawian data had 188/1,008 (19%) sites where gametic phase was unknown. Where available, genotypes from offspring of the adults were used to phase these data (using the program PHAMILY), which reduced the number to 127/2,544 (5%) phase-unknown sites in the Gambian dataset and 75/1,008 (7%) phase-unknown sites in the Malawian dataset. The data were pooled and the program PHASE was then used to assign the remaining phase-unknown sites. After inferring haplotypes, only 6/2,544 (0.2%) phase assignments were less than 90% certain in the Gambian dataset, and only 2/1,008 (0.2%) phase assignments were less than 90% certain in the Malawian dataset. All other assignments (121/2,544 and 73/1,008) were greater than 90% certain. These 424 Gambian haplotypes and 168 Malawian haplotypes (Table <tblr tid="T2">2</tblr>) were the basis of subsequent analyses reported.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Diagram of the <it>TNF </it>locus drawn to scale with SNPs indicated</p>
               </caption>
               <text>
                  <p>Diagram of the <it>TNF </it>locus drawn to scale with SNPs indicated. Filled boxes represent exons and the open boxes represent the 3' untranslated region (3' UTR). Positions are given in number of base pairs relative to the transcriptional start of <it>TNF</it>. The SNP at position -3025 is also referred to as an LT&#945; <it>Nco</it>I restriction fragment length polymorphism in some references.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Allele frequencies of 12 SNPs at the <it>TNF </it>populations locus in two</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Polymorphism</p>
                     </c>
                     <c ca="center">
                        <p>Gambia (<it>n </it>= 424)</p>
                     </c>
                     <c ca="center">
                        <p>Malawi (<it>n </it>= 168)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-3025 A/G</p>
                     </c>
                     <c ca="center">
                        <p>0.382</p>
                     </c>
                     <c ca="center">
                        <p>0.446</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-1073 T/C</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="center">
                        <p>0.030</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-1031 T/C</p>
                     </c>
                     <c ca="center">
                        <p>0.108</p>
                     </c>
                     <c ca="center">
                        <p>0.250</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-863 C/A</p>
                     </c>
                     <c ca="center">
                        <p>0.057</p>
                     </c>
                     <c ca="center">
                        <p>0.161</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-857 C/T</p>
                     </c>
                     <c ca="center">
                        <p>0.054</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-376 G/A</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                     <c ca="center">
                        <p>0.048</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-308 G/A</p>
                     </c>
                     <c ca="center">
                        <p>0.193</p>
                     </c>
                     <c ca="center">
                        <p>0.107</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-244 G/A</p>
                     </c>
                     <c ca="center">
                        <p>0.021</p>
                     </c>
                     <c ca="center">
                        <p>0.060</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF-238 G/A</p>
                     </c>
                     <c ca="center">
                        <p>0.054</p>
                     </c>
                     <c ca="center">
                        <p>0.071</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF+467 G/A</p>
                     </c>
                     <c ca="center">
                        <p>0.028</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF+851 A/G</p>
                     </c>
                     <c ca="center">
                        <p>0.104</p>
                     </c>
                     <c ca="center">
                        <p>0.149</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TNF+1304 A/G</p>
                     </c>
                     <c ca="center">
                        <p>0.085</p>
                     </c>
                     <c ca="center">
                        <p>0.131</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>TNF haplotype frequencies in two populations</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Gambia</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Malawi</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Identifier</p>
                     </c>
                     <c ca="left">
                        <p>Haplotype</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>n</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Frequency</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>n</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Frequency</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i8.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>157</p>
                     </c>
                     <c ca="center">
                        <p>0.370</p>
                     </c>
                     <c ca="right">
                        <p>42</p>
                     </c>
                     <c ca="center">
                        <p>0.250</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i9.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>0.042</p>
                     </c>
                     <c ca="right">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>0.042</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i10.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i11.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i12.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>0.026</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i13.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i14.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.009</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i15.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>0.052</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i16.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>0.054</p>
                     </c>
                     <c ca="right">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>0.161</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i17.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>0.031</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i18.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>P</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i19.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i20.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i21.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i22.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i23.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i24.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i25.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i26.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i27.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>70</p>
                     </c>
                     <c ca="center">
                        <p>0.165</p>
                     </c>
                     <c ca="right">
                        <p>39</p>
                     </c>
                     <c ca="center">
                        <p>0.232</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i28.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i29.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.009</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i30.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i31.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>0.021</p>
                     </c>
                     <c ca="right">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i32.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i33.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i34.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i35.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.030</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i36.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>71</p>
                     </c>
                     <c ca="center">
                        <p>0.167</p>
                     </c>
                     <c ca="right">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>0.101</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i37.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i38.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.002</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i39.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>
                           <graphic file="gb-2003-4-4-r24-i40.gif"/>
                        </p>
                     </c>
                     <c ca="right">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.006</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>424</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>168</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>SNPs are in genomic order from left to right as listed in Table <tblr tid="T1">1</tblr>. The haplotype labeled "P" was found in two chimpanzees, a gorilla, and two orang utans.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Haplotype distributions in two populations</p>
            </st>
            <p>In the Gambian sample, we observed 24 different haplotypes, and the distribution was dominated by one major haplotype (37.0%) and two others (16.7%, 16.5%) (Table <tblr tid="T2">2</tblr>, Figure <figr fid="F2">2</figr>). Thirteen haplotypes of low frequency made up another 28%, and 2% of alleles were singletons (haplotypes observed only once in the sample). The gene diversity, or probability that two haplotypes chosen at random from the sample are different, was 0.80 (standard deviation (SD) 0.01). There is one nonrecombinant haplotype that is found only in The Gambia (8).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Minimum mutation networks of the <it>TNF </it>haplotypes</p>
               </caption>
               <text>
                  <p>Minimum mutation networks of the <it>TNF </it>haplotypes. <b>(a) </b>The Gambian and <b>(b) </b>the Malawian population. Each circle represents a haplotype, and the size of the circle is proportional to the haplotype frequency. Each connecting line corresponds to a mutational event, and the position of the resulting SNP is indicated in base pairs from the transcriptional start of the gene. Broken lines indicate recombination. The numbers correspond to the haplotype indentifiers listed in Table <tblr tid="T2">2</tblr>. Filled circles represent haplotypes that were not found in that population. The haplotype labelled 'P' was found in two chimpanzees, a gorilla, and two orang utans. Haplotypes of frequency less than 1% are not shown.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-2"/>
            </fig>
            <p>The Malawian sample included 21 different haplotypes, 13 of which were shared with the Gambian population (Table <tblr tid="T2">2</tblr>, Figure <figr fid="F2">2</figr>). The predominant haplotype in Malawi was not as common, at 25.0%, and there were three other haplotypes with frequencies greater than 10% (23.2%, 16.1% and 10.1%). Haplotype 9, bearing the -1031C and -863A alleles, is much more common in Malawi than in The Gambia, at 16.1% versus 5.4%. Four percent of the Malawian alleles were present as singletons. The gene diversity in Malawi was slightly greater at 0.85 (SD 0.01).</p>
            <p>Wright's <it>F </it>statistic reveals differences between populations by calculating the reduction in heterozygosity that is expected after random mating between the two populations. For the Gambian and Malawian populations the <it>F</it><sub><it>ST </it></sub>= 0.021. Permutations of the data under a null hypothesis of no differences between the populations gives a probability of 0.0001. An exact test of population differentiation finds the observed haplotype distributions to be a poor fit of a model of random mating between the Gambian and Malawian samples, with a probability less than 10<sup>-5</sup>.</p>
         </sec>
         <sec>
            <st>
               <p>Recombination</p>
            </st>
            <p>Results of the four-gamete test on pairs of SNPs in each population reveal that in The Gambia, 19/66 (29%) pairs of loci have all four possible haplotypes present; while in Malawi, 15/55 (27%) pairs of SNPs have all four haplotypes present (Figure <figr fid="F3">3</figr>). Some high-frequency (and presumably older) SNPs show evidence of recombination in both populations (TNF-3025, TNF-1031). In The Gambia, the TNF-308 SNP shows evidence of recombination with six other SNPs. In Malawi, the TNF-244 shows evidence of recombination with many other SNPs. Interestingly, the TNF-863 SNP, of relatively high frequency in Malawi, shows no evidence of recombination with any other SNPs in the Malawian sample. Recombinant haplotypes of frequency greater than 1% are represented in Figure <figr fid="F2">2</figr> with dotted lines. Although the branching structure of the <it>TNF </it>genealogy is similar in both populations, there are unique recombination events that have differentiated the two samples (haplotypes 17, 20 and 26). In Malawi, there appear to be more recombinant haplotypes of greater than 1% frequency.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Recombination in the <it>TNF </it>gene region</p>
               </caption>
               <text>
                  <p>Recombination in the <it>TNF </it>gene region. <b>(a) </b>Four-gamete test, The Gambia; <b>(b) </b>four-gamete test, Malawi. Red squares indicate pairs of SNPs in which all four gametes were observed, which is evidence for recombination or recurrent mutation.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Linkage disequilibrium</p>
            </st>
            <p>Consistent with the evidence for recombination, linkage disequilibrium is variable across the <it>TNF </it>locus (Figure <figr fid="F4">4c,4d</figr>) and generally low. Values of pairwise linkage disequilibrium, <it>r</it><sup>2</sup>, are different between the two populations, though well correlated (<it>r </it>= 0.83). In The Gambia, 80% of SNP pairs have the minor alleles in the repulsion phase, while in Malawi, 65% of SNP pairs are in the repulsion phase. Most of the SNP pairs in the repulsion phase have very low linkage disequilibrium (<it>r</it><sup>2 </sup>= 0). There are six pairs of SNPs that are in the coupling phase in the Malawian sample, but are in the repulsion phase in The Gambian sample, reversing the allelic associations. Most of these occasions involve the TNF-244A allele, which is found on only one haplotype in the Gambia (in phase with the TNF-3025G allele), but on five different haplotypes in Malawi.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Association efficiency and linkage disequilibrium parameters for 12 SNPs at the <it>TNF </it>locus</p>
               </caption>
               <text>
                  <p>Association efficiency and linkage disequilibrium parameters for 12 SNPs at the <it>TNF </it>locus. <b>(a) </b>AEA, Gambia; <b>(b) </b>AEA, Malawi; <b>(c) </b>linkage disequilibrium, Gambia; <b>(d) </b>linkage disequilibrium, Malawi. The apparent relative risk at the marker allele indicated in the leftmost column is given when the hypothetical disease allele indicated in the bottom row is assigned a relative risk of 10. In (a,b) color indicates the magnitude of the apparent relative risk: blue indicates a relative risk of less than twofold, yellow between two- and fourfold, and red between fourfold and the maximum of 10-fold. In (c,d) red indicates <it>p </it>&lt; 0.05 by the chi-squared test, uncorrected for multiple tests.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Association efficiency analysis</p>
            </st>
            <p>Using Equation (2) (see Materials and methods) we calculate the apparent relative risk at one marker given a relative risk of 10 at a second marker for all pairs of markers. As an example, we assign a relative risk of 10 to the TNF-376A allele (this is the relative risk of the A allele compared to the G allele) and calculate the relative risks at all other SNPs in the TNF gene using the haplotype frequencies from The Gambia (Figure <figr fid="F5">5</figr>). Log<sub>10 </sub>of the relative risk is plotted on the <it>y</it>-axis, so values above the <it>x</it>-axis correspond to disease risk while values below the <it>x</it>-axis correspond to disease protection. We find that the relative risk varies greatly across the TNF gene region. The three SNPs that are in maximum positive linkage disequilibrium (<it>D</it>' = 1) with TNF-376A show relative risks of 2.2, 3.3, and 2.2 when TNF-376A gives a relative risk of 10. The remaining eight SNPs in the TNF gene would have a relative risk between 0.8 and 0.9 when the relative risk at TNF-376A is 10 (Figure <figr fid="F4">4</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>AEA of the TNF-376 SNP</p>
               </caption>
               <text>
                  <p>AEA of the TNF-376 SNP. The TNF-376 SNP is assigned a hypothetical relative risk of 10, and the apparent relative risk at all other <it>TNF </it>SNPs is plotted on a log<sub>10 </sub>scale (log<sub>10</sub>R). The positions of the other SNPs are indicated on the <it>x</it>-axis as base pairs from the start of transcription. Eight of the SNPs in the <it>TNF </it>gene would give a relative risk between 0.8 and 0.9 when the TNF-376 SNP gives a hypothetical relative risk of 10.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-5"/>
            </fig>
            <p>Data for all pairs of markers are given in Figure <figr fid="F4">4a,4b</figr>. Hypothetical disease alleles are given along the <it>x</it>-axis, and marker alleles are given on the <it>y</it>-axis. Each cell indicates the apparent relative risk (<it>R</it>) of a marker on the <it>y</it>-axis when the hypothetical disease allele indicated on the <it>x</it>-axis has a relative risk of 10. Using an arbitrary cutoff of 0.5 &lt;<it>R </it>&lt; 2.0 (indicated by blue color), some markers perform poorly: for example in The Gambia TNF-308 can only detect TNF-3025 (<it>R </it>= 2.9). Interestingly, if the TNF-308 SNP were a disease allele, most of the other SNPs in TNF would show a moderate protective effect. TNF-1031 is good at detecting associations at seven other SNPs. In Malawi, the pattern of association efficiency is different, and generally stronger. Most hypothetical disease SNPs in Malawi would be detectable by other SNPs in TNF, with the exception of TNF+476 and TNF-1073 which have very low frequency and little prospect of being detected by other markers.</p>
         </sec>
         <sec>
            <st>
               <p>Association efficiency analysis compared to linkage disequilibrium</p>
            </st>
            <p>To illustrate the relationship between the association efficiency parameter and measures of linkage disequilibrium, we plot the log<sub>10 </sub>of relative risk for all pairs of SNPs (when one SNP is assigned a relative risk of 10) against <it>r</it><sup>2 </sup>(Figure <figr fid="F6">6a,6b</figr>). Above the <it>x</it>-axis, <it>r</it><sup>2 </sup>appears to be correlated with the association efficiency parameter (correlation coefficients of <it>r </it>= 0.77 in The Gambia, and <it>r </it>= 0.91 in Malawi). Below the <it>x</it>-axis, where the marker SNP gives an apparently protective effect, the correlation is not as strong (<it>r </it>= -0.61 in The Gambia, <it>r </it>= -0.60 in Malawi). In this situation where minor alleles are in the repulsion phase, the values for <it>r</it><sup>2 </sup>remain low even when association efficiency parameter shows a strong protective effect.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Plots of the AEA parameter versus linkage disequilibrium (LD) for all pairs of SNPs</p>
               </caption>
               <text>
                  <p>Plots of the AEA parameter versus linkage disequilibrium (LD) for all pairs of SNPs. The apparent relative risk for all marker SNPs (when paired with a hypothetical disease SNP of relative risk = 10) is plotted on a log<sub>10 </sub>scale against measures of LD in <b>(a,c,e) </b>the Gambian and <b>(b,d,f) </b>the Malawian populations. The AEA parameter is plotted against (a,b) <it>r</it><sup>2</sup>, (c,d) <it>D</it>, and (e,f) <it>D</it>'. The correlation coefficient (<it>r</it>) between the AEA parameter and the measure of LD is indicated on each graph.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-6"/>
            </fig>
            <p>The association efficiency parameter is correlated with <it>D</it>, the un-normalized linkage disequilibrium parameter (Figure <figr fid="F6">6c,6d</figr>). When the absolute value of <it>D </it>is great, the association efficiency tends to be high; however, many SNP pairs also have high association efficiency even when the absolute value of <it>D </it>is low.</p>
            <p>When we plot the association efficiency parameter against the normalized disequilibrium parameter, <it>D</it>', we see a strong positive correlation: <it>r </it>= 0.89 in the Gambia and <it>r </it>= 0.91 in Malawi (Figure <figr fid="F6">6e,6f</figr>). Linkage disequilibrium is necessary to detect association; however, it is not sufficient. Despite <it>D</it>' = 1.0, some pairs of SNPs are poor markers of each other, showing relative risks between 0.5 and 2.0 when the linked hypothetical disease allele gives a relative risk of 10.</p>
         </sec>
         <sec>
            <st>
               <p>Entropy maximization method</p>
            </st>
            <p>Entropy (<it>E</it>), a measure of haplotypic diversity, was calculated for the full 12-SNP haplotypes in each population (Equation (4) in Materials and methods). In the Gambian sample, <it>E </it>= 2.95; in the Malawian sample, <it>E </it>= 3.23. To determine if a subset of these 12 SNPs could account for most of the haplotypic diversity in these samples, we calculated entropy for all possible subsets of SNPs, increasing the number of SNPs in a subset from 1 to 12. In Figure <figr fid="F7">7</figr> we plot the subsets of markers with maximum entropy. We begin with a single SNP, and with each additional SNP we include, the entropy increases. The entropy plateaus when the additional markers contribute little additional information. To account for 95% of the haplotypic diversity of <it>TNF</it>, one must genotype 8 of 12 SNPs in The Gambian sample (in order of decreasing informativeness): -3025, -308, 851, -1031, -857, 467, -244, -238; and type 7 SNPs in the Malawian sample: -3025, -1031, 851, -308, 1304, -244, -1073.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Results of the entropy maximization method</p>
               </caption>
               <text>
                  <p>Results of the entropy maximization method. Plots of the percentage of haplotypic diversity accounted for by a subset of SNPs as the size of the subset goes from 1 SNP to 12 SNPs. The individual SNPs are labeled on the plot. The percentage is calculated as the entropy of the haplotype frequencies as defined by the best <it>n</it>-SNP subset divided by the entropy of the haplotype frequencies as defined by all 12 SNPs, as <it>n </it>goes from 1 to 12. Note that the order in which SNPs are included in an optimum subset differs between the two populations.</p>
               </text>
               <graphic file="gb-2003-4-4-r24-7"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The <it>TNF </it>locus has been associated with susceptibility to a wide range of infectious and inflammatory diseases. In African populations, three <it>TNF </it>promoter SNPs have been independently associated with severe malaria <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. To better understand these complex genetic associations with malaria, we have defined the haplotypic structure of the <it>TNF </it>gene region using 12 SNPs genotyped in 212 Gambian and 84 Malawian adults. Gametic phase was determined by genotyping one offspring from each adult (when available), reducing the number of phase-unknown sites by 65%. The gametic phase of the remaining sites was determined by statistical inference. This integration of family- and population-based methods of haplotype reconstruction resulted in phase assignments of high certainty. This description of the pattern of DNA sequence variation provides insight into the genetic differentiation of these two African populations, the evolution of the TNF gene itself, and the suitability of these SNPs to serve as markers of disease association.</p>
         <p>There are a number of interesting features that distinguish these two populations at the haplotypic level. Eleven haplotypes are unique to the Gambian sample, and eight are unique to the Malawian sample. The haplotype bearing the TNF-857T allele was not found in Malawi (this site is not polymorphic in the Malawian sample), whereas it has a frequency of 5.2% in The Gambia. When population differentiation is assessed by haplotype instead of by individual SNP frequencies, we observe a greater value for Wright's fixation index, <it>F</it><sub><it>ST </it></sub>= 0.021, compared to <it>F</it><sub><it>ST </it></sub>= 0.007 for individual TNF SNPs <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Intragenic recombination seems to have had an important role in diversifying the locus and in distinguishing the two populations, as evidenced by a number of population-specific recombinant haplotypes (7, 17, 20 and 26). In spite of the significant differences in haplotype distributions between the two African populations, it appears that most of the haplotypes have evolved over a period of time during which there was gene flow between the two sampled populations. That is, the basic branching structure of the gene genealogy is the same in both populations, although frequencies are different and there appear to be population specific recombination events (Figure <figr fid="F2">2</figr>).</p>
         <p>Our results have some similarities with a report of haplotype structure across 13 megabases (Mb) in a sample of Yorubans and African-Americans <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In a haplotype block defined by 12 SNPs, four haplotypes greater than 5% frequency were found; we find five in the Gambia, and four in Malawi with frequency greater than 5%. However, we find that a minimum of 8 and 10 haplotypes make up 90% of the sample in The Gambia and Malawi, respectively, while they report an average minimum of 5 haplotypes. Furthermore, they report values for <it>D</it>' that are constant across a haplotype block, while we find variability in linkage disequilibrium, even over short distances. Our data are more consistent with previous reports of haplotypic structure where SNPs were ascertained by resequencing in a large sample of chromosomes <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
         <p>The increased frequency of haplotype 1 in the Gambian sample has a profound impact on the allelic associations between SNPs as evaluated by chi-square test or as defined by the pairwise association efficiency parameter. The 'star-shaped' genealogy of the Gambian <it>TNF </it>locus, dominated by one central haplotype with many derived haplotypes defined by single mutational events, produces negative disequilibria between most pairs of SNPs (the rare alleles are found in the repulsion phase). The chi-square test has low power to detect negative disequilibria <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>; this may explain why in the case where <it>p </it>> 0.05, 41/42 SNP pairs have negative disequilibria. The ability of one marker to detect another in a case-control scenario is also greatly reduced when alleles are in the repulsion phase. For example, in the Gambian sample, allele pairs in the repulsion phase are much more likely to be inefficient markers (where inefficient means the apparent relative risk at the marker SNP is betwen 0.5 and 2.0 when the relative risk of the hypothetical disease SNP is 10) than alleles in the coupling phase (<it>p </it>&lt; 10<sup>-7</sup>).</p>
         <p>Although linkage disequilibrium between a disease susceptibility locus and a marker locus is required to detect an association with a complex disease, for a fixed sample size, it may not be sufficient. Even in the extreme case when linkage disequilibrium is at its theoretical maximum (<it>D</it>' = 100% and no recombination has occurred between the two SNPs), the relative risk at the marker locus will be some fraction of the relative risk at the disease susceptibility locus, unless the two alleles have the same frequency <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Using the relationship defined in Equation 2 (see Materials and methods), we can examine the haplotypic structure of a candidate locus and assess the ability of one marker to detect a disease association when a second marker contributes a given disease risk. We calculate two values of apparent relative risk for each pair of SNPs (assigning each SNP in turn a hypothetical risk of 10) and plot them against <it>D</it>' (Figure <figr fid="F6">6</figr>).</p>
         <p>Pairs of SNPs where <it>D</it>' values are reversed between the populations also show a reversal of the relative risk (that is, relative risks greater than 1 become less than 1). For example, if the risk at SNP TNF+851 was 10, then the neighboring SNP, TNF+467, would give a relative risk of 3.7 in Malawi, but in the Gambian population, the effect would be protective with a relative risk of 0.8. This suggests that even with a marker in close physical proximity to a disease-susceptibility SNP, results may differ greatly and even give apparently contradictory results between two populations. This illustrates the importance of identifying population-specific recombinant chromosomes in the study populations. We identify six pairs of SNPs that would reverse relative risks (that is, reverse the sign of <it>D</it>) between the Gambian and Malawian populations.</p>
         <p>Although the association efficiency parameter is correlated with <it>D</it>' (<it>r </it>= 0.99 for combined data), it is interesting to consider the situation where <it>D</it>' = 1.00 between the marker and hypothetical disease SNPs. In this situation, the apparent relative risk at a marker SNP may be considerably lower than at the disease SNP (Figure <figr fid="F6">6</figr>). This is especially evident in the Malawian population. These observations suggest that linkage-disequilibrium measures, as summarized by <it>D</it>', may not accurately reflect the ability of a marker to detect a disease association.</p>
         <p>It is troubling to observe that even within a relatively small gene like <it>TNF </it>(4.3 kb in this study) the SNP markers are poorly correlated: if a true disease-susceptibility allele existed in the TNF gene, many of the other SNPs would be inefficient markers of that disease allele. Our calculations reveal that even if a hypothetical disease allele gave a 10-fold increased risk, out of the 132 possible pairs of SNPs, in 92 of them (70%), the marker would show a relative risk of less than twofold. Only a small minority of SNP pairs are efficient enough to detect a strong disease-modifying SNP in the <it>TNF </it>gene region.</p>
         <p>Although it appears that individual SNPs in <it>TNF </it>are poor markers of each other, it is important to ask which SNPs serve as good markers of the <it>TNF </it>haplotypes. Depending on the haplotypic structure of a gene region, one may be able to choose a small subset of SNPs that capture most of the haplotypic diversity, thus reducing the cost of genotyping significantly with only modest reductions in information <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Two methods have been reported that select the best subset of SNPs <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B23">23</abbr></abbrgrp>. In this paper we introduce a third, the entropy maximization method (EMM), which chooses the subset of SNPs that explains the largest proportion of the underlying haplotypic diversity as measured by entropy.</p>
         <p>The single SNP with maximum entropy is the SNP with frequency closest to 0.50, in this case TNF-3025. In both populations, this SNP represents about one-third of the haplotypic diversity. With the addition of a second SNP to form a two-SNP haplotype, one-half of the haplotypic diversity is accounted for. In The Gambia, it is most informative to include the TNF-308 SNP as a second marker, whereas in Malawi, the TNF-1031 SNP contributes the most as a second marker. It is not surprising that the first few SNPs selected by the EMM are those that have frequencies closest to 0.50; however, the addition of a common SNP that is well correlated with a SNP already in the subset will contribute little additional information.</p>
         <p>For example, in the Gambian sample, the four most common SNPs are selected first by the EMM (TNF-3025 (38%), TNF-308 (19%), TNF+851 (10%), TNF-1031 (11%)). The next most common SNPs are the TNF+1304 (9%) and the TNF-863 (6%); however, the EMM does not choose these SNPs next because the TNF+1304 tends to occur in phase with the TNF+851, and the TNF-863 tends to occur in phase with the TNF-1031, both of which are already in the subset of four SNPs. So the next two choices that maximize entropy are the TNF-857 and TNF+467 SNPs, which, although less common, represent haplotypes previously unaccounted for (5 and 8). These six-SNP haplotypes now represent about 90% of the haplotypic diversity. Two more SNPs must be added (for a total of eight) before 95% of the 12-SNP haplotypic diversity is accounted for.</p>
         <p>In Malawi, the general features of marker selection are the same, though the optimal SNPs are different. Three of the four most common SNPs are selected first by the EMM (TNF-3025 (45%), TNF-1031 (25%), TNF+851 (15%)), followed by the TNF-308 (11%) and TNF+1304 (13%) SNPs. Interestingly, population-specific recombinant haplotypes make the TNF+1304 SNP more informative in Malawi; it is required to resolve haplotypes 2 and 4 and haplotypes 21 and 22, which together make up almost 10% of all the haplotypes. In Malawi seven SNPs must be included to represent 95% of the 12-SNP haplotypic diversity.</p>
         <p>EMM is especially useful when combined with AEA. The latter can be used to determine how well the recommended haplotype-tagging SNPs detect association at a particular candidate SNP. The results of AEA can then be used to adjust the power of an association study accordingly.</p>
         <p>Analysis of the <it>TNF </it>locus in two African populations reveals a haplotypically diverse locus where markers are only weakly associated with each other. Although initial detection of a disease association in <it>TNF </it>may be difficult, weak allelic associations within the <it>TNF </it>gene region will make it possible to distinguish a disease-modifying SNP from its neighbors. Future studies may determine that this is a general feature of African genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Subjects</p>
            </st>
            <p>Healthy unrelated adults were recruited in Banjul, The Gambia and in Blantyre, Malawi. All were parents of children who presented to hospital with severe malaria. All subjects gave informed consent and the study was approved by the Gambia Government/Medical Research Council Joint Ethical Committee and the College of Medicine Research Committee of the University of Malawi.</p>
            <p>After excluding erroneous pedigrees, the 187 parent-child pairs plus 45 other adults comprised the Gambian sample (a total of 212 adults), and 70 parent-child pairs plus 14 other adults (a total of 84 adults) comprised the Malawian sample. We also studied DNA from two chimpanzees, one gorilla and two orang utans.</p>
         </sec>
         <sec>
            <st>
               <p>SNP identification and genotyping</p>
            </st>
            <p>In a previous study, we described the nucleotide diversity of the <it>TNF </it>locus in 36 healthy, unrelated Gambian adults <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. There we identified 11 SNPs by forward and reverse sequencing of the entire TNF gene from -1,389 bp relative to the start of transcription to +3,004 bp on 72 chromosomes. Here we have genotyped the 11 SNPs identified in <it>TNF</it>, plus one SNP in the neighboring gene, <it>LT</it>&#945;, in a sample of 212 Gambian and 84 Malawian adults. The amplification refractory mutation system PCR (ARMS-PCR) was used to genotype the polymorphisms identified by sequencing. Methods are as previously described <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The accuracy of the ARMS-PCR method was tested on sequenced individuals before extending it to DNA samples of unknown genotype.</p>
         </sec>
         <sec>
            <st>
               <p>Haplotype construction</p>
            </st>
            <p>For most of these adults, genotypes were available from their children, raising the question of how to construct haplotypes most efficiently from a dataset where pedigree information is available for some but not all individuals. We developed a program PHAMILY, which uses information from two-generation pedigrees to construct parental haplotypes at all unambiguous sites. These partial haplotypes of unambiguous sites serve as input for PHASE <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, which uses the Stephens-Donnelly method of haplotype construction to assign the remaining phase-unknown sites among the unrelated parents. The two study populations were pooled before haplotype construction; after haplotype construction, individuals were assigned to their original populations on the basis of their sample identifiers.</p>
         </sec>
         <sec>
            <st>
               <p>Population genetic analyses</p>
            </st>
            <sec>
               <st>
                  <p>Estimating gene diversity</p>
               </st>
               <p>An estimate of gene diversity (<it>H</it>), the probability that two randomly chosen haplotypes are different in a sample, was calculated using Equation 8.5 from Nei <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>,</p>
               <p>
                  <graphic file="gb-2003-4-4-r24-i1.gif"/>
               </p>
               <p>where <it>n </it>is the number of gene copies in the sample, <it>k </it>is the number of different haplotypes, and <it>p</it><sub><it>i </it></sub>is the frequency of the <it>i</it>th haplotype. The sampling variance of this estimate was calculated according to Nei and Roychoudhury <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The computer software Arlequin was used to perform these calculations <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Construction of haplotype networks</p>
               </st>
               <p>Network2.oc was used to construct median-joining networks of the haplotypic data <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Because of the occurrence of many low-frequency recombinant haplotypes in the dataset, only haplotypes observed twice or more were represented in the network. Output from Network2.oc was critically evaluated and some modifications made to best fit a model of minimum mutation.</p>
            </sec>
            <sec>
               <st>
                  <p>Four-gamete test</p>
               </st>
               <p>All pairs of biallelic SNPs were cross-tabulated to identify the number of unique two-locus haplotypes present. An observation of four different haplotypes from a pair of biallelic loci was considered evidence for recombination or recurrent mutation.</p>
            </sec>
            <sec>
               <st>
                  <p>Population differentiation and pairwise <it>F<sub>ST</sub></it></p>
               </st>
               <p>Pairwise <it>F</it><sub><it>ST </it></sub>was calculated as the genetic variance among populations divided by the genetic variance of the total population using haplotypic data <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. An exact test of population differentiation was applied to the haplotypic data <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The computer software Arlequin was used to perform these calculations <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Linkage disequilibrium calculations</p>
               </st>
               <p>Pairwise linkage disequilibria were calculated using the <it>r</it><sup>2 </sup>statistic. Terms for allele and two-locus haplotype frequencies are given in Table <tblr tid="T3">3</tblr>. <it>r</it><sup>2 </sup>was calculated using</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Terms for linkage disequilibrium and association efficiency claculations</p>
                  </caption>
                  <tblbdy cols="1">
                     <r>
                        <c>
                           <p>
                              <graphic file="gb-2003-4-4-r24-i41.gif"/>
                           </p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>
                  <graphic file="gb-2003-4-4-r24-i2.gif"/>
               </p>
               <p>where <it>D </it>= &#402;<sub>11</sub>&#402;<sub>22 </sub>- &#402;<sub>12</sub>&#402;<sub>21</sub>. Normalized linkage disequilibrium parameters <it>D</it>' were calculated using the method of Lewontin <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> in Hartl and Clark <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Statistical significance of the standardized linkage disequilibrium parameter was calculated using the formula</p>
               <p>
                  <graphic file="gb-2003-4-4-r24-i3.gif"/>
               </p>
               <p>with one degree of freedom <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, where <it>N </it>is the number of chromosomes.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Association efficiency analysis (AEA)</p>
            </st>
            <p>To assess the ability of one marker to detect a given disease association at a second marker, we derived a measure we call association efficiency, which is the relative risk at one SNP by virtue of its association with a second SNP. Consider a disease-modifying SNP <it>A </it>that is in linkage disequilibrium with a neighboring nonfunctional SNP <it>B</it>. We define <it>R</it><sub><it>A </it></sub>as the relative risk associated with allele <it>A</it><sub>2 </sub>compared to allele <it>A</it><sub>1</sub>. Under a multiplicative mode of inheritance, the frequencies of the different haplotypes (A<sub>1</sub>, A<sub>1</sub>B<sub>2</sub>, A<sub>2</sub>B<sub>1</sub>, A<sub>2</sub>B<sub>2</sub>) found in diseased individuals is expected to be &#402;<sub>11</sub>, &#402;<sub>12</sub>, <it>R</it><sub><it>A</it></sub>&#402;<sub>21</sub>, and <it>R</it><sub><it>A</it></sub>&#402;<sub>22 </sub>(see Table <tblr tid="T3">3</tblr>). Given the risk associated with <it>A</it><sub>2 </sub>and the frequency of each haplotype, we can derive the frequency of allele <it>B</it><sub>2 </sub>among diseased individuals (<it>q</it><sub>2</sub>').</p>
            <p>
               <graphic file="gb-2003-4-4-r24-i4.gif"/>
            </p>
            <p>The frequency of allele <it>B</it><sub>2 </sub>in the general population is defined as <it>q</it><sub>2</sub>. Thus, in a case-control study where diseased individuals are compared to individuals drawn from the general population, we would obtain an odds ratio for allele <it>B</it><sub>2 </sub>of</p>
            <p>
               <graphic file="gb-2003-4-4-r24-i5.gif"/>
            </p>
            <p>This odds ratio provides an approximation of the relative risk that is associated with the non-functional allele <it>B</it><sub>2 </sub>by virtue of its association with the disease allele <it>A</it><sub>2</sub>.</p>
            <p>When locus B is considered to be the disease-modifying locus, with <it>R</it><sub><it>B </it></sub>as the relative risk associated with allele <it>B</it><sub>2 </sub>compared to <it>B</it><sub>1</sub>, it can be shown that the relative risk at locus <it>A </it>is</p>
            <p>
               <graphic file="gb-2003-4-4-r24-i6.gif"/>
            </p>
            <p>In this paper, we consider the possibility that the rarer allele gives a true relative risk of 10 for a given disease, and calculate the apparent relative risk at all other loci. We do this for all pairs of loci. Because this property does not commute (that is, the apparent relative risk at locus <it>B </it>given a true relative risk of 10 at locus <it>A </it>is not necessarily equal to the apparent relative risk at locus <it>A </it>given a true relative risk of 10 at locus <it>B</it>) we calculate two values for each pair of SNPs.</p>
         </sec>
         <sec>
            <st>
               <p>Entropy maximization method (EMM)</p>
            </st>
            <p>A measure of haplotypic diversity, entropy (<it>E</it>), was calculated using</p>
            <p>
               <graphic file="gb-2003-4-4-r24-i7.gif"/>
            </p>
            <p>where <it>p</it><sub><it>i </it></sub>is the frequency of the <it>i</it>th haplotype, and <it>k </it>is the number of unique haplotypes in the sample. Entropy is maximized when all haplotypes have the same frequency. We developed an algorithim that chooses SNPs, first individually, then in multi-SNP haplotypes, that maximize entropy <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Thus we can define a subset of SNPs that represent the greatest proportion of the full 12-SNP haplotypic diversity. This method provides for the selection of optimal haplotype-tagging SNPs even in the absence of a clear haplotypic block structure.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>A g-zipped file of the C source code for the program (<supplr sid="s1">Additional data file 1</supplr>) together with instructions for compiling the C source code (<supplr sid="s2">Additional data file 2</supplr>) are available. Instructions for how to run the compile program (<supplr sid="s3">Additional data file 3</supplr>), an example output file with an introduction (<supplr sid="s4">Additional data file 4</supplr>) and an example of the input file format (<supplr sid="s5">Additional data file 5</supplr>), as well as a file listing marker selection by maximum entropy 94kb (<supplr sid="s6">Additional data file 6</supplr>) can also be downloaded. The algorithm is also available from <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
         <suppl id="s1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>A g-zipped file of the C source code for the program</p>
            </caption>
            <text>
               <p>A g-zipped file of the C source code for the program</p>
            </text>
            <file name="gb-2003-4-4-r24-s1.gz">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Instructions for compiling the C source code</p>
            </caption>
            <text>
               <p>Instructions for compiling the C source code</p>
            </text>
            <file name="gb-2003-4-4-r24-s2.txt">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Instructions on how to run the compiled program</p>
            </caption>
            <text>
               <p>Instructions on how to run the compiled program</p>
            </text>
            <file name="gb-2003-4-4-r24-s3.txt">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>An example output file with an introduction</p>
            </caption>
            <text>
               <p>An example output file with an introduction</p>
            </text>
            <file name="gb-2003-4-4-r24-s4.html">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>An example of the input file format</p>
            </caption>
            <text>
               <p>An example of the input file format</p>
            </text>
            <file name="gb-2003-4-4-r24-s5.txt">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
         <suppl id="s6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>A mathematical explanation of the method</p>
            </caption>
            <text>
               <p>A mathematical explanation of the method</p>
            </text>
            <file name="gb-2003-4-4-r24-s6.pdf">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the families and our colleagues at the Royal Victoria Hospital, Banjul and the Queen Elizabeth Central Hospital, Blantyre, who made this study possible. The authors would also like to thank Matthew Stephens for providing software for haplotype reconstruction. This work was funded by the Medical Research Council, UK. This article is dedicated to the memory of Ryk Ward, who passed away on February 14, 2003.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Variation in the TNF-alpha promoter region associated with susceptibility to cerebral malaria.</p>
            </title>
            <aug>
               <au>
                  <snm>McGuire</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Allsopp</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Greenwood</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>371</volume>
            <fpage>508</fpage>
            <lpage>510</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102169</pubid>
                  <pubid idtype="pmpid" link="fulltext">7935762</pubid>
                  <pubid idtype="doi">10.1038/371508a0</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Severe malarial anemia and cerebral malaria are associated with different tumor necrosis factor promoter alleles.</p>
            </title>
            <aug>
               <au>
                  <snm>McGuire</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Allsopp</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Greenwood</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Infect Dis</source>
            <pubdate>1999</pubdate>
            <volume>179</volume>
            <fpage>287</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">124231</pubid>
                  <pubid idtype="pmpid" link="fulltext">9841857</pubid>
                  <pubid idtype="doi">10.1086/314533</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A polymorphism that affects OCT-1 binding to the TNF promoter region is associated with severe malaria.</p>
            </title>
            <aug>
               <au>
                  <snm>Knight</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Udalova</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Greenwood</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Peshu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Marsh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>22</volume>
            <fpage>145</fpage>
            <lpage>150</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/9649</pubid>
                  <pubid idtype="pmpid" link="fulltext">10369255</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Inherited variability of tumor necrosis factor production and susceptibility to infectious disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Knight</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Assoc Am Physicians</source>
            <pubdate>1999</pubdate>
            <volume>111</volume>
            <fpage>290</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10417736</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genes of the class II and class III major histocompatibility complex are associated with typhoid fever in Vietnam.</p>
            </title>
            <aug>
               <au>
                  <snm>Dunstan</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Blackwell</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Duc</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Lanh</snm>
                  <fnm>MN</fnm>
               </au>
               <au>
                  <snm>Dudbridge</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Phuong</snm>
                  <fnm>CX</fnm>
               </au>
               <au>
                  <snm>Luxemburger</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wain</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>VA</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Infect Dis</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>261</fpage>
            <lpage>268</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/317940</pubid>
                  <pubid idtype="pmpid" link="fulltext">11120931</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Polymorphism in tumor necrosis factor genes associated with mucocutaneous leishmaniasis.</p>
            </title>
            <aug>
               <au>
                  <snm>Cabrera</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Sharples</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Castes</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Convit</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blackwell</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Exp Med</source>
            <pubdate>1995</pubdate>
            <volume>182</volume>
            <fpage>1259</fpage>
            <lpage>1264</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7595196</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Variation in the tumor necrosis factor-alpha gene promoter region may be associated with death from meningococcal disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Nadel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Newport</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Booy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Levin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Infect Dis</source>
            <pubdate>1996</pubdate>
            <volume>174</volume>
            <fpage>878</fpage>
            <lpage>880</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8843235</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Scarring trachoma is associated with polymorphism in the tumor necrosis factor alpha (TNF-alpha) gene promoter and with elevated TNF-alpha levels in tear fluid.</p>
            </title>
            <aug>
               <au>
                  <snm>Conway</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Mahdi</snm>
                  <fnm>OS</fnm>
               </au>
               <au>
                  <snm>Jennings</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mbena</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mabey</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>1997</pubdate>
            <volume>65</volume>
            <fpage>1003</fpage>
            <lpage>1006</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">175081</pubid>
                  <pubid idtype="pmpid" link="fulltext">9038309</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Tumour necrosis factor haplotypes and asthma.</p>
            </title>
            <aug>
               <au>
                  <snm>Moffatt</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Cookson</snm>
                  <fnm>WO</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>1997</pubdate>
            <volume>6</volume>
            <fpage>551</fpage>
            <lpage>554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/6.4.551</pubid>
                  <pubid idtype="pmpid" link="fulltext">9097957</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Primary association of a TNF gene polymorphism with susceptibility to multiple sclerosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Fernandez-Arquero</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Arroyo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rubio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Vigil</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Conejero</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Figueredo</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>de la Concha</snm>
                  <fnm>EG</fnm>
               </au>
            </aug>
            <source>Neurology</source>
            <pubdate>1999</pubdate>
            <volume>53</volume>
            <fpage>1361</fpage>
            <lpage>1363</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10522904</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Crohn's disease is associated with novel polymorphisms in the 53-flanking region of the tumor necrosis factor gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Negoro</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kinouchi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hiwatashi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Takagi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Satoh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shimosegawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Toyota</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Gastroenterology</source>
            <pubdate>1999</pubdate>
            <volume>117</volume>
            <fpage>1062</fpage>
            <lpage>1068</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10535868</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Functional consequences of a polymorphism affecting NF-kappaB p50-p50 binding to the TNF promoter region.</p>
            </title>
            <aug>
               <au>
                  <snm>Udalova</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Denys</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ackerman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Foxwell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>2000</pubdate>
            <volume>20</volume>
            <fpage>9113</fpage>
            <lpage>9119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102169</pubid>
                  <pubid idtype="pmpid" link="fulltext">11094063</pubid>
                  <pubid idtype="doi">10.1128/MCB.20.24.9113-9119.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Nucleotide diversity of the TNF gene region in an African village.</p>
            </title>
            <aug>
               <au>
                  <snm>Richardson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sisay-Joof</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ackerman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Usen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Katundu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Molyneux</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pinder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kwiatkowski</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genes Immun</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>343</fpage>
            <lpage>348</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.gene.6363789</pubid>
                  <pubid idtype="pmpid" link="fulltext">11607791</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The structure of haplotype blocks in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Gabriel</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Schaffner</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blumenstiel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>DeFelice</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lochner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Faggart</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>2225</fpage>
            <lpage>2229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1069424</pubid>
                  <pubid idtype="pmpid" link="fulltext">12029063</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase.</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stengard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Salomaa</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vartiainen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Perola</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boerwinkle</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1998</pubdate>
            <volume>63</volume>
            <fpage>595</fpage>
            <lpage>612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/301977</pubid>
                  <pubid idtype="pmpid" link="fulltext">9683608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Apolipoprotein E variation at the sequence haplotype level: implications for the origin and maintenance of a major human polymorphism.</p>
            </title>
            <aug>
               <au>
                  <snm>Fullerton</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Stengard</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Salomaa</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vartiainen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Perola</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boerwinkle</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>67</volume>
            <fpage>881</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/303070</pubid>
                  <pubid idtype="pmpid" link="fulltext">10986041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Recombinational and mutational hotspots within the human lipoprotein lipase gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Templeton</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Boerwinkle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sing</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>69</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/302699</pubid>
                  <pubid idtype="pmpid" link="fulltext">10631137</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The detection of linkage disequilibrium between closely linked markers: RFLPs at the AI-CIII apolipoprotein genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Deeb</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Motulsky</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1988</pubdate>
            <volume>42</volume>
            <fpage>113</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2892394</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genetic analysis of complex diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Muller-Myhsok</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Abel</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>275</volume>
            <fpage>1328</fpage>
            <lpage>1330</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9064791</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Haplotype tagging for the identification of common disease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Esposito</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Barratt</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Heward</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Di Genova</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ueda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cordell</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Eaves</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Dudbridge</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>233</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-233</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586306</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>High-resolution haplotype structure in the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Rioux</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Schaffner</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Hudson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>229</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-229</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Rioux</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Silverberg</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lindblad</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Steinhart</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Delmonte</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Guschwan</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>223</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-223</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A dynamic programming algorithm for haplotype block partitioning.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>7335</fpage>
            <lpage>7339</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">124231</pubid>
                  <pubid idtype="pmpid" link="fulltext">12032283</pubid>
                  <pubid idtype="doi">10.1073/pnas.102186799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>A new statistical method for haplotype reconstruction from population data.</p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>68</volume>
            <fpage>978</fpage>
            <lpage>989</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/319501</pubid>
                  <pubid idtype="pmpid" link="fulltext">11254454</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Mathematical Genetics Group: Software</p>
            </title>
            <url>http://www.stats.ox.ac.uk/mathgen/software.html</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Molecular Evolutionary Genetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <publisher>New York: Columbia University Press</publisher>
            <pubdate>1987</pubdate>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Sampling variances of heterozygosity and genetic distance.</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Roychoudhury</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1974</pubdate>
            <volume>76</volume>
            <fpage>379</fpage>
            <lpage>390</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4822472</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Arlequin ver. 2.000: a software for population genetics analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Roessli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Excoffier</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Geneva: Genetics and Biometry Laboratory, University of Geneva</source>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Arlequin</p>
            </title>
            <url>http://lgb.unige.ch/arlequin/</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Free Phylogenetic Network Analysis Shareware Software</p>
            </title>
            <url>http://www.fluxus-engineering.com/sharenet.htm</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Evolution in Mendelian populations.</p>
            </title>
            <aug>
               <au>
                  <snm>Wright</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1931</pubdate>
            <volume>16</volume>
            <fpage>97</fpage>
            <lpage>159</lpage>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Testing heterozygote excess and deficiency.</p>
            </title>
            <aug>
               <au>
                  <snm>Rousset</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1995</pubdate>
            <volume>140</volume>
            <fpage>1413</fpage>
            <lpage>1419</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7498780</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The interaction of selection and linkage. I. General considerations; heterotic models.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewontin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1964</pubdate>
            <volume>49</volume>
            <fpage>49</fpage>
            <lpage>67</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Principles of Population Genetics</p>
            </title>
            <aug>
               <au>
                  <snm>Hartl</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <publisher>Sunderland, MA: Sinauer</publisher>
            <edition>3rd</edition>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B35">
            <title>
               <p>SNP selection page</p>
            </title>
            <url>http://www.well.ox.ac.uk/~rmott/SNPS/</url>
         </bibl>
      </refgrp>
   </bm>
</art>
