<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2156-8-48</ui>
   <ji>1471-2156</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Patterns of population differentiation of candidate genes for cardiovascular disease</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Kullo</snm>
               <mi>J</mi>
               <fnm>Iftikhar</fnm>
               <insr iid="I1"/>
               <email>kullo.iftikhar@mayo.edu</email>
            </au>
            <au id="A2">
               <snm>Ding</snm>
               <fnm>Keyue</fnm>
               <insr iid="I1"/>
               <email>ding.keyue@mayo.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Division of Cardiovascular Diseases, Mayo Clinic, Rochester MN, USA</p>
            </ins>
         </insg>
         <source>BMC Genetics</source>
         <issn>1471-2156</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>48</fpage>
         <url>http://www.biomedcentral.com/1471-2156/8/48</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17626638</pubid>
               <pubid idtype="doi">10.1186/1471-2156-8-48</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>4</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Kullo and Ding; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (<it>F</it><sub><it>ST</it></sub>) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We calculated <it>F</it><sub><it>ST </it></sub>for 15,559 common SNPs (minor allele frequency &#8805; 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted <it>F</it><sub><it>ST </it></sub>for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean <it>F</it><sub><it>ST </it></sub>values for common putatively functional variants were significantly higher than <it>F</it><sub><it>ST </it></sub>values for nonfunctional variants. A significant variation in <it>F</it><sub><it>ST </it></sub>was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high <it>F</it><sub><it>ST</it></sub>. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of <it>F</it><sub><it>ST </it></sub>values was noted among pairwise population comparisons for different biological processes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The human population is not homogeneous in terms of disease susceptibility and substantial differences in susceptibility to common chronic diseases such as cardiovascular disease (CVD), are present between self-identified ancestral/ethnic groups <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Significant differences in CVD prevalence were noted in the Seven Countries Study <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In the United States, African-Americans have a higher prevalence of hypertension <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and hypertensive heart disease and significantly greater cardiovascular morbidity and mortality than Whites <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, whereas Japanese-Americans are less prone to CVD than Whites <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Differences in cardiovascular 'intermediate' phenotypes also occur among populations; for example, plasma lipid levels differ significantly between African-Americans and non-Hispanic whites, and plasma levels of C-reactive protein vary substantially between people of different ethnic origins <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The basis for ethnic differences in CVD susceptibility is not fully understood but it is likely that in addition to environmental factors, genetic factors contribute either by determining type or severity of risk factors, as well as the susceptibility to environmental/lifestyle risk factors <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Since different populations are subject to distinct environments, natural selection may produce population-specific allele frequencies. If a functional genetic variant exhibits significantly different pattern of geographic variation compared to a neutral variant, this may be indicative of different selective pressures across populations <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. For instance, a given genetic variation may be adaptive under a local environmental stressor, which would increase the allele frequencies of this selected locus in a particular population and lead to a greater level of population differentiation <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. A recent study suggested that differential susceptibility to hypertension may be due to differential exposure to selection pressures during the out-of-Africa expansion <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>Natural selection alters the amount of differentiation between or among populations within a species so that a measure quantifying the differences in allele frequencies among human populations from diverse geographical regions &#8211; the <it>F</it><sub><it>ST </it></sub>statistic &#8211; has been used to test for evidence of selection. <it>F</it><sub><it>ST </it></sub>is a measure of the correlation between alleles in subpopulations relative to the alleles in the total population <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. It is expected that local adaptation will lead to an increase in <it>F</it><sub><it>ST</it></sub>, when comparing populations under different environmental pressures <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Multilocus scans in the human genome, using either single nucleotide polymorphisms (SNPs) <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> or microsatellite markers <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, that compare different populations for several loci, can identify genomic regions carrying a variant that results in a local adaptation <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Recently, Ryan <it>et al</it>. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> investigated population differentiation among different functional classes of immunologically important genes and found significantly increased <it>F</it><sub><it>ST </it></sub>in individual nonsynonymous SNPs of the intercellular adhesion molecule 1 (<it>ICAM1</it>) and Toll-like receptors (<it>TLR</it>) genes.</p>
         <p>Population differentiation has particular relevance for studies of genetic susceptibility to complex diseases since many of the genes that are known to have been affected by natural selection are medically important <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Loci with an increased <it>F</it><sub><it>ST </it></sub>should be considered high priority candidate genes for association studies of complex diseases as well as the study of local adaptation to environmental conditions. An example &#8211; provided by Rockman <it>et al</it>. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> &#8211; is the increased frequency of high-expression allele (5T) of <it>MMP3 </it>due to positive selection in Europe but not elsewhere (i.e., a significant differentiation was noted between populations). This variant is associated with reduced arterial stiffness, resulting in lower CVD risk. Understanding genotypic difference among ethnic groups for these genes in relevant biological pathways will provide insights into ethnic differences in complex diseases that may be useful in the prevention and treatment of such diseases <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
         <p>In the present study, using genotype data for three populations from HapMap <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> &#8211; Yoruban Africans (YRI), European Whites (CEU), and East Asians (CHB + JPT) &#8211; we investigated differences in the distribution of common variants (minor allele frequency &#8805; 0.10) of 364 genes in etiologic pathways for CVD, and assessed patterns of population differentiation of these genes in the various biological processes underlying CVD. Our goal was to identify loci with high levels of population differentiation, as a step towards understanding the genetic basis of ethnic differences in cardiovascular risk.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Genotype data for CVD candidate genes</p>
            </st>
            <p>Genotype data for 35,369 SNPs in 405 of 416 genes in etiologic pathways of CVD were available from HapMap; these included 24,391 SNPs in YRI, 22,751 SNPs in CEU, and 20,965 SNPs in CHB+JPT, respectively. Pairwise population comparisons of common variants (MAF &#8805; 0.10) showed that a sizeable fraction (79% &#8211; 89%) of variants common in one population were present in another population. To avoid ascertainment bias, we focused on the co-segregating SNPs among three populations that had a minor allele frequency (MAF) &#8805; 0.10 in at least one population and were in Hardy-Weinberg equilibrium, as suggested by Weir <it>et al</it>. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. In all, 15,559 such SNPs from 364 genes (87.5%) were identified and classified as putative functional SNPs [5' untranslated regions (5' UTR); coding (nonsynonymous) sites; and 3' untranslated regions (3' UTR)] and non-functional SNPs [coding (synonymous) sites; and intronic sites], based on NCBI SNP database <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> or UCSC genome browsers <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>The number of co-segregating SNPs for various genomic regions in three population samples from HapMap data set</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Class</p>
                     </c>
                     <c ca="center">
                        <p>Number</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5' flanking regions</p>
                     </c>
                     <c ca="center">
                        <p>1,224</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Introns</p>
                     </c>
                     <c ca="center">
                        <p>13,601</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Synonymous sites</p>
                     </c>
                     <c ca="center">
                        <p>217</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Non-synonymous sites</p>
                     </c>
                     <c ca="center">
                        <p>140</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3' flanking regions</p>
                     </c>
                     <c ca="center">
                        <p>377</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>15,559</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Patterns of <it>F</it><sub><it>ST </it></sub>of CVD candidate genes in three populations</p>
            </st>
            <sec>
               <st>
                  <p>Distribution of pairwise <it>F</it><sub><it>ST </it></sub>values</p>
               </st>
               <p>We first compared the distribution of <it>F</it><sub><it>ST </it></sub>values of 15,559 SNPs among the three populations. The mean <it>F</it><sub><it>ST </it></sub>for YRI vs. CEU was 0.139, for YRI vs. CHB + JPT, 0.158, and for CEU vs. CHB + JPT, 0.095 (Figure <figr fid="F1">1a</figr>). Thus, <it>F</it><sub><it>ST </it></sub>between Africans and East Asians was slightly higher than the genome-wide average of <it>F</it><sub><it>ST </it></sub>(0.10 ~0.15, i.e., the background <it>F</it><sub><it>ST</it></sub>) previously noted to be present between sub-Saharan Africans, Northern Europeans, and East Asians <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. The distribution of single-locus estimates of <it>F</it><sub><it>ST </it></sub>values between two populations has an approximate &#967;<sup>2 </sup>distribution (Figure <figr fid="F1">1b</figr>). The distribution of <it>F</it><sub><it>ST </it></sub>between pairwise populations was significantly different (Kolmogorov-Smirnov test, <it>P </it>&lt; 10<sup>-16</sup>). There was a higher proportion of low (&lt; 0.10) pairwise <it>F</it><sub><it>ST </it></sub>in CEU vs. CHB + JPT (63.5%), compared with YRI vs. CEU (52.5%) or CHB + JPT (49.0%). Larger <it>F</it><sub><it>ST </it></sub>values (> 0.2) for common SNPs were observed in YRI vs. CEU (25.6%) and YRI vs. CHB + JPT (30.0%), but less often in CEU vs. CHB + JPT (15.4%).</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>(a) Mean <it>F</it><sub><it>ST </it></sub>in pairwise population comparisons in the observed data</p>
                  </caption>
                  <text>
                     <p>(a) Mean <it>F</it><sub><it>ST </it></sub>in pairwise population comparisons in the observed data. (b) Distribution of <it>F</it><sub><it>ST </it></sub>values in HapMap data set. (c) Mean <it>F</it><sub><it>ST </it></sub>in pairwise population comparisons in the simulated data. (d) Distribution of <it>F</it><sub><it>ST </it></sub>values in the simulated data set. YRI, Yoruba in Ibadan, Nigeria; CEU, Utah residents with ancestry from northern and western Europe, CHB, Han Chinese in Beijing, China, and JPT, Japanese in Tokyo, Japan</p>
                  </text>
                  <graphic file="1471-2156-8-48-1"/>
               </fig>
               <p>We randomly selected 15,559 SNPs from the data generated by coalescent simulations (one MB region, 1,000 times, see Methods), which matched the characteristics of the observed data in terms of sample size, average <it>F</it><sub><it>ST </it></sub>(Figure <figr fid="F1">1c</figr>), and MAF (i.e., &#8805; 0.10 at least in one population). The simulated distribution of pairwise <it>F</it><sub><it>ST </it></sub>(Figure <figr fid="F1">1d</figr>) was significantly different compared with the observed HapMap data (Kolmogorov-Smirnov test, <it>P </it>&lt; 10<sup>-10</sup>). An excess of high-<it>F</it><sub><it>ST </it></sub>values were present in the HapMap data for the set of genes in the present study, consistent with action of either genetic drift or natural selection and local adaptation leading to an increase in allele frequencies for the selected locus in a particular population <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. It should be noted that the simulated <it>F</it><sub><it>ST </it></sub>underestimated the <it>F</it><sub><it>ST </it></sub>from the empirical data, consistent with the previous findings <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and indicating that simulation did not perfectly predict <it>F</it><sub><it>ST</it></sub>.</p>
            </sec>
            <sec>
               <st>
                  <p>Distribution of pairwise <it>F</it><sub><it>ST </it></sub>based on SNP functional classification</p>
               </st>
               <p>To assess differences in the distribution of combined <it>F</it><sub><it>ST </it></sub>values according to different categories of SNPs, we plotted the correlation between mean pairwise <it>F</it><sub><it>ST </it></sub>and MAF according to the five different SNPs categories (Figure <figr fid="F2">2</figr>). The mean <it>F</it><sub><it>ST </it></sub>values for SNPs of different categories conditioned on MAF are listed in Table S2 (see additional file <supplr sid="S1">1</supplr>). Using analysis of variance (ANOVA), we found that the pairwise mean <it>F</it><sub><it>ST </it></sub>values in CEU vs. CHB + JPT varied significantly among different SNP categories (<it>P </it>= 0.019) by analysis of variance, but not in YRI vs. CEU (<it>P </it>= 0.273) and YRI vs. CHB + JPT (<it>P </it>= 0.124) (see Methods). In addition, pairwise mean <it>F</it><sub><it>ST </it></sub>values between any two populations differed with MAF (<it>P </it>&lt; 2.2 &#215; 10<sup>-16</sup>), and there was a significant interaction of logarithm transformed MAF &#215; category (<it>P </it>&lt; 3.3 &#215; 10<sup>-5</sup>), indicating that the effect of SNP category was modified by MAF.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Mean <it>F</it><sub><it>ST </it></sub>values for SNPs of different categories, conditioned on MAF</p>
                  </caption>
                  <text>
                     <p>Mean <it>F</it><sub><it>ST </it></sub>values for SNPs of different categories, conditioned on MAF. YRI, Yoruba in Ibadan, Nigeria; CEU, Utah residents with ancestry from northern and western Europe, CHB, Han Chinese in Beijing, China, and JPT, Japanese in Tokyo, Japan</p>
                  </text>
                  <graphic file="1471-2156-8-48-2"/>
               </fig>
               <suppl id="S1">
                  <title>
                     <p>Additional file 1</p>
                  </title>
                  <text>
                     <p>Patterns of population differentiation of candidate genes for cardiovascular disease. Table S1 describes the summary of functional pathways implicated in atherosclerosis; Table S2 shows mean <it>F</it><sub><it>ST </it></sub>values for SNPs of different categories, conditioned on MAF; and Table S3 is the summary of function for genes with a significant weighted-average <it>F</it><sub><it>ST</it></sub>.</p>
                  </text>
                  <file name="1471-2156-8-48-S1.doc">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p>Common, putative functional SNPs (i.e., SNPs in potentially functional genomic elements such as non-synonymous sites, 5' and 3' UTR) had systematically higher mean <it>F</it><sub><it>ST </it></sub>values than SNPs in nonfunctional genomic elements (i.e., intronic and synonymous sites), although this was limited to SNPs with MAF &#8805; 0.30. For example, when comparing YRI vs. CHB + JPT, mean <it>F</it><sub><it>ST </it></sub>for common SNPs with MAF of 0.35&#8211;0.40 in non-synonymous sites (<it>F</it><sub><it>ST </it></sub>= 0.303) was higher than that in synonymous (<it>F</it><sub><it>ST </it></sub>= 0.124) and intronic sites (<it>F</it><sub><it>ST </it></sub>= 0.201) (Figure <figr fid="F2">2b</figr>), although the effect was not statistically significant (pairwise comparison by 'multcomp' library in <it>R</it>) due to the limited number of non-synonymous sites. Much higher mean <it>F</it><sub><it>ST </it></sub>values for non-synonymous SNPs with MAF of 0.45&#8211;0.50 were noted in YRI vs. CEU (<it>F</it><sub><it>ST </it></sub>= 0.505).</p>
            </sec>
            <sec>
               <st>
                  <p>Patterns of <it>F</it><sub><it>ST </it></sub>in biological processes and functional pathways</p>
               </st>
               <p>We compared the distribution of <it>F</it><sub><it>ST </it></sub>values for 364 genes in various biological processes (= 6) or functional pathways (= 37). The boxplots of <it>F</it><sub><it>ST </it></sub>values in different biological processes are shown in Figure <figr fid="F3">3</figr>. Significant variation in <it>F</it><sub><it>ST </it></sub>values was seen among different biological processes (<it>P </it>&#8804; 3.1 &#215; 10<sup>-15</sup>, Kruskal-Wallis test).</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Boxplots of <it>F</it><sub><it>ST </it></sub>according to biological processes</p>
                  </caption>
                  <text>
                     <p>Boxplots of <it>F</it><sub><it>ST </it></sub>according to biological processes. Points in the box are the mean <it>F</it><sub><it>ST </it></sub>values. Outliers are not shown. <it>P </it>values were calculated by Kruskal-Wallis test. YRI, Yoruba in Ibadan, Nigeria; CEU, Utah residents with ancestry from northern and western Europe, CHB, Han Chinese in Beijing, China, and JPT, Japanese in Tokyo, Japan</p>
                  </text>
                  <graphic file="1471-2156-8-48-3"/>
               </fig>
               <p>In general, mean <it>F</it><sub><it>ST </it></sub>for each biological process was significantly higher between Africans and non-Africans (especially between Africans and East Asians), in comparison with that between non-Africans (<it>P </it>&lt; 0.05, pairwise <it>t </it>test) (Figure <figr fid="F3">3</figr>). The patterns of <it>F</it><sub><it>ST </it></sub>suggested differential local factors operating on the selected biological processes among populations. For example, for genes in the 'blood circulation &amp; gas exchange' pathway, the mean <it>F</it><sub><it>ST </it></sub>value was 0.129 in YRI vs. CEU and 0.158 in YRI vs. CHB + JPT, but significantly lower (<it>F</it><sub><it>ST </it></sub>= 0.062) in CEU vs. CHB + JPT. A similar pattern was also noted in the 'lipoprotein metabolism' genes (YRI vs. CEU: 0.144; YRI vs. CHB + JPT: 0.165; and CEU vs. CHB + JPT: 0.082).</p>
               <p>Significant variation in <it>F</it><sub><it>ST </it></sub>was also noted among the 37 functional pathways, (data not shown; <it>P </it>&lt; 2.2 &#215; 10<sup>-16</sup>, Kruskal-Wallis test). As expected, the mean <it>F</it><sub><it>ST </it></sub>for each functional pathway was significantly higher between Africans and non-Africans, than between non-Africans. Most strikingly, genes in the 'Insulin/IGF-mitogen activated protein kinase kinase/MAP kinase cascade' pathway showed a relatively high <it>F</it><sub><it>ST </it></sub>in all pairwise population comparisons (YRI vs. CEU: 0.194, YRI vs. CHB + JPT: 0.174 and CEU vs. CHB + JPT: 0.138). Also, a relatively high <it>F</it><sub><it>ST </it></sub>between Africans and non-Africans was noted in the 'interleukin signaling pathway' genes (YRI vs. CEU: 0.192 and YRI vs. CHB + JPT: 0.210).</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Signatures of local adaptation</p>
            </st>
            <p>The HapMap data provides a genome-wide empirical distribution of <it>F</it><sub><it>ST </it></sub>against which significance of <it>F</it><sub><it>ST </it></sub>values can be evaluated, rather than based on theoretical computer simulations <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. SNPs distant from genes are good candidates for neutral mutations since genes and their regulatory elements are more likely to be under selection than non-coding DNA <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. We acquired the empirical 'neutral' distribution of <it>F</it><sub><it>ST </it></sub>values from 289 intergenic regions across the autosomal genome (14,792 SNPs) and 17 intergenic regions across the X chromosome (372 SNPs) without considering the effect of MAF. For autosomal chromosomes, the 95% upper limits of <it>F</it><sub><it>ST </it></sub>values were: YRI vs. CEU (= 0.602); YRI vs. CHB + JPT (= 0.640); and CEU vs. CHB + JPT (= 0.466) (Figure <figr fid="F4">4</figr>). The 95% upper limits of <it>F</it><sub><it>ST </it></sub>values were higher for chromosome X &#8211; 0.729 (YRI vs. CEU), 0.828 (YRI vs. CHB + JPT), and 0.707 (CEU vs. CHB + JPT), respectively.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Mean and 95% upper limits of <it>F</it><sub><it>ST </it></sub>distributions</p>
               </caption>
               <text>
                  <p>Mean and 95% upper limits of <it>F</it><sub><it>ST </it></sub>distributions. Black, dark grey, and light grey bars represent the mean of <it>F</it><sub><it>ST </it></sub>distribution found for the autosomal SNPs analyzed in Akey <it>et al</it>. [18], Izagirre <it>et al</it>. [47], and coalescent simulations, respectively. White bars represent the mean of <it>F</it><sub><it>ST </it></sub>in the 'neutral' autosomal loci (14,792 SNPs) from the intergenic regions in this study. The 95% upper limits are placed on top of the mean value of <it>F</it><sub><it>ST</it></sub>.</p>
               </text>
               <graphic file="1471-2156-8-48-4"/>
            </fig>
            <p>We first calculated the significance level of <it>F</it><sub><it>ST </it></sub>for each SNP locus. A small fraction of SNP loci showed a significantly higher <it>F</it><sub><it>ST </it></sub>(<it>P </it>&lt; 0.05) based on the empirical 'neutral' distribution of <it>F</it><sub><it>ST </it></sub>values &#8211; 238 SNPs (1.53%) in YRI vs. CEU, 325 (2.09%) in YRI vs. CHB + JPT, and 164 (1.05%) in CEU vs. CHB + JPT, respectively. The number of genes with at least one unusually high <it>F</it><sub><it>ST </it></sub>value according to biological process is shown in Table <tblr tid="T2">2</tblr>. The biological processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes (31.3% and 34.0%) with a significantly higher <it>F</it><sub><it>ST </it></sub>(Table <tblr tid="T2">2</tblr>, Figure <figr fid="F5">5</figr>).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Number of genes in different biological processes with significantly higher <it>F</it><sub><it>ST </it></sub>(empirical <it>P </it>&#8804; 0.05) in at least one SNP</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Biological process (gene number)</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Pairwise population comparisons</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Total</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>YRI vs. CEU</p>
                     </c>
                     <c ca="center">
                        <p>YRI vs. CHB+JPT</p>
                     </c>
                     <c ca="center">
                        <p>CEU vs. CHB+JPT</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Apoptosis (147)</p>
                     </c>
                     <c ca="center">
                        <p>20 (13.6%)</p>
                     </c>
                     <c ca="center">
                        <p>32 (21.8%)</p>
                     </c>
                     <c ca="center">
                        <p>17 (11.6%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>46 (31.3%)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Blood circulation and gas exchange (13)</p>
                     </c>
                     <c ca="center">
                        <p>1 (7.7%)</p>
                     </c>
                     <c ca="center">
                        <p>2 (15.4%)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>2 (15.4%)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Blood clotting (40)</p>
                     </c>
                     <c ca="center">
                        <p>7 (17.5%)</p>
                     </c>
                     <c ca="center">
                        <p>6 (15.0%)</p>
                     </c>
                     <c ca="center">
                        <p>5 (12.5%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>9 (22.5%)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Homeostasis (9)</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>-</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Immune response (151)</p>
                     </c>
                     <c ca="center">
                        <p>20 (13.2%)</p>
                     </c>
                     <c ca="center">
                        <p>19 (12.6%)</p>
                     </c>
                     <c ca="center">
                        <p>10 (6.6%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>35 (23.2%)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lipoprotein metabolism (53)</p>
                     </c>
                     <c ca="center">
                        <p>11 (20.8%)</p>
                     </c>
                     <c ca="center">
                        <p>9 (17.0%)</p>
                     </c>
                     <c ca="center">
                        <p>5 (9.4%)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>18 (34.0%)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Total (364)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>51 (14.0%)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>63 (17.3%)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>32 (8.8%)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>110 (30.2%)</b>
                        </p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Percentage of genes in different biological processes with significantly high <it>F</it><sub><it>ST </it></sub>(empirical <it>P </it>&#8804; 0.05) in at least one SNP</p>
               </caption>
               <text>
                  <p>Percentage of genes in different biological processes with significantly high <it>F</it><sub><it>ST </it></sub>(empirical <it>P </it>&#8804; 0.05) in at least one SNP.</p>
               </text>
               <graphic file="1471-2156-8-48-5"/>
            </fig>
            <p>We also calculated a weighted-average <it>F</it><sub><it>ST</it></sub>, combining information over loci <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> that summarizes the levels of interpopulation differentiation in each gene. Genes with a significantly higher weighted-average <it>F</it><sub><it>ST </it></sub>are shown in Table <tblr tid="T3">3</tblr>. In total, there were signatures of local adaptation in nine genes (2.5%) &#8211; four genes in YRI vs. CEU and three genes in YRI vs. CHB + JPT, and three genes in CEU vs. CHB + JPT. Most of the genes are involved in 'immune response' (<it>GRB2</it>, <it>IKBKB</it>, <it>IL4</it>, <it>IL6</it>) and 'apoptosis' (<it>ARHGEF1</it>, <it>RIPK1</it>, <it>BCL2L1</it>, <it>IL4</it>, <it>IL6</it>), as well as one gene each in 'blood clotting' (<it>F2</it>) and 'lipoprotein metabolism' (<it>PMVK</it>). The distribution of <it>F</it><sub><it>ST </it></sub>along the sequence for these genes is shown in Figure <figr fid="F6">6</figr>, indicating multiple SNP loci with a significantly high <it>F</it><sub><it>ST</it></sub>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Genes with a significantly high weighted-average <it>F</it><sub><it>ST </it></sub>(<it>P </it>&#8804; 0.05)</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="left">
                        <p>Symbol</p>
                     </c>
                     <c ca="center">
                        <p>Gene name</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>YRI vs. CEU</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>YRI vs. CHB + JPT</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>CEU vs. CHB + JPT</p>
                     </c>
                     <c ca="center">
                        <p>Biological processes</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <it>F</it>
                           <sub>
                              <it>ST</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>P</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>F</it>
                           <sub>
                              <it>ST</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>P</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>F</it>
                           <sub>
                              <it>ST</it>
                           </sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>P</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>GRB2</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>growth factor receptor-bound protein 2</p>
                     </c>
                     <c ca="center">
                        <p>0.720</p>
                     </c>
                     <c ca="center">
                        <p>0.035*</p>
                     </c>
                     <c ca="center">
                        <p>0.828</p>
                     </c>
                     <c ca="center">
                        <p>0.023*</p>
                     </c>
                     <c ca="center">
                        <p>0.048</p>
                     </c>
                     <c ca="center">
                        <p>0.556</p>
                     </c>
                     <c ca="center">
                        <p>Immune response</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IKBKB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase beta</p>
                     </c>
                     <c ca="center">
                        <p>0.658</p>
                     </c>
                     <c ca="center">
                        <p>0.043*</p>
                     </c>
                     <c ca="center">
                        <p>0.555</p>
                     </c>
                     <c ca="center">
                        <p>0.071</p>
                     </c>
                     <c ca="center">
                        <p>0.034</p>
                     </c>
                     <c ca="center">
                        <p>0.605</p>
                     </c>
                     <c ca="center">
                        <p>Immune response</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IL4</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>interleukin 4</p>
                     </c>
                     <c ca="center">
                        <p>0.262</p>
                     </c>
                     <c ca="center">
                        <p>0.220</p>
                     </c>
                     <c ca="center">
                        <p>0.205</p>
                     </c>
                     <c ca="center">
                        <p>0.329</p>
                     </c>
                     <c ca="center">
                        <p>0.512</p>
                     </c>
                     <c ca="center">
                        <p>0.043*</p>
                     </c>
                     <c ca="center">
                        <p>Immune response/Apoptosis</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>IL6</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>interleukin 6</p>
                     </c>
                     <c ca="center">
                        <p>0.108</p>
                     </c>
                     <c ca="center">
                        <p>0.478</p>
                     </c>
                     <c ca="center">
                        <p>0.431</p>
                     </c>
                     <c ca="center">
                        <p>0.119</p>
                     </c>
                     <c ca="center">
                        <p>0.570</p>
                     </c>
                     <c ca="center">
                        <p>0.037*</p>
                     </c>
                     <c ca="center">
                        <p>Immune response/Apoptosis</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ARHGEF1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Rho guanine nucleotide exchange factor 1</p>
                     </c>
                     <c ca="center">
                        <p>0.769</p>
                     </c>
                     <c ca="center">
                        <p>0.030*</p>
                     </c>
                     <c ca="center">
                        <p>0.532</p>
                     </c>
                     <c ca="center">
                        <p>0.078</p>
                     </c>
                     <c ca="center">
                        <p>0.149</p>
                     </c>
                     <c ca="center">
                        <p>0.263</p>
                     </c>
                     <c ca="center">
                        <p>Apoptosis</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>RIPK1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>receptor (<it>TNFRSF</it>)-interacting serine-threonine kinase 1</p>
                     </c>
                     <c ca="center">
                        <p>0.642</p>
                     </c>
                     <c ca="center">
                        <p>0.045*</p>
                     </c>
                     <c ca="center">
                        <p>0.544</p>
                     </c>
                     <c ca="center">
                        <p>0.074</p>
                     </c>
                     <c ca="center">
                        <p>0.070</p>
                     </c>
                     <c ca="center">
                        <p>0.456</p>
                     </c>
                     <c ca="center">
                        <p>Apoptosis</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>BCL2L1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>BCL2-like 1</p>
                     </c>
                     <c ca="center">
                        <p>0.278</p>
                     </c>
                     <c ca="center">
                        <p>0.204</p>
                     </c>
                     <c ca="center">
                        <p>0.679</p>
                     </c>
                     <c ca="center">
                        <p>0.043*</p>
                     </c>
                     <c ca="center">
                        <p>0.265</p>
                     </c>
                     <c ca="center">
                        <p>0.127</p>
                     </c>
                     <c ca="center">
                        <p>Apoptosis</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>PMVK</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>phosphomevalonate kinase</p>
                     </c>
                     <c ca="center">
                        <p>0.563</p>
                     </c>
                     <c ca="center">
                        <p>0.059*</p>
                     </c>
                     <c ca="center">
                        <p>0.652</p>
                     </c>
                     <c ca="center">
                        <p>0.048*</p>
                     </c>
                     <c ca="center">
                        <p>0.097</p>
                     </c>
                     <c ca="center">
                        <p>0.380</p>
                     </c>
                     <c ca="center">
                        <p>Lipoprotein metabolism</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>F2</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>coagulation factor II</p>
                     </c>
                     <c ca="center">
                        <p>0.413</p>
                     </c>
                     <c ca="center">
                        <p>0.109</p>
                     </c>
                     <c ca="center">
                        <p>0.536</p>
                     </c>
                     <c ca="center">
                        <p>0.077</p>
                     </c>
                     <c ca="center">
                        <p>0.473</p>
                     </c>
                     <c ca="center">
                        <p>0.049*</p>
                     </c>
                     <c ca="center">
                        <p>Blood clotting</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* Genes with a significantly higher weighted-average <it>F</it><sub><it>ST</it></sub></p>
               </tblfn>
            </tbl>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p><it>F</it><sub><it>ST </it></sub>profile for nine genes with a significantly higher weighted-average <it>F</it><sub><it>ST</it></sub></p>
               </caption>
               <text>
                  <p><it>F</it><sub><it>ST </it></sub>profile for nine genes with a significantly higher weighted-average <it>F</it><sub><it>ST</it></sub>. The X-axis indicates the chromosomal position (kb). See Table 3 for gene names. The average recombination rate (cM/MB) for the genes is: <it>GRB2</it>, 1.78; <it>IKBKB</it>, 0.76; <it>ARHGEF1</it>, 0.81; <it>RIPK1</it>, 2.06; <it>PMVK</it>, 1.07; <it>BCL2L1</it>, 0.86; <it>IL4</it>, 0.94; IL6, 1.10; <it>F2</it>, 0.66.</p>
               </text>
               <graphic file="1471-2156-8-48-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The most direct way to study whether genetic risk factors vary among ethnic groups is to determine whether disease susceptibility variants differ in frequency and/or effect among groups <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Several studies have demonstrated that the genotype frequencies of SNPs in candidate genes for cardiovascular diseases (CVD) differ among populations <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B9">9</abbr><abbr bid="B37">37</abbr></abbrgrp>. Lanfear <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> found higher frequencies of disease-associated genotypes in African-Americans than in European-Americans for polymorphisms in <it>GJA4</it>, <it>SERPINE1 </it>(<it>PAI</it>-<it>1</it>) and <it>MMP3</it>. Two nonsense mutations in <it>PCSK9 </it>that lead to lower plasma levels of low-density lipoprotein cholesterol are relatively common in African-Americans (2%) but rare in European-Americans (&lt; 0.1%) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Significant differences in allele frequencies were noted in the polymorphisms of <it>IL2</it>, <it>IL6</it>, and <it>IL10 </it>among Blacks, Whites, and Asians <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. In a meta-analysis, Ioannidis <it>et al</it>. <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> assessed 43 validated gene-disease associations across 697 study populations of various ethnicities and found that frequencies of polymorphisms in seven cardiovascular disease genes &#8211; <it>APOE</it>, <it>ACE</it>, <it>ITGB3</it>, <it>MTHFR</it>, <it>F2</it>, <it>PON1</it>, and <it>PON2 </it>&#8211; varied significantly between ethnicities (I<sup>2 </sup>&#8805; 75%; I<sup>2 </sup>being a measure of the extent to which the heterogeneity is not due to chance) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. These loci (except <it>APOE</it>) showed a large heterogeneity of 'race'-specific frequency of polymorphisms. In addition, a disease-associated mutation may be present at high frequency in one population but virtually absent in another, an example being a variant in the <it>SCN5A </it>gene (associated with cardiac arrhythmia) <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, which is present in African-Americans at an allele frequency of 0.132 and is not found in Europeans and Asians. However, the <it>F</it><sub><it>ST </it></sub>value may not be a 'sensitive' test and genes implicated in CVD susceptibility may not lead to exceptionally elevated <it>F</it><sub><it>ST </it></sub>values. For example, the <it>F</it><sub><it>ST </it></sub>value was not significantly higher for the nonsense mutation in <it>PCSK9 </it>when comparing African-Americans (allele frequency: ~2%) and European-Americans (frequency &lt; 0.1%).</p>
         <p>Gene variants that interact with geographic- or population-specific environmental factors may be under strong positive or diversifying selection pressure <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. A measure of population differentiation (i.e., <it>F</it><sub><it>ST</it></sub>) has been used to quantify the degree to which populations are subdivided for particular genetic variants. While other measures, such as the nearest-neighbor statistic (<it>S</it><sub><it>nn</it></sub>) <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, and <it>c </it>parameter (i.e., measuring how isolated a population has been) <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> are used to identify the level of population differentiation, the unbiased estimator of <it>F</it><sub><it>ST </it></sub>is simple and easy to calculate. We calculated <it>F</it><sub><it>ST </it></sub>for SNPs in a set of genes in causal pathways for CVD, to identify the patterns of differences for allele frequencies from one ethnic group to another.</p>
         <p>To avoid false positive results due to genotyping error and ascertainment bias <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B25">25</abbr><abbr bid="B43">43</abbr></abbrgrp>, we studied SNPs that were in Hardy-Weinberg equilibrium, had a minor allele frequency (MAF) of &#8805; 0.10 in at least one population (i.e., common SNPs), and cosegregated in all three populations <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The relatively low genotype error rate (0.3%) in HapMap data <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> has likely had a limited impact on estimates of <it>F</it><sub><it>ST </it></sub><abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Consistent with previous reports <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, most common SNPs were shared between populations in the present study and 10&#8211;20% of the common variants present in one population are not necessarily common in the other population (i.e., private SNPs), especially when comparing Africans and non-Africans. However, whether the 'private' or shared common SNPs contribute to ethnic differences of CVD risk needs further investigation. The approximate chi-square distribution of <it>F</it><sub><it>ST </it></sub>(Figure <figr fid="F1">1b</figr>) is similar to the distribution for the entire HapMap data <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The distribution of pairwise <it>F</it><sub><it>ST </it></sub>is similar for the YRI vs. CEU and YRI vs. CHB + JPT, consistent with previous reports <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B33">33</abbr><abbr bid="B44">44</abbr></abbrgrp>. The lower level of population differentiation in CEU vs. CHB + JPT supports a recent split between these two populations.</p>
         <p>Given that candidate loci with large <it>F</it><sub><it>ST </it></sub>values might have undergone local adaptation <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B45">45</abbr></abbrgrp>, we hypothesized that <it>F</it><sub><it>ST </it></sub>values would be higher in putatively functional variants than in putatively nonfunctional variants. Common functional SNPs with large <it>F</it><sub><it>ST </it></sub>values may influence variation in disease susceptibility among different populations. A high divergence of allele frequency was noted among putatively functional (e.g., nonsynonymous sites and SNPs in 5', and 3' UTRs) and non-functional SNPs (Figure <figr fid="F2">2</figr>). In addition, the pattern varied in different pairwise population comparisons and was most obvious for the Africans vs. non-Africans comparison. An extreme example is a non-synonymous SNP in <it>F2 </it>(rs5896, Met165 &#8594; Thr) with higher <it>F</it><sub><it>ST </it></sub>in YRI vs. CHB + JPT (<it>F</it><sub><it>ST </it></sub>= 1.000) and CEU vs. CHB + JPT (<it>F</it><sub><it>ST </it></sub>= 0.950), at a MAF of 0.00, 0.05, and 1.00 in YRI, CEU and CHB + JPT, respectively. This finding has also been reported in a prior analysis of the Phase I HapMap data <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
         <p>To provide a reasonable biological explanation for the <it>F</it><sub><it>ST </it></sub>values, we considered the sampling distribution of the <it>F</it><sub><it>ST </it></sub>estimates. One of the methods is based on numerical sampling or permutation procedures so that <it>F</it><sub><it>ST </it></sub>is estimated and the proportion of values larger than or equal to the one estimated from the observed data set will yield the unbiased <it>P</it>-value of the test <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Yet another method involves the use of variances in actual values of <it>F</it><sub><it>ST </it></sub>to detect regions of exceptional <it>F</it><sub><it>ST </it></sub>values, defined as population-average values more than three standard deviations from the chromosomal average <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. An alternative strategy is based on coalescent theory that simulates the histories of the samples or the populations. An expected distribution of <it>F</it><sub><it>ST </it></sub>for 15,559 simulated SNPs was generated under the calibrated demographic model <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> (Figure <figr fid="F1">1c~d</figr>). Recently, an empirical distribution has been used to test the significance level of <it>F</it><sub><it>ST </it></sub><abbrgrp><abbr bid="B18">18</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. Instead of whole-genome empirical distribution of <it>F</it><sub><it>ST </it></sub><abbrgrp><abbr bid="B18">18</abbr><abbr bid="B34">34</abbr></abbrgrp>, we used an empirical distribution of <it>F</it><sub><it>ST </it></sub>from 'neutral' loci in 289 and 17 intergenic regions from autosomal chromosomes and X chromosome, respectively. The number of SNPs for empirical neutral distribution (14,792 from autosomal chromosomes and 372 from the X chromosome) takes into account the multiple testing incurred in our evaluation of 15,559 SNPs. This 'neutral' empirical distribution of <it>F</it><sub><it>ST </it></sub>is most likely shaped by only demography and therefore the <it>P </it>values of <it>F</it><sub><it>ST </it></sub>estimated from the empirical distribution may represent a more reliable indicator of selection. The mean and 95% upper limits of this 'neutral' distribution are slightly higher than the previously used 'neutral' empirical <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B47">47</abbr></abbrgrp> or the simulated distribution (Figure <figr fid="F4">4</figr>), indicating the statistical test using the empirical distribution is more conservative. Although none of the SNPs or the genes remained significant after correction for multiple testing using false discovery rate <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, the method of population differentiation can used as an exploratory tool for detecting local adaptation <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
         <p>Genes are subjected to different evolutionary constraints depending on their biological functions and genes with a higher population differentiation are likely to have been more readily influenced by the environment <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. For instance, Grossman <it>et al</it>. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> found that <it>F</it><sub><it>ST </it></sub>values for 'apoptosis' genes among Ashkenazi, Sephardic and Arab Israelis are low. In the present study, most striking was the high mean <it>F</it><sub><it>ST </it></sub>in the biological process of 'apoptosis' (<it>F</it><sub><it>ST </it></sub>= 0.166) and 'lipoprotein metabolism' (<it>F</it><sub><it>ST </it></sub>= 0.165) between YRI vs. CHB + JPT, but lower <it>F</it><sub><it>ST </it></sub>in CEU vs. CHB + JPT (0.091 and 0.082, respectively) (Figure <figr fid="F3">3</figr>). The patterns suggested that differential environments pressures may have accounted for the varying <it>F</it><sub><it>ST </it></sub>among biological processes for different populations.</p>
         <p>The biological processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of high <it>F</it><sub><it>ST </it></sub>values, which may be the result of local adaptation or genetic drift (Table <tblr tid="T2">2</tblr>, Figure <figr fid="F5">5</figr>). The combined information of <it>F</it><sub><it>ST </it></sub>over loci provides a means to quantify the level of population differentiation in a given gene (Figure <figr fid="F6">6</figr>) <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. A summary of functions for these genes relevant to CVD is shown in Table S3 (see additional file <supplr sid="S1">1</supplr>). For three gene loci &#8211; <it>F2, IL4</it>, and <it>IL6 </it>&#8211; significantly higher <it>F</it><sub><it>ST </it></sub>values in CEU vs. CHB + JPT were noted (Table <tblr tid="T3">3</tblr>). Previous studies also demonstrated a higher population differentiation in <it>IL4 </it>between Europeans and East Asians <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. These findings suggest that different selective factors might be exerting locus-specific effects in populations with different geographic origin. Most of the genes under local adaptation were in the 'immune response' and 'apoptosis' pathways (Table <tblr tid="T3">3</tblr>). How these biological processes may promote differential susceptibility to CVD among Africans and non-Africans need further study.</p>
         <p>Given that CVD susceptibility varies among populations, genes that are responsible for such variations should also differ among populations. Hence, regardless of whether drift or selection is responsible, the approach of looking among a set of candidate genes for those with highest <it>F</it><sub><it>ST </it></sub>values should help identify candidate genes to explain differences in CVD susceptibility. The present study identifies genes in etiological pathways of CVD with a high <it>F</it><sub><it>ST </it></sub>among populations, and should be considered as a means of generating new hypotheses to test (i.e., reprioritize candidate genes rather than identify new ones). Putative functional SNPs with a high <it>F</it><sub><it>ST </it></sub>should be investigated further for confirmation of functional effects and should be included in CVD association studies among populations. In addition, the differential patterns of <it>F</it><sub><it>ST </it></sub>among biological processes and functional pathways may provide insight into the mechanisms contributing to varying CVD susceptibility among different populations.</p>
         <p>Our study has several limitations. First, since population differentiation detects local adaptation in geographically separate populations within the last ~75,000 years <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, the present study cannot identify genes subject to natural selection before this time scale. Phylogenetic analyses <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> detect evolutionary changes preceding this time period whereas nucleotide diversity and LD-based tests <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B53">53</abbr></abbrgrp> may be helpful in further investigating genomic regions with significantly high <it>F</it><sub><it>ST </it></sub>values. Second, the fluctuation of allele frequencies due to a relatively small sample size could affect the robustness of our inferences. Thus, estimation of allele frequencies in a larger sample would be needed to confirm our results. Even though common SNPs (MAF > 0.1) that conformed to Hardy-Weinberg equilibrium (HWE) were included, we cannot completely address the issues of ascertainment bias of SNPs <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. However, SNPs with MAF &lt; 0.1 and not in HWE could have biological relevance since natural selection, not genotyping error also could lead to deviation from HWE. Third, we did not adjust for recombination rate in our analyses. We found a negative correlation between the weighted-average <it>F</it><sub><it>ST </it></sub>and recombination rate (based on the recombination map of Kong et al. <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>), although the correlation was not statistically significant (analyses not shown). The recombination rate in genes with a high <it>F</it><sub><it>ST </it></sub>listed in Figure <figr fid="F6">6</figr> (except GRB2 and PRIK1) was below the average recombination rate (1cM/MB) across the human genome. Finally, a complete catalogue of etiologic pathways implicated in CVD is yet to be established.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In summary, the present study of genes in etiologic pathways for CVD revealed greater population differentiation in putative functional SNPs in these genes, as well as significant variation in <it>F</it><sub><it>ST </it></sub>based on different biological processes relevant to CVD. The biological processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high <it>F</it><sub><it>ST</it></sub>. In addition, the pattern varied in different pairwise population comparisons. SNP loci (especially putatively functional SNPs) and genes with a significantly higher population differentiation should be considered high priority for investigating genetic factors influencing differences in CVD risk among populations.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Genes in the etiologic pathways for cardiovascular disease (CVD)</p>
            </st>
            <p>Based on a search of the literature in PUBMED <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, we identified 37 functional pathways implicated in CVD (a summary of the functional pathways, the number of genes in each pathway and corresponding references is presented in Table S1, see additional file <supplr sid="S1">1</supplr>). We explored 416 genes from these pathways using the Panther classification system <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. These genes were classified into the following biological processes relevant to CVD: 1) apoptosis; 2) blood circulation &amp; gas exchange; 3) blood clotting; 4) homeostasis; 5) immunity and defense; and 6) lipid fatty acid &amp; steroid metabolism. Examples of candidate genes in various pathways were: caspase and TNF/TNF receptor gene family in the 'apoptosis' process; endothelin and nitric oxide synthase 3 genes in the 'blood circulation &amp; gas exchange' process; those genes involved in the intrinsic and extrinsic coagulation pathway in the 'blood clotting' process; the gene family of insulin receptor substrate in the 'homeostasis' process; those genes participating in the inflammation response, such as the interleukin gene family, in the 'immunity and defense' process; and the arachidonate-lipoxygenase, and phospholipase gene family in the 'lipid fatty acid and steroid metabolism' process.</p>
         </sec>
         <sec>
            <st>
               <p>Genotype data</p>
            </st>
            <p>Using the National Center for Biotechnology Information (NCBI) reference sequence <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, we aligned the sequence of messenger RNA of each gene with the human chromosome sequence (NCBI build 35). Based on the alignment, the genotype data for single nucleotide polymorphisms (SNPs) in each gene were obtained from HapMap database (Phase II) <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B33">33</abbr></abbrgrp>. The HapMap data includes 90 individuals (30 trios) from the Yoruba in Ibadan, Nigeria (YRI), 90 individuals (30 trios) in Utah residents with ancestry from Northern and Western Europe (CEU), 45 unrelated Han Chinese in Beijing, China (CHB) and 45 unrelated Japanese in Tokyo, Japan (JPT). For each gene, map information for SNPs was obtained from NCBI and the genome annotation database at University of California, Santa Cruz (UCSC) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The reference mRNA sequence was annotated as 5' untranslated regions (5' UTR), coding (synonymous and non-synonymous), intronic, and 3' untranslated regions (3' UTR).</p>
         </sec>
         <sec>
            <st>
               <p>Calculation of <it>F</it><sub><it>ST</it></sub></p>
            </st>
            <p>The estimate of population differentiation, <it>F</it><sub><it>ST</it></sub>, measures relatedness of pairs of alleles within a population relative to the total populations <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B35">35</abbr></abbrgrp>. We calculated an unbiased small-sample estimator of <it>F</it><sub><it>ST </it></sub>as described by Weir <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B59">59</abbr></abbrgrp>. If there are <it>n</it><sub><it>i </it></sub>alleles sampled from the <it>i</it><sup><it>th </it></sup>of <it>r </it>populations, the sample frequency of the SNP allele <it>u </it>in the <it>i</it><sup><it>th </it></sup>subpopulation is <inline-formula><m:math name="1471-2156-8-48-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>u</m:mi></m:mrow></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGWbaCgaacamaaBaaaleaacqWGPbqAcqWG1bqDaeqaaaaa@311E@</m:annotation></m:semantics></m:math></inline-formula>, and a weighted average of <it>p</it><sub><it>u </it></sub>across population is <inline-formula><m:math name="1471-2156-8-48-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#175;</m:mo></m:mover><m:mi>u</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mstyle displaystyle="true"><m:munder><m:mo>&#8721;</m:mo><m:mi>i</m:mi></m:munder><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle></m:mrow></m:mfrac><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:munderover><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>u</m:mi></m:mrow></m:msub></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGWbaCgaqeamaaBaaaleaacqWG1bqDaeqaaOGaeyypa0ZaaSaaaeaacqaIXaqmaeaadaaeqbqaaiabd6gaUnaaBaaaleaacqWGPbqAaeqaaaqaaiabdMgaPbqab0GaeyyeIuoaaaGcdaaeWbqaaiabd6gaUnaaBaaaleaacqWGPbqAaeqaaOGafmiCaaNbaGaadaWgaaWcbaGaemyAaKMaemyDauhabeaaaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGYbGCa0GaeyyeIuoaaaa@46A2@</m:annotation></m:semantics></m:math></inline-formula>. Two mean squares were defined as, <inline-formula><m:math name="1471-2156-8-48-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>G</m:mi><m:mi>u</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:munderover><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow></m:mfrac><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:munderover><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>u</m:mi></m:mrow></m:msub><m:mo stretchy="false">(</m:mo><m:mn>1</m:mn><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>u</m:mi></m:mrow></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGnbqtcqWGtbWucqWGhbWrdaWgaaWcbaGaemyDauhabeaakiabg2da9maalaaabaGaeGymaedabaWaaabCaeaacqGGOaakcqWGUbGBdaWgaaWcbaGaemyAaKgabeaakiabgkHiTiabigdaXiabcMcaPaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOCaihaniabggHiLdaaaOWaaabCaeaacqWGUbGBdaWgaaWcbaGaemyAaKgabeaakiqbdchaWzaaiaWaaSbaaSqaaiabdMgaPjabdwha1bqabaGccqGGOaakcqaIXaqmcqGHsislcuWGWbaCgaacamaaBaaaleaacqWGPbqAcqWG1bqDaeqaaOGaeiykaKcaleaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGYbGCa0GaeyyeIuoaaaa@57D0@</m:annotation></m:semantics></m:math></inline-formula>, and <inline-formula><m:math name="1471-2156-8-48-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>P</m:mi><m:mi>u</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mi>r</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mfrac><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:munderover><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub><m:msup><m:mrow><m:mo stretchy="false">(</m:mo><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#732;</m:mo></m:mover><m:mrow><m:mi>i</m:mi><m:mi>u</m:mi></m:mrow></m:msub><m:mo>&#8722;</m:mo><m:msub><m:mover accent="true"><m:mi>p</m:mi><m:mo>&#175;</m:mo></m:mover><m:mi>u</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow><m:mn>2</m:mn></m:msup></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGnbqtcqWGtbWucqWGqbaudaWgaaWcbaGaemyDauhabeaakiabg2da9maalaaabaGaeGymaedabaGaemOCaiNaeyOeI0IaeGymaedaamaaqahabaGaemOBa42aaSbaaSqaaiabdMgaPbqabaGccqGGOaakcuWGWbaCgaacamaaBaaaleaacqWGPbqAcqWG1bqDaeqaaOGaeyOeI0IafmiCaaNbaebadaWgaaWcbaGaemyDauhabeaakiabcMcaPmaaCaaaleqabaGaeGOmaidaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabdkhaYbqdcqGHris5aaaa@4C6F@</m:annotation></m:semantics></m:math></inline-formula>, where <it>MSG</it><sub><it>u </it></sub>and <it>MSP</it><sub><it>u </it></sub>denote the observed mean square errors for loci with populations and between populations, respectively. The moment estimator of <it>F</it><sub><it>ST </it></sub>was defined as, <inline-formula><m:math name="1471-2156-8-48-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>F</m:mi><m:mrow><m:mi>S</m:mi><m:mi>T</m:mi></m:mrow></m:msub><m:mo>=</m:mo><m:mfrac><m:mrow><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>P</m:mi><m:mi>u</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>G</m:mi><m:mi>u</m:mi></m:msub></m:mrow><m:mrow><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>P</m:mi><m:mi>u</m:mi></m:msub><m:mo>+</m:mo><m:mo stretchy="false">(</m:mo><m:msub><m:mi>n</m:mi><m:mi>c</m:mi></m:msub><m:mo>&#8722;</m:mo><m:mn>1</m:mn><m:mo stretchy="false">)</m:mo><m:mi>M</m:mi><m:mi>S</m:mi><m:msub><m:mi>G</m:mi><m:mi>u</m:mi></m:msub></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGgbGrdaWgaaWcbaGaem4uamLaemivaqfabeaakiabg2da9maalaaabaGaemyta0Kaem4uamLaemiuaa1aaSbaaSqaaiabdwha1bqabaGccqGHsislcqWGnbqtcqWGtbWucqWGhbWrdaWgaaWcbaGaemyDauhabeaaaOqaaiabd2eanjabdofatjabdcfaqnaaBaaaleaacqWG1bqDaeqaaOGaey4kaSIaeiikaGIaemOBa42aaSbaaSqaaiabdogaJbqabaGccqGHsislcqaIXaqmcqGGPaqkcqWGnbqtcqWGtbWucqWGhbWrdaWgaaWcbaGaemyDauhabeaaaaaaaa@4E17@</m:annotation></m:semantics></m:math></inline-formula>, where, <it>n</it><sub><it>c </it></sub>is the average sample size across samples that also incorporates and corrects for the variance in sample size over subpopulations, <inline-formula><m:math name="1471-2156-8-48-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:mi>n</m:mi><m:mi>c</m:mi></m:msub><m:mo>=</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mi>r</m:mi><m:mo>&#8722;</m:mo><m:mn>1</m:mn></m:mrow></m:mfrac><m:mo stretchy="false">(</m:mo><m:mstyle displaystyle="true"><m:munderover><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:munderover><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle><m:mo>&#8722;</m:mo><m:mfrac><m:mrow><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:msubsup><m:mrow><m:msubsup><m:mi>n</m:mi><m:mi>i</m:mi><m:mn>2</m:mn></m:msubsup></m:mrow></m:mstyle></m:mrow><m:mrow><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>r</m:mi></m:msubsup><m:mrow><m:msub><m:mi>n</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle></m:mrow></m:mfrac><m:mo stretchy="false">)</m:mo></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGUbGBdaWgaaWcbaGaem4yamgabeaakiabg2da9maalaaabaGaeGymaedabaGaemOCaiNaeyOeI0IaeGymaedaaiabcIcaOmaaqahabaGaemOBa42aaSbaaSqaaiabdMgaPbqabaaabaGaemyAaKMaeyypa0JaeGymaedabaGaemOCaihaniabggHiLdGccqGHsisldaWcaaqaamaaqadabaGaemOBa42aa0baaSqaaiabdMgaPbqaaiabikdaYaaaaeaacqWGPbqAcqGH9aqpcqaIXaqmaeaacqWGYbGCa0GaeyyeIuoaaOqaamaaqadabaGaemOBa42aaSbaaSqaaiabdMgaPbqabaaabaGaemyAaKMaeyypa0JaeGymaedabaGaemOCaihaniabggHiLdaaaOGaeiykaKcaaa@55C9@</m:annotation></m:semantics></m:math></inline-formula>.</p>
            <p><it>F</it><sub><it>ST </it></sub>was estimated for each SNP locus and a weighted-average <it>F</it><sub><it>ST </it></sub>was estimated for each gene <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B60">60</abbr></abbrgrp>. <it>F</it><sub><it>ST </it></sub>can be negative when levels of differentiation are close to zero and/or sample sizes are small, indicating no population differentiation at these loci <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. In our analysis, we assigned a value of zero to negative <it>F</it><sub><it>ST </it></sub>values. The program for calculating <it>F</it><sub><it>ST </it></sub>for each SNP locus was written in Perl and is available from the authors upon request. The weighted-average <it>F</it><sub><it>ST </it></sub>value combining information over loci <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> was calculated using the 'Genepop' software (version 3.4) <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Expected distribution of <it>F</it><sub><it>ST </it></sub>under a calibrated demographic model</p>
            </st>
            <p>We used coalescent theory to obtain the expected distribution of <it>F</it><sub><it>ST </it></sub>under a calibrated demographic model for Africans, Europeans, and Asians <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Using the program 'cosi' <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>, we simulated a one megabase (MB) region 1,000 times under the 'best-fitting' population parameters for the three populations. The 'best-fitting' set of parameters yielded good agreement with all aspects (including allele frequency spectrum, fraction of alleles that are ancestral, linkage disequilibrium, and <it>F</it><sub><it>ST</it></sub>) of the observed data in the human genome <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Pairwise <it>F</it><sub><it>ST </it></sub>among populations was calculated for each simulated SNP.</p>
            <sec>
               <st>
                  <p>Significance of <it>F</it><sub><it>ST</it></sub></p>
               </st>
               <p>Population demographic history, such as migration among sub-populations, can also influence <it>F</it><sub><it>ST </it></sub><abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. By comparing the <it>F</it><sub><it>ST </it></sub>of an individual locus to the empirical distribution, it is possible to distinguish between genetic drift and natural selection without having to take population demographic history into account <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. To assess the statistical significance of <it>F</it><sub><it>ST </it></sub>values for SNPs, we selected 289 intergenic regions across the autosomal genome and 17 intergenic regions across chromosome X to obtain a neutral distribution of <it>F</it><sub><it>ST</it></sub>, based on the annotation tables of human chromosomes from UCSC database. These regions were separated by at least one MB from the closest exon and did not include centromeric regions. Each region spanned an average of 1.52 MB and in total composed 466.10 MB. Following the method of Izagirre <it>et al</it>. <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, we defined 'neutral' SNPs as follows: 1) separated by at least 25 kb from each other, 2) genotyped in all three populations, and 3) MAF &#8805; 0.10 in at least one of the three populations. In all, 14,792 SNPs for autosomal chromosomes and 372 SNPs for the X chromosome satisfied these criteria. We used the observed 'neutral' distribution to assess the significance level of the <it>F</it><sub><it>ST </it></sub>for each SNP (<it>P </it>value, one-sided) in the autosomal and X chromosome regions separately. We focused on the significantly higher values of <it>F</it><sub><it>ST </it></sub>for local adaptation, although significantly lower <it>F</it><sub><it>ST </it></sub>might result from balancing selection.</p>
               <p>All statistical analyses were performed using <it>R</it>. Analysis of variance (ANOVA) was performed to compare linear regressions of <it>F</it><sub><it>ST </it></sub>against logarithm of MAF [ln(MAF)] with and without the terms for the SNP category and a ln(MAF) &#215; category interaction.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>IJK and KD designed the project, analyzed the data, and wrote the paper. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We acknowledge the Research Computing Facility of Mayo Clinic Rochester and the Supercomputing Institute of University of Minnesota, Minneapolis, for technical support.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Categorization of humans in biomedical research: genes, race and disease</p>
            </title>
            <aug>
               <au>
                  <snm>Risch</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Burchard</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ziv</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>7</issue>
            <fpage>comment2007</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139378</pubid>
                  <pubid idtype="pmpid" link="fulltext">12184798</pubid>
                  <pubid idtype="doi">10.1186/gb-2002-3-7-comment2007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>'Racial' differences in genetic effects for complex diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Ioannidis</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Ntzani</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Trikalinos</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <issue>12</issue>
            <fpage>1312</fpage>
            <lpage>1318</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1474</pubid>
                  <pubid idtype="pmpid" link="fulltext">15543147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Coronary heart disease in seven countries.</p>
            </title>
            <aug>
               <au>
                  <snm>Keys</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Circulation</source>
            <pubdate>1970</pubdate>
            <volume>41</volume>
            <issue>4 Suppl</issue>
            <fpage>1</fpage>
            <lpage>211</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Racial differences in thoracic aorta atherosclerosis among ischemic stroke patients</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Nanda</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Yesilbursa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>WY</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Gomez</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Stroke</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <issue>2</issue>
            <fpage>408</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1161/01.STR.0000050643.32175.89</pubid>
                  <pubid idtype="pmpid" link="fulltext">12574551</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Mortality differences between black and white men in the USA: contribution of income and other risk factors among men screened for the MRFIT. MRFIT Research Group. Multiple Risk Factor Intervention Trial</p>
            </title>
            <aug>
               <au>
                  <snm>Davey Smith</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Neaton</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Wentworth</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stamler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stamler</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Lancet</source>
            <pubdate>1998</pubdate>
            <volume>351</volume>
            <issue>9107</issue>
            <fpage>934</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9734939</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Ten-year incidence of coronary heart disease in the Honolulu Heart Program. Relationship to biologic and lifestyle characteristics</p>
            </title>
            <aug>
               <au>
                  <snm>Yano</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Reed</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>McGee</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Am J Epidemiol</source>
            <pubdate>1984</pubdate>
            <volume>119</volume>
            <issue>5</issue>
            <fpage>653</fpage>
            <lpage>666</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6720665</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>C-reactive protein as a screening test for cardiovascular risk in a multiethnic population</p>
            </title>
            <aug>
               <au>
                  <snm>Anand</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Razak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yi</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Vuksan</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lonn</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Teo</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>McQueen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yusuf</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Arterioscler Thromb Vasc Biol</source>
            <pubdate>2004</pubdate>
            <volume>24</volume>
            <issue>8</issue>
            <fpage>1509</fpage>
            <lpage>1515</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1161/01.ATV.0000135845.95890.4e</pubid>
                  <pubid idtype="pmpid" link="fulltext">15205215</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Global burden of cardiovascular diseases: part I: general considerations, the epidemiologic transition, risk factors, and impact of urbanization</p>
            </title>
            <aug>
               <au>
                  <snm>Yusuf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reddy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ounpuu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Anand</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Circulation</source>
            <pubdate>2001</pubdate>
            <volume>104</volume>
            <issue>22</issue>
            <fpage>2746</fpage>
            <lpage>2753</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11723030</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Genotypes associated with myocardial infarction risk are more common in African Americans than in European Americans</p>
            </title>
            <aug>
               <au>
                  <snm>Lanfear</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Marsh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cresci</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Spertus</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>McLeod</snm>
                  <fnm>HL</fnm>
               </au>
            </aug>
            <source>J Am Coll Cardiol</source>
            <pubdate>2004</pubdate>
            <volume>44</volume>
            <issue>1</issue>
            <fpage>165</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jacc.2004.03.053</pubid>
                  <pubid idtype="pmpid" link="fulltext">15234427</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The question of the role of ethnicity on cardiovascular risk: does it matter where we come from?</p>
            </title>
            <aug>
               <au>
                  <snm>Bolli</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Hypertens</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>7</issue>
            <fpage>1331</fpage>
            <lpage>1333</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.hjh.0000173512.30116.52</pubid>
                  <pubid idtype="pmpid" link="fulltext">15942452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The application of molecular genetic approaches to the study of human evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalli-Sforza</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>33 Suppl</volume>
            <fpage>266</fpage>
            <lpage>275</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1113</pubid>
                  <pubid idtype="pmpid" link="fulltext">12610536</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>History and Geography of Human Genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalli-Sforza</snm>
                  <fnm>LL</fnm>
                  <suf>Piazza A, Menozzi P.</suf>
               </au>
            </aug>
            <publisher>Princeton , Princeton University Press</publisher>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Clines of nuclear DNA markers suggest a largely neolithic ancestry of the European gene pool</p>
            </title>
            <aug>
               <au>
                  <snm>Chikhi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Destro-Bisol</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bertorelle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pascali</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Barbujani</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>15</issue>
            <fpage>9053</fpage>
            <lpage>9058</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21201</pubid>
                  <pubid idtype="pmpid" link="fulltext">9671803</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.15.9053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Differential susceptibility to hypertension is due to selection during the out-of-Africa expansion</p>
            </title>
            <aug>
               <au>
                  <snm>Young</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Chretien</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Klag</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ruff</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>NY</fnm>
               </au>
               <au>
                  <snm>Chakravarti</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>e82</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1342636</pubid>
                  <pubid idtype="pmpid" link="fulltext">16429165</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0010082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Analysis of population subdivision</p>
            </title>
            <aug>
               <au>
                  <snm>Excoffier</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Handbook of statistical genetics</source>
            <publisher>Chichester, West Sussex , John Wiley &amp; Sons, LTD</publisher>
            <editor>Balding DJ, Bishop M, Cannings C</editor>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Estimating F-statistics</p>
            </title>
            <aug>
               <au>
                  <snm>Weir</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>36</volume>
            <fpage>721</fpage>
            <lpage>750</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.36.050802.093940</pubid>
                  <pubid idtype="pmpid" link="fulltext">12359738</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms</p>
            </title>
            <aug>
               <au>
                  <snm>Lewontin</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Krakauer</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1973</pubdate>
            <volume>74</volume>
            <issue>1</issue>
            <fpage>175</fpage>
            <lpage>195</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1212935</pubid>
                  <pubid idtype="pmpid" link="fulltext">4711903</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Interrogating a high-density SNP map for signatures of natural selection</p>
            </title>
            <aug>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Shriver</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>12</issue>
            <fpage>1805</fpage>
            <lpage>1814</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187574</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466284</pubid>
                  <pubid idtype="doi">10.1101/gr.631202</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A genome scan to detect candidate regions influenced by local natural selection in human populations</p>
            </title>
            <aug>
               <au>
                  <snm>Kayser</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brauer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stoneking</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <issue>6</issue>
            <fpage>893</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg092</pubid>
                  <pubid idtype="pmpid" link="fulltext">12717000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Towards a molecular characterization of adaptation in local populations</p>
            </title>
            <aug>
               <au>
                  <snm>Schlotterer</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <issue>6</issue>
            <fpage>683</fpage>
            <lpage>687</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(02)00349-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12433582</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Levels of interpopulation differentiation among different functional classes of immunologically important genes</p>
            </title>
            <aug>
               <au>
                  <snm>Ryan</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Mapp</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Moyna</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mattiangeli</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kelleher</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bradley</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>McManus</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genes Immun</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>2</issue>
            <fpage>179</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.gene.6364266</pubid>
                  <pubid idtype="pmpid" link="fulltext">16222342</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Molecular evolution of 5' flanking regions of 87 candidate genes for atherosclerotic cardiovascular disease</p>
            </title>
            <aug>
               <au>
                  <snm>Ding</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kullo</snm>
                  <fnm>IJ</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2006</pubdate>
            <volume>30</volume>
            <issue>7</issue>
            <fpage>557</fpage>
            <lpage>569</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/gepi.20169</pubid>
                  <pubid idtype="pmpid" link="fulltext">16799961</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Positive selection on MMP3 regulation has shaped heart disease risk</p>
            </title>
            <aug>
               <au>
                  <snm>Rockman</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Soranzo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Loisel</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Wray</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>17</issue>
            <fpage>1531</fpage>
            <lpage>1539</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2004.08.051</pubid>
                  <pubid idtype="pmpid" link="fulltext">15341739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The International HapMap Project</p>
            </title>
            <url>http://www.hapmap.org</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Measures of human population structure show heterogeneity among genomic regions</p>
            </title>
            <aug>
               <au>
                  <snm>Weir</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>11</issue>
            <fpage>1468</fpage>
            <lpage>1476</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1310634</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251456</pubid>
                  <pubid idtype="doi">10.1101/gr.4398405</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>NCBI SNP database</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/projects/SNP/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>USCS genome browser [genome.ucsc.edu]</p>
            </title>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Genetic structure of human populations</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Cann</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Zhivotovsky</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <issue>5602</issue>
            <fpage>2381</fpage>
            <lpage>2385</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1078311</pubid>
                  <pubid idtype="pmpid" link="fulltext">12493913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Human population genetic structure and inference of group membership</p>
            </title>
            <aug>
               <au>
                  <snm>Bamshad</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Wooding</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watkins</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Ostler</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Jorde</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2003</pubdate>
            <volume>72</volume>
            <issue>3</issue>
            <fpage>578</fpage>
            <lpage>589</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1180234</pubid>
                  <pubid idtype="pmpid" link="fulltext">12557124</pubid>
                  <pubid idtype="doi">10.1086/368061</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs</p>
            </title>
            <aug>
               <au>
                  <snm>Shriver</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Parra</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Sonpar</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Hum Genomics</source>
            <pubdate>2004</pubdate>
            <volume>1</volume>
            <issue>4</issue>
            <fpage>274</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15588487</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Deconstructing the relationship between genetics and race</p>
            </title>
            <aug>
               <au>
                  <snm>Bamshad</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wooding</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Salisbury</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>8</issue>
            <fpage>598</fpage>
            <lpage>609</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1401</pubid>
                  <pubid idtype="pmpid" link="fulltext">15266342</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Calibrating a coalescent simulation of human genome sequence variation</p>
            </title>
            <aug>
               <au>
                  <snm>Schaffner</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Foo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gabriel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reich</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>11</issue>
            <fpage>1576</fpage>
            <lpage>1583</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1310645</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251467</pubid>
                  <pubid idtype="doi">10.1101/gr.3709305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A haplotype map of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brooks</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Chakravarti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <issue>7063</issue>
            <fpage>1299</fpage>
            <lpage>1320</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1880871</pubid>
                  <pubid idtype="pmpid" link="fulltext">16255080</pubid>
                  <pubid idtype="doi">10.1038/nature04226</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Ancient and recent positive selection transformed opioid cis-regulation in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Rockman</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Soranzo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zimprich</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Wray</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>12</issue>
            <fpage>e387</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1283535</pubid>
                  <pubid idtype="pmpid" link="fulltext">16274263</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030387</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Estimating F-statistics for the analysis of population structure</p>
            </title>
            <aug>
               <au>
                  <snm>Weir</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Chockerham</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>1984</pubdate>
            <volume>38</volume>
            <fpage>1358</fpage>
            <lpage>1370</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2408641</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Genetic influences on health: does race matter?</p>
            </title>
            <aug>
               <au>
                  <snm>Bamshad</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>JAMA</source>
            <pubdate>2005</pubdate>
            <volume>294</volume>
            <issue>8</issue>
            <fpage>937</fpage>
            <lpage>946</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1001/jama.294.8.937</pubid>
                  <pubid idtype="pmpid" link="fulltext">16118384</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pertsemlidis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kotowski</snm>
                  <fnm>IK</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Hobbs</snm>
                  <fnm>HH</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <issue>2</issue>
            <fpage>161</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1509</pubid>
                  <pubid idtype="pmpid" link="fulltext">15654334</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Ethnicity greatly influences cytokine gene polymorphism distribution</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Stanley</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>DiMercurio</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Koziol</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Harlan</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Kirk</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Blair</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Am J Transplant</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <issue>6</issue>
            <fpage>560</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1034/j.1600-6143.2002.20611.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12118901</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Variant of SCN5A sodium channel implicated in risk of cardiac arrhythmia</p>
            </title>
            <aug>
               <au>
                  <snm>Splawski</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Timothy</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Tateyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clancy</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Malhotra</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Beggs</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Cappuccio</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Sagnella</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Kass</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Keating</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <issue>5585</issue>
            <fpage>1333</fpage>
            <lpage>1336</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1073569</pubid>
                  <pubid idtype="pmpid" link="fulltext">12193783</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Locus-specific genetic diversity between human populations: an analysis of the literature</p>
            </title>
            <aug>
               <au>
                  <snm>Garte</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Am J Hum Biol</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <issue>6</issue>
            <fpage>814</fpage>
            <lpage>823</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/ajhb.10215</pubid>
                  <pubid idtype="pmpid" link="fulltext">14595873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A new statistic for detecting genetic differentiation</p>
            </title>
            <aug>
               <au>
                  <snm>Hudson</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>155</volume>
            <issue>4</issue>
            <fpage>2011</fpage>
            <lpage>2014</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461195</pubid>
                  <pubid idtype="pmpid" link="fulltext">10924493</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Assessing population differentiation and isolation from single-nucleotide polymorphism data</p>
            </title>
            <aug>
               <au>
                  <snm>Nicholson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Jonsson</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gustafsson</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Stefanssonand</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society Series B</source>
            <pubdate>2002</pubdate>
            <volume>64</volume>
            <fpage>695</fpage>
            <lpage>715</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/1467-9868.00357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Ascertainment bias in studies of human genome-wide polymorphism</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Hubisz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Bustamante</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Williamson</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>11</issue>
            <fpage>1496</fpage>
            <lpage>1502</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1310637</pubid>
                  <pubid idtype="pmpid" link="fulltext">16251459</pubid>
                  <pubid idtype="doi">10.1101/gr.4107905</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Carlson</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Eberle</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rieder</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <issue>4</issue>
            <fpage>518</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1128</pubid>
                  <pubid idtype="pmpid" link="fulltext">12652300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Whole-genome patterns of common DNA variation in three human populations</p>
            </title>
            <aug>
               <au>
                  <snm>Hinds</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Stuve</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Nilsen</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Halperin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Eskin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ballinger</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <issue>5712</issue>
            <fpage>1072</fpage>
            <lpage>1079</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1105436</pubid>
                  <pubid idtype="pmpid" link="fulltext">15718463</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The estimation of population differentiation with microsatellite markers</p>
            </title>
            <aug>
               <au>
                  <snm>Balloux</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lugon-Moulin</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Ecol</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <issue>2</issue>
            <fpage>155</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.0962-1083.2001.01436.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11856418</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>A Scan for Signatures of Positive Selection in Candidate Loci for Skin Pigmentation in Humans</p>
            </title>
            <aug>
               <au>
                  <snm>Izagirre</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Junquera</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>de la Rua</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Alonso</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>9</issue>
            <fpage>1697</fpage>
            <lpage>1706</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msl030</pubid>
                  <pubid idtype="pmpid" link="fulltext">16757656</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>How reliable are empirical genomic scans for selective sweeps?</p>
            </title>
            <aug>
               <au>
                  <snm>Teshima</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Coop</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <issue>6</issue>
            <fpage>702</fpage>
            <lpage>712</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1473181</pubid>
                  <pubid idtype="pmpid" link="fulltext">16687733</pubid>
                  <pubid idtype="doi">10.1101/gr.5105206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Statistical significance for genomewide studies</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>16</issue>
            <fpage>9440</fpage>
            <lpage>9445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">170937</pubid>
                  <pubid idtype="pmpid" link="fulltext">12883005</pubid>
                  <pubid idtype="doi">10.1073/pnas.1530509100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Genomic profiling of interpopulation diversity guides prioritization of candidate-genes for autoimmunity</p>
            </title>
            <aug>
               <au>
                  <snm>Grossman</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Avidan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Singer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Paperna</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lancet</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Beckmann</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genes Immun</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>6</issue>
            <fpage>493</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.gene.6364117</pubid>
                  <pubid idtype="pmpid" link="fulltext">15269719</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Positive selection on a human-specific transcription factor binding site regulating IL4 expression</p>
            </title>
            <aug>
               <au>
                  <snm>Rockman</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Soranzo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Wray</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>23</issue>
            <fpage>2118</fpage>
            <lpage>2123</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2003.11.025</pubid>
                  <pubid idtype="pmpid" link="fulltext">14654003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Positive natural selection in the human lineage</p>
            </title>
            <aug>
               <au>
                  <snm>Sabeti</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Schaffner</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lohmueller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Varilly</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shamovsky</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Palma</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mikkelsen</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>312</volume>
            <issue>5780</issue>
            <fpage>1614</fpage>
            <lpage>1620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1124309</pubid>
                  <pubid idtype="pmpid" link="fulltext">16778047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Comparative and Evolutionary Pharmacogenetics of ABCB1: Complex Signatures of Positive Selection on Coding and Regulatory Regions.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kullo</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Pharmacogenet Genomics</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <issue>8</issue>
            <fpage>667</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17622943</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>A high-resolution recombination map of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gudbjartsson</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Sainz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jonsdottir</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gudjonsson</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Richardsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sigurdardottir</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barnard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hallbeck</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Masson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Shlien</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Palsson</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Frigge</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Thorgeirsson</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Gulcher</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Stefansson</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <issue>3</issue>
            <fpage>241</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12053178</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>PUBMED</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/pubmed</url>
         </bibl>
         <bibl id="B56">
            <title>
               <p>PANTHER classification system</p>
            </title>
            <url>http://panther.appliedbiosystems.com</url>
         </bibl>
         <bibl id="B57">
            <title>
               <p>The PANTHER database of protein families, subfamilies, functions and pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Mi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lazareva-Ulitsky</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Loo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kejariwal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vandergriff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rabkin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Muruganujan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Doremieux</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kitano</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>Database issue</issue>
            <fpage>D284</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540032</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608197</pubid>
                  <pubid idtype="doi">10.1093/nar/gki078</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>NCBI Reference sequence</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/RefSeq/</url>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Genetic data analysis II.</p>
            </title>
            <aug>
               <au>
                  <snm>Weir</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <publisher>Sunderland, MA , Sinauer Associated</publisher>
            <pubdate>1986</pubdate>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Estimation of levels of gene flow from DNA sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Hudson</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Slatkin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maddison</snm>
                  <fnm>WP</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1992</pubdate>
            <volume>132</volume>
            <issue>2</issue>
            <fpage>583</fpage>
            <lpage>589</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205159</pubid>
                  <pubid idtype="pmpid" link="fulltext">1427045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Genepop software</p>
            </title>
            <url>http://wbiomed.curtin.edu.au/genepop/</url>
         </bibl>
         <bibl id="B62">
            <title>
               <p>cosi program</p>
            </title>
            <url>http://www.broad.mit.edu/~sfs/cosi</url>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Letters to the editors: Lewontin-Krakauer test for neutral genes</p>
            </title>
            <aug>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1975</pubdate>
            <volume>80</volume>
            <issue>2</issue>
            <fpage>395</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1213335</pubid>
                  <pubid idtype="pmpid" link="fulltext">1132690</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Complex signatures of natural selection at the Duffy blood group locus</p>
            </title>
            <aug>
               <au>
                  <snm>Hamblin</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Di Rienzo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2002</pubdate>
            <volume>70</volume>
            <issue>2</issue>
            <fpage>369</fpage>
            <lpage>383</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419988</pubid>
                  <pubid idtype="pmpid" link="fulltext">11753822</pubid>
                  <pubid idtype="doi">10.1086/338628</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
