<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2156-9-66</ui>
   <ji>1471-2156</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Calculating expected DNA remnants from ancient founding events in human population genetics</p>
         </title>
         <aug>
            <au ce="yes" id="A1">
               <snm>Stacey</snm>
               <fnm>Andrew</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>Andrew.Stacey@osumc.edu</email>
            </au>
            <au ca="yes" ce="yes" id="A2">
               <snm>Sheffield</snm>
               <mi>C</mi>
               <fnm>Nathan</fnm>
               <insr iid="I2"/>
               <email>ncs@byu.net</email>
            </au>
            <au id="A3">
               <snm>Crandall</snm>
               <mi>A</mi>
               <fnm>Keith</fnm>
               <insr iid="I2"/>
               <email>Keith_Crandall@byu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Statistics, Brigham Young University, Provo, UT 84602, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Biology, Brigham Young University, Provo, UT 84602, USA</p>
            </ins>
            <ins id="I3">
               <p>Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, the Ohio State University, 700 Children's Drive, Columbus, OH 43205, USA</p>
            </ins>
         </insg>
         <source>BMC Genetics</source>
         <issn>1471-2156</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>66</fpage>
         <url>http://www.biomedcentral.com/1471-2156/9/66</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18928554</pubid>
               <pubid idtype="doi">10.1186/1471-2156-9-66</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>10</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>17</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Stacey et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Recent advancements in sequencing and computational technologies have led to rapid generation and analysis of high quality genetic data. Such genetic data have achieved wide acceptance in studies of historic human population origins and admixture. However, in studies relating to small, recent admixture events, genetic factors such as historic population sizes, genetic drift, and mutation can have pronounced effects on data reliability and utility. To address these issues we conducted genetic simulations targeting influential genetic parameters in admixed populations.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We performed a series of simulations, adjusting variable values to assess the affect of these genetic parameters on current human population studies and what these studies infer about past population structure. Final mean allele frequencies varied from 0.0005 to over 0.50, depending on the parameters.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The results of the simulations illustrate that, while genetic data may be sensitive and powerful in large genetic studies, caution must be used when applying genetic information to small, recent admixture events. For some parameter sets, genetic data will not be adequate to detect historic admixture. In such cases, studies should consider anthropologic, archeological, and linguistic data where possible.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>In the past 20 years, DNA sequence data and advanced computational techniques have provided an unparalleled resource in the study of human origins<abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and migration<abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. These tools have demonstrated a Pleistocene colonization of America by Asian populations<abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> and have even prompted calculations of the size of the original human founding populations<abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Similarly, DNA sequence data have helped demonstrate the dynamics of large human populations such as primitive human migration out of Africa<abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the American migration<abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, the Lemba migration in Africa<abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, the migratory history of the Baltic States<abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, and many others. Researchers have even used the population genetics of human disease vectors to trace human migration events<abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. It may be difficult to underestimate the value genetic data have played and will continue to play on our ability to reconstruct historic population events.</p>
         <p>But while sequence data have been used to study many forms of human migration, their utility in the study of small-scale migration is still in question. Research into small migrations like the Norse settlements in Greenland<abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, a possible Polynesian migration to the New World<abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, the North African Slave migration to America<abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, and the pre-Columbian European migration to America<abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, have traditionally been based primarily on evidence other than DNA sequence information. However, recently, researchers have begun to apply genetic data to these smaller historical migrations and make conclusions about small historic populations using current DNA. For example, DNA information has recently been used to study the small indigenous populations of Tierra del Fuego<abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, and to analyze Caucasian admixture in specific African American populations<abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. It should be noted that genetic data have been used to study the large Norse migration to Ireland<abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, but are an afterthought when researching their short-lived occupation of Canada<abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <p>This raises questions about the utility of genetic data in providing evidence for historic migrations and inferences of unknown past events. While genetic studies can provide considerable information, they are also accompanied by variation and stochasticity. Because of these limitations, even the most complete studies of human populations have been called "not unequivocal"<abbrgrp><abbr bid="B21">21</abbr></abbrgrp> or "sobering"<abbrgrp><abbr bid="B22">22</abbr></abbrgrp> by those conducting the research. Recent reports have also addressed the limited depth of current genetic studies<abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, indicating that most studies make conclusions after sequencing less than 1% of subjects' genomes, and sampling only small numbers of a population. Such methods can be especially problematic when dealing with historic admixture events that are very small. The difficulty is a function of the current architecture of genetic studies: researchers sample loci from a group of individuals and categorize individuals into groups based on which alleles they have at the loci tested<abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. These categorizations are determined based on the most prevalent or probable genetic markers in an individual's genome. The results of these studies, then, can overlook genetic markers that simply are not sampled, which is common in small admixture events. Additionally, stochastic events can lead to allele fixation and further complicate matters, particularly in small populations. It has been suggested that studies of even the largest migrations should couple genetic information with archeological, anthropological, and linguistic data<abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         <p>As our ability to collect and analyze DNA sequence data increases, understanding the probabilities and variability associated with admixture becomes especially important. In this study, we explore the utility of DNA sequence data in small, recent human migration studies. We use forward-based genetic simulation to explore three questions: 1) what variables contribute to the presence (or absence) of historic markers in today's genomes, 2) how do these variables affect the probability of finding historically admixed DNA in today's populations, and 3) how can studies be designed to maximize information from genetic data? These questions are answered through genetic simulation and a sample size study aimed at suggesting the numbers of subjects and loci that should be sampled to successfully detect small-scale admixture. In our simulations, we assume that migrant allele frequencies are known a priori. The simulations test our ability to detect these known migrant alleles in admixed descended populations. We find that genetic parameters, the stochasticity of genetic drift, and experimental design all play an important role in the ability to find historic DNA in current admixed populations.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>We used the simuPOP software package for forward-based genetic simulations<abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. In each simulation, a "migrant" population with distinct, known alleles was admixed with a "native" population. We followed the combined population through time and recorded the frequency of migrant alleles at each generation. Because migrant genetic parameters were known a priori, these simulated allele frequencies allow us to assess how parameters affect the ability of detecting migrant alleles in an admixed descendant population. We used a generation time of 23 years as a compromise among differing estimates of human generation times <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. The simuPOP module allows numerous genetic variables to be altered and studied independently. The variables of interest in these initial simulations are basic genetic variables: native population size, migrant population size, mutation rate, time since admixture event, and initial allele frequencies. These variables allow the assessment of the role that population sizes, mutation, genetic drift, and allele frequency have on the amount of migrant DNA present in the admixed population after a number of generations. Our simulations have been designed so that total population sizes are as analogous to effective population sizes (N<sub>e</sub>) as possible. We assume that each individual has an equal expectation of obtaining progeny, that there are equal sex ratios, and that the population remains constant over time<abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. These assumptions allow the population size used in our study to be interpreted as an effective population size, though under some definitions of N<sub>e </sub>our numbers will have different values of N<sub>e </sub>than those assigned. The statistics and results in this study are based on the allele frequencies retrieved from the simuPOP software. We imported these numbers into the R statistical package for numeric and graphical analysis.</p>
         <p>In our genetic simulations, we make a number of assumptions about the populations: random mating, absence of selection, no gene flow, and constant population size from time of the migratory event to the present. Actual populations experience some gene flow with neighboring populations<abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>; however, in our simulations, we do not consider this in an attempt to create a best-case scenario for the migrant allele. If such gene flow did occur, it could only decrease the chances of detecting the migration event by lowering the frequency of the migratory allele in the admixed population. In addition, real populations often experience growth following admixture. However, assuming that the migrant allele is growing at the same rate as the other alleles (random mating), the allele frequency should not be changed directly by population size increase<abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, although the effects of drift could become less pronounced as a result of a greater population size. Further studies and simulations using population growth rates may be helpful in addressing the effects of population growth.</p>
         <sec>
            <st>
               <p>Simulations</p>
            </st>
            <p>Our simulations can be grouped into two separate categories. The first is a series of simulations designed to assess how the parameters mentioned above can influence the presence of historic migrant DNA in today's populations. More concretely, these simulations answer this question: how does each genetic parameter affect the frequency of migrant alleles in an admixed population? Our simulations tested the effect of 4 variables: size of migrant population, size of native population, time since admixture event, and mutation rate at the locus of interest. We assigned each variable a high value and a low value based on current literature and ran a total of 16 simulations using a full factorial experimental design, altering only one variable at a time. This allowed us to study variables independently and assess how they affect the frequency of the migrant allele over time. We compare the impact of each variable by holding other variables constant and comparing the frequencies of the migrant allele.</p>
            <p>We assigned high and low values for the four parameters based on actual events of historic admixture (Table <tblr tid="T1">1</tblr>). The high value for migrating population size was set at 1000, indicative of a large group like the Norse in the North of America<abbrgrp><abbr bid="B10">10</abbr></abbrgrp>; the low value was set at 40, a generic number that could represent any small group of migrants, either in a boat or a migrating family. The high value for native population size was set at 40,000, the size of a large Mayan city in 1492; the low value was 1000, the size of a small city at the same time<abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The high value for the number of generations since the migratory event was 174 generations ago, roughly the time of the ancient Lemba migration to Africa<abbrgrp><abbr bid="B36">36</abbr></abbrgrp>; the low value of 44 generations represents the recent Norse migration<abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Our simulations represent migration events that have occurred relatively recently (in the past 3,000 years), and the results should be interpreted accordingly. Although one may be able to extrapolate our results to more distant admixture events, additional simulations could better illustrate these scenarios.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Simulation variables</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Variable</p>
                     </c>
                     <c ca="left">
                        <p>Low Value</p>
                     </c>
                     <c ca="center">
                        <p>High Value</p>
                     </c>
                     <c ca="center">
                        <p>Source</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Migrant Population Size</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>1,000</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B10">10</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Native Population Size</p>
                     </c>
                     <c ca="left">
                        <p>1,000</p>
                     </c>
                     <c ca="left">
                        <p>40,000</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B35">35</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Generations Ago</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>174</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B10">10</abbr>
                              <abbr bid="B36">36</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mutation Rate</p>
                     </c>
                     <c ca="left">
                        <p>0.0043</p>
                     </c>
                     <c ca="left">
                        <p>1.3 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="left">
                        <p>see Table <tblr tid="T2">2</tblr></p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The high and low values for mutation rate were chosen based on the mutation rates of the regions of the genome that are used in current genetic research. Determining which regions are preferred in genetic studies is a difficult question, as there are many possibilities; the literature involving just the human migration to America contains (but is not limited to) studies performed using autosomal genes<abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, autosomal microsatellites<abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, Y chromosome<abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, mtDNA<abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>, and SNPs<abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. To determine the mutation rates used in our simulations, we chose a high and low value among these genomic regions (Table <tblr tid="T2">2</tblr>). In our simulations, we used a high mutation rate of 0.0043 mutations/locus/generation and a low rate of 1.3 &#215; 10<sup>-8 </sup>which represent mtDNA and autosomal loci, respectively.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Mutation rates</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Genome Region</p>
                     </c>
                     <c ca="left">
                        <p>Mutation Rate</p>
                     </c>
                     <c ca="left">
                        <p>Source</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Autosomal</p>
                     </c>
                     <c ca="left">
                        <p>2.5 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B48">48</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Y Chromosome</p>
                     </c>
                     <c ca="left">
                        <p>3 &#215; 10<sup>-3 </sup>to 1 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B49">49</abbr>
                              <abbr bid="B50">50</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>X Chromosome</p>
                     </c>
                     <c ca="left">
                        <p>1 &#215; 10<sup>-8</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B51">51</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Microsatellites</p>
                     </c>
                     <c ca="left">
                        <p>4.5 &#215; 10<sup>-4</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B52">52</abbr>
                              <abbr bid="B53">53</abbr>
                              <abbr bid="B54">54</abbr>
                              <abbr bid="B55">55</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>mtDNA control region</p>
                     </c>
                     <c ca="left">
                        <p>4.3 &#215; 10<sup>-3</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B56">56</abbr>
                              <abbr bid="B57">57</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Common regions of the human genome used in genetic research and their mutation rates.</p>
               </tblfn>
            </tbl>
            <p>In the first simulations, we modeled only one locus per individual and assumed no recombination; one locus is adequate to assess the role of these parameters on allele frequencies. We also initialized the migrant population with the migrant allele fixed (all migrant individuals possessed the migrant allele). This is unrealistic, but provides a best-case scenario for detecting the migrant allele. We replicated each simulation 250 times.</p>
            <p>The second simulation category was a single simulation designed to mimic the genetic landscape of a true admixed population. We assigned mid-range values for migrant population size (200), native population size (5,000), and generations (100). In order to more realistically model a current study, we followed 1,000 loci on 20 different chromosomes on each individual. This represents a sample much larger than the recommended number needed in order to detect large human admixture<abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. A standard recombination rate of 1.26 cM/Mb was used<abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, though the human recombination rate has been shown to be negligible over 100 generations<abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. At the beginning of the simulation, an initial migrant allele frequency and a mutation rate were randomly generated for each of the 1,000 loci on each individual, in order to model the DNA seen in actual human genetics research. The methods of random generation are outlined below.</p>
            <p>Initial allele frequency is difficult to assign because of the variability of allele frequencies in the human genome. Alleles with frequencies less than 5% are considered rare but are the most common categorization of SNPs and some alleles demonstrate frequencies greater than 90% (though these common alleles are rarely used in genetic studies)<abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. While the majority of SNPs are found in the 5% range, we built a simulation that will provide the best-case scenario for finding migrant alleles. Accordingly, we chose a much larger level for the average of initial allele frequencies, 30%. To generate frequencies in this range, we used a Beta distribution with a mean of 0.30 (Figure <figr fid="F1">1</figr>) and assigned a random frequency to each migrant locus. We also assumed that the migrant alleles were all absent in the native populations.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Initial allele frequencies</p>
               </caption>
               <text>
                  <p><b>Initial allele frequencies</b>. Density of a Beta distribution with a mean of 0.3 and a standard deviation of 0.17. Initial allele frequencies for all alleles were randomly generated from this density.</p>
               </text>
               <graphic file="1471-2156-9-66-1"/>
            </fig>
            <p>Mutation rates depend on the region of the genome used in a study. Differing mutation rates in the literature were presented earlier (Table <tblr tid="T2">2</tblr>). There is no estimate for which region of the genome is used most often in genetic studies; we, therefore, drew random values that capture the entire distribution of mutation rates seen in today's literature. For this simulation, we drew mutation rates equally from three different uniform distributions: one representing autosomal DNA with a low mutation rate (1 &#215; 10<sup>-9</sup>, 1 &#215; 10<sup>-6 </sup>mutations/locus/generation), one representing microsatellites and some sex chromosomes (1 &#215; 10<sup>-6</sup>, 7 &#215; 10<sup>-4</sup>), and one representing mtDNA (1 &#215; 10<sup>-5</sup>, 3 &#215; 10<sup>-3</sup>) (Figure <figr fid="F2">2</figr>). We followed the migrant allele frequency at each locus through 100 generations. Final analyses and graphs were completed using the R software.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Mutation Rates</p>
               </caption>
               <text>
                  <p><b>Mutation rates</b>. Histogram demonstrating the distribution of mutation rates randomly assigned to the 1,000 simulated loci.</p>
               </text>
               <graphic file="1471-2156-9-66-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Sample size study</p>
            </st>
            <p>In order to understand what must be done to successfully study data from historic admixture, we constructed a sample size study using the data from simulation 2. Small human genetics studies test approximately 50 loci when studying populations<abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Given the calculated frequency of migrant alleles in our simulated population, we calculated the number of migrant alleles that would be seen, on average, in each human subject of a genetic study. This is accomplished using the cumulative density function (CDF) of a binomially distributed random variable where the size parameter is 50 and the probability parameter is the expected migrant allele frequency. In comparison, one of the larger human genetic studies to date sequenced 993 loci in each human subject<abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Accordingly, we followed the same protocol to investigate a study of this magnitude, using the binomial CDF with a size parameter of 993 and the same probability parameter.</p>
            <p>The most recent studies have again raised the bar as far as loci per subject, sampling 650,000 loci in each individual<abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Although sampling more loci will find a larger number of migrant alleles, the proportion of such markers in the population does not change when more samples are taken. The study conducted by Li et al. (2008) samples about 20 individuals per population group, a number similar to previous studies. Accordingly, we investigated the sample size necessary to find at least one migrant allele at each of the loci sequenced in a large genetic study.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Simulation study 1</p>
            </st>
            <p>We calculated the frequency of the migrant allele at the final generation in each of the 16 simulations. The mean and standard deviation of this frequency, among the 250 replicates, are reported for each of the 16 simulations (Figure <figr fid="F3">3</figr>). We found that the parameter set that led to the highest mean allele frequency value included a low native population size, high migrant population size, and low mutation rate and was unchanged by the time since the migration event. The parameter sets that led to the lowest mean allele frequency value were: high native population size, low migrant population size, high mutation rate, and high number of generations (highlighted in Figure <figr fid="F3">3</figr>). For these two parameter sets, we randomly selected a fifth of the 250 replicates to illustrate the stochasticity of genetic drift (Figures <figr fid="F4">4</figr> and <figr fid="F5">5</figr>). For the parameter set with the highest final mean allele frequency, we found replicates with final allele frequencies as low as 25.5% or as high as 78.5%. This parameter set also had the highest standard deviation (.1044), indicative of the wide range of final values in the different replicates. For the parameter set with the lowest final mean allele frequency, many of the replicates drifted to extinction (45.6%), while the highest allele frequency was 0.006%.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Simulation Results</p>
               </caption>
               <text>
                  <p><b>Simulation results</b>. The probability of detecting historic, migrant alleles under all combinations of 4 essential genetic parameters. The average final migrant allele frequency of 250 replications of each parameter set is reported as the mean (&#956;) frequency of migrant alleles. The standard deviation (&#963;) of the 250 replications is reported for each parameter set below the corresponding mean. The two parameter sets with the highest and lowest mean allele frequencies are in bold.</p>
               </text>
               <graphic file="1471-2156-9-66-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Genetic Drift</p>
               </caption>
               <text>
                  <p><b>Genetic drift</b>. Individual replications of the parameter sets highlighted in Figure <figr fid="F3">3</figr>. The migrant allele frequency of each replication at each generation are reported and plotted in line format, each replication is described by a single line. Only a sample of 50 replications was used, as 250 lines would be difficult to distinguish. The first parameter set is characterized by an initial allele frequency of 0.5 while the second parameter set has an initial migrant allele frequency of less than 0.001. Genetic drift and mutation cause the allele frequencies to change over time, resulting in some allele extinction and an overall distribution of frequencies at the end of the simulation.</p>
               </text>
               <graphic file="1471-2156-9-66-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Distribution of Final Allele Frequencies</p>
               </caption>
               <text>
                  <p><b>Distribution of final allele frequencies</b>. Histogram of the final allele frequencies recorded over 250 replicates in the parameter sets highlighted in Table <tblr tid="T2">2</tblr>. These histograms are a representation of the last recorded allele frequencies from Figure <figr fid="F4">4</figr>. They demonstrate the distribution of the migrant allele frequency expected to be found in today's population, given the assumed genetic parameters (A: large migrant population, small native population, low mutation rate, and more distant admixture advent. B: small migrant population, large native population, high mutation rate, and a more distant admixture advent)</p>
               </text>
               <graphic file="1471-2156-9-66-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Simulation study 2</p>
            </st>
            <p>For the second simulation study, we followed 1,000 loci through a simulation that could represent a human population (of 5,000 individuals) that experienced admixture (of 200 individuals) circa 2,000 years ago. Out of the 1,000 simulated loci, 140 (14%) drifted to extinction within 100 generations (Figure <figr fid="F6">6</figr>). These extinct alleles, combined with the effects of mutation, decreased the expected allele frequency of the final generation to 1.017%, a 16% decrease from the original value.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Distribution of Final Allele Frequencies</p>
               </caption>
               <text>
                  <p><b>Distribution of final allele frequencies</b>. A histogram showing the migrant allele frequencies found at 1,000 loci in a generic simulated population. This histogram illustrates the allele frequencies one would expect to find if 1,000 informative alleles were sampled from a current population that experienced admixture circa 2,000 years ago, given that the population had the specified genetic parameters.</p>
               </text>
               <graphic file="1471-2156-9-66-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Sample size study</p>
            </st>
            <p>The average final allele frequency of the migrant allele in our population from the second simulation was 1.017%. We calculated the cumulative density function (CDF) for a genetic study that samples 50 loci for each individual and where the probability of detecting the migrant allele is equal to the probability found in our simulations. The CDF demonstrates that in 60% of individuals sequenced for 50 loci, we would not expect to find a single migrant allele (Figure <figr fid="F7">7a</figr>). Furthermore, we will only find more than one migrant allele in 9% of the subjects examined.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Migrant Allele Expectation CDFs</p>
               </caption>
               <text>
                  <p><b>Migrant allele expectation CDFs</b>. The CDF functions for the number of migrant alleles expected to be found in an admixed population for a study sampling 50 loci from each subject (A) and a study sampling 993 loci from each subject (B) (with an expected migrant allele frequency of 1.017%).</p>
               </text>
               <graphic file="1471-2156-9-66-7"/>
            </fig>
            <p>In the case of a large study with as many as 933 loci, based upon the expected migrant allele frequency of 1.017%, almost every subject would demonstrate at least one migrant allele (Figure <figr fid="F7">7b</figr>). In fact, most subjects would demonstrate more than 9 migrant alleles. However, while large studies would expect to succeed in finding more migrant alleles in today's population, this alone cannot link the admixed population to the migrant population. The migrant alleles will still only represent, on average, 1% of every allele sequenced in the entire study. Therefore, although 9 migrant alleles may, on average, be found in each subject, it is hard to know if the migrant alleles will be redundant among loci and subjects or spread evenly throughout all the loci in the study. Additionally, these numbers could be considerably lower depending on the allele frequency in the migrating population.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Our results provide some important insights in detecting historic admixture. The simulations we present illustrate the effect that initial parameters have on the outcome of human admixture. Simple adjustments in the parameters in our simulation series changed the expected allele frequency outcome from as low as 0.0005 to over 0.50, an increase of three orders of magnitude. The results of any admixture study using genetic data, then, are highly dependent on the variables presented in these simulations (e.g., mutation rate, population sizes, and time since admixture (number of generations)).</p>
         <p>High mutation rates can decrease the expected migrant allele frequency and the variability by more than 50 percent, especially in populations that experienced earlier migrations. For example, an increased mutation rate can change the mean final allele frequency from .0243 to .0128, or from .5016 to .2699 (depending on other variables, as reported in Figure <figr fid="F3">3</figr>). Researchers should keep this in mind when selecting loci for analysis. Because some DNA mutation rates are highly variable, choice of locus can have a profound impact on the number of migrant alleles detected years later. Many studies advocate the use of mtDNA due to data collecting feasibility and other factors. However, because the mutation rate is generally higher in mtDNA, it could corrupt signal in studies addressing historic admixture, even when the time frame is relatively recent.</p>
         <p>The sizes of the migrant and native populations are fundamental for an understanding of expected allele frequency. With time since admixture as low as those we consider in our simulations, the most important factors are the sizes of the migrating and native populations. In our simulations, if the native population is large, changing the migrating population size results in a change of mean final allele frequency from .0243 to .0010. If the native population is small, those numbers change to .5016 and .0407. These are the most significant differences illustrated by our simulations and they attest to the important role of population sizes. Researchers should not expect to find many alleles from a small migratory group of 50 individuals in a large population today, even if sampling methods are exhaustive.</p>
         <p>Additionally, we see that time plays an important role. The standard deviations presented in Table <tblr tid="T1">1</tblr> demonstrate that allelic frequencies vary widely, particularly as the number of generations increases. High mutation rates combined with large time spans can reduce migrant allele frequencies significantly. When the mutation rate is low, however, the time since admixture does not affect the final mean allele frequency much (or at all), but it still has a profound impact on the standard deviation. For example, a change in time since admixture in one parameter set almost doubles the standard deviation from .0525 to .1044. As time increases, genetic drift causes the spread of final allele frequencies to increase, particularly when the population sizes are small. Thus, as the time since the admixture event increases, sample size for both loci and subjects becomes increasingly important.</p>
         <p>In our second simulation, most of the migrant alleles are present in less than 2% of the population. In a study of a population where few subjects from many human populations are studied, alleles from a small-scale admixture will usually not be recovered at all. And these rare alleles could easily be ignored in favor of haplotypes that better categorize the population into clusters.</p>
         <p>Our results demonstrate a profound and general fact: the values of these genetic parameters can drastically alter the expected frequency of migrant alleles in today's populations. Even in our simulations, where steps have been taken to ensure a best-case scenario for the migrant allele, there is often a large spread of possible outcomes. DNA data have been touted as a panacea for recovering information about the past, but their use depends so extensively on factors that are beyond our control that their application is not always appropriate. It is imperative, therefore, that researchers understand the implications of the variables we have presented and not rely solely on DNA sequence data when researching small, recent human migrations. We can only hope to understand basic details of population history when quantifying genetic data and even valid results derived from genetic data may still be misleading if viewed unilaterally, as demonstrated by Harpending et al <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>.</p>
         <p>Our results, however, are not completely ominous. Carefully designed studies should be able to draw specific and valid conclusions from genetic data. One area for major improvement is the number of individuals and loci sampled. Our results indicate that a large sample size and large number of loci are needed to obtain robust results. Studies that are unable to sample sufficiently do not have the power to draw appropriate conclusions and should be interpreted with caution. Our results give guidelines for a variety of conditions and allow researchers to analyze the benefits of increasing sample sizes given their populations of interest. Because of the real possibility that a certain allele will have drifted to extinction, even sampling 100% of a population at a single locus may not reveal a single migrant allele, even if it was fixed in the migrant population. If one is faced with the challenge of researching small-scale admixture, it is necessary to identify migrant alleles even if they show up in a very small proportion of loci and subjects. Consequently, phylogenetic methods must be created that can pinpoint very small similarities between populations. Table <tblr tid="T3">3</tblr> summarizes the genetic and experimental factors that we believe will increase the chance of detecting admixture in today's populations. One complication that arises in such situations, however, is that very recent migration and admixture will further complicate the results. Identifying migrant alleles that are rare will be very difficult, not only because of the increased sampling necessary to detect them, but because of the noise that is likely to be introduced in the time since the event under examination.</p>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Improving probability of detecting historic admixture</p>
            </caption>
            <tblbdy cols="2">
               <r>
                  <c ca="left">
                     <p>
                        <b>Genetic Parameters</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Experimental Design</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="2">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&#8226; Large Migrant Population</p>
                  </c>
                  <c ca="left">
                     <p>&#8226; Identify informative migrant alleles</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&#8226; Small Native Population</p>
                  </c>
                  <c ca="left">
                     <p>&#8226; Test large number of loci</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&#8226; Low mutation rate at loci of interest</p>
                  </c>
                  <c ca="left">
                     <p>&#8226; Large sample size for each population</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>&#8226; Fewer generations since admixture event</p>
                  </c>
                  <c ca="left">
                     <p>&#8226; Establish methods for detecting rare alleles</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>&#8226; Collaborative approach (Archeology, Anthropology, Linguistics)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Perhaps most importantly, it must be remembered that drift is stochastic and that historic genetic parameters are, for the most part, unknown. Thus, the absence of specific genetic data is not conclusive evidence against historic admixture. Our results illustrate several parameter sets that would cause admixture to be either completely or practically undetectable today. To address the inconsistent results found in DNA all but the largest genetic studies need to continue to consider anthropologic, archeological, and linguistic data in order to formulate conclusions. Finally, our study demonstrates the utility of simulation studies to put bounds on parameter values and sample sizes for studies of human migration events.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The ability to detect historic admixture and make correct inferences based on genetic data depends on the interplay between population sizes, mutation rates, time, and other parameters. We explore the parameter space of historic alleles in current populations and demonstrate the broad implications of each of these genetic parameters on modern allele frequencies. Our results provide guidelines with respect to the population genetic parameters and their values needed to detect migrant alleles in an admixed population. While studies that focus on large admixture events should be able to draw specific and valid conclusions, we suggest that genetic data be used with caution when studying small admixture events. The random nature of admixed genetic data seen in these simulations demonstrates that the utility of genetic data is dependent on the context of each individual study. Increasing the number of loci and the number of individuals sampled will increase the probability of detecting small traces of signal, but other sources of evidence should always be considered where possible.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AS and NCS designed the simulations. NCS wrote and ran the simulations using simuPOP. AS analyzed and formatted the resulting data in R. AS and NCS wrote the manuscript. KAC conceived the study and provided expertise and advice throughout the process, including critical comments on simulation design and on the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Ryan Parr for comments on an earlier draft of this manuscript. This work was supported by an Eliza R. Snow Fellowship from Brigham Young University.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Mapping human genetic ancestry</p>
            </title>
            <aug>
               <au>
                  <snm>Ebersberger</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Galgoczy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Taudien</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Taenzer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Platzer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>von Haeseler</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Molecular Biology and Evolution</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <issue>10</issue>
            <fpage>2266</fpage>
            <lpage>2276</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/molbev/msm156</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Out of Africa again and again</p>
            </title>
            <aug>
               <au>
                  <snm>Templeton</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>416</volume>
            <fpage>45</fpage>
            <lpage>51</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/416045a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11882887</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The Settlement of the Americas: A Comparison of the Linguistic, Dental, and Genetic Evidence</p>
            </title>
            <aug>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>II</snm>
                  <fnm>CGT</fnm>
               </au>
               <au>
                  <snm>Zegura</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Laughlin</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Szathmary</snm>
                  <fnm>EJE</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Wollford</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Current Anthropology</source>
            <pubdate>1986</pubdate>
            <volume>27</volume>
            <issue>5</issue>
            <fpage>477</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1086/203472</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Pleistocene Human Colonization of Siberia and Peopling of the Americas: An Ecological Approach</p>
            </title>
            <aug>
               <au>
                  <snm>Goebel</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Evolutionary Anthropology</source>
            <pubdate>1999</pubdate>
            <volume>8</volume>
            <issue>6</issue>
            <fpage>208</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/(SICI)1520-6505(1999)8:6&lt;208::AID-EVAN2>3.0.CO;2-M</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>On the Number of New World Founders: A Population Genetic Portrait of the Peopling of the Americas</p>
            </title>
            <aug>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PlOS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>6</issue>
            <fpage>e193</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1131883</pubid>
                  <pubid idtype="pmpid" link="fulltext">15898833</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Statistical Evaluation of Alternative Models of Human Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Fagundes</snm>
                  <fnm>LJR</fnm>
               </au>
               <au>
                  <snm>Ray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Beaumont</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Neuenschwaner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Salzano</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Bonatto</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Excoffier</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>45</issue>
            <fpage>17614</fpage>
            <lpage>17619</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2077041</pubid>
                  <pubid idtype="pmpid" link="fulltext">17978179</pubid>
                  <pubid idtype="doi">10.1073/pnas.0708280104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Y Chromosomes Traveling Sough: The Cohen Modal Haplotype and the Origins of the Lemba &#8211; the "Black Jews of Southern Africa"</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Parfill</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Skorecki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Roux</snm>
                  <fnm>Ml</fnm>
               </au>
               <au>
                  <snm>Bradman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>674</fpage>
            <lpage>686</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1288118</pubid>
                  <pubid idtype="pmpid" link="fulltext">10677325</pubid>
                  <pubid idtype="doi">10.1086/302749</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Tranferrin Varients as Markers of Migrations and Admixtures between Populations in the Baltic Sea Region</p>
            </title>
            <aug>
               <au>
                  <snm>Beckman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sikstrom</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Midelsaar</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Krumina</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ambrasiene</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kucinskas</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Beckman</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Human Heredity</source>
            <pubdate>1998</pubdate>
            <volume>48</volume>
            <fpage>185</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000022800</pubid>
                  <pubid idtype="pmpid" link="fulltext">9694249</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Traces of human migrations in <it>Helicobacter pylori </it>populations</p>
            </title>
            <aug>
               <au>
                  <snm>Falush</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wirth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Linz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Stephens</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Blaser</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Graham</snm>
                  <fnm>DY</fnm>
               </au>
               <au>
                  <snm>Vacher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Perez-Perez</snm>
                  <fnm>GI</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>299</volume>
            <fpage>1582</fpage>
            <lpage>1585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1080857</pubid>
                  <pubid idtype="pmpid" link="fulltext">12624269</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Contact between Native Americans and the Medieval Norse: A Review of the Evidence</p>
            </title>
            <aug>
               <au>
                  <snm>McGhee</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>American Antiquity</source>
            <pubdate>1984</pubdate>
            <volume>49</volume>
            <issue>1</issue>
            <fpage>4</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/280509</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Linguistic Evidence for a Prehistoric Polynesia-Southern California Contact Event</p>
            </title>
            <aug>
               <au>
                  <snm>Klar</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Anthropological LInguistics</source>
            <pubdate>2005</pubdate>
            <volume>47</volume>
            <issue>4</issue>
            <fpage>369</fpage>
            <lpage>400</lpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>More on the Free Black Population of the Southern Appalachian Mountains: Speculations on the North African Connection</p>
            </title>
            <aug>
               <au>
                  <snm>Allen</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Journal of Black Studies</source>
            <pubdate>1995</pubdate>
            <volume>25</volume>
            <issue>6</issue>
            <fpage>651</fpage>
            <lpage>671</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1177/002193479502500601</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Pre-Columbian Old World Coins in America: An Examination of the Evidence</p>
            </title>
            <aug>
               <au>
                  <snm>Epstein</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Buttrey</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Cook;</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Covey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jett</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mundkur</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>AC</fnm>
               </au>
               <etal/>
            </aug>
            <source>Current Anthropology</source>
            <pubdate>1980</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1086/202398</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The Myth of the Kensington Rune Stone: The Norse Discovery of Minnesota 1362</p>
            </title>
            <aug>
               <au>
                  <snm>Quaife</snm>
                  <fnm>MM</fnm>
               </au>
            </aug>
            <source>The New England Quarterly</source>
            <pubdate>1934</pubdate>
            <volume>7</volume>
            <issue>4</issue>
            <fpage>613</fpage>
            <lpage>645</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/359189</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Lack of founding Amerindian midochondrial DNA lineages in extinct Aborigines from Tierra del Fuego-Patagonia</p>
            </title>
            <aug>
               <au>
                  <snm>Lalueza</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perez-Perez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Prats</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cornudella</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Turbon</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Human Molecular Genetics</source>
            <pubdate>1997</pubdate>
            <volume>6</volume>
            <issue>1</issue>
            <fpage>41</fpage>
            <lpage>46</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/6.1.41</pubid>
                  <pubid idtype="pmpid" link="fulltext">9002668</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Ancestral Proportions and Admixture Dynamics in Geographically Defined African Americans Living in South Carolina</p>
            </title>
            <aug>
               <au>
                  <snm>Parra</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Kittles</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Argyropoulos</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pfaff</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Hiester</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bonilla</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sylvester</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Parrish-Gause</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Garvey</snm>
                  <fnm>WT</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>American Journal of Physical Anthropology</source>
            <pubdate>2001</pubdate>
            <volume>114</volume>
            <fpage>18</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/1096-8644(200101)114:1&lt;18::AID-AJPA1002>3.0.CO;2-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">11150049</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Estimating African American Admixture Proportions by Use of Population Specific Alleles</p>
            </title>
            <aug>
               <au>
                  <snm>Parra</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Marcini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Akey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Martinson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Forrester</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Allison</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Deka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ferrell</snm>
                  <fnm>RE</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1998</pubdate>
            <volume>63</volume>
            <fpage>1839</fpage>
            <lpage>1851</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1377655</pubid>
                  <pubid idtype="pmpid" link="fulltext">9837836</pubid>
                  <pubid idtype="doi">10.1086/302148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The Scale and Nature of Viking Settlement in Ireland from Y-chromosome Admixture Analysis</p>
            </title>
            <aug>
               <au>
                  <snm>McEvoy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brady</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Bradley</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>European Journal of Human Genetics</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <fpage>1288</fpage>
            <lpage>1294</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201709</pubid>
                  <pubid idtype="pmpid" link="fulltext">16957681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The Norsemen in America</p>
            </title>
            <aug>
               <au>
                  <snm>Nansen</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>The Geographical Journal</source>
            <pubdate>1911</pubdate>
            <volume>38</volume>
            <issue>6</issue>
            <fpage>557</fpage>
            <lpage>575</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/1778837</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The Norse in Newfoundland: L'Anse aux Meadows and Vinland</p>
            </title>
            <aug>
               <au>
                  <snm>Wallace</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Newfoundland and Labrador Studies</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>1</issue>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Mitochondrial DNA Studies of Native Americans: Conceptions and MIsconceptions of the Population Prehistory of the Americas</p>
            </title>
            <aug>
               <au>
                  <snm>Eshleman</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Malhi</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Evolutionary Anthropology</source>
            <pubdate>2003</pubdate>
            <volume>12</volume>
            <fpage>7</fpage>
            <lpage>18</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/evan.10048</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genetic Clues to Dispersal in Human Populations: Retracing the Past from the Present</p>
            </title>
            <aug>
               <au>
                  <snm>Cann</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>291</volume>
            <fpage>1742</fpage>
            <lpage>1748</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1058948</pubid>
                  <pubid idtype="pmpid" link="fulltext">11249820</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The Science and Business of Genetic Ancestry Testing</p>
            </title>
            <aug>
               <au>
                  <snm>Bolnick</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Fullwiley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Duster</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Fujimura</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kaufman</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Morning</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <fpage>399</fpage>
            <lpage>400</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1150098</pubid>
                  <pubid idtype="pmpid" link="fulltext">17947567</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Clines, Clusters, and the Effect of Study Design on the Inferene of Human Population Structure</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Mahajan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Feldmen</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>PLOS Genetics</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>e70</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1310579</pubid>
                  <pubid idtype="pmpid" link="fulltext">16355252</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0010070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>JZ</fnm>
               </au>
               <au>
                  <snm>Absher</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Southwick</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Castro</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cann</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Barsh</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cavalli-Sforza</snm>
                  <fnm>LL</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>319</volume>
            <fpage>1100</fpage>
            <lpage>1103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1153717</pubid>
                  <pubid idtype="pmpid" link="fulltext">18292342</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The Peopling of The New World: Perspectives from Molecular Anthropology</p>
            </title>
            <aug>
               <au>
                  <snm>Schurr</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>Annu Rev Anthropol</source>
            <pubdate>2004</pubdate>
            <volume>33</volume>
            <fpage>551</fpage>
            <lpage>583</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.anthro.33.070203.143932</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>simuPOP: a forward-time population genetics simulation envirnment</p>
            </title>
            <aug>
               <au>
                  <snm>Peng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kimmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>18</issue>
            <fpage>3686</fpage>
            <lpage>3687</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti584</pubid>
                  <pubid idtype="pmpid" link="fulltext">16020469</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Alu Evolution in Human Populations: Using the Coalescent to Estimate Effective Population Size</p>
            </title>
            <aug>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Harpending</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Stoneking</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>147</volume>
            <fpage>1977</fpage>
            <lpage>1982</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1208362</pubid>
                  <pubid idtype="pmpid" link="fulltext">9409852</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Genomic Relationships and Speciation Times of Human, CHimpanzeee, and Gorilla Inferred from a Coalescent Hidden Markov Model</p>
            </title>
            <aug>
               <au>
                  <snm>Hobolth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Christensen</snm>
                  <fnm>OF</fnm>
               </au>
               <au>
                  <snm>Mailund</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Schierup</snm>
                  <fnm>MH</fnm>
               </au>
            </aug>
            <source>PLOS Genetics</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>2</issue>
            <fpage>e7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1802818</pubid>
                  <pubid idtype="pmpid" link="fulltext">17319744</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0030007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Estimating Mutation Rate and Generation Time from Longitudinal Samples of DNA Sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y-X</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <issue>4</issue>
            <fpage>620</fpage>
            <lpage>626</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11264414</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Genetics of Populations</p>
            </title>
            <aug>
               <au>
                  <snm>Hedrick</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <publisher>Jones &amp; Bartlett Publishers</publisher>
            <edition>2</edition>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Isolation by Distance</p>
            </title>
            <aug>
               <au>
                  <snm>Wright</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1943</pubdate>
            <volume>28</volume>
            <issue>2</issue>
            <fpage>114</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1209196</pubid>
                  <pubid idtype="pmpid" link="fulltext">17247074</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Nested clade analyses of phylogeographic data: testing hypotheses about gene flow and population history</p>
            </title>
            <aug>
               <au>
                  <snm>Templeton</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Molecular Ecology</source>
            <pubdate>1998</pubdate>
            <volume>7</volume>
            <fpage>381</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-294x.1998.00308.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9627999</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Non-equilibrium theory of the allele frequency spectrum</p>
            </title>
            <aug>
               <au>
                  <snm>Evans</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Shvets</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Slatkin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Theoretical Population Biology</source>
            <pubdate>2006</pubdate>
            <volume>71</volume>
            <fpage>109</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tpb.2006.06.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16887160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Family Size, Prehistoric Population Estimates, and the Ancient Maya</p>
            </title>
            <aug>
               <au>
                  <snm>Haviland</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>American Antiquity</source>
            <pubdate>1972</pubdate>
            <volume>37</volume>
            <issue>1</issue>
            <fpage>135</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/278895</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The Origins of the Lemba "Black Jews" of Southern Africa: Evidence from p12E2 and Other T-Chromosome Markers</p>
            </title>
            <aug>
               <au>
                  <snm>Spurdle</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Jenkins</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1996</pubdate>
            <volume>59</volume>
            <fpage>1126</fpage>
            <lpage>1133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1914832</pubid>
                  <pubid idtype="pmpid">8900243</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Admixture dynamics in Hispanics: A shift in the nuclear genetic ancestry of a South American population isolate</p>
            </title>
            <aug>
               <au>
                  <snm>Bedoya</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Montoya</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Garc&#253;</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Soto</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bourgeois</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carvajal</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Labuda</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Alvarez</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ospina</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hedrick</snm>
                  <fnm>PW</fnm>
               </au>
               <etal/>
            </aug>
            <source>PNAS</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>19</issue>
            <fpage>7734</fpage>
            <lpage>7239</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0508716103</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>High-Resolution SNPs and Microsatellite Haplotypes Point to a Single, Recent Entry of Native American Y Chromosomes into the Americas</p>
            </title>
            <aug>
               <au>
                  <snm>Zegura</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Karafet</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Zhivotovsky</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Hammer</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <issue>1</issue>
            <fpage>164</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh009</pubid>
                  <pubid idtype="pmpid" link="fulltext">14595095</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Peopling of the Americas, Founded by Four Major Lineages of Mitochondrial DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Horai</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakagawa-Hattori</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sonoda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tajima</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <issue>1</issue>
            <fpage>23</fpage>
            <lpage>47</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7680748</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>A single and early migration for the peopling of the Americas supported by mitochondrial DNA sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Bonatto</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Salzano</snm>
                  <fnm>FM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>1866</fpage>
            <lpage>1871</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20009</pubid>
                  <pubid idtype="pmpid" link="fulltext">9050871</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.5.1866</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Detecting Ancient Admixture in Humans Using Sequence Polymorphism Data</p>
            </title>
            <aug>
               <au>
                  <snm>Wall</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>154</volume>
            <issue>3</issue>
            <fpage>1271</fpage>
            <lpage>1279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460992</pubid>
                  <pubid idtype="pmpid" link="fulltext">10757768</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Comparative Recombination Rates in the Rat, Mouse, and Human Genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen-Seaman</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Payseur</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Roskin</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>C-F</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Jacob</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>528</fpage>
            <lpage>538</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383296</pubid>
                  <pubid idtype="pmpid" link="fulltext">15059993</pubid>
                  <pubid idtype="doi">10.1101/gr.1970304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>High-resolution haplotype structure in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Rioux</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Schaffner</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Hudson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nature Genetics</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>229</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-229</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Ethnic India: A Genomic View, With Special Reference to Peopling and Structure</p>
            </title>
            <aug>
               <au>
                  <snm>Basu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sengupta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Banerjee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chakraborty</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dey</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bhattacharyya</snm>
                  <fnm>NP</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Research</source>
            <pubdate>2007</pubdate>
            <volume>13</volume>
            <fpage>2277</fpage>
            <lpage>2290</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1101/gr.1413403</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Clines, clusters, and the effect of study design on the inference of human population structure</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Mahajan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ramachandran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MM</fnm>
               </au>
            </aug>
            <source>PLoS Genetics</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>660</fpage>
            <lpage>671</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pgen.0010070</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Genetic Perspectives on Human Origins and Differentiation</p>
            </title>
            <aug>
               <au>
                  <snm>Harpending</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <fpage>361</fpage>
            <lpage>385</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genom.1.1.361</pubid>
                  <pubid idtype="pmpid" link="fulltext">11701634</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Genetic Traces of Ancient Demography</p>
            </title>
            <aug>
               <au>
                  <snm>Harpending</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Gurven</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jorde</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>1961</fpage>
            <lpage>1967</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">19224</pubid>
                  <pubid idtype="pmpid" link="fulltext">9465125</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.4.1961</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Estimate of the Mutation Rate per Nucleotide in Humans</p>
            </title>
            <aug>
               <au>
                  <snm>Nachman</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Crowell</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>156</volume>
            <fpage>297</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461236</pubid>
                  <pubid idtype="pmpid" link="fulltext">10978293</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Estimating Y chromosome specific microsatellite mutation frequencies using deep rooting pedigrees</p>
            </title>
            <aug>
               <au>
                  <snm>Heyer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Puymirat</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dieltjes</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bakker</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>de Knijff</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Human Molecular Genetics</source>
            <pubdate>1997</pubdate>
            <volume>6</volume>
            <issue>5</issue>
            <fpage>799</fpage>
            <lpage>803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/6.5.799</pubid>
                  <pubid idtype="pmpid" link="fulltext">9158156</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Characteristics and Frequency of Germline Mutations at Microsatellite Loci from the Human Y Chromosome, as Revealed by Direct Observation in Father/Son Pairs</p>
            </title>
            <aug>
               <au>
                  <snm>Kayser</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Roewer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hedman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Henke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Henke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brauer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Krawczak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nagy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dobosz</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>1580</fpage>
            <lpage>1588</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1378017</pubid>
                  <pubid idtype="pmpid" link="fulltext">10762544</pubid>
                  <pubid idtype="doi">10.1086/302905</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>DNA Variation in a 5-Mb Region of the X Chromosome and Estimates of Sex-Specific/Type-Specific Mutation Rates</p>
            </title>
            <aug>
               <au>
                  <snm>Anagnostopoulos</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Rowley</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Giannelli</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1999</pubdate>
            <volume>64</volume>
            <fpage>508</fpage>
            <lpage>517</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1377759</pubid>
                  <pubid idtype="pmpid" link="fulltext">9973287</pubid>
                  <pubid idtype="doi">10.1086/302250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Short tandem repeat polymorphism evolution in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Calafell</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Shuster</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>KK</fnm>
               </au>
            </aug>
            <source>European Journal of Human Genetics</source>
            <pubdate>1998</pubdate>
            <volume>6</volume>
            <fpage>38</fpage>
            <lpage>49</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5200151</pubid>
                  <pubid idtype="pmpid" link="fulltext">9781013</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Mutation Rate in Human Microsatellites: Influence of the Struture and Length of Tandem Repeat</p>
            </title>
            <aug>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Klintschar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Neuhuber</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huhne</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rolf</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>1408</fpage>
            <lpage>1415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1377148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9585597</pubid>
                  <pubid idtype="doi">10.1086/301869</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Likelihood-Based Estimation of Microsatellite Mutation Rates</p>
            </title>
            <aug>
               <au>
                  <snm>Whittaker</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Harbord</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Boxall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mackay</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dawson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sibly</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <fpage>781</fpage>
            <lpage>787</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462577</pubid>
                  <pubid idtype="pmpid" link="fulltext">12807796</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A Measure of Population Subdivision Based on Microsatellite Allele Frequencies</p>
            </title>
            <aug>
               <au>
                  <snm>Slatkin</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1995</pubdate>
            <volume>139</volume>
            <fpage>457</fpage>
            <lpage>462</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1206343</pubid>
                  <pubid idtype="pmpid" link="fulltext">7705646</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>The Mutation Rate in the Human mtDNA Control Region</p>
            </title>
            <aug>
               <au>
                  <snm>Sigur&#240;ardottir</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Helgason</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gulcher</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Stefansson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>1599</fpage>
            <lpage>1609</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1378010</pubid>
                  <pubid idtype="pmpid" link="fulltext">10756141</pubid>
                  <pubid idtype="doi">10.1086/302902</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Mitochondrial mutational spectra in human cells and tissues</p>
            </title>
            <aug>
               <au>
                  <snm>Khrapko</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Coller</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Andre</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>X-C</fnm>
               </au>
               <au>
                  <snm>Hanekamp</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Thilly</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>13798</fpage>
            <lpage>13803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">28387</pubid>
                  <pubid idtype="pmpid" link="fulltext">9391107</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.25.13798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

