<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-223</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>GENOMEPOP: A program to simulate genomes in populations</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Carvajal-Rodr&#237;guez</snm>
               <fnm>Antonio</fnm>
               <insr iid="I1"/>
               <email>acraaj@uvigo.es</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Departamento de Bioqu&#237;mica, Gen&#233;tica e Inmunolog&#237;a. Universidad de Vigo, 36310 Vigo, Spain</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>223</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/223</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18447924</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-223</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>05</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>30</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Carvajal-Rodr&#237;guez; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>There are several situations in population biology research where simulating DNA sequences is useful. Simulation of biological populations under different evolutionary genetic models can be undertaken using backward or forward strategies. Backward simulations, also called coalescent-based simulations, are computationally efficient. The reason is that they are based on the history of lineages with surviving offspring in the current population. On the contrary, forward simulations are less efficient because the entire population is simulated from past to present. However, the coalescent framework imposes some limitations that forward simulation does not. Hence, there is an increasing interest in forward population genetic simulation and efficient new tools have been developed recently. Software tools that allow efficient simulation of large DNA fragments under complex evolutionary models will be very helpful when trying to better understand the trace left on the DNA by the different interacting evolutionary forces. Here I will introduce GenomePop, a forward simulation program that fulfills the above requirements. The use of the program is demonstrated by studying the impact of intracodon recombination on global and site-specific <it>dN/dS </it>estimation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>I have developed algorithms and written software to efficiently simulate, forward in time, different Markovian nucleotide or codon models of DNA mutation. Such models can be combined with recombination, at inter and intra codon levels, fitness-based selection and complex demographic scenarios.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>GenomePop has many interesting characteristics for simulating SNPs or DNA sequences under complex evolutionary and demographic models. These features make it unique with respect to other simulation tools. Namely, the possibility of forward simulation under General Time Reversible (GTR) mutation or GTR&#215;MG94 codon models with intra-codon recombination, arbitrary, user-defined, migration patterns, diploid or haploid models, constant or variable population sizes, etc. It also allows simulation of fitness-based selection under different distributions of mutational effects. Under the 2-allele model it allows the simulation of recombination hot-spots, the definition of different frequencies in different populations, etc. GenomePop can also manage large DNA fragments. In addition, it has a scaling option to save computation time when simulating large sequences and population sizes under complex demographic and evolutionary situations. These and many other features are detailed in its web page <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>There are several situations in population biology research where simulation of DNA sequences is useful. Simulations have been used to for hypothesis testing <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, to study the impact of differing demographic scenarios on patterns of human diversity <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, or to simulate the evolution of complex diseases in human populations <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. In addition, population simulation of genetic datasets is also used to estimate population parameters <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>One of the most exciting research areas in the current context of population genetics is the HapMap project. Knowledge about patterns of linkage disequilibrium (LD) in humans is very important from a genomic point of view. The existence of linkage or haplotype blocks <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> or, at least, networks of SNPs in high LD <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, will facilitate the assembly of human genome haplotype maps <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp> that will enormously improve, among other things, the efficiency of disease gene mapping. It seems that these blocks are mainly defined by recombination hot spots <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>, but haplotype blocks can also be generated by genetic drift in regions of uniform recombination if rates is low enough <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. We have now growing empirical knowledge about haplotype block and tagSNP diversity, but less is known about the effect of population demographic history. Though important work has been undertaken in the application of population genetics to LD mapping <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> and its relevance to human populations <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, we still have an incomplete understanding of how the combined effect of genetic drift, mutation, recombination and migration, affect LD and tagSNP patterns, although it is known that they do <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Moreover, recombination is an important evolutionary process to understand how genetic diversity is generated and maintained in populations. Jointly with positive selection, recombination allows for very high rates of evolution <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. However, the impact of recombination is dependent on other forces, such as selection and demography. Developing tools that allow simultaneous simulation of natural selection, recombination and complex demographic patterns will be of great help in trying to better understand the trace left on the DNA by the different interacting evolutionary forces.</p>
         <p>Simulation of biological populations under different evolutionary genetic models can be done following backward or forward strategies. Backward simulations, also called coalescent-based simulations, are computationally very efficient because they are based on the history of lineages with surviving offspring in the current population and ignore all individuals that are not ancestral to the present-day population <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Hence, coalescent is a sample-based theory relevant to the study of population samples and DNA sequence data. From its beginnings, the basic coalescent has been extended in several useful ways. For example, to include structured population models <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, changing population size <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>, recombination <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp> and selection <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>.</p>
         <p>On the contrary, forward simulations are less efficient because the entire population is simulated from past to present. However, the coalescent framework imposes some limitations that forward simulation does not. The first of these is the same feature that causes its efficiency, namely, the coalescent does not keep track of the complete ancestral information i.e. only takes into account ancestries that survived to form the present-day sample. Thus, if the interest is focused on the evolutionary process itself, rather than on its outcome, forward simulations should be preferred <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Second, coalescent simulations are complicated by simple genetic forces such as selection, and although different evolutionary scenarios have been incorporated (see above) it is still difficult to implement models incorporating complex evolutionary situations with selection, variable population size, recombination, complex mating schemes, and so on. In fact, we can only simulate limited forms of recombination and selection under the coalescent. It is known that recombination has a major impact for detecting positive natural selection <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. Shriner <it>et al </it>studied the impact of recombination under a neutral model. Anisimova <it>et al </it>studied the recombination effect under a coalescent codon-based model i.e. the unit of change was the codon instead of the nucleotide. In the latter case, recombination was not simulated at the intracodon level. Therefore, we still ignore the importance of intracodon recombination under a given codon-based model. Moreover, coalescent methods cannot yet simulate realistic samples of complex human diseases <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Indeed, when simulating non-neutral scenarios and/or complex models under the coalescent, much of its computational efficiency is lost (however, see recent work by Marjoram <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and Liang <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>). Furthermore, the coalescent model is based on specific limiting values and relationships between some important parameters <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. Hence, there is increasing interest in forward population genetic simulation and new efficient tools have been recently developed <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. Therefore, a program that allows the simulation forward in time, of different Markovian nucleotide or codon models of DNA mutation combined with recombination, at inter and intra codon levels, fitness-based selection and complex demographic scenarios, will be of great interest. Here I will introduce the program GenomePop that fulfills the mentioned requirements.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>GenomePop uses a simple and efficient algorithm to perform forward simulation of populations and/or genomes. The basic idea considers an individual as the differences (mutations) between this individual and a reference or consensus genotype. Thus, each individual is no longer represented by its complete sequence or genotype but by the mutations it carries with respect to the consensus. A more detailed explanation of the algorithm is provided at the program web page. Taking advantage of the efficiency of this approach, GenomePop can simulate, forward in time, DNA sequences under specific Markov models. The program allows the simulation of recombination under both nucleotide and codon models of evolution, providing a way to simulate recombination at inter and intracodon levels under codon models. It also permits arbitrary migration models, simulation of SNPs, recombination hot-spots, fitness-based selection and many other features that are detailed in the program web-page. GenomePop has different output formats as GenePop for SNPs and Phylip or Nexus for DNA sequences.</p>
         <sec>
            <st>
               <p>Markov models of DNA mutation</p>
            </st>
            <p>Markov processes are used in molecular evolution to describe the change between nucleotides, aminoacids or codons over evolutionary time. Usually, time is measured as the number of substitutions because molecular sequence data does not allow the separate estimation of the rate and the time, but only of their product <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. In the context of forward simulation we are not interested in the transition after an arbitrary time <it>t </it>(branch length) but just in the transition from a nucleotide or codon to another, given that a mutation occurs. An advantage of this approach is that we need to compute the transition matrix just once at the beginning of the evolutionary process. Therefore, consider a given instantaneous substitution rate matrix <it>Q</it>, which allows for a complete definition of any Markovian substitution model <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, the matrix <it>M </it>= -<it>qQ </it>+ <it>I </it>is the conditional transition matrix to go from <it>i </it>to <it>j </it>provided that a substitution occurs, where <it>q </it>= diagonal (1/<it>q</it><sub><it>i</it></sub>) and <it>I </it>is the identity matrix <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Then, given an instantaneous substitution matrix <it>Q</it>, estimated for example using PAUP <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> or Hyphy <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> programs, we can obtain the corresponding transition matrix <it>M </it>that can be used to produce the necessary mutation process in a forward in time evolutionary model.</p>
         </sec>
         <sec>
            <st>
               <p>Biological models</p>
            </st>
            <p>There are two basic biological models implemented in GenomePop, namely "viral" and "non-viral". The only difference that distinguishes them is just that in the viral model the initial sequences are different in each population, as the different viruses infect different individuals. Thus, the user can define a viral model indicating the percentage of sequence identity (0&#8211;100) between the sequences of the distinct populations. By default the sequence identity is zero i.e. the sequences at each population are randomly settled. In the non-viral model the initial sequence is the same for every population (identity of 100%).</p>
         </sec>
         <sec>
            <st>
               <p>DNA models, recombination and selection</p>
            </st>
            <p>There are different DNA models implemented in GenomePop (Table <tblr tid="T1">1</tblr>). In any of them, the user can decide to allow recurrent mutation, i.e. multiple site hits or not. Models can be haploid or diploid. Population size can be constant or variable. In the four-allele models, the sequences can be generated by the program or provided by the user. In the case of the 2-allele model (SNPs) just one or several chromosomes can be considered. In this same model, recombination can be constant or a hot spot recombination model can be defined. In the latter, the recombination rate <it>r </it>is per haploid region and generation. If no hot spots are defined, the expected number of recombination events between any two sites <it>i </it>and <it>j </it>will be 2<it>rd</it><sub>ij</sub>/(<it>L</it>-1) where <it>d</it><sub>ij </sub>is the implied region length and <it>L </it>is the chromosome length. The number of recombination events between the two chromosome extremes 0 and <it>L </it>-1 will be 2<it>rd</it><sub>ij</sub>/(<it>L</it>-1) = 2<it>r</it>. In GenomePop, the effect of natural selection can be modelled in two different ways: 1) by its effects on the <it>dN/dS </it>ratio i.e. by defining a codon model, and 2) via the fitness effect of mutation on specific loci. The user can run either of two models. The codon model option runs a MG94 codon model <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> with a given <it>dN/dS </it>combined with any defined nucleotide model. This model of codon evolution will be implemented by the instantaneous rate matrix to go from codon <it>i </it>to <it>j</it>. That is, <it>Q</it><sub>ij </sub>= <it>&#952;</it><sub><it>mn</it></sub><it>k&#960;</it><sub><it>n </it></sub>where <it>&#952;</it><sub><it>mn </it></sub>accounts for biased nucleotide, <it>m </it>to <it>n </it>substitutions; <it>k </it>= 1 or &#969; for synonymous or nonsynonymous mutation rates respectively and <it>&#960;</it><sub><it>n </it></sub>is the equilibrium frequency of the target nucleotide. This corresponds to the MG94 model <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> with the restriction of &#945; = 1. Nucleotide equilibrium frequencies are used instead of codon frequencies. To simulate a given <it>dN</it>/<it>dS </it>we simply set &#969; = <it>dN</it>/<it>dS</it>. Alternatively, the user can set the codon model option to false (default option) and define specific sites under directional selection with a given selective coefficient which will apply when a mutation occurs at such site. The user can also force all sites to undergo selection. The selection coefficient, <it>s</it>, can be constant or sampled from a gamma distribution with user-defined shape parameter &#946; and scale parameter &#946;/<it>s</it>. The &#946; parameter allows for modelling of the fitness effects distribution, e.g. a low value of &#946; (0.1) will sample many mutations with low effect and few with high. A &#946; parameter of 1 corresponds to the exponential distribution. If we set &#946; to 0 then a constant effect model is applied. Moreover, GenomePop permits the combination of both kinds of models of selection, codon and fitness-based, though the biological meaning of such a mixture is not clear.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>GenomePop DNA models</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>DNA Model</p>
                     </c>
                     <c ca="left">
                        <p>GenomePop Notation</p>
                     </c>
                     <c ca="left">
                        <p>Output format</p>
                     </c>
                     <c ca="left">
                        <p>Recombination</p>
                     </c>
                     <c ca="left">
                        <p>Selective sites</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2 allele</p>
                     </c>
                     <c ca="left">
                        <p>JC2</p>
                     </c>
                     <c ca="left">
                        <p>Genepop</p>
                     </c>
                     <c ca="left">
                        <p>Hot spots</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Jukes Cantor</p>
                     </c>
                     <c ca="left">
                        <p>JC4</p>
                     </c>
                     <c ca="left">
                        <p>Phylip/Nexus</p>
                     </c>
                     <c ca="left">
                        <p>Constant</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GTR</p>
                     </c>
                     <c ca="left">
                        <p>GTR</p>
                     </c>
                     <c ca="left">
                        <p>Phylip/Nexus</p>
                     </c>
                     <c ca="left">
                        <p>Constant</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MG94 &#215; (JC/GTR)</p>
                     </c>
                     <c ca="left">
                        <p>Codon true</p>
                     </c>
                     <c ca="left">
                        <p>Phylip/Nexus</p>
                     </c>
                     <c ca="left">
                        <p>Constant</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>GTR: General Time Reversible Model [63]. MG94: Muse and Gaut [57] codon model.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Migration models</p>
            </st>
            <p>Two basic migration schemes, island model and one-dimensional stepping stone, are pre-defined in GenomePop. However, the user can define any migration model of interest (Figure <figr fid="F1">1</figr>). To do this, set the flow model to 'user' in the standard input file and then just introduce a scheme similar to that of Figure <figr fid="F1">1</figr> in a file called MigrationModel.txt. In this file, the lines beginning with '#' are comments. To indicate how individuals will migrate from a given population just begin the line with the word "pop". The order of appearance of each population in the file will correspond with its index i.e. the first population that appear is the population number one, etc. The number below "pop" refers to the migration level, i.e. the number of different migration rates defined from this population. The next line should begin with a migration rate (between 0 and 1) followed, in the same line, by the target population(s). We should have as many of these kinds of lines as the migration level indicates, i.e. if the migration level is 2 we should have two lines beginning with a migration rate. More detailed explanation and specific examples are given in the program web page.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Example of a user-defined migration model</p>
               </caption>
               <text>
                  <p>
                     <b>Example of a user-defined migration model.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-223-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Scaling</p>
            </st>
            <p>Clearly, the more complex the model defined, the slower the simulation. To avoid high computation times, GenomePop incorporates a scaling option based on the fact that, under neutral models, we can scale the population size <it>N </it>and the time <it>t</it>, provided the consequent correction to the mutation (&#956;), migration (<it>m</it>) and recombination (<it>r</it>) rates holds the corresponding compound products <it>N&#956;</it>, <it>Nr</it>, <it>Nm</it>, etc., constant.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Input file</p>
            </st>
            <p>The input file should be called GenomePopInput.txt. In this file, lines beginning with '#' are comments and will be ignored. In Figure <figr fid="F2">2</figr> we can see an example of an input file. Note that the input is flexible, i.e. the minimum input for GenomePop to work appropriately corresponds to the first line and the values below it. This line must begin with the identifier 'chromsize' and the line below with the corresponding desired values. Note that, in lines with identifiers, only the first word matters for the program.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Input file to generate 100 datasets under a GTR model</p>
               </caption>
               <text>
                  <p>
                     <b>Input file to generate 100 datasets under a GTR model.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-223-2"/>
            </fig>
            <p>Thus, the input in Figure <figr fid="F2">2</figr> generates 100 datasets under a GTR model with substitution rates typical for HIV <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Both recurrent and retromutation are allowed. The system will evolve 1 chromosome of 1 Kb under the given model over 20,000 generations. As can be seen in Figure <figr fid="F2">2</figr>, a scaling of 10 was used, which implies that both, population size and the number of generations, was divided by 10 and mutation was multiplied by the same factor. A more exhaustive explanation of the input facilities of GenomePop is provided at the program web page.</p>
         </sec>
         <sec>
            <st>
               <p>Example and validation of the Markov mutation method</p>
            </st>
            <p>For each obtained dataset from the input in Figure <figr fid="F2">2</figr>, the best-fit model of nucleotide substitution under the Akaike information criteria (AIC) was estimated with Modeltest v3.6 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, using maximum likelihood (ML) estimates from PAUP* <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The percentage of correct model estimation (GTR) was 97% although some datasets, about 29%, were also assigned invariable sites or rate heterogeneity among sites. The substitution pattern and equilibrium frequencies were correctly estimated.</p>
         </sec>
         <sec>
            <st>
               <p>Examples and validation of other general features</p>
            </st>
            <p>As GenomePop has many different features and models it is difficult to validate every possibility or circumstance. However, strong effort has been made to validate the program as thoroughly as possible. For example, both unscaled and scaled simulations were performed under a Jukes-Cantor model with diversity &#952; = 4<it>N&#956; </it>= 0.004 over 10<sup>4 </sup>generations and then &#952; was estimated using the finite-sites correction of Watterson &#952; <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. The accuracy was quite good, obtaining estimates of 0.0043 &#177; 0.00015 and 0.0037 &#177; 0.00016 for the unscaled and scaled cases respectively. Recombination was also tested by evolving datasets for 6<it>N </it>generations under a Jukes-Cantor 4-allele model with different values for the parameter &#961; = 4<it>NrL</it>, where <it>N </it>is population size, <it>r </it>is recombination rate per site and <it>L </it>is the DNA sequence length (the corresponding parameter in GenomePop is 'Rec' = <it>r </it>&#215; <it>L</it>). Namely, we ran cases with &#961; equal to 0, 50 and 100. Recombination was then accurately estimated using the program Kpairwise <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. GenomePop allows also studying 2-allele SNPs at different frequencies in different populations. In Figure <figr fid="F3">3</figr> we define a 2-allele model (JC2) with different initial composition at each population (viral model) and 10 independent SNPs (recombination 'Rec' = 10 &#215; 0.5 = 5). The populations have different sizes (100 and 120) and migration occurs under the island model. Note that when defining different population sizes, the original population size provided in the 'chromsize' line under the 'popsizeKmax' identifier is overwritten.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Input file to generate 10 independent SNPs at different frequencies in different populations</p>
               </caption>
               <text>
                  <p>
                     <b>Input file to generate 10 independent SNPs at different frequencies in different populations.</b>
                  </p>
               </text>
               <graphic file="1471-2105-9-223-3"/>
            </fig>
            <p>We ran this example over 200 generations and then analyze the output with the GenePop 4.0 program <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. As expected the SNPs were detected as independent. We then changed the value of recombination to 0 ('Rec' = 0) and then GenePop 4.0 tell us that the 10 SNPs are linked, as expected. Note the many possibilities that the program provides in the context of studying SNPs under complex evolutionary situations. We can define any number of populations under any user-defined migration model. We can set any number of SNPs with the desired linkage relationships. The SNPs can be set at distinct initial frequencies in the different populations, for example, 'SNPfreqs' at 1.0 and 0.0 defines the first population with allele 1 fixed and the second with allele 2 fixed.</p>
         </sec>
         <sec>
            <st>
               <p>Impact of recombination on estimation of positive selection</p>
            </st>
            <p>We performed a simple experiment to test the impact of recombination on <it>dN</it>/<it>dS </it>estimation. We ran 50 replicates, with and without population recombination per gene, 4<it>Nr </it>= 40 and 0, respectively. The runs were performed under a MG94 &#215; JC model both with <it>dN</it>/<it>dS </it>= 1 and <it>dN</it>/<it>dS </it>= 2.5 evolving 333 codons for 10<it>N </it>generations with an effective population size of <it>N </it>= 10<sup>3 </sup>to get samples of 20 sequences. The <it>dN</it>/<it>dS </it>ratio was estimated with the FEL (Fixed effects Likelihood) model of Hyphy <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> which computes global and site by site <it>dN/dS </it>ratio. A <it>p </it>value of 0.1 was used to infer sites under positive selection. As can be seen in Table <tblr tid="T2">2</tblr> a <it>dN/dS </it>of 2.5 provokes the detection of some sites under positive selection (1 or 2, not shown) in only 30% of the replicates (NSS = 0.3 in Table <tblr tid="T2">2</tblr>). Furthermore in the strictly neutral case (<it>dN/dS </it>= 1), one positive selected site was assigned in 10% of the replicates as expected given the <it>p </it>value used. If we correct by this 10% of false positive tests then positive selected sites were detected only in 20% of the replicates under a <it>dN/dS </it>value of 2.5 and no recombination. This is in agreement with the conservative nature of the FEL method <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Also noteworthy is that recombination had no impact on global <it>dN</it>/<it>dS </it>estimation but had important effects on the number of sites detected under positive selection as is evident upon inspecting Table <tblr tid="T2">2</tblr>. It seems also that the effect of intracodon recombination is negligible. Interestingly, it appears that the effect of recombination is somewhat higher under non-neutral <it>dN/dS </it>than in the neutral case. The impact of recombination on positive selection detection has already been studied <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. However, as far as we know, the comparison of the impact of recombination under neutral or positve <it>dN/dS </it>jointly with the effect of intracodon recombination has never been studied before. The significance of this effect should be studied with more replicates and cases, which is out of the scope of the present work.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Impact of recombination on <it>dN/dS </it>estimation under a Jukes Cantor model.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center">
                        <p>4<it>Nr</it></p>
                     </c>
                     <c ca="center">
                        <p>Expected <it>&#969;</it></p>
                     </c>
                     <c ca="center">
                        <p>Estimated <it>&#969;</it></p>
                     </c>
                     <c ca="center">
                        <p>NPSS</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1.02 &#177; 0.03</p>
                     </c>
                     <c ca="center">
                        <p>0.1 &#177; 0.05</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1.06 &#177; 0.04</p>
                     </c>
                     <c ca="center">
                        <p>9.9 &#177; 0.56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40 ncb</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1.01 &#177; 0.03</p>
                     </c>
                     <c ca="center">
                        <p>8.8 &#177; 0.49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>2.62&#177; 0.12</p>
                     </c>
                     <c ca="center">
                        <p>0.3 &#177; 0.07</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>2.57&#177; 0.11</p>
                     </c>
                     <c ca="center">
                        <p>13.1 &#177; 0.77</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>40 ncb</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>2.58&#177; 0.13</p>
                     </c>
                     <c ca="center">
                        <p>12.7 &#177; 0.65</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><it>N</it>: Population size. <it>r </it>= Recombination rate per gene. <it>&#969; </it>= <it>dN</it>/<it>dS</it>. NPSS: Average number of positive selection sites. ncb: no codon break allowed.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>GenomePop has interesting characteristics for simulating SNPs or DNA sequences under complex models of evolution and demography. These features make it unique with respect to other simulation tools. Namely, the possibility of forward simulation under GTR mutation or GTR &#215; MG94 codon models with intra-codon recombination, simulation of any user-defined migration pattern, diploid or haploid models, constant or variable population sizes, fitness-based selection, etc. Under the 2-allele model it allows the simulation of recombination hot-spots, the definition of different frequencies in different populations, etc. GenomePop can also manage large DNA fragments and has a scaling option to save computation time when simulating large sequences or population sizes under complex demographic and evolutionary situations. It has many other features that are detailed in the web page <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p><b>Project name: </b>GenomePop v. 1.0</p>
         <p>
            <b>Project home page: </b>
            <url>http://webs.uvigo.es/acraaj/GenomePop.htm</url>
         </p>
         <p><b>Operating system(s): </b>Windows and Linux (the source will be provided to compile for Mac)</p>
         <p><b>Programming language: </b>C++</p>
         <p><b>License: </b>GNU GPL.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AC-R had the original idea for the work, designed and implemented the algorithms and wrote the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>I am grateful to A. Caballero, H. Quesada, S.T. Rodr&#237;guez-Ramilo and two anonymous reviewers for discussion and comments on the manuscript. I also want to thank Sergei L Kosakovsky Pond for his help with HYPHY. This work was supported by grant CPE03-004-C2 from Instituto Nacional de Investigaci&#243;n y Tecnolog&#237;a Agraria y Alimentaria (INIA) and from Direcci&#243;n Xeral de Investigaci&#243;n e Desenvolvemento from Xunta de Galicia. AC-R is currently funded by an Isidro Parga Pondal research fellowship from Xunta de Galicia (Spain).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>GenomePop: software to simulate the evolution of genomes and populations</p>
            </title>
            <aug>
               <au>
                  <snm>Carvajal-Rodr&#237;guez</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <url>http://webs.uvigo.es/acraaj/GenomePop.htm</url>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Molecular clock-like evolution of human immunodeficiency virus type 1</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nickle</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Shriner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Gerald</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Learn</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mittler</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Mullins</snm>
                  <fnm>JI</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2004</pubdate>
            <volume>329</volume>
            <fpage>101</fpage>
            <lpage>108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.virol.2004.08.014</pubid>
                  <pubid idtype="pmpid" link="fulltext">15476878</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Waiting times for the appearance of cytotoxic T-lymphocyte escape mutants in chronic HIV-1 infection</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mullins</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Mittler</snm>
                  <fnm>JE</fnm>
               </au>
            </aug>
            <source>Virology</source>
            <pubdate>2006</pubdate>
            <volume>347</volume>
            <issue>1</issue>
            <fpage>140</fpage>
            <lpage>146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.virol.2005.11.036</pubid>
                  <pubid idtype="pmpid" link="fulltext">16387340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Accumulation of deleterious mutations: Additional Drosophila melanogaster estimates and a simulation of the effects of selection</p>
            </title>
            <aug>
               <au>
                  <snm>Caballero</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cusi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Garcia-Dorado</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>2002</pubdate>
            <volume>56</volume>
            <issue>6</issue>
            <fpage>1150</fpage>
            <lpage>1159</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12144016</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Quantitative variation as a tool for detecting human-induced impacts on genetic diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Carvajal-Rodriguez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rolan-Alvarez</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Caballero</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Biological Conservation</source>
            <pubdate>2005</pubdate>
            <volume>124</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.biocon.2004.12.008</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Forward-Time Simulations of Human Populations with Complex Diseases</p>
            </title>
            <aug>
               <au>
                  <snm>Peng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Amos</snm>
                  <fnm>CI</fnm>
               </au>
               <au>
                  <snm>Kimmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>3</issue>
            <fpage>e47</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1829403</pubid>
                  <pubid idtype="pmpid" link="fulltext">17381243</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0030047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Simulations provide support for the common disease-common variant hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Peng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kimmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2007</pubdate>
            <volume>175</volume>
            <issue>2</issue>
            <fpage>763</fpage>
            <lpage>776</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1800600</pubid>
                  <pubid idtype="pmpid" link="fulltext">17151262</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.058164</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Inference of genome-wide mutation rates and distributions of mutation effects for fitness traits: a simulation study</p>
            </title>
            <aug>
               <au>
                  <snm>Keightley</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1998</pubdate>
            <volume>150</volume>
            <issue>3</issue>
            <fpage>1283</fpage>
            <lpage>1293</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460396</pubid>
                  <pubid idtype="pmpid" link="fulltext">9799279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Nonequilibrium migration in human history</p>
            </title>
            <aug>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>153</volume>
            <issue>4</issue>
            <fpage>1863</fpage>
            <lpage>1871</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460876</pubid>
                  <pubid idtype="pmpid" link="fulltext">10581291</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The coalescent in an island model of population subdivision with variation among demes</p>
            </title>
            <aug>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Theor Popul Biol</source>
            <pubdate>2001</pubdate>
            <volume>59</volume>
            <issue>2</issue>
            <fpage>133</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/tpbi.2000.1495</pubid>
                  <pubid idtype="pmpid" link="fulltext">11302758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Islands of linkage disequilibrium</p>
            </title>
            <aug>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>109</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1001-109</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The effect of single-nucleotide polymorphism marker selection on patterns of haplotype blocks and haplotype frequency estimates</p>
            </title>
            <aug>
               <au>
                  <snm>Nothnagel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rohde</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2005</pubdate>
            <volume>77</volume>
            <issue>6</issue>
            <fpage>988</fpage>
            <lpage>998</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1285181</pubid>
                  <pubid idtype="pmpid" link="fulltext">16380910</pubid>
                  <pubid idtype="doi">10.1086/498175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The International HapMap Project</p>
            </title>
            <aug>
               <au>
                  <cnm>International-HapMap-Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>426</volume>
            <issue>6968</issue>
            <fpage>789</fpage>
            <lpage>796</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02168</pubid>
                  <pubid idtype="pmpid" link="fulltext">14685227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>A haplotype map of the human genome</p>
            </title>
            <aug>
               <au>
                  <cnm>International-HapMap-Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <issue>7063</issue>
            <fpage>1299</fpage>
            <lpage>1320</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1880871</pubid>
                  <pubid idtype="pmpid" link="fulltext">16255080</pubid>
                  <pubid idtype="doi">10.1038/nature04226</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A second generation human haplotype map of over 3.1 million SNPs</p>
            </title>
            <aug>
               <au>
                  <cnm>International-HapMap-Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>449</volume>
            <issue>7164</issue>
            <fpage>851</fpage>
            <lpage>861</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06258</pubid>
                  <pubid idtype="pmpid" link="fulltext">17943122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Meiotic recombination hot spots and human DNA diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffreys</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Holloway</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Kauppi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Neumann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Slingsby</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2004</pubdate>
            <volume>359</volume>
            <issue>1441</issue>
            <fpage>141</fpage>
            <lpage>152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693298</pubid>
                  <pubid idtype="pmpid" link="fulltext">15065666</pubid>
                  <pubid idtype="doi">10.1098/rstb.2003.1372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Strong correlation between meiotic crossovers and haplotype structure in a 2.5-Mb region on the long arm of chromosome 21</p>
            </title>
            <aug>
               <au>
                  <snm>Greenawalt</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tereshchenko</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>JY</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Azaro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Decoste</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Chimge</snm>
                  <fnm>NO</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Lange</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <issue>2</issue>
            <fpage>208</fpage>
            <lpage>214</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1361716</pubid>
                  <pubid idtype="pmpid" link="fulltext">16385099</pubid>
                  <pubid idtype="doi">10.1101/gr.4641706</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Haplotype block structures show significant variation among populations</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sawyer</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pakstis</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>KK</fnm>
               </au>
               <au>
                  <snm>Brookes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2004</pubdate>
            <volume>27</volume>
            <issue>4</issue>
            <fpage>385</fpage>
            <lpage>400</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/gepi.20026</pubid>
                  <pubid idtype="pmpid" link="fulltext">15389924</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Linkage disequilibrium: what history has to tell us</p>
            </title>
            <aug>
               <au>
                  <snm>Nordborg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tavare</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>2</issue>
            <fpage>83</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(02)02557-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11818140</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Genealogical trees, coalescent theory and the analysis of genetic polymorphisms</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenberg</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Nordborg</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>5</issue>
            <fpage>380</fpage>
            <lpage>390</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg795</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988763</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Estimating recombination rates from population-genetic data</p>
            </title>
            <aug>
               <au>
                  <snm>Stumpf</snm>
                  <fnm>MPH</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>GAT</fnm>
               </au>
            </aug>
            <source>Nature Reviews Genetics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>959</fpage>
            <lpage>968</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1227</pubid>
                  <pubid idtype="pmpid" link="fulltext">14631356</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Gene genealogies, variation and evolution : a primer in coalescent theory</p>
            </title>
            <aug>
               <au>
                  <snm>Hein</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wiuf</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schierup</snm>
                  <fnm>MH</fnm>
               </au>
            </aug>
            <publisher>Oxford , Oxford University Press</publisher>
            <pubdate>2005</pubdate>
            <fpage>XIII, 276 s.</fpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Prospects for whole-genome linkage disequilibrium mapping of common disease genes</p>
            </title>
            <aug>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>22</volume>
            <fpage>139</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/9642</pubid>
                  <pubid idtype="pmpid" link="fulltext">10369254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Linkage disequilibrium in humans: models and data</p>
            </title>
            <aug>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>69</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1226024</pubid>
                  <pubid idtype="pmpid" link="fulltext">11410837</pubid>
                  <pubid idtype="doi">10.1086/321275</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The fine-scale structure of recombination rate variation in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>McVean</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deloukas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <issue>5670</issue>
            <fpage>581</fpage>
            <lpage>584</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1092500</pubid>
                  <pubid idtype="pmpid" link="fulltext">15105499</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations</p>
            </title>
            <aug>
               <au>
                  <snm>Gu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pakstis</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>KK</fnm>
               </au>
            </aug>
            <source>Eur J Hum Genet</source>
            <pubdate>2007</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>302</fpage>
            <lpage>312</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201751</pubid>
                  <pubid idtype="pmpid" link="fulltext">17202997</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Genome evolution: recombination speeds up adaptive evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>R68</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(02)01432-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12546809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Coalescence time for two genes from a subdivided population</p>
            </title>
            <aug>
               <au>
                  <snm>Bahlo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>J Math Biol</source>
            <pubdate>2001</pubdate>
            <volume>43</volume>
            <issue>5</issue>
            <fpage>397</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002850100104</pubid>
                  <pubid idtype="pmpid">11767204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Inference from gene trees in a subdivided population</p>
            </title>
            <aug>
               <au>
                  <snm>Bahlo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Theor Popul Biol</source>
            <pubdate>2000</pubdate>
            <volume>57</volume>
            <issue>2</issue>
            <fpage>79</fpage>
            <lpage>95</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/tpbi.1999.1447</pubid>
                  <pubid idtype="pmpid" link="fulltext">10792974</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Maximum likelihood estimation of a migration matrix and efective population sizes in n subpopulations by using a coalescent approach</p>
            </title>
            <aug>
               <au>
                  <snm>Beerli</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences, USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>8</issue>
            <fpage>4563</fpage>
            <lpage>4568</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.081068098</pubid>
                  <pubid idtype="pmpid" link="fulltext">11287657</pubid>
                  <pubid idtype="pmcid">31874</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The coalescent and the genealogical process in geographically structured population</p>
            </title>
            <aug>
               <au>
                  <snm>Notohara</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Math Biol</source>
            <pubdate>1990</pubdate>
            <volume>29</volume>
            <fpage>59</fpage>
            <lpage>75</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00173909</pubid>
                  <pubid idtype="pmpid">2277236</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Genealogy and subpopulation differentiation under various models of population structure</p>
            </title>
            <aug>
               <au>
                  <snm>Wilkinson-Herbots</snm>
                  <fnm>HM</fnm>
               </au>
            </aug>
            <source>J Math Biol</source>
            <pubdate>1998</pubdate>
            <volume>37</volume>
            <issue>6</issue>
            <fpage>535</fpage>
            <lpage>585</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s002850050140</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Sampling theory for neutral alleles in a varying environment</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Tavare</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Philosophical Transactions of the Royal Society of London, Series B</source>
            <pubdate>1994</pubdate>
            <volume>344</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.1994.0079</pubid>
                  <pubid idtype="pmpid" link="fulltext">7800710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A classification of coalescent processes for haploid exchangeable population models</p>
            </title>
            <aug>
               <au>
                  <snm>Mohle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sagitov</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Annals of Probability</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>4</issue>
            <fpage>1547</fpage>
            <lpage>1562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1214/aop/1015345761</pubid>
                  <pubid idtype="pmpid">7800710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The effect of change in population size on DNA polymorphism</p>
            </title>
            <aug>
               <au>
                  <snm>Tajima</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1989</pubdate>
            <volume>123</volume>
            <fpage>597</fpage>
            <lpage>601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1203832</pubid>
                  <pubid idtype="pmpid" link="fulltext">2599369</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>A coalescent estimator of the population recombination rate</p>
            </title>
            <aug>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>145</volume>
            <fpage>833</fpage>
            <lpage>846</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207867</pubid>
                  <pubid idtype="pmpid" link="fulltext">9055092</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The coalescent process in models with selection and recombination</p>
            </title>
            <aug>
               <au>
                  <snm>Hudson</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Kaplan</snm>
                  <fnm>NL</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1988</pubdate>
            <volume>120</volume>
            <fpage>831</fpage>
            <lpage>840</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1203560</pubid>
                  <pubid idtype="pmpid" link="fulltext">3147214</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The coalescent process in models with selection</p>
            </title>
            <aug>
               <au>
                  <snm>Kaplan</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Darden</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hudson</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1988</pubdate>
            <volume>120</volume>
            <fpage>819</fpage>
            <lpage>829</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1203559</pubid>
                  <pubid idtype="pmpid" link="fulltext">3066685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Ancestral processes with selection</p>
            </title>
            <aug>
               <au>
                  <snm>Krone</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Neuhauser</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Theor Popul Biol</source>
            <pubdate>1997</pubdate>
            <volume>51</volume>
            <issue>3</issue>
            <fpage>210</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/tpbi.1997.1299</pubid>
                  <pubid idtype="pmpid" link="fulltext">9245777</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The genealogy of samples in models with selection</p>
            </title>
            <aug>
               <au>
                  <snm>Neuhauser</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Krone</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>145</volume>
            <fpage>519</fpage>
            <lpage>534</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207815</pubid>
                  <pubid idtype="pmpid" link="fulltext">9071604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Likelihoods and simulation methods for a class of nonneutral population genetics models</p>
            </title>
            <aug>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nordborg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Joyce</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>159</volume>
            <issue>2</issue>
            <fpage>853</fpage>
            <lpage>867</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461835</pubid>
                  <pubid idtype="pmpid" link="fulltext">11606558</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Coalescence in a random background</p>
            </title>
            <aug>
               <au>
                  <snm>Barton</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Etheridge</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Sturm</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Annals of Applied Probability</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>2</issue>
            <fpage>754</fpage>
            <lpage>785</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1214/105051604000000099</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Perfect simulation from nonneutral population genetic models: Variable population size and population subdivision</p>
            </title>
            <aug>
               <au>
                  <snm>Fearnhead</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <issue>3</issue>
            <fpage>1397</fpage>
            <lpage>1406</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1667080</pubid>
                  <pubid idtype="pmpid" link="fulltext">16951070</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.060681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Haplotype evolution and linkage disequilibrium: A simulation study</p>
            </title>
            <aug>
               <au>
                  <snm>Calafell</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Grigorenko</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Chikanian</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Kidd</snm>
                  <fnm>KK</fnm>
               </au>
            </aug>
            <source>Hum Hered</source>
            <pubdate>2001</pubdate>
            <volume>51</volume>
            <issue>1-2</issue>
            <fpage>85</fpage>
            <lpage>96</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000022963</pubid>
                  <pubid idtype="pmpid" link="fulltext">11096275</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Effect of Recombination on the Accuracy of the Likelihood Method for Detecting Positive Selection at Amino Acid Sites</p>
            </title>
            <aug>
               <au>
                  <snm>Anisimova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <issue>3</issue>
            <fpage>1229</fpage>
            <lpage>1236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462615</pubid>
                  <pubid idtype="pmpid" link="fulltext">12871927</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Potential impact of recombination on sitewise approaches for detecting positive natural selection</p>
            </title>
            <aug>
               <au>
                  <snm>Shriner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nickle</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Mullins</snm>
                  <fnm>JI</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>2003</pubdate>
            <volume>81</volume>
            <fpage>115</fpage>
            <lpage>121</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0016672303006128</pubid>
                  <pubid idtype="pmpid">12872913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Fast "coalescent" simulation</p>
            </title>
            <aug>
               <au>
                  <snm>Marjoram</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>BMC Genet</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>16</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1458357</pubid>
                  <pubid idtype="pmpid" link="fulltext">16539698</pubid>
                  <pubid idtype="doi">10.1186/1471-2156-7-16</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>GENOME: a rapid coalescent-based whole genome simulator</p>
            </title>
            <aug>
               <au>
                  <snm>Liang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zollner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Abecasis</snm>
                  <fnm>GR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>12</issue>
            <fpage>1565</fpage>
            <lpage>1567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm138</pubid>
                  <pubid idtype="pmpid" link="fulltext">17459963</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The limits of theoretical population genetics</p>
            </title>
            <aug>
               <au>
                  <snm>Wakeley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>169</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1448894</pubid>
                  <pubid idtype="pmpid" link="fulltext">15677744</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>EASYPOP (version 1.7): a computer program for population genetics simulations</p>
            </title>
            <aug>
               <au>
                  <snm>Balloux</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Hered</source>
            <pubdate>2001</pubdate>
            <volume>92</volume>
            <issue>3</issue>
            <fpage>301</fpage>
            <lpage>302</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jhered/92.3.301</pubid>
                  <pubid idtype="pmpid" link="fulltext">11447253</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>simuPOP: a forward-time population genetics simulation environment</p>
            </title>
            <aug>
               <au>
                  <snm>Peng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kimmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>18</issue>
            <fpage>3686</fpage>
            <lpage>3687</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti584</pubid>
                  <pubid idtype="pmpid" link="fulltext">16020469</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Nemo: an evolutionary and population genetics programming framework</p>
            </title>
            <aug>
               <au>
                  <snm>Guillaume</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rougemont</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>20</issue>
            <fpage>2556</fpage>
            <lpage>2557</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl415</pubid>
                  <pubid idtype="pmpid" link="fulltext">16882649</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Adaptive Molecular Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Balding</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bishop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <cnm>Cannings</cnm>
               </au>
            </aug>
            <source>Handbook of Statistical Genetics</source>
            <publisher> Wiley J. and Sons Ltd.</publisher>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B54">
            <title>
               <p>A second course in stochastic processes</p>
            </title>
            <aug>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>HM</fnm>
               </au>
            </aug>
            <publisher>New York , Academic Press</publisher>
            <pubdate>1981</pubdate>
            <fpage>XVIII, 542 s.</fpage>
         </bibl>
         <bibl id="B55">
            <title>
               <p>PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods).</p>
            </title>
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Massachusetts , Sinauer Associates</publisher>
            <edition>4</edition>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B56">
            <title>
               <p>HyPhy: hypothesis testing using phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Kosakovsky Pond</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SDW</fnm>
               </au>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>5</issue>
            <fpage>676</fpage>
            <lpage>679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti079</pubid>
                  <pubid idtype="pmpid" link="fulltext">15509596</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Muse</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <issue>5</issue>
            <fpage>715</fpage>
            <lpage>724</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7968485</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Recombination Estimation under Complex Evolutionary Models with the Coalescent Composite Likelihood Method</p>
            </title>
            <aug>
               <au>
                  <snm>Carvajal-Rodriguez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Crandall</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Posada</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>4</issue>
            <fpage>817</fpage>
            <lpage>827</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1949848</pubid>
                  <pubid idtype="pmpid" link="fulltext">16452117</pubid>
                  <pubid idtype="doi">10.1093/molbev/msj102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Modeltest: testing the model of DNA substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Posada</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Crandall</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>817</fpage>
            <lpage>818</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.817</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918953</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>A coalescent based-method for detecting and estimating recombination from gene sequences</p>
            </title>
            <aug>
               <au>
                  <snm>McVean</snm>
                  <fnm>GAT</fnm>
               </au>
               <au>
                  <snm>Awadalla</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fearnhead</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2002</pubdate>
            <volume>160</volume>
            <fpage>1231</fpage>
            <lpage>1241</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462015</pubid>
                  <pubid idtype="pmpid" link="fulltext">11901136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism</p>
            </title>
            <aug>
               <au>
                  <snm>Raymond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rousset</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Heredity</source>
            <pubdate>1995</pubdate>
            <volume>86</volume>
            <fpage>248</fpage>
            <lpage>249</lpage>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Not so different after all: a comparison of methods for detecting amino acid sites under selection</p>
            </title>
            <aug>
               <au>
                  <snm>Kosakovsky Pond</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Frost</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>5</issue>
            <fpage>1208</fpage>
            <lpage>1222</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi105</pubid>
                  <pubid idtype="pmpid" link="fulltext">15703242</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>The general stochastic model of nucleotide substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Rodr&#237;guez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Mar&#237;n</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Medina</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>1990</pubdate>
            <volume>142</volume>
            <fpage>485</fpage>
            <lpage>501</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-5193(05)80104-3</pubid>
                  <pubid idtype="pmpid">2338834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
