<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2156-6-S1-S24</ui>
   <ji>1471-2156</ji>
   <fm>
      <dochead>Proceedings</dochead>
      <bibl>
         <title>
            <p>Resampling methods to reduce the selection bias in genetic effect estimation in genome-wide scans</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Wu</snm>
               <mnm>Yang</mnm>
               <fnm>Long</fnm>
               <insr iid="I1"/>
               <email>lwu@mshri.on.ca</email>
            </au>
            <au id="A2">
               <snm>Lee</snm>
               <mi>SF</mi>
               <fnm>Sophia</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>slee@mshri.on.ca</email>
            </au>
            <au id="A3">
               <snm>Shi</snm>
               <mnm>Steven</mnm>
               <fnm>Haijiang</fnm>
               <insr iid="I1"/>
               <email>steven.shi@ices.on.ca</email>
            </au>
            <au id="A4">
               <snm>Sun</snm>
               <fnm>Lei</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>sun@utstat.toronto.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Bull</snm>
               <mi>B</mi>
               <fnm>Shelley</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>bull@mshri.on.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Samuel Lunenfeld Research Institute, Mount Sinai Hospital, 600 University Avenue, Toronto, Ontario, Canada M5G 1X5</p>
            </ins>
            <ins id="I2">
               <p>Department of Public Health Sciences, University of Toronto, 12 Queen's Park Crescent West, Toronto, Ontario, Canada M5S 1A8</p>
            </ins>
            <ins id="I3">
               <p>Hospital for Sick Children, 555 University Avenue, Toronto, Ontario, Canada M5G 1X8</p>
            </ins>
         </insg>
         <source>BMC Genetics</source>
         <supplement>
            <title>
               <p>Genetic Analysis Workshop 14: Microsatellite and single-nucleotide polymorphism</p>
            </title>
            <editor>Joan E Bailey-Wilson, Laura Almasy, Mariza de Andrade, Julia Bailey, Heike Bickeb&#246;ller, Heather J Cordell, E Warwick Daw, Lynn Goldin, Ellen L Goode, Courtney Gray-McGuire, Wayne Hening, Gail Jarvik, Brion S Maher, Nancy Mendell, Andrew D Paterson, John Rice, Glen Satten, Brian Suarez, Veronica Vieland, Marsha Wilcox, Heping Zhang, Andreas Ziegler and Jean W MacCluer</editor>
            <note>Proceedings</note>
         </supplement>
         <conference>
            <title>
               <p>Genetic Analysis Workshop 14: Microsatellite and single-nucleotide polymorphism</p>
            </title>
            <location>Noordwijkerhout, The Netherlands</location>
            <date-range>7-10 September 2004</date-range>
            <url>http://www.gaworkshop.org/</url>
         </conference>
         <issn>1471-2156</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>Suppl 1</issue>
         <fpage>S24</fpage>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16451633</pubid>
               <pubid idtype="doi">10.1186/1471-2156-6-S1-S24</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>30</day>
               <month>12</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Wu et al; licensee BioMed Central Ltd</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Using the simulated data of Problem 2 for Genetic Analysis Workshop 14 (GAW14), we investigated the ability of three bootstrap-based resampling estimators (a shrinkage, an out-of-sample, and a weighted estimator) to reduce the selection bias for genetic effect estimation in genome-wide linkage scans. For the given marker density in the preliminary genome scans (7 cM for microsatellite and 3 cM for SNP), we found that the two sets of markers produce comparable results in terms of power to detect linkage, localization accuracy, and magnitude of test statistic at the peak location. At the locations detected in the scan, application of the three bootstrap-based estimators substantially reduced the upward selection bias in genetic effect estimation for both true and false positives. The relative effectiveness of the estimators depended on the true genetic effect size and the inherent power to detect it. The shrinkage estimator is recommended when the power to detect the disease locus is low. Otherwise, the weighted estimator is recommended.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>After a genetic marker or candidate gene has been identified from a genome-wide scan as a putative disease susceptibility locus, it is of interest to estimate the associated genetic effect on the related phenotype. However, locus-specific effect estimates are subject to upward selection bias because of stringent test criteria adopted in genome-wide scans. G&#246;ring et al. <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> formally raised this issue and argued that reliable locus-specific parameter estimates can only be obtained in an independent sample. Sun and Bull <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> proposed three resampling-based estimators that can be applied to the original sample at the location where the maximum test statistic exceeds a genome-wide significance criterion. They demonstrated effective bias reduction in analytic and simulation studies of a homogenous population with a single disease gene. In their simulation studies, they compared a catalog of resampling methods, including cross-validation and bootstrapping, and their results suggested that bootstrap methods perform best in terms of smaller mean squared error. Therefore, we focused on the bootstrap method in the current study.</p>
         <p>The simulated data of Problem 2 for Genetic Analysis Workshop 14 (GAW14) provided a microsatellite marker map of 416 markers with a resolution of 7 cM and a denser single-nucleotide polymorphism (SNP) marker map of 917 markers with 3-cM density. The disease expression was under the influence of multiple genes in a complex manner. We compared performance of the two maps in multipoint linkage analysis in terms of power and localization accuracy. The main objective of this study was to further investigate the effectiveness of bootstrap resampling methods in reducing the bias of genetic effect estimates in genome-wide linkage scans. The new methods, were applied to both the microsatellite and SNP data for selected replicates. With the knowledge of the answers to the simulated data, we were able to investigate the performance of the new methods under stratification of true and false positives.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>To evaluate the power to detect linkage, we conducted multipoint analyses in all the 100 replicates using ALLEGRO <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. There were four populations Aipotu (AI), Danacaa (DA), Karangar (KA), and New York (NY) in each replicate. AI, DA, and KA included only nuclear families, while NY had multigeneration extended pedigrees. Because some of the large NY families (size > 25 bits) required too much execution time to complete the analysis in a reasonable time, the NY population was excluded.</p>
         <p>We adopted the exponential allele-sharing model of Kong and Cox <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and used <it>Spair </it>as the scoring function for affected relatives. The genetic effect was measured by <it>&#948; </it>the excess identity-by-descent (IBD) allele-sharing parameter in this model. The genome-wide significance criterion was set to 2.2 &#215; 10<sup>-5 </sup><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, corresponding to a <it>Zlr </it>value of 4.09, where <it>Zlr </it>is the test statistic for linkage in the exponential model. In each replicate of the three populations, we identified all loci that met the significance criterion.</p>
         <p>We implemented a simple bootstrapping method in this study. Suppose that the original dataset has <it>n </it>families; we repeatedly drew random samples of size <it>n </it>with replacement from it. In each bootstrap replication <it>b </it>(<it>b </it>= 1, ..., <it>B</it>), the selected families constitute the detection sample, and the remaining families (out-of-sample families) comprise the estimation sample, thus providing independence within each of the <it>B </it>resampling replications. To reduce the upward selection bias in the genetic effect estimates of &#948; we implemented three bootstrap-based estimators <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>: a shrinkage estimator defined by <graphic file="1471-2156-6-S1-S24-i1.gif"/>, an out-of-sample estimator <graphic file="1471-2156-6-S1-S24-i2.gif"/>, and a weighted estimator <graphic file="1471-2156-6-S1-S24-i3.gif"/>, where <it>&#969; </it>= 0.632 was analogous to Efron's 0.632 estimator <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>We first obtained the na&#239;ve estimate, <graphic file="1471-2156-6-S1-S24-i4.gif"/>, at location <it>m</it><sub><it>D</it></sub>, where the maximum test statistic exceeded the significance criterion in the original data. Note that the location <it>m</it><sub><it>D </it></sub>was the overall gene localization and the three bootstrap-based estimators were then applied only to genetic effect estimation at this location. The shrinkage estimator was constructed by reducing the na&#239;ve estimate <graphic file="1471-2156-6-S1-S24-i4.gif"/> by a shrinkage factor of <graphic file="1471-2156-6-S1-S24-i5.gif"/>, which was constructed by taking the average of the difference between <graphic file="1471-2156-6-S1-S24-i6.gif"/> and <graphic file="1471-2156-6-S1-S24-i7.gif"/> over <it>B</it>* bootstrap replications, with <it>B* </it>&#8804; <it>B</it>, where <it>B* </it>is the number of replications with significant results. In bootstrap replication <it>b</it>, <graphic file="1471-2156-6-S1-S24-i6.gif"/> is the genetic effect estimate at location <graphic file="1471-2156-6-S1-S24-i8.gif"/> with the maximum significant genome-wide test statistic in the detection sample; <graphic file="1471-2156-6-S1-S24-i7.gif"/> is the genetic effect estimate at the same location <graphic file="1471-2156-6-S1-S24-i8.gif"/> in the estimation sample. Note that <graphic file="1471-2156-6-S1-S24-i8.gif"/> could be different from <it>m</it><sub><it>D</it></sub>. The out-of-sample estimator was the average of <graphic file="1471-2156-6-S1-S24-i7.gif"/> at location <graphic file="1471-2156-6-S1-S24-i8.gif"/> in the estimation sample over <it>B</it>* bootstrap replications. It resembles the estimate that would have been obtained in an independent sample. The weighted estimator combined <graphic file="1471-2156-6-S1-S24-i4.gif"/> and <graphic file="1471-2156-6-S1-S24-i2.gif"/> with the weight of <it>&#969;</it>. The weight was chosen to be 0.632, which was derived from a distance argument based on the fact that bootstrap samples are supported by about 0.632<it>n </it>of the original families <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Note that the weighted estimator can also be written as <graphic file="1471-2156-6-S1-S24-i9.gif"/>. Therefore, it can be considered as a variant of shrinkage estimator, with the amount of shrinkage depending on <it>&#969; </it>and <graphic file="1471-2156-6-S1-S24-i10.gif"/>. Although an adaptive choice of the weight is attractive, as in the 0.632+ method <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, time constraints precluded its inclusion in this study.</p>
         <p>Bias reduction of the three estimators was compared according to whether the localization was a true or false positive. We classified significant findings in the 100 replicates into true or false positives, according to the answers (disease loci D1 and D2 on chromosomes 1 and 3 for the AI, KA, and DA populations, and disease loci D3 and D4 on chromosomes 5 and 9 for AI and KA). A true positive was defined if the detection was within 10 cM of the true disease gene location. The true genetic effects were estimated by averaging corresponding estimates from all 100 replicates.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>Averaging over all 100 replicates, the genome scans based on microsatellite markers at 7-cM density yielded similar performance in power and in accuracy of location estimates to those based on SNP markers at 3-cM density (Table <tblr tid="T1">1</tblr>). The power to detect disease gene loci varied among populations (Table <tblr tid="T1">1</tblr>), and the DA population generally had the highest power among the three populations.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Comparison of linkage analysis results between microsatellite and SNP based genome scans</p>
            </caption>
            <tblbdy cols="9">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Location estimates (mode)</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Power to detect linkage</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Mean test statistic</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pop</p>
                  </c>
                  <c ca="center">
                     <p>Chr.</p>
                  </c>
                  <c ca="center">
                     <p>True location (cM)</p>
                  </c>
                  <c ca="right">
                     <p>MS (cM)</p>
                  </c>
                  <c ca="right">
                     <p>SNP (cM)</p>
                  </c>
                  <c ca="right">
                     <p>MS</p>
                  </c>
                  <c ca="right">
                     <p>SNP</p>
                  </c>
                  <c ca="right">
                     <p>MS</p>
                  </c>
                  <c ca="right">
                     <p>SNP</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>AI</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>168.98</p>
                  </c>
                  <c ca="right">
                     <p>169.85</p>
                  </c>
                  <c ca="right">
                     <p>167.4</p>
                  </c>
                  <c ca="right">
                     <p>4/100</p>
                  </c>
                  <c ca="right">
                     <p>7/100</p>
                  </c>
                  <c ca="right">
                     <p>4.33</p>
                  </c>
                  <c ca="right">
                     <p>4.56</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>299.32</p>
                  </c>
                  <c ca="right">
                     <p>293.61</p>
                  </c>
                  <c ca="right">
                     <p>295.6</p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>22/100<sup>a</sup></b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>24/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>4.79</p>
                  </c>
                  <c ca="right">
                     <p>4.77</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="right">
                     <p>5.45</p>
                  </c>
                  <c ca="right">
                     <p>7.34</p>
                  </c>
                  <c ca="right">
                     <p>5.94</p>
                  </c>
                  <c ca="right">
                     <p>11/100</p>
                  </c>
                  <c ca="right">
                     <p>11/100</p>
                  </c>
                  <c ca="right">
                     <p>4.53</p>
                  </c>
                  <c ca="right">
                     <p>4.63</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="right">
                     <p>5.88</p>
                  </c>
                  <c ca="right">
                     <p>4.78</p>
                  </c>
                  <c ca="right">
                     <p>5.54</p>
                  </c>
                  <c ca="right">
                     <p>10/100</p>
                  </c>
                  <c ca="right">
                     <p>9/100</p>
                  </c>
                  <c ca="right">
                     <p>4.39</p>
                  </c>
                  <c ca="right">
                     <p>4.38</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>KA</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>168.98</p>
                  </c>
                  <c ca="right">
                     <p>169.97</p>
                  </c>
                  <c ca="right">
                     <p>168.0</p>
                  </c>
                  <c ca="right">
                     <p>2/100</p>
                  </c>
                  <c ca="right">
                     <p>2/100</p>
                  </c>
                  <c ca="right">
                     <p>4.07</p>
                  </c>
                  <c ca="right">
                     <p>4.24</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>299.32</p>
                  </c>
                  <c ca="right">
                     <p>293.75</p>
                  </c>
                  <c ca="right">
                     <p>295.9</p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>14/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>17/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>4.41</p>
                  </c>
                  <c ca="right">
                     <p>4.77</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="right">
                     <p>5.45</p>
                  </c>
                  <c ca="right">
                     <p>7.01</p>
                  </c>
                  <c ca="right">
                     <p>5.69</p>
                  </c>
                  <c ca="right">
                     <p>20/100</p>
                  </c>
                  <c ca="right">
                     <p>40/100</p>
                  </c>
                  <c ca="right">
                     <p>4.67</p>
                  </c>
                  <c ca="right">
                     <p>4.71</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="right">
                     <p>5.88</p>
                  </c>
                  <c ca="right">
                     <p>6.80</p>
                  </c>
                  <c ca="right">
                     <p>5.96</p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>52/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>45/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>4.95</p>
                  </c>
                  <c ca="right">
                     <p>4.92</p>
                  </c>
               </r>
               <r>
                  <c cspan="9">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="right">
                     <p>168.98</p>
                  </c>
                  <c ca="right">
                     <p>169.95</p>
                  </c>
                  <c ca="right">
                     <p>168.8</p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>82/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>89/100</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>5.20</p>
                  </c>
                  <c ca="right">
                     <p>5.43</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="right">
                     <p>299.32</p>
                  </c>
                  <c ca="right">
                     <p>293.62</p>
                  </c>
                  <c ca="right">
                     <p>295.7</p>
                  </c>
                  <c ca="right">
                     <p>48/100</p>
                  </c>
                  <c ca="right">
                     <p>56/100</p>
                  </c>
                  <c ca="right">
                     <p>4.89</p>
                  </c>
                  <c ca="right">
                     <p>4.77</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>a </sup>Bold text indicates the cases in which linkage was detected in replicate 1 with genome-wide significance using either the microsatellite or the SNP markers (as reported in Table 2).</p>
            </tblfn>
         </tbl>
         <p>For the microsatellite marker analysis, we used replicates 1 and 35 to illustrate the application of the three bootstrap-based estimators (Table <tblr tid="T2">2</tblr>). Replicate 1 was used for our initial genome scan. Replicate 35 was chosen because it contained an unambiguous false positive for the DA population on chromosome 6, after the chromosome containing the locus with highest test statistic (i.e., a true positive) was removed. We confirmed that the na&#239;ve estimates overestimated the true genetic effects. In this example, the most severe overestimation occurred at the false positive location. Figure <figr fid="F1">1</figr> depicts the biases of na&#239;ve and bootstrap-based estimates at various levels of true genetic effect. The bootstrap-based estimates were less biased than the na&#239;ve estimate for both true and false positives. When the true genetic effect was relatively large and power was high, such as the location at 169.97 cM on chromosome 1 of DA population (Table <tblr tid="T2">2</tblr>), the three estimators gave roughly the same genetic effect estimate and had low bias. When the true genetic effect was moderate, such as the significant loci on chromosome 3 of AI population and chromosomes 5 and 9 of KA population, the three estimates were different. The shrinkage estimates overcorrected, while the weighted estimates were least biased. In the case of a false positive, i.e., the significant locus at 192.09 cM on chromosome 6 of DA population, the shrinkage estimate gave the best result in terms of bias, followed by the out-of-sample estimate and the weighted estimate.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Bias comparisons of the na&#239;ve estimate and the three resampling-based estimates for microsatellite markers</p>
            </caption>
            <text>
               <p>Bias comparisons of the na&#239;ve estimate and the three resampling-based estimates for microsatellite markers.</p>
            </text>
            <graphic file="1471-2156-6-S1-S24-1"/>
         </fig>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Comparison of linkage analysis results and genetic effect estimates for the na&#239;ve and three bootstrap estimates using microsatellite markers and SNPs.</p>
            </caption>
            <tblbdy cols="10">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="4" ca="center">
                     <p>Bootstrap estimate (bias)</p>
                  </c>
               </r>
               <r>
                  <c cspan="10">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Replicate</p>
                  </c>
                  <c ca="center">
                     <p>Population</p>
                  </c>
                  <c ca="center">
                     <p>Chromosome</p>
                  </c>
                  <c ca="center">
                     <p>Highest peak (cM)</p>
                  </c>
                  <c ca="center">
                     <p>True genetic effect</p>
                  </c>
                  <c ca="center">
                     <p>T/F positive</p>
                  </c>
                  <c ca="center">
                     <p>
                        <graphic file="1471-2156-6-S1-S24-i4.gif"/>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <graphic file="1471-2156-6-S1-S24-i3.gif"/>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <graphic file="1471-2156-6-S1-S24-i2.gif"/>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <graphic file="1471-2156-6-S1-S24-i1.gif"/>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="10">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Microsatellite Markers</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>169.97</p>
                  </c>
                  <c ca="center">
                     <p>0.49 &#177; 0.10</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.55 (0.06)</p>
                  </c>
                  <c ca="center">
                     <p>0.50 (0.01)</p>
                  </c>
                  <c ca="center">
                     <p>0.48 (-0.01)</p>
                  </c>
                  <c ca="center">
                     <p>0.47 (-0.03)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>AI</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>294.68</p>
                  </c>
                  <c ca="center">
                     <p>0.33 &#177; 0.16</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.43 (0.10)</p>
                  </c>
                  <c ca="center">
                     <p>0.31 (-0.01)</p>
                  </c>
                  <c ca="center">
                     <p>0.25 (-0.08)</p>
                  </c>
                  <c ca="center">
                     <p>0.19 (-0.13)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>KA</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>2.76</p>
                  </c>
                  <c ca="center">
                     <p>0.43 &#177; 0.11</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.49 (0.06)</p>
                  </c>
                  <c ca="center">
                     <p>0.40 (-0.04)</p>
                  </c>
                  <c ca="center">
                     <p>0.34 (-0.09)</p>
                  </c>
                  <c ca="center">
                     <p>0.24 (-0.19)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>35</p>
                  </c>
                  <c ca="center">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>192.09</p>
                  </c>
                  <c ca="center">
                     <p>-0.01 &#177; 0.11</p>
                  </c>
                  <c ca="center">
                     <p>F</p>
                  </c>
                  <c ca="center">
                     <p>0.41 (0.42)</p>
                  </c>
                  <c ca="center">
                     <p>0.26 (0.27)</p>
                  </c>
                  <c ca="center">
                     <p>0.17 (0.18)</p>
                  </c>
                  <c ca="center">
                     <p>0.10 (0.11)</p>
                  </c>
               </r>
               <r>
                  <c cspan="10">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>SNP Markers</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>168.94</p>
                  </c>
                  <c ca="center">
                     <p>0.53 &#177; 0.10</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.53 (0.00)</p>
                  </c>
                  <c ca="center">
                     <p>0.49 (-0.04)</p>
                  </c>
                  <c ca="center">
                     <p>0.46 (-0.07)</p>
                  </c>
                  <c ca="center">
                     <p>0.43 (-0.10)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>AI</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>304.58</p>
                  </c>
                  <c ca="center">
                     <p>0.33 &#177; 0.11</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.41 (0.08)</p>
                  </c>
                  <c ca="center">
                     <p>0.25 (-0.08)</p>
                  </c>
                  <c ca="center">
                     <p>0.16 (-0.17)</p>
                  </c>
                  <c ca="center">
                     <p>0.07 (-0.26)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>KA</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>305.81</p>
                  </c>
                  <c ca="center">
                     <p>0.29 &#177; 0.11</p>
                  </c>
                  <c ca="center">
                     <p>T</p>
                  </c>
                  <c ca="center">
                     <p>0.57 (0.28)</p>
                  </c>
                  <c ca="center">
                     <p>0.48 (0.19)</p>
                  </c>
                  <c ca="center">
                     <p>0.43 (0.14)</p>
                  </c>
                  <c ca="center">
                     <p>0.33 (0.04)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>27</p>
                  </c>
                  <c ca="center">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>200.12</p>
                  </c>
                  <c ca="center">
                     <p>0.02 &#177; 0.13</p>
                  </c>
                  <c ca="center">
                     <p>F</p>
                  </c>
                  <c ca="center">
                     <p>0.45 (0.43)</p>
                  </c>
                  <c ca="center">
                     <p>0.32 (0.30)</p>
                  </c>
                  <c ca="center">
                     <p>0.24 (0.22)</p>
                  </c>
                  <c ca="center">
                     <p>0.16 (0.14)</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>67</p>
                  </c>
                  <c ca="center">
                     <p>DA</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>214.33</p>
                  </c>
                  <c ca="center">
                     <p>-0.01 &#177; 0.11</p>
                  </c>
                  <c ca="center">
                     <p>F</p>
                  </c>
                  <c ca="center">
                     <p>0.42 (0.43)</p>
                  </c>
                  <c ca="center">
                     <p>0.27 (0.28)</p>
                  </c>
                  <c ca="center">
                     <p>0.18 (0.19)</p>
                  </c>
                  <c ca="center">
                     <p>0.11 (0.12)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>*Mean &#177; SD over 100 replicates</p>
            </tblfn>
         </tbl>
         <p>The bias reductions for the SNP marker analysis are presented in Table <tblr tid="T2">2</tblr>. We report results for replicates 1, 27, and 67. After the chromosomes with highest test statistic (true positives) were removed, we found two false positives at chromosome 9 (replicate 27) and chromosome 5 (replicate 67) for DA population. The bias reduction pattern was similar to the pattern in microsatellite markers (Figure <figr fid="F2">2</figr>). Note that selection bias in the na&#239;ve estimates was similar for microsatellite and SNP analysis, despite having twice as many markers for the latter.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Bias comparisons of the na&#239;ve estimate and the three resampling-based estimates for SNP markers</p>
            </caption>
            <text>
               <p>Bias comparisons of the na&#239;ve estimate and the three resampling-based estimates for SNP markers.</p>
            </text>
            <graphic file="1471-2156-6-S1-S24-2"/>
         </fig>
         <p>The bootstrap-based estimators reduced the upward selection bias in genetic effect estimation for both microsatellite and SNP based linkage analysis. The performance of the three estimators differed according to true or false positive status. The shrinkage estimator had the smallest bias for false positives but over-corrected for the true positives. On the other hand, the weighted estimator had the smallest bias for true positives but under-corrected for the false positives. It has been shown that the bias depends on the power to detect linkage <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In these examples from the simulated data, we found that the shrinkage estimator had lower bias when the power was less than 20%. Otherwise, the weighted estimator provided lower bias.</p>
         <p>In this study, our bootstrap estimators focused on genetic effect estimation for the most significant locus in a genome scan, without considering other loci that also exceeded genome-wide significance criteria. However, the underlying genetic model has multiple loci. Further research is warranted to construct a joint estimator that would simultaneously handle multiple significant loci and thereby extend bias-reduction methods to more general settings.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The reliability of gene detection, the accuracy of locus-specific effect estimates, and the failure to replicate initial claims of linkage or association have emerged as major concerns in genome-wide studies. Estimation of the genetic effect for a specific locus in a genome-wide scan is subject to upward bias because of selection by strict significance criteria. This bias is most severe for locations with small genetic effect and low power. Our results indicate that, in a complex disease setting, the three bootstrap-based estimators appear to be effective in reducing the selection bias of the na&#239;ve estimator. The shrinkage estimator is recommended when the power to detect the disease loci is low. Otherwise, the weighted estimator is recommended.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GAW: Genetic Analysis Workshop</p>
         <p>IBD: Identity by descent</p>
         <p>SNP: Single-nucleotide polymorphism</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>LYW implemented the bootstrap methods and drafted the manuscript. SSFL assisted in preparing the manuscript for publication. HSS conducted the genome-wide MS and SNP scans. LS and SBB developed the bootstrap estimators and assisted in revising the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This research was supported by research grants from the Canadian Institutes of Health Research (CIHR) and the Network of Centres of Excellence in Mathematics (MITACS). LS and SBB also received support from the Natural Sciences and Engineering Research Council (Canada). SBB holds a CIHR Senior Investigator Award.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Large upward bias in estimation of locus-specific effects from genomewide scans</p>
            </title>
            <aug>
               <au>
                  <snm>G&#246;ring</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Terwilliger</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Blangero</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>69</volume>
            <fpage>1357</fpage>
            <lpage>1369</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/324471</pubid>
                  <pubid idtype="pmpid" link="fulltext">11593451</pubid>
                  <pubid idtype="pmcid">1235546</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Reduction of selection bias in genomewide studies by resampling</p>
            </title>
            <aug>
               <au>
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bull</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Genet Epidemiol</source>
            <pubdate>2005</pubdate>
            <volume>28</volume>
            <fpage>352</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/gepi.20068</pubid>
                  <pubid idtype="pmpid" link="fulltext">15761913</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Allegro, a new computer program for multipoint linkage analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Gudbjartsson</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Jonasson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Frigge</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>12</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75514</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802644</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Allele-sharing models: LOD scores and accurate linkage tests</p>
            </title>
            <aug>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1997</pubdate>
            <volume>61</volume>
            <fpage>1179</fpage>
            <lpage>1188</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/301592</pubid>
                  <pubid idtype="pmpid" link="fulltext">9345087</pubid>
                  <pubid idtype="pmcid">1376655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genetic dissection of complex traits guidelines for interpreting and reporting linkage results</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1995</pubdate>
            <volume>11</volume>
            <fpage>241</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/301592</pubid>
                  <pubid idtype="pmpid" link="fulltext">9345087</pubid>
                  <pubid idtype="pmcid">1376655</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Estimating the error rate of a prediction rule: some improvements on cross-validation</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Am Statist Assoc</source>
            <pubdate>1983</pubdate>
            <volume>78</volume>
            <fpage>316</fpage>
            <lpage>331</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2288636</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Improvements on cross-validation: the .632+ bootstrap method</p>
            </title>
            <aug>
               <au>
                  <snm>Efron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tibshirani</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Am Statist Assoc</source>
            <pubdate>1997</pubdate>
            <volume>92</volume>
            <fpage>548</fpage>
            <lpage>560</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2965703</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

