<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2156-6-7</ui>
   <ji>1471-2156</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Comparative linkage analysis and visualization of high-density oligonucleotide SNP array data</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Leykin</snm>
               <fnm>Igor</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>ileykin@hsph.harvard.edu</email>
            </au>
            <au id="A2">
               <snm>Hao</snm>
               <fnm>Ke</fnm>
               <insr iid="I1"/>
               <email>khao@hsph.harvard.edu</email>
            </au>
            <au id="A3">
               <snm>Cheng</snm>
               <fnm>Junsheng</fnm>
               <insr iid="I3"/>
               <email>cjsuicedu@yahoo.com</email>
            </au>
            <au id="A4">
               <snm>Meyer</snm>
               <fnm>Nicole</fnm>
               <insr iid="I4"/>
               <email>nic-meyer@uiowa.edu</email>
            </au>
            <au id="A5">
               <snm>Pollak</snm>
               <mi>R</mi>
               <fnm>Martin</fnm>
               <insr iid="I5"/>
               <email>mpollak@rics.bwh.harvard.edu</email>
            </au>
            <au id="A6">
               <snm>Smith</snm>
               <mi>JH</mi>
               <fnm>Richard</fnm>
               <insr iid="I4"/>
               <email>richard-smith@uiowa.edu</email>
            </au>
            <au id="A7">
               <snm>Wong</snm>
               <mnm>Hung</mnm>
               <fnm>Wing</fnm>
               <insr iid="I6"/>
               <email>wwong@hsph.harvard.edu</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Rosenow</snm>
               <fnm>Carsten</fnm>
               <insr iid="I7"/>
               <email>crosenow@rocketmail.com</email>
            </au>
            <au id="A9" ca="yes">
               <snm>Li</snm>
               <fnm>Cheng</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>cli@hsph.harvard.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Biostatistical Science, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA</p>
            </ins>
            <ins id="I4">
               <p>Molecular Otolaryngology Research Labs, University of Iowa, Iowa City, IA 52242, USA</p>
            </ins>
            <ins id="I5">
               <p>Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA</p>
            </ins>
            <ins id="I6">
               <p>Department of Statistics, Stanford University, Stanford, CA, 94305, USA</p>
            </ins>
            <ins id="I7">
               <p>Affymetrix Inc., 3380 Central Expressway, Santa Clara, CA 95051, USA</p>
            </ins>
         </insg>
         <source>BMC Genetics</source>
         <issn>1471-2156</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>7</fpage>
         <url>http://www.biomedcentral.com/1471-2156/6/7</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15713228</pubid>
               <pubid idtype="doi">10.1186/1471-2156-6-7</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>7</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>15</day>
               <month>2</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>2</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Leykin et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The identification of disease-associated genes using single nucleotide polymorphisms (SNPs) has been increasingly reported. In particular, the Affymetrix Mapping 10 K SNP microarray platform uses one PCR primer to amplify the DNA samples and determine the genotype of more than 10,000 SNPs in the human genome. This provides the opportunity for large scale, rapid and cost-effective genotyping assays for linkage analysis. However, the analysis of such datasets is nontrivial because of the large number of markers, and visualizing the linkage scores in the context of genome maps remains less automated using the current linkage analysis software packages. For example, the haplotyping results are commonly represented in the text format.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we report the development of a novel software tool called CompareLinkage for automated formatting of the Affymetrix Mapping 10 K genotype data into the "Linkage" format and the subsequent analysis with multi-point linkage software programs such as Merlin and Allegro. The new software has the ability to visualize the results for all these programs in dChip in the context of genome annotations and cytoband information. In addition we implemented a variant of the Lander-Green algorithm in the dChipLinkage module of dChip software (V1.3) to perform parametric linkage analysis and haplotyping of SNP array data. These functions are integrated with the existing modules of dChip to visualize SNP genotype data together with LOD score curves. We have analyzed three families with recessive and dominant diseases using the new software programs and the comparison results are presented and discussed.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The CompareLinkage and dChipLinkage software packages are freely available. They provide the visualization tools for high-density oligonucleotide SNP array data, as well as the automated functions for formatting SNP array data for the linkage analysis programs Merlin and Allegro and calling these programs for linkage analysis. The results can be visualized in dChip in the context of genes and cytobands. In addition, a variant of the Lander-Green algorithm is provided that allows parametric linkage analysis and haplotyping.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The oligonucleotide Mapping 10 K arrays <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> have been used for linkage analysis <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp> and their advantages in genome coverage and information content compared to microsatellite-based assays has been demonstrated. The array contains 11,550 SNPs with an average heterozygosity rate of 0.32 and an average marker distance of 0.31 cM. However, the commonly used multi-point linkage analysis software packages such as GeneHunter <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> and Merlin <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> are command-line programs and it is not straightforward to find genes in the regions of high linkage scores. In addition, the haplotyping results are represented commonly in a text format without any gene context.</p>
         <p>Here we report the development of a new software tool called CompareLinkage that can be used for automated conversion of Mapping 10 K genotype data into the "Linkage" format for linkage analysis in Merlin, GeneHunter and Allegro <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. In addition the program can convert the pedigree information and SNP marker information into the "Linkage" format. After performing the linkage analysis using one or more of these programs, the CompareLinkage software can export the linkage score information into the dChip software <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> to visualize the results within a chromosome window. In addition, we implemented a variant of the Lander-Green <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B12">12</abbr></abbrgrp> algorithm into the dChipLinkage module to analyze pedigrees with up to 18 bits (bits = 2n-f ; with n = number of non-founders and f = number of founders) using the parametric linkage analysis method. We are currently testing and validating the implementation of the algorithm which will be described in detail elsewhere. The linkage score curves, genotypes and haplotypes are graphically displayed in a dChip chromosome window which has the genes, cytoband and SNP marker information included. Together the CompareLinkage and dChip software programs provide for the first time a graphical user interface (GUI) and an automated procedure for comparative linkage analysis utilizing three commonly used linkage software programs.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p>The CompareLinkage software for comparative linkage analysis using Merlin and Allegro</p>
            </st>
            <p>To analyze large pedigrees rapidly and to compare the linkage analysis results of different software packages, we developed a software tool called CompareLinkage to automate the following processes: (1) Converting of Affymetrix Mapping 10 K genotype data, pedigree files and marker information into the "Linkage" format <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, and detecting and fixing incompatibilities in pedigree genotypes. The input genotype text file for CompareLinkage can be a single text file containing genotypes for each sample or a combined text file as exported by the Affymetrix GDAS 3.0 software. (2) Automatically calling the software packages Merlin and Allegro for linkage analysis and converting the analysis results (LOD or non-parametric linkage (NPL) scores) into the input files for dChip to visualize the results in the context of genes and cytobands. (3) The SNP genotype data in the "Linkage" format can be converted into the dChip input files (genotype, pedigree and marker information files) to perform parametric linkage analysis by dChipLinkage. All steps are discussed in detail at the CompareLinkage software manual provided on the software website. All these functionalities are useful for cross-validation of linkage results and to identify concordance and discordances between different linkage analysis programs as well as between parametric and non-parametric linkage results.</p>
            <p>A graphical user interface (GUI) for Windows was also implemented in Java. In this GUI users are allowed to set their own working directory and the location of the Perl interpreter through the "Setting" menu. CompareLinkage's functions of converting file formats and getting dChip input files are incorporated through the "Convert" and "GetCurve" menu (Figure <figr fid="F1">1</figr>). Since computing is usually time-consuming, the code of calling the Perl program is executed in separate thread to provide better interaction. The output of the Perl program can be viewed in the message window (Figure <figr fid="F2">2</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The CompareLinkage GUI dialog for choosing pedigree, genome information and genotype call files</p>
               </caption>
               <text>
                  <p>The CompareLinkage GUI dialog for choosing pedigree, genome information and genotype call files.</p>
               </text>
               <graphic file="1471-2156-6-7-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The intermediate output of the CompareLinkage GUI</p>
               </caption>
               <text>
                  <p>The intermediate output of the CompareLinkage GUI.</p>
               </text>
               <graphic file="1471-2156-6-7-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The dChipLinkage software module</p>
            </st>
            <p>The Affymetrix Mapping 10 K array CEL files and genotype TXT files can be imported into dChip and visualized along cytobands and genes as previously reported <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr></abbrgrp>. The information of the SNPs such as their genetic and physical distance and allele frequencies from three ethnic groups (Asian, African American and Caucasian) is obtained from the Affymetrix website <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and converted into the genome information files for dChip. The information of the reference genes and cytobands is obtained from the UCSC genome bioinformatics database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> for the matching human genome assembly (hg12 or hg15) of the SNP information, and is organized into the refGene and cytoband file provided with dChip.</p>
            <p>We implemented a variant of the Lander-Green <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B16">16</abbr></abbrgrp> algorithm in the dChipLinkage module of dChip to perform multipoint parametric linkage analysis and compute a LOD score at each SNP position. Disease allele frequencies, penetrance information and phenocopy information for dominant and recessive disease models can be selected by the user through a dialog (Figure <figr fid="F3">3</figr>). The Mendelian genotype errors inconsistent with parental genotypes are detected and set to missing genotypes. To handle other genotyping errors or wrongly mapped SNP markers, we assume a conservative genotyping error rate of 0.01 <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> (user adjustable) and regard observed genotypes as phenotypes in the likelihood computation <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. As a result, the computation of the probability of the observed genotype data at one marker given an inheritance vector <it>v </it>involves the summation over all the possible real genotypes (or equivalently the founder allele configurations):</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The dChipLinkage dialog for specifying linkage analysis parameters</p>
               </caption>
               <text>
                  <p>The dChipLinkage dialog for specifying linkage analysis parameters.</p>
               </text>
               <graphic file="1471-2156-6-7-3"/>
            </fig>
            <p>
               <graphic file="1471-2156-6-7-i1.gif"/>
            </p>
            <p>where F<sub>i </sub>represents the <it>i</it>th of all the possible founder allele configurations and is independent of <it>v</it>. <it>P</it>(real genotypes <it>i </it>| <it>v</it>, F<sub>i</sub>) is 1 since an inheritance vector and founder allele configuration uniquely determines the real genotypes, and <it>P</it>(observed genotypes | real genotypes <it>i</it>) involves comparing the real genotype and observed genotype for all the individuals and multiplying the probability by the error rate of 0.01 (default value) for each disagreement and 0.99 for each agreement. We also use the matrix-vector multiplication algorithm and bit reduction due to founder phase symmetry described in <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, and the founder allele factoring technique reported in <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B17">17</abbr></abbrgrp> to speed up the computation of single-locus and accumulative likelihood vectors as well as the likelihood vector of disease phenotypes.</p>
            <p>We use the forward-backward computation in the Lander-Green algorithm to obtain the marginal probability distribution of inheritance vector at each SNP marker position given the data of all the markers on a chromosome. In addition the most likely inheritance vector at each marker given the genotype data of all the markers on this chromosome is calculated <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Conditioned on the most likely inheritance vector at a marker and the observed genotype data, we can find the most likely founder allele configurations. When there are competing inheritance vectors with the same largest marginal probabilities at a marker, we select the one with fewer crossover events from the last marker since the distance between adjacent markers are small (average 300 kb) and it is therefore less likely to have multiple crossover events between two markers in a pedigree <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Together these procedures give the haplotyping results of the pedigree data. dChipLinkage visualizes the haplotyping result in either the haplotype view or the ordered genotype view.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>The comparative linkage analysis using Merlin, GeneHunter, Allegro and dChipLinkage</p>
            </st>
            <p>CompareLinkage can format Affymetrix Mapping 10 K SNP genotype output files and genotype files into the "Linkage" format and convert genome information and pedigree files into the formats suitable for Merlin (Version 0.10.2), GeneHunter (Version 2.1) and Allegro (Version 1.2). CompareLinkage removes all non-informative markers and calls the PedCheck software <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to detect genotype incompatibilities using the pedigree information. A Mendelian genotype inconsistency at a SNP is handled by setting the genotype of this SNP in all the individuals to missing. For the analysis in GeneHunter, overlapping segments of large chromosomes are prepared, with each segment containing 150 or fewer markers with 75 markers in common between adjacent segments. Linkage scores are computed as the mean of two scores for the same marker from the two overlapping fragments. We ran genome-wide linkage analysis using all the three software packages and dChipLinkage for the 10 K SNP genotype data of three families: 5026.10 (Figure <figr fid="F4">4</figr>; autosomal recessive non-syndromic deafness disease, 13 bits, Asian), CR (Figure <figr fid="F5">5</figr>; recessive, 17 bits, Asian) and ER (Figure <figr fid="F6">6</figr>; dominant, 17 bits, Caucasian). For the parametric analysis, we use a disease frequency of 0.001, a penetrance value of 0.99 and a phenocopy of 0.01 for all the families and all the software packages. For GeneHunter and Allegro we ran both nonparametric and parametric analysis. For Merlin, the NPL_all statistic is computed. The allele frequencies are calculated based on the actual genotype data in each family. The LOD score or NPL score are computed at the position of the SNP makers. After running the analysis for all chromosomes, the two chromosomes with the largest LOD scores were selected from each pedigree and compared below.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The pedigree structure of family 5026.10</p>
               </caption>
               <text>
                  <p>The pedigree structure of family 5026.10. The PED 4.2 software is used to draw the pedigrees.</p>
               </text>
               <graphic file="1471-2156-6-7-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>The pedigree structure of family CR</p>
               </caption>
               <text>
                  <p>The pedigree structure of family CR</p>
               </text>
               <graphic file="1471-2156-6-7-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>The pedigree structure of family ER</p>
               </caption>
               <text>
                  <p>The pedigree structure of family ER.</p>
               </text>
               <graphic file="1471-2156-6-7-6"/>
            </fig>
            <p>Figures <figr fid="F7">7</figr>, <figr fid="F8">8</figr>, <figr fid="F9">9</figr>, <figr fid="F10">10</figr>, <figr fid="F11">11</figr>, <figr fid="F12">12</figr> show the comparative LOD score and nonparametric score plots in dChip for these chromosomes analyzed with GeneHunter, Merlin, Allegro and dChipLinkage. The vertical red line in the figures indicates the significance threshold and is set to 3 for parametric analysis (LOD scores) and to 3.7 for non-parametric analysis (NPL score) based on statistical significance recommended by Lander and Kruglyak <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The linkage scores largely agree with each other in the regions with significant LOD/NPL scores. GeneHunter, Merlin and Allegro detect the peaks in the chromosome 1 and 3 of the consanguineous family 5026.10 but compute lower LOD scores than dChipLinkage (indicated by arrows in Figure <figr fid="F7">7</figr> and <figr fid="F8">8</figr>). For another consanguineous family CR with a recessive disease, all software packages detect similar peak regions in the two chromosomes denoted as A and B (Figure <figr fid="F9">9</figr> and <figr fid="F10">10</figr>). For the family ER with a dominant disease, dChipLinkage computes similar overall patterns but reports a possible sporadic and non-significant peak (LOD &lt; 1.6) in each chromosome (indicated by arrows in Figure <figr fid="F11">11</figr> and <figr fid="F12">12</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome 1 of the family 5026.10 using CompareLinkage and dChipLinkage</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome 1 of the family 5026.10 using CompareLinkage and dChipLinkage. The genotype calls are displayed on the left in yellow (AB), red (AA) and blue (BB), with SNPs on rows and samples on columns. The sample names and the disease status (1 = Unaffected and 2 = Affected) are displayed on the top. The linkage scores of different software are displayed on the right in the shaded box. The lower and upper limits of the shaded box (such as [-10, 6]) are in the brackets on the bottom of the curve. The red vertical line indicates the threshold of 3.0 for LOD scores and 3.7 for NPL scores. This line is user adjustable.</p>
               </text>
               <graphic file="1471-2156-6-7-7"/>
            </fig>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome 3 from the family 5026.10</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome 3 from the family 5026.10. The figure format is the same as Figure 7.</p>
               </text>
               <graphic file="1471-2156-6-7-8"/>
            </fig>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome A of the family CR</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome A of the family CR.</p>
               </text>
               <graphic file="1471-2156-6-7-9"/>
            </fig>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome B of the family CR</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome B of the family CR.</p>
               </text>
               <graphic file="1471-2156-6-7-10"/>
            </fig>
            <fig id="F11">
               <title>
                  <p>Figure 11</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome A of the family ER</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome A of the family ER.</p>
               </text>
               <graphic file="1471-2156-6-7-11"/>
            </fig>
            <fig id="F12">
               <title>
                  <p>Figure 12</p>
               </title>
               <caption>
                  <p>The comparative linkage results of the chromosome B of the family ER</p>
               </caption>
               <text>
                  <p>The comparative linkage results of the chromosome B of the family ER.</p>
               </text>
               <graphic file="1471-2156-6-7-12"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Linkage analysis and visualization using dChipLinkage</p>
            </st>
            <p>To do parametric linkage analysis in dChipLinkage, a pedigree file is needed (Figure <figr fid="F13">13C</figr>). The file is similar to the standard pedigree file format but has an additional "Array" column matching each individual in the pedigree file to the corresponding genotype information in the genotype file through array names (the header line in Figure <figr fid="F13">13A</figr>). The data importing and analysis steps are:</p>
            <fig id="F13">
               <title>
                  <p>Figure 13</p>
               </title>
               <caption>
                  <p>The genotype file (A), genome information file (B) and pedigree file (C) used by dChipLinkage for analysis</p>
               </caption>
               <text>
                  <p>The genotype file (A), genome information file (B) and pedigree file (C) used by dChipLinkage for analysis.</p>
               </text>
               <graphic file="1471-2156-6-7-13"/>
            </fig>
            <p>1. Open dChip.</p>
            <p>2. Select the <b><it>Analysis </it></b>menu and the <b><it>Get External Data </it></b>function to read in the genotype file in the text format (Figure <figr fid="F13">13A</figr>).</p>
            <p>3. Select the genome information file downloaded from the dChip website (Figure <figr fid="F13">13B</figr>). This file is provided in three versions, each containing the SNP information like TSC SNP ID and genetic map locations but having different allele frequencies for each of the three ethnic groups (Asian, Caucasian and African Americans).</p>
            <p>4. Select the <b><it>Analysis </it></b>menu and the <b><it>Chromosome </it></b>function to display the genotype calls, genes and cytobands along the chromosome</p>
            <p>5. After the program has displayed the genotype data, select the <b><it>Chromosome </it></b>menu and the <b><it>Linkage </it></b>function to start the dChipLinkage module (Figure <figr fid="F3">3</figr>). Specify the pedigree file (Figure <figr fid="F13">13C</figr>) and other linkage parameters. Depending on whether the dChip "<b><it>Chromosome View</it></b>" displays one or all chromosomes, the linkage analysis will be performed for one or all chromosomes accordingly. For the analysis of the 5026.10 family, the recessive disease model is assumed, and a penetrance of 0.99, phenocopy of 0.01, disease allele frequency of 0.001 and a SNP marker error rate of 0.01 are used. The SNP allele frequencies in the genome information file are used and truncated to values between 0.001 and 0.999. This family has 13 bits and it takes about 20 minutes for the whole genome linkage analysis.</p>
            <p>Using dChipLinkage to analyze the 5026.10 family, we were able to identify a region on the chromosome 1 (Cytogenetic region: 1p36.32 &#8211; 1p36.22) with LOD scores of greater than 2.3 (Figure <figr fid="F7">7</figr>, indicated by arrow). The most interesting gene in this region is ESPIN, which has previously been shown to be involved in deafness in mice <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and two frameshift mutations in the gene have just recently been associated with deafness in two consanguineous families <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Sequence analysis of the locus revealed that the parents (the individual 1 and 2 in Figure <figr fid="F4">4</figr>) and the unaffected child (the individual 6) are heterozygous for the insertion mutation and the affected children are homozygous (data not shown). In addition a novel locus with a maximum LOD score of 2.77 was identified on the chromosome 3 (Figure <figr fid="F8">8</figr>, indicated by arrow). The peak region on the chromosome 3 is about 2 Mb wide (Figure <figr fid="F14">14C</figr>). Using the GeneHunter software, we compute a maximum expected LOD score of 2.78 for this family under the specified parameters. Therefore we extract the most linkage information based on the dense SNP markers in this region. Figure <figr fid="F14">14</figr> shows the LOD score curve together with genotype calls, inferred haplotypes and ordered genotypes based on haplotyping. In Figure <figr fid="F15">15</figr> the results are presented in the context of cytobands and genes. The <b><it>Chromosome/Export SNP data </it></b>function can also export the text information of the SNPs, genes and cytobands in the region with linkage scores exceeding the threshold.</p>
            <fig id="F14">
               <title>
                  <p>Figure 14</p>
               </title>
               <caption>
                  <p>(A) In the genotype view, the red, blue, yellow and white colors represent genotype call AA, BB, AB and No Call</p>
               </caption>
               <text>
                  <p>(A) In the genotype view, the red, blue, yellow and white colors represent genotype call AA, BB, AB and No Call. (B) The inferred haplotypes indicating ancestor origins are displayed in correspondence to the genotype view. The different colors represent distinct founder chromosomes. For each individual (column), the father allele haplotype is displayed on the left and mother allele haplotype on the right. (C) In the ordered genotype view, the red and blue colors represent the A and B genotype of father allele (left) and mother allele (right) in each individual (column). The LOD score curve is displayed in the shaded box on the right. The left boundary and right boundary of the box represent value of -2 and 3, and the red vertical line represents 2.</p>
               </text>
               <graphic file="1471-2156-6-7-14"/>
            </fig>
            <fig id="F15">
               <title>
                  <p>Figure 15</p>
               </title>
               <caption>
                  <p>(A) The peak LOD score region is enlarged and displayed proportionally to real chromosomal distance in the context of genes and cytobands</p>
               </caption>
               <text>
                  <p>(A) The peak LOD score region is enlarged and displayed proportionally to real chromosomal distance in the context of genes and cytobands. LOD score peaks are shown at the q-arm of chromosome 3 (114.18 -117.00 Mb, maximal LOD = 2.77). The shaded curve region has the same range as Figure 14. (B) A enlarged view of the peak region with more details of the individual SNPs and genes. The transcription starting site of the genes are used to display their positions.</p>
               </text>
               <graphic file="1471-2156-6-7-15"/>
            </fig>
            <p>After the linkage computation is finished, the inferred haplotype information can be visualized. In the haplotype view (Figure <figr fid="F14">14</figr> and <figr fid="F16">16</figr>), one can view the inference on how the founder chromosomes are crossed over and inherited by the descendants. The different colors represent distinct founder chromosomes, and for each individual, the father allele haplotype is displayed on the left and mother on the right. Since a pedigree contains no phase information of the founders <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, in the linkage computation we can assume that one child of each founder always inherits the whole grandfather-descent chromosome. This assumption does not affect the LOD score computation but reduces the number of bits in the Lander-Green algorithm by the number of founders and consequently reduces the analysis time. This is the reason that in Figure <figr fid="F14">14B</figr> the individual 1 has both father and mother haplotypes in pure color and individual 2 has only the father haplotype in pure color. By inspection of the observed genotype and the inferred haplotypes (Figure <figr fid="F16">16</figr>), one can see that only in the peak LOD score region all the affected children (individual 3, 4, 5 and 7) are homozygous and that the unaffected child (individual 6) is heterozygous. All the affected individuals share two copies of the identical chromosome segment (the pink color between the two arrows) presumably containing the disease locus. By two very close crossover events respectively in individual 6 (indicated by the black arrow) and individual 7 (indicated by the white arrow), the LOD score implicates the possible disease gene in a 2 Mb region and one can easily search the physical map for candidate disease genes in this region in the dChip chromosome view (Figure <figr fid="F15">15</figr>).</p>
            <fig id="F16">
               <title>
                  <p>Figure 16</p>
               </title>
               <caption>
                  <p>The genotypes (A) and inferred haplotypes (B) from family 5026.10 on the peak score region of chromosome 3 are shown (for more details see the legend in Figure 14)</p>
               </caption>
               <text>
                  <p>The genotypes (A) and inferred haplotypes (B) from family 5026.10 on the peak score region of chromosome 3 are shown (for more details see the legend in Figure 14). In the peak LOD score region all the affected children (3, 4, 5 and 7) inherited the same ancestral allele in the consanguineous family and the unaffected child (6) inherited two different ancestral alleles.</p>
               </text>
               <graphic file="1471-2156-6-7-16"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion and conclusions</p>
         </st>
         <p>We have developed the CompareLinkage software for easy comparison and analysis of genotype datasets with common multi-point linkage analysis software programs. It provides functions such as automated data formatting and the calling of linkage analysis software programs to facilitate comparative linkage analysis. The results can be visualized in a chromosome window in the context of genes, cytobands and SNPs in dChip's user friendly graphical interface. The linkage scores of other linkage software packages can be saved into the dChip score file format through CompareLinkage and viewed in the dChip chromosome viewer. This provides the interface to view other computed statistics such as linkage disequilibrium scores along the chromosomes. We have also implemented a variant of the Lander-Green algorithm as the dChipLinkage module for parametric linkage analysis of small pedigrees. It can analyze all chromosomes for families with up to 18 bits within one hour on a PC with one gigabyte memory. This is useful for recessive and consanguineous families whose bits are often small.</p>
         <p>The comparison analysis of three Mapping 10 K array data sets show similar results in regions with significant LOD scores across all the four software packages. The regions with concordant LOD/NPL scores should provide more confidence in the candidate disease loci. However, there are clear differences in isolated regions. This emphasizes the challenge of a comparative analysis using different linkage algorithm implementations. We hypothesize that the differences between the software programs in peak locations are attributable to:</p>
         <p>1. The specific algorithm implementation in each program.</p>
         <p>2. The difference between parametric &#8211; and non-parametric analysis.</p>
         <p>3. The existence of undetected genotype errors in the data sets which could falsely deflate LOD scores <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B22">22</abbr></abbrgrp>. dChipLinkage uses an error model to automatically handle genotype errors and avoid sporadic LOD score peaks due to undetected non-Mendelian errors, and results in a smoother LOD curve as seen in Figure <figr fid="F7">7</figr>, <figr fid="F8">8</figr>, <figr fid="F9">9</figr>, <figr fid="F10">10</figr>, <figr fid="F11">11</figr>, <figr fid="F12">12</figr>. However, this error handling algorithm involves more iterations and increases the computation time. There are further techniques to reduce the memory and time requirement of the Lander-Green algorithm <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp></p>
         <p>In light of the discordance between the results from common linkage software packages and from dChipLinkage, we will validate dChipLinkage implementation using additional datasets and the CompareLinkage software.</p>
         <p>In summary, the CompareLinkage and dChipLinkage software automate the comparative linkage analysis and visualization using multiple software packages. With these tools users will be able to increase their confidence in candidate regions and can use the visualization tools to explore the disease associated genome regions.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: The CompareLinkage software and the dChipLinkage software module</p>
         <p>Project home page: <url>http://biosun1.harvard.edu/complab/linkage</url></p>
         <p>Operating system(s): Windows (dChipLinkge); Windows (CompareLinkage and its graphical interface), Unix (CompareLinkage command line version)</p>
         <p>Programming language: Visual C++ 6.0 (dChipLinkge); Perl and Java (CompareLinkage software)</p>
         <p>Other requirements: None</p>
         <p>License: None.</p>
         <p>Any restrictions to use by non-academics: No restrictions</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>CR, CL and WHW conceived of the study, and participated in its design and coordination. NM and RJHS generated the 5026.10 family data, and MP generated the CR and ER family data. IL implemented the CompareLinkage software and performed the comparative analysis using multiple linkage analysis software packages. JC implemented its graphical user interface (GUI). CL implemented the dChipLinkage module. KH participated in the design and analysis of the study. IL, CR and CL drafted the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Hajime Matsuzaki, Patricia Dahia, Robert Sean Hill, Steven Boyden for helpful discussions. This work is supported by NIH grant 1R01HG02341 and P20-CA96470 (IL, KH and WHW), RO1-DC02842 (Richard J.H. Smith), NIH DK54931 (Martin R. Pollak), and grants from Friends of Dana-Farber Cancer Institute (CL) and Claudia Adams Barr Program in Cancer Research (CL).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Large-scale genotyping of complex DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Matsuzaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Di</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Surti</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Boyce-Jacino</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Fodor</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KW</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>21</volume>
            <fpage>1233</fpage>
            <lpage>1237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt869</pubid>
                  <pubid idtype="pmpid" link="fulltext">12960966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array</p>
            </title>
            <aug>
               <au>
                  <snm>Matsuzaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Loi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Law</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Di</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Marcus</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Shriver</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Puck</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Mei</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>414</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353229</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993208</pubid>
                  <pubid idtype="doi">10.1101/gr.2014904</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genomewide linkage analysis of bipolar disorder by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay: a comparison with microsatellite marker assays and finding of significant linkage to chromosome 6q22</p>
            </title>
            <aug>
               <au>
                  <snm>Middleton</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Pato</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Gentile</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Morley</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Eisener</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Petryshen</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Kirby</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Medeiros</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Macedo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dourado</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coelho</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Valente</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Ferreira</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Lei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Azevedo</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sklar</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pato</snm>
                  <fnm>CN</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2004</pubdate>
            <volume>74</volume>
            <fpage>886</fpage>
            <lpage>897</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/420775</pubid>
                  <pubid idtype="pmpid" link="fulltext">15060841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Whole-genome scan, in a complex disease, using 11,245 single-nucleotide polymorphisms: comparison with microsatellites</p>
            </title>
            <aug>
               <au>
                  <snm>John</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shephard</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zeggini</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Vasavda</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mills</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hinks</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eyre</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Ollier</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Silman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Worthington</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>GC</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2004</pubdate>
            <volume>75</volume>
            <fpage>54</fpage>
            <lpage>64</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/422195</pubid>
                  <pubid idtype="pmpid" link="fulltext">15154113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Construction of multilocus genetic linkage maps in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1987</pubdate>
            <volume>84</volume>
            <fpage>2363</fpage>
            <lpage>2367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">304651</pubid>
                  <pubid idtype="pmpid">3470801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Parametric and nonparametric linkage analysis: a unified multipoint approach</p>
            </title>
            <aug>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Reeve-Daly</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1996</pubdate>
            <volume>58</volume>
            <fpage>1347</fpage>
            <lpage>1363</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8651312</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Merlin--rapid analysis of dense genetic maps using sparse gene flow trees</p>
            </title>
            <aug>
               <au>
                  <snm>Abecasis</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Cherny</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Cookson</snm>
                  <fnm>WO</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>LR</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>97</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng786</pubid>
                  <pubid idtype="pmpid" link="fulltext">11731797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Allegro, a new computer program for multipoint linkage analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Gudbjartsson</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Jonasson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Frigge</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Kong</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>12</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75514</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802644</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data</p>
            </title>
            <aug>
               <au>
                  <snm>Lin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Sellers</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Lieberfarb</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1233</fpage>
            <lpage>1240</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth069</pubid>
                  <pubid idtype="pmpid" link="fulltext">14871870</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>31</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">14539</pubid>
                  <pubid idtype="pmpid" link="fulltext">11134512</pubid>
                  <pubid idtype="doi">10.1073/pnas.011404098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>DNA-Chip Analyzer (dChip)</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>The analysis of gene expression data: methods and software</source>
            <publisher>New York, Springer</publisher>
            <editor>Parmigiani G, Garrett ES, Irizarry R and Zeger SL</editor>
            <pubdate>2003</pubdate>
            <fpage>120</fpage>
            <lpage>141</lpage>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Mathematical and statistical methods for genetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Lange</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <publisher>New York, Springer-Verlag</publisher>
            <edition>2</edition>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Linkage User's Guide</p>
            </title>
            <aug>
               <au>
                  <snm>Lathrop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ott</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <url>ftp://linkage.rockefeller.edu/software/linkage</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Affymetrix Mapping 10K Array - Support Materials</p>
            </title>
            <aug>
               <au>
                  <cnm>Affymetrix</cnm>
               </au>
            </aug>
            <fpage>[http://www.affymetrix.com/support/technical/byproduct.affx?product=10k]</fpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>UCSC Genome Bioinformatics</p>
            </title>
            <aug>
               <au>
                  <cnm>UCSC</cnm>
               </au>
            </aug>
            <fpage>[http://genome.ucsc.edu/]</fpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Rapid multipoint linkage analysis of recessive traits in nuclear families, including homozygosity mapping</p>
            </title>
            <aug>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1995</pubdate>
            <volume>56</volume>
            <fpage>519</fpage>
            <lpage>527</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7847388</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Detection and integration of genotyping errors in statistical genetics</p>
            </title>
            <aug>
               <au>
                  <snm>Sobel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Papp</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Lange</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2002</pubdate>
            <volume>70</volume>
            <fpage>496</fpage>
            <lpage>508</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">384922</pubid>
                  <pubid idtype="pmpid" link="fulltext">11791215</pubid>
                  <pubid idtype="doi">10.1086/338920</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>PedCheck: a program for identification of genotype incompatibilities in linkage analysis</p>
            </title>
            <aug>
               <au>
                  <snm>O'Connell</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Weeks</snm>
                  <fnm>DE</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>1998</pubdate>
            <volume>63</volume>
            <fpage>259</fpage>
            <lpage>266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/301904</pubid>
                  <pubid idtype="pmpid" link="fulltext">9634505</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results</p>
            </title>
            <aug>
               <au>
                  <snm>Lander</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1995</pubdate>
            <volume>11</volume>
            <fpage>241</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1195-241</pubid>
                  <pubid idtype="pmpid" link="fulltext">7581446</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The deaf jerker mouse has a mutation in the gene encoding the espin actin-bundling proteins of hair cell stereocilia and lacks espins</p>
            </title>
            <aug>
               <au>
                  <snm>Zheng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sekerkova</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vranich</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tilney</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Mugnaini</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bartles</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>102</volume>
            <fpage>377</fpage>
            <lpage>385</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)00042-8</pubid>
                  <pubid idtype="pmpid">10975527</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Mutations of ESPN cause autosomal recessive deafness and vestibular dysfunction</p>
            </title>
            <aug>
               <au>
                  <snm>Naz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Griffith</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Riazuddin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hampton</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Battey</snm>
                  <fnm>JFJ</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Wilcox</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>TB</fnm>
               </au>
            </aug>
            <source>J Med Genet</source>
            <pubdate>2004</pubdate>
            <volume>41</volume>
            <fpage>591</fpage>
            <lpage>595</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/jmg.2004.018523</pubid>
                  <pubid idtype="pmpid" link="fulltext">15286153</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data</p>
            </title>
            <aug>
               <au>
                  <snm>Douglas</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Boehnke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lange</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>1287</fpage>
            <lpage>1297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/302861</pubid>
                  <pubid idtype="pmpid" link="fulltext">10739757</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Efficient multipoint linkage analysis through reduction of inheritance space</p>
            </title>
            <aug>
               <au>
                  <snm>Markianos</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2001</pubdate>
            <volume>68</volume>
            <fpage>963</fpage>
            <lpage>977</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/319507</pubid>
                  <pubid idtype="pmpid" link="fulltext">11254453</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Faster multipoint linkage analysis using Fourier transforms</p>
            </title>
            <aug>
               <au>
                  <snm>Kruglyak</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>1998</pubdate>
            <volume>5</volume>
            <fpage>1</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9541867</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
