Log on / register
Feedback | Support | My details
Open AccessSoftware

Merging microsatellite data: enhanced methodology and software to combine genotype data for linkage and association analysis

Angela P Presson1 email, Eric M Sobel2 email, Paivi Pajukanta2 email, Christopher Plaisier2 email, Daniel E Weeks3,4 email, Karolina Åberg4 email and Jeanette C Papp2 email

1Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA

2Department of Human Genetics, University of California, Los Angeles, CA, 90095, USA

3Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA

4Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, 15261, USA

author email corresponding author email

BMC Bioinformatics 2008, 9:317doi:10.1186/1471-2105-9-317

Published: 21 July 2008

Abstract

Background

Correctly merged data sets that have been independently genotyped can increase statistical power in linkage and association studies. However, alleles from microsatellite data sets genotyped with different experimental protocols or platforms cannot be accurately matched using base-pair size information alone. In a previous publication we introduced a statistical model for merging microsatellite data by matching allele frequencies between data sets. These methods are implemented in our software MicroMerge version 1 (v1). While MicroMerge v1 output can be analyzed by some genetic analysis programs, many programs can not analyze alignments that do not match alleles one-to-one between data sets. A consequence of such alignments is that codominant genotypes must often be analyzed as phenotypes. In this paper we describe several extensions that are implemented in MicroMerge version 2 (v2).

Results

Notably, MicroMerge v2 includes a new one-to-one alignment option that creates merged pedigree and locus files that can be handled by most genetic analysis software. Other features in MicroMerge v2 enhance the following aspects of control: 1) optimizing the algorithm for different merging scenarios, such as data sets with very different sample sizes or multiple data sets, 2) merging small data sets when a reliable set of allele frequencies are available, and 3) improving the quantity and 4) quality of merged data. We present results from simulated and real microsatellite genotype data sets, and conclude with an association analysis of three familial dyslipidemia (FD) study samples genotyped at different laboratories. Independent analysis of each FD data set did not yield consistent results, but analysis of the merged data sets identified strong association at locus D11S2002.

Conclusion

The MicroMerge v2 features will enable merging for a variety of genotype data sets, which in turn will facilitate meta-analyses for powering association analysis.


© 1999-2008 BioMed Central Ltd unless otherwise stated