Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

Open Access Highly Accessed Software

Snagger: A user-friendly program for incorporating additional information for tagSNP selection

Christopher K Edlund13, Won H Lee23, Dalin Li23, David J Van Den Berg1 and David V Conti23*

Author Affiliations

1 Keck School of Medicine, University of Southern California, Los Angeles, California 90033, USA

2 Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California 90033, USA

3 Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, CA 90033, USA

For all author emails, please log on.

BMC Bioinformatics 2008, 9:174  doi:10.1186/1471-2105-9-174

Published: 27 March 2008

Abstract

Background

There has been considerable effort focused on developing efficient programs for tagging single-nucleotide polymorphisms (SNPs). Many of these programs do not account for potential reduced genomic coverage resulting from genotyping failures nor do they preferentially select SNPs based on functionality, which may be more likely to be biologically important.

Results

We have developed a user-friendly and efficient software program, Snagger, as an extension to the existing open-source software, Haploview, which uses pairwise r2 linkage disequilibrium between single nucleotide polymorphisms (SNPs) to select tagSNPs. Snagger distinguishes itself from existing SNP selection algorithms, including Tagger, by providing user options that allow for: (1) prioritization of tagSNPs based on certain characteristics, including platform-specific design scores, functionality (i.e., coding status), and chromosomal position, (2) efficient selection of SNPs across multiple populations, (3) selection of tagSNPs outside defined genomic regions to improve coverage and genotyping success, and (4) picking of surrogate tagSNPs that serve as backups for tagSNPs whose failure would result in a significant loss of data. Using HapMap genotype data from ten ENCODE regions and design scores for the Illumina platform, we show similar coverage and design score distribution and fewer total tagSNPs selected by Snagger compared to the web server Tagger.

Conclusion

Snagger improves upon current available tagSNP software packages by providing a means for researchers to select tagSNPs that reliably capture genetic variation across multiple populations while accounting for significant genotyping failure risk and prioritizing on SNP-specific characteristics.