Open Access Research article

Core Hunter: an algorithm for sampling genetic resources based on multiple genetic measures

Chris Thachuk1*, José Crossa2*, Jorge Franco2,4, Susanne Dreisigacker3, Marilyn Warburton3,5 and Guy F Davenport2

Author Affiliations

1 Department of Computer Science, University of British Columbia, 2366 Main Mall, Vancouver, BC V6T1Z4, Canada

2 Crop Research Informatics Laboratory, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, México D.F., México

3 Applied Biotechnology Center, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, México D.F., México

4 International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria

5 USDA-ARS-CHPRRU, Box 9555, Mississippi State, MS 39762, USA

For all author emails, please log on.

BMC Bioinformatics 2009, 10:243 doi:10.1186/1471-2105-10-243

Published: 6 August 2009

Abstract

Background

Existing algorithms and methods for forming diverse core subsets currently address either allele representativeness (breeder's preference) or allele richness (taxonomist's preference). The main objective of this paper is to propose a powerful yet flexible algorithm capable of selecting core subsets that have high average genetic distance between accessions, or rich genetic diversity overall, or a combination of both.

Results

We present Core Hunter, an advanced stochastic local search algorithm for selecting core subsets. Core Hunter is able to find core subsets having more genetic diversity and better average genetic distance than the current state-of-the-art algorithms for all genetic distance and diversity measures we evaluated. Furthermore, Core Hunter can attempt to optimize any number of genetic measures simultaneously, based on the preference of the user. Notably, Core Hunter is able to select significantly smaller core subsets, which retain all unique alleles from a reference collection, than state-of-the-art algorithms.

Conclusion

Core Hunter is a highly effective and flexible tool for sampling genetic resources and establishing core subsets. Our implementation, documentation, and source code for Core Hunter is available at http://corehunter.org webcite