This article is part of the supplement: Proceedings of the 12th European workshop on QTL mapping and marker assisted selection

Open Access Proceedings

Pre-selection of most significant SNPS for the estimation of genomic breeding values

Nicolò PP Macciotta1*, Giustino Gaspa1, Roberto Steri1, Camillo Pieramati2, Paolo Carnier3 and Corrado Dimauro1

Author Affiliations

1 Dipartimento di Scienze Zootecniche, Università di Sassari, Italy

2 Centro di Studio del Cavallo Sportivo, Università di Perugia, Italy

3 Dipartimento di Scienze Animali, Università di Padova, Italy

For all author emails, please log on.

BMC Proceedings 2009, 3(Suppl 1):S14  doi:10.1186/1753-6561-3-S1-S14

Published: 23 February 2009


The availability of a large amount of SNP markers throughout the genome of different livestock species offers the opportunity to estimate genomic breeding values (GEBVs). However, the estimation of many effects in a data set of limited size represent a severe statistical problem. A pre-selection of SNPS based on single regression may provide a reasonable compromise between accuracy of results, number of independent variables to be considered and computing requirements.

A total of 595 and 618 SNPS were pre-selected using a simple linear regression for each SNP, based on phenotypes or polygenic EBVs, respectively, with an average distance of 9–10 cM between them. Chromosome four had the largest frequency of selected SNPS. Average correlations between GEBVs and TBVs were about 0.82 and 0.73 for the TRAINING generations when phenotypes or polygenic EBVs were considered as dependent variable, whereas they tend to decrease to 0.66 and 0.54 for the PREDICTION generations. The pre-selection of SNPs using the phenotypes as dependent variable together with a BLUP estimation of marker genotype effects using a variance contribution of each marker equal to σ2a/nsnps resulted in a remarkable accuracy of GEBV estimation (0.77) in the PREDICTION generations.