Open Access Highly Accessed Research article

Gene discovery in the horned beetle Onthophagus taurus

Jeong-Hyeon Choi1, Teiya Kijimoto2, Emilie Snell-Rood2, Hongseok Tae1, Youngik Yang3, Armin P Moczek2 and Justen Andrews2*

Author Affiliations

1 Center for Genomics and Bioinformatics, Indiana University, Bloomington, Indiana, 47405, USA

2 Department of Biology, Indiana University, Bloomington, Indiana, 47405, USA

3 School of Informatics and Computing, Indiana University, Bloomington, Indiana 47408, USA

For all author emails, please log on.

BMC Genomics 2010, 11:703  doi:10.1186/1471-2164-11-703

Published: 14 December 2010



Horned beetles, in particular in the genus Onthophagus, are important models for studies on sexual selection, biological radiations, the origin of novel traits, developmental plasticity, biocontrol, conservation, and forensic biology. Despite their growing prominence as models for studying both basic and applied questions in biology, little genomic or transcriptomic data are available for this genus. We used massively parallel pyrosequencing (Roche 454-FLX platform) to produce a comprehensive EST dataset for the horned beetle Onthophagus taurus. To maximize sequence diversity, we pooled RNA extracted from a normalized library encompassing diverse developmental stages and both sexes.


We used 454 pyrosequencing to sequence ESTs from all post-embryonic stages of O. taurus. Approximately 1.36 million reads assembled into 50,080 non-redundant sequences encompassing a total of 26.5 Mbp. The non-redundant sequences match over half of the genes in Tribolium castaneum, the most closely related species with a sequenced genome. Analyses of Gene Ontology annotations and biochemical pathways indicate that the O. taurus sequences reflect a wide and representative sampling of biological functions and biochemical processes. An analysis of sequence polymorphisms revealed that SNP frequency was negatively related to overall expression level and the number of tissue types in which a given gene is expressed. The most variable genes were enriched for a limited number of GO annotations whereas the least variable genes were enriched for a wide range of GO terms directly related to fitness.


This study provides the first large-scale EST database for horned beetles, a much-needed resource for advancing the study of these organisms. Furthermore, we identified instances of gene duplications and alternative splicing, useful for future study of gene regulation, and a large number of SNP markers that could be used in population-genetic studies of O. taurus and possibly other horned beetles.