Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Database

SoyTEdb: a comprehensive database of transposable elements in the soybean genome

Jianchang Du1, David Grant2, Zhixi Tian1, Rex T Nelson2, Liucun Zhu1, Randy C Shoemaker2* and Jianxin Ma1*

Author Affiliations

1 Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA

2 US Department of Agriculture-Agricultural Research Service, Corn Insect and Crop Genetics Research Unit, Ames, Iowa 50011, USA

For all author emails, please log on.

BMC Genomics 2010, 11:113  doi:10.1186/1471-2164-11-113

Published: 17 February 2010

Abstract

Background

Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop.

Description

Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I) and 6,029 DNA transposons (Class II) with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95%) of these elements (particularly a few hundred low-copy-number families) are first described in this study.

Conclusion

SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually curated transposable element database for any individual plant genome completely sequenced to date. Transposable elements previously identified in legumes, the third largest family of flowering plants, are relatively scarce. Thus this database will facilitate structural, evolutionary, functional, and epigenetic analyses of transposable elements in soybean and other legume species.