Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Research article

Comparison of class 2 transposable elements at superfamily resolution reveals conserved and distinct features in cereal grass genomes

Yujun Han1, Shanshan Qin1 and Susan R Wessler12*

Author Affiliations

1 Department of Plant Biology, University of Georgia, Athens, GA 30602, USA

2 Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA

For all author emails, please log on.

BMC Genomics 2013, 14:71  doi:10.1186/1471-2164-14-71

Published: 31 January 2013

Abstract

Background

Class 2 transposable elements (TEs) are the predominant elements in and around plant genes where they generate significant allelic diversity. Using the complete sequences of four grasses, we have performed a novel comparative analysis of class 2 TEs. To ensure consistent comparative analyses, we re-annotated class 2 TEs in Brachypodium distachyon, Oryza sativa (rice), Sorghum bicolor and Zea mays and assigned them to one of the five cut-and-paste superfamilies found in plant genomes (Tc1/mariner, PIF/Harbinger, hAT, Mutator, CACTA). We have focused on noncoding elements because of their abundance, and compared superfamily copy number, size and genomic distribution as well as correlation with the level of nearby gene expression.

Results

Our comparison revealed both unique and conserved features. First, the average length or size distribution of elements in each superfamily is largely conserved, with the shortest always being Tc1/mariner elements, followed by PIF/Harbinger, hAT, Mutator and CACTA. This order also holds for the ratio of the copy numbers of noncoding to coding elements. Second, with the exception of CACTAs, noncoding TEs are enriched within and flanking genes, where they display conserved distribution patterns, having the highest peak in the promoter region. Finally, our analysis of microarray data revealed that genes associated with Tc1/mariner and PIF/Harbinger noncoding elements have significantly higher expression levels than genes without class 2 TEs. In contrast, genes with CACTA elements have significantly lower expression than genes without class 2 TEs.

Conclusions

We have achieved the most comprehensive annotation of class 2 TEs to date in these four grass genomes. Comparative analysis of this robust dataset led to the identification of several previously unknown features of each superfamily related to copy number, element size, genomic distribution and correlation with the expression levels of nearby genes. These results highlight the importance of distinguishing TE superfamilies when assessing their impact on gene and genome evolution.

Keywords:
Genome comparison; Plant genomes; Genome evolution; Class2 transposable elements; Features; Grass genomes