Open Access Highly Accessed Open Badges Research article

RNA-Seq Atlas of Glycine max: A guide to the soybean transcriptome

Andrew J Severin1*, Jenna L Woody1, Yung-Tsi Bolon2, Bindu Joseph1, Brian W Diers3, Andrew D Farmer4, Gary J Muehlbauer6, Rex T Nelson5, David Grant5, James E Specht7, Michelle A Graham15, Steven B Cannon15, Gregory D May4, Carroll P Vance26 and Randy C Shoemaker15

Author Affiliations

1 Department of Agronomy, Iowa State University, Ames, IA 50011, USA

2 United States Department of Agriculture-Agricultural Research Service, Plant Research Unit, St. Paul, MN 55108, USA

3 Department of Crop Sciences, University of Illinois, 1101 West Peabody Dr., Urbana, IL 61801, USA

4 National Center for Genome Resources, Santa Fe, NM 87505, USA

5 United States Department of Agriculture-Agricultural Research Service, Corn Insects and Crop Genetics Resources Unit, Ames, IA 50011, USA

6 Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA

7 Department of Agronomy, University of Nebraska-Lincoln, Lincoln, NE 68583, USA

For all author emails, please log on.

BMC Plant Biology 2010, 10:160  doi:10.1186/1471-2229-10-160

Published: 5 August 2010



Next generation sequencing is transforming our understanding of transcriptomes. It can determine the expression level of transcripts with a dynamic range of over six orders of magnitude from multiple tissues, developmental stages or conditions. Patterns of gene expression provide insight into functions of genes with unknown annotation.


The RNA Seq-Atlas presented here provides a record of high-resolution gene expression in a set of fourteen diverse tissues. Hierarchical clustering of transcriptional profiles for these tissues suggests three clades with similar profiles: aerial, underground and seed tissues. We also investigate the relationship between gene structure and gene expression and find a correlation between gene length and expression. Additionally, we find dramatic tissue-specific gene expression of both the most highly-expressed genes and the genes specific to legumes in seed development and nodule tissues. Analysis of the gene expression profiles of over 2,000 genes with preferential gene expression in seed suggests there are more than 177 genes with functional roles that are involved in the economically important seed filling process. Finally, the Seq-atlas also provides a means of evaluating existing gene model annotations for the Glycine max genome.


This RNA-Seq atlas extends the analyses of previous gene expression atlases performed using Affymetrix GeneChip technology and provides an example of new methods to accommodate the increase in transcriptome data obtained from next generation sequencing. Data contained within this RNA-Seq atlas of Glycine max can be explored at webcite.