Eucalyptusresearch in the post-genome era

Pappas, Georgios; de Alencar, Sergio; Silva-Junior, Orzenil B; Togawa, Roberto C; Pappas, Marilia CR; Grattapaglia, Dario

doi:10.1186/1753-6561-5-S7-I22

Volume 5 Supplement 7

IUFRO Tree Biotechnology Conference 2011: From Genomes to Integration and Delivery

Invited speaker presentation
Open access
Published: 13 September 2011

Eucalyptusresearch in the post-genome era

Georgios Pappas¹,
Sergio de Alencar¹,
Orzenil B Silva-Junior¹,
Roberto C Togawa¹,
Marilia CR Pappas¹ &
…
Dario Grattapaglia¹

BMC Proceedings volume 5, Article number: I22 (2011) Cite this article

1674 Accesses
Metrics details

With the efforts of the Joint Genome Institute (JGI) through the EUCAGEN Eucalyptus Genome Network reaching the final release of a Eucalyptus grandis reference genome (BRASUZ1), it is anticipated that this accomplishment will profoundly shape the future research of this global tree genus [1]. One of the first steps toward this end has been the refinement of the genome annotation. Robust models have been generated for protein-coding genes. Here we report the annotation of other pivotal genome constituents, transposable elements (TE) and micro RNA genes. TEs are not only the most dominant elements but are also major drivers of genome plasticity. Several bioinformatic strategies were employed to perform “de novo” and homology based prediction of repetitive elements [2]in the current genome release (version 1.0). A total of 53 distinct TE families could be identified, with the retrotransposon super-family being widely over-represented, as observed for the majority of plant genomes sequenced.

Micro RNAs (miRNA), key players in post-transcriptional gene regulation, were annotated using a combination of massively parallel sequencing of small RNA libraries and a genome-wide computational screening to ascertain a compatible secondary structure of the precursor. Both experimental and “in silico” evidences enabled the annotation of 206 distinct miRNA loci comprising 36 different mir gene families including several miRNA isoforms. The blueprint provided by a high-quality reference genome, both in terms of sequence completeness and annotation, will leverage efforts to better characterize intra- and inter-specific sequence variation underlying the marked phenotypic differences among the hundreds of species comprising this genus.

Recent advances of DNA sequencing technologies permit a comprehensive interrogation of several other individuals at a fraction of the cost. In this context, we have used Illumina (2x75bp) short read sequencing data of E. globulus clone X46 generated by JGI and made available through the EUCAGEN network to carry out two comparative genomics experiments. From 40X raw sequence data provided it was possible to use an equivalent of 20X coverage (~12Gbp). We first attempted to perform a "de novo" assembly using VELVET [3]. A total of 161,000 contigs were obtained the largest one sizing at ~3,5kb. In spite of the easy access and low cost of next generation sequencing technologies, these results suggest that even for relatively small forest tree genomes, current technical and computational limitations preclude comprehensive assembly, likely due to the ubiquitous occurrence of repetitive elements in such genomes. Nevertheless when we mapped the E. globulus sequencing data against the BRASUZ1 reference genome, 55% of the reads could be mapped with high confidence. From these, approximately 800,000 high quality single nucleotide polymorphisms (SNPs) could be identified clearly showing the key role that the reference genome will have for future genomic undertakings. The sheer number of molecular markers discovered in this experiment not only fosters more powerful studies on the evolutionary history and population genomics of eucalypts, but also inaugurates a new era in molecular breeding of species of this genus, providing genome-wide coverage for genomic selection and association studies.

References

Grattapaglia D, Kirst M: Eucalyptus applied genomics: from gene sequences to breeding tools. New Phytologist. 2008, 179 (4): 911-929. 10.1111/j.1469-8137.2008.02503.x.
Article CAS PubMed Google Scholar
Flutre T, Duprat E, Feuillet C, Quesneville H: Considering Transposable Element Diversification in De Novo Annotation Approaches. Plos One. 2011, 6 (1):
Zerbino DR, Birney E: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research. 2008, 18 (5): 821-829. 10.1101/gr.074492.107.
Article PubMed Central CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

EMBRAPA Genetic Resources and Biotechnology - EPqB FInal W5 Norte, Brazilia, DF 70770-917, Brazil
Georgios Pappas, Sergio de Alencar, Orzenil B Silva-Junior, Roberto C Togawa, Marilia CR Pappas & Dario Grattapaglia

Authors

Georgios Pappas
View author publications
You can also search for this author in PubMed Google Scholar
Sergio de Alencar
View author publications
You can also search for this author in PubMed Google Scholar
Orzenil B Silva-Junior
View author publications
You can also search for this author in PubMed Google Scholar
Roberto C Togawa
View author publications
You can also search for this author in PubMed Google Scholar
Marilia CR Pappas
View author publications
You can also search for this author in PubMed Google Scholar
Dario Grattapaglia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Georgios Pappas.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Pappas, G., de Alencar, S., Silva-Junior, O.B. et al. Eucalyptusresearch in the post-genome era. BMC Proc 5 (Suppl 7), I22 (2011). https://doi.org/10.1186/1753-6561-5-S7-I22

Download citation

Published: 13 September 2011
DOI: https://doi.org/10.1186/1753-6561-5-S7-I22

IUFRO Tree Biotechnology Conference 2011: From Genomes to Integration and Delivery

Eucalyptusresearch in the post-genome era

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Proceedings

Contact us

IUFRO Tree Biotechnology Conference 2011: From Genomes to Integration and Delivery

Eucalyptusresearch in the post-genome era

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Proceedings

Contact us