Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets

Criscione, Steven; Neretti, Nicola

doi:10.1186/1753-6561-6-S6-O29

Volume 6 Supplement 6

Beyond the Genome 2012

Oral presentation
Open access
Published: 01 October 2012

Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets

Steven Criscione¹ &
Nicola Neretti^1,2

BMC Proceedings volume 6, Article number: O29 (2012) Cite this article

1551 Accesses
Metrics details

Repetitive and transposable elements comprise more than half of the human genome and play diverse roles in many biological processes. Mobile elements including retrotransposons are implicated in the organization of the epigenetic landscape, the progression of tumorigenesis, and the enhancement of genetic diversity. Despite the importance of repetitive and transposable elements these sequences are traditionally ignored in high-throughput sequencing analysis due to the technical difficulty of uniquely mapping reads from repeat DNA sequences. Here we report a new computational method for the analysis of repetitive elements from high-throughput sequencing datasets that accounts for all mapping reads. In our approach, we examine reads that map uniquely and to multiple locations of the genome using two separate strategies to determine a complete estimate of enrichment for repetitive elements. Included in our computational method is an output defined by reads per kilobase of repeat element per million mapped reads (similar to RPKM definition for the exon model) [1]. The calculated repeat element enrichment RPKM allows for the comparisons between repetitive elements as well as between experimental conditions. Our new method for examining repetitive elements from high-throughput sequencing datasets represents an improvement over existing methods because we do not exclude reads from the analysis and we can make comparisons between experimental conditions. To test our method we have examined repetitive element enrichment in the embryonic and adult mouse across different tissues using a variety of high-throughput mouse sequencing datasets available from the mouse ENCODE project and Shen et al. that provide a thorough snapshot of the epigenetic landscape of the embryonic and adult mouse [2]. We compare our method with an existing strategy for estimating repetitive element enrichment proposed by Day et al. [3], and demonstrate the advantages to our strategy. In addition, we test the robustness of our approach for determining differences in enrichment between experimental samples by conducting a comparison between the embryonic and adult mouse.

References

Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.
Article CAS PubMed Google Scholar
Shen Y, Yue F, McCleary DF, Ye Z, Edsall L, Kuan S, Wagner U, Dixon J, Lee L, Lobanenkov VV, Ren B: A map of the cis-regulatory sequences in the mouse genome. Nature. 2012, doi:10.1038/nature11243
Google Scholar
Day D, Luguette L, Park P, Kharchenko P: Estimating enrichment of repetitive elements from high-throughput sequence data. Genome Biol. 2010, 11: R69-10.1186/gb-2010-11-6-r69.
Article PubMed Central PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Molecular Biology, Cell Biology, and Biochemistry, Brown University, 75 Waterman Street, Providence, RI, 02912, USA
Steven Criscione & Nicola Neretti
Center for Computational Molecular Biology, and Biochemistry, Brown University, Providence, RI, 02912, USA
Nicola Neretti

Authors

Steven Criscione
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Neretti
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Criscione, S., Neretti, N. Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets. BMC Proc 6 (Suppl 6), O29 (2012). https://doi.org/10.1186/1753-6561-6-S6-O29

Download citation

Published: 01 October 2012
DOI: https://doi.org/10.1186/1753-6561-6-S6-O29

Beyond the Genome 2012

Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

BMC Proceedings

Contact us

Beyond the Genome 2012

Development of a computational strategy to compare repetitive element enrichment between experimental conditions from high-throughput sequencing datasets

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Proceedings

Contact us