TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads

Mangul, Serghei; Caciula, Adrian; Brinza, Dumitru; Mandoiu, Ion I; Zelikovsky, Alex

doi:10.1186/1471-2105-13-S18-A11

Volume 13 Supplement 18

Highlights from the Eighth International Society for Computational Biology (ISCB) Student Council Symposium 2012

Meeting abstract
Open access
Published: 14 December 2012

TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads

Serghei Mangul¹,
Adrian Caciula¹,
Dumitru Brinza²,
Ion I Mandoiu³ &
…
Alex Zelikovsky¹

BMC Bioinformatics volume 13, Article number: A11 (2012) Cite this article

2700 Accesses
1 Citations
11 Altmetric
Metrics details

Background

Recent advances in DNA sequencing have made it possible to sequence the whole transcriptome by massively parallel sequencing, commonly referred as RNA-Seq. RNA-Seq is quickly becoming the technology of choice for transcriptome research and analyses. RNA-Seq allows to reduce the sequencing cost and significantly increase data throughput, but it is computationally challenging to use such RNA-Seq data for reconstructing of full length transcripts and accurately estimate their abundances across all cell types. A number of recent works have addressed the problem of transcriptome reconstruction from RNA-Seq reads. These methods fall into three categories: genome-guided, genome-independent and annotation-guided.

Methods

In this work, we propose a novel statistical genome-guided method called “T ranscriptome R econstruction using I nteger P rograming” (TRIP) that incorporates fragment length distribution into novel transcript reconstruction from paired-end RNA-Seq reads. To reconstruct novel transcripts, we create a splice graph based on inferred exon boundaries and RNA-Seq reads. A splice graph is a directed acyclic graph (DAG), whose vertices represent exons and edges represent splicing events. We enumerate all maximal paths in the splice graph using a depth-first-search (DFS) algorithm. These paths correspond to putative transcripts and are the input for the TRIP algorithm.

To solve the transcriptome reconstruction problem we must select a set of putative transcripts with the highest support from the RNA-Seq reads. We formulate this problem as an integer program. The objective to select the smallest set of putative transcripts that yields a good statistical fit between the fragment length distribution empirically determined during library preparation and fragment lengths implied by mapping read pairs to selected transcripts.

Conclusions

Preliminary experimental results on synthetic datasets generated with various sequencing parameters and distribution assumptions show that TRIP has increased transcriptome reconstruction accuracy compared to previous methods that ignore fragment length distribution information.

Author information

Authors and Affiliations

Computer Science Department, Georgia State University, University Plaza, Atlanta, Georgia, 30303, USA
Serghei Mangul, Adrian Caciula & Alex Zelikovsky
Ion Bioinformatics, Life Technologies Corporation, Foster City, CA, USA
Dumitru Brinza
Department of Computer Science & Engineering, University of Connecticut, 371 Faireld Rd., Unit 2155, Storrs, CT, 06269-2155, USA
Ion I Mandoiu

Authors

Serghei Mangul
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Caciula
View author publications
You can also search for this author in PubMed Google Scholar
Dumitru Brinza
View author publications
You can also search for this author in PubMed Google Scholar
Ion I Mandoiu
View author publications
You can also search for this author in PubMed Google Scholar
Alex Zelikovsky
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Serghei Mangul.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mangul, S., Caciula, A., Brinza, D. et al. TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads. BMC Bioinformatics 13 (Suppl 18), A11 (2012). https://doi.org/10.1186/1471-2105-13-S18-A11

Download citation

Published: 14 December 2012
DOI: https://doi.org/10.1186/1471-2105-13-S18-A11

Highlights from the Eighth International Society for Computational Biology (ISCB) Student Council Symposium 2012

TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads

Background

Methods

Conclusions

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Highlights from the Eighth International Society for Computational Biology (ISCB) Student Council Symposium 2012

TRIP: a method for novel transcript reconstruction from paired-end RNA-seq reads

Background

Methods

Conclusions

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us