Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

This article is part of the supplement: 22nd International Conference on Genome Informatics: Bioinformatics

Open Access Proceedings

ClipCrop: a tool for detecting structural variations with single-base resolution using soft-clipping information

Shin Suzuki, Tomohiro Yasuda, Yuichi Shiraishi, Satoru Miyano and Masao Nagasaki*

Author affiliations

Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo, 108-8639, Japan

For all author emails, please log on.

Citation and License

BMC Bioinformatics 2011, 12(Suppl 14):S7  doi:10.1186/1471-2105-12-S14-S7

Published: 14 December 2011

Abstract

Background

Structural variations (SVs) change the structure of the genome and are therefore the causes of various diseases. Next-generation sequencing allows us to obtain a multitude of sequence data, some of which can be used to infer the position of SVs.

Methods

We developed a new method and implementation named ClipCrop for detecting SVs with single-base resolution using soft-clipping information. A soft-clipped sequence is an unmatched fragment in a partially mapped read. To assess the performance of ClipCrop with other SV-detecting tools, we generated various patterns of simulation data – SV lengths, read lengths, and the depth of coverage of short reads – with insertions, deletions, tandem duplications, inversions and single nucleotide alterations in a human chromosome. For comparison, we selected BreakDancer, CNVnator and Pindel, each of which adopts a different approach to detect SVs, e.g. discordant pair approach, depth of coverage approach and split read approach, respectively.

Results

Our method outperformed BreakDancer and CNVnator in both discovering rate and call accuracy in any type of SV. Pindel offered a similar performance as our method, but our method crucially outperformed for detecting small duplications. From our experiments, ClipCrop infer reliable SVs for the data set with more than 50 bases read lengths and 20x depth of coverage, both of which are reasonable values in current NGS data set.

Conclusions

ClipCrop can detect SVs with higher discovering rate and call accuracy than any other tool in our simulation data set.