BMC Bioinformatics

official impact factor 3.03

Open Access Research article

Shape based indexing for faster search of RNA family databases

Stefan Janssen, Jens Reeder and Robert Giegerich*

Author Affiliations

Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany

For all author emails, please log on.

BMC Bioinformatics 2008, 9:131 doi:10.1186/1471-2105-9-131

Published: 29 February 2008

Additional files

Additional file 1:

Effects of skipping "difficult" families on sensitivity and filtration ratio. The test- and training sets are constructed as above, but this time we choose up to 1,000 sequences for each family instead of four. Random- and gene- testsets are not considered, because we focus on the changes of the sensitivity. RNAsifter is set to kfamily = 3, kquery = 5, ε = 0.4. One has to read the rows in a accumulative fashion. In the first row no family is skipped, in the second row family RF00017 is omitted, the third row omits families RF00017 and RF00230 and so on. Note that the testset shrinks, because the sequences of a skipped family are also removed from the set.

Format: PDF Size: 40KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data