Log on / register
Feedback | Support | My details
Open AccessResearch article

Effective ambiguity checking in biosequence analysis

Janina Reeder1 email, Peter Steffen2 email and Robert Giegerich2 email

1InternationaI NRW Graduate School of Bioinformatics and Genome Research, Center of Biotechnology (CeBiTec), Bielefeld University, Postfach 10 01 31, 33501 Bielefeld, Germany

2Practical Computer Science, Faculty of Technology, Bielefeld University, Postfach 10 01 31, 33501 Bielefeld, Germany

author email corresponding author email

BMC Bioinformatics 2005, 6:153doi:10.1186/1471-2105-6-153

Published: 20 June 2005

Abstract

Background

Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence of ambiguity. As this problem inherits undecidability (as we show here) from the namely problem for context free languages, there is no complete algorithmic solution to the problem of ambiguity checking.

Results

We explain frequently observed sources of ambiguity, and show how to avoid them. We suggest four testing procedures that may help to detect ambiguity when present, including a just-in-time test that permits to work safely with a potentially ambiguous grammar. We introduce, for the special case of stochastic context free grammars and RNA structure modeling, an automated partial procedure for proving non-ambiguity. It is used to demonstrate non-ambiguity for several relevant grammars.

Conclusion

Our mechanical proof procedure and our testing methods provide a powerful arsenal of methods to ensure non-ambiguity.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.