Log on / register
Feedback | Support | My details
Open AccessHighly AccessResearch article

NLStradamus: a simple Hidden Markov Model for nuclear localization signal prediction

Alex N Nguyen Ba1,2 email, Anastassia Pogoutse1 email, Nicholas Provart1,2 email and Alan M Moses1,2 email

1Department of Cell & Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, Canada

2Centre for the Analysis of Genome Evolution and Function, University of Toronto, 25 Willcocks Street, Toronto, Canada

author email corresponding author email

BMC Bioinformatics 2009, 10:202doi:10.1186/1471-2105-10-202

Published: 29 June 2009

Abstract

Background

Nuclear localization signals (NLSs) are stretches of residues within a protein that are important for the regulated nuclear import of the protein. Of the many import pathways that exist in yeast, the best characterized is termed the 'classical' NLS pathway. The classical NLS contains specific patterns of basic residues and computational methods have been designed to predict the location of these motifs on proteins. The consensus sequences, or patterns, for the other import pathways are less well-understood.

Results

In this paper, we present an analysis of characterized NLSs in yeast, and find, despite the large number of nuclear import pathways, that NLSs seem to show similar patterns of amino acid residues. We test current prediction methods and observe a low true positive rate. We therefore suggest an approach using hidden Markov models (HMMs) to predict novel NLSs in proteins. We show that our method is able to consistently find 37% of the NLSs with a low false positive rate and that our method retains its true positive rate outside of the yeast data set used for the training parameters.

Conclusion

Our implementation of this model, NLStradamus, is made available at: http://www.moseslab.csb.utoronto.ca/NLStradamus/ webcite


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.