Prediction of twin-arginine signal peptides
1 Center for Biological Sequence Analysis BioCentrum-DTU Building 208 Technical University of Denmark DK-2800 Lyngby, Denmark
2 Department of Molecular Microbiology John Innes Centre Norwich NR4 7UH UK
3 School of Biological Sciences University of East Anglia Norwich NR4 7TJ UK
BMC Bioinformatics 2005, 6:167 doi:10.1186/1471-2105-6-167Published: 2 July 2005
Proteins carrying twin-arginine (Tat) signal peptides are exported into the periplasmic compartment or extracellular environment independently of the classical Sec-dependent translocation pathway. To complement other methods for classical signal peptide prediction we here present a publicly available method, TatP, for prediction of bacterial Tat signal peptides.
We have retrieved sequence data for Tat substrates in order to train a computational method for discrimination of Sec and Tat signal peptides. The TatP method is able to positively classify 91% of 35 known Tat signal peptides and 84% of the annotated cleavage sites of these Tat signal peptides were correctly predicted. This method generates far less false positive predictions on various datasets than using simple pattern matching. Moreover, on the same datasets TatP generates less false positive predictions than a complementary rule based prediction method.
The method developed here is able to discriminate Tat signal peptides from cytoplasmic proteins carrying a similar motif, as well as from Sec signal peptides, with high accuracy. The method allows filtering of input sequences based on Perl syntax regular expressions, whereas hydrophobicity discrimination of Tat- and Sec-signal peptides is carried out by an artificial neural network. A potential cleavage site of the predicted Tat signal peptide is also reported. The TatP prediction server is available as a public web server at http://www.cbs.dtu.dk/services/TatP/ webcite.