This article is part of the supplement: ACM Conference on Bioinformatics, Computational Biology and Biomedicine 2011
Ab initio detection of fuzzy amino acid tandem repeats in protein sequences
1 Istituto di Informatica e Telematica, CNR - Consiglio Nazionale delle Ricerche, Pisa I-56124, Italy
2 Dipartimento di Ingegneria dell'Informazione, Università di Pisa, Pisa I-56122, Italy
BMC Bioinformatics 2012, 13(Suppl 3):S8 doi:10.1186/1471-2105-13-S3-S8Published: 21 March 2012
Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats).
In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure.
PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available.