Log on / register
Feedback | Support | My details
Open AccessMethodology article

Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures

Mikael Bodén1 email, Zheng Yuan2 email and Timothy L Bailey2 email

School of Information Technology and Electrical Engineering, The University of Queensland, QLD 4072, St Lucia, Australia

Institute of Molecular Bioscience, The University of Queensland, QLD 4072, St Lucia, Australia

author email corresponding author email

BMC Bioinformatics 2006, 7:68doi:10.1186/1471-2105-7-68

Published: 14 February 2006

Abstract

Background

The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models.

Results

Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues.

Conclusion

Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.