Benchmarking pKa prediction
- Equal contributors
1 Edward Jenner Institute for Vaccine Research, Compton, Berkshire, RG20 7NN, UK
2 School of Crystallography, Birkbeck College, Malet Street, London WC1E 7HX, UK
BMC Biochemistry 2006, 7:18 doi:10.1186/1471-2091-7-18Published: 2 June 2006
pKa values are a measure of the protonation of ionizable groups in proteins. Ionizable groups are involved in intra-protein, protein-solvent and protein-ligand interactions as well as solubility, protein folding and catalytic activity. The pKa shift of a group from its intrinsic value is determined by the perturbation of the residue by the environment and can be calculated from three-dimensional structural data.
Here we use a large dataset of experimentally-determined pKas to analyse the performance of different prediction techniques. Our work provides a benchmark of available software implementations: MCCE, MEAD, PROPKA and UHBD. Combinatorial and regression analysis is also used in an attempt to find a consensus approach towards pKa prediction. The tendency of individual programs to over- or underpredict the pKa value is related to the underlying methodology of the individual programs.
Overall, PROPKA is more accurate than the other three programs. Key to developing accurate predictive software will be a complete sampling of conformations accessible to protein structures.