PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation

Wohlers, Inken; Petzold, Lars; Domingues, Francisco S; Klau, Gunnar W

doi:10.1186/1471-2105-10-S13-P2

Volume 10 Supplement 13

Highlights from the Fifth International Society for Computational Biology (ISCB) Student Council Symposium

Poster presentation
Open access
Published: 19 October 2009

PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation

Inken Wohlers¹,
Lars Petzold²,
Francisco S Domingues³ &
…
Gunnar W Klau¹

BMC Bioinformatics volume 10, Article number: P2 (2009) Cite this article

3327 Accesses
6 Citations
Metrics details

Background

Protein structural alignment determines the three-dimensional superposition of protein structures by means of aligning the protein's residues. It is a basic method for identifying proteins of related structure or common evolutionary origin and for measuring three-dimensional similarity. Applications are for instance the search for proteins with similar biological function or the classification of proteins based on their structural features.

Methods

We present a structural alignment approach that computes an alignment based on the protein's inter-residue distances. Building upon work for the alignment of protein contact maps by Caprara et al. [1], we use these distances to formulate the problem as an integer linear program which is subsequently solved using Lagrangian relaxation. One advantage of the integer linear programming formulation over heuristic methods is that we compute in many cases demonstrably optimal alignments. The bottleneck of the integer linear programming approach is its computational complexity which does not allow to incorporate all inter-residue distances in the problem description. On that account we select and score inter-residue distances efficiently. We develop and optimize a scoring function inspired by Holm and Sander. [2] using a set of 200 pairwise HOMSTRAD [3] alignments with a sequence identity of less than 35%. Subsequently, we use this scoring function to assess the performance of PAUL on the more challenging SISY data set of 130 alignments [4, 5] – on this data set we compare PAUL alignments to alignments computed by MATRAS [6], DALI [2], FATCAT [7], SHEBA [8], CA [9] and CE [10].

Results and conclusion

Our novel, non-heuristic structural alignment algorithm is flexible and mathematically sound. On the SISY data set PAUL alignments show higher mean and median alignment accuracies than all other methods (see Figure 1). In more than 30% of the cases, PAUL is the most accurate method. PAUL is thus competitive to other state-of-the-art algorithms and a beneficial tool for high-quality pairwise structural alignment.

References

Caprara A, Carr R, Istrail S, Lancia G, Walenz B: 1001 optimal PDB structure alignments: integer programming methods for finding the maximum contact map overlap. J Comput Biol 2004, 11(1):27–52. 10.1089/106652704773416876
Article CAS PubMed Google Scholar
Holm L, Sander C: Protein structure comparison by alignment of distance matrices. J Mol Biol 1993, 233(1):123–138. 10.1006/jmbi.1993.1489
Article CAS PubMed Google Scholar
Mizuguchi K, Deane CM, Blundell TL, Overington JP: Homstrad: a database of protein structure alignments for homologous families. Protein Sci 1998, 7(11):2469–2471. 10.1002/pro.5560071126
Article PubMed Central CAS PubMed Google Scholar
Andreeva A, Prlic A, Hubbard TJ, Murzin AG: Sisyphus-structural alignments for proteins with non-trivial relationships. Nucleic Acids Res 2007, (35 Database):253–259. 10.1093/nar/gkl746
Mayr G, Domingues FS, Lackner P: Comparative analysis of protein structure alignments. BMC Struct Biol 2007, 7: 50–50. 10.1186/1472-6807-7-50
Article PubMed Central PubMed Google Scholar
Kawabata T: Matras: A program for protein 3d structure comparison. Nucleic Acids Res 2003, 31(13):3367–3369. 10.1093/nar/gkg581
Article PubMed Central CAS PubMed Google Scholar
Ye Y, Godzik A: Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 2003, 19(Suppl 2):ii246-ii255.
Article PubMed Google Scholar
Jung J, Lee B: Protein structure alignment using environmental profiles. Protein Eng 2000, 13(8):535–543. 10.1093/protein/13.8.535
Article CAS PubMed Google Scholar
Bachar O, Fischer D, Nussinov R, Wolfson H: A computer vision based technique for 3-D sequence-independent structural comparison of proteins. Protein Eng 1993, 6(3):279–288. 10.1093/protein/6.3.279
Article CAS PubMed Google Scholar
Shindyalov IN, Bourne PE: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng 1998, 11(9):739–747. 10.1093/protein/11.9.739
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Life Sciences Group, Centrum Wiskunde & Informatica, Science Park 123, 1098 XG, Amsterdam, the Netherlands
Inken Wohlers & Gunnar W Klau
Mathematics in Life Sciences Group, Freie Universität Berlin, Arnimallee 6, 14195, Berlin, Germany
Lars Petzold
Computational Biology and Applied Algorithmics Group, Max-Planck-Institut für Informatik, 66123, Saarbrücken, Germany
Francisco S Domingues

Authors

Inken Wohlers
View author publications
You can also search for this author in PubMed Google Scholar
Lars Petzold
View author publications
You can also search for this author in PubMed Google Scholar
Francisco S Domingues
View author publications
You can also search for this author in PubMed Google Scholar
Gunnar W Klau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Inken Wohlers.

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Wohlers, I., Petzold, L., Domingues, F.S. et al. PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation. BMC Bioinformatics 10 (Suppl 13), P2 (2009). https://doi.org/10.1186/1471-2105-10-S13-P2

Download citation

Published: 19 October 2009
DOI: https://doi.org/10.1186/1471-2105-10-S13-P2

Highlights from the Fifth International Society for Computational Biology (ISCB) Student Council Symposium

PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation

Background

Methods

Results and conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Highlights from the Fifth International Society for Computational Biology (ISCB) Student Council Symposium

PAUL: protein structural alignment using integer linear programming and Lagrangian relaxation

Background

Methods

Results and conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us