BMC Bioinformatics

official impact factor 3.03

Open Access Highly Access Research article

Mining protein loops using a structural alphabet and statistical exceptionality

Leslie Regad1*, Juliette Martin2,3, Gregory Nuel4 and Anne-Claude Camproux1

Author Affiliations

1 MTi, Inserm UMR-S 973, Université Paris Diderot- Paris 7, Paris, F-75205 Cedex 13, France

2 Unite Mathématiques Informatique et Génome UR1077, INRA, Jouy-en-Josas, F-78350, France

3 Université Lyon 1, IFR 128, CNRS, UMR 5086, IBCP, Institut de Biologie et Chimie des Protéines, Lyon, F-69367, France

4 MAP5, UMR CNRS 8145, Université Paris-Descartes,Paris, F-75006, France

For all author emails, please log on.

BMC Bioinformatics 2010, 11:75 doi:10.1186/1471-2105-11-75

Published: 4 February 2010

Additional files

Additional file 1:

Supplementary. This file is a pdf file. It contains different information about: • Extraction of words of different lengths. • Comparison of the loop length distribution in loops containing all words and loops containing only words seen 30 times. • Coverage of SCOP superfamilies by recurrent words. • Correlation between sequence specificity (Zmax) and structure variability (RMSdw) for all words in Wset≥30. • Exceptionality score Lp versus frequency for the 28274 words of the data set. • Robustness of the word statistical analysis on different data sets. • ClustalW of 3SIL sequence (P29768) and homologous sequences from UniProt

Format: PDF Size: 454KB Download file

This file can be viewed with: Adobe Acrobat Reader

Open Data