Mining protein loops using a structural alphabet and statistical exceptionality
-
* Corresponding author: Leslie Regad leslie.regad@univ-paris-diderot.fr
1 MTi, Inserm UMR-S 973, Université Paris Diderot- Paris 7, Paris, F-75205 Cedex 13, France
2 Unite Mathématiques Informatique et Génome UR1077, INRA, Jouy-en-Josas, F-78350, France
3 Université Lyon 1, IFR 128, CNRS, UMR 5086, IBCP, Institut de Biologie et Chimie des Protéines, Lyon, F-69367, France
4 MAP5, UMR CNRS 8145, Université Paris-Descartes,Paris, F-75006, France
BMC Bioinformatics 2010, 11:75 doi:10.1186/1471-2105-11-75
Published: 4 February 2010Additional files
Additional file 1:
Supplementary. This file is a pdf file. It contains different information about: • Extraction of words of different lengths. • Comparison of the loop length distribution in loops containing all words and loops containing only words seen 30 times. • Coverage of SCOP superfamilies by recurrent words. • Correlation between sequence specificity (Zmax) and structure variability (RMSdw) for all words in Wset≥30. • Exceptionality score Lp versus frequency for the 28274 words of the data set. • Robustness of the word statistical analysis on different data sets. • ClustalW of 3SIL sequence (P29768) and homologous sequences from UniProt
Format: PDF Size: 454KB Download file
This file can be viewed with: Adobe Acrobat Reader
