Automatic discovery of cross-family sequence features associated with protein function
-
* Corresponding author: Robert M MacCallum maccallr@sbc.su.se
- Equal contributors
1 Stockholm Bioinformatics Center, Stockholm University, 106 91 Stockholm, Sweden
2 Bioinformatics Research Center, University of Aarhus, 8000 Aarhus C, Denmark
3 Division of Cell and Molecular Biology, Imperial College London, London SW7 2AZ, UK
BMC Bioinformatics 2006, 7:16 doi:10.1186/1471-2105-7-16
Published: 12 January 2006Additional files
Additional File 1:
As in Figure 2 of the article, the predictors are clustered with a 8 × 8 Kohonen Self-Organising Map (SOM). In this figure, the evolved annotation word boolean expression (from the annotation_classifier subroutine) are shown in full for each of the 500 evolved function predictors. Each boolean expression is separated by a semicolon. The A-type predictors are shown with upper case to identify them.
Format: PDF Size: 304KB Download file
This file can be viewed with: Adobe Acrobat Reader
Additional File 2:
Here we show the full list of the 150 most common annotation words after manual filtering. The filtering is performed in order to remove stopwords and words that do not contain any information about protein function. The filtered words are shown with strikethrough text.
Format: HTML Size: 13KB Download file
Additional File 3:
Additional supplementary material (includes files 1 and 2); gzipped tar archive; after unpacking, please open the file brameier2005/index.html in your web browser.
Format: GZ Size: 715KB Download file
