A literature-based similarity metric for biological processes1 Biocomputing Unit. Centro Nacional de Biotecnologia – CSIC, Madrid, Spain 2 Dpto. Arquitectura de Computadores y Automatica. Universidad Complutense de Madrid, Madrid, Spain 3 Dpto. Microbiologia II. Facultad de Farmacia. Universidad Complutense de Madrid, Madrid, Spain 4 Unidad de Proteomica UCM – Parque Cientifico de Madrid, Madrid, Spain
BMC Bioinformatics 2006, 7:363doi:10.1186/1471-2105-7-363
Additional filesAdditional file 1: Histograms. This file contains three histograms corresponding to the pair-wise similarities among biological processes in the 282 subset used for validation, as obtained by literature analysis, the GO ontology structure (using Lin and Czekanowski-Dice formulae) and S. cerevisiae genome annotation. Format: PDF Size: 20KB Download file This file can be viewed with: Adobe Acrobat Reader Additional file 2: Comparison of ontology-based similarities. This file contains the boxplot of Lin similarity along different intervals of Czekanowski-Dice similarity. Format: PDF Size: 14KB Download file This file can be viewed with: Adobe Acrobat Reader Additional file 3: Correlation with shared genes/references. This file contains plots of the literature-based similarity against (a) the number of genes shared by any two biological processes (note than no more than 3 genes are shared by any two processes); (b) the normalised number references shared by any two biological processes. Format: PDF Size: 21KB Download file This file can be viewed with: Adobe Acrobat Reader Additional file 4: Correlation among similarity metrics. This file contains the correlation coefficients (using Pearson, Spearman, Kendall and uncentered dot product methods) among the four similarity metrics used for the evaluation set. Format: PDF Size: 12KB Download file This file can be viewed with: Adobe Acrobat Reader Additional file 5: Similar biological processes according to the literature. This file contains the 49 biological process pairs which are similar according to literature and similar less than average according to both ontology-based metrics. The first 10 pairs correspond to Table 1 in the full text article. Format: PDF Size: 17KB Download file This file can be viewed with: Adobe Acrobat Reader |




on Google Scholar







author email
corresponding author email