Log on / register
Feedback | Support | My details
Open AccessHighly AccessMethodology article

A literature-based similarity metric for biological processes

Monica Chagoyen1,2 email, Pedro Carmona-Saez1 email, Concha Gil3,4 email, Jose M Carazo1 email and Alberto Pascual-Montano2 email

Biocomputing Unit. Centro Nacional de Biotecnologia – CSIC, Madrid, Spain

Dpto. Arquitectura de Computadores y Automatica. Universidad Complutense de Madrid, Madrid, Spain

Dpto. Microbiologia II. Facultad de Farmacia. Universidad Complutense de Madrid, Madrid, Spain

Unidad de Proteomica UCM – Parque Cientifico de Madrid, Madrid, Spain

author email corresponding author email

BMC Bioinformatics 2006, 7:363doi:10.1186/1471-2105-7-363

Published: 26 July 2006

Additional files

Additional file 1:

Histograms. This file contains three histograms corresponding to the pair-wise similarities among biological processes in the 282 subset used for validation, as obtained by literature analysis, the GO ontology structure (using Lin and Czekanowski-Dice formulae) and S. cerevisiae genome annotation.

Format: PDF Size: 20KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional file 2:

Comparison of ontology-based similarities. This file contains the boxplot of Lin similarity along different intervals of Czekanowski-Dice similarity.

Format: PDF Size: 14KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional file 3:

Correlation with shared genes/references. This file contains plots of the literature-based similarity against (a) the number of genes shared by any two biological processes (note than no more than 3 genes are shared by any two processes); (b) the normalised number references shared by any two biological processes.

Format: PDF Size: 21KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional file 4:

Correlation among similarity metrics. This file contains the correlation coefficients (using Pearson, Spearman, Kendall and uncentered dot product methods) among the four similarity metrics used for the evaluation set.

Format: PDF Size: 12KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional file 5:

Similar biological processes according to the literature. This file contains the 49 biological process pairs which are similar according to literature and similar less than average according to both ontology-based metrics. The first 10 pairs correspond to Table 1 in the full text article.

Format: PDF Size: 17KB Download file

This file can be viewed with: Adobe Acrobat Reader


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.