Log on / register
Feedback | Support | My details
Open AccessResearch article

Denoising inferred functional association networks obtained by gene fusion analysis

Atanas Kamburov1,5 email, Leon Goldovsky1,2,3 email, Shiri Freilich1,2,3 email, Aliki Kapazoglou3 email, Victor Kunin1,6 email, Anton J Enright4 email, Athanasios Tsaftaris3 email and Christos A Ouzounis1,2,3,7 email

Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK

Computational Genomics Unit at the Center for Research & Technology Hellas, GR-57001 Thessalonica, Greece

Institute of Agrobiotechnology, Center for Research & Technology Hellas, GR-57001 Thessalonica, Greece

The Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK

Max-Planck Institute for Molecular Genetics, Ihnestrasse 63-73, D-14195 Berlin, Germany

DOE Joint Genome Institute, 2800 Mitchel Drive, Walnut Creek CA 94598, USA

Centre for Bioinformatics, School of Physical Sciences and Engineering, King's College London, Strand, London WC2R 2LS, UK

author email corresponding author email

BMC Genomics 2007, 8:460doi:10.1186/1471-2164-8-460

Published: 14 December 2007

Additional files

Additional File 1:

Dependence of fraction of composite (fused) genes on genome size. The x-axis represents genome size (number of genes) for a given query species, the y-axis represents the fraction of composite genes in that species. Color coding as in Figure 3.

Format: PDF Size: 67KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional File 2:

a) Dependence of fraction of composite (fused) genes on fraction of component (un-fused) genes, as a ratio of unique instances over total number of genes, per genome. b) Distribution of fraction of distinct interacting proteins in the three domains of life (color coding as in Figure 3) and all domains (grey).

Format: PDF Size: 67KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional File 3:

Distribution of slopes (relative rank order of reference species and number of contributed interactions) across genomes (see Figure 3c for an example). a) Distribution of slope values for all species; 25 species are excluded with slopes>12. b) Plot of slopes versus genome size. The x-axis represents the slope for a given species, the y-axis represents genome size (total number of genes) for that species. Both axes are shown on a logarithmic scale. Grey boxes represent the 25 species with slopes>12.

Format: PDF Size: 33KB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional File 4:

Distribution of composites across 184 genomes.

Format: PDF Size: 2.4MB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional File 5:

Distribution of interactions across 184 genomes.

Format: PDF Size: 2.4MB Download file

This file can be viewed with: Adobe Acrobat Reader

Additional File 6:

Analysis of predicted interactions from Chlamydia trachomatis. Examples of previously characterised interactions in C. trachomatis that are detected by gene fusion analysis. Cases in bold are presumed as known; cases in purple are artifacts from genomic paralogy.

Format: DOC Size: 46KB Download file

This file can be viewed with: Microsoft Word Viewer

Additional File 7:

Examples of predicted interactions from Arabidopsis thaliana. The species of origin is also given for the composite proteins.

Format: DOC Size: 24KB Download file

This file can be viewed with: Microsoft Word Viewer

Additional File 8:

Inferred interactions in this study. The data forms part of the COGENT++ data environment.[36].

Format: GZ Size: 10MB Download file


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.