Denoising inferred functional association networks obtained by gene fusion analysis1 Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK 2 Computational Genomics Unit at the Center for Research & Technology Hellas, GR-57001 Thessalonica, Greece 3 Institute of Agrobiotechnology, Center for Research & Technology Hellas, GR-57001 Thessalonica, Greece 4 The Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK 5 Max-Planck Institute for Molecular Genetics, Ihnestrasse 63-73, D-14195 Berlin, Germany 6 DOE Joint Genome Institute, 2800 Mitchel Drive, Walnut Creek CA 94598, USA 7 Centre for Bioinformatics, School of Physical Sciences and Engineering, King's College London, Strand, London WC2R 2LS, UK
BMC Genomics 2007, 8:460doi:10.1186/1471-2164-8-460
Additional filesAdditional File 1: Dependence of fraction of composite (fused) genes on genome size. The x-axis represents genome size (number of genes) for a given query species, the y-axis represents the fraction of composite genes in that species. Color coding as in Figure 3. Format: PDF Size: 67KB Download file This file can be viewed with: Adobe Acrobat Reader Additional File 2: a) Dependence of fraction of composite (fused) genes on fraction of component (un-fused) genes, as a ratio of unique instances over total number of genes, per genome. b) Distribution of fraction of distinct interacting proteins in the three domains of life (color coding as in Figure 3) and all domains (grey). Format: PDF Size: 67KB Download file This file can be viewed with: Adobe Acrobat Reader Additional File 3: Distribution of slopes (relative rank order of reference species and number of contributed interactions) across genomes (see Figure 3c for an example). a) Distribution of slope values for all species; 25 species are excluded with slopes>12. b) Plot of slopes versus genome size. The x-axis represents the slope for a given species, the y-axis represents genome size (total number of genes) for that species. Both axes are shown on a logarithmic scale. Grey boxes represent the 25 species with slopes>12. Format: PDF Size: 33KB Download file This file can be viewed with: Adobe Acrobat Reader Additional File 4: Distribution of composites across 184 genomes. Format: PDF Size: 2.4MB Download file This file can be viewed with: Adobe Acrobat Reader Additional File 5: Distribution of interactions across 184 genomes. Format: PDF Size: 2.4MB Download file This file can be viewed with: Adobe Acrobat Reader Additional File 6: Analysis of predicted interactions from Chlamydia trachomatis. Examples of previously characterised interactions in C. trachomatis that are detected by gene fusion analysis. Cases in bold are presumed as known; cases in purple are artifacts from genomic paralogy. Format: DOC Size: 46KB Download file This file can be viewed with: Microsoft Word Viewer Additional File 7: Examples of predicted interactions from Arabidopsis thaliana. The species of origin is also given for the composite proteins. Format: DOC Size: 24KB Download file This file can be viewed with: Microsoft Word Viewer Additional File 8: Inferred interactions in this study. The data forms part of the COGENT++ data environment.[36]. Format: GZ Size: 10MB Download file |




on Google Scholar







author email
corresponding author email