Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Software

Linking Cytoscape and the corynebacterial reference database CoryneRegNet

Jan Baumbach1* and Leonard Apeltsin2

Author Affiliations

1 Center for Biotechnology, Bielefeld University, Bielefeld, Germany

2 Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA

For all author emails, please log on.

BMC Genomics 2008, 9:184  doi:10.1186/1471-2164-9-184


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2164/9/184


Received:21 February 2008
Accepted:21 April 2008
Published:21 April 2008

© 2008 Baumbach and Apeltsin; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Recently, the research community has seen an influx of data relating to transcriptional regulatory interactions of Corynebacteria, organisms that are highly relevant to fields of systems biology, biotechnology, and human medicine. Information derived from DNA microarray experiments, computational predictions, and literature has opened the way for the graph-based analysis, visualization, and reconstruction of transcriptional regulatory networks across entire organisms. The reference database for corynebacterial gene regulatory networks CoryneRegNet provides methods for data storage and data exchange in a well-structured manner. Additional information on the model organism Escherichia coli K12 obtained from RegulonDB has been integrated. Generally, gene regulatory networks can be visualized as graphs by drawing directed edges between nodes, where a node represents a gene and an edge corresponds to a typed regulatory interaction. Cytoscape is an open-source software project whose aim is to provide graph-based visualization and analysis for biological networks. Its architecture allows the development and integration of user-made plugins to enhance core functionalities.

Results

We introduce two novel plugins for the Cytoscape environment designed to enhance in silico studies of procaryotic transcriptional regulatory networks. Our plugins leverage the information from the cornyebacterial reference database CoryneRegNet with the graph analysis capabilities of Cytoscape. CoryneRegNetLoader queries the CoryneRegNet database to extract a gene regulatory network represented as a directed graph. Additional information is stored as node/edge attributes within the graph. COMA facilitates consistency checks for gene expression studies given a gene regulatory network. COMA tests whether all gene expression levels correlate properly with the given network topology.

Conclusion

The plugins facilitate in silico studies of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the knowledge from the corynebacterial reference database with the graph analysis capabilities of Cytoscape, which is one of the most-widely used tools for biological network analyses.

Background

Recently, the research community has seen an influx of data relating to transcriptional regulatory interactions of microorganisms. Information derived from DNA microarray hybridization experiments, computational predictions, and literature-based text mining has opened the way for the graph-based analysis of gene regulatory networks across entire organisms [1,2]. The large-scale reconstruction of these networks allows for in silico simulation of cell behavior in response to changing environmental conditions [3].

Corynebacteria are highly relevant to fields of systems biology, biotechnology, and human medicine [4]. The ontology-based reference database for corynebacterial transcription factors and regulatory networks CoryneRegNet [5-8] provides methods for the storage, analysis, and reconstruction of gene regulatory networks of corynebacteria. Data on the model organism Escherichia coli K12 has been imported from RegulonDB release 5.0 [9,10] and integrated together with corynebacterial data (refer to [8] for a detailed description of the integrated data). The reconstructed gene regulatory networks can be visualized as graphs by drawing typed, directed edges between nodes, where a node represents a gene and an edge corresponds to a known gene regulatory interaction (activation or repression).

Cytoscape [11-13] is an open-source software project whose aim is to provide graph-based visualization and analysis for all biological networks. Its architecture allows the development and integration of user-made plugins to enhance core functionalities. The easy to use plugin interface has encouraged the community to develop a number of Cytoscape plugins, which are publicly available at [14].

In this short article, we introduce two novel plugins for the Cytoscape environment. The plugins facilitate in silico studies of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the knowledge from the corynebacterial reference database with the graph analysis capabilities of Cytoscape, which is one of the most-widely used tools for biological network analyses.

Implementation

Both plugins are implemented in Java 5 in compliance with the Cytoscape plugin interface mechanism by extending the class CytoscapePlugin and creating Java JAR files. A tutorial on Cytoscape plugins may be found at [15]. The plugins are publicly available at the Cytoscape plugin site at [14] (COMA in section 'Analysis' and CoryneRegNetLoader in section 'Network and Attribute I/O') as JAR files, which have to be copied into the Cytoscape plugins directory. They can also be installed more easily by using the 'Plugins'/'Manage plugins' menu of Cytoscape. In order to run CoryneRegNetLoader internet access is necessary. The plugins are compatible with Cytoscape version 2.5.x and have also been tested with the current beta release 2.6. The URL for a documentation and a step-by-step description of an application example is given in Availability and Requirements. The source code of both plugins is included in the JAR files.

Results and Conclusion

CoryneRegNet plugin

The CoryneRegNetLoader is able to extract transcriptional regulatory networks stored in CoryneRegNet and to import them into Cytoscape. A directed graph is then created, with genes as nodes and regulatory interactions as directed edges. For each node, CoryneRegNetLoader imports the up-to-date gene and protein annotation data, which is stored as node attributes. For each edge, the plugin retrieves the evidence of the underlying protein-DNA interaction and whether the target genes are up- or downregulated. This data is stored as edge attributes. In [16] we described the SOAP-based Web Service server of CoryneRegNet and presented an application example that shows how the community profits from an integrated visualization of corynebacterial network data. With this plugin, we take a step forward and provide an interface to a more powerful environment for integrated biological network analyses.

COMA plugin

The COMA plugin implements a method to facilitate consistency checks for gene expression studies given a gene regulatory network. It checks whether the expression levels of all genes are as expected from the network topology.

We discretize the expression levels of genes to overexpressed ('+') and underexpressed ('-') respectively. Furthermore, we do the same for the activation or repression of a target gene by a regulator. Let g ∈ {+, -} be the gene expression of a gene G. Let t ∈ {+, -} be the gene expression of a transcription factor T, which regulates G. Finally, let r ∈ {+, -} be the known gene regulatory interaction of gene G by transcription factor T. Subsequently consider the algebraic signs in the following equation:

t·g = r

If the equation is incorrect, for instance '+' · '-' = '+', this is a putative inconsistency with the given regulatory network; also refer to [5]. Based on this, for every gene G of a given microarray experiment with expression g, all transcription factors T that regulate G are considered, and for all transcription factors the expression level t and the type of the regulatory interaction r is determined. Subsequently, COMA applies the above explained inconsistency test. Table 1 summarizes the results for all possible combinations of t, g, and r.

Table 1. Potential inconsistencies for gene regulatory interactions given a gene expression study.

Although COMA was designed to work with the CoryneRegNetLoader, it can be used with any directed biological network as long as there is a Cytoscape edge attribute of type boolean, integer, or float. This attribute is used to distinguish between up- and downregulations and can be defined by the user. Gene expression values can be imported by using the Cytoscape microarray import feature, for instance. Generally, any node attribute of type integer or float can be used for COMA. Each putative inconsistent edge is listed in a result table. If the user selects a certain node-pair, both nodes including the edge between them are highlighted in the Cytoscape graph. Figure 1 show a subset of the complete gene regulatory network of C. glutamicum, extracted from CoryneRegNet by utilizing the CoryneRegNetLoader plugin. The COMA plugin has been applied to the graph by using the results of a gene expression study provided by Brune et. al [17], which is also available in Cytoscape format as Additional file 1. With this plugin, Cytoscape now provides hints for missing regulatory interactions.

Additional file 1. Microarray result file in Cytoscape format. The results of a gene expression study provided by [17] in Cytoscape format. Listed are the m-values for those genes that have been differentially expressed significantly.

Format: TXT Size: 5KB Download fileOpen Data

thumbnailFigure 1. The complete gene regulatory network of C. glutamicum visualized with Cytoscape. This figure shows parts of the complete gene regulatory network of Corynebacterium glutamicum imported by using the CoryneRegNetLoader plugin. The two highlighted nodes in yellow are those selected from the COMA results table (right). Nodes correspond to genes, edges to gene regulatory interactions. Cytoscape has been configured to visualize activations as green edges and repressions as red edges respectively. The relative gene expression levels are continously mapped to the node colors: red – underexpressed, green -overexpressed, and blue – no significant differential gene expression.

Both plugins together facilitate in silico the study and visualization of procaryotic transcriptional gene regulation, particularly in Corynebacteria and E. coli, by combining the manually curated knowledge from the CoryneRegNet database with the power of the graph analysis and visualization capabilities of Cytoscape.

Since various chemical mechanisms take part in the regulation of the genes, in a future version of the COMA plugin we plan to consider also non-transcriptional gene regulatory interactions. Additionally, we plan to extend COMA to check for coherent and incoherent feed-forward loops [18] in a given network. We will also conside further rules to validate microarray data in the context of known regulatory networks (see e.g. [19]).

Availability and Requirements

Project name: CoryneRegNetLoader and COMA

Project home page: http://cytoscape.org/plugins webcite and http://www.coryneregnet.de webcite

Documentation and application example: https://www.cebitec.uni-bielefeld.de/groups/gi/software/coryneregnet/v4/CytoscapePlugins/index.html webcite

Operating system(s): Platform independent

Programming language: Java 5

License: Academic Free License (AFL)

Any restrictions to use by non-academics: User should contact Jan.Baumbach@CeBiTec.Uni-Bielefeld.DE.

Authors' contributions

JB and LA developed and implemented the plugins. Both authors contributed to writing and both read and approved the final manuscript.

Acknowledgements

LA wishes to thank John 'Scooter' Morris and Thomas Ferrin (CGL, UCSF) for supervising, JB wishes to thank them for their cooperation and for their assistence with the Cytoscape plugin mechanism.

Furthermore, JB acknowledges Sven Rahmann (Dortmund University) and Andreas Tauch (CeBiTec, Bielefeld University) for supervising and for their help with the CoryneRegNet database. JB thanks Heladia Salgado-Osorio for providing RegulonDB data. JB also thanks the Boehringer Ingelheim Fond for partly funding the cooperation between the CeBiTec and the UCSF. LA received support from NIH grant T32 GM067547.

References

  1. Brune I, Brinkrolf K, Kalinowski J, Pühler A, Tauch A: The individual and common repertoire of DNA-binding transcriptional regulators of Corynebacterium glutamicum, Corynebacterium efficiens, Corynebacterium diphtheriae and Corynebacterium jeikeium deduced from the complete genome sequences.

    BMC Genomics 2005, 6:86. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  2. Perez-Rueda E, Collado-Vides J: The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12.

    Nucleic Acids Res 2000, 28(8):1838-1847. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  3. Herrgård MJ, Covert MW, Palsson B: Reconstruction of microbial transcriptional regulatory networks.

    Curr Opin Biotechnol 2004, 15:70-77. PubMed Abstract | Publisher Full Text OpenURL

  4. Wendisch VF, Bott M, Kalinowski J, Oldiges M, Wiechert W: Emerging Corynebacterium glutamicum systems biology.

    J Biotechnol 2006, 124:74-92. PubMed Abstract | Publisher Full Text OpenURL

  5. Baumbach J: CoryneRegNet 4.0 – A reference database for corynebacterial gene regulatory networks.

    BMC Bioinformatics 2007, 8:429. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  6. Baumbach J, Brinkrolf K, Czaja L, Rahmann S, Tauch A: CoryneRegNet: An ontology-based data warehouse of corynebacterial transcription factors and regulatory networks.

    BMC Genomics 2006, 7:24. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  7. Baumbach J, Brinkrolf K, Wittkop T, Tauch A, Rahmann S: CoryneRegNet 2: An Integrative Bioinformatics Approach for Reconstruction and Comparison of Transcriptional Regulatory Networks in Prokaryotes.

    Journal of Integrative Bioinformatics 2006, 3(2):24. OpenURL

  8. Baumbach J, Wittkop T, Rademacher K, Rahmann S, Brinkrolf K, Tauch A: CoryneRegNet 3.0-An interactive systems biology platform for the analysis of gene regulatory networks in corynebacteria and Escherichia coli.

    J Biotechnol 2007, 129(2):279-289. PubMed Abstract | Publisher Full Text OpenURL

  9. Gama-Castro S, Jiménez-Jacinto V, Peralta-Gil M, Santos-Zavaleta A, Peñaloza-Spinola MI, Contreras-Moreira B, Segura-Salazar J, Muñiz-Rascado L, Martínez-Flores I, Salgado H, Bonavides-Martínez C, Abreu-Goodger C, Rodríguez-Penagos C, Miranda-Ríos J, Morett E, Merino E, Huerta AM, Treviño-Quintanilla L, Collado-Vides J: RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation.

    Nucleic Acids Res 2008, (36 Database):D120-D124. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  10. Salgado H, Gama-Castro S, Peralta-Gil M, Diaz-Peredo E, Sanchez-Solano F, Santos-Zavaleta A, Martinez-Flores I, Jimenez-Jacinto V, Bonavides-Martinez C, Segura-Salazar J, Martinez-Antonio A, Collado-Vides J: RegulonDB (version 5.0): Escherichia coli K-12 transcriptional regulatory network, operon organization, and growth conditions.

    Nucleic Acids Res 2006, (34 Database):D394-D397. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  11. Cline MS, Smoot M, Cerami E, Kuchinsky A, Landys N, Workman C, Christmas R, Avila-Campilo I, Creech M, Gross B, Hanspers K, Isserlin R, Kelley R, Killcoyne S, Lotia S, Maere S, Morris J, Ono K, Pavlovic V, Pico AR, Vailaya A, Wang PL, Adler A, Conklin BR, Hood L, Kuiper M, Sander C, Schmulevich I, Schwikowski B, Warner GJ, Ideker T, Bader GD: Integration of biological networks and gene expression data using Cytoscape.

    Nat Protoc 2007, 2(10):2366-2382. PubMed Abstract | Publisher Full Text OpenURL

  12. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks.

    Genome Res 2003, 13(11):2498-2504. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  13. Cytoscape website [http://cytoscape.org/] webcite

  14. Cytoscape plugins website [http://cytoscape.org/plugins/] webcite

  15. Cytoscape plugin tutorial website [http://www.cytoscape.org/cgi-bin/moin.cgi/Cytoscape Plugin Tutorial] webcite

  16. Neuweger H, Baumbach J, Albaum S, Bekel T, Dondrup M, Hueser A, Kalinowski J, Oehm S, Puehler A, Rahmann S, Weile J, Goesmann A: CoryneCenter – An online resource for the integrated analysis of corynebacterial genome and transcriptome data.

    BMC Syst Biol 2007, 1:55. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  17. Brune I, Werner H, Huser A, Kalinowski J, Puhler A, Tauch A: The DtxR protein acting as dual transcriptional regulator directs a global regulatory network involved in iron metabolism of Corynebacterium glutamicum.

    BMC Genomics 2006, 7:21. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  18. Mangan S, Alon U: Structure and function of the feed-forward loop network motif.

    Proc Natl Acad Sci USA 2003, 100(21):11980-11985. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Gutiérrez-Ríos RM, Rosenblueth DA, Loza JA, Huerta AM, Glasner JD, Blattner FR, Collado-Vides J: Regulatory network of Escherichia coli: consistency between literature knowledge and microarray profiles.

    Genome Res 2003, 13(11):2435-2443. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL