Email updates

Keep up to date with the latest news and content from BMC Genomics and BioMed Central.

Open Access Highly Accessed Database

GiSAO.db: a database for ageing research

Edith Hofer12, Gerhard T Laschober3, Matthias Hackl4, Gerhard G Thallinger1, Günter Lepperdinger3, Johannes Grillari4, Pidder Jansen-Dürr3 and Zlatko Trajanoski2*

Author Affiliations

1 Institute for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, 8010 Graz, Austria

2 Division for Bioinformatics, Biocenter, Innsbruck Medical University, Schöpfstrasse 45, 6020 Innsbruck, Austria

3 Institute for Biomedical Aging Research, Austrian Academy of Sciences, Rennweg 10, 6020 Innsbruck, Austria

4 Aging and Immortalization Research, Department of Biotechnology, University of Natural Resources and Applied Life Sciences Vienna, Muthgasse 18, 1190 Vienna, Austria

For all author emails, please log on.

BMC Genomics 2011, 12:262  doi:10.1186/1471-2164-12-262

The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2164/12/262


Received:16 February 2011
Accepted:24 May 2011
Published:24 May 2011

© 2011 Hofer et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Age-related gene expression patterns of Homo sapiens as well as of model organisms such as Mus musculus, Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster are a basis for understanding the genetic mechanisms of ageing. For an effective analysis and interpretation of expression profiles it is necessary to store and manage huge amounts of data in an organized way, so that these data can be accessed and processed easily.

Description

GiSAO.db (Genes involved in senescence, apoptosis and oxidative stress database) is a web-based database system for storing and retrieving ageing-related experimental data. Expression data of genes and miRNAs, annotation data like gene identifiers and GO terms, orthologs data and data of follow-up experiments are stored in the database. A user-friendly web application provides access to the stored data. KEGG pathways were incorporated and links to external databases augment the information in GiSAO.db. Search functions facilitate retrieval of data which can also be exported for further processing.

Conclusions

We have developed a centralized database that is very well suited for the management of data for ageing research. The database can be accessed at https://gisao.genome.tugraz.at webcite and all the stored data can be viewed with a guest account.

Background

Accumulated cell damage is one of the main perpetrators of ageing. The damage is caused by a variety of different factors and conditions, including somatic mutations, mitochondrial dysfunction and oxidative stress [1,2]. If damage of cellular components (proteins, nucleic acids, lipids, etc.) remains permanently and is not corrected by repair systems (e.g. DNA repair or the elimination of damaged organelles and proteins), then cellular senescence and/or apoptosis is occuring. Cellular senescence and apoptosis contribute to a characteristic ageing phenotype as well as to the development of age-related diseases [3,4]. Since the underlying mechanisms of cellular ageing, leading to senescence or perhaps to apoptosis, have not yet been fully revealed, it is indispensable to identify and study genes and miRNAs which are involved in the ageing process. An effective way to determine these genes and gene-regulatory miRNAs are genome-wide studies of expression patterns, as it is well known that expression profiles of organisms change with age [2,5]. Microarrays are well suited for this task as they are a high-throughput method for determining the expression of tens of thousands of genes in parallel [6]. Results of microarray experiments are usually validated by applying low-throughput methods for measuring gene expression, e.g. qPCR or Northern blots [7], or confirmed with protein assays like Western blots.

Genetic research into human ageing is supported by investigation of ageing in various model organisms, such as Mus musculus, Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster. These organisms have a much shorter lifespan than humans and can be easily genetically manipulated for experimental purposes. The results obtained from model organisms can be transferred to a certain extent to Homo sapiens, since these organisms share orthologous genes [8]. However, in order to effectively analyse data generated in various experiments using different organisms, it is necessary to structure and manage this data in an organized way.

Therefore, several publicly available databases which store ageing specific gene information were developed: the Human Aging Genomic Resources (HAGR) [9], the Gene Aging Nexus (GAN) [10], the Aging Gene Database [11], the Atlas of Gene Expression in Mouse Aging Project (AGEMAP) [12], and the NetAge database [13]. However, to the best of our knowledge, there is no database which contains microarray gene expression data together with orthologous genes, ageing-related microarray miRNA expression data as well as data of follow-up experiments. We have therefore initiated the development of a database GiSAO.db (Genes involved in senescence, apoptosis and oxidative stress) to support ongoing and future studies in experimental ageing research.

Construction and content

GiSAO.db is a database for storing and managing expression data of genes involved in senescence, apoptosis and oxidative stress. It is connected to a web application which provides an easy and controlled access to this data. Specifically, the database is capable of storing four data types: expression data, annotation data, orthologous data and data of follow-up experiments (Figure 1).

thumbnailFigure 1. GiSAO overview. Four different types of data are stored in the database: gene expression data obtained from Affymetrix microarray experiments and miRNA expression data from Exiqon microarray experiments, annotations for genes and miRNAs including GO terms, orthologs data provided by Affymetrix as well as manually entered verified orthologs data and experimental data of follow-up experiments.

Expression data

Normalized gene and miRNA expression values obtained from microarray experiments investigating ageing reside in GiSAO.db. It is possible to store gene expression data from Affymetrix one-colour microarrays as well as miRNA expression data from Exiqon two-colour microarrays in the database.

Annotation data

For genes, several gene identifiers are available as annotation in GiSAO.db: GeneSymbol, Refseq Id, Gene Name, EntrezGene Id, UniProt Id, UniGene Id, SGD Id, MGI Id, FlyBase Id, RGD Id and AGI Id. These identifiers were obtained together with Gene Ontology (GO) [14] terms from Affymetrix which provides annotation data of each spot on a microarray chip [15]. In case of miRNAs, the miRNA name and the miRBase Id [16] are stored as annotations in the database.

Orthologs data

Two types of orthologous data are included in GiSAO.db: orthologs provided by Affymetrix and verified orthologs data. Pairs of orthologs probe sets from different Affymetrix microarray chips are stored. Moreover, verified orthologous gene pairs between different species retrieved from other sources, such as literature or orthologs databases can be entered manually.

Experimental data

Finally, GiSAO.db provides facilities to store data of follow-up experiments. An experiment is specified by properties such as type (e.g. qPCR, Western blot), classification (e.g. senescence, inflammation), organism and cell type. All genes that were investigated can be linked to the experiment and antibodies or primers can be defined. Moreover, it is possible to specify references, and protocols as well as result files may be uploaded and attached to experiments or their associated genes.

Database content

GiSAO.db provides annotation data for five Affymetrix microarrays: Human Genome U133 Plus 2.0 Array, Mouse Genome 430 2.0 Array, Yeast Genome 2.0 Array, C. elegans Genome Array and Drosophila Genome Array. Furthermore, annotations for two custom made human Exiqon miRNA microarrays are available. The database contains orthologs provided by Affymetrix between Homo sapiens, Mus musculus, Saccharomyces cerevisiae, Caenorhabditis elegans and Drosophila melanogaster. Currently GiSAO.db stores gene expression values of 11 experiments comprising 111 Affymetrix microarrays of three different species: Homo sapiens, Mus musculus and Saccharomyces cerevisae. Additionally there are 7 human miRNA experiments with 40 Exiqon microarrays stored in the database. Moreover, numerous verified orthologs and data of several follow-up experiments are available in GiSAO.db.

Normalization of expression data obtained from Affymetrix microarray experiments was performed using the gcrma algorithm [17] in CARMAweb [18]. Independently of the particular experiments, data of a certain cell type, e.g. HUVEC or PFF, were normalized together.

The statistical framework R [19] and the Bioconductor package limma [20] were used for background correction of miRNA expression data with the normexp algorithm as well as lowess normalization [21].

Implementation

The GiSAO.db database system was developed using the object-oriented and platform independent Java programming language [22]. Based on the Java Enterprise Edition (Java EE) platform [23], a three-tier application composed of a relational database, business logic and presentation layer was implemented.

In order to control data access and manage user data, the web application offers an integrated authentication and authorization system [24].

Utility

User Interface

The database system offers a user-friendly web interface which facilitates data input and retrieval. Results of microarray experiments can be viewed in detail as both the expression value of each spot for one-colour microarray chips, and the expression ratio between the colour channels for two-colour microarrays are displayed. Expression values and ratios are represented by colour-coded boxes which facilitate the determination of highly expressed genes, up- or down-regulated miRNAs and the comparison of expression values and ratios of different microarrays (Figure 2). The values and ratios can be displayed in a logarithmic or decimal scale, and a threshold can be defined to show only those genes or miRNAs whose expression values or ratios exceed the defined cut-off.

thumbnailFigure 2. Display of expression values obtained from Affymetrix one-colour arrays. The values are colour coded to facilitate the identification of prominent expression values and comparison between values of a gene of different arrays. Additionally, tags are displayed for a quick overview of experimental data. In this case, for the gene THBS a follow-up experiment was performed with the organism Homo sapiens (HS), experiment classification oxidative stress (OS) and experiment type qPCR (QP).

Pairs of orthologous genes can be retrieved using a simple search function which returns Affymetrix orthologs as well as verified orthologs. For experimental data of follow-up experiments, the application provides a flexible query mechanism which accepts organism, experiment classification and experiment type as parameters. Additionally, tags are displayed in gene lists to provide basic information about the different experiments performed on a gene at first glance. Tags are essentially shortcuts describing experiment classification, experiment type and organism, referencing experimental data (Figure 2). Genes or miRNAs which are of special interest for users can be assembled to favourite lists and furnished with additional information. These lists can be compared to check for common entries (Figure 3).

thumbnailFigure 3. Favourite gene list comparison. Genes of interest can be collected in lists and two or more favourite lists can be compared to determine common genes.

A comprehensive search function that takes gene and miRNA Ids as parameters provides access to all data about a gene or miRNA in GiSAO.db. The search yields expression data, annotation data, orthologs, experimental data tags and favourite lists of the specified gene or miRNA (Figure 4). An export mechanism which enables further processing of data from the database in external tools is seamlessly included into GiSAO.db. Lists of expression values, favorite genes and orthologs can be written to plain text files, PDF files or files in comma separated values (CSV) format.

thumbnailFigure 4. Gene search result. For the queried gene all the information stored in the database, i.e. tags, gene expression data, annotation data and orthologs data are listed.

Identifiers of genes and miRNAs as well as GO terms are provided to attribute a meaning to the probe (set) identifiers of the microarray spots. To enhance stored components with additional information, links to external databases are offered. Gene Ids are linked to their respective databases, e.g. RefSeq, Entrez Gene or UniProt, and to the ortholog databases HomoloGene [25] and InParanoid [26]. Moreover, pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [27] can be accessed in GiSAO.db via a web service provided by KEGG.

Data Submission

There are two ways of entering data in the database: manual data input using web forms or data upload from files. Forms are enhanced by a data dictionary concept, which extends the functionality of select fields, facilitates data input, and prevents inconsistencies in the database caused by spelling mistakes or duplicate entries.

Verified orthologs and experimental data can be entered in GiSAO.db using specific upload forms. Experimental data can be additionally uploaded from files as well as favorite lists, expression data, annotation data and Affymetrix ortholog data to accelerate the data input process. Affymetrix and Exiqon annotation data of microarray chips as well as Affymetrix ortholog data are updated on a regular basis by the producers. To adopt these changes in GiSAO.db, update functions have been implemented.

Moreover, several protocols or result files of follow-up experiments can be uploaded at once by using a Java applet. For each file upload, a feedback report is created which allows the user to check whether the upload was successful and view error messages in case something went wrong.

Authentication and authorization

To protect the data in the database fine grained access rights were defined in the integrated authentication and authorization system [24]. Three different user roles can access GiSAO.db: administrator, user and guest. An "administrator" can add, edit and delete all data. A "user" is allowed to add data, edit and delete his/her own data or data belonging to a member of the user's institute. Finally, a user assigned the "guest" role may view all the data stored in the database but has no rights to add, edit or delete any data.

Case study

Expression data of 47 human Affymetrix microarrays stored in GiSAO.db were analyzed. The experiments were performed on different cell types and were divided into two different groups: premature senescence induced by oxidative stress and other senescence models. The derived expression profiles were compared to determine conserved patterns in the various cellular aging models. As a result 484 genes associated with oxidative stress-induced senescence, 1087 genes associated with other senescence models, and 155 genes which seem to play a role in both experimental settings were determined [28]. This information guided the selection of 93 candidate genes which were tested for their ability to modulate lifespan in a unicellular model system (yeast chronological lifespan) to study organismic ageing [28]. Thereby, several new pathways which may be important in cellular senescence were identified [28]. Additionally, GiSAO.db was used supporting another study investigating the contribution of miRNAs in ageing [21].

Up to date 87 expression data sets stored in GiSAO.db have also been published in the public repository ArrayExpress [29]: 51 Affymetrix arrays from 6 experiments (E-MEXP-2283, E-MEXP-2285, E-MEXP-2167, E-MEXP-2345, E-MEXP-1506 and E-MEXP-2683) as well as 36 Exiqon arrays from 6 experiments (E-MEXP-2386, E-MEXP-2425, E-MEXP-2393, E-MEXP-2398, E-MEXP-2455, E-MEXP-2459).

Discussion

GiSAO.db is a database for the storage and management of ageing-related data. The core of GiSAO.db consists of normalized gene expression and miRNA expression data retrieved from microarray experiments. Annotation data like gene or miRNA identifiers as well as GO terms are available to interpret the expression profiles. As many of the orthologs provided by Affymetrix are only predicted ones, a manual curation of verified orthologs was implemented. The orthologs in the database facilitate cross-species comparison of expression profiles and the detection of evolutionary conserved expression patterns. Data of follow-up experiments, e.g. qPCR or Western blot experiments complement the microarray expression data. For a quick overview on these follow-up experiments performed for a specific gene, tags which serve as links to experimental data can be added.

Additionally, links to external gene, miRNA and ortholog databases are offered as well as KEGG pathways. Data upload and update is performed asynchronously, meaning that GISAO.db can be used while the upload takes place. In web forms data dictionary fields support controlled, yet customizable data input to keep the database content consistent. Search functions deliver data from the database which can then be exported in various file formats that are suitable for direct import in programs like MS Excel that are used for further processing of the data.

Furthermore, genes or miRNAs of interest can be grouped into favorite lists which can then be compared among different research groups. A sophisticated authentication and authorization system prevents undesired manipulation of data, yet allows all users to view the entire content of the database. By using a three-tier architecture, maintenance and extension of the application is facilitated and the various layers, e.g. the underlying database system, may also be exchanged. The usability of the web application is greatly enhanced by Web 2.0 functionality which was added using AJAX technology.

Conclusions

We have developed GiSAO.db, a system for storage and management of ageing-related gene data. An intuitive user interface provides fast and organised access to these data. Additionally, references to external databases are offered to elaborate the data in the database. These features in combination with the stored gene and miRNA expression data, annotation data, orthologs data and data of follow-up experiments make it a powerful tool for genetic ageing research.

Availability

GiSAO.db is available at http://gisao.genome.tugraz.at webcite. All the data stored in the database can be viewed with a guest account. Username and password for guest users are provided on the login page of the application.

Authors' contributions

EH designed and implemented the application and drafted the manuscript. GGT contributed to conception, design, and implementation of the application. PJD contributed to the concept of the database, was involved in data upload and data quality control issues, and contributed to writing the manuscript. GL, MH, and GL contributed to the concept of the database and were involved in data upload. PJD and ZT were responsible for the overall project coordination. All authors gave final approval of the version to be published.

Acknowledgements

The authors thank Martina Pitzl for the initial development of GiSAO.db.

This work was supported by the Austria Science Fund (NFN Project S93 Proliferation, Differentiation and Apoptosis in Aging) and the GEN-AU project BIN from the Austrian Ministry for Science and Research.

References

  1. Dilman VM: Age-associated elevation of hypothalamic, threshold to feedback control, and its role in development, ageing, and disease.

    Lancet 1971, 1:1211-1219. PubMed Abstract OpenURL

  2. Weinert BT, Timiras PS: Invited review: Theories of aging.

    J Appl Physiol 2003, 95:1706-1716. PubMed Abstract | Publisher Full Text OpenURL

  3. Campisi J, d'Adda dF: Cellular senescence: when bad things happen to good cells.

    Nat Rev Mol Cell Biol 2007, 8:729-740. PubMed Abstract | Publisher Full Text OpenURL

  4. Campisi J: Cellular senescence and apoptosis: how cellular responses might influence aging phenotypes.

    Exp Gerontol 2003, 38:5-11. PubMed Abstract | Publisher Full Text OpenURL

  5. Chen LH, Chiou GY, Chen YW, Li HY, Chiou SH: microRNA and aging: a novel modulator in regulating the aging network.

    Ageing Res Rev 2010, 9(Suppl 1):S59-S66. PubMed Abstract | Publisher Full Text OpenURL

  6. Schena M: Microarray Biochip Technology. Natick: Eaton Publishing; 2000. OpenURL

  7. Chuaqui RF, Bonner RF, Best CJ, Gillespie JW, Flaig MJ, Hewitt SM, Phillips JL, Krizman DB, Tangrea MA, Ahram M, Linehan WM, Knezevic V, Emmert-Buck MR: Post-analysis follow-up and validation of microarray experiments.

    Nat Genet 2002, 32:509-514. PubMed Abstract | Publisher Full Text OpenURL

  8. Kuningas M, Mooijaart SP, van Heemst D, Zwaan BJ, Slagboom PE, Westendorp RG: Genes encoding longevity: from model organisms to humans.

    Aging Cell 2008, 7:270-280. PubMed Abstract | Publisher Full Text OpenURL

  9. de Magalhaes JP, Budovsky A, Lehmann G, Costa J, Li Y, Fraifeld V, Church GM: The Human Ageing Genomic Resources: online databases and tools for biogerontologists.

    Aging Cell 2009, 8:65-72. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  10. Pan F, Chiu CH, Pulapura S, Mehan MR, Nunez-Iglesias J, Zhang K, Kamath K, Waterman MS, Finch CE, Zhou XJ: Gene Aging Nexus: a web database and data mining platform for microarray data on aging.

    Nucleic Acids Res 2007, 35:D756-D759. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  11. Aging Gene Database [http://uwaging.org/genesdb/index.php] webcite

  12. Zahn JM, Poosala S, Owen AB, Ingram DK, Lustig A, Carter A, Weeraratna AT, Taub DD, Gorospe M, Mazan-Mamczarz K, Lakatta EG, Boheler KR, Xu X, Mattson MP, Falco G, Ko MS, Schlessinger D, Firman J, Kummerfeld SK, Wood WH, Zonderman AB, Kim SK, Becker KG: AGEMAP: a gene expression database for aging in mice.

    PLoS Genet 2007, 3:e201. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  13. Tacutu R, Budovsky A, Fraifeld VE: The NetAge database: a compendium of networks for longevity, age-related diseases and associated processes.

    Biogerontology 2010, 11:513-522. PubMed Abstract | Publisher Full Text OpenURL

  14. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

    Nat Genet 2000, 25:25-29. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  15. Liu G, Loraine AE, Shigeta R, Cline M, Cheng J, Valmeekam V, Sun S, Kulp D, Siani-Rose MA: NetAffx: Affymetrix probesets and annotations.

    Nucleic Acids Res 2003, 31:82-86. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature.

    Nucleic Acids Res 2006, 34:D140-D144. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. Wu Z, Irizarry R, Gentleman R, Martinez-Murillo F, Spencer F: A Model-Based Background Adjustment for Oligonucleotide Expression Arrays.

    Journal of the American Statistical Association 99:909. OpenURL

  18. Rainer J, Sanchez-Cabo F, Stocker G, Sturn A, Trajanoski Z: CARMAweb: comprehensive R- and bioconductor-based web service for microarray data analysis.

    Nucleic Acids Res 2006, 34:W498-W503. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. The R Project for Statistical Computing [http://www.r-project.org/] webcite

  20. Smyth GK: Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

    Stat Appl Genet Mol Biol 2004, 3:Article3. PubMed Abstract | Publisher Full Text OpenURL

  21. Hackl M, Brunner S, Fortschegger K, Schreiner C, Micutkova L, Muck C, Laschober GT, Lepperdinger G, Sampson N, Berger P, Herndler-Brandstetter D, Wieser M, Kuhnel H, Strasser A, Rinnerthaler M, Breitenbach M, Mildner M, Eckhart L, Tschachler E, Trost A, Bauer JW, Papak C, Trajanoski Z, Scheideler M, Grillari-Voglauer R, Grubeck-Loebenstein B, Jansen-Dürr P, Grillari J: miR-17, miR-19b, miR-20a, and miR-106a are down-regulated in human aging.

    Aging Cell 2010, 9:291-296. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  22. Gosling J, Joy B, Steele G, Bracha G: The Java Language Specification. 3rd edition. Amsterdam: Addison-Wesley; 2005. OpenURL

  23. The Java EE 5 Tutorial [http://download.oracle.com/javaee/5/tutorial/doc/] webcite

  24. Maurer M, Molidor R, Sturn A, Hartler J, Hackl H, Stocker G, Prokesch A, Scheideler M, Trajanoski Z: MARS: microarray analysis, retrieval, and storage system.

    BMC Bioinformatics 2005, 6:101. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  25. HomoloGene [http://www.ncbi.nlm.nih.gov/homologene] webcite

  26. O'Brien KP, Remm M, Sonnhammer EL: Inparanoid: a comprehensive database of eukaryotic orthologs.

    Nucleic Acids Res 2005, 33:D476-D480. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes.

    Nucleic Acids Res 2000, 28:27-30. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Laschober GT, Ruli D, Hofer E, Muck C, Carmona-Gutierrez D, Ring J, Hutter E, Ruckenstuhl C, Micutkova L, Brunauer R, Jamnig A, Trimmel D, Herndler-Brandstetter D, Brunner S, Zenzmaier C, Sampson N, Breitenbach M, Frohlich KU, Grubeck-Loebenstein B, Berger P, Wieser M, Grillari-Voglauer R, Thallinger GG, Grillari J, Trajanoski Z, Madeo F, Lepperdinger G, Jansen-Dürr P: Identification of evolutionarily conserved genetic regulators of cellular aging.

    Aging Cell 2010, 9:1084-1097. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  29. ArrayExpress [http://www.ebi.ac.uk/arrayexpress/] webcite