Log on / register
Feedback | Support | My details
Open AccessSoftware

DISCLOSE : DISsection of CLusters Obtained by SEries of transcriptome data using functional annotations and putative transcription factor binding sites

Evert-Jan Blom1 email, Sacha AFT van Hijum1,2,3 email, Klaas J Hofstede1 email, Remko Silvis1 email, Jos BTM Roerdink4 email and Oscar P Kuipers1 email

Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, The Netherlands

Interfacultary Institute for Genetics and Functional Genomics, Ernst-Moritz-Arndt-University, Friedrich-Ludwig-Jahnstraße, 15A 17487, Greifswald 17489, Germany

NIZO Food Research, PO Box 20, 6710 BA Ede, the Netherlands

Institute for Mathematics and Computing Science, University of Groningen, Nijenborgh 9, 9747 AG, Groningen, The Netherlands

author email corresponding author email

BMC Bioinformatics 2008, 9:535doi:10.1186/1471-2105-9-535

Published: 16 December 2008

Abstract

Background

A typical step in the analysis of gene expression data is the determination of clusters of genes that exhibit similar expression patterns. Researchers are confronted with the seemingly arbitrary choice between numerous algorithms to perform cluster analysis.

Results

We developed an exploratory application that benchmarks the results of clustering methods using functional annotations. In addition, a de novo DNA motif discovery algorithm is integrated in our program which identifies overrepresented DNA binding sites in the upstream DNA sequences of genes from the clusters that are indicative of sites of transcriptional control. The performance of our program was evaluated by comparing the original results of a time course experiment with the findings of our application.

Conclusion

DISCLOSE assists researchers in the prokaryotic research community in systematically evaluating results of the application of a range of clustering algorithms to transcriptome data. Different performance measures allow to quickly and comprehensively determine the best suited clustering approach for a given dataset.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.