Log on / register
Feedback | Support | My details
Open AccessHighly AccessMethodology article

An improved distance measure between the expression profiles linking co-expression and co-regulation in mouse

Ryung S Kim1,4 email, Hongkai Ji2 email and Wing H Wong3 email

Department of Neurology, Harvard Medical School, Boston, MA 02115, USA

Department of Statistics, Harvard University, Cambridge, MA 02138, USA

Department of Statistics, Stanford University, Stanford, CA 94305, USA

Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA02115, USA

author email corresponding author email

BMC Bioinformatics 2006, 7:44doi:10.1186/1471-2105-7-44

Published: 26 January 2006

Abstract

Background

Many statistical algorithms combine microarray expression data and genome sequence data to identify transcription factor binding motifs in the low eukaryotic genomes. Finding cis-regulatory elements in higher eukaryote genomes, however, remains a challenge, as searching in the promoter regions of genes with similar expression patterns often fails. The difficulty is partially attributable to the poor performance of the similarity measures for comparing expression profiles. The widely accepted measures are inadequate for distinguishing genes transcribed from distinct regulatory mechanisms in the complicated genomes of higher eukaryotes.

Results

By defining the regulatory similarity between a gene pair as the number of common known transcription factor binding motifs in the promoter regions, we compared the performance of several expression distance measures on seven mouse expression data sets. We propose a new distance measure that accounts for both the linear trends and fold-changes of expression across the samples.

Conclusion

The study reveals that the proposed distance measure for comparing expression profiles enables us to identify genes with large number of common regulatory elements because it reflects the inherent regulatory information better than widely accepted distance measures such as the Pearson's correlation or cosine correlation with or without log transformation.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.