Log on / register
Feedback | Support | My details
Open AccessHighly AccessResearch article

Conservation and implications of eukaryote transcriptional regulatory regions across multiple species

Lin Wan1,2 email, Dayong Li3 email, Donglei Zhang3 email, Xue Liu3 email, Wenjiang J Fu4 email, Lihuang Zhu3 email, Minghua Deng1,2 email, Fengzhu Sun5,6 email and Minping Qian1,2 email

1School of Mathematical Sciences, Peking University, Beijing 100871, PR China

2Center for Theoretical Biology, Peking University, Beijing 100871, PR China

3State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, PR China

4Department of Epidemiology, Michigan State University, East Lansing, Michigan 48824, USA

5MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100871, PR China

6Molecular and Computational Biology Program, University of Southern California, Los Angeles, California 90089, USA

author email corresponding author email

BMC Genomics 2008, 9:623doi:10.1186/1471-2164-9-623

Published: 20 December 2008

Abstract

Background

Increasing evidence shows that whole genomes of eukaryotes are almost entirely transcribed into both protein coding genes and an enormous number of non-protein-coding RNAs (ncRNAs). Therefore, revealing the underlying regulatory mechanisms of transcripts becomes imperative. However, for a complete understanding of transcriptional regulatory mechanisms, we need to identify the regions in which they are found. We will call these transcriptional regulation regions, or TRRs, which can be considered functional regions containing a cluster of regulatory elements that cooperatively recruit transcriptional factors for binding and then regulating the expression of transcripts.

Results

We constructed a hierarchical stochastic language (HSL) model for the identification of core TRRs in yeast based on regulatory cooperation among TRR elements. The HSL model trained based on yeast achieved comparable accuracy in predicting TRRs in other species, e.g., fruit fly, human, and rice, thus demonstrating the conservation of TRRs across species. The HSL model was also used to identify the TRRs of genes, such as p53 or OsALYL1, as well as microRNAs. In addition, the ENCODE regions were examined by HSL, and TRRs were found to pervasively locate in the genomes.

Conclusion

Our findings indicate that 1) the HSL model can be used to accurately predict core TRRs of transcripts across species and 2) identified core TRRs by HSL are proper candidates for the further scrutiny of specific regulatory elements and mechanisms. Meanwhile, the regulatory activity taking place in the abundant numbers of ncRNAs might account for the ubiquitous presence of TRRs across the genome. In addition, we also found that the TRRs of protein coding genes and ncRNAs are similar in structure, with the latter being more conserved than the former.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.