Email updates

Keep up to date with the latest news and content from BMC Evolutionary Biology and BioMed Central.

This article is part of the supplement: First International Conference on Phylogenomics

Open Access Research

Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human

Zhaolei Zhang12*, Andy Wing Chun Pang12 and Mark Gerstein3

Author Affiliations

1 Banting & Best Department of Medical Research, Donnelly CCBR, 160 College Street, University of Toronto, Toronto, ON M5S 3E1, Canada

2 Department of Medical Genetics and Microbiology, University of Toronto, Toronto, ON M5S 3E1, Canada

3 Department of Molecular Biophysics and Biochemistry (MBB), Yale University, New Haven, CT 06511, USA

For all author emails, please log on.

BMC Evolutionary Biology 2007, 7(Suppl 1):S14  doi:10.1186/1471-2148-7-S1-S14

Published: 8 February 2007

Abstract

Background

Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus.

Results

Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates.

Conclusion

The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome.