This article is part of the supplement: First International Conference on Phylogenomics
Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human
1 Banting & Best Department of Medical Research, Donnelly CCBR, 160 College Street, University of Toronto, Toronto, ON M5S 3E1, Canada
2 Department of Medical Genetics and Microbiology, University of Toronto, Toronto, ON M5S 3E1, Canada
3 Department of Molecular Biophysics and Biochemistry (MBB), Yale University, New Haven, CT 06511, USA
BMC Evolutionary Biology 2007, 7(Suppl 1):S14 doi:10.1186/1471-2148-7-S1-S14Published: 8 February 2007
Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus.
Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates.
The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome.