A computational screen for site selective A-to-I editing detects novel sites in neuron specific Hu proteins
- Equal contributors
1 Department of Molecular Biology and Functional Genomics, Stockholm University, Svante Arrheniusväg 20C, SE-10691 Stockholm, Sweden
2 Stockholm Bioinformatics Centre (SBC), Stockholm University, Albanova, Roslagstullsbacken 35, SE-10691 Stockholm, Sweden
3 School for Computer Science and Communication (CSC), Royal Institute of Technology (KTH), SE-10044 Stockholm, Sweden
4 Institute for Genome Sciences and Policy (IGSP), Deptartment of Biostatistics and Bioinformatics
5 Department of Computer Science, Duke University, 101 Science Dr, Box 3382, Durham, NC 27708, USA
BMC Bioinformatics 2010, 11:6 doi:10.1186/1471-2105-11-6Published: 4 January 2010
Several bioinformatic approaches have previously been used to find novel sites of ADAR mediated A-to-I RNA editing in human. These studies have discovered thousands of genes that are hyper-edited in their non-coding intronic regions, especially in alu retrotransposable elements, but very few substrates that are site-selectively edited in coding regions. Known RNA edited substrates suggest, however, that site selective A-to-I editing is particularly important for normal brain development in mammals.
We have compiled a screen that enables the identification of new sites of site-selective editing, primarily in coding sequences. To avoid hyper-edited repeat regions, we applied our screen to the alu-free mouse genome. Focusing on the mouse also facilitated better experimental verification. To identify candidate sites of RNA editing, we first performed an explorative screen based on RNA structure and genomic sequence conservation. We further evaluated the results of the explorative screen by determining which transcripts were enriched for A-G mismatches between the genomic template and the expressed sequence since the editing product, inosine (I), is read as guanosine (G) by the translational machinery. For expressed sequences, we only considered coding regions to focus entirely on re-coding events. Lastly, we refined the results from the explorative screen using a novel scoring scheme based on characteristics for known A-to-I edited sites. The extent of editing in the final candidate genes was verified using total RNA from mouse brain and 454 sequencing.
Using this method, we identified and confirmed efficient editing at one site in the Gabra3 gene. Editing was also verified at several other novel sites within candidates predicted to be edited. Five of these sites are situated in genes coding for the neuron-specific RNA binding proteins HuB and HuD.