<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2148-7-S1-S14</ui>
	<ji>1471-2148</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Zhang</snm>
					<fnm>Zhaolei</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>Zhaolei.Zhang@utoronto.ca</email>
				</au>
				<au id="A2">
					<snm>Pang</snm>
					<mnm>Wing Chun</mnm>
					<fnm>Andy</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
				</au>
				<au id="A3">
					<snm>Gerstein</snm>
					<fnm>Mark</fnm>
					<insr iid="I3"/>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Banting &amp; Best Department of Medical Research, Donnelly CCBR, 160 College Street, University of Toronto, Toronto, ON M5S 3E1, Canada</p>
				</ins>
				<ins id="I2">
					<p>Department of Medical Genetics and Microbiology, University of Toronto, Toronto, ON M5S 3E1, Canada</p>
				</ins>
				<ins id="I3">
					<p>Department of Molecular Biophysics and Biochemistry (MBB), Yale University, New Haven, CT 06511, USA</p>
				</ins>
			</insg>
			<source>BMC Evolutionary Biology</source>
			<supplement>
				<title>
					<p>First International Conference on Phylogenomics</p>
				</title>
				<editor>Herv&#233; Philippe, Mathieu Blanchette</editor>
				<note>Proceedings</note>
			</supplement>
			<conference>
				<title>
					<p>First International Conference on Phylogenomics</p>
				</title>
				<location>Sainte-Ad&#232;le, Qu&#233;bec, Canada</location>
				<date-range>15&#8211;19 March, 2006</date-range>
				<url>http://www.bioinfo.umontreal.ca/evenements/phylogenomics.html</url>
			</conference>
			<issn>1471-2148</issn>
			<pubdate>2007</pubdate>
			<volume>7</volume>
			<issue>Suppl 1</issue>
			<fpage>S14</fpage>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">17288572</pubid><pubid idtype="doi">10.1186/1471-2148-7-S1-S14</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>8</day>
					<month>2</month>
					<year>2007</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2007</year>
			<collab>Zhang et al; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusion</p>
					</st>
					<p>The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<sec>
				<st>
					<p>Whole genome tiling array experiments</p>
				</st>
				<p>The human genome is the blueprint that encodes most of the functional components in the human body: proteins and RNAs. With the completion of sequencing of the human genome, the focus of the genomic research is shifting to identifying all the functional units encoded within the genome. A new technology, the maskless oligonucleotide tiling array, has recently emerged as a powerful tool to interrogate transcription activities on the whole genomic scale and at unprecedented high resolution <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Using known genome sequence as blueprint, short oligonucleotides were synthesized to cover or "tile" each chromosome at regular intervals. Repetitive elements and regions of low complexity are usually avoided in such tiling experiments. Biological samples such as mRNAs or cDNAs are labelled with fluorescence and hybridized to the microarray spotted with the probes. Just like regular microarray experiments, the observed fluorescence intensities are interpreted as elevated transcription activity at specific genome locations. The tiling array experiments are most useful in verifying predicted exons and identify novel exons and other transcribed sequence elements.</p>
				<p>A number of tiling array studies on the human and other genomes have been published since 2002 <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. The major differences among these studies are the resolution of the tiles (length of the oligonucleotide probes and intervals between them), and the coverage of the genome. As of early 2006, the study by Bertone et al. (2005) is the only one that covers the entire human genome. These researchers designed ~51,000,000 probes of 36 mers, positioned at every 46 nucleotide interval on average, which cover ~1.5 GB of the non-repetitive genomic DNA sequence from both strands of the human genome <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The biological sample used in this study was fluorescence-labelled cDNA, reverse-transcribed from triple-selected polyadenylated RNAs [poly(A)+] from liver tissue. In total, these researchers identified ~17,000 transcriptionally active regions (termed as TARs) in the entire genome. There are strong correlations between the TARs and the known gene annotations or predictions, e.g. 64% of the genes annotated in RefSeq and 57% in Ensembl and 70% in UniGene were observed in this study <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Widespread transcription activity in the human genome</p>
				</st>
				<p>The big surprise from the tiling array study is that transcription activities were observed in many genomic regions that do not overlap with known gene annotations. In fact, only about 40% of the TARs correspond to known exons, and a significant fraction of the TARs (6,656 or 38.7%) are more than 10 kb away from any known exons. Table <tblr tid="T1">1</tblr> divides the TARs into groups according to their distance to the nearest genes that are on the same strand and also the opposite strand of the TAR. To estimate the enrichment or depletion of the TARs in the different regions, in Table <tblr tid="T2">2</tblr> we break down the human genome into 25 categories in the same way as for the TARs and list the total length of these regions. Table <tblr tid="T3">3</tblr> lists the density of the TARs in these regions, for instance the upper-left corner indicates that on average 574 base pairs per Mb in the Distal/Distal category has evidence of transcription as observed in the Bertone experiment. In contrast, on average 34,200 base pairs per Mb (3%) has evidence of transcription. It is likely that only a fraction of the human genes are transcribed in the liver cell line, thus transcription activity is not observed for all the annotated exons in the genome.</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Distribution of transcriptionally active regions (TARs), categorized by their distances from the nearest gene annotations</p>
					</caption>
					<tblbdy cols="8">
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7" ca="center">
								<p>Annotation on the opposite strand</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Distal</p>
							</c>
							<c ca="left">
								<p>10 kb</p>
							</c>
							<c ca="left">
								<p>1 kb</p>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="right">
								<p>Total</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Annotation on the same strand as TAR<sup>4</sup></p>
							</c>
							<c ca="left">
								<p><sup>1 </sup>Distal</p>
							</c>
							<c ca="left">
								<p>3,695 (21.5%)</p>
							</c>
							<c ca="left">
								<p>537 (3.1%)</p>
							</c>
							<c ca="left">
								<p>149 (0.9%)</p>
							</c>
							<c ca="left">
								<p>861 (5.0%)</p>
							</c>
							<c ca="left">
								<p>1,414 (8.2%)</p>
							</c>
							<c ca="right">
								<p>6,656 (38.7%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p><sup>2</sup>10 kb</p>
							</c>
							<c ca="left">
								<p>799 (4.6%)</p>
							</c>
							<c ca="left">
								<p>224 (1.3%)</p>
							</c>
							<c ca="left">
								<p>59 (0.3%)</p>
							</c>
							<c ca="left">
								<p>265 (1.5%)</p>
							</c>
							<c ca="left">
								<p>146 (0.8%)</p>
							</c>
							<c ca="right">
								<p>1,493 (8.7%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p><sup>3</sup>1 kb</p>
							</c>
							<c ca="left">
								<p>347 (2.0%)</p>
							</c>
							<c ca="left">
								<p>92 (0.5%)</p>
							</c>
							<c ca="left">
								<p>32 (0.2%)</p>
							</c>
							<c ca="left">
								<p>52 (0.3%)</p>
							</c>
							<c ca="left">
								<p>20 (0.1%)</p>
							</c>
							<c ca="right">
								<p>543 (3.2%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>4,637 (27.0%)</p>
							</c>
							<c ca="left">
								<p>1,454 (8.4%)</p>
							</c>
							<c ca="left">
								<p>345 (2.0%)</p>
							</c>
							<c ca="left">
								<p>120 (0.7%)</p>
							</c>
							<c ca="left">
								<p>172 (1.0%)</p>
							</c>
							<c ca="right">
								<p>6,728 (39.1%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="left">
								<p>1,554 (9.0%)</p>
							</c>
							<c ca="left">
								<p>137 (0.8%)</p>
							</c>
							<c ca="left">
								<p>24 (0.1%)</p>
							</c>
							<c ca="left">
								<p>35 (0.2%)</p>
							</c>
							<c ca="left">
								<p>28 (0.2%)</p>
							</c>
							<c ca="right">
								<p>1,778 (10.3%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total:</p>
							</c>
							<c ca="left">
								<p>11,032 (64.1%)</p>
							</c>
							<c ca="left">
								<p>2,444 (14.2%)</p>
							</c>
							<c ca="left">
								<p>609 (3.5%)</p>
							</c>
							<c ca="left">
								<p>1,333 (7.8%)</p>
							</c>
							<c ca="left">
								<p>1,780 (10.4%)</p>
							</c>
							<c ca="right">
								<p>17,198</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p><sup>1</sup>Distal: distance from nearest annotated exon &gt; 10 kb</p>
						<p><sup>2</sup>10 kb: distance from nearest annotated exon is less than 10 kb but longer than 1 kb</p>
						<p><sup>3</sup>1 kb: distance from nearest annotated exon is less than 1 kb but do not overlap with exons</p>
						<p><sup>4 </sup>Genome annotation was based on NCBI assembly 34 downloaded from Ensembl.</p>
					</tblfn>
				</tbl>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Distribution of human genomic regions, categorized by annotations on both strands (Mb = megabases)</p>
					</caption>
					<tblbdy cols="8">
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7" ca="center">
								<p>Annotation on the antisense strand (Mb)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Distal</p>
							</c>
							<c ca="left">
								<p>10 kb</p>
							</c>
							<c ca="left">
								<p>1 kb</p>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="right">
								<p>Total</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Annotation on the sense strand <sup>1</sup></p>
							</c>
							<c ca="left">
								<p>Distal</p>
							</c>
							<c ca="left">
								<p>1,655 (54.%)</p>
							</c>
							<c ca="left">
								<p>130 (4.2%)</p>
							</c>
							<c ca="left">
								<p>17.6 (0.6%)</p>
							</c>
							<c ca="left">
								<p>37 (1.2%)</p>
							</c>
							<c ca="left">
								<p>438 (14.2%)</p>
							</c>
							<c ca="right">
								<p>2,279 (74%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>10 kb</p>
							</c>
							<c ca="left">
								<p>133 (4.3%)</p>
							</c>
							<c ca="left">
								<p>26 (0.9%)</p>
							</c>
							<c ca="left">
								<p>49 (0.1%)</p>
							</c>
							<c ca="left">
								<p>7 (0.2%)</p>
							</c>
							<c ca="left">
								<p>34 (1.1%)</p>
							</c>
							<c ca="right">
								<p>206 (7%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>1 kb</p>
							</c>
							<c ca="left">
								<p>17 (0.6%)</p>
							</c>
							<c ca="left">
								<p>4 (0.1%)</p>
							</c>
							<c ca="left">
								<p>1 (0.03%)</p>
							</c>
							<c ca="left">
								<p>1 (0.04%)</p>
							</c>
							<c ca="left">
								<p>2.8 (0.1%)</p>
							</c>
							<c ca="right">
								<p>27 (0.9%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>44 (1.4%)</p>
							</c>
							<c ca="left">
								<p>7.6 (0.25%)</p>
							</c>
							<c ca="left">
								<p>1 (0.04%)</p>
							</c>
							<c ca="left">
								<p>1.1 (0.04%)</p>
							</c>
							<c ca="left">
								<p>3.8 (0.12%)</p>
							</c>
							<c ca="right">
								<p>58 (1.9%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="left">
								<p>450 (14.6%)</p>
							</c>
							<c ca="left">
								<p>35 (1.2 %)</p>
							</c>
							<c ca="left">
								<p>3.0 (0.1%)</p>
							</c>
							<c ca="left">
								<p>4 (0.1%)</p>
							</c>
							<c ca="left">
								<p>12 (0.4%)</p>
							</c>
							<c ca="right">
								<p>505 (16%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total:</p>
							</c>
							<c ca="left">
								<p>2301 (75%)</p>
							</c>
							<c ca="left">
								<p>204 (6.7%)</p>
							</c>
							<c ca="left">
								<p>27 (0.9%)</p>
							</c>
							<c ca="left">
								<p>50.7 (1.6%)</p>
							</c>
							<c ca="left">
								<p>491 (16%)</p>
							</c>
							<c ca="right">
								<p>3,076</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p><sup>1</sup>Annotation is the same as in Table 1.</p>
					</tblfn>
				</tbl>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Total length and density of TARs in different types of genomic regions<sup>1</sup></p>
					</caption>
					<tblbdy cols="8">
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7" ca="center">
								<p>Annotation on the antisense strand</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Distal</p>
							</c>
							<c ca="left">
								<p>10 kb</p>
							</c>
							<c ca="left">
								<p>1 kb</p>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="right">
								<p>Total</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Annotation on the sense strand<sup>2</sup></p>
							</c>
							<c ca="left">
								<p>Distal</p>
							</c>
							<c ca="left">
								<p>950,963 (574)</p>
							</c>
							<c ca="left">
								<p>137,168 (1,048)</p>
							</c>
							<c ca="left">
								<p>38,856 (2,202)</p>
							</c>
							<c ca="left">
								<p>223,699 (6,012)</p>
							</c>
							<c ca="left">
								<p>364,308 (831)</p>
							</c>
							<c ca="right">
								<p>1,715,000 (753)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>10 kb</p>
							</c>
							<c ca="left">
								<p>212,945 (1,596)</p>
							</c>
							<c ca="left">
								<p>61,991 (2,334)</p>
							</c>
							<c ca="left">
								<p>15,186 (3,584)</p>
							</c>
							<c ca="left">
								<p>69,413 (9,683)</p>
							</c>
							<c ca="left">
								<p>37,434 (1,077)</p>
							</c>
							<c ca="right">
								<p>396,969 (1,930)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>1 kb</p>
							</c>
							<c ca="left">
								<p>97,974 (5,454)</p>
							</c>
							<c ca="left">
								<p>24,062 (5631)</p>
							</c>
							<c ca="left">
								<p>8,793 (8,647)</p>
							</c>
							<c ca="left">
								<p>14,475 (13,391)</p>
							</c>
							<c ca="left">
								<p>4,960 (1,751)</p>
							</c>
							<c ca="right">
								<p>150,264 (5,570)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>exon</p>
							</c>
							<c ca="left">
								<p>1,370,991 (30,952)</p>
							</c>
							<c ca="left">
								<p>422,364 (55,666)</p>
							</c>
							<c ca="left">
								<p>103,263 (90,951)</p>
							</c>
							<c ca="left">
								<p>33,079 (29,331)</p>
							</c>
							<c ca="left">
								<p>53,984 (1,4043)</p>
							</c>
							<c ca="right">
								<p>1,983,681 (34,200)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>intron</p>
							</c>
							<c ca="left">
								<p>398,063 (884)</p>
							</c>
							<c ca="left">
								<p>34,368 (970)</p>
							</c>
							<c ca="left">
								<p>6,230 (2,104)</p>
							</c>
							<c ca="left">
								<p>8,752 (2,107)</p>
							</c>
							<c ca="left">
								<p>7,059 (586)</p>
							</c>
							<c ca="right">
								<p>454,472 (900)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Total:</p>
							</c>
							<c ca="left">
								<p>3,030,936 (1,320)</p>
							</c>
							<c ca="left">
								<p>679,953 (3,330)</p>
							</c>
							<c ca="left">
								<p>172,328 (6,380)</p>
							</c>
							<c ca="left">
								<p>349,418 (6,890)</p>
							</c>
							<c ca="left">
								<p>467,745 (953)</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p><sup>1</sup>Numbers in the brackets are the normalized densities of the TARs, i.e. number of transcribed nucleotides per megabase of genomic DNA</p>
						<p><sup>2</sup>Annotation is the same as in Table 1(A)</p>
					</tblfn>
				</tbl>
				<p>Such widespread transcription activities have also been observed in other human tiling array experiments as well <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. There has not been a consensus opinion on the exact nature and origin of these TARs (or called "transfrags" as in <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>), however, it has been pointed out that many of these novel transcripts are not likely to code for proteins as they do not have open reading frames of longer than 300 nucleotides.</p>
				<p>In addition to these microarray studies, widespread transcription activities outside of known human genes have also been observed in other types of experiments. Long serial analysis of gene expression experiments (LongSAGE) suggest that over 15,000 additional new exons exist in the human genome, and over half of these may be from new genes <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Ota et al. analyzed full-length human cDNA library and discovered ~5,000 novel non-coding cDNA transcripts <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. A large number of noncoding transcripts has also been reported to exist in the mouse genome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. In addition to the mammalian genomes, large number of intergenic transcripts were also observed in plants and fruit fly <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Taking all these pieces of evidences together, it was estimated that over half of the human genome could potentially be transcribed <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, or at least 90% of the transcription activity in the genome is outside of well-characterized protein-coding exons <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Possible functional roles of the novel transcripts</p>
				</st>
				<p>There have been many theories proposed to account for the origin, property, and possible functions of these novel transcripts. It was suggested that some of these TARs may be novel protein coding genes, novel RNA genes, anti-sense transcripts, alternative transcripts or just simply biological artefacts (please see <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> for a detailed discussion on the possible hypotheses). Because of the prevalence of such intergenic transcription activity (see above), it is unlikely that these novel transcripts are experimental artefact or false positives. It is also unlikely that any single mechanism can fully explain the observed novel transcripts, but perhaps the combination of explanations can account for the bulk of the observed novel transcripts. For example, it is possible that those TARs that are near the known genes could represent previously unidentified exons in the same gene structure, or represent alternative transcripts caused by alternative promoters. Depending on their degree of sequence conservation, some of the "distal" transcripts, i.e. those that are far away from known genes, are likely to be candidates for novel protein coding genes or noncoding RNAs. This notion has been suggested by many, including the researchers who conducted the tiling array experiments, as the most likely explanation for the bulk of the novel transcripts <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Some of the TARs may be functional noncoding RNAs</p>
				</st>
				<p>Mammalian genomes contain many RNA genes that do not code for proteins; these are collectively called noncoding RNAs or ncRNAs <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. The most well known noncoding RNAs include rRNA, tRNA, snoRNA, Xist and microRNAs (miRNAs). Table <tblr tid="T4">4</tblr> lists some ncRNAs that were recently discovered and also implicated in human disorders. Some of these longer ncRNAs are sometimes referred to as mRNA-like ncRNAs (mlncRNAs) because they share properties with mRNAs such as splicing <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. With the accumulating evidence on their prevalence and importance, ncRNAs have become increasingly appreciated as crucial components of cellular and organismal complexity, which prompts some to ponder whether we still live in an RNA world <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
				<tbl id="T4">
					<title>
						<p>Table 4</p>
					</title>
					<caption>
						<p>Noncoding RNAs that are known to have medical implications.</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>
									<b>RNA name</b>
								</p>
							</c>
							<c ca="left">
								<p>
									<b>disorder</b>
								</p>
							</c>
							<c ca="left">
								<p>
									<b>Reference</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>H19 RNA</p>
							</c>
							<c ca="left">
								<p>Tumor suppressor</p>
							</c>
							<c ca="left">
								<p>[51]</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>BIC</p>
							</c>
							<c ca="left">
								<p>Hodgkin lymphoma</p>
							</c>
							<c ca="left">
								<p>[52, 53]</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DLEU2,</p>
							</c>
							<c ca="left">
								<p>lymphocytic leukaemia</p>
							</c>
							<c ca="left">
								<p>[54];</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>NCRMS</p>
							</c>
							<c ca="left">
								<p>rhabdomyosarcoma</p>
							</c>
							<c ca="left">
								<p>[55]</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>7H4</p>
							</c>
							<c ca="left">
								<p>postnatal development</p>
							</c>
							<c ca="left">
								<p>[56],</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>NRSE/REST</p>
							</c>
							<c ca="left">
								<p>found in adult neural stem cells</p>
							</c>
							<c ca="left">
								<p>[57].</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>Many of the known ncRNAs in human and mouse were discovered accidentally or from large-scale cloning experiments. The novel transcripts found in the tiling array experiments offer a new resource to identify novel non-coding RNA transcripts. Kampa and colleagues have screened the "transfrags" from Chromosomes 21 and 22 and identified 193 novel ncRNA candidates; they were able to verify 126 or 65% of these ncRNAs by RT-PCR. Remarkably, this extrapolates to ~4200 ncRNAs in the entire human genome <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Several software tools have been developed to predict ncRNAs by computational approaches, especially on predicting miRNA, which have defined secondary structures. These programs mostly work by searching for conserved motifs, existence of secondary structure, cross-species conservation, or combination of above methods.</p>
				<p>In this paper, we describe our bioinformatics analysis of these novel transcripts that were identified in the tiling array experiment <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. We will investigate their sequence conservation in other species, their potential of forming stable secondary structures and ultimately the possibility that they could code for functional noncoding RNAs. Figure <figr fid="F1">1</figr> is a flowchart outlining the basic analysis steps.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>A flow chart of the analysis pipeline</p>
					</caption>
					<text>
						<p>A flow chart of the analysis pipeline.</p>
					</text>
					<graphic file="1471-2148-7-S1-S14-1"/>
				</fig>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>Large numbers of novel transcripts are conserved in other vertebrates</p>
				</st>
				<p>We have BLASTed the TAR sequences against the genomic sequences of a number of fully or partially sequenced vertebrates, including mouse, rat, chimpanzee, chicken, dog, sea squirt, frog, and two kinds of pufferfish (Table <tblr tid="T5">5</tblr>). All these sequences were downloaded from Ensembl website, repetitive elements were removed by RepeatMasker <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
				<tbl id="T5">
					<title>
						<p>Table 5</p>
					</title>
					<caption>
						<p>BLAST matches in the other genomes and databases (E-value &lt; 0.01)</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>
									<b>Genome or database</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>Number of hits</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>fraction</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human EST library</p>
							</c>
							<c ca="right">
								<p>12,021</p>
							</c>
							<c ca="right">
								<p>70%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human cDNA library</p>
							</c>
							<c ca="right">
								<p>7,546</p>
							</c>
							<c ca="right">
								<p>44%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Chimpanzee (<it>Pan troglodytes</it>)</p>
							</c>
							<c ca="right">
								<p>15,757</p>
							</c>
							<c ca="right">
								<p>92%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Macaque cDNA library</p>
							</c>
							<c ca="right">
								<p>5,955</p>
							</c>
							<c ca="right">
								<p>35%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Mouse genome</p>
							</c>
							<c ca="right">
								<p>8,011</p>
							</c>
							<c ca="right">
								<p>47%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Mouse EST library</p>
							</c>
							<c ca="right">
								<p>7,546</p>
							</c>
							<c ca="right">
								<p>44%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Rat genome</p>
							</c>
							<c ca="right">
								<p>7,691</p>
							</c>
							<c ca="right">
								<p>45%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>* Rodents: mouse &#8746; rat</p>
							</c>
							<c ca="right">
								<p>9,184</p>
							</c>
							<c ca="right">
								<p>53%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Dog (<it>Canis familiaris</it>,) partial genome</p>
							</c>
							<c ca="right">
								<p>7,924</p>
							</c>
							<c ca="right">
								<p>46%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Chicken (<it>Gallus gallus</it>)</p>
							</c>
							<c ca="right">
								<p>3,600</p>
							</c>
							<c ca="right">
								<p>21%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Frog (<it>Xenopus tropicalis</it>)</p>
							</c>
							<c ca="right">
								<p>2,279</p>
							</c>
							<c ca="right">
								<p>13%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Japanese pufferfish (<it>Takifugu rubripes</it>)</p>
							</c>
							<c ca="right">
								<p>2,254</p>
							</c>
							<c ca="right">
								<p>13%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Green spotted pufferfish (<it>Tetraodon nigroviridis</it>)</p>
							</c>
							<c ca="right">
								<p>2,069</p>
							</c>
							<c ca="right">
								<p>12%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>** Pufferfish: <it>Takifugu rubripes &#8746; Tetraodon nigroviridis</it></p>
							</c>
							<c ca="right">
								<p>2867</p>
							</c>
							<c ca="right">
								<p>17%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Sea squirt (<it>Ciona intestinalis)</it></p>
							</c>
							<c ca="right">
								<p>607</p>
							</c>
							<c ca="right">
								<p>4%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>Noncoding RNA Databases</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Rfam</p>
							</c>
							<c ca="right">
								<p>252</p>
							</c>
							<c ca="right">
								<p>1%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RNAdb</p>
							</c>
							<c ca="right">
								<p>1,995</p>
							</c>
							<c ca="right">
								<p>12%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>mirBase</p>
							</c>
							<c ca="right">
								<p>138</p>
							</c>
							<c ca="right">
								<p>1%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>NONCODE</p>
							</c>
							<c ca="right">
								<p>379</p>
							</c>
							<c ca="right">
								<p>2%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>*** Rfam &#8746; RNAdb &#8746; mirBase &#8746; NONCODE</p>
							</c>
							<c ca="right">
								<p>2,637</p>
							</c>
							<c ca="right">
								<p>15%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>Total number of TARs reported by Bertone et al.</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>17,198</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>* number of TARs that have hits in either mouse or rat genome, The symbol &#8746; represents the union of two sets</p>
						<p>** number of TARs that have hits in either species of pufferfish</p>
						<p>*** number of TARs that have hits in either of the noncoding RNA database</p>
					</tblfn>
				</tbl>
				<p>Sequencing projects of two primates, Macaque (<it>Macaca mulatta</it>) and orangutan (<it>Pongo pygmaeus</it>), are currently under way. We downloaded the trace sequence files of these two primates and included them in the homology search as well. Because of the incompleteness of these genomes, the existence or absence of homologs in these libraries does not reflect the true level of conservation for each TAR. We also searched for TAR homologs in mammalian cDNA and EST libraries, including H-Invitational Database (H-InvDB) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>), which contains 21,037 validated human full-length cDNA: mouse full-length cDNA library (FANTOM) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B23">23</abbr></abbrgrp>); human and mouse EST libraries from NCBI; and a macaque cDNA library from <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
				<p>Table <tblr tid="T5">5</tblr> shows that 69% of the TARs have EST matches, and 43% of the TARs have matches in the human cDNA library, which further validated that the bulk of the novel transcripts identified from tiling arrays are real transcripts instead of experimental artefacts. As expected, more TARs are conserved in the chimpanzee genome than in the rodent genomes (90% vs. 50%), which is in line with what would be expected for random genomic regions. This is obviously the result of closer evolutionary relationship between the two primate species, but it also implies that there must be many primate-specific transcripts that are shared between human and chimpanzee but not between human and rodents. A significant number of TARs are also conserved in chicken (21%) and pufferfish (16%).</p>
			</sec>
			<sec>
				<st>
					<p>TARs in noncoding RNA databases</p>
				</st>
				<p>We also searched for homologs of the TARs in several sequence databases that contain known noncoding RNAs (Table <tblr tid="T5">5</tblr>, bottom). We found that there are ~2,637 non-protein-coding RNAs among the TARs, or about 15% of the entire novel transcripts, which also include 138 miRNAs. Note that some of these databases such as RNAdb also include hypothetical ncRNAs that are predicted from cDNA libraries.</p>
				<p>In addition to number of homologs in single organisms, Table <tblr tid="T6">6</tblr> further lists the number of TARs that are conserved in more than one species, i.e. with different conservation profiles. There are 4,806 transcripts (27%) that are only present in human and chimpanzee but not in any other genomes. Among these, 4,574 are not found in any ncRNA databases.</p>
				<tbl id="T6">
					<title>
						<p>Table 6</p>
					</title>
					<caption>
						<p>Conservation profiles of TARs among vertebrates</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>
									<b>Conservation profile *</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b># of hits</b>
								</p>
							</c>
							<c ca="right">
								<p>
									<b>fraction</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp</p>
							</c>
							<c ca="center">
								<p>15,757</p>
							</c>
							<c ca="right">
								<p>92%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp &#8745; rodents</p>
							</c>
							<c ca="center">
								<p>8,828</p>
							</c>
							<c ca="right">
								<p>51%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp &#8745; rodents &#8745; dog</p>
							</c>
							<c ca="center">
								<p>5,988</p>
							</c>
							<c ca="right">
								<p>35%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp &#8745; rodents &#8745; dog &#8745; chicken</p>
							</c>
							<c ca="center">
								<p>2,435</p>
							</c>
							<c ca="right">
								<p>14%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp &#8745; rodents &#8745; dog &#8745; chicken &#8745; pufferfish</p>
							</c>
							<c ca="center">
								<p>1,244</p>
							</c>
							<c ca="right">
								<p>7%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Chimp &#8745; rodents &#8745; dog &#8745; chicken &#8745; pufferfish &#8745; sea squirt</p>
							</c>
							<c ca="center">
								<p>276</p>
							</c>
							<c ca="right">
								<p>2%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; macaque</p>
							</c>
							<c ca="center">
								<p>5,955</p>
							</c>
							<c ca="right">
								<p>35%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; macaque &#8745; chimp</p>
							</c>
							<c ca="center">
								<p>5,815</p>
							</c>
							<c ca="right">
								<p>34%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Rodent</p>
							</c>
							<c ca="center">
								<p>8,776</p>
							</c>
							<c ca="right">
								<p>51%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Rodent &#8745; dog</p>
							</c>
							<c ca="center">
								<p>6,166</p>
							</c>
							<c ca="right">
								<p>36%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Rodents &#8745; dog &#8745; chicken</p>
							</c>
							<c ca="center">
								<p>2,493</p>
							</c>
							<c ca="right">
								<p>14%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Rodent &#8745; dog &#8745; chicken &#8745; pufferfish</p>
							</c>
							<c ca="center">
								<p>1,270</p>
							</c>
							<c ca="right">
								<p>7%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Human &#8745; Rodent &#8745; dog &#8745; chicken &#8745; pufferfish &#8745; sea squirt</p>
							</c>
							<c ca="center">
								<p>276</p>
							</c>
							<c ca="right">
								<p>2%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Num. of TARs in human AND chimp, but NOT in rodents</p>
							</c>
							<c ca="center">
								<p>6,929</p>
							</c>
							<c ca="right">
								<p>40%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Num. of TARs in human AND chimp, but NOT in any other vertebrates</p>
							</c>
							<c ca="center">
								<p>4,806</p>
							</c>
							<c ca="right">
								<p>28%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Num. of TARs in human and chimp, but NOT in rodents, and NOT in databases (Rfam, RNAdb, mirBase, NONCODE)</p>
							</c>
							<c ca="center">
								<p>6,132</p>
							</c>
							<c ca="right">
								<p>36%</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Num. of TARs in human and chimp, but NOT in any other genomes, and NOT in databases (Rfam, RNAdb, mirBase, NONCODE)</p>
							</c>
							<c ca="center">
								<p>4,574</p>
							</c>
							<c ca="right">
								<p>27%</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>
									<b>Total # of TARs</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>17,198</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>* &#8746; represents the union of two or more sets</p>
					</tblfn>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Distal versus proximal TARs</p>
				</st>
				<p>Table <tblr tid="T7">7</tblr> further divides these TARs according to their distance to the nearest annotated genes. It is intriguing to note that the "Distal/Distal" category has more TARs than any other category (highlighted by big bold fonts), even for those TARs that are only found in chimpanzee. It is likely that these are candidates for potential new protein genes or noncoding RNAs. It is also clear that the TARs that are far away from known genes tend to contain more primate-specific transcripts than TARs near genes (2036 versus 1209 or 27.8% versus 14.3%). This may be because the intergenic regions are less conserved between primate and rodents, which consequently could be the places where primate-specific transcripts are born.</p>
				<tbl id="T7">
					<title>
						<p>Table 7</p>
					</title>
					<caption>
						<p>Conservation of TARs in chimpanzee and rodents, categorized by their distance to known genes on both strands</p>
					</caption>
					<tblbdy cols="7">
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Same strand</b>
								</p>
							</c>
							<c cspan="5" ca="center">
								<p>
									<b>Opposite strand</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>exon</p>
							</c>
							<c ca="center">
								<p>intron</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Found in human AND chimp (15,757)</p>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>
									<b>3245 (20.6%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>469 (3.0%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>130 (0.8%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>826 (5.2%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>1233 (7.8%)</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>724 (4.6%)</p>
							</c>
							<c ca="center">
								<p>197 (1.3%)</p>
							</c>
							<c ca="center">
								<p>56 (0.4%)</p>
							</c>
							<c ca="center">
								<p>232 (1.5%)</p>
							</c>
							<c ca="center">
								<p>123 (0.8%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>327 (2.1%)</p>
							</c>
							<c ca="center">
								<p>87 (0.6%)</p>
							</c>
							<c ca="center">
								<p>30 (0.2%)</p>
							</c>
							<c ca="center">
								<p>46 (0.3%)</p>
							</c>
							<c ca="center">
								<p>18 (0.1%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>exon</p>
							</c>
							<c ca="center">
								<p>4492 (28.5%)</p>
							</c>
							<c ca="center">
								<p>1368 (8.7%)</p>
							</c>
							<c ca="center">
								<p>323 (2.0%)</p>
							</c>
							<c ca="center">
								<p>116 (0.7%)</p>
							</c>
							<c ca="center">
								<p>165 (1.0%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>intron</p>
							</c>
							<c ca="center">
								<p>1358 (8.6%)</p>
							</c>
							<c ca="center">
								<p>115 (0.7%)</p>
							</c>
							<c ca="center">
								<p>21 (0.1%)</p>
							</c>
							<c ca="center">
								<p>31 (0.2%)</p>
							</c>
							<c ca="center">
								<p>25 (0.2%)</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>Exon</p>
							</c>
							<c ca="center">
								<p>Intron</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Found in human AND mouse (8,776)</p>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>
									<b>1256 (14.3%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>155 (1.8%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>51 (0.6%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>668 (7.6%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>402 (4.6%)</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>340 (3.9%)</p>
							</c>
							<c ca="center">
								<p>79 (0.9%)</p>
							</c>
							<c ca="center">
								<p>20 (0.2%)</p>
							</c>
							<c ca="center">
								<p>198 (2.3%)</p>
							</c>
							<c ca="center">
								<p>33 (0.4%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>136 (1.5%)</p>
							</c>
							<c ca="center">
								<p>42 (0.5%)</p>
							</c>
							<c ca="center">
								<p>9 (0.1%)</p>
							</c>
							<c ca="center">
								<p>25 (0.3%)</p>
							</c>
							<c ca="center">
								<p>10 (0.1%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>exon</p>
							</c>
							<c ca="center">
								<p>3398 (38.7%)</p>
							</c>
							<c ca="center">
								<p>1028 (11.7%)</p>
							</c>
							<c ca="center">
								<p>217 (2.5%)</p>
							</c>
							<c ca="center">
								<p>96 (1.1%)</p>
							</c>
							<c ca="center">
								<p>125 (1.4%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>intron</p>
							</c>
							<c ca="center">
								<p>419 (4.8%)</p>
							</c>
							<c ca="center">
								<p>29 (0.3%)</p>
							</c>
							<c ca="center">
								<p>6 (0.1%)</p>
							</c>
							<c ca="center">
								<p>24 (0.3%)</p>
							</c>
							<c ca="center">
								<p>10 (0.1%)</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>Exon</p>
							</c>
							<c ca="center">
								<p>Intron</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Found in human AND chimp AND rodents (<b>8,446</b>)</p>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>
									<b>1209 (14.3%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>146 (1.7%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>46 (0.5%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>649 (7.7%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>382 (4.5%)</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>330 (3.9%)</p>
							</c>
							<c ca="center">
								<p>75 (0.9%)</p>
							</c>
							<c ca="center">
								<p>19 (0.2%)</p>
							</c>
							<c ca="center">
								<p>180 (2.1%)</p>
							</c>
							<c ca="center">
								<p>30 (0.4%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>133 (1.6%)</p>
							</c>
							<c ca="center">
								<p>42 (0.5%)</p>
							</c>
							<c ca="center">
								<p>8 (0.1%)</p>
							</c>
							<c ca="center">
								<p>21 (0.2%)</p>
							</c>
							<c ca="center">
								<p>10 (0.1%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>exon</p>
							</c>
							<c ca="center">
								<p>3306 (39.1%)</p>
							</c>
							<c ca="center">
								<p>983 (11.6%)</p>
							</c>
							<c ca="center">
								<p>206 (2.4%)</p>
							</c>
							<c ca="center">
								<p>93 (1.1%)</p>
							</c>
							<c ca="center">
								<p>121 (1.4%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>intron</p>
							</c>
							<c ca="center">
								<p>393 (4.7%)</p>
							</c>
							<c ca="center">
								<p>26 (0.3%)</p>
							</c>
							<c ca="center">
								<p>6 (0.1%)</p>
							</c>
							<c ca="center">
								<p>22 (0.3%)</p>
							</c>
							<c ca="center">
								<p>10 (0.1%)</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>Exon</p>
							</c>
							<c ca="center">
								<p>Intron</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>Found in human AND in chimp NOT in rodents (<b>7,311</b>)</p>
							</c>
							<c ca="center">
								<p>Distal</p>
							</c>
							<c ca="center">
								<p>
									<b>2036 (27.8%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>323 (4.4%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>84 (1.1%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>177 (2.4%)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>851 (11.6%)</b>
								</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>10 kb</p>
							</c>
							<c ca="center">
								<p>394 (5.4%)</p>
							</c>
							<c ca="center">
								<p>122 (1.7%)</p>
							</c>
							<c ca="center">
								<p>37 (0.5%)</p>
							</c>
							<c ca="center">
								<p>52 (0.7%)</p>
							</c>
							<c ca="center">
								<p>93 (1.3%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>1 kb</p>
							</c>
							<c ca="center">
								<p>194 (2.7%)</p>
							</c>
							<c ca="center">
								<p>45 (0.6%)</p>
							</c>
							<c ca="center">
								<p>22 (0.3%)</p>
							</c>
							<c ca="center">
								<p>25 (0.3%)</p>
							</c>
							<c ca="center">
								<p>8 (0.1%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>exon</p>
							</c>
							<c ca="center">
								<p>1186 (16.2%)</p>
							</c>
							<c ca="center">
								<p>385 (5.3%)</p>
							</c>
							<c ca="center">
								<p>117 (1.6%)</p>
							</c>
							<c ca="center">
								<p>23 (0.3%)</p>
							</c>
							<c ca="center">
								<p>44 (0.6%)</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>intron</p>
							</c>
							<c ca="center">
								<p>965 (13.2%)</p>
							</c>
							<c ca="center">
								<p>89 (1.2%)</p>
							</c>
							<c ca="center">
								<p>15 (0.2%)</p>
							</c>
							<c ca="center">
								<p>9 (0.1%)</p>
							</c>
							<c ca="center">
								<p>15 (0.2%)</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Many TARs are predicted to have stable secondary structures</p>
				</st>
				<p>It has been proposed that many of the transcripts identified in the tiling array or cloning experiments are novel non-protein-coding RNAs that have potential regulatory or catalytic functions. Okazaki and colleagues analyzed the mouse full-length cDNA library, and estimated 15,000 or about half of the library are non-protein-coding and functional RNA genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, but this number has been debated <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. A more thorough computational study of the mouse transcriptom by Numata et al revealed a set of ~4,200 functional non-protein-coding RNA candidates <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Kampa and colleagues analyzed the tiling array data of human chromosome 21 and 22 <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. They identified 193 novel RNA candidates and experimentally verified 126 of them. These researchers only used evidence of transcription in their predictions, which is powerful as demonstrated by the respectable 65% verification rate, but the false-positive rates can be further reduced if additional lines of evidence are incorporated. In this study, we utilize 2 lines of evidence: sequence conservation and RNA secondary structure, to make prediction on conserved novel RNA transcripts hidden in the human genome.</p>
				<sec>
					<st>
						<p>Sequence conservation</p>
					</st>
					<p>Functional elements in the genomes are presumably under selective pressure to maintain their sequence, therefore sequence conservation in other organisms are generally a good indication of functionality. However, we have to be cautious when applying such principle onto RNA sequences. It has been observed that except for structural RNAs such as rRNAs, noncoding RNA genes are in general less conserved than protein coding genes. This is particular true for regulatory RNAs <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, as some of the non-protein-coding RNAs, which have identical functions in human and mouse, do not show obvious sequence homologies. In addition, as pointed out earlier, too much reliance on conservation can overlook lineage-specific transcripts. Given the limitations of using sequence conservation alone in detecting ncRNAs, it is obvious that additional approaches are needed to address these concerns, such as the probability of forming stable secondary structure.</p>
				</sec>
				<sec>
					<st>
						<p>RNA secondary structure and thermodynamic stability</p>
					</st>
					<p>Functional ncRNAs usually form stable secondary structures, thus the potential of forming stable secondary structure are often considered as an indicator of functional RNAs <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. We evaluated a number of existing tools and elected to use RNAZ in the analysis <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. For every category of TARs that are listed in Table <tblr tid="T7">7</tblr>, we further filtered them by running RNA structure prediction program (RNAZ) on their sequences, the results are listed in Table <tblr tid="T8">8</tblr>. This filtering step reduced the number of ncRNA candidates by 6&#8211;7 folds; for example, only 1202 primate-specific TARs are predicted to have stable secondary structures, and only 1073 of them are novel sequences that do not have similar sequences in existing databases. We are more interested in the "Distal" TARs since they are at least 10 kb away from known genes and likely represent new ncRNA transcripts. There are 353 of these novel ncRNA candidates in the final set of candidates. Chromosomal coordinates and DNA sequences for these ncRNA candidates can be found in Additional Files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr>. Figure <figr fid="F2">2</figr> shows the predicted structures of three possible ncRNAs. Please see Additional Files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr> and <supplr sid="S3">3</supplr> for more details.</p>
					<tbl id="T8">
						<title>
							<p>Table 8</p>
						</title>
						<caption>
							<p>Predictions of RNA secondary structures by RNAZ (p-value &gt; 90%)</p>
						</caption>
						<tblbdy cols="7">
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Found in human AND chimp (1436)</p>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>418 (29.1%)</p>
								</c>
								<c ca="center">
									<p>64(4.5%)</p>
								</c>
								<c ca="center">
									<p>17(1.2%)</p>
								</c>
								<c ca="center">
									<p>62(4.3%)</p>
								</c>
								<c ca="center">
									<p>184 (12.8%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>70 (4.9%)</p>
								</c>
								<c ca="center">
									<p>19(1.3%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
								<c ca="center">
									<p>19(1.3%)</p>
								</c>
								<c ca="center">
									<p>24(1.7%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>30 (2.1%)</p>
								</c>
								<c ca="center">
									<p>8(0.6%)</p>
								</c>
								<c ca="center">
									<p>4(0.3%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
								<c ca="center">
									<p>1(0.1%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>164 (11.4%)</p>
								</c>
								<c ca="center">
									<p>61(4.2%)</p>
								</c>
								<c ca="center">
									<p>15(1.0%)</p>
								</c>
								<c ca="center">
									<p>6(0.4%)</p>
								</c>
								<c ca="center">
									<p>6(0.4%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
								<c ca="center">
									<p>226(15.7%)</p>
								</c>
								<c ca="center">
									<p>20(1.4%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
								<c ca="center">
									<p>5(0.3%)</p>
								</c>
								<c ca="center">
									<p>4(0.3%)</p>
								</c>
							</r>
							<r>
								<c cspan="7">
									<hr/>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>Exon</p>
								</c>
								<c ca="center">
									<p>Intron</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Found in human AND mouse (241)</p>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>32(13.3%)</p>
								</c>
								<c ca="center">
									<p>2(0.8%)</p>
								</c>
								<c ca="center">
									<p>2(0.8%)</p>
								</c>
								<c ca="center">
									<p>39(16.2%)</p>
								</c>
								<c ca="center">
									<p>11(4.6%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>4(1.7%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
								<c ca="center">
									<p>15(6.2%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>2(0.8%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>2(0.8%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>71(29.5%)</p>
								</c>
								<c ca="center">
									<p>32(13.3%)</p>
								</c>
								<c ca="center">
									<p>5(2.1%)</p>
								</c>
								<c ca="center">
									<p>2(0.8%)</p>
								</c>
								<c ca="center">
									<p>3(1.2%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
								<c ca="center">
									<p>9(3.7%)</p>
								</c>
								<c ca="center">
									<p>3(1.2%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>3(1.2%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
							</r>
							<r>
								<c cspan="7">
									<hr/>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>Exon</p>
								</c>
								<c ca="center">
									<p>Intron</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Found in human AND chimp AND rodents (234)</p>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>31(13.2%)</p>
								</c>
								<c ca="center">
									<p>2(0.9%)</p>
								</c>
								<c ca="center">
									<p>2(0.9%)</p>
								</c>
								<c ca="center">
									<p>38(16.2%)</p>
								</c>
								<c ca="center">
									<p>10(4.3%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>4(1.7%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
								<c ca="center">
									<p>14(6.0%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>2(0.9%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>0(0.0%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>70(29.9%)</p>
								</c>
								<c ca="center">
									<p>32(13.7%)</p>
								</c>
								<c ca="center">
									<p>5(2.1%)</p>
								</c>
								<c ca="center">
									<p>2(0.9%)</p>
								</c>
								<c ca="center">
									<p>3(1.3%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
								<c ca="center">
									<p>9(3.8%)</p>
								</c>
								<c ca="center">
									<p>2(0.9%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
								<c ca="center">
									<p>3(1.3%)</p>
								</c>
								<c ca="center">
									<p>1(0.4%)</p>
								</c>
							</r>
							<r>
								<c cspan="7">
									<hr/>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>Intron</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Found in human AND in chimp NOT in rodents (1202)</p>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>387(32.2%)</p>
								</c>
								<c ca="center">
									<p>62(5.2%)</p>
								</c>
								<c ca="center">
									<p>15(1.2%)</p>
								</c>
								<c ca="center">
									<p>24(2.0%)</p>
								</c>
								<c ca="center">
									<p>174 (14.5%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>66(5.5%)</p>
								</c>
								<c ca="center">
									<p>19(1.6%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
								<c ca="center">
									<p>5(0.4%)</p>
								</c>
								<c ca="center">
									<p>24(2.0%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>28(2.3%)</p>
								</c>
								<c ca="center">
									<p>7(0.6%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>1(0.1%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>94(7.8%)</p>
								</c>
								<c ca="center">
									<p>29(2.4%)</p>
								</c>
								<c ca="center">
									<p>10(0.8%)</p>
								</c>
								<c ca="center">
									<p>4(0.3%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
								<c ca="center">
									<p>217(18.1%)</p>
								</c>
								<c ca="center">
									<p>18(1.5%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>3(0.2%)</p>
								</c>
							</r>
							<r>
								<c cspan="7">
									<hr/>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>Intron</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c cspan="6">
									<hr/>
								</c>
							</r>
							<r>
								<c ca="center">
									<p>Found in human AND in chimp NOT in rodents, and NOT present in databases (1073)</p>
								</c>
								<c ca="center">
									<p>Distal</p>
								</c>
								<c ca="center">
									<p>
										<b>353(33.1%)</b>
									</p>
								</c>
								<c ca="center">
									<p>56(5.2%)</p>
								</c>
								<c ca="center">
									<p>13(1.2%)</p>
								</c>
								<c ca="center">
									<p>22(2.1%)</p>
								</c>
								<c ca="center">
									<p>151 (14.1%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>10 kb</p>
								</c>
								<c ca="center">
									<p>57(5.3%)</p>
								</c>
								<c ca="center">
									<p>15(1.4%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>5(0.5%)</p>
								</c>
								<c ca="center">
									<p>20(1.9%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>1 kb</p>
								</c>
								<c ca="center">
									<p>24(2.2%)</p>
								</c>
								<c ca="center">
									<p>7(0.7%)</p>
								</c>
								<c ca="center">
									<p>3(0.3%)</p>
								</c>
								<c ca="center">
									<p>1(0.1%)</p>
								</c>
								<c ca="center">
									<p>1(0.1%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>exon</p>
								</c>
								<c ca="center">
									<p>83(7.7%)</p>
								</c>
								<c ca="center">
									<p>24(2.2%)</p>
								</c>
								<c ca="center">
									<p>9(0.8%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>3(0.3%)</p>
								</c>
							</r>
							<r>
								<c>
									<p/>
								</c>
								<c ca="center">
									<p>intron</p>
								</c>
								<c ca="center">
									<p>196 (18.3%)</p>
								</c>
								<c ca="center">
									<p>17(1.6%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>2(0.2%)</p>
								</c>
								<c ca="center">
									<p>3(0.3%)</p>
								</c>
							</r>
						</tblbdy>
					</tbl>
					<fig id="F2">
						<title>
							<p>Figure 2</p>
						</title>
						<caption>
							<p>Example of secondary structures of three candidates for novel primate-specific non-protein-coding RNAs as predicted by program RNAZ</p>
						</caption>
						<text>
							<p><b>Example of secondary structures of three candidates for novel primate-specific non-protein-coding RNAs as predicted by program RNAZ</b>. The genomic coordinates (NCBI build 34) of the three TARs are chromosome 2:10,415,689&#8211;10,415,908; chromosomes 11: 11,3423,510&#8211;113,423,729; and chromosome 7:56,614,161&#8211;56,614,382.</p>
						</text>
						<graphic file="1471-2148-7-S1-S14-2"/>
					</fig>
					<suppl id="S1">
						<title>
							<p>Additional file 1</p>
						</title>
						<text>
							<p>chromosomal coordinates of the final 353 candidate TAR sequences based on NCBI build 34</p>
						</text>
						<file name="1471-2148-7-S1-S14-S1.doc">
							<p>Click here for file</p>
						</file>
					</suppl>
					<suppl id="S2">
						<title>
							<p>Additional file 2</p>
						</title>
						<text>
							<p>DNA sequence of these candidate sequences</p>
						</text>
						<file name="1471-2148-7-S1-S14-S2.doc">
							<p>Click here for file</p>
						</file>
					</suppl>
					<suppl id="S3">
						<title>
							<p>Additional file 3</p>
						</title>
						<text>
							<p>UCSC genome browser screen shots of these 3 candidate sequences as shown in Figure <figr fid="F2">2</figr>. These screenshots show that these sequences are either absent or less conserved in the mouse and rat genomes, thus are primate-specific.</p>
						</text>
						<file name="1471-2148-7-S1-S14-S3.pdf">
							<p>Click here for file</p>
						</file>
					</suppl>
					<p>It is interesting and encouraging that our analysis has discovered a large number of potential noncoding RNAs that only exist in the primates. Conventional genome annotation efforts often limit the cross-species comparison to human and mouse, such strategy likely have overlooked many lineage-specific protein or RNA genes. As we discuss below, special strategies are needed to uncover these lineage-specific sequences.</p>
				</sec>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<sec>
				<st>
					<p>Primate-specific noncoding RNAs in the human genome</p>
				</st>
				<p>It is important and fascinating to identify and characterize the genes that are responsible for the primate or human distinctiveness. In this paper, we discussed a bioinformatics analysis on the novel RNA transcripts discovered in our previous tiling array work <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. We are interested to identify those functional novel transcripts that are primate-specific, i.e. they emerged only recently in the primate lineage and thus have no obvious sequence homologs in other mammalian genomes. This is a novel research area that has been largely overlooked, and it potentially will have great impact in the field of non-protein-coding RNAs, comparative genomics, and also medicine.</p>
				<p>Most of the current efforts in detecting novel coding or noncoding transcripts require the transcript to be conserved in at least another mammalian genome, mostly in rodent genomes since they were the only available mammalian genomes until the chimpanzee draft genome was finished in 2005. Rodents and human last shared common ancestor at about 75&#8211;80 million years ago; their evolutionary distance from human is considered sufficiently distant to be able to separate conserved functional sequence that are under selective (purifying) pressure from those background neutral DNA <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. A potential limitation of only using rodents as the yardstick in such comparative studies is that it overlooks those genes that have only emerged recently in the primate line-age, which likely determine primate-specific traits. Three-way comparisons between human-mouse-rat genomes have revealed 2302 rodent-specific exons, and similar number of human-specific genes <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. These new genes were believed to have arisen through the following processes: (i) accelerated evolution in one lineage, (ii) arisen de novo from non-coding DNA, and (iii) derived from retroposition or recombinations <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Similarly, lineage-specific ncRNAs must also be present in either rodents or primates, which remain to be discovered.</p>
			</sec>
			<sec>
				<st>
					<p>Comparison with other predictions</p>
				</st>
				<p>Pedersen and colleagues recently developed a computational method called EvoFold, which utilizes the algorithm of phylogenetic stochastic context-free grammar (Phylo-SCFG) to detect conserved structured RNAs in the genome <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. These researchers first aligned the whole genome sequences of eight vertebrates (human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish), and applied the EvoFold program to derive 48,479 sequences in the human genome that are predicted to have secondary structures. These predicted sequences can be accessed at the UCSC genome browser.</p>
				<p>We are interested to analyze the overlap between the Evofold predictions and the TARs. For each TAR, we identified the closest ncRNA candidate as predicted by Evofold. Surprisingly these two datasets have very little overlap: 548 TARs overlap with an Evofold prediction, and 624 TARs (including the overlapping ones) are within 100 bp of a nearest Evofold prediction. Among the 548 TARs that overlap with Evofold, only 16 were predicted by RNAZ to be noncoding RNAs with P-value greater than 0.5. The lack of overlap between TARs and the Evolfold predictions is not really surprising, as the latter only looked at the genomic regions that are conserved in eight vertebrate species.</p>
			</sec>
			<sec>
				<st>
					<p>Phylogenetic Shadowing</p>
				</st>
				<p>As discussed above, the conventional phylogenetic and comparative methods have their limitations in identification of lineage-specific transcripts. An alternative approach, "phylogenetic shadowing", has been recently used in a number of studies and is likely to be very useful in this area. Phylogenetic shadowing is an alternative method to phylogenetic footprinting, which is a more commonly used comparative technique <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Both methods use comparative approach to identify functional elements hidden in a group of orthologous sequences, but they work in different ways and are most suited in different situations. Phylogenetic footprinting is most useful in searching for conserved elements that are present in organisms that are very distantly related. As at such great evolutionary distance, any conserved sequence would have been the result of selective pressure, therefore must be functionally important. In contrast, phylogenetic shadowing is best suited to study sequences from a group of closely related species. It analyzes patterns of sequence variations and mutations in a multiple sequence alignment, and separates the slowly evolving sites from the fast evolving sites. The sites that evolve slower are inferred as being under stronger selective pressure thus functionally important. In order to rigorously calculate the sequence variations without bias, phylogenetic relationships among the species is usually required. For closely related species, such phylogeny information is normally easy to obtain. Boffelli and Rubin were the first to employ phylogenetic shadowing, who used it on sequences from primates to discover regulatory elements and exon/intron boundaries <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Phylogenetic shadowing was recently used among a group of 9 primate species to identify conserved miRNA sequences <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Future directions</p>
				</st>
				<p>We have initiated a sequencing project to obtain orthologous sequences for the 353 candidate transcripts from several related primate species. Experimental details and analysis will be reported in the future.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Searching for homologs in other genomes</p>
				</st>
				<p>For each TAR, we used Blastn to search for homologous sequences in another fully sequenced genome or sequence library. It is important to select the most optimal Blastn e-value threshold, so that we will not miss any real homologs, and also avoid too many false positive hits. To select the e-value cut-off, we did the following control experiments. We selected the experimentally verified human miRNA hairpin sequences from the mirRegistry database <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> as query sequences, and BLASTed them against the mouse genome. We also included some negative sequences into the query set. All of these known human miRNAs have homologs in the mouse genome, so the resulted e-values from this Blastn search should be the optimal cutoff for selecting homologs in a different genome. The results confirmed that e-value = 0.01 is sufficient to identify the homologs and separate the real homologs from negative controls.</p>
			</sec>
			<sec>
				<st>
					<p>Predicting secondary structure in RNAs</p>
				</st>
				<p>Programs such as MiRscan, miRseeker and ProMiR are dedicated to search for miRNAs <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> and programs such as RNAZ, RNAFOLD, Mfold, ddbRNA, RANDFOLD, MSARI, QRNA, FOLDALIGN were written to detect stable RNA secondary structures <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. A track (EvoFold) was also implemented into the UCSC Genome Browser, which indicates the potential of forming secondary structures for any give genomics locus. A number of databases have been created to collect and categorize these ncRNA sequences, which include Rfam <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, NONCODE <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, microRNA Register <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, and RNAdb <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
				<p>Among these RNA structure prediction tools, the RNAZ program has been evaluated as the most effective <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, and was used as the primary prediction tool <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The effectiveness of the RNAZ program comes from its unique approach in combining the predicted thermodynamic stability with the structure and sequence conservation index <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The program has been tested on positive and negative control sequences. At the P value cutoff at 0.9, the program has the sensitivity of 75% and specificity of 98% <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We are currently also testing other prediction software such as QRNA, we will compare these two prediction results and investigate the possibility of using the intersection of the two predictions.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Authors' contributions</p>
			</st>
			<p>ZZ conducted most of the computational analysis; AP is responsible for evaluating and running the RNAZ program. MBG was responsible for the initiation of the original tiling array experiment and providing the data.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>ZZ thanks Steve Scherer, Benjamin Blencowe, Timothy Hughes and Matthew Fagnani for helpful discussions. This work is partially supported by the Start-up fund from University of Toronto Faculty of Medicine and by a grant from Canadian Institutes of Health Research (CIHR) to ZZ. AP was partially funded by a summer research fellowship from Ontario Genomics Institute (OGI).</p>
				<p>This article has been published as part of <it>BMC Evolutionary Biology </it>Volume 7 Supplement 1, 2007: First International Conference on Phylogenomics. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/bmcevolbiol/7?issue=S1</url>.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Applications of DNA tiling arrays to experimental genome annotation and regulatory pathway discovery</p>
				</title>
				<aug>
					<au>
						<snm>Bertone</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Gerstein</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Snyder</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Chromosome Res</source>
				<pubdate>2005</pubdate>
				<volume>13</volume>
				<issue>3</issue>
				<fpage>259</fpage>
				<lpage>274</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s10577-005-2165-0</pubid>
						<pubid idtype="pmpid" link="fulltext">15868420</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Dark matter in the genome: evidence of widespread transcription detected by microarray tiling experiments</p>
				</title>
				<aug>
					<au>
						<snm>Johnson</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Edwards</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Shoemaker</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Schadt</snm>
						<fnm>EE</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>2</issue>
				<fpage>93</fpage>
				<lpage>102</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2004.12.009</pubid>
						<pubid idtype="pmpid" link="fulltext">15661355</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Large-scale transcriptional activity in chromosomes 21 and 22</p>
				</title>
				<aug>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Cawley</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Drenkow</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bekiranov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Strausberg</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Fodor</snm>
						<fnm>SP</fnm>
					</au>
					<au>
						<snm>Gingeras</snm>
						<fnm>TR</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2002</pubdate>
				<volume>296</volume>
				<issue>5569</issue>
				<fpage>916</fpage>
				<lpage>919</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1068597</pubid>
						<pubid idtype="pmpid" link="fulltext">11988577</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Global identification of human transcribed sequences with genome tiling arrays</p>
				</title>
				<aug>
					<au>
						<snm>Bertone</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Stolc</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Royce</snm>
						<fnm>TE</fnm>
					</au>
					<au>
						<snm>Rozowsky</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Urban</snm>
						<fnm>AE</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Rinn</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Tongprasit</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Samanta</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Weissman</snm>
						<fnm>S</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<issue>5705</issue>
				<fpage>2242</fpage>
				<lpage>2246</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1103388</pubid>
						<pubid idtype="pmpid" link="fulltext">15539566</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>A comprehensive transcript index of the human genome generated using microarrays and computational approaches</p>
				</title>
				<aug>
					<au>
						<snm>Schadt</snm>
						<fnm>EE</fnm>
					</au>
					<au>
						<snm>Edwards</snm>
						<fnm>SW</fnm>
					</au>
					<au>
						<snm>GuhaThakurta</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Holder</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Ying</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Svetnik</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Leonardson</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hart</snm>
						<fnm>KW</fnm>
					</au>
					<au>
						<snm>Russell</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>G</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2004</pubdate>
				<volume>5</volume>
				<issue>10</issue>
				<fpage>R73</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">545593</pubid>
						<pubid idtype="pmpid" link="fulltext">15461792</pubid>
						<pubid idtype="doi">10.1186/gb-2004-5-10-r73</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution</p>
				</title>
				<aug>
					<au>
						<snm>Cheng</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Drenkow</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Dike</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Brubaker</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Patel</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Long</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Stern</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Tammana</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Helt</snm>
						<fnm>G</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2005</pubdate>
				<volume>308</volume>
				<issue>5725</issue>
				<fpage>1149</fpage>
				<lpage>1154</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1108625</pubid>
						<pubid idtype="pmpid" link="fulltext">15790807</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags</p>
				</title>
				<aug>
					<au>
						<snm>Chen</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sun</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Zhou</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Rowley</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>SM</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<issue>19</issue>
				<fpage>12257</fpage>
				<lpage>12262</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">129432</pubid>
						<pubid idtype="pmpid" link="fulltext">12213963</pubid>
						<pubid idtype="doi">10.1073/pnas.192436499</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Using the transcriptome to annotate the genome</p>
				</title>
				<aug>
					<au>
						<snm>Saha</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Sparks</snm>
						<fnm>AB</fnm>
					</au>
					<au>
						<snm>Rago</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Akmaev</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Vogelstein</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Kinzler</snm>
						<fnm>KW</fnm>
					</au>
					<au>
						<snm>Velculescu</snm>
						<fnm>VE</fnm>
					</au>
				</aug>
				<source>Nat Biotechnol</source>
				<pubdate>2002</pubdate>
				<volume>20</volume>
				<issue>5</issue>
				<fpage>508</fpage>
				<lpage>512</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nbt0502-508</pubid>
						<pubid idtype="pmpid" link="fulltext">11981567</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Complete sequencing and characterization of 21,243 full-length human cDNAs</p>
				</title>
				<aug>
					<au>
						<snm>Ota</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Suzuki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Nishikawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Otsuki</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Sugiyama</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Irie</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Wakamatsu</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Hayashi</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Sato</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Nagai</snm>
						<fnm>K</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nat Genet</source>
				<pubdate>2004</pubdate>
				<volume>36</volume>
				<issue>1</issue>
				<fpage>40</fpage>
				<lpage>45</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/ng1285</pubid>
						<pubid idtype="pmpid" link="fulltext">14702039</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs</p>
				</title>
				<aug>
					<au>
						<snm>Okazaki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Furuno</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kasukawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Adachi</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Bono</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Kondo</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Nikaido</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Osato</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Saito</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Suzuki</snm>
						<fnm>H</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<issue>6915</issue>
				<fpage>563</fpage>
				<lpage>573</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01266</pubid>
						<pubid idtype="pmpid" link="fulltext">12466851</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection</p>
				</title>
				<aug>
					<au>
						<snm>Numata</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kanai</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Saito</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Kondo</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Adachi</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wilming</snm>
						<fnm>LG</fnm>
					</au>
					<au>
						<snm>Hume</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Tomita</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<issue>6B</issue>
				<fpage>1301</fpage>
				<lpage>1306</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403720</pubid>
						<pubid idtype="pmpid" link="fulltext">12819127</pubid>
						<pubid idtype="doi">10.1101/gr.1011603</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Empirical analysis of transcriptional activity in the Arabidopsis genome</p>
				</title>
				<aug>
					<au>
						<snm>Yamada</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Lim</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Dale</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Shinn</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Palm</snm>
						<fnm>CJ</fnm>
					</au>
					<au>
						<snm>Southwick</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Wu</snm>
						<fnm>HC</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Nguyen</snm>
						<fnm>M</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>302</volume>
				<issue>5646</issue>
				<fpage>842</fpage>
				<lpage>846</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1088305</pubid>
						<pubid idtype="pmpid" link="fulltext">14593172</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>A gene expression map for the euchromatic genome of Drosophila melanogaster</p>
				</title>
				<aug>
					<au>
						<snm>Stolc</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Gauhar</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Mason</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Halasz</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>van Batenburg</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Rifkin</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Hua</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Herreman</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Tongprasit</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Barbano</snm>
						<fnm>PE</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>306</volume>
				<issue>5696</issue>
				<fpage>655</fpage>
				<lpage>660</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1101312</pubid>
						<pubid idtype="pmpid" link="fulltext">15499012</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Detecting novel low-abundant transcripts in Drosophila</p>
				</title>
				<aug>
					<au>
						<snm>Lee</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bao</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zhou</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Shapiro</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Shi</snm>
						<fnm>RZ</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Clark</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>YC</fnm>
					</au>
					<etal/>
				</aug>
				<source>Rna</source>
				<pubdate>2005</pubdate>
				<volume>11</volume>
				<issue>6</issue>
				<fpage>939</fpage>
				<lpage>946</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1370778</pubid>
						<pubid idtype="pmpid" link="fulltext">15923377</pubid>
						<pubid idtype="doi">10.1261/rna.7239605</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Evidence that functional transcription units cover at least half of the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Semon</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Duret</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>5</issue>
				<fpage>229</fpage>
				<lpage>232</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2004.03.001</pubid>
						<pubid idtype="pmpid" link="fulltext">15109775</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22</p>
				</title>
				<aug>
					<au>
						<snm>Kampa</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Cheng</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Kapranov</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Yamanaka</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Brubaker</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Cawley</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Drenkow</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Piccolboni</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bekiranov</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Helt</snm>
						<fnm>G</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2004</pubdate>
				<volume>14</volume>
				<issue>3</issue>
				<fpage>331</fpage>
				<lpage>342</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">353210</pubid>
						<pubid idtype="pmpid" link="fulltext">14993201</pubid>
						<pubid idtype="doi">10.1101/gr.2094104</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms</p>
				</title>
				<aug>
					<au>
						<snm>Mattick</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Bioessays</source>
				<pubdate>2003</pubdate>
				<volume>25</volume>
				<issue>10</issue>
				<fpage>930</fpage>
				<lpage>939</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/bies.10332</pubid>
						<pubid idtype="pmpid" link="fulltext">14505360</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Non-coding RNAs: hope or hype?</p>
				</title>
				<aug>
					<au>
						<snm>Huttenhofer</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Schattner</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Polacek</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>5</issue>
				<fpage>289</fpage>
				<lpage>297</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2005.03.007</pubid>
						<pubid idtype="pmpid" link="fulltext">15851066</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Non-coding, mRNA-like RNAs database Y2K</p>
				</title>
				<aug>
					<au>
						<snm>Erdmann</snm>
						<fnm>VA</fnm>
					</au>
					<au>
						<snm>Szymanski</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hochberg</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Groot</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Barciszewski</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2000</pubdate>
				<volume>28</volume>
				<issue>1</issue>
				<fpage>197</fpage>
				<lpage>200</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">102406</pubid>
						<pubid idtype="pmpid" link="fulltext">10592224</pubid>
						<pubid idtype="doi">10.1093/nar/28.1.197</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Waste not, want not &#8211; transcript excess in multicellular eukaryotes</p>
				</title>
				<aug>
					<au>
						<snm>Brosius</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2005</pubdate>
				<volume>21</volume>
				<issue>5</issue>
				<fpage>287</fpage>
				<lpage>288</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.tig.2005.02.014</pubid>
						<pubid idtype="pmpid" link="fulltext">15851065</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>unpublished data</p>
				</title>
				<aug>
					<au>
						<snm>Smit</snm>
						<fnm>AF</fnm>
					</au>
					<au>
						<snm>Green</snm>
						<fnm>P</fnm>
					</au>
				</aug>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Integrative annotation of 21,037 human genes validated by full-length cDNA clones</p>
				</title>
				<aug>
					<au>
						<snm>Imanishi</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Itoh</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Suzuki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>O'Donovan</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Fukuchi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Koyanagi</snm>
						<fnm>KO</fnm>
					</au>
					<au>
						<snm>Barrero</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Tamura</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Yamaguchi-Kabata</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Tanino</snm>
						<fnm>M</fnm>
					</au>
					<etal/>
				</aug>
				<source>PLoS Biol</source>
				<pubdate>2004</pubdate>
				<volume>2</volume>
				<issue>6</issue>
				<fpage>e162</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">393292</pubid>
						<pubid idtype="pmpid" link="fulltext">15103394</pubid>
						<pubid idtype="doi">10.1371/journal.pbio.0020162</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Functional annotation of a full-length mouse cDNA collection</p>
				</title>
				<aug>
					<au>
						<snm>Kawai</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Shinagawa</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Shibata</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Yoshino</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Itoh</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ishii</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Arakawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Hara</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Fukunishi</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Konno</snm>
						<fnm>H</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>409</volume>
				<issue>6821</issue>
				<fpage>685</fpage>
				<lpage>690</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35055500</pubid>
						<pubid idtype="pmpid" link="fulltext">11217851</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<url>http://www.macaque.org/</url>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Mouse transcriptome: neutral evolution of 'non-coding' complementary DNAs</p>
				</title>
				<aug>
					<au>
						<snm>Wang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Zheng</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Samudrala</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wong</snm>
						<fnm>GK</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>431</volume>
				<issue>7010</issue>
				<note>1 p following 757; discussion following 757</note>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Response: Mouse transcriptome: neutral evolution of 'non-coding' complementary DNAs</p>
				</title>
				<aug>
					<au>
						<snm>Hyashizaki</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>431</volume>
				<issue>7010</issue>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Tracking down noncoding RNAs</p>
				</title>
				<aug>
					<au>
						<snm>Moulton</snm>
						<fnm>V</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<issue>7</issue>
				<fpage>2269</fpage>
				<lpage>2270</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">549017</pubid>
						<pubid idtype="pmpid" link="fulltext">15703286</pubid>
						<pubid idtype="doi">10.1073/pnas.0500129102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Fast and reliable prediction of noncoding RNAs</p>
				</title>
				<aug>
					<au>
						<snm>Washietl</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hofacker</snm>
						<fnm>IL</fnm>
					</au>
					<au>
						<snm>Stadler</snm>
						<fnm>PF</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<issue>7</issue>
				<fpage>2454</fpage>
				<lpage>2459</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">548974</pubid>
						<pubid idtype="pmpid" link="fulltext">15665081</pubid>
						<pubid idtype="doi">10.1073/pnas.0409169102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Initial sequencing and comparative analysis of the mouse genome</p>
				</title>
				<aug>
					<au>
						<snm>Waterston</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Lindblad-Toh</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Birney</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Rogers</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Abril</snm>
						<fnm>JF</fnm>
					</au>
					<au>
						<snm>Agarwal</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Agarwala</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ainscough</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Alexandersson</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>An</snm>
						<fnm>P</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>420</volume>
				<issue>6915</issue>
				<fpage>520</fpage>
				<lpage>562</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01262</pubid>
						<pubid idtype="pmpid" link="fulltext">12466850</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Genome sequence of the Brown Norway rat yields insights into mammalian evolution</p>
				</title>
				<aug>
					<au>
						<snm>Gibbs</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Weinstock</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>Metzker</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Muzny</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Sodergren</snm>
						<fnm>EJ</fnm>
					</au>
					<au>
						<snm>Scherer</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Scott</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Steffen</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Worley</snm>
						<fnm>KC</fnm>
					</au>
					<au>
						<snm>Burch</snm>
						<fnm>PE</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>428</volume>
				<issue>6982</issue>
				<fpage>493</fpage>
				<lpage>521</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02426</pubid>
						<pubid idtype="pmpid" link="fulltext">15057822</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>On the sequencing of the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Waterston</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Lander</snm>
						<fnm>ES</fnm>
					</au>
					<au>
						<snm>Sulston</snm>
						<fnm>JE</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<issue>6</issue>
				<fpage>3712</fpage>
				<lpage>3716</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122589</pubid>
						<pubid idtype="pmpid" link="fulltext">11880605</pubid>
						<pubid idtype="doi">10.1073/pnas.042692499</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Complex genomic rearrangements lead to novel primate gene function</p>
				</title>
				<aug>
					<au>
						<snm>Ciccarelli</snm>
						<fnm>FD</fnm>
					</au>
					<au>
						<snm>von Mering</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Suyama</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Harrington</snm>
						<fnm>ED</fnm>
					</au>
					<au>
						<snm>Izaurralde</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Bork</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2005</pubdate>
				<volume>15</volume>
				<issue>3</issue>
				<fpage>343</fpage>
				<lpage>351</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">551560</pubid>
						<pubid idtype="pmpid" link="fulltext">15710750</pubid>
						<pubid idtype="doi">10.1101/gr.3266405</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>The origin of new genes: glimpses from the young and old</p>
				</title>
				<aug>
					<au>
						<snm>Long</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Betran</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Nat Rev Genet</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<issue>11</issue>
				<fpage>865</fpage>
				<lpage>875</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nrg1204</pubid>
						<pubid idtype="pmpid" link="fulltext">14634634</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Identification and classification of conserved RNA secondary structures in the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Pedersen</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Bejerano</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Siepel</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Rosenbloom</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Lindblad-Toh</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Lander</snm>
						<fnm>ES</fnm>
					</au>
					<au>
						<snm>Kent</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Haussler</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>PLoS Comput Biol</source>
				<pubdate>2006</pubdate>
				<volume>2</volume>
				<issue>4</issue>
				<fpage>e33</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1440920</pubid>
						<pubid idtype="pmpid" link="fulltext">16628248</pubid>
						<pubid idtype="doi">10.1371/journal.pcbi.0020033</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Phylogenetic shadowing of primate sequences to find functional regions of the human genome</p>
				</title>
				<aug>
					<au>
						<snm>Boffelli</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>McAuliffe</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Ovcharenko</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Lewis</snm>
						<fnm>KD</fnm>
					</au>
					<au>
						<snm>Ovcharenko</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Pachter</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>EM</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>299</volume>
				<issue>5611</issue>
				<fpage>1391</fpage>
				<lpage>1394</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1081331</pubid>
						<pubid idtype="pmpid" link="fulltext">12610304</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Phylogenetic shadowing and computational identification of human microRNA genes</p>
				</title>
				<aug>
					<au>
						<snm>Berezikov</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Guryev</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>van de Belt</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Wienholds</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Plasterk</snm>
						<fnm>RH</fnm>
					</au>
					<au>
						<snm>Cuppen</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2005</pubdate>
				<volume>120</volume>
				<issue>1</issue>
				<fpage>21</fpage>
				<lpage>24</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.cell.2004.12.031</pubid>
						<pubid idtype="pmpid" link="fulltext">15652478</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>The microRNA Registry</p>
				</title>
				<aug>
					<au>
						<snm>Griffiths-Jones</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<issue>32 Database</issue>
				<fpage>D109</fpage>
				<lpage>111</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308757</pubid>
						<pubid idtype="pmpid" link="fulltext">14681370</pubid>
						<pubid idtype="doi">10.1093/nar/gkh023</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Computational identification of Drosophila microRNA genes</p>
				</title>
				<aug>
					<au>
						<snm>Lai</snm>
						<fnm>EC</fnm>
					</au>
					<au>
						<snm>Tomancak</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Williams</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Rubin</snm>
						<fnm>GM</fnm>
					</au>
				</aug>
				<source>Genome Biol</source>
				<pubdate>2003</pubdate>
				<volume>4</volume>
				<issue>7</issue>
				<fpage>R42</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">193629</pubid>
						<pubid idtype="pmpid" link="fulltext">12844358</pubid>
						<pubid idtype="doi">10.1186/gb-2003-4-7-r42</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Human microRNA prediction through a probabilistic co-learning model of sequence and structure</p>
				</title>
				<aug>
					<au>
						<snm>Nam</snm>
						<fnm>JW</fnm>
					</au>
					<au>
						<snm>Shin</snm>
						<fnm>KR</fnm>
					</au>
					<au>
						<snm>Han</snm>
						<fnm>JJ</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Kim</snm>
						<fnm>VN</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>BT</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<volume>33</volume>
				<issue>11</issue>
				<fpage>3570</fpage>
				<lpage>3581</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1159118</pubid>
						<pubid idtype="pmpid" link="fulltext">15987789</pubid>
						<pubid idtype="doi">10.1093/nar/gki668</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification</p>
				</title>
				<aug>
					<au>
						<snm>Ohler</snm>
						<fnm>U</fnm>
					</au>
					<au>
						<snm>Yekta</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lim</snm>
						<fnm>LP</fnm>
					</au>
					<au>
						<snm>Bartel</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Burge</snm>
						<fnm>CB</fnm>
					</au>
				</aug>
				<source>Rna</source>
				<pubdate>2004</pubdate>
				<volume>10</volume>
				<issue>9</issue>
				<fpage>1309</fpage>
				<lpage>1322</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1370619</pubid>
						<pubid idtype="pmpid" link="fulltext">15317971</pubid>
						<pubid idtype="doi">10.1261/rna.5206304</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences</p>
				</title>
				<aug>
					<au>
						<snm>Bonnet</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Wuyts</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Rouze</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Van de Peer</snm>
						<fnm>Y</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2004</pubdate>
				<volume>20</volume>
				<issue>17</issue>
				<fpage>2911</fpage>
				<lpage>2917</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/bth374</pubid>
						<pubid idtype="pmpid" link="fulltext">15217813</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>MSARI: multiple sequence alignments for statistical detection of RNA secondary structure</p>
				</title>
				<aug>
					<au>
						<snm>Coventry</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Kleitman</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Berger</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<issue>33</issue>
				<fpage>12102</fpage>
				<lpage>12107</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">514400</pubid>
						<pubid idtype="pmpid" link="fulltext">15304649</pubid>
						<pubid idtype="doi">10.1073/pnas.0404193101</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>ddbRNA: detection of conserved secondary structures in multiple alignments</p>
				</title>
				<aug>
					<au>
						<snm>di Bernardo</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Down</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Hubbard</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<issue>13</issue>
				<fpage>1606</fpage>
				<lpage>1611</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/btg229</pubid>
						<pubid idtype="pmpid" link="fulltext">12967955</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search</p>
				</title>
				<aug>
					<au>
						<snm>Havgaard</snm>
						<fnm>JH</fnm>
					</au>
					<au>
						<snm>Lyngso</snm>
						<fnm>RB</fnm>
					</au>
					<au>
						<snm>Gorodkin</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<issue>33 Web Server</issue>
				<fpage>W650</fpage>
				<lpage>653</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1160234</pubid>
						<pubid idtype="pmpid" link="fulltext">15980555</pubid>
						<pubid idtype="doi">10.1093/nar/gki473</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Vienna RNA secondary structure server</p>
				</title>
				<aug>
					<au>
						<snm>Hofacker</snm>
						<fnm>IL</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<issue>13</issue>
				<fpage>3429</fpage>
				<lpage>3431</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">169005</pubid>
						<pubid idtype="pmpid" link="fulltext">12824340</pubid>
						<pubid idtype="doi">10.1093/nar/gkg599</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Noncoding RNA gene detection using comparative sequence analysis</p>
				</title>
				<aug>
					<au>
						<snm>Rivas</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Eddy</snm>
						<fnm>SR</fnm>
					</au>
				</aug>
				<source>BMC Bioinformatics</source>
				<pubdate>2001</pubdate>
				<volume>2</volume>
				<issue>1</issue>
				<fpage>8</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">64605</pubid>
						<pubid idtype="pmpid" link="fulltext">11801179</pubid>
						<pubid idtype="doi">10.1186/1471-2105-2-8</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Rfam: annotating non-coding RNAs in complete genomes</p>
				</title>
				<aug>
					<au>
						<snm>Griffiths-Jones</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Moxon</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Marshall</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Khanna</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Eddy</snm>
						<fnm>SR</fnm>
					</au>
					<au>
						<snm>Bateman</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<issue>33 Database</issue>
				<fpage>D121</fpage>
				<lpage>124</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540035</pubid>
						<pubid idtype="pmpid" link="fulltext">15608160</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>NONCODE: an integrated knowledge database of non-coding RNAs</p>
				</title>
				<aug>
					<au>
						<snm>Liu</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Bai</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Skogerbo</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Cai</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Deng</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Bu</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Zhao</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<issue>33 Database</issue>
				<fpage>D112</fpage>
				<lpage>115</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">539995</pubid>
						<pubid idtype="pmpid" link="fulltext">15608158</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>RNAdb &#8211; a comprehensive mammalian noncoding RNA database</p>
				</title>
				<aug>
					<au>
						<snm>Pang</snm>
						<fnm>KC</fnm>
					</au>
					<au>
						<snm>Stephen</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Engstrom</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Tajul-Arifin</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Chen</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Wahlestedt</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Lenhard</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Hayashizaki</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Mattick</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2005</pubdate>
				<issue>33 Database</issue>
				<fpage>D125</fpage>
				<lpage>130</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">540043</pubid>
						<pubid idtype="pmpid" link="fulltext">15608161</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>How to detect non-coding RNAs?</p>
				</title>
				<aug>
					<au>
						<snm>Fontaine</snm>
						<fnm>a</fnm>
					</au>
					<au>
						<snm>Touzet</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>JOBIM: 2005</source>
				<pubdate>2005</pubdate>
			</bibl>
			<bibl id="B51">
				<title>
					<p>Tumour-suppressor activity of H19 RNA</p>
				</title>
				<aug>
					<au>
						<snm>Hao</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Crenshaw</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Moulton</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Newcomb</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Tycko</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>1993</pubdate>
				<volume>365</volume>
				<issue>6448</issue>
				<fpage>764</fpage>
				<lpage>767</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/365764a0</pubid>
						<pubid idtype="pmpid" link="fulltext">7692308</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Accumulation of miR-155 and BIC RNA in human B cell lymphomas</p>
				</title>
				<aug>
					<au>
						<snm>Eis</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Tam</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Sun</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Chadburn</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Gomez</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Lund</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Dahlberg</snm>
						<fnm>JE</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2005</pubdate>
				<volume>102</volume>
				<issue>10</issue>
				<fpage>3627</fpage>
				<lpage>3632</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">552785</pubid>
						<pubid idtype="pmpid" link="fulltext">15738415</pubid>
						<pubid idtype="doi">10.1073/pnas.0500613102</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Identification and characterization of human BIC, a gene on chromosome 21 that encodes a noncoding RNA</p>
				</title>
				<aug>
					<au>
						<snm>Tam</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2001</pubdate>
				<volume>274</volume>
				<issue>1&#8211;2</issue>
				<fpage>157</fpage>
				<lpage>167</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(01)00612-6</pubid>
						<pubid idtype="pmpid" link="fulltext">11675008</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Nucleotide sequence, transcription map, and mutation analysis of the 13q14 chromosomal region deleted in B-cell chronic lymphocytic leukemia</p>
				</title>
				<aug>
					<au>
						<snm>Migliazza</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bosch</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Komatsu</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Cayanis</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Martinotti</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Toniato</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Guccione</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Qu</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Chien</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Murty</snm>
						<fnm>VV</fnm>
					</au>
					<etal/>
				</aug>
				<source>Blood</source>
				<pubdate>2001</pubdate>
				<volume>97</volume>
				<issue>7</issue>
				<fpage>2098</fpage>
				<lpage>2104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1182/blood.V97.7.2098</pubid>
						<pubid idtype="pmpid" link="fulltext">11264177</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>2005 #16: Identification of a novel gene NCRMS on chromosome 12q21 with differential expression between rhabdomyosarcoma subtypes</p>
				</title>
				<aug>
					<au>
						<snm>Chan</snm>
						<fnm>AS</fnm>
					</au>
					<au>
						<snm>Thorner</snm>
						<fnm>PS</fnm>
					</au>
					<au>
						<snm>Squire</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Zielenska</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>Oncogene</source>
				<pubdate>2002</pubdate>
				<volume>21</volume>
				<issue>19</issue>
				<fpage>3029</fpage>
				<lpage>3037</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/sj.onc.1205460</pubid>
						<pubid idtype="pmpid" link="fulltext">12082533</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>A novel synapse-associated noncoding RNA</p>
				</title>
				<aug>
					<au>
						<snm>Velleca</snm>
						<fnm>MA</fnm>
					</au>
					<au>
						<snm>Wallace</snm>
						<fnm>MC</fnm>
					</au>
					<au>
						<snm>Merlie</snm>
						<fnm>JP</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1994</pubdate>
				<volume>14</volume>
				<issue>11</issue>
				<fpage>7095</fpage>
				<lpage>7104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">359243</pubid>
						<pubid idtype="pmpid">7523860</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>A small modulatory dsRNA specifies the fate of adult neural stem cells</p>
				</title>
				<aug>
					<au>
						<snm>Kuwabara</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Hsieh</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Nakashima</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Taira</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Gage</snm>
						<fnm>FH</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>2004</pubdate>
				<volume>116</volume>
				<issue>6</issue>
				<fpage>779</fpage>
				<lpage>793</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0092-8674(04)00248-X</pubid>
						<pubid idtype="pmpid" link="fulltext">15035981</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
