<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2164-13-729</ui>
	<ji>1471-2164</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>The prediction of the porcine pre-microRNAs in genome-wide based on support vector machine (SVM) and homology searching</p>
			</title>
			<aug>
				<au id="A1"><snm>Wang</snm><fnm>Zhen</fnm><insr iid="I1"/><insr iid="I2"/><email>wangzhen054130124@163.com</email></au>
				<au id="A2"><snm>He</snm><fnm>Kan</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><email>hekan@sjtu.edu.cn</email></au>
				<au id="A3"><snm>Wang</snm><fnm>Qishan</fnm><insr iid="I1"/><insr iid="I2"/><email>wangqishan@sjtu.edu.cn</email></au>
				<au id="A4"><snm>Yang</snm><fnm>Yumei</fnm><insr iid="I1"/><insr iid="I2"/><email>yangyumei2818@sina.com</email></au>
				<au id="A5" ca="yes"><snm>Pan</snm><fnm>Yuchun</fnm><insr iid="I1"/><insr iid="I2"/><email>panyuchun1963@yahoo.com.cn</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>School of Agriculture and Biology, Department of Animal Science, Shanghai Jiao Tong University, Shanghai, 200240, PR China</p></ins>
				<ins id="I2"><p>Shanghai Key Laboratory of Veterinary Biotechnology, Shanghai, 200240, PR China</p></ins>
				<ins id="I3"><p>Department of Biology, Faculty of Science, Hong Kong Baptist University, Hong Kong, China</p></ins>
			</insg>
			<source>BMC Genomics</source>
			<section><title><p>Non-human and non-rodent vertebrate genomics</p></title></section><issn>1471-2164</issn>
			<pubdate>2012</pubdate>
			<volume>13</volume>
			<issue>1</issue>
			<fpage>729</fpage>
			<url>http://www.biomedcentral.com/1471-2164/13/729</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-13-729</pubid><pubid idtype="pmpid">23268561</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>26</day><month>12</month><year>2011</year></date></rec><acc><date><day>22</day><month>12</month><year>2012</year></date></acc><pub><date><day>27</day><month>12</month><year>2012</year></date></pub></history>
		<cpyrt><year>2012</year><collab>Wang et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
		<kwdg>
			<kwd>Porcine</kwd>
			<kwd>Pre-miRNA</kwd>
			<kwd>SVM</kwd>
			<kwd>Homology searching</kwd>
		</kwdg>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st><p>MicroRNAs (miRNAs) are a class of small non-coding RNAs that regulate gene expression by targeting mRNAs for translation repression or mRNA degradation. Although many miRNAs have been discovered and studied in human and mouse, few studies focused on porcine miRNAs, especially in genome wide.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st><p>Here, we adopted computational approaches including support vector machine (SVM) and homology searching to make a global scanning on the pre-miRNAs of pigs. In our study, we built the SVM-based porcine pre-miRNAs classifier with a sensitivity of 100%, a specificity of 91.2% and a total prediction accuracy of 95.6%, respectively. Moreover, 2204 novel porcine pre-miRNA candidates were found by using SVM-based pre-miRNAs classifier. Besides, 116 porcine pre-miRNA candidates were detected by homology searching.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st><p>We identified the porcine pre-miRNA in genome-wide through computational approaches by utilizing the data sets of pigs and set up the porcine pre-miRNAs library which may provide us a global scanning on the pre-miRNAs of pigs in genome level and would benefit subsequent experimental research on porcine miRNA functional and expression analysis.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st><p>MicroRNAs (miRNAs) are a family of ~22nt endogenous non-coding RNAs <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>. Mature miRNAs are usually cleaved from ~90nt miRNA precursors (pre-miRNAs) which are derived from processing of a long primary miRNA (pri-miRNA) by a ribonucluease <abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp>. Increasing evidences have shown that miRNAs play fundamentally important roles in various biological processes, including cell proliferation <abbrgrp>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>, development timing <abbrgrp>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
				</abbrgrp>, apoptosis <abbrgrp>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
				</abbrgrp>, carcinogenesis <abbrgrp>
					<abbr bid="B12">12</abbr>
					<abbr bid="B13">13</abbr>
					<abbr bid="B14">14</abbr>
				</abbrgrp>, and response to different environmental stresses containing disease <abbrgrp>
					<abbr bid="B15">15</abbr>
					<abbr bid="B16">16</abbr>
					<abbr bid="B17">17</abbr>
				</abbrgrp>.</p><p>Since the first lin-4 miRNA of <it>C. elegans</it> was discovered in 1992 <abbrgrp>
					<abbr bid="B18">18</abbr>
				</abbrgrp>, more than 19000 miRNAs have been found in animals and plants. Currently, the miRNA Registry Database (Release 17, April 2011; <url>http://mirbase.org</url>), a comprehensive and searchable database of published miRNA sequences, contains 16772 entries representing hairpin pre-miRNAs, expressing 19724 mature miRNA products, in 153 species <abbrgrp>
					<abbr bid="B19">19</abbr>
				</abbrgrp>. However, only 228 pre-miRNAs of pigs are included in this database, the number is far less than it really has.</p><p>Pre-miRNAs have similar hairpin-shaped stem loop structure, high minimal folding free energy index, and high evolutionary conservation. They become the important features which could be used in the computational identification of pre-miRNA <abbrgrp>
					<abbr bid="B20">20</abbr>
					<abbr bid="B21">21</abbr>
					<abbr bid="B22">22</abbr>
				</abbrgrp>. To date, computational prediction has been broadly used to identify potential pre-miRNAs in animals and plants <abbrgrp>
					<abbr bid="B23">23</abbr>
					<abbr bid="B24">24</abbr>
					<abbr bid="B25">25</abbr>
				</abbrgrp>, because it is not limited by tissue specificity and time of miRNA expression. Especially, machine learning approaches such as random forest (RF) <abbrgrp>
					<abbr bid="B26">26</abbr>
				</abbrgrp>, na&#239;ve Bayes classifier <abbrgrp>
					<abbr bid="B27">27</abbr>
				</abbrgrp>, hidden Markov model <abbrgrp>
					<abbr bid="B28">28</abbr>
					<abbr bid="B29">29</abbr>
				</abbrgrp> and SVM <abbrgrp>
					<abbr bid="B30">30</abbr>
					<abbr bid="B31">31</abbr>
					<abbr bid="B32">32</abbr>
				</abbrgrp> have been adopted.</p><p>Although previous studies have identified a certain number of porcine pre-miRNAs, few researches in computational identification of pre-miRNAs based on the whole genome sequences are being done. Furthermore, most of the machine learning approaches are based on the data sets of human, while the features of the pre-miRNAs also exhibit the species-specificity. Therefore, we are aimed to identify the porcine pre-miRNA in genome-wide through computational approaches by utilizing the data sets of pigs in our study, which may provide us a global scanning on the pre-miRNAs of pigs in genome level. In our study, we built the SVM-based porcine pre-miRNAs classifier with a sensitivity of 100%, a specificity of 91.2% and a total prediction accuracy of 95.6%, respectively. As a result, 2204 and 116 porcine pre-miRNA candidates were separately detected by using SVM-based pre-miRNAs classifier and homology searching.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>Performance of the SVM-based pre-miRNAs classifier</p>
				</st><p>SVM-based porcine pre-miRNAs classifier was built by using the data sets of pigs. Interestingly, all of porcine pre-miRNAs of the test set were correctly detected by our classifier, which achieved a sensitivity (SE) of 100%, a specificity (SP) of 91.2% and a total prediction accuracy (ACC) of 95.6%, respectively. The power of the pre-miRNAs classifier was given in Table <tblr tid="T1">1</tblr>. Moreover, the performance of the classifier was also tested by a ROC curve. As shown in the Figure <figr fid="F1">1</figr>, the classifier achieved a five-fold cross-validation rate of 99.54%. In a word, it indicated that our classifier was available for the prediction of porcine pre-miRNAs. Additionally, it also demonstrated that the comprehensive use of the pre-miRNAs features of the secondary structure and sequence information was an effective strategy in pre-miRNAs prediction.
				</p>
				<table id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>
							<b>Performance of the pre</b>-<b>miRNAs classifier on test sets.</b>
						</p>
					</caption>
					<tgroup align="left" cols="4">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="left" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Test set</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>
										<b>Type</b>
									</p>
								</entry>
								<entry colname="c3">
									<p>
										<b>Size</b>
									</p>
								</entry>
								<entry colname="c4">
									<p>
										<b>Accuracy (%)</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>Test set represents positive and negative set used to test the power of the pre-miRNAs classifier. Type represents the classification of the test set. Size is the number of the real or pseudo pre-miRNAs contained in test set. Accuracy is the percentage of the real or pseudo correctly recognized by pre-miRNAs classifier.</p>
						</tfoot>
						<tbody valign="top">
							<row>
								<entry colname="c1">
									<p>
										<b>TE</b>-<b>S1</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>Real</p>
								</entry>
								<entry colname="c3">
									<p>40</p>
								</entry>
								<entry colname="c4">
									<p>100%</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>TE</b>-<b>S2</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>Pseudo</p>
								</entry>
								<entry colname="c3">
									<p>1000</p>
								</entry>
								<entry colname="c4">
									<p>91.20%</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<fig id="F1"><title><p>Figure 1</p></title><caption><p>ROC curve for the pre-miRNAs classifier on the test set</p></caption><text>
   <p><b>ROC curve for the pre</b>-<b>miRNAs classifier on the test set.</b> The curve with more areas has better performance of the classifier. It showed the classifier reached a well performance.</p>
</text><graphic file="1471-2164-13-729-1"/></fig><p>Xue et al. obtained an accuracy of 90% by using a set of features combining the local contiguous structures with sequence information to distinct the pre-miRNAs with that of pseudo pre-miRNAs <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>, and those features have been used by several other pre-miRNA predicting methods <abbrgrp>
						<abbr bid="B26">26</abbr>
						<abbr bid="B31">31</abbr>
						<abbr bid="B33">33</abbr>
					</abbrgrp>. Their studies demonstrated that those features were effective in pre-miRNA prediction. Thus, we also adopted those features in our study. Later, Jiang et al. found that the predicting performance significantly increased by combining the minimum of free energy (MFE) of the secondary structure or p-value feature with the local contiguous triplet structure composition feature. Their results indicated that a comprehensive feature vector was able to extract more information of a primary sequence and reach a better prediction performance <abbrgrp>
						<abbr bid="B26">26</abbr>
					</abbrgrp>. Our classifier was capable of achieving a well prediction performance with an accuracy of 95.6% may be due to the using of a combined feature vector, because additional seven features used in our study have been proved to be one part of the optimized features subset in pre-miRNAs prediction by Wang et al. <abbrgrp>
						<abbr bid="B3">3</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Identification of pre-miRNAs candidates on pig genome using the SVM-based classifier</p>
				</st><p>Since the genome sequences contain the full information of a species and the database of non-coding RNA of pigs is quite incompletely, thus we used whole genome sequences to construct the prediction set (PR-S). After splitting the pig genome, we obtained more than 222 million short sequences. The PR-S constructed by short sequences passed by pre-filter was further distinguished by our SVM-based pre-miRNAs classifier. As pre-filter parameters would be very useful in filtering the pseudo pre-miRNAs from huge number of similar pre-miRNA sequences, those pre-filters were incorporated into the SVM-based classifier to predict novel pre-miRNAs. Except for the redundancy and the known pre-miRNAs, we finally got 2204 pre-miRNA candidates with the probability more than 0.99995 in the pig genome. They were formed into 1849 clusters according to their locations in genome wide (inter-distance &lt;=50kb <abbrgrp>
						<abbr bid="B34">34</abbr>
					</abbrgrp>). Those pre-miRNA candidates were blasted with porcine CDS and other non-coding RNA (NONCODE v3.0, <url>http://www.noncode.org/NONCODERv3/</url>). The result shown that 6 novel pre-miRNAs (coverage &gt;90%, identities =100% with CDS) overlap with coding region. Namely, 2198 out of 2204 new pre-miRNAs are in the non-coding region. And none of pre-miRNAs (coverage &gt;90%, identities &gt;90% with non-coding RNAs) were found that overlap with other non-coding RNAs. The procedure for predicting porcine pre-miRNAs was given as Figure <figr fid="F2">2</figr>.
				</p>
				<fig id="F2"><title><p>Figure 2</p></title><caption><p>Flowchart of the porcine pre-miRNA prediction procedure</p></caption><text>
   <p><b>Flowchart of the porcine pre</b>-<b>miRNA prediction procedure.</b></p>
</text><graphic file="1471-2164-13-729-2"/></fig><p>The large number of the novel pre-miRNA candidates indicated that there were still many unidentified pre-miRNAs in pigs. Previous studies estimated that the number of miRNAs have taken up to approximately 2-3% of the total number of genes in animal genomes <abbrgrp>
						<abbr bid="B20">20</abbr>
					</abbrgrp>. According to our study, the number of the pre-miRNAs would be more than previous estimate. Expression profiling studies showed that most miRNAs were under the control of tissue-specific and development signaling, or both <abbrgrp>
						<abbr bid="B35">35</abbr>
						<abbr bid="B36">36</abbr>
						<abbr bid="B37">37</abbr>
					</abbrgrp>. As a result, it may lead to a less number of miRNA identified by experimental methods and a low evaluate of pre-miRNAs&#8217; number. Indeed, in our studies, we regarded those pre-miRNA candidates as the real porcine pre-miRNAs in the view of bioinformatics. Meanwhile, those pre-miRNA candidates were set up to the porcine pre-miRNA library, the detail information of which was given in Additional file <supplr sid="S1">1</supplr>.</p>
				<suppl id="S1">
					<title>
						<p>Additional file 1</p>
					</title>
					<text>
						<p>
							<b>The list of porcine pre</b>-<b>miRNA candidates predicted by SVM</b>-<b>based classifier.</b> The data provided represent the list of porcine pre-miRNA candidates predicted by SVM-based classifier in the whole genome of the pigs, and containing the information of their length, location in chromosome and genome location clusters.</p>
					</text>
					<file name="1471-2164-13-729-S1.xlsx">
   <p>Click here for file</p>
</file>
				</suppl><p>To explore the location distribution of all the pre-miRNA candidates, we calculated the number of pre-miRNA candidates in each chromosome. And the chromosome 1 covered the maximum number of pre-miRNAs candidates, while the chromosome 18 included the minimum. To a large extent, the number was consistent with the length of chromosome, namely the bigger of the chromosome the more number of pre-miRNA candidates it contained. The density analysis of pre-miRNA in chromosome showed that chromosome X, 8 and 16 maintained the highest density of pre-miRNA. The chromosome 8 was also found that it had a high density of quantitative trait locus (QTL) (<url>http://www.animalgenome.org/cgi-bin/QTLdb/SS/index</url>). Thus, the result suggested other researchers should pay more attention to study the chromosome 8 of pigs in the future. The result of density analysis of pre-miRNA and QTL in chromosome was given in Additional file <supplr sid="S2">2</supplr>.</p>
				<suppl id="S2">
					<title>
						<p>Additional file 2</p>
					</title>
					<text>
						<p>
							<b>The result of density analysis of pre</b>-<b>miRNA and QTL in chromosome.</b> The data provided the information of the number of pre-miRNA and QTL and their density in each chromosome.</p>
					</text>
					<file name="1471-2164-13-729-S2.xlsx">
   <p>Click here for file</p>
</file>
				</suppl><p>At the same time, 215 unique pre-miRNAs were identified in pigs by Solexa sequencing in another published study <abbrgrp>
						<abbr bid="B38">38</abbr>
					</abbrgrp>. Based on the comparison this data with ours, we found that 49 (coverage &gt;90%, identities &gt;90% with predicting pre-miRNAs ) of above 215 unique pre-miRNAs were included in our study. In Chen et al.&#8217;s study, it mainly focused on identifying miRNAs in porcine backfat tissues. Tissues-specificity may lead to a bias on much more number of miRNAs identified in backfat tissues in their study, meanwhile some of their candidate miRNAs were unidentified by our method due to a limited length of 90-nt changed their features in our study. These may count for the low overlap rate. However, the result of Chen et al.&#8217;s study may still provide a piece of experimental evidence for our study. After the step of pre-filtering, a total of 160 known pre-miRNAs were retained in PR-S. 181 sequence fragments (coverage &gt;90%, identities =100% with known pre-miRNAs) represented 115 known pre-miRNAs were detected by classifier. Namely, the sequence fragments of the known pre-miRNAs in the PR-S could be detected with the coverage of 72% (115 out of 160). The details those known pre-miRNAs sequence fragments were given in Additional file <supplr sid="S3">3</supplr>. There are several possible reasons accounting for that not all the reported porcine pre-miRNAs in miRNA Registry Database were covered in our studies. Firstly, not all the pre-miRNA sequences are expressed in the order of the genome sequence due to the RNA editing <abbrgrp>
						<abbr bid="B39">39</abbr>
						<abbr bid="B40">40</abbr>
					</abbrgrp> , such as mir-381,mir-1271. According to our observation, 184 out of known 224 pre-miRNAs are completely identical to the sequence of the genome, thus 40 known pre-miRNA sequences unmapped to the genomic sequence data were filtered. Secondly, in order to reduce the pseudo pre-miRNAs as more as possible, the pre-filter parameters setting is up to some reported pre-miRNAs, such as the value of the minimal folding free energy index (MFEI). 160 out of 184 known pre-miRNAs were retained (20 known pre-miRNA were missed) after this step. Thirdly, the length of the short sequence is limited to 90-nt, while some features of pre-miRNAs (such as adjusted minimal folding free energy (N(AMFE)) and the adjust number of paired nucleotides (N(ANNB)) have connection with the sequence length <abbrgrp>
						<abbr bid="B32">32</abbr>
						<abbr bid="B41">41</abbr>
					</abbrgrp>, which may influence the features of 45 reported pre-miRNAs and lead them to be undetected.</p>
				<suppl id="S3">
					<title>
						<p>Additional file 3</p>
					</title>
					<text>
						<p>
							<b>The list of porcine known pre</b>-<b>miRNA fragments of 90nt detected by SVM</b>-<b>based classifier.</b> The data provided represents the list of porcine known pre-miRNA detected by SVM-based classifier in the whole genome of the pigs, and containing the information of their length, location in chromosome and the name of the represented known pre-miRNA.</p>
					</text>
					<file name="1471-2164-13-729-S3.xlsx">
   <p>Click here for file</p>
</file>
				</suppl><p>Although the classifier produced a specificity of 91.2%, the candidate hairpins could be lead to a certain number of false positives in genome-wide prediction. Thus, the next problem removing those pseudo pre-miRNAs in the library is needed to be considered deeply.</p>
			</sec>
			<sec>
				<st>
					<p>Identification of the pre-miRNAs candidates using the homologous searching</p>
				</st><p>Since the pre-miRNA candidate sequences were split from genome with a specified length of 90-nt which may lead some of them undetected by our SVM classifier and the coverage of some model species with our SVM-based classifier result (coverage &gt;85%, identities &gt;85% with model species known pre-miRNAs) were 8% (human), 12% (mouse), 22% (rat), 16% (cow) and 31% (dog), which was not so high. The SVM-based classifier&#8217;s training set was composed by the porcine known pre-miRNAs to predict the novel pre-miRNAs of pigs. The feature of pre-miRNAs exhibits the species-specificity. It may cause our SVM-based classifier have some biases to detect more pre-miRNA possessed only by pigs. The species-specificity and homologous porcine pre-miRNAs unidentified in model species may contribute to the low overlap rate. It was necessary to make it up by some other computational methods. At present, besides the SVM classifier the homologous searching is also a widely used method for identifying the pre-miRNAs, because the pre-miRNAs have a highly conservation among the different species <abbrgrp>
						<abbr bid="B20">20</abbr>
					</abbrgrp>. What&#8217;s more, in recent years, a large number of new pre-miRNAs were identified in some model species, such as Mouse, Human. Up to now, according to the records of miRNA Registry Database (Release 17, April 2011; <url>http://mirbase.org</url>), it contains human (1424), mouse (720), rat (408), cow (662) and dog (323). While, there are only 228 pre-miRNAs in pig. Therefore, it is quite necessary for us to do a homologous searching once again to find the new pre-miRNAs of porcine by using the identified pre-miRNAs in the other species.</p><p>According to the criteria mentioned in homologous searching method, we found 116 new pre-miRNAs candidates, and the detail information of which was given in Additional file <supplr sid="S4">4</supplr>. Interestingly, some pre-miRNAs candidates were mapped to more than one location of chromosomes. Guo et al. thought that cross-mapping events in pre-miRNAs revealed potential miRNA-mimics and evolutionary implications <abbrgrp>
						<abbr bid="B42">42</abbr>
					</abbrgrp>. The newly identified porcine pre-miRNAs candidates belong to different miRNA families, such as miR-1282, miR-3059, miR-3120, miR-3618. Among them, miR-3120 initially identified from melanoma <abbrgrp>
						<abbr bid="B43">43</abbr>
					</abbrgrp> and miR-3618 from human cervical cancer and normal cervices <abbrgrp>
						<abbr bid="B44">44</abbr>
					</abbrgrp> have a highly conservation with pigs. We have also compared this result with the SVM-based and found no overlap between them. Actually, there were some of them passing SVM model before filtering in our study. However, when the prediction probability was set as more than 0.99995 to reduce false positive, they were filtered out with a result of no overlap between homology search and SVM-model candidates. There is no doubt that the high conservation of pre-miRNAs among the species also provides us a rapid way to identify the pig pre-miRNAs. This would be helpful to further enrich the resource of pre-miRNAs databases.</p>
				<suppl id="S4">
					<title>
						<p>Additional file 4</p>
					</title>
					<text>
						<p>
							<b>The list of porcine pre</b>-<b>miRNA candidates predicted by homology searching.</b> The data provided represent the list of porcine pre-miRNA candidates predicted by homology searching, and containing the information of their length and location in chromosome.</p>
					</text>
					<file name="1471-2164-13-729-S4.xlsx">
   <p>Click here for file</p>
</file>
				</suppl>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st><p>In conclusion, we built the SVM-based pre-miRNAs classifier using the known pre-miRNAs and CDS sets of the pigs. From the porcine genome, we discovered 2204 new pre-miRNAs candidates by our SVM-based classifier and 116 pre-miRNAs candidates by homology searching. Our study would provide guidance on further experimentally verifying swine pre-miRNA in the future and offer the opportunity to research gene function and the genetic mechanism of complex traits in genome level.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Sequence data collection</p>
				</st><p>The porcine genomic sequences were available from UCSC database (Mar 2010, <url>http://hgdownload.cse.ucsc.edu/goldenPath/susScr2/bigZips/</url>). The precursor sequences of known miRNAs of <it>Homo sapiens</it> (human), <it>Mus musculus</it> (mouse), <it>Rattus norvegicus</it> (rat), <it>Bos Taurus</it> (cow), <it>Canis familiaris</it> (dog) and <it>Sus scrofa</it> (pig) were obtained from miRNA Registry Database (Release 17, April 2011; <url>http://mirbase.org</url>) <abbrgrp>
						<abbr bid="B19">19</abbr>
					</abbrgrp>. The porcine protein coding regions sequences (CDS) were downloaded from NCBI (<url>ftp://ftp.ncbi.nih.gov/genomes/Sus_scrofa/RNA/</url>), which were used as the pseudo pre-miRNA data.</p>
			</sec>
			<sec>
				<st>
					<p>The length of the pre-miRNAs sequences (LS)</p>
				</st><p>The statistical length distribution of porcine pre-miRNA from miRNA Registry Database is that 86% of them within 75~105 nt. In our study, both the porcine genome sequences and CDS were divided into short sequences using a 90-nt sliding window with 9-nt increments at one time <abbrgrp>
						<abbr bid="B3">3</abbr>
						<abbr bid="B33">33</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>The complexity of the sequences</p>
				</st><p>Low-complexity of the sequences, such as those with single nucleotide repeated &gt; 8 times (for example, AAAAAAAA), dinucleotides repeated &gt; 7 times (for example, AGAGAGAGAGAGAG), trinucleotides repeated &gt; 4 times (for example, ATGATGATGATG), were removed for further analysis, since we observed few known pre-miRNA possessed such sequences. Additionally, the sequences with the region of gap were removed.</p>
			</sec>
			<sec>
				<st>
					<p>MFE feature</p>
				</st><p>MFE of the secondary structure was predicted by the Vienna RNA software package (RNAfold) (Version 1.8.5; <url>http://www.tbi.univie</url>. ac.at/~ivo/ RNA/) <abbrgrp>
						<abbr bid="B45">45</abbr>
						<abbr bid="B46">46</abbr>
					</abbrgrp>. Previous studies indicated that pre-miRNAs have a high negative MFE and MFEI, which is a useful criterion to distinguish pre-miRNAs from all coding or non-coding RNAs <abbrgrp>
						<abbr bid="B41">41</abbr>
					</abbrgrp>. The MFEI was calculated by the equation: MFEI&#8201;=&#8201;(-&#8201;100&#8201;&#215;&#8201;MFE/LS)/(G&#8201;+&#8201;C).</p><p>The three characteristics related to MFE were used as the feature vectors in SVM, and they were defined as follows:</p><p>
					<display-formula id="M1">
						<m:math name="1471-2164-13-729-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">MFE</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mo>-</m:mo>
         <m:mi mathvariant="normal">MFE</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo stretchy="true">/</m:mo>
   <m:mn>1000</m:mn>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>
					<display-formula id="M2">
						<m:math name="1471-2164-13-729-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">MFE</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi mathvariant="normal">MFEI</m:mi>
   <m:mo stretchy="true">/</m:mo>
   <m:mn>10</m:mn>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>
					<display-formula id="M3">
						<m:math name="1471-2164-13-729-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">AMFE</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mo>-</m:mo>
         <m:mi mathvariant="normal">MFE</m:mi>
      </m:mrow>
   </m:mfenced>
   <m:mo stretchy="true">/</m:mo>
   <m:mfenced open="(" close=")">
      <m:mrow>
         <m:mn>10</m:mn>
         <m:mo>&#215;</m:mo>
         <m:mi mathvariant="normal">LS</m:mi>
      </m:mrow>
   </m:mfenced>
</m:mrow>
</m:math>
					</display-formula>
				</p>
			</sec>
			<sec>
				<st>
					<p>Base-pairings and the secondary structure features</p>
				</st><p>Because nucleic acid G can be paired with C or U, the base-pairings on the stem of the hairpin structure included the GU wobble pairs. And the threshold of the minimum base-parings of real pre-miRNA was 18. Indeed, the stem of the hairpin structure is highly conserved in pre-miRNAs, so we still only considered the stem regions of the pre-miRNA. The number of paired nucleotides (NNB), the adjust number of paired nucleotides (ANNB) and the number of nucleotides of the stem parts (NNS) were utilized as three feature vectors, defined as follows:</p><p>
					<display-formula id="M4">
						<m:math name="1471-2164-13-729-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">NNB</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi mathvariant="normal">NNB</m:mi>
   <m:mo stretchy="true">/</m:mo>
   <m:mn>1000</m:mn>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>
					<display-formula id="M5">
						<m:math name="1471-2164-13-729-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">ANNB</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi mathvariant="normal">NNB</m:mi>
   <m:mo stretchy="true">/</m:mo>
   <m:mi mathvariant="normal">LS</m:mi>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>
					<display-formula id="M6">
						<m:math name="1471-2164-13-729-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">NNS</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi mathvariant="normal">NNB</m:mi>
   <m:mo stretchy="true">/</m:mo>
   <m:mi mathvariant="normal">NNS</m:mi>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>Meanwhile, we denoted the contents of GC as follows:</p><p>
					<display-formula id="M7">
						<m:math name="1471-2164-13-729-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:mrow>
   <m:mi mathvariant="normal">N</m:mi>
   <m:mfenced open="(" close=")">
      <m:mi mathvariant="normal">GC</m:mi>
   </m:mfenced>
   <m:mo>=</m:mo>
   <m:mi mathvariant="normal">GC</m:mi>
   <m:mo stretchy="true">/</m:mo>
   <m:mn>1000</m:mn>
</m:mrow>
</m:math>
					</display-formula>
				</p><p>Besides, seven other features, including the structural diversity (N(Diversity)) (8), the frequency of the MFE structure (N(Freq/100)) (9) <abbrgrp>
						<abbr bid="B46">46</abbr>
					</abbrgrp>, adjusted base pair distance (N(dD)) (10) <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp>, average distance between internal loops (N(D_interlp/1000)) (11), the ratio of |A-U| to the length of sequence (N(|A-U|/LS)) (12), the length of the longest relaxed symmetry region (N(l_rsym_rgn/100)) (13) and the length of the longest symmetry region (N(l_sym_rgn/100)) (14), which were found as the optimized features for pre-miRNAs prediction according to the studies of Wang et.al <abbrgrp>
						<abbr bid="B3">3</abbr>
					</abbrgrp>, were also adopted.</p>
			</sec>
			<sec>
				<st>
					<p>The local adjacent sequence-structure features</p>
				</st><p>Previous studies have shown that local sequence features play a crucial role in pre-miRNAs <abbrgrp>
						<abbr bid="B48">48</abbr>
					</abbrgrp>. Additionally, Xue et al. found that the distributions of local contiguous sub-structures of pre-miRNAs are significantly distinguished with that of pseudo pre-miRNAs <abbrgrp>
						<abbr bid="B30">30</abbr>
					</abbrgrp>. Therefore, in our study, we also characterized the secondary structure of pre-miRNAs by combining of the sequence information with the local contiguous structures.</p><p>There are only two conditions for each nucleotide in the predicted secondary structure by RNAfold <abbrgrp>
						<abbr bid="B45">45</abbr>
					</abbrgrp>, paired or unpaired, denoted by brackets &#8220;(&#8221; and dots &#8220;.&#8221;, respectively. The left bracket &#8220;(&#8221; represents that paired nucleotide located near 5&#8242;-end which can be paired with another nucleotide at the 3&#8242;-end indicated by a right bracket &#8220;)&#8221;. We used &#8220;(&#8221; for both situations without differentiating &#8220;(&#8221; or &#8220;)&#8221;, because no evidence has indicated that mature miRNAs have a preference of the 3&#8242; or 5&#8242; arms of their hairpin precursors. Obviously, for any 3 adjacent nucleotides, there are eight possible structure units: &#8220;(((&#8221;, &#8220;((.&#8221;, &#8220;(.(&#8221;, &#8220;.((&#8221;, &#8220;(.&#8221;, &#8220;.(. &#8221;, &#8220;.(&#8221;, &#8220;&#8230;&#8221;. Furthermore, by considering the left nucleotide among the three <abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp>, there are 32 possible sequence-structure units, left-triplet coding ,denoted as &#8220;A(((&#8221;, &#8220;U(((&#8221;, &#8220;A((.&#8221;, etc. as shown in Additional file <supplr sid="S5">5</supplr>
					<abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp>. Here, we only considered the stem regions of a pre-miRNA by excluding the external single-stranded parts and the terminal loop. Similar features have been adopt by pioneer work, e.g. that Zhao et al. <abbrgrp>
						<abbr bid="B31">31</abbr>
					</abbrgrp>. The frequency of each left-triplet coding of pre-miRNA was counted to create the 32 feature vectors. After normalizing, the frequency was used as input features for SVM. Combining with 14 features above, in all 46 feature vectors (summarized in Additional file <supplr sid="S6">6</supplr>) were taken as the input of SVM.</p>
				<suppl id="S5">
					<title>
						<p>Additional file 5</p>
					</title>
					<text>
						<p>
							<b>Local sequence</b>-<b>structure features of a hairpin were denoted by the left</b>-<b>triplet coding.</b> Left-triplet elements are used to represent the local structure sequence features of a hairpin. The nucleotide type at the left and three local continuous substructures compose the left-triplet element. The appearances of all 32 possible triplet elements are counted along a hairpin segment to form a 32-dimensional vector, which is normalized to be the input vector for SVM.</p>
					</text>
					<file name="1471-2164-13-729-S5.jpeg">
   <p>Click here for file</p>
</file>
				</suppl>
				<suppl id="S6">
					<title>
						<p>Additional file 6</p>
					</title>
					<text>
						<p>
							<b>The 46 features used by SVM-based porcine pre-miRNAs classifier.</b>
						</p>
					</text>
					<file name="1471-2164-13-729-S6.docx">
   <p>Click here for file</p>
</file>
				</suppl>
			</sec>
			<sec>
				<st>
					<p>The pre-filter parameters of secondary structure features</p>
				</st><p>Each sequence secondary structure, predicted by the Vienna RNAfold, was passed through a set of filter parameters. The filtering parameters <abbrgrp>
						<abbr bid="B33">33</abbr>
						<abbr bid="B49">49</abbr>
						<abbr bid="B50">50</abbr>
					</abbrgrp>related to some terms of secondary structures were given as Additional file <supplr sid="S7">7</supplr>
					<abbrgrp>
						<abbr bid="B3">3</abbr>
					</abbrgrp>, which were shown below.</p><p indent="1">(a) The number of hairpin loops = 1;</p><p indent="1">(b) The number of symmetrical loops &lt; 6.</p><p indent="1">(c) The number of asymmetrical loops &lt; 4.</p><p indent="1">(d) The number of bulges &lt; 5.</p><p indent="1">(e) The total number of symmetrical and asymmetrical loops &lt; 8.</p><p indent="1">(f) The total number of symmetrical, asymmetrical loops and bulges &lt;10.</p><p indent="1">(g) The number of the base pairing &gt;17.</p><p indent="1">(h) The value of ANNB is between 0.3~0.43.</p><p indent="1">(i) The length of symmetrical loops &lt; 5.</p><p indent="1">(j) The length of asymmetrical loops &lt;6.</p><p indent="1">(k) The length of bulges &lt; 6.</p><p indent="1">(l) The MFE &lt; &#8722;15kal/mol.</p><p indent="1">(m) The MFEI &gt;0.7.</p><p indent="1">(n) The percentage of the GC contents is between 30-70%.</p>
				<suppl id="S7">
					<title>
						<p>Additional file 7</p>
					</title>
					<text>
						<p>
							<b>The primary sequence of the has</b>-<b>let</b>-<b>7e precursor and the locations of some terms in the secondary structure.</b> The upper part gives the primary structure of has-let-7e and the lower one shows the secondary structure and the correlative terms with varied colors.</p>
					</text>
					<file name="1471-2164-13-729-S7.jpeg">
   <p>Click here for file</p>
</file>
				</suppl>
			</sec>
			<sec>
				<st>
					<p>SVM data set</p>
				</st><p>Among the 228 known porcine pre-miRNAs, whose secondary structures with no multiple loops were considered. 224 pre-miRNAs, covering more than 98% of all the reported porcine pre-miRNAs, were retained. We randomly extracted 184 pre-miRNAs from them as one part of training set (TR-S) and the remaining 40 pre-miRNAs formed into the test set 1 (TE-S1).</p><p>A pseudo pre-miRNAs set was collected from the porcine CDS and 5677 pseudo pre-miRNAs were selected due to their similar stem-loop structures to real pre-miRNAs. The criteria for extracting the pseudo pre-miNRAs from CDS segment was complied with the pre-filter parameters of the secondary structure features above. 184 pseudo pre-miRNAs selected randomly from the pseudo pre-miRNAs set composed another part of TR-S. Furthermore, we randomly took out 1000 pseudo pre-miRNA from the remaining pseudo pre-miRNAs set as test set 2 (TE-S2).</p><p>In addition, the porcine genome sequence fragments split from genome using a 90-nt sliding window with 9-nt increments at one time, passed the pre-filter parameters of secondary structure features (including (a),(g),(l) and (MFEI&gt;0.6)), were collected for further identifying by SVM classifier and constructed the PR-S. The composition of each set was shown in Figure <figr fid="F3">3</figr>.
				</p>
				<fig id="F3"><title><p>Figure 3</p></title><caption><p>The composition of each set including the training set (TR-S), testing set (TE-S1 and TE-S2) and predictive set (PR-S)</p></caption><text>
   <p><b>The composition of each set including the training set (TR</b>-<b>S), testing set (TE</b>-<b>S1 and TE</b>-<b>S2) and predictive set (PR</b>-<b>S).</b> 184 real and pseudo porcine pre-miRNAs are randomly extracted from positive set (224 known real porcine pre-miRNAs) and negative set (5677 porcine CDS), respectively, and then they form into the training set. The remaining 40 real porcine pre-miRNAs compose the test set 1 (TE-S1). 1000 pseudo pre-miRNAs from the remaining negative set are randomly selected as test set 2 (TE-S2). Both TE-S1 and TE-S2 are used to test the performance of the SVM-based pre-miRNAs classifier. The predicting set (PR-S) is constructed by the porcine genome sequence fragments passed the pre-filter parameters of secondary structure features.</p>
</text><graphic file="1471-2164-13-729-3"/></fig>
			</sec>
			<sec>
				<st>
					<p>SVM</p>
				</st><p>SVM, based on statistical theory <abbrgrp>
						<abbr bid="B51">51</abbr>
					</abbrgrp>, has a good generalization ability <abbrgrp>
						<abbr bid="B52">52</abbr>
					</abbrgrp>. Therefore, in our study, SVM was adopted as a classifier to identify the real and pseudo pre-miRNAs. It was trained by the TR-S with the performance estimated by TE-S and applied to the PR-S. A 46-dimension feature vector referred to the above was taken as the input of SVM and the output was the number value &#8220;1&#8221;, which means the true, or &#8220;-1&#8221; indicating the false.</p><p>In our study, we downloaded a widely used software package Libsvm (Version 3.1, April 2011; <url>http://www.csie.ntu.edu.tw/~cjlin/libsvm/)</url>
					<abbrgrp>
						<abbr bid="B53">53</abbr>
					</abbrgrp> to carry out our work. In order to acquire SVM classifier with optimal performance, we applied five cross-validation in model training, which could obtain the optimal penalty parameter C and the RBF kernel parameter g. Meanwhile, the performance of the SVM classifier was evaluated by following the assessment system used in RF <abbrgrp>
						<abbr bid="B26">26</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Homologous searching</p>
				</st><p>We chose pre-miRNAs of five other mammalian species (including human, mouse, rat, cow and dog), which have a highly homology with pigs. Firstly, we removed the pre-miRNAs which have a highly homologous with 228 known porcine pre-miRNAs from the total pre-miRNAs of five species by utilizing the software of BLAST (ncbi-blast-2.2.25+; <url>ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/)</url>
					<abbrgrp>
						<abbr bid="B54">54</abbr>
					</abbrgrp>. Next, the remaining pre-miRNAs were blasted with the genome sequence of pigs and the sequence fragments (coverage &gt;85%, identities &gt;85% with pre-miRNAs) were retrieved from genome. Lastly, after discarding the redundant sequences, the sequences were regarded as pre-miRNA candidates if they accorded with the following criteria <abbrgrp>
						<abbr bid="B55">55</abbr>
						<abbr bid="B56">56</abbr>
					</abbrgrp>:(i) an RNA sequence can fold into an stem-loop hairpin structure;(ii) predicted secondary structures had MFE less than -15kcal/mol;(iii) minimum base pairings on the stem of the hairpin structure is18;(iv) no multiple loops; (v) the GC contents is between 30~70%.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st><p>The authors declare that they had no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors&#8217; contributions</p>
			</st><p>YP,QW and ZW designed the study. ZW collected the datasets from databases and analyzed the data, then prepared the original draft the manuscript. KH and YY guided the SVM analysis and the interpretation of the results. YP and QW reviewed the manuscript. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st><p>This work was funded by National Natural Science Foundation of China (grant No. 31272414, 31072003 and 31000992), 2012 Animal Germplasm Resources Protection Project and Agriculture Development through Science and Technology Key Project of Shanghai (grant No. 2010 (1&#8211;3)).</p>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>The functions of animal microRNAs</p></title><aug><au><snm>Ambros</snm><fnm>V</fnm></au></aug><source>Nature</source><pubdate>2004</pubdate><volume>431</volume><issue>7006</issue><fpage>350</fpage><lpage>355</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature02871</pubid><pubid idtype="pmpid" link="fulltext">15372042</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>MicroRNAs: genomics, biogenesis, mechanism, and function</p></title><aug><au><snm>Bartel</snm><fnm>DP</fnm></au></aug><source>Cell</source><pubdate>2004</pubdate><volume>116</volume><issue>2</issue><fpage>281</fpage><lpage>297</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(04)00045-5</pubid><pubid idtype="pmpid" link="fulltext">14744438</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Predicting human microRNA precursors based on an optimized feature subset generated by GA-SVM</p></title><aug><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Chen</snm><fnm>X</fnm></au><au><snm>Jiang</snm><fnm>W</fnm></au><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Li</snm><fnm>W</fnm></au><au><snm>Yang</snm><fnm>L</fnm></au><au><snm>Liao</snm><fnm>M</fnm></au><au><snm>Lian</snm><fnm>B</fnm></au><au><snm>Lv</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>S</fnm></au><etal/></aug><source>Genomics</source><pubdate>2011</pubdate><volume>98</volume><issue>2</issue><fpage>73</fpage><lpage>78</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ygeno.2011.04.011</pubid><pubid idtype="pmpid" link="fulltext">21586321</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Down-regulation of cyclin E1 expression by microrna-195 accounts for interferon-beta-induced inhibition of hepatic stellate cell proliferation</p></title><aug><au><snm>Sekiya</snm><fnm>Y</fnm></au><au><snm>Ogawa</snm><fnm>T</fnm></au><au><snm>Iizuka</snm><fnm>M</fnm></au><au><snm>Yoshizato</snm><fnm>K</fnm></au><au><snm>Ikeda</snm><fnm>K</fnm></au><au><snm>Kawada</snm><fnm>N</fnm></au></aug><source>J Cell Physiol</source><pubdate>2011</pubdate><volume>226</volume><issue>10</issue><fpage>2535</fpage><lpage>2542</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcp.22598</pubid><pubid idtype="pmpid" link="fulltext">21792910</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Insulin promotes vascular smooth muscle cell proliferation via microRNA-208-mediated downregulation of p21</p></title><aug><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Eisner</snm><fnm>GM</fnm></au><au><snm>Asico</snm><fnm>LD</fnm></au><au><snm>Jose</snm><fnm>PA</fnm></au><au><snm>Zeng</snm><fnm>C</fnm></au></aug><source>J Hypertens</source><pubdate>2011</pubdate><volume>29</volume><issue>8</issue><fpage>1560</fpage><lpage>1568</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1097/HJH.0b013e328348ef8e</pubid><pubid idtype="pmpid" link="fulltext">21720271</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>bantam encodes a developmentally regulated microRNA that controls cell proliferation and regulates the proapoptotic gene hid in Drosophila</p></title><aug><au><snm>Brennecke</snm><fnm>J</fnm></au><au><snm>Hipfner</snm><fnm>DR</fnm></au><au><snm>Stark</snm><fnm>A</fnm></au><au><snm>Russell</snm><fnm>RB</fnm></au><au><snm>Cohen</snm><fnm>SM</fnm></au></aug><source>Cell</source><pubdate>2003</pubdate><volume>113</volume><issue>1</issue><fpage>25</fpage><lpage>36</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(03)00231-9</pubid><pubid idtype="pmpid" link="fulltext">12679032</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Peroxisome proliferator-activated receptor alpha regulates a microRNA-mediated signaling cascade responsible for hepatocellular proliferation</p></title><aug><au><snm>Shah</snm><fnm>YM</fnm></au><au><snm>Morimura</snm><fnm>K</fnm></au><au><snm>Yang</snm><fnm>Q</fnm></au><au><snm>Tanabe</snm><fnm>T</fnm></au><au><snm>Takagi</snm><fnm>M</fnm></au><au><snm>Gonzalez</snm><fnm>FJ</fnm></au></aug><source>Mol Cell Biol</source><pubdate>2007</pubdate><volume>27</volume><issue>12</issue><fpage>4238</fpage><lpage>4247</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/MCB.00317-07</pubid><pubid idtype="pmcid">1900062</pubid><pubid idtype="pmpid" link="fulltext">17438130</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>The sequential action of miR156 and miR172 regulates developmental timing in Arabidopsis</p></title><aug><au><snm>Wu</snm><fnm>G</fnm></au><au><snm>Park</snm><fnm>MY</fnm></au><au><snm>Conway</snm><fnm>SR</fnm></au><au><snm>Wang</snm><fnm>JW</fnm></au><au><snm>Weigel</snm><fnm>D</fnm></au><au><snm>Poethig</snm><fnm>RS</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>138</volume><issue>4</issue><fpage>750</fpage><lpage>759</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.06.031</pubid><pubid idtype="pmcid">2732587</pubid><pubid idtype="pmpid" link="fulltext">19703400</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>MicroRNA pathways in flies and worms: growth, death, fat, stress, and timing</p></title><aug><au><snm>Ambros</snm><fnm>V</fnm></au></aug><source>Cell</source><pubdate>2003</pubdate><volume>113</volume><issue>6</issue><fpage>673</fpage><lpage>676</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(03)00428-8</pubid><pubid idtype="pmpid" link="fulltext">12809598</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>ES cells overexpressing microRNA-1 attenuate apoptosis in the injured myocardium</p></title><aug><au><snm>Glass</snm><fnm>C</fnm></au><au><snm>Singla</snm><fnm>DK</fnm></au></aug><source>Mol Cell Biochem</source><pubdate>2011</pubdate><volume>357</volume><issue>1-2</issue><fpage>135</fpage><lpage>141</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s11010-011-0883-5</pubid><pubid idtype="pmcid">3336362</pubid><pubid idtype="pmpid" link="fulltext">21671035</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis</p></title><aug><au><snm>Cheng</snm><fnm>AM</fnm></au><au><snm>Byrom</snm><fnm>MW</fnm></au><au><snm>Shelton</snm><fnm>J</fnm></au><au><snm>Ford</snm><fnm>LP</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><volume>33</volume><issue>4</issue><fpage>1290</fpage><lpage>1297</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki200</pubid><pubid idtype="pmcid">552951</pubid><pubid idtype="pmpid" link="fulltext">15741182</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Expression of members of the miRNA17-92 cluster during development and in carcinogenesis</p></title><aug><au><snm>Jevnaker</snm><fnm>AM</fnm></au><au><snm>Khuu</snm><fnm>C</fnm></au><au><snm>Kjole</snm><fnm>E</fnm></au><au><snm>Bryne</snm><fnm>M</fnm></au><au><snm>Osmundsen</snm><fnm>H</fnm></au></aug><source>J Cell Physiol</source><pubdate>2011</pubdate><volume>226</volume><issue>9</issue><fpage>2257</fpage><lpage>2266</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/jcp.22562</pubid><pubid idtype="pmpid" link="fulltext">21660949</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>MicroRNAs in biological processes and carcinogenesis</p></title><aug><au><snm>Osada</snm><fnm>H</fnm></au><au><snm>Takahashi</snm><fnm>T</fnm></au></aug><source>Carcinogenesis</source><pubdate>2007</pubdate><volume>28</volume><issue>1</issue><fpage>2</fpage><lpage>12</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/carcin/bgl185</pubid><pubid idtype="pmpid" link="fulltext">17028302</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>MicroRNAs in carcinogenesis</p></title><aug><au><snm>Hagan</snm><fnm>JP</fnm></au><au><snm>Croce</snm><fnm>CM</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2007</pubdate><volume>118</volume><issue>2&#8211;4</issue><fpage>252</fpage><lpage>259</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">18000378</pubid></xrefbib></bibl><bibl id="B15"><title><p>MicroRNA functions in animal development and human disease</p></title><aug><au><snm>Alvarez-Garcia</snm><fnm>I</fnm></au><au><snm>Miska</snm><fnm>EA</fnm></au></aug><source>Development</source><pubdate>2005</pubdate><volume>132</volume><issue>21</issue><fpage>4653</fpage><lpage>4662</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1242/dev.02073</pubid><pubid idtype="pmpid" link="fulltext">16224045</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>MicroRNAs in Development and Disease</p></title><aug><au><snm>Sayed</snm><fnm>D</fnm></au><au><snm>Abdellatif</snm><fnm>M</fnm></au></aug><source>Physiol Rev</source><pubdate>2011</pubdate><volume>91</volume><issue>3</issue><fpage>827</fpage><lpage>887</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1152/physrev.00006.2010</pubid><pubid idtype="pmpid" link="fulltext">21742789</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>MicroRNAs in diseases and drug response</p></title><aug><au><snm>Garofalo</snm><fnm>M</fnm></au><au><snm>Condorelli</snm><fnm>G</fnm></au><au><snm>Croce</snm><fnm>CM</fnm></au></aug><source>Curr Opin Pharmacol</source><pubdate>2008</pubdate><volume>8</volume><issue>5</issue><fpage>661</fpage><lpage>667</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.coph.2008.06.005</pubid><pubid idtype="pmpid" link="fulltext">18619557</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14</p></title><aug><au><snm>Lee</snm><fnm>RC</fnm></au><au><snm>Feinbaum</snm><fnm>RL</fnm></au><au><snm>Ambros</snm><fnm>V</fnm></au></aug><source>Cell</source><pubdate>1993</pubdate><volume>75</volume><issue>5</issue><fpage>843</fpage><lpage>854</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0092-8674(93)90529-Y</pubid><pubid idtype="pmpid" link="fulltext">8252621</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>The microRNA Registry</p></title><aug><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><issue>Database issue</issue><fpage>D109</fpage><lpage>111</lpage><xrefbib><pubidlist><pubid idtype="pmcid">308757</pubid><pubid idtype="pmpid" link="fulltext">14681370</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Genomics of microRNA</p></title><aug><au><snm>Kim</snm><fnm>VN</fnm></au><au><snm>Nam</snm><fnm>JW</fnm></au></aug><source>Trends Genet</source><pubdate>2006</pubdate><volume>22</volume><issue>3</issue><fpage>165</fpage><lpage>173</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2006.01.003</pubid><pubid idtype="pmpid" link="fulltext">16446010</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Computational approaches for microRNA studies: a review</p></title><aug><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Xu</snm><fnm>J</fnm></au><au><snm>Yang</snm><fnm>D</fnm></au><au><snm>Tan</snm><fnm>X</fnm></au><au><snm>Wang</snm><fnm>H</fnm></au></aug><source>Mamm Genome</source><pubdate>2010</pubdate><volume>21</volume><issue>1&#8211;2</issue><fpage>1</fpage><lpage>12</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20012966</pubid></xrefbib></bibl><bibl id="B22"><title><p>Mammalian microRNA prediction through a support vector machine model of sequence and structure</p></title><aug><au><snm>Sheng</snm><fnm>Y</fnm></au><au><snm>Engstrom</snm><fnm>PG</fnm></au><au><snm>Lenhard</snm><fnm>B</fnm></au></aug><source>PLoS One</source><pubdate>2007</pubdate><volume>2</volume><issue>9</issue><fpage>e946</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0000946</pubid><pubid idtype="pmcid">1978525</pubid><pubid idtype="pmpid" link="fulltext">17895987</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Computational identification of microRNAs in peach expressed sequence tags and validation of their precise sequences by miR-RACE</p></title><aug><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Yu</snm><fnm>M</fnm></au><au><snm>Yu</snm><fnm>H</fnm></au><au><snm>Han</snm><fnm>J</fnm></au><au><snm>Song</snm><fnm>C</fnm></au><au><snm>Ma</snm><fnm>R</fnm></au><au><snm>Fang</snm><fnm>J</fnm></au></aug><source>Mol Biol Rep</source><pubdate>2011</pubdate><volume>39</volume><issue>2</issue><fpage>1975</fpage><lpage>1987</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">21667243</pubid></xrefbib></bibl><bibl id="B24"><title><p>Computational identification of microRNAs and their targets from the expressed sequence tags of horsegram (Macrotyloma uniflorum (Lam.) Verdc.)</p></title><aug><au><snm>Bhardwaj</snm><fnm>J</fnm></au><au><snm>Mohammad</snm><fnm>H</fnm></au><au><snm>Yadav</snm><fnm>SK</fnm></au></aug><source>J Struct Funct Genomics</source><pubdate>2010</pubdate><volume>11</volume><issue>4</issue><fpage>233</fpage><lpage>240</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s10969-010-9098-3</pubid><pubid idtype="pmpid" link="fulltext">20978860</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Identification of novel homologous microRNA genes in the rhesus macaque genome</p></title><aug><au><snm>Yue</snm><fnm>J</fnm></au><au><snm>Sheng</snm><fnm>Y</fnm></au><au><snm>Orwig</snm><fnm>KE</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>8</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-9-8</pubid><pubid idtype="pmcid">2254598</pubid><pubid idtype="pmpid" link="fulltext">18186931</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features</p></title><aug><au><snm>Jiang</snm><fnm>P</fnm></au><au><snm>Wu</snm><fnm>H</fnm></au><au><snm>Wang</snm><fnm>W</fnm></au><au><snm>Ma</snm><fnm>W</fnm></au><au><snm>Sun</snm><fnm>X</fnm></au><au><snm>Lu</snm><fnm>Z</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><issue>Web Server issue</issue><fpage>W339</fpage><lpage>344</lpage><xrefbib><pubidlist><pubid idtype="pmcid">1933124</pubid><pubid idtype="pmpid" link="fulltext">17553836</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier</p></title><aug><au><snm>Yousef</snm><fnm>M</fnm></au><au><snm>Nebozhyn</snm><fnm>M</fnm></au><au><snm>Shatkay</snm><fnm>H</fnm></au><au><snm>Kanterakis</snm><fnm>S</fnm></au><au><snm>Showe</snm><fnm>LC</fnm></au><au><snm>Showe</snm><fnm>MK</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>22</volume><issue>11</issue><fpage>1325</fpage><lpage>1334</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btl094</pubid><pubid idtype="pmpid" link="fulltext">16543277</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>HHMMiR: efficient de novo prediction of microRNAs using hierarchical hidden Markov models</p></title><aug><au><snm>Kadri</snm><fnm>S</fnm></au><au><snm>Hinman</snm><fnm>V</fnm></au><au><snm>Benos</snm><fnm>PV</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2009</pubdate><volume>10 Suppl 1</volume><fpage>S35</fpage></bibl><bibl id="B29"><title><p>Predicting microRNA precursors with a generalized Gaussian components based density estimation algorithm</p></title><aug><au><snm>Hsieh</snm><fnm>CH</fnm></au><au><snm>Chang</snm><fnm>DT</fnm></au><au><snm>Hsueh</snm><fnm>CH</fnm></au><au><snm>Wu</snm><fnm>CY</fnm></au><au><snm>Oyang</snm><fnm>YJ</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2010</pubdate><volume>11 Suppl 1</volume><fpage>S52</fpage></bibl><bibl id="B30"><title><p>Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine</p></title><aug><au><snm>Xue</snm><fnm>C</fnm></au><au><snm>Li</snm><fnm>F</fnm></au><au><snm>He</snm><fnm>T</fnm></au><au><snm>Liu</snm><fnm>GP</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Zhang</snm><fnm>X</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2005</pubdate><volume>6</volume><fpage>310</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-6-310</pubid></xrefbib></bibl><bibl id="B31"><title><p>PMirP: a pre-microRNA prediction method based on structure-sequence hybrid features</p></title><aug><au><snm>Zhao</snm><fnm>D</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Luo</snm><fnm>D</fnm></au><au><snm>Shi</snm><fnm>X</fnm></au><au><snm>Wang</snm><fnm>L</fnm></au><au><snm>Xu</snm><fnm>D</fnm></au><au><snm>Yu</snm><fnm>J</fnm></au><au><snm>Liang</snm><fnm>Y</fnm></au></aug><source>Artif Intell Med</source><pubdate>2010</pubdate><volume>49</volume><issue>2</issue><fpage>127</fpage><lpage>132</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.artmed.2010.03.004</pubid><pubid idtype="pmpid" link="fulltext">20399081</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>microPred: effective classification of pre-miRNAs for human miRNA gene prediction</p></title><aug><au><snm>Batuwita</snm><fnm>R</fnm></au><au><snm>Palade</snm><fnm>V</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><issue>8</issue><fpage>989</fpage><lpage>995</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btp107</pubid><pubid idtype="pmpid" link="fulltext">19233894</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>MicroRNA prediction with a novel ranking algorithm based on random walks</p></title><aug><au><snm>Xu</snm><fnm>Y</fnm></au><au><snm>Zhou</snm><fnm>X</fnm></au><au><snm>Zhang</snm><fnm>W</fnm></au></aug><source>Bioinformatics</source><pubdate>2008</pubdate><volume>24</volume><issue>13</issue><fpage>i50</fpage><lpage>58</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btn175</pubid><pubid idtype="pmcid">2718653</pubid><pubid idtype="pmpid" link="fulltext">18586744</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Repertoire of porcine microRNAs in adult ovary and testis by deep sequencing</p></title><aug><au><snm>Li</snm><fnm>M</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Wang</snm><fnm>T</fnm></au><au><snm>Guan</snm><fnm>J</fnm></au><au><snm>Luo</snm><fnm>Z</fnm></au><au><snm>Chen</snm><fnm>H</fnm></au><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Chen</snm><fnm>L</fnm></au><au><snm>Ma</snm><fnm>J</fnm></au><au><snm>Mu</snm><fnm>Z</fnm></au><etal/></aug><source>Int J Biol Sci</source><pubdate>2011</pubdate><volume>7</volume><issue>7</issue><fpage>1045</fpage><lpage>1055</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3174389</pubid><pubid idtype="pmpid" link="fulltext">21927574</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Identification of tissue-specific microRNAs from mouse</p></title><aug><au><snm>Lagos-Quintana</snm><fnm>M</fnm></au><au><snm>Rauhut</snm><fnm>R</fnm></au><au><snm>Yalcin</snm><fnm>A</fnm></au><au><snm>Meyer</snm><fnm>J</fnm></au><au><snm>Lendeckel</snm><fnm>W</fnm></au><au><snm>Tuschl</snm><fnm>T</fnm></au></aug><source>Curr Biol</source><pubdate>2002</pubdate><volume>12</volume><issue>9</issue><fpage>735</fpage><lpage>739</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0960-9822(02)00809-6</pubid><pubid idtype="pmpid" link="fulltext">12007417</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Probing microRNAs with microarrays: tissue specificity and functional inference</p></title><aug><au><snm>Babak</snm><fnm>T</fnm></au><au><snm>Zhang</snm><fnm>W</fnm></au><au><snm>Morris</snm><fnm>Q</fnm></au><au><snm>Blencowe</snm><fnm>BJ</fnm></au><au><snm>Hughes</snm><fnm>TR</fnm></au></aug><source>RNA</source><pubdate>2004</pubdate><volume>10</volume><issue>11</issue><fpage>1813</fpage><lpage>1819</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.7119904</pubid><pubid idtype="pmcid">1370668</pubid><pubid idtype="pmpid" link="fulltext">15496526</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>High-resolution experimental and computational profiling of tissue-specific known and novel miRNAs in Arabidopsis</p></title><aug><au><snm>Breakfield</snm><fnm>NW</fnm></au><au><snm>Corcoran</snm><fnm>DL</fnm></au><au><snm>Petricka</snm><fnm>JJ</fnm></au><au><snm>Shen</snm><fnm>J</fnm></au><au><snm>Sae-Seaw</snm><fnm>J</fnm></au><au><snm>Rubio-Somoza</snm><fnm>I</fnm></au><au><snm>Weigel</snm><fnm>D</fnm></au><au><snm>Ohler</snm><fnm>U</fnm></au><au><snm>Benfey</snm><fnm>PN</fnm></au></aug><source>Genome Res</source><pubdate>2011</pubdate><volume>22</volume><issue>1</issue><fpage>163</fpage><lpage>176</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3246203</pubid><pubid idtype="pmpid" link="fulltext">21940835</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Solexa sequencing identification of conserved and novel microRNAs in backfat of Large White and Chinese Meishan pigs</p></title><aug><au><snm>Chen</snm><fnm>C</fnm></au><au><snm>Deng</snm><fnm>B</fnm></au><au><snm>Qiao</snm><fnm>M</fnm></au><au><snm>Zheng</snm><fnm>R</fnm></au><au><snm>Chai</snm><fnm>J</fnm></au><au><snm>Ding</snm><fnm>Y</fnm></au><au><snm>Peng</snm><fnm>J</fnm></au><au><snm>Jiang</snm><fnm>S</fnm></au></aug><source>PLoS One</source><pubdate>2012</pubdate><volume>7</volume><issue>2</issue><fpage>e31426</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0031426</pubid><pubid idtype="pmcid">3280305</pubid><pubid idtype="pmpid" link="fulltext">22355364</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Regulation of alternative splicing by RNA editing</p></title><aug><au><snm>Rueter</snm><fnm>SM</fnm></au><au><snm>Dawson</snm><fnm>TR</fnm></au><au><snm>Emeson</snm><fnm>RB</fnm></au></aug><source>Nature</source><pubdate>1999</pubdate><volume>399</volume><issue>6731</issue><fpage>75</fpage><lpage>80</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/19992</pubid><pubid idtype="pmpid" link="fulltext">10331393</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>RNA editing of a miRNA precursor</p></title><aug><au><snm>Luciano</snm><fnm>DJ</fnm></au><au><snm>Mirsky</snm><fnm>H</fnm></au><au><snm>Vendetti</snm><fnm>NJ</fnm></au><au><snm>Maas</snm><fnm>S</fnm></au></aug><source>RNA</source><pubdate>2004</pubdate><volume>10</volume><issue>8</issue><fpage>1174</fpage><lpage>1177</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.7350304</pubid><pubid idtype="pmcid">1370607</pubid><pubid idtype="pmpid" link="fulltext">15272117</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Evidence that miRNAs are different from other RNAs</p></title><aug><au><snm>Zhang</snm><fnm>BH</fnm></au><au><snm>Pan</snm><fnm>XP</fnm></au><au><snm>Cox</snm><fnm>SB</fnm></au><au><snm>Cobb</snm><fnm>GP</fnm></au><au><snm>Anderson</snm><fnm>TA</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2006</pubdate><volume>63</volume><issue>2</issue><fpage>246</fpage><lpage>254</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00018-005-5467-7</pubid><pubid idtype="pmpid" link="fulltext">16395542</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Cross-mapping events in miRNAs reveal potential miRNA-mimics and evolutionary implications</p></title><aug><au><snm>Guo</snm><fnm>L</fnm></au><au><snm>Liang</snm><fnm>T</fnm></au><au><snm>Gu</snm><fnm>W</fnm></au><au><snm>Xu</snm><fnm>Y</fnm></au><au><snm>Bai</snm><fnm>Y</fnm></au><au><snm>Lu</snm><fnm>Z</fnm></au></aug><source>PLoS One</source><pubdate>2011</pubdate><volume>6</volume><issue>5</issue><fpage>e20517</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0020517</pubid><pubid idtype="pmcid">3102724</pubid><pubid idtype="pmpid" link="fulltext">21637827</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Characterization of the Melanoma miRNAome by Deep Sequencing</p></title><aug><au><snm>Stark</snm><fnm>MS</fnm></au><au><snm>Tyagi</snm><fnm>S</fnm></au><au><snm>Nancarrow</snm><fnm>DJ</fnm></au><au><snm>Boyle</snm><fnm>GM</fnm></au><au><snm>Cook</snm><fnm>AL</fnm></au><au><snm>Whiteman</snm><fnm>DC</fnm></au><au><snm>Parsons</snm><fnm>PG</fnm></au><au><snm>Schmidt</snm><fnm>C</fnm></au><au><snm>Sturm</snm><fnm>RA</fnm></au><au><snm>Hayward</snm><fnm>NK</fnm></au></aug><source>PLoS One</source><pubdate>2010</pubdate><volume>5</volume><issue>3</issue><fpage>e9685</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pone.0009685</pubid><pubid idtype="pmcid">2837346</pubid><pubid idtype="pmpid" link="fulltext">20300190</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Ultra-high throughput sequencing-based small RNA discovery and discrete statistical biomarker analysis in a collection of cervical tumours and matched controls</p></title><aug><au><snm>Witten</snm><fnm>D</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au><au><snm>Gu</snm><fnm>SG</fnm></au><au><snm>Fire</snm><fnm>A</fnm></au><au><snm>Lui</snm><fnm>WO</fnm></au></aug><source>BMC Biol</source><pubdate>2010</pubdate><volume>8</volume><fpage>58</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1741-7007-8-58</pubid><pubid idtype="pmcid">2880020</pubid><pubid idtype="pmpid" link="fulltext">20459774</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>RNA secondary structure analysis using the Vienna RNA package</p></title><aug><au><snm>Hofacker</snm><fnm>IL</fnm></au></aug><source>Curr Protoc Bioinformatics</source><pubdate>2009</pubdate><volume>Chapter 12</volume><fpage>Unit12 12</fpage></bibl><bibl id="B46"><title><p>Vienna RNA secondary structure server</p></title><aug><au><snm>Hofacker</snm><fnm>IL</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2003</pubdate><volume>31</volume><issue>13</issue><fpage>3429</fpage><lpage>3431</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkg599</pubid><pubid idtype="pmcid">169005</pubid><pubid idtype="pmpid" link="fulltext">12824340</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>A comparison of RNA folding measures</p></title><aug><au><snm>Freyhult</snm><fnm>E</fnm></au><au><snm>Gardner</snm><fnm>PP</fnm></au><au><snm>Moulton</snm><fnm>V</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2005</pubdate><volume>6</volume><fpage>241</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-6-241</pubid></xrefbib></bibl><bibl id="B48"><title><p>Evidence that microRNA precursors, unlike other non-coding RNAs, have lower folding free energies than random sequences</p></title><aug><au><snm>Bonnet</snm><fnm>E</fnm></au><au><snm>Wuyts</snm><fnm>J</fnm></au><au><snm>Rouze</snm><fnm>P</fnm></au><au><snm>Van de Peer</snm><fnm>Y</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><issue>17</issue><fpage>2911</fpage><lpage>2917</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth374</pubid><pubid idtype="pmpid" link="fulltext">15217813</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Utilization of SSCprofiler to predict a new miRNA gene</p></title><aug><au><snm>Oulas</snm><fnm>A</fnm></au><au><snm>Poirazi</snm><fnm>P</fnm></au></aug><source>Methods Mol Biol</source><pubdate>2011</pubdate><volume>676</volume><fpage>243</fpage><lpage>252</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/978-1-60761-863-8_17</pubid><pubid idtype="pmpid" link="fulltext">20931402</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Computational identification of new porcine microRNAs and their targets</p></title><aug><au><snm>Zhou</snm><fnm>B</fnm></au><au><snm>Liu</snm><fnm>HL</fnm></au></aug><source>Anim Sci J</source><pubdate>2010</pubdate><volume>81</volume><issue>3</issue><fpage>290</fpage><lpage>296</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1111/j.1740-0929.2010.00742.x</pubid><pubid idtype="pmpid" link="fulltext">20597884</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><aug><au><snm>Vapnik</snm><fnm>V</fnm></au></aug><source>Statistical Learning Theory</source><publisher>Wiley-Interscience</publisher><pubdate>1998</pubdate></bibl><bibl id="B52"><title><p>Evaluating the Generalization Ability of Support Vector Machines through the Bootstrap</p></title><aug><au><snm>Davide</snm><fnm>A</fnm></au><au><snm>Andrea</snm><fnm>B</fnm></au><au><snm>Sandro</snm><fnm>R</fnm></au></aug><source>Neural Processing Letters</source><pubdate>2000</pubdate><volume>11</volume><fpage>51</fpage><lpage>58</lpage><xrefbib><pubid idtype="doi">10.1023/A:1009636300083</pubid></xrefbib></bibl><bibl id="B53"><title><p>LIBSVM: a library for support vector machines</p></title><aug><au><snm>Chang</snm><fnm>C-C</fnm></au><au><snm>Lin</snm><fnm>C-J</fnm></au></aug><source>ACM Transactions on Intelligent Systems and Technology</source><pubdate>2011</pubdate><volume>2</volume><issue>27</issue><fpage>21</fpage><lpage>27</lpage><note>27</note></bibl><bibl id="B54"><title><p>BLAST+: architecture and applications</p></title><aug><au><snm>Camacho</snm><fnm>C</fnm></au><au><snm>Coulouris</snm><fnm>G</fnm></au><au><snm>Avagyan</snm><fnm>V</fnm></au><au><snm>Ma</snm><fnm>N</fnm></au><au><snm>Papadopoulos</snm><fnm>J</fnm></au><au><snm>Bealer</snm><fnm>K</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2009</pubdate><volume>10</volume><fpage>421</fpage><xrefbib><pubid idtype="doi">10.1186/1471-2105-10-421</pubid></xrefbib></bibl><bibl id="B55"><title><p>Identification of 188 conserved maize microRNAs and their targets</p></title><aug><au><snm>Zhang</snm><fnm>B</fnm></au><au><snm>Pan</snm><fnm>X</fnm></au><au><snm>Anderson</snm><fnm>TA</fnm></au></aug><source>FEBS Lett</source><pubdate>2006</pubdate><volume>580</volume><issue>15</issue><fpage>3753</fpage><lpage>3762</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.febslet.2006.05.063</pubid><pubid idtype="pmpid" link="fulltext">16780841</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Identification and characteristics of cattle microRNAs by homology searching and small RNA cloning</p></title><aug><au><snm>Long</snm><fnm>JE</fnm></au><au><snm>Chen</snm><fnm>HX</fnm></au></aug><source>Biochem Genet</source><pubdate>2009</pubdate><volume>47</volume><issue>5&#8211;6</issue><fpage>329</fpage><lpage>343</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">19267191</pubid></xrefbib></bibl></refgrp>
	</bm>
</art>