Email updates

Keep up to date with the latest news and content from BMC Evolutionary Biology and BioMed Central.

Open Access Research article

Sequence variation and selection of small RNAs in domesticated rice

Yu Wang1, Dan Shen1, Shiping Bo1, Huan Chen1, Jian Zheng1, Qian-Hao Zhu2, Daguang Cai3, Chris Helliwell2 and Longjiang Fan1*

Author Affiliations

1 Department of Agronomy, Zhejiang University, Hangzhou 310029, China

2 CSIRO Plant Industry, Canberra, ACT 2601, Australia

3 Department of Molekulare Phytopathologie, Christian-Albrechts-Universität zu Kiel, D-24118 Kiel, Germany

For all author emails, please log on.

BMC Evolutionary Biology 2010, 10:119  doi:10.1186/1471-2148-10-119


The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2148/10/119


Received:8 July 2009
Accepted:30 April 2010
Published:30 April 2010

© 2010 Wang et al; licensee BioMed Central Ltd.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Endogenous non-coding small RNAs (21-24 nt) play an important role in post-transcriptional gene regulation in plants. Domestication selection is the most important evolutionary force in shaping crop genomes. The extent of polymorphism at small RNA loci in domesticated rice and whether small RNA loci are targets of domestication selection have not yet been determined.

Results

A polymorphism survey of 94 small RNA loci (88 MIRNAs, four TAS3 loci and two miRNA-like long hairpins) was conducted in domesticated rice, generating 2 Mb of sequence data. Many mutations (substitution or insertion/deletion) were observed at small RNA loci in domesticated rice, e.g. 12 mutation sites were observed in the mature miRNA sequences of 11 MIRNAs (12.5% of the investigated MIRNAs). Several small RNA loci showed significant signals for positive selection and/or potential domestication selection.

Conclusions

Sequence variation at miRNAs and other small RNAs is higher than expected in domesticated rice. Like protein-coding genes, non-coding small RNA loci could be targets of domestication selection and play an important role in rice domestication and improvement.

Background

Endogenous non-coding small RNAs (21-24 nt) play an important role in post-transcriptional gene regulation in plants. In general, small RNAs are grouped into two major classes, microRNAs (miRNAs) and short-interfering RNAs (siRNAs) based on the mechanisms by which they are synthesized and function [1]. miRNAs are generated from a stem-loop primary transcript by the endonuclease Dicer-like 1 (DCL1) and its partners. To date, 414 miRNAs from 151 miRNA families (miRBase, http://microrna.sanger.ac.uk/sequences/ webcite, Release 14.0), including several natural antisense miRNAs (nat-miRNAs) and a mitron, have been identified in rice [2,3]. siRNAs derive from double-stranded RNA precursors that are processed mainly by the endonucleases DCL2, DCL3 and/or DCL4 [4]. Several types of siRNAs, such as trans-acting siRNAs (ta-siRNAs), natural antisense transcript-derived siRNAs (nat-siRNAs), repeat-associated siRNAs (ra-siRNAs) and miRNA-like long hairpin siRNAs, have been identified in rice and other plants [3,5-13].

mRNAs are regulated by several classes of small RNAs through base complementarity leading to cleavage or repression of translation of the mRNA. It has been observed that single nucleotide polymorphisms (SNP) within a miRNA binding site can cause significant phenotypic changes [14]. Genome-wide investigation of nucleotide variation at small RNA loci can determine the prevalence of functional variations in a species; this was first assessed in humans [15]. In plants, a large-scale survey of miRNA polymorphism has been performed in Arabidopsis [16]. This survey showed that mutations in mature miRNA sequences are rare and that the level of polymorphism in miRNAs is lower than in their flanking regions, suggesting purifying selection in the miRNA sequences. Evidence of positive selection on some miRNA loci has been found in Arabidopsis [16,17]. Our previous studies on rice miRNAs and miRNA binding sites suggested that positive selection and nucleotide mutations play an important role in co-evolution of miRNAs and their targets [18,19].

Despite the importance of small RNAs in regulation of development in crops, investigation of polymorphism of miRNAs and other small RNAs in rice and other crops is still in its infancy. The extent of sequence variation at functional regions of small RNAs and their potential contribution to phenotypic evolution during crop domestication are unclear. Unlike Arabidopsis, rice has experienced domestication and subsequent artificial genetic improvement and therefore has been shaped by an additional bottleneck effect. Rice is believed to have been domesticated approximately 10,000 years ago [20,21]. Several protein-coding genes controlling domestication traits have been identified and a notable feature of these domestication genes is that they generally encode transcription factors that regulate other protein-coding genes by direct binding to their DNA [22]. Most conserved miRNAs target transcription factors with an important role in plant development [1], but whether miRNAs and other types of non-coding small RNAs are targets of domestication selection in crops is yet to be determined.

In this study, we carried out an investigation of sequence variation and selection of small RNAs in domesticated rice by two steps (Figure 1). Firstly, we selected 94 small RNA generating loci (including MIRNAs, TAS3 and miRNA-like long hairpin loci) and analyzed their sequence variation and neutrality within a domesticated rice population of 33 accessions, and secondly, we further investigated 20 of the 94 small RNA loci in populations of 54 cultivated rice accessions and 15 wild rice accessions to determine how many of these loci might have experienced domestication selection. Our results suggest a non-neutral evolutionary pattern of small RNAs in the cultivated rice and that several of them are putative targets of selection during rice domestication.

thumbnailFigure 1. A flow chart describing the analysis steps involved in this study. At the first step, 94 small RNA generating loci were selected and their sequence variation and neutrality were estimated using a population of 33 domesticated rice accessions (see Figures 2, 3 and 4; Additional files 3, 4 and 5). At the second step, 20 small RNA loci were selected from the 94 small RNA loci and their sequence diversity and potential for having been under domestication selection (see Table 1) were further investigated using a cultivated population (54 accessions) and a wild population (15 accessions).

Results

Nucleotide divergence and neutrality of small RNAs

A total of 94 small RNA loci, including 88 MIRNAs from 40 miRNA families, four members of the TAS3 family and two miRNA-like long hairpins, were selected for a polymorphism survey of rice small RNAs (Additional file 1). With the exception of the two miRNA-like long hairpins, genomic fragments containing the intact precursor sequences of the small RNAs were amplified and sequenced from a population of 33 domesticated rice accessions (17 indica and 16 japonica cultivars), which were collected from diverse geographic locations (Additional file 2). For the two miRNA-like long hairpins which have large loop regions, the two stem regions were amplified and sequenced. Fragments from each rice accession were aligned to each other for each small RNA locus. The average fragment length of all alignments was 579.0 bp, which covered the precursor sequences of the investigated small RNAs (Additional file 3).

Additional file 1. Small RNA loci investigated in this study. 94 small RNA loci (88 MIRNAs, four TAS3 loci and two miRNA-like long hairpins) were included.

Format: DOC Size: 74KB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

Additional file 2. Accession numbers and geographic origin of the cultivated and wild rice used in this study. Fifty-four cultivated rice accessions (Oryza sativa, 29 indica and 25 japonica), 15 accessions of the wild ancestor, O. rufipogon, and two accessions of the Africa wild rice relative, O. barthii, were selected.

Format: DOC Size: 88KB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

Additional file 3. Polymorphism of small RNAs in domesticated rice. Number of accessions sequenced (N), length of the core alignments in which all sequences contain bases without gaps (L), total number of segregating sites (S) and two diversity parameters (θ and π) in the 94 small RNA loci of rice were shown.

Format: XLS Size: 84KB Download file

This file can be viewed with: Microsoft Excel ViewerOpen Data

Of the 88 investigated MIRNA genes, 11 (12.5%) showed SNPs or insertion/deletions (indels) within their mature miRNA sequences in the cultivated rice population (Additional file 4), leading to an average number of SNPs per 1000 sites of 1.52 ± 0.51 (mean ± SE of the mean) or an average pairwise nucleotide diversity (π) of 0.00169 ± 0.00067 (Figure 2A and 2B). As an example, the polymorphic mature sequence of the miR166 family is presented in Figure 3 (further details are shown in Additional file 3 and 4). A SNP was observed at the fourth position from the 5' end of the mature sequences of miR166e. Out of the 32 cultivated indica and japonica accessions investigated, 17 including indica cultivar 93-11 and wild rice had the same miR166e sequence (with G at the fourth position) as in miRBase http://microrna.sanger.ac.uk/sequences/ webcite, while 15 had an A at the fourth position (Figure 3). In all other members of the miR166 family the fourth base is G (Figure 3). These results suggest that the fourth position A could be a recent mutation. It would be interesting to determine whether this mutation has an impact on the interaction between miR166e and its targets because it results in a perfect match with the predicted targets at the fourth position. From our sequencing data for miR166e target genes and the recently released rice SNP data http://www.oryzasnp.org webcite[23], a SNP was found at the tenth position (counting from the 5' end of miR166e) of the miR166e binding site in Os03 g16320, one of the predicted miR166e targets, in 20% of accessions investigated. miRNA-mediated cleavage takes place between positions 10 and 11, a mutation from C to G at the tenth position might abolish miR166e-mediated cleavage of Os03 g16320 in these accessions, but whether this change has a consequence on rice depends on the specific interaction between miR166e and Os03 g16320 because Os03 g16320 could still be regulated by other members of the miR166 family.

Additional file 4. Mutations at the mature miRNA sequences in domesticated rice. Mutations (SNP or indel) at 11 mature miRNA sequences in domesticated rice were listed.

Format: XLS Size: 31KB Download file

This file can be viewed with: Microsoft Excel ViewerOpen Data

thumbnailFigure 2. Diversity of MIRNAs in cultivated rice. A-B: mean numbers of SNPs per site (A) and π (B) in the mature miRNA sequences and their flanking regions. C-D: mean numbers of SNPs per site in the dicot/monocot conserved (C) and rice-specific (D) miRNAs and their flanking regions. E-F: mean numbers of SNPs per site in the mature miRNA sequences and their flanking regions in indica (E) and japonica (F) subpopulations. SE bars are shown.

thumbnailFigure 3. Polymorphism of the mature sequences of the miR166 family and their targets in rice. Base pairing between miR166e and its predicted target Os03 g56920 is shown with short bars representing a Watson-Crick pair and two vertical dots representing a G-U pair. Percentages of rice accessions with different genotypes in cultivated and wild populations are shown on the right side. For further details see Additional file 3 and 4.

In addition to the mature miRNA sequences, many SNPs were also observed in the flanking regions of miRNAs. Compared to the mature miRNA sequences, significantly higher SNP densities or diversities were observed in the precursor miRNA (pre-miRNA) sequences, but the highest polymorphism was found in the upstream or downstream region of pre-miRNAs (Figure 2A and 2B). These results suggest that purifying selection was the predominant evolutionary force acting on miRNA sequences in rice as reported for Arabidopsis [16]. Some miRNAs are conserved in both dicot and monocot plants while others are species-specific. Of the 88 MIRNA genes investigated in this study, 64 are conserved and 17 are rice-specific (Additional file 1). A dramatic difference in nucleotide diversities in the mature miRNAs and their flanking regions was observed between the conserved and rice-specific MIRNA loci (Figure 2C and 2D). The nucleotide diversity of the rice-specific MIRNAs was nearly twice that of the conserved MIRNAs. However, no significant difference in nucleotide diversity could be found in the miRNAs from the two subspecies of cultivated rice although the average number of SNPs per site of the indica subgroup was slightly lower than that of the japonica subgroup in both the mature miRNA and its flanking regions (Figure 2E and 2F).

Polymorphisms were also found in the functional regions of siRNA-generating loci in rice, for example, at phases P5'_5 and P5'_9 of a miRNA-like long hairpin (AK120922), and in ta-siARF and the 3' miR390 binding site of TAS3a2 (Additional file 5).

Additional file 5. Mutations at the functional regions (phases) in TAS3 and miRNA-like long hairpin loci of domesticated rice. Mutations (SNP or indel) at the functional regions (phases) in a TAS3 and a miRNA-like long hairpin loci of domesticated rice were listed.

Format: XLS Size: 85KB Download file

This file can be viewed with: Microsoft Excel ViewerOpen Data

To examine neutrality of small RNAs in rice, Tajima's D [24], a widely used neutrality test, was employed. Tajima's D test measures the frequency distribution of polymorphisms and selection is expected to skew the population frequency of genetic variants relative to the neutral equilibrium model (NE). Under NE, the mean Tajima's D is expected to be zero and a negative value indicates an excess of rare sequence variants relative to NE expectation and a recent positive selection is thus inferred. In contrast to positive selection, balanced selection retains genetic differences and elevates the Tajima's D statistic towards a positive value [25].

For each small RNA locus, Tajima's D was calculated for the miRNA- or siRNA-containing sequence fragment. A slightly skewed distribution of Tajima's D values was observed in domesticated rice (Figure 4), which is similar to that observed in Arabidopsis [16]. Significant probabilities of non-neutral patterns of sequence variation were detected at several loci in one or both subgroups of the cultivated rice (Additional file 3). Significantly negative values were found for several small RNAs, such as miR395a/b and TAS3a2 (Tajima's D value: -2.21 and -2.01) in the indica subgroup, suggesting that these small RNA loci could have experienced positive selection during rice evolution. On the other hand, some small RNA loci possessed an extreme positive Tajima's D value (e.g. 1.94 for TAS3a1), indicating a balanced selection for these loci. The above results suggest that polymorphisms at or linked to the small RNA loci might be a result of selection.

thumbnailFigure 4. Distribution of Tajima's D values of all small RNA loci in cultivated rice.

Potential domestication selection

Indica and japonica rice were domesticated independently and their domestication genes usually show different evolutionary histories of selection between cultivated and wild rice populations [26,27]. To find candidate small RNA loci that experienced domestication selection, 20 loci were selected from the set of 94 small RNA loci for further analysis (Figure 1) using a cultivated rice population including 29 indica and 25 japonica cultivars, a wild population including 15 accessions of O. rufipogon and an outgroup (Africa wild rice, O. barthii) (Additional file 2). Of the 20 loci selected, three (miR395a/b, TAS3a2 and MIR399d) had significant negative Tajima's D values and one (miR390) had extremely low divergence according to the above neutral test results (Additional file 3); the remaining loci were randomly chosen from the small RNA list shown in Additional file 1.

As previously mentioned, the extent of nucleotide diversity of selected genes tends to be reduced. Our analyses indicated that nucleotide diversity (π) of the four neutral genes (Adh1, Waxy, ks1 and RGRC2; [28,29]) reduced less than 2.6 fold in the cultivated rice population compared to their wild progenitors. Of the 20 small RNA loci, eight (MIR164a/d/e, MIR166f, MIR390, MIR399d, MIR440 and AK120922) had a nucleotide diversity reduction greater than 2.6 fold (from 2.9 to 21.2 fold) in the domesticated rice compared to their wild progenitors (Table 1). For example, nucleotide diversity of MIR390 reduced 7.0 and 6.3 fold in the two domesticated subgroups relative to the wild population, and nucleotide diversities of the miRNA-like long hairpin (AK120922) reduced 5.7 and 11.1 fold. This result suggests that these eight small RNA loci probably underwent selection during rice evolution.

Table 1. Sequence diversity, Tajima's D and HKA test results

To further investigate positive selection in these small RNA loci, in addition to the Tajima's D test, another independent test, the Hudson-Kreitman-Aguade (HKA) test [30], was performed. The HKA test is able to measure the degree of nucleotide diversity reduced by positive selection, a feature of the sequence data that cannot be measured by the Tajima's D test. Rejection of the multi-locus HKA test that measures the difference in diversity within species relative to divergence between species is often considered as evidence of a positive selection. The multi-locus HKA test requires an outgroup and reference loci that are believed to have not been affected by selection. For this purpose, African wild rice (O. barthii) that is the nearest phylogenetic node to O. sativa and O. rufipogon [31] was used as the outgroup and the aforementioned four neutral genes were used as the reference loci. Of the 20 small RNA loci, four miRNA genes (MIR164e, MIR390, MIR395a/b and MIR399d) and two siRNA genes (TAS3a2 and AK120922) had signatures of positive selection in indica and/or japonica subgroup according to Tajima's D or HKA test (Table 1). No positive selection was detected by the two tests for other 14 MIRNA loci. In the wild rice population, MIR399d, MIR1862d and the miRNA-like long hairpin (AK120922) presented evidence of positive selection (Tajima's D value of -2.04, -1.88 and -2.05, respectively), suggesting the non-neutral evolution of these three genes in the wild rice population. A gene can be considered as a candidate domestication gene if a significantly positive selection signal is detected in the cultivated population but not in the wild population [27]. Based on this criterion, our results thus suggest that MIR164e, MIR390, MIR395a/b and TAS3a2 are potential candidates of small RNA loci that have experienced direct selection during rice domestication.

Discussion

To date, large-scale studies of nucleotide variation at miRNAs have only been carried out in human [15] and Arabidopsis [16]. Overall, our results in rice support the findings in human and Arabidopsis that purifying selection is one of the main evolutionary forces acting on rice MIRNA genes, maintaining lower levels of sequence divergence in mature miRNAs than in their flanking regions. In Arabidopsis, only a few substitution events were observed in the mature miRNA sequences (2 of the 66 investigated miRNAs) in a population including 24 diverse wild accessions. However, nucleotide mutations in the mature miRNA sequences were found in 11 of the 88 investigated rice miRNAs in this study. We also found polymorphisms in at least seven miRNAs in the wild rice population (data not shown). The higher percentage of SNPs or indels found in rice miRNAs implies that the polymorphism levels of miRNAs and other miRNA-like small RNAs may vary among species.

Our results suggest selection, and probably domestication selection, on some small RNA loci (Table 1 and Additional file 3). Small RNAs regulate genes that control a wide range of traits; the functions of these small RNA regulated genes might provide clues for the traits under selection via small RNA. One interesting result in this study is that both MIR390 and TAS3 are potential candidates of selection. miR390 acts as a guide for processing of TAS3 and biogenesis of ta-siRNAs, including the functional ta-siARF [5,10,11]. ARFs (auxin response factors) are encoded by a large gene family (25 members in rice) and are involved in regulation of a wide range of biological functions in rice [32]. ta-siARF downregulates ARF2, ARF3 and ARF4 [5,12] that affect gynoecium patterning [33], proper timing of vegetative shoot development and establishment of leaf polarity [34]. At least two ARFs have been detected as selected genes in maize [27,35]. It is unclear whether rice ARF genes have experienced selection during evolution. Our results suggest that genes involved in the same regulatory pathway could all be targeted by domestication selection for enhanced growth response and productivity. miR164e targets NAC domain transcription factors that play an important role in development, stress-tolerance and disease-resistance in rice [36,37]. miR395 targets ATP-sulfurylases that are involved in sulfate assimilation. Expression of miR395 is greatly increased under sulfate starvation conditions [38]. Sulfur is an essential macronutrient required by plants. miR395 regulated sulfate metabolism might have been selected for high sulfate usage efficiency. Our results suggest that miRNAs and other small RNA loci should not be ignored in the endeavor to identify the molecular basis of domestication and improvement of crops.

Note that there are three limitations in our effort to identify candidate small RNA loci experienced domestication selection. First, a bottleneck effect or demographic effect can also cause significant reduction of genetic diversity and it could not be excluded in our analysis. Second, the positive selection detected in this study could have arisen by selective sweep rather than direct selection on the small RNA loci themselves. Third, although we identified several mutations in the functional regions (mature miRNA, miRNA precursor or promoter) of the candidate small RNA genes in domesticated rice, such as in the mature miR395a/b sequences and in tasi-ARF and the miR390 binding sites of TAS3 (Additional file 4 and 5), and found distinct expression levels of miR164e in the cultivated and wild rice (data not shown), it does not necessarily indicate a direct consequence of the domestication event. Further experiments are needed to reveal the consequence of positive selection, such as phenotypic changes.

Our sequencing results revealed sequence polymorphism in miR166e and its target gene Os03 g16320 (Figure 3). By checking the recently released rice SNP dataset (http://www.oryzasnp.org webcite[23]), we found a similar situation with miR443 and the target gene Os01 g49940 (a C → T mutation at the fifth position from the 3' end of miR443 and a G → A mutation at the eighth position from the 3' end of the miR443 binding site in Os01 g49940). We also observed SNPs in the miRNA binding sites of targets of miR164d/e and miR530, but no SNP was observed in the mature sequences of the two miRNAs. These results suggest that evolution of miRNAs and their targets could be independent, although a co-evolution relationship between miRNAs and their target sequences does happen in cultivated rice as demonstrated previously [18].

At least one member each of four miRNA families (miR166, miR167, miR171 and miR395) were found to have experienced positive selection based on Tajima's D test in the natural Arabidopsis populations [16,17]. These four miRNA families are conserved in dicot and monocot plants. In our study, significant signals of positive selection were also detected by Tajima's D test in at least one member of the miR166, miR167 and miR395 family in indica or japonica subgroup (Additional file 3), implying a potential conservation of adaptive evolution between rice and Arabidopsis for these miRNA families. A common feature of these three families is that they have relatively large number of family members (e.g. 23 and 6 miR395 members in rice and Arabidopsis, respectively). These homologous genes might have experienced rapid functional divergence after gene expansion. It has been suggested that positive selection acts on gene copy number variations [39]. Positive selection could promote adaptive evolution but whether plants enhance their fitness in the changing environment by adding copy number of miRNAs requires further investigation.

Conclusions

Our survey of polymorphisms in 94 small RNA loci in rice, the first such effort in crops, revealed a higher level of sequence variations in miRNAs in domesticated rice than in Arabidopsis. In agreement with previous studies, our investigation suggests that purifying selection dominated the evolution of small RNAs in rice. A neutrality test revealed non-neutral evolution of small RNA loci in which both positive selection and balanced selection are involved. Our results suggest that domestication selection on small RNA loci could play an important role in rice domestication and/or improvement.

Methods

Plant Materials

Fifty-four cultivated rice accessions (Oryza sativa, 29 indica and 25 japonica) from a wide range of geographical locations and 15 accessions of the wild ancestor, O. rufipogon, were selected for investigation of nucleotide diversity in small RNA loci. Two accessions of the Africa wild rice relative, O. barthii, were used as an outgroup species in a neutrality test. Details of the 71 rice accessions are in Additional file 2.

Small RNAs selected for analyses of nucleotide diversity and domestication selection

The majority of the MIRNAs selected are conserved in other species to ensure they are bona fide MIRNAs. For all the rice-specific MIRNAs, authenticity is supported by experimental evidence and BLAST analysis has been used to remove those potentially derived from repeat regions of the rice genome. The siRNAs used in this study have been experimentally identified and confirmed by our previous study [3]

PCR and DNA Sequencing

Primers were designed based on the genomic sequence of japonica cultivar Nipponbare of the Rice Genome Annotation Project (http://rice.plantbiology.msu.edu/ webcite; [40,41] using the Primer3 program [42]http://fokker.wi.mit.edu/primer3/input.htm webcite and were compared to the rice genome sequence to ensure their specificity. Primer pairs were designed to amplify products with a length of 600 - 1300 bp, including the pre-miRNA and its flanking regions (for MIRNA loci). Details of the primers and their products were provided in Additional file 6. Polymerase chain reaction (PCR) amplification was carried out in a DNAEngine Peltier Thermal Cycler (BIO-RAD) in a total volume of 50 μl. The reaction mixture contained 10-100 ng of template DNA, 2 μl of each primer (10-20 pmol), 0.6 μl of Taq DNA Polymerase (5 U/μl), 1 μl of dNTPs (10 mM), 3 μl of MgCl2 (25 mM), 5 μl of 10× PCR buffer (Sangon, China). PCR was performed using the following conditions: 5 min at 95°C followed by 35 cycles of 30 s at 95°C, 30 s at 54°C, and 90 s at 72°C, and a final extension at 72°C for 10 minutes. When this program failed to amplify the expected products, specific conditions were experimentally determined. For O. sativa cultivars, purified PCR products were directly sequenced using the forward or reverse primer. For wild rice, in which either homozygous or heterozygous individuals might exist, PCR fragments were cloned into pGEM T-Easy vector (PMD19-T, Takara) and sequenced using the forward or reverse primer. The sequencing quality files were checked manually to avoid the sequencing errors. Sequences with poor quality were re-sequenced. All sequences have been deposited into GenBank under the accession numbers GQ418390-GQ420345 and HM138917-HM140183. In total, approximately 2.0 Mb of sequences were generated in this study.

Additional file 6. Primer pairs for amplification of small RNA loci. Ninety-six primer pairs used in this study were provided.

Format: DOC Size: 104KB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

Statistical Analysis

CLUSTALW version 1.82-UNIX was used for multiple sequence alignments [43] and segregating sites were identified manually using GeneDoc [44]. Two summary statistics parameters (θ and π) for nucleotide diversity and Tajima's D statistic [24] were calculated with a perl script. θ and π estimate the population mutation rate per locus based on the number of segregating sites and the mean value of pairwise divergence per locus, respectively [45,46]. Multi-locus HKA tests were performed using the HKA program available at http://lifesci.rutgers.edu/hey/home webcite. The entire segments that contain the small RNAs were amplified by the primers shown in Additional file 6 and used for a neutrality test. For MIRNA loci, nucleotide diversities were separately estimated for the mature miRNAs, the pre-miRNAs, and the upstream and downstream regions of the mature miRNA (the miR395 family that is located at four clusters in which most members are less than 100 bp apart from each other was not included in this analysis). Sequences of the mature miRNAs and the pre-miRNAs were retrieved from miRBase http://microrna.sanger.ac.uk/sequences/ webcite. Conservation of miRNAs in dicots and monocots is based on Sunkar and Jagadeeswaran [47]. Targets of miRNA were predicted using Patscan [48] by searching the full length cDNA sequence dataset (release 5) of the Rice Genome Annotation http://rice.plantbiology.msu.edu webcite with a penalty score up to four.

Abbreviations

miRNA: microRNA; siRNA: short inferring RNA; ta-siRNA: trans-acting siRNA; TAS3: trans-acting siRNA locus 3; HKA: the Hudson-Kreitman-Aguade test; indel: insertion and deletion; SNP: single nucleotide polymorphism.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

LF designed the experiments. DS, HC and JZ performed the experiments. YW and SB analyzed the data. LF, QZ, CH, YW and DC wrote the paper. All authors read and approved the final manuscript.

Acknowledgements

This work was supported by the National Basic Research Program of China (2006CB101700/2008CB117002), National High Technology Research and Development Program of China (2006AA10A102) and the Program of Scientific Methodology by Minister of Science and Technology (2008IM020800).

References

  1. Bartel DP: MicroRNAs: genomics, biogenesis, mechanism, and function.

    Cell 2004, 116(2):281-297. PubMed Abstract | Publisher Full Text OpenURL

  2. Lu C, Jeong DH, Kulkarni K, Pillay M, Nobuta K, German R, Thatcher SR, Maher C, Zhang L, Ware D, et al.: Genome-wide analysis for discovery of rice microRNAs reveals natural antisense microRNAs (nat-miRNAs).

    Proc Natl Acad Sci USA 2008, 105(12):4951-4956. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  3. Zhu QH, Spriggs A, Matthew L, Fan L, Kennedy G, Gubler F, Helliwell C: A diverse set of microRNAs and microRNA-like small RNAs in developing rice grains.

    Genome Res 2008, 18(9):1456-1465. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  4. Vazquez F: Arabidopsis endogenous small RNAs: highways and byways.

    Trends Plant Sci 2006, 11(9):460-468. PubMed Abstract | Publisher Full Text OpenURL

  5. Allen E, Xie Z, Gustafson AM, Carrington JC: microRNA-directed phasing during trans-acting siRNA biogenesis in plants.

    Cell 2005, 121(2):207-221. PubMed Abstract | Publisher Full Text OpenURL

  6. Axtell MJ, Jan C, Rajagopalan R, Bartel DP: A two-hit trigger for siRNA biogenesis in plants.

    Cell 2006, 127(3):565-577. PubMed Abstract | Publisher Full Text OpenURL

  7. Borsani O, Zhu J, Verslues PE, Sunkar R, Zhu JK: Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis.

    Cell 2005, 123(7):1279-1291. PubMed Abstract | Publisher Full Text OpenURL

  8. Heisel SE, Zhang Y, Allen E, Guo L, Reynolds TL, Yang X, Kovalic D, Roberts JK: Characterization of unique small RNA populations from rice grain.

    PLoS One 2008, 3(8):e2871. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  9. Liu B, Chen Z, Song X, Liu C, Cui X, Zhao X, Fang J, Xu W, Zhang H, Wang X, et al.: Oryza sativa dicer-like4 reveals a key role for small interfering RNA silencing in plant development.

    Plant Cell 2007, 19(9):2705-2718. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  10. Peragine A, Yoshikawa M, Wu G, Albrecht HL, Poethig RS: SGS3 and SGS2/SDE1/RDR6 are required for juvenile development and the production of trans-acting siRNAs in Arabidopsis.

    Genes Dev 2004, 18(9):2368-2379. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  11. Vazquez F, Vaucheret H, Rajagopalan R, Lepers C, Gasciolli V, Mallory AC, Hilbert JL, Bartel DP, Crete P: Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs.

    Mol Cell 2004, 16(1):69-79. PubMed Abstract | Publisher Full Text OpenURL

  12. Williams L, Carles CC, Osmont KS, Fletcher JC: A database analysis method identifies an endogenous trans-acting short-interfering RNA that targets the Arabidopsis ARF2, ARF3, and ARF4 genes.

    Proc Natl Acad Sci USA 2005, 102(27):9703-9708. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  13. Zhou X, Sunkar R, Jin H, Zhu JK, Zhang W: Genome-wide identification and analysis of small RNAs originated from natural antisense transcripts in Oryza sativa.

    Genome Res 2009, 19(1):70-78. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  14. Clop A, Marcq F, Takeda H, Pirottin D, Tordoir X, Bibe B, Bouix J, Caiment F, Elsen JM, Eychenne F, et al.: A mutation creating a potential illegitimate microRNA target site in the myostatin gene affects muscularity in sheep.

    Nat Genet 2006, 38(7):813-818. PubMed Abstract | Publisher Full Text OpenURL

  15. Saunders MA, Liang H, Li WH: Human polymorphism at microRNAs and microRNA target sites.

    Proc Natl Acad Sci USA 2007, 104(9):3300-3305. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Ehrenreich IM, Purugganan MD: Sequence variation of MicroRNAs and their binding sites in Arabidopsis.

    Plant Physiol 2008, 146(4):1974-1982. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  17. de Meaux J, Hu JY, Tartler U, Goebel U: Structurally different alleles of the ath-MIR824 microRNA precursor are maintained at high frequency in Arabidopsis thaliana.

    Proc Natl Acad Sci USA 2008, 105(26):8994-8999. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. Guo X, Gui Y, Wang Y, Zhu QH, Helliwell C, Fan L: Selection and mutation on microRNA target sequences during rice evolution.

    BMC Genomics 2008, 9:454. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  19. Wang S, Zhu QH, Guo X, Gui Y, Bao J, Helliwell C, Fan L: Molecular evolution and selection of a gene encoding two tandem microRNAs in rice.

    FEBS Lett 2007, 581(24):4789-4793. PubMed Abstract | Publisher Full Text OpenURL

  20. Kovach MJ, Sweeney MT, McCouch SR: New insights into the history of rice domestication.

    Trends Genet 2007, 23(1):578-587. PubMed Abstract | Publisher Full Text OpenURL

  21. Vaughan DA, Lu B, Tomooka N: The evolving story of rice evolution.

    Plant science 2008, 174:394-408. OpenURL

  22. Doebley JF, Gaut BS, Smith BD: The molecular genetics of crop domestication.

    Cell 2006, 127(7):1309-1321. PubMed Abstract | Publisher Full Text OpenURL

  23. McNally KL, Childs KL, Bohnert R, Davidson RM, Zhao K, Ulat VJ, Zeller G, Clark RM, Hoen DR, Bureau TE, et al.: Genomewide SNP variation reveals relationships among landraces and modern varieties of rice.

    P Natl Acad Sci USA 2009, 106(30):12273-12278. Publisher Full Text OpenURL

  24. Tajima F: Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

    Genetics 1989, 123(3):585-595. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Wright SI, Gaut BS: Molecular population genetics and the search for adaptive evolution in plants.

    Mol Biol Evol 2005, 22(3):506-519. PubMed Abstract | Publisher Full Text OpenURL

  26. Londo JP, Chiang YC, Hung KH, Chiang TY, Schaal BA: Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa.

    Proc Natl Acad Sci USA 2006, 103(25):9578-9583. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. Yamasaki M, Tenaillon MI, Bi IV, Schroeder SG, Sanchez-Villeda H, Doebley JF, Gaut BS, McMullen MD: A large-scale screen for artificial selection in maize identifies candidate agronomic loci for domestication and crop improvement.

    Plant Cell 2005, 17(1):2859-2872. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Olsen KM, Purugganan MD: Molecular evidence on the origin and evolution of glutinous rice.

    Genetics 2002, 162(2):941-950. PubMed Abstract | PubMed Central Full Text OpenURL

  29. Zhu Q, Zheng X, Luo J, Gaut BS, Ge S: Multilocus analysis of nucleotide variation of Oryza sativa and its wild relatives: severe bottleneck during domestication of rice.

    Mol Biol Evol 2007, 24(3):875-888. PubMed Abstract | Publisher Full Text OpenURL

  30. Hudson RR, Kreitman M, Aguade M: A test of neutral molecular evolution based on nucleotide data.

    Genetics 1987, 116(1):153-159. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  31. Zou XH, Zhang FM, Zhang JG, Zang LL, Tang L, Wang J, Sang T, Ge S: Analysis of 142 genes resolves the rapid diversification of the rice genus.

    Genome Biol 2008, 9(3):R49. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  32. Wang D, Pei K, Fu Y, Sun Z, Li S, Liu H, Tang K, Han B, Tao Y: Genome-wide analysis of the auxin response factors (ARF) gene family in rice (Oryza sativa).

    Gene 2007, 394(1-2):13-24. PubMed Abstract | Publisher Full Text OpenURL

  33. Nemhauser JL, Feldman LJ, Zambryski PC: Auxin and ETTIN in Arabidopsis gynoecium morphogenesis.

    Development 2000, 127(8):3877-3888. PubMed Abstract | Publisher Full Text OpenURL

  34. Hunter C, Willmann MR, Wu G, Yoshikawa M, de la Luz Gutierrez-Nava M, Poethig SR: Trans-acting siRNA-mediated repression of ETTIN and ARF4 regulates heteroblasty in Arabidopsis.

    Development 2006, 133(5):2973-2981. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  35. Wright SI, Bi IV, Schroeder SG, Yamasaki M, Doebley JF, McMullen MD, Gaut BS: The effects of artificial selection on the maize genome.

    Science 2005, 308(5726):1310-1314. PubMed Abstract | Publisher Full Text OpenURL

  36. Lin RM, Zhao WS, Meng XB, Wang M, Peng YL: Rice gene OsNAC19 encodes a novel NAC-domain transcription factor and responds to infection by Magnaporthe grisea.

    Plant science 2007, 172(1):120-130. Publisher Full Text OpenURL

  37. Zheng X, Chen B, Lu G, Han B: Overexpression of a NAC transcription factor enhances rice drought and salt tolerance.

    Biochem Biophys Res Commun 2009, 379(4):985-989. PubMed Abstract | Publisher Full Text OpenURL

  38. Jones-Rhoades MW, Bartel DP: Computational identification of plant microRNAs and their targets, including a stress-induced miRNA.

    Mol Cell 2004, 14(6):787-799. PubMed Abstract | Publisher Full Text OpenURL

  39. Kelley JL, Swanson WJ: Positive selection in the human genome: from genome scans to biological significance.

    Annu Rev Genomics Hum Genet 2008, 9:143-160. PubMed Abstract | Publisher Full Text OpenURL

  40. Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, et al.: The TIGR Rice Genome Annotation Resource: improvements and new features.

    Nucleic Acids Res 2007, (35 Database):D883-887. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  41. Yuan Q, Ouyang S, Wang A, Zhu W, Maiti R, Lin H, Hamilton J, Haas B, Sultana R, Cheung F, et al.: The institute for genomic research Osa1 rice genome annotation database.

    Plant Physiol 2005, 138(1):18-26. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  42. Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers.

    Methods Mol Biol 2000, 132:365-386. PubMed Abstract OpenURL

  43. Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

    Nucleic Acids Res 1994, 22(22):4673-4680. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  44. Nicholas KB, Nicholas JHB: GeneDoc: a tool for editing and annotating multiple sequence alignments. [http://www.psc.edu/biomed/genedoc] webcite

    Distributed by the authors 1997. OpenURL

  45. Tajima F: Evolutionary relationship of DNA sequences in finite populations.

    Genetics 1983, 105(2):437-460. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  46. Watterson GA: On the number of segregating sites in genetical models without recombination.

    Theor Popul Biol 1975, 7(2):256-276. PubMed Abstract | Publisher Full Text OpenURL

  47. Sunkar R, Jagadeeswaran G: In silico identification of conserved microRNAs in large number of diverse plant species.

    BMC Plant Biol 2008, 8:37. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  48. Dsouza M, Larsen N, Overbeek R: Searching for patterns in genomic data.

    Trends Genet 1997, 13(2):497-498. PubMed Abstract | Publisher Full Text OpenURL