Original Paper

Pdf file on Synergy

omments

Acta Biochim Biophys Sin 2007, 39: 829�834

doi:10.1111/j.1745-7270.2007.00346.x

Cloning of novel repeat-associated small RNAs derived from hairpin precursors in Oryza sativa

 

Chengguo Yao, Botao Zhao, Wei Li, Yang Li, Wenming Qin, Bing Huang, and Youxin Jin*

 

State Key Laboratory of Molecular Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences; Graduate School of Chinese Academy of Sciences, Shanghai 200031, China

 

Received: April 4, 2007�������

Accepted: July 1, 2007

This work was supported by the National Natural Science Foundation� of China (No. 30430210), the National Key Basic Research and Development� Program (No. 2005CB724602) and Chinese Academy of Science (KSCXI-YW-R-64)

*Corresponding author: Tel, 86-21-54921222; Fax, 86-21-54921011; E-mail, [email protected]

 

Abstract������� Plant small non-coding RNAs including microRNAs (miRNAs), small interfering RNAs (siRNAs) and trans-acting siRNAs, play important roles in modulating gene expression in cells. Here we isolated 21 novel endogenous small RNA molecules, ranging from 18 to 24 nucleotides, in Oryza sativa that can be mapped to 111 hairpin precursors. Further analysis indicated that most of these hairpin sequences originated from putative miniature inverted-repeat transposable elements, a major type of DNA transposon. Considering that miRNA is characteristic of hairpin-like precursor and plant endogenous siRNAs are often located at transposon regions, we hypothesized that our cloned small RNAs might represent the intermediate product in the evolutionary process between siRNAs and miRNAs. Northern blot analysis indicated that five of them were much more abundantly expressed in flower compared to other tissues, implying their potential function in inflorescence. In conclusion, our results enrich rice small RNA data and provide a meaningful perspective for small RNA annotation in plants.

 

Keywords������� small RNA; microRNA; small interfering RNA; hairpin; Oryza sativa

 

MicroRNAs (miRNAs), small interfering RNAs (siRNAs), and trans-acting siRNAs have emerged in recent� years as small but mighty gene attenuators in plants [1]. As small non-coding RNAs, they are almost indistinguishable� in chemical composition and length, and their classification mainly depends on their biogenesis and mode of action [2]. miRNA is processed to a ~22 nucleotide� (nt) mature sequence from an incompletely base-paired hairpin-like RNA transcript and then acts in trans on target messenger RNA [3]. siRNAs are primarily processed from long double-stranded RNAs, but trans-acting siRNAs act in trans just like miRNAs, whereas other siRNAs mainly target the transcripts� that originate from siRNAs themselves [4].

Given their relatively detailed characterization, a number� of open questions remain concerning the differentiation of these small RNAs. For example, siRNAs are well known to be related to transposable elements (TEs), as opposed to them, miRNAs are thought to derive from loci different from other genes or TEs [5]. However, several examples of miRNA genes have been identified to be located at TEs in animals [6], and recent bioinformatics analysis shows that 55 experimentally characterized human miRNA genes are derived from TEs, and these TE-derived miRNAs have the potential to regulate thousands of genes [7]. Miniature inverted-repeat transposable elements (MITEs) are notable types of TE associated with miRNA, as they have palindromic� structures with terminal inverted repeats (TIRs) that flank short internal regions. Their expression as RNA results in the formation of the kinds of hairpins seen for pre-miRNAs [8]. Indeed, MITEs have previously been shown to contribute miRNA genes in human and Arabidopsis genomes [8,9].

The rice genome contains approximately 90,000 MITEs, constituting approximately 26% of the genome sequence and highest-copy-number TEs in rice [10]. Previous sequence� analyses of cloned rice small RNAs ignored the correlation between hairpin-derived small RNA and TEs [11-13]. We constructed several rice small RNA libraries from typical organs at different development stages, and were able to obtain 3003 sequences. Subsequent analysis allowed us to pick out 21 novel small RNA sequences mapped to 111 hairpin precursors that are encoded by putative transposons. These small RNAs share characteristics� with both miRNA and siRNA, and might represent the evolutionary link between both.

 

 

Materials and methods

 

Cloning of small RNA sequences from rice

 

Whole rice plants (Oryza sativa L. ssp. japonica) were grown under natural conditions. Different plant tissues at various development stages (root, leaf, flower, and stem) were collected, washed with double distilled H2O, and frozen� in liquid nitrogen. Total RNAs from O. sativa were prepared using the Trizol method (Invitrogen, Carlsbad, USA) in which the isopropanol precipitation was replaced by ethanol precipitation. In brief, small RNAs from 18 to 28 nt were size-fractionated, purified, and ligated sequentially to the 5' DNA adapter (ACCGAATTC�AC�AG�TCA-GACC, EcoRI site) and 3' adapter (GCAGATCGTCAGA-ATTCCAG, EcoRI site) with T4 RNA ligase from New England Biolabs (Beverly, USA). The 3' adapter was blocked by ddA with terminal transferase (NEB) at its 3' terminus and phosphorylated by T4 polynucleotide kinase (NEB) at the 5' terminus. The ligated RNA was reverse transcribed into cDNA by the Access Quick reverse transcription�-polymerase chain reaction (RT-PCR) system (Promega, Madison, USA) with the 5' adapter sequence and another primer complementary to the 3' adapter. The RT-PCR product was amplified by PCR with the same primers. After purification, the PCR product was digested with EcoRI (NEB) and concatemerized with T4 DNA ligase� (NEB) followed by purification with the kit (Bioasia, Shanghai, China). PCR was carried out with the previous primers and the DNA of approximately 800 bp was cloned into pGEM-T vector (Promega) for sequencing.

To avoid losing the cDNAs containing the EcoRI site, the adaptors with a SalI site but not an EcoRI site were also used to generate some cDNA libraries.

 

Sequence analysis and prediction of fold-back structures

 

Using BLASTN on the National Center for Biotechnology� Information (NCBI) website (http:www.ncbi.nlm.nih.gov/blast), cloned sequences shorter than 16 nt and possible ribosomal RNA, messenger RNA, transfer RNA, and small nucleolar RNA fragments were discarded. This step allowed� the removal of most non-siRNA and non-miRNA species and identification of cloned known miRNAs that could be confirmed in the miRNA registry (http://microrna.sanger.ac.uk/sequences/). After redundancy analysis and exclusion of previously reported rice small RNAs that were also cloned through construction of libraries, the remaining sequences were compared with the latest version of the rice genome at the Rice Genome Database (RGP; http://riceblast.dna.affrc.go.jp/). The surrounding sequences of 200 nt in both orientations were extracted and, together with the perfectly matched sequences in the RGP, were submitted for RNA folding using the Mfold program [14]. Folding results were inspected, and fold-back structures with small RNA in the stem were considered hairpin precursors.

Next, we ran BLAST search against the TIGR rice repeat� database (http://tigrblast.tigr.org/euk-blast/index.cgi?project=osa1) with the hairpin precursors. Sequences with at least 80% similarity, with one hit including the mature region, were regarded as candidates for rice repeat genome data [7], and some of them were perfect matches. Genomic annotation was examined using the Rice Genome Automated� Annotation System (http://ricegaas.dna.affrc.go.jp/).

 

Northern blot analysis of small RNAs

 

Approximately 10 mg total RNA was isolated from different� rice samples (embryo, seedling, leaf, stem, flower, and fruit), loaded and resolved on the denaturing 15% polyacrylamide gel, and transferred to a Hybond N+ nylon membrane (Amersham, Piscataway, USA). The membrane was ultraviolet cross-linked. DNA probes complementing miRNA sequences were end-labeled with [g-32P]ATP (3000 Ci/mM; Amersham) using T4 polynucleotide kinase (MBI, Vilnius, Lithuania). Unincorporated [g-32P]ATP was removed� using a Sephadex G-25 column (Pharmacia, Uppsala, Sweden). Methylene blue staining of membranes prior to hybridization was used to detect ribosomal RNA. The membrane was pre-hybridized and hybridized using ExpressHyb Hybridization Solution (Clontech, Palo Alto, USA) then washed according to the user manual. The membrane was briefly air-dried then exposed to a PhosphorImager (Amersham).

 

 

Results

 

Novel small RNA gene families identified in rice show features of both miRNA and siRNA

 

Construction of a small RNA library is a practical and effective way to study small RNAs. In fact, many rice miRNAs, endogenous siRNAs and other uncharacterized tiny non-coding RNAs were identified through this starting point. We constructed six small RNA libraries ranging from embryo to flower in rice plant (O. sativa L. ssp. japonica). Small RNAs of 18-26 nt in length were isolated by size fractionation and ligated to 5' and 3' adapters, then cloned and sequenced. A total of ~460 clones comprising 3003 small cDNA insertion sequences were collected. After removal of sequences shorter than 16 nt, we blasted the remained sequences against the rice genome in the NCBI database to discard the query sequences bearing no hits, and those that could be degradation products of ribosomal RNA, transfer RNA, small nucleolar RNA, or small nuclear RNA that constituted approximately half of the overall sequences. The BLAST results in the NCBI database and miRNA registry (http://www.sanger.ac.uk/Software/Rfam/mirna/search.shtml) also showed that we had cloned many known miRNAs and endogenous siRNAs. A total of 1416 small RNA sequences were matched to the rice genome (japonica genome database at RGP). One hundred and eleven loci corresponding to 21 small RNAs were predicted to hold hairpin-like fold-backs (Table 1), characteristic of miRNAs. However, sequences blasted against the rice repeat database indicated that they all showed high sequence similarity with one or more hits. Most of them are putative MITEs, and others are putative unclassified transposons or putative retrotransposons (data not shown), which are obviously features of siRNA-generating sequences.

 

Genomic annotation of new rice small RNAs

 

Non-coding small RNAs are usually located in the genomic segments that are distinct from protein-coding regions. Examination results of the corresponding loci with the Rice Genome Automated Annotation System (http://ricegaas.dna.affrc.go.jp/) conformed to the traditional knowledge, in that almost all of the loci were in the intergenic or intron, non-coding expressed sequence tag regions, except for a few in the exon of a hypothetical protein and cDNA region, which could be pseudo genes. The proteins specified by the intron region are nearly all hypothetical or putative, indicating that they might also be pseudo genes. For those small RNAs encoded by multiple loci, we could not assign their exact origin unambiguously, and some of them might not be genuine genes, as referred to above.��

 

Cloning of putative influorescence-associated small RNAs

 

To validate the cloning results, we carried out Northern blot analysis to examine the existence of these small RNAs using total RNA samples isolated from diverse organs from different development stages. Interestingly, five of them were exclusively abundant in flower compared to other tissues (Fig. 1), suggesting that they might specially function in the process of inflorescence. However, their targets and the mechanisms through which they play the role are yet to be discovered. As a control, miR159c was also examined, and showed ubiquitous expression in the samples listed in our experimental conditions. Some small RNAs were not detected by Northern blot analysis with our available rice tissues. They might be expressed at very low levels or confined to some specific tissues or cell types, therefore the amount of these miRNAs was inappreciable in the limited total RNA samples.

 

 

Discussion

 

As more and more detailed information about plant small RNAs, including miRNAs, endogenous siRNAs, and trans-acting siRNAs are being proffered [15-17], the differences between them seem to be much clearer. However, some aspects still remain controversial, such as their conservation, origin, and length [18], which convinced us of the view that the diversity of plant small RNAs is more complex than previously expected. Recent cloning and analysis of rice endogenous small non-coding RNAs corroborate this point [13]. To date, most reports have emphasized their distinction and classification, but ignored the correlation and possible evolutionary intermediates. Here we present data to provide a link between plant siRNAs and miRNAs. A full-length DNA-type transposable element contains an open reading frame flanking two TIRs. When it becomes a non-autonomous MITE, it is deprived of an open reading frame, and the hairpin structure would be formed by base-pair interaction of MITE TIRs. Previously, an inverted duplication model for miRNA gene evolution in plants was proposed [19], however, the role of MITEs was not discussed in that the researchers used known miRNA fold-back sequences. There is a postulation that miRNAs could have evolved from TE-encoded siRNAs in the way that the TIRs that are processed from longer RNAs to form siRNAs could be similarly processed to form miRNAs [8]. Thus we assumed that small RNAs that originated from MITEs were matured by miRNA biogenesis pathways (Fig. 2). However, additional analysis of these small RNAs in rice mutants relating to the biogenesis of miRNAs, such as DCL1, is necessary to validate this hypothesis [20,21]. Predictably, the evolutionary relationship and distinction between plant miRNA and siRNA will become clearer as the biogenesis knowledge about these MITE-derived small RNAs accumulates.

The average copy number of rice MITEs ranges from dozens to thousands. Here we only extracted hairpin precursors perfectly matched to the cloned sequences, thus many homologs might have been missed. In view of our limited number of small RNA libraries, predicting the average copy number of the MITE-derived small RNAs is not practical. However, taking account of the variability of plant fold-back precursors and 90,000 MITEs in rice, it is a really interesting area of study, and our cloned small RNA sequences from MITEs just show the tip of the iceberg.

Although plant small RNA target prediction is convenient for the extensive complementarity between small RNA and its target according to the accepted principles [22,23], the authentic targets of the vast majority of plant small RNAs are not characterized, and the phenotypic consequences of disrupted or altered small RNA regulation are also obscure because projects often take place over long time periods, and chance events take place, especially in the study of rice [11,24-26]. Northern blot data revealed that we have cloned five flower-exclusive small RNAs among the tissues analyzed, hinting that they might act like miRNAs to get involved in inflorescence. The molecular scenarios are waiting to be unraveled.

 

 

References

 

 1�� Johnson C, Bowman L, Adai AT, Vance V, Sundaresan V. CSRDB: a small RNA integrated database and browser resource for cereals. Nucleic Acids Res 2007, 35: D829-D833

 2�� Vazquez F. Arabidopsis endogenous small RNAs: highways and byways. Trends Plant Sci 2006, 11: 460-468

 3�� Chen X. MicroRNA biogenesis and function in plants. FEBS Lett 2005, 579: 5923-5931

 4�� Yoshikawa M, Peragine A, Park MY, Poethig RS. A pathway for the biogenesis of trans-acting siRNAs in Arabidopsis. Genes Dev 2005, 19: 2164-2175

 5�� Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell 2004, 116: 281-297

 6�� Smalheiser NR, Torvik VI. Mammalian microRNAs derived from genomic repeats. Trends Genet 2005, 21: 322-326

 7�� Piriyapongsa J, Marino-Ramirez L, Jordan IK. Origin and evolution of human microRNAs from transposable elements. Genetics 2007, 176: 1323-1337

 8�� Piriyapongsa J, Jordan IK. A family of human microRNA genes from miniature inverted-repeat transposable elements. PLoS ONE 2007, 2: e203

 9�� Mette MF, van der Winden J, Matzke M, Matzke AJ. Short RNAs can identify new candidate transposable element families in Arabidopsis. Plant Physiol 2002, 130: 6-9

10Jiang N, Feschotte C, Zhang X, Wessler SR. Using rice to understand the origin and amplification of miniature inverted repeat transposable elements (MITEs). Curr Opin Plant Biol 2004, 7: 115-119

11Sunkar R, Girke T, Jain PK, Zhu JK. Cloning and characterization of microRNAs from rice. Plant Cell 2005, 17: 1397-1411

12Sunkar R, Girke T, Zhu JK. Identification and characterization of endogenous small interfering RNAs from rice. Nucleic Acids Res 2005, 33: 4443-4454

13Chen Z, Zhang J, Kong J, Li S, Fu Y, Li S, Zhang H et al. Diversity of endogenous small non-coding RNAs in Oryza sativa. Genetica 2006, 128: 21-31

14Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 2003, 31: 3406-3415

15Allen E, Xie Z, Gustafson AM, Carrington JC. microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell 2005, 121: 207-221

16Nakano M, Nobuta K, Vemaraju K, Tej SS, Skogen JW, Meyers BC. Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA. Nucleic Acids Res 2006, 34: D731-D735

17Johnson C, Sundaresan V. Regulatory small RNAs in plants. EXS 2007, 97: 99-113

18Carrington JC. Small RNAs and Arabidopsis. A fast forward look. Plant Physiol 2005, 138: 565-566

19Allen E, Xie Z, Gustafson AM, Sung GH, Spatafora JW, Carrington JC. Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana. Nat Genet 2004, 36: 1282-1290

20Liu B, Li P, Li X, Liu C, Cao S, Chu C, Cao X. Loss of function of OsDCL1 affects microRNA accumulation and causes developmental defects in rice. Plant Physiol 2005, 139: 296-305

21Axtell MJ, Jan C, Rajagopalan R, Bartel DP. A two-hit trigger for siRNA biogenesis in plants. Cell 2006, 127: 565-577

22Zhang Y. miRU: an automated plant miRNA target prediction server. Nucleic Acids Res 2005, 33: W701-W704

23Li Y, LI W, Jin YX. Computational identification of novel family members of microRNA genes in Arabidopsis thaliana and Oryza sativa. Acta Biochim Biophys Sin 2005, 37: 75-87

24Luo YC, Zhou H, Li Y, Chen JY, Yang JH, Chen YQ, Qu LH. Rice embryogenic calli express a unique set of microRNAs, suggesting regulatory roles of microRNAs in plant post-embryogenic development. FEBS Lett 2006, 580: 5111-5116

25Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP. MicroRNAs in plants. Genes Dev 2002, 16: 1616-1626

26Jones-Rhoades MW, Bartel DP. Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. Mol Cell 2004, 14: 787-799