Comparative analysis of miRNAs and their targets across four plant species
© Walther et al; licensee BioMed Central Ltd. 2010
Received: 5 July 2011
Accepted: 8 November 2011
Published: 8 November 2011
MicroRNA (miRNA) mediated regulation of gene expression has been recognized as a major posttranscriptional regulatory mechanism also in plants. We performed a comparative analysis of miRNAs and their respective gene targets across four plant species: Arabidopsis thaliana (Ath), Medicago truncatula(Mtr), Brassica napus (Bna), and Chlamydomonas reinhardtii (Cre).
miRNAs were obtained from mirBase with 218 miRNAs for Ath, 375 for Mtr, 46 for Bna, and 73 for Cre, annotated for each species respectively. miRNA targets were obtained from available database annotations, bioinformatic predictions using RNAhybrid as well as predicted from an analysis of mRNA degradation products (degradome sequencing) aimed at identifying miRNA cleavage products. On average, and considering both experimental and bioinformatic predictions together, every miRNA was associated with about 46 unique gene transcripts with considerably variation across species. We observed a positive and linear correlation between the number miRNAs and the total number of transcripts across different plant species suggesting that the repertoire of miRNAs correlates with the size of the transcriptome of an organism. Conserved miRNA-target pairs were found to be associated with developmental processes and transcriptional regulation, while species-specific (in particular, Ath) pairs are involved in signal transduction and response to stress processes. Conserved miRNAs have more targets and higher expression values than non-conserved miRNAs. We found evidence for a conservation of not only the sequence of miRNAs, but their expression levels as well.
Our results support the notion of a high birth and death rate of miRNAs and that miRNAs serve many species specific functions, while conserved miRNA are related mainly to developmental processes and transcriptional regulation with conservation operating at both the sequence and expression level.
Keywordsplants miRNA miRNA targets conservation next generation sequencing
The discovery of microRNAs (miRNAs) in different kingdoms and in many species prompted comparative analyses to identify those miRNAs that are more strongly conserved than others and to understand whether their main functional role is associated with species-specific or universally occurring processes or both. Animal miRNAs have been reported to be involved in developmental timing, cell death, cell proliferation, haematopoiesis, and patterning of the nervous system , i.e. primarily developmental processes. MicroRNAs involved in these processes were also found to be conserved across species . Genes involved in basic cellular maintenance functions are less often miRNA targets . Many miRNA families have also been conserved across different plant lineages including mosses, gymnosperms, moncotes, and dicots [4–6]. However, with modern sequencing technologies that allow miRNAs do be identified at increased breadths, it was also noted that the number of species-specific miRNAs is greater than the number of conserved miRNAs [7, 8]. Thus, a high birth and death rate of miRNAs has been postulated . The highly dynamic nature of miRNA evolution was also confirmed recently in a comparative analysis of the closely related Arabidopsis species A.thaliana and A.lyrata . A substantial number of miRNAs was found to be species-specific, despite the only recent separation of the two species. No event of miRNA conservation between plants and animals has yet been found . As also the miRNA processing and targeting differs substantially, it has been concluded that the miRNA mechanism has evolved separately in animals and plants from common ancestral siRNA machinery .
In this study, we carried out a comparative analysis of miRNAs and their targets across the four plant species Arabidopsis thaliana (Ath), Brassica napus(Bna) - both members of the brassicaceae family, Medicago truncatula (Mtr) - a legume, and Chlamydomonas reinhardtii (Cre) - a single cell alga. The choice of plant species was motivated by several research projects conducted at the Max Planck Institute for Molecular Plant Physiology. The unifying goal of these studies was to identify functional miRNAs, to profile known miRNAs with regard to their abundance and to potentially discover novel miRNAs by applying the Solexa/Illumina Next Generation Sequencing (NGS) technology to RNA extractions for the different plant systems exposed to different conditions. More specifics and results regarding these studies can be found in  and , and in the Method section. Recently, so-called degradome sequencing was established as a powerful experimental approach to detect miRNA targets  and corresponding bioinformatic data processing pipelines introduced . Here, the cleavage products generated upon miRNA induced mRNA target cleavage are specifically identified thereby allowing those miRNA-target pairs to be identified for which cleavage is the mode of action while not detecting those targets that are under translational repression. Degradome data have also been used in the current study.
miRNA, cDNA, EST sequence data
For the four investigated plant species, we obtained mature miRNA sequences and stem-loop sequences associated with miRNA precursors from miRBase release 15 http://www.mirbase.org  yielding 218 miRNAs for Ath, 375 for Mtr, 46 for Bna, and 73 for Cre, respectively. No miRNA-star sequences were considered for analysis. For Ath, cDNA sequence information was obtained from The Arabidopsis Information Resource (TAIR, http://www.arabidopsis.org), genome release 9 . For Bna, assembled contigs were retrieved from PlantGDB http://www.plantgdb.org/. For Mtr, sequence and annotation information was obtained from The Medicago Genome Sequence Consortium (MGSC, http://www.medicago.org), and here referred to as "Mt3.0". Sequences for Cre were downloaded from the DOE Joint Genome Institute using genome assembly v4.0 and Augustus v5.0 gene models http://genome.jgi-psf.org.
Conditions for experimental data; small RNA, degradome sequencing data
We used smallRNA sequencing data obtained and published for Ath under eight [11, 17] and for Mtr under two different experimental conditions . The specific conditions were for Mtr: a) treatment with the symbiotic fungus mycorrhiza ("Myc") and b) treatment without the fungus ("N-Myc"). The eight conditions for Ath were: full nutrition ("FN"), phosphate starvation ("P"), phosphate starvation after three hours phosphate re-addition ("P+3 h"), nitrogen starvation ("N"), nitrogen starvation after three hours nitrogen re-addition ("N+3 h") (all from ), and FN from root cells ("root+p"), phosphate starvation from root cells ("root-p"), and phosphate starvation from shoot cells ("shoot-p") (from ). In total, 15.8 Mill small RNA reads were sequenced for Ath and 13.6 Mill reads for Mtr (2 conditions ("Myc", "N-Myc"). Degradome data to experimentally identify miRNA targets by detecting miRNA induced cleavage products from four conditions in Ath ("FN", "P-12 h", "P-48 h" and "N-48 h") and two conditions in Mtr ("Myc", "N-Myc") were used. For experimental details see [11, 12].
Normalization of expression data
Normalization of expression values per condition was done to adjust for variable sequencing depth between samples. The sequencing reads mapping on annotated miRNA were normalized to reads per million (RPM) per experimental condition: number of reads per gene/number of total reads * 1E6.
Criteria for conserved miRNAs
For analyzing the conservation of miRNA families across species, we performed a pairwise global sequence alignment of all single mature miRNA sequences with the program Align0 . Sequence pairs were considered conserved if the sequence identity was greater than 75, if there was a perfect match of seed sequence (6 nt, positions 2-7), and the two respective identifiers of the pair were classified by miRBase to be in the same MIRNA family.
Verified miRNA-target relationships were extracted from several sources: Supplementary Data of  (500 targets), 530 targets in total from the Arabidopsis Small RNA Project, "ASRP" (http://asrp.cgrb.oregonstate.edu, ), experimental data reported in Supplementary Table two and three of , referred to here as "degradomeG" (60 targets), and degradome sequencing data for Ath and Mtr from in-house experiments, called "degradome" (1,154 targets) [11, 12]. To identify miRNA-target relationships from degradome data, the CleaveLand algorithm was used [14, 21]. miRNA-targets were further predicted using the program RNAhybrid . The mature miRNA sequence data from miRBase and, on the potential target side, the downloaded cDNAs or assembled ESTs mentioned above were used as input. We used the parameter settings described in . We required the minimum free energy of hybridization to be greater than 70% compared to perfect match hybridization; i.e. in concordance with the initial threshold used in . Note that for the final set, the authors in  used a stricter 75% mfe cutoff. In Arabidopsis and using a 75% mfe threshold level, we obtained 2,967 unique targets transcripts for 218 miRNAs. All RNAhybrid predictions with additional score and mfe information for all four plant species are provided in tabular format as supplementary material (Additional File 1).
Gene Ontology (GO) annotation files were downloaded: for Ath from TAIR , for Cre from the DOE Joint Genome Institute http://genome.jgi-psf.org/Chlre4/Chlre4.download.ftp.html, and for Mtr from http://www.medicago.org/genome/downloads/Mt3/. Annotations for Bna were assigned by copying the GO slim term from TAIR for the best hit from a BLAST run against Ath. The calculation of over-representation of GO terms was done by applying the Fisher's Exact Test for count data and the p-values for Molecular Functions and Biological Processes were adjusted for multiple testing applying the Benjamini-Hochberg method .
Overview of the miRNA and target statistics
Genome size (Mbp)
average targets per
As reported for plant miRNA target action earlier , most miRNA target sites were found to fall within the coding regions (86% in Ath), whereas the 5' and 3'UTR regions are targeted by approximately 7% (in Ath).
Conservation of miRNAs and their targets
miRNA-target relationships in Arabidopsis thaliana
Properties of conserved versus non-conserved miRNA
Conserved miRNA families were found to target on average more gene transcripts - with the average number of targets summed up across the three species Ath, Bna, and Mtr amounting to 161.4 - than their non-conserved counterparts (106.2), p = 0.073 (Mann-Whitney test). Based on the available quantitative data of miRNA expression via normalized read counts (see Methods), conserved miRNAs were found to be expressed at higher levels than non-conserved miRNAs. In Ath, the average log-2 expression value for conserved miRNAs was 9.25 and significantly higher than the corresponding value for non-conserved miRNAs (3.92, p = 2.2e-5), observed similarly in Mtr with 7.39 for conserved vs. 4.94 average log-2 expression level for non-conserved miRNAs, albeit significance could not be established (p = 0.21).
Biological process involvement of conserved/non-conserved miRNAs.
Species specific targets
GO Process Term
Unknown biological processes
Other metabolic processes
Response to stress
Other cellular processes
Electron transport or energy pathway
Conservation of miRNA expression levels
We performed a comparative analysis of miRNAs in four different plant species (Ath, Bna, Mtr, and Cre). Our results confirm previous results that miRNA evolution appears to be rapid suggesting a significant participation of miRNAs in species-specific processes [8, 9]. The observation that species-specific miRNAs and their targets appear to be involved in processes involving interactions with the environment, such as signal transduction and stress response (Table 2) supports the notion that miRNAs are an important level of regulation at the speciation level as every species will have their very own environment to cope with. The observation that genes involved in "unknown biological processes" were also found overrepresented in the set of target genes of non-conserved miRNAs may either suggest that there are still many species-specific genes not properly characterized yet, or that those miRNA-target associations are spurious in the sense that the annotation of the genes and/or the identification of the miRNA may have been incorrect.
Small RNA sequencing data was analyzed to assess conservation not only at the sequence, but also at the expression level with the conclusion that miRNA expression is conserved as well. Therefore, it may be worthwhile to compare the respective cis-regulatory regions associated miRNA genes across different species and to investigate evolutionary differences and conservation patterns.
Further improvements also seem possible on the bioinformatic target prediction side. While it is clear that in silico methods may yield more predictions than miRNA-target pairs detected experimentally - as they depend on the miRNA actually being expressed - ideally, all of the experimentally found miRNA-target pairs would also be found by in silico methods.
Gene expression regulation via miRNAs in plants appears to scale with genome size and to play a predominant role in species specific adaptation processes. In cases of miRNA conservation not only is the sequence conserved, but also their expression with targeted processes associated with general, developmental programs.
We wish to thank our colleagues from the Max Planck Institute for Molecular Plant Physiology, Potsdam-Golm, for providing the experimental data used in this study. In particular, Wolf-Rüdiger Scheible (Ath, Bna data), Franziska Krajinski (Mtr data) and their teams.
- Ambros V: The functions of animal microRNAs. Nature. 2004, 431: 350-355. 10.1038/nature02871.PubMedView ArticleGoogle Scholar
- Wienholds E, Plasterk RH: MicroRNA function in animal development. FEBS Lett. 2005, 579: 5911-5922. 10.1016/j.febslet.2005.07.070.PubMedView ArticleGoogle Scholar
- Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM: Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3'UTR evolution. Cell. 2005, 123: 1133-1146. 10.1016/j.cell.2005.11.023.PubMedView ArticleGoogle Scholar
- Zhang B, Pan X, Cannon CH, Cobb GP, Anderson TA: Conservation and divergence of plant microRNA genes. Plant J. 2006, 46: 243-259. 10.1111/j.1365-313X.2006.02697.x.PubMedView ArticleGoogle Scholar
- Axtell MJ, Bowman JL: Evolution of plant microRNAs and their targets. Trends Plant Sci. 2008, 13: 343-349. 10.1016/j.tplants.2008.03.009.PubMedView ArticleGoogle Scholar
- Axtell MJ, Bartel DP: Antiquity of microRNAs and their targets in land plants. Plant Cell. 2005, 17: 1658-1673. 10.1105/tpc.105.032185.PubMedPubMed CentralView ArticleGoogle Scholar
- Fahlgren N, Howell MD, Kasschau KD, Chapman EJ, Sullivan CM, Cumbie JS, Givan SA, Law TF, Grant SR, Dangl JL, Carrington JC: High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One. 2007, 2: e219-10.1371/journal.pone.0000219.PubMedPubMed CentralView ArticleGoogle Scholar
- Rajagopalan R, Vaucheret H, Trejo J, Bartel DP: A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 2006, 20: 3407-3425. 10.1101/gad.1476406.PubMedPubMed CentralView ArticleGoogle Scholar
- Fahlgren N, Jogdeo S, Kasschau KD, Sullivan CM, Chapman EJ, Laubinger S, Smith LM, Dasenko M, Givan SA, Weigel D, Carrington JC: MicroRNA gene evolution in Arabidopsis lyrata and Arabidopsis thaliana. Plant Cell. 2010, 22: 1074-1089. 10.1105/tpc.110.073999.PubMedPubMed CentralView ArticleGoogle Scholar
- Axtell MJ: Evolution of microRNAs and their targets: are all microRNAs biologically relevant?. Biochim Biophys Acta. 2008, 1779: 725-734.PubMedView ArticleGoogle Scholar
- Pant BD, Musialak-Lange M, Nuc P, May P, Buhtz A, Kehr J, Walther D, Scheible WR: Identification of nutrient-responsive Arabidopsis and rapeseed microRNAs by comprehensive real-time polymerase chain reaction profiling and small RNA sequencing. Plant Physiol. 2009, 150: 1541-1555. 10.1104/pp.109.139139.PubMedPubMed CentralView ArticleGoogle Scholar
- Devers EA, Branscheid A, May P, Krajinski F: Stars and symbiosis: microRNA- and microRNA*-mediated transcript cleavage involved in arbuscular mycorrhizal symbiosis. Plant Physiol. 2011Google Scholar
- German MA, Pillay M, Jeong DH, Hetawal A, Luo S, Janardhanan P, Kannan V, Rymarquis LA, Nobuta K, German R, De Paoli E, Lu C, Schroth G, Meyers BC, Green PJ: Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends. Nat Biotechnol. 2008, 26: 941-946. 10.1038/nbt1417.PubMedView ArticleGoogle Scholar
- Addo-Quaye C, Miller W, Axtell MJ: CleaveLand: a pipeline for using degradome data to find cleaved small RNA targets. Bioinformatics. 2009, 25: 130-131. 10.1093/bioinformatics/btn604.PubMedPubMed CentralView ArticleGoogle Scholar
- Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ: miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008, 36: D154-158. 10.1093/nar/gkn221.PubMedPubMed CentralView ArticleGoogle Scholar
- Huala E, Dickerman AW, Garcia-Hernandez M, Weems D, Reiser L, LaFond F, Hanley D, Kiphart D, Zhuang M, Huang W, Mueller LA, Bhattacharyya D, Bhaya D, Sobral BW, Beavis W, Meinke DW, Town CD, Somerville C, Rhee SY: The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res. 2001, 29: 102-105. 10.1093/nar/29.1.102.PubMedPubMed CentralView ArticleGoogle Scholar
- Hsieh LC, Lin SI, Shih AC, Chen JW, Lin WY, Tseng CY, Li WH, Chiou TJ: Uncovering small RNA-mediated responses to phosphate deficiency in Arabidopsis by deep sequencing. Plant Physiol. 2009, 151: 2120-2132. 10.1104/pp.109.147280.PubMedPubMed CentralView ArticleGoogle Scholar
- Myers EW, Miller W: Optimal alignments in linear space. Comput Appl Biosci. 1988, 4: 11-17.PubMedGoogle Scholar
- Alves L, Niemeier S, Hauenschild A, Rehmsmeier M, Merkle T: Comprehensive prediction of novel microRNA targets in Arabidopsis thaliana. Nucleic Acids Res. 2009, 37: 4010-4021. 10.1093/nar/gkp272.PubMedView ArticleGoogle Scholar
- Gustafson AM, Allen E, Givan S, Smith D, Carrington JC, Kasschau KD: ASRP: the Arabidopsis Small RNA Project Database. Nucleic Acids Res. 2005, 33: D637-640.PubMedPubMed CentralView ArticleGoogle Scholar
- Devers E, Branscheid P, May P, Kranjinski F: Stars and symbiosis: microRNA- and microRNA*-mediated transcript cleavage involved in arbuscular mycorrhizal symbiosis. Plant Physiology. 2011, 111-172627Google Scholar
- Kruger J, Rehmsmeier M: RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34: W451-454. 10.1093/nar/gkl243.PubMedPubMed CentralView ArticleGoogle Scholar
- Benjamini Y, Hochberg Y: Controlling the False Discovery Rate - a Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B-Methodological. 1995, 57: 289-300.Google Scholar
- Rhoades MW, Reinhart BJ, Lim LP, Burge CB, Bartel B, Bartel DP: Prediction of plant microRNA targets. Cell. 2002, 110: 513-520. 10.1016/S0092-8674(02)00863-2.PubMedView ArticleGoogle Scholar
- Johnston JS, Pepper AE, Hall AE, Chen ZJ, Hodnett G, Drabek J, Lopez R, Price HJ: Evolution of genome size in Brassicaceae. Ann Bot. 2005, 95: 229-235. 10.1093/aob/mci016.PubMedPubMed CentralView ArticleGoogle Scholar
- Misumi O, Yoshida Y, Nishida K, Fujiwara T, Sakajiri T, Hirooka S, Nishimura Y, Kuroiwa T: Genome analysis and its significance in four unicellular algae, Cyanidioschyzon [corrected] merolae, Ostreococcus tauri, Chlamydomonas reinhardtii, and Thalassiosira pseudonana. J Plant Res. 2008, 121: 3-17. 10.1007/s10265-007-0133-9.PubMedView ArticleGoogle Scholar
- Gutierrez MV, Vaz Patto MC, Huguet T, Cubero JI, Moreno MT, Torres AM: Cross-species amplification of Medicago truncatula microsatellites across three major pulse crops. Theor Appl Genet. 2005, 110: 1210-1217. 10.1007/s00122-005-1951-6.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.