Skip to main content

A deeper view into the significance of simple sequence repeats in pre-miRNAs provides clues for its possible roles in determining the function of microRNAs

Abstract

Background

The central tenet of ‘genome content’ has been that the ‘non-coding’ parts are highly enriched with ‘microsatellites’ or ‘Simple Sequence Repeats’ (SSRs). We presume that the presence and change in number of repeat unit (n) of SSRs in different genomic locations may or may not become beneficial, depending on the position of SSRs in a gene. Very few studies have looked into the existence of SSRs in the hair-pin precursors of miRNAs (pre-miRNAs). The interplay between SSRs and miRNAs is not yet clearly understood.

Results

Considering the potential significance of SSRs in pre-miRNAs, we analysed the miRNA hair-pin precursors of 171 organisms, which revealed a noticeable (29.8%) existence of SSRs in their pre-miRNAs. The maintenance of SSRs in pre-miRNAs even in the complex, highly evolved phyla like Chordata and Magnoliophyta shed light upon its diverse functions. Putative effects of SSRs in either regulating the biogenesis or function of miRNAs were more underlined based on computational and experimental analysis. A preliminary computational analysis to explore the relevance of such SSRs maintained in pre-miRNA sequences led to the detection of splicing regulatory elements (SREs) either in or near to the SSRs. The absence of SSRs correspondingly decreased the detection of SREs.

Conclusion

The present study is the first implication for the possible involvement of SSRs in shaping the SREs to undergo Alternative Splicing events to produce miRNA isoforms in accordance with different stress environments. This part of work well demonstrates the importance of studying such consistently maintained SSRs residing in pre-miRNAs and can enhance more and more research towards deciphering the exact function of SSRs in the near future.

Background

The secret behind the difference in complexity of genome from small worms to highly evolved humans resides on the ‘non-coding’ part of the genome which was once considered as ‘dead ends’ or ‘genetic waste’. Reports point out that there is no proportional increase in the number of genes corresponding to the increase in complexity of the genome size, suggesting the evolution under positive selection pressure for the non-coding part of the genome. New high throughput sequencing technologies gave way to understand the importance of ‘non-coding transcripts’ and left behind the so far studied ‘coding transcripts’ that constitutes less percentage. The non-coding region includes two parts-the unique elements (promoters, enhancers, repressors, boundary elements, introns, conserved regions, pseudogenes, non-coding RNAs) and repetitive elements (transposable elements, tandem repeats) [1]. ‘Microsatellites’ or ‘Simple Sequence Repeats’ (SSRs) or ‘Simple Tandem Repeats’ (STR) are a major class of tandem repeats. They are tandem arrays of short (1–5 bp), repeated DNA sequences [2], that are commonly found in most genomes with a high mutation rate of 10− 2 to 10− 6 nucleotides per locus per generation [3] and hence utilized for fingerprinting studies. But once the discovery that change in tandem repeat unit of SSRs that fall in genes caused phenotypic changes, SSRs became more noticeable. The effect of SSRs were studied best across different plant species like rice [4, 5], common bean [6], barley [7], Arabidopsis [8] and also in humans [9].

But SSRs are poorly analysed in functional non-coding small regulatory RNAs like microRNAs (miRNAs). The importance of miRNAs (~ 20 nt) is that they play a major role in many biological processes and their biogenesis occurs from primary miRNA transcripts known as pri-miRNAs. The pri-miRNAs will adopt a stem-loop secondary structure known as the pre-miRNAs, from which a specific 21-nucleotide miRNA duplex is excised by a Dicer endonuclease [10]. Our previous experiments on transcriptome profiling revealed about the existence of SSRs in the non-coding transcripts of black pepper [11]. This true fact about the existence of SSRs in pre-miRNAs made us to ponder the possibility of SSRs in all the available pre-miRNAs across different taxa. To date, there is no lucid demonstration to prove the presence or pivotal functions of SSRs in hair-pin precursors of miRNAs, except for a few [11,12,13]. Hence our objective was to illustrate the exact incidence ratio of SSRs in all the available pre-miRNAs including plants, animals and viruses by performing a computational analysis in order to achieve a better understanding about the significance of such SSRs occurring in the pre-miRNAs. The preliminary observations revealed the significant incidence of SSRs and indicated the possible involvement of SSRs in Alternative Splicing (AS) events. AS also known as differential splicing is a regulated process that increases an organism’s transcriptome and proteome diversity [14]. One of the key regulators of AS are the cis acting Splicing Regulatory Elements (SREs) which are categorized into four classes like ESE (Exon Splicing Enhancer), ESS (Exon Splicing Silencer), ISE (Intron Splicing Enhancer) and ISS (Intron Splicing Silencer) depending on their location and its effect on splicing either as enhancers or silencers [15]. Here, the detection of SREs near SSRs in pre-miRNAs strongly suggests the possible involvement of SSRs in shaping episodes of AS.

Results

The distribution pattern of SSRs in hair-pin precursors of miRNAs

The SSRIT analysis of all the available miRNA precursors extracted from miRBase showed significant presence of SSRs which accounted to about 29.8%. The frequency and distribution pattern of SSRs varied extensively across different taxa analysed (see Additional file 1). SSR arrays were characterized as di, tri, tetra, penta or hexanucleotide based on the type of motif repeated in a sequence. Here, about 84.71% of SSRs were dinucleotide type of repeats, 12.5% were trinucleotide, 2.003% tetra, 0.544% penta and 0.181% were hexanucleotide type. When the relative count of SSRs bearing pre-miRNAs (the number of SSR bearing pre-miRNAs out of the total number of pre-miRNAs) were taken into consideration, Homo sapiens displayed the highest count, followed by Mus musculus, the least being Cunninghamia lanceolata, Macropus eugenii, Lemur catta, Marsupenaeus japonicas, Strigamia maritima, Glottidia pyramidata, Leucosolenia complicata, Sycon ciliatum, BK polyomavirus, Bandicoot papillomatosis carcinomatosis virus, Herpesvirus saimiri strain A11, JC polyomavirus, Merkel cell polyomavirus and Simian virus 40. The lesser count of SSR bearing pre-miRNAs may be due to lack of extensive miRNA characterisation studies in these organisms and hence these organisms cannot be completely demarcated. Similarly the relative count of SSR bearing pre-miRNAs were really scarce in a few organisms like Avicennia marina, Phytophthora sojae and Terebratulina retusa, still all of their pre-miRNAs revealed SSRs in their sequences showing 100% relative abundance of SSRs in miRNA precursors (Fig. 1). For a deeper and better understanding about the SSR motifs in the pre-miRNAs, we further focused our study in Arabidopsis thaliana, the model system. A closer examination of all the types of SSRs in the 325 reported pre-miRNAs of Arabidopsis thaliana exposed the significant presence of different types of SSR motifs. About 45% of pre-miRNAs in A. thaliana carried SSRs in their sequences of which 77% constituted dinucleotide type of SSRs, 19% trinucleotide, 3% tetranucleotide and 1% pentanucleotide type of SSRs. The distribution pattern of SSR types identified is shown in Fig. 2. Out of the 45% of SSR bearing pre-miRNAs, 7.5% of SSR bearing pre-miRNA showed transcription factors like SBP, MYB, NAC, HLZ, ARF, GRAS, ZF, BZIP, bHLH and WRKY as corresponding targets. A comparative analysis between normal PCR with miRNA specific primers and deletion PCR with primers designed to avoid SSR regions revealed a difference in the size of the PCR products as shown in Fig. 3. Five sets of miRNAs were further chosen based on certain criteria like the type of SSR motif, miRNAs with transcription factors as targets, the length of SSR repeat unit etc. The PCR profile showed either an absence or difference in size of the amplicons, which indicated the possible deletions of SSR regions in pre-miRNAs. Deeper focused studies give way to open up the potential significant roles for SSRs in pre-miRNAs.

Fig. 1
figure 1

Comprehensive Circos plot depicting the frequency and distribution pattern of tandem repeats occurring in the miRNA precursors across different organisms. Outermost circle (I): The names of individual organisms selected for the study whose details are given in Additional file 1. Subsequent Inner circle (II):Phyla based categorization of organisms: Chlorophyta (Chloro), Mycetozoa (Myce), Heterokontophyta (Hete), Embryophyta (Embr), Coniferophyta (Coni), Magnoliophyta (Magn), Porifera (Pori), Cnidaria (Cnid), Platyhelminthes (Plat), Nematoda (Nema), Annelida (Anne), Mollusca(Moll), Nemertea (Neme), Brachiopoda(Brac), Arthropoda(Arth), Deuterostoma(Deut), Hemichordata (Hemi), Echinodermata (Echi) and Chordata(Chor). Subsequent Inner circle (III): Kingdom based classification of organisms: Protista (P), Plantae (P), Animalia (A) and Viruses (V). Subsequent Inner circle (IV): The corresponding serial numbers of organisms, as listed in additional file 1. Subsequent Inner circle (V): The relative count of dinucleotide type of SSRs in miRNA precursors. Subsequent Inner circle (VI): The relative count of trinucleotide type of SSRs in miRNA precursors. Subsequent Inner circle (VII): The relative count of tetranucleotide type of SSRs in miRNA precursors. Subsequent Inner circle (VIII): The relative count of pentanucleotide type of SSRs in miRNA precursors. Subsequent Inner circle (IX): The relative count of hexanucleotide type of SSRs in miRNA precursors. Subsequent Inner circle (X): The total count of miRNA precursors. Subsequent Inner circle (XI): The total count of SSR containing miRNA precursors

Fig. 2
figure 2

Distribution pattern of different types of SSR motif in the pre-miRNAs of Arabidopsis thaliana. The X-axis shows the different types of SSR motifs identified in the pre-miRNAs of A.thaliana and the Y- axis shows the relative count of each of the SSR motif identified

Fig. 3
figure 3

PCR products showing difference in size of the amplicons observed after Normal PCR with miRNA specific forward and reverse primers and Deletion PCR with forward and reverse primers designed to avoid SSR regions in pre-miRNAs. a Normal PCR with 15 sets of miRNAs (miR164b, miR408, miR2936, miR166e, miR8183, miR167d, miR5021, miR169e, miR166f, miR167c, miR863, miR5015, miR3434, miR156b and miR394a). b and c Primary and secondary deletion PCR with corresponding primer pair combinations; Lane A1, B1, C1:100 bp ladder, lane A17, B12, C7: 1 kb ladder, lane A2 to A16: Normal PCR products with miRNA specific forward and reverse primers; lane B2 to B11: Primary amplicons observed; lane C2 to C6: Final deletion PCR amplicons observed (* corresponds to deletion)

Clues for SSR involvement in shaping SREs for alternative splicing events

The 149 SSR bearing pre-miRNAs identified in A. thaliana when subjected to RegRNA analysis, detected different functional RNA regulatory motifs. Among this, a most interesting and concurrent functional motif was Splicing Regulatory Element (SRE). The SREs were found to occur either in or near to the SSR motifs in pre-miRNA sequences. Out of the four SREs, the presence of Intron Splicing Silencers (ISS) in most of the CT/TC SSR motif type were noticeable. Such CT/TC motifs were well sustained in most of the members of conserved miRNA families like miR156 and 157. In miR854 family members, a trinucleotide SSR type GGA was found to be conserved with Exon Splicing Enhancer (ESE) like activity. A striking existence of two different types of SSR motifs adjacent to SREs like Intron Splicing Enhancers (ISE) [AG-ISE-CA] were also noticed among members of miR156 family. In addition to SREs, other functional RNA regulatory motifs associated with SSRs that were identified included Transcriptional Regulatory Motifs (TRM), Untranslated region motifs (UTRs), cis regulatory elements, noncoding RNA (ncRNA) hybridization regions, miRNA target sites etc. The AG motif was yet another SSR type which was conserved among miR8167 family members with potential function as TRMs. (see Additional file 2). This together with the observation that SSR motifs are consistently maintained even in the highly evolved Chordata and Magnoliophyta increased the chances of promising functions for such SSRs. The significant matches of SSRs in pre-miRNAs with SREs made us to check the possible role of SSRs in determining SREs required for the process of AS. For this, a computational based deletion analysis of SSR motifs in sequences of pre-miRNAs was carried out. Such tailored pre-miRNAs, when subjected to SRE prediction showed that, in the absence of certain SSRs the corresponding SREs were not detected (Table 1). Out of the four different SREs like ESE, ESS, ISE and ISS, the ISS and ESE elements were found to be the most affected when the SSR motifs were deleted. Among the SSR motifs, the CT/TC type was found to be the most prominent and consistent which were predicted as ISS sites (Fig. 4). If the CT/TC motif were deleted, the ISS sites were not detected in the corresponding pre-miRNA sequences. This initial result strengthened the possibility of SSR motifs to play a major role in shaping SREs to undergo AS.

Table 1 Computational deletion analysis of SSR motifs in pre-miRNAs of A.thaliana identifies potential role for SSRs in shaping SRE elements
Fig. 4
figure 4

The CT dinucleotide SSR bearing pre-miRNAs in Arabidopsis thaliana with potential ISS activity. List of CT dinucleotide SSR bearing pre-miRNAs showing ISS activity. The blue underline indicates the position of ISS emphasizing the presence of CT motifs in such regions

Discussion

There exist different perspectives for SSRs like (1) hypervariable molecular marker which is well addressed and demonstrated by its utility as molecular marker during fingerprinting studies; (2) biological effects of SSRs in genes and (3) interplay between SSRs and miRNAs. The distribution of SSRs in genes is non-random presumably because they are supposed to have a variety of putative functions. Reports suggest that SSRs present in both coding and non-coding regions can affect gene expression [1]. SSRs in the 5’ UTR served as protein binding site there by regulating translation [16].SSR expansion in 3’UTR caused transcriptional slippage and produced expanded mRNA, which could accumulate as nuclear foci, and disrupt splicing, and other cellular function. Intronic SSR can affect gene transcription, mRNA splicing or export to cytoplasm. SSRs located in EST’s have a range of functions such as metabolic enzymes, structural and storage proteins, disease signaling, and transcription factors. A positive selection exists when the SSRs happen to occur near the transposons [17]. In Drosophila, a change in the 17-copy repeat of SSR in the period gene coding for Thr-Gly was found to affect the circadian rhythm maintenance [18]. Reporter assays predicted a putative enhancer-like function for TG repeats [19]. In rodents, the expression pattern of vasopressin 1a receptor (V1aR) is regulated by differences in SSR in the 5′ regulatory region which in turn affect the social behavior [20]. The expression of luciferase gene in reporter assays is directly proportional to the length of GA repeats thus can enhance the transcriptional output of a gene [21,22,23]. Repeats residing in the intronic regions also enhance gene expression like the TCAT repeat in the intron of Tyrosine Hydroxylase gene [24]. A significant enrichment of SSRs near the transcriptional start sites (TSS) (60 and 20% CCG and ACG found within 1 kb of TSS) was observed in humans. These examples indicate the importance and necessity of studying these SSRs and underline the fact that SSRs can be functional entities in the genome.

Out of the three perspectives, debates on the third perspective about the correlation between SSRs and miRNAs still exist. Very few reports suggest that SSRs are an important component of pre-miRNAs [10,11,12, 25, 26]. As the SSRs present in the genes are shown to have regulatory effects when associated with new miRNA candidates [27] and phenotypic effects [18, 20,21,22,23,24], we presume that the SSRs identified from pre-miRNAs in this study may have similar biological effects. Thus SSRs may be involved either in the biological function of miRNAs or its biogenesis. Here, the presence of SSR bearing or SSR related miRNAs across different taxa was well demonstrated and the incidence ratio in each of the respective organism was portrayed. A miniature platform detailing each and every aspect of SSRs like extent of occurrence, the type of motif etc. was successfully generated from the study. Previous reports suggest that Repeat-related miRNAs (RrmiRs) are those miRNA genes having at least 50% of repetitive elements or 100% in one of the associated mature miRNA sequences [28]. Recently identified RrmiRs in fungi and animals include the piwi associated small interfering RNAs (rasiRNAs) and heterochromatic small RNAs (hcRNAs), processed from long double stranded RNA precursors [29, 30].

A comparative study on the type of SSRs across the taxa showed that dinucleotide SSRs were the predominant type (84.71%), whereas hexanucleotide SSRs (0.181%) were the least. Moreover the CT and AG dinucleotide SSR motifs were found to be consistent among the members of highly conserved miRNA families like miR854, miR156 etc. This preference for dinucleotide type of SSRs in pre-miRNAs can be correlated to its probable functions. The frequencies of different repeats can vary considerably in different organisms. In humans, the A/T regions are more frequent and in A.thaliana GA/CT repeats are more [31]. The 5’UTRs of Arabidopsis are reported to have relatively more number of AG/CT repeats, whereas the 3’UTRs of humans and catfish possess more number of AC/GT repeats [32]. The frequency of (A/T) n was high in the intronic regions of different species, (AC/GT) n was high in primates, rodentia, mammalian, vertebrata, arthropoda, fungi etc. and CG/GC repeats were more in C. elegans, yeast and embryophyta. Among the dinucleotides identified from the current study, the CT motif was found to be the most frequent SSR. This can be correlated to its function. Earlier reports suggest that there is an increased probability for CT repeats to play a major role in the transcription of miRNA genes. CT repeats are reported to form non-B DNA that play important potential roles in gene transcription activation [33, 34]. Similar abundance of dinucleotide simple repeats like (CA) n and (TG) n were reported in the largest mir-467 family in mouse [27]. Each SSR generated might be the product of repeated mutations and cross-overs that might have occurred during the course of evolution. The resulting SSR type observed in a particular pre-miRNA may be the requisite of that particular pre-miRNA to undertake its specific function in the right way. We believe that ‘demand tunes the function by changing the sequence preference’. Also, the number (n) of times a particular type of SSR is repeated may or may not affect its putative function. One of the best reported examples is the fragile X syndrome (FXS), a triplet expansion disease (TRED), which is the most common neuropsychiatric and mental retardation disorder in humans [35]. When there is an expansion of a trinucleotide CGG repeat located in the 5’UTR of FMR 1 gene, to over 200 copies, it results in the deficiency of FMRP protein, which is required for normal neuronal development and plasticity.

The amplicon profile observed after deletion PCR is indeed a strong opening to study the effects of deletion of SSRs in pre-miRNAs. This together with the computational identification of SREs either in or near to SSRs made us to presume that SSRs are involved in shaping AS to generate variant miRNAs during stress environments. SREs are sequences in exons and introns that are important for constitutive splicing as well as alternative splicing. They function either as splicing enhancers or suppressors and affect splice site choice [15]. Our preliminary identification of SSRs either adjacent to or as SREs with splicing activity is the first implication for likely involvement of SSRs in Alternative Splicing (AS). A possible explanation for the presence of SSRs in pre-miRNAs is that the SSRs may fine tune the Alternative Splicing (AS) events in pre-miRNAs which contributes to different isoforms of miRNAs. As the miRNAs are tissue specific and developmental stage specific, each miRNA formed in response to a stress factor has a specific role. About 61 and 95% of intron containing protein-coding genes in A. thaliana and humans are reported to undergo AS [36, 37] and the stress responsive miRNAs were reported to be G/C rich in A.thaliana [38]. The miRNAs with UGUGU sequences are said to activate the targets associated with carcinogenesis in humans [39]. The correlation between AS and miRNAs was well demonstrated [40], where competition between the splicing machinery and the miRNA processing machinery comes into play. It is assumed that when the splicing machinery does not recognise the internal exon, the miRNA processing components bind to pre-miRNA splice junction, thereby leading to the formation of pre-miRNA and a skipped isoform. Whereas when the splicing components recognise the internal exon, pre-miRNA is not formed, instead an isoform bearing the internal alternative exon is formed. It is known that miRNAs are generated either from intergenic or intronic regions of coding or noncoding genes [41] and splicing and processing of intronic miRNAs may affect each other [34, 42]. A characteristic GU dinucleotide at the 5’end and AG at the 3’end is noticed for the canonical U2 type introns; whereas AU and AC dinucleotides at the 5′ and 3’ends were noticed for U12 type introns. This strengthens the intron retention process that may happen during AS events and this may also be another reason for occurrence of ‘tandem repeats’ in such hair-pin precursor sequences.

Conclusions

The higher mutation rate of SSRs during recombination, polymerase slippage, DNA replication or repair, unequal crossing over etc. will finally end up with a change in number of repeat units of SSRs. This change may or may not become beneficial, depending on the incidence or position of these SSRs in a gene. The presence of SSRs in different locations that have an impact on genome strongly suggests that these SSRs should be considered significant and are not to be discarded as ‘nonsense’. Debates regarding the functional aspects of SSRs are never-ending. Those SSRs which are associated with miRNAs are speculated to have potential functions other than the conventional marker based assays. We speculate that there can be a tug of war between AS and miRNA biogenesis, which may in turn be affected, when there is a change in the number of repeat units (n) present in pre-miRNAs. All the three i.e. AS, SSRs and MIR genes are a complex interconnected network among which AS may be one of the crucial steps in miRNA biogenesis, which determine the formation of miRNAs in accordance with the external stress factors, whereas both AS and miRNA can be affected if a change in (n) occurs.

Methods

All the available miRNA precursors (miRBase v.21) of different taxa {Chromalveolata, Metazoa, Mycetozoa, Viridiplantae and Viruses (see Additional file 1)} were extracted for the current study from public database, miRBase (www.mirbase.org). Among the miRNA precursors of the same genus but different species, the species with more number of miRNAs were included. Simple Sequence Repeat Identification Tool (SSRIT) [43] (http://archive.gramene.org/db/markers/ssrtool markers/ssrtool) was used to study the frequency, type and distribution pattern of SSRs in each individual sequences.

To identify the presence of different functional RNA motifs including Splicing Regulatory Elements (SREs), the SSR bearing pre-miRNA transcripts were subjected to an integrated web server RegRNA 2.0 analysis [44]. The pre-miRNAs bearing SSRs with SRE activity were selected for further analysis. To figure out whether the SSR motif had any effect in determining SRE elements, a computational deletion was carried out manually for each of the SSR sequence motif in their corresponding pre-miRNAs. Then the trimmed pre-miRNAs (after the deletion of SSR motifs) were again subjected to RegRNA analysis to understand the effects of SSRs. An experimental approach to generate deletion constructs for SSRs in pre-miRNAs was carried out using normal PCR with miRNA specific forward and reverse primers and deletion PCR with primers designed to avoid the SSR region in the pre-miRNAs. Out of 15 sets of miRNAs, five sets of miRNAs were chosen for the study which included miR156b, miR164b, miR166f, miR167c and miR2936. The details of the primer sequences are given in Additional file 3. Total RNA was isolated from in vitro seedlings of Arabidopsis thaliana wild variety (Col-0) using mirVana™ miRNA Isolation Kit (Ambion) according to manufacturer’s instructions. About 50 ng of total RNA was subjected to reverse transcription (RT) using the TaqMan MicroRNA Reverse Transcription Kit (Applied Biosystems™) in the presence of 0.15 μL of 100 mM dNTPs, 1.00 μL of MultiScribe Reverse Transcriptase (50 U/μL), 1.5 μL of 10X reverse transcription buffer, 0.19 μL of RNase inhibitor (20 U/μL) and 1.0 μL of reverse primer in a total reaction volume of 15 μL under cycling conditions: 16 °C for 30 min, 42 °C for 30 min, 85 °C for 5 min and a final 4 °C. The first strand cDNA was converted into dscDNA by carrying out a secondary PCR with 1.0 μL of the template from first strand cDNA synthesis reaction, 1.0 μL of 10X Advantage 2 PCR buffer (Clontech), 200 μM of each dNTPs (50X dNTP mix), 0.5 μL of 10 μM forward primer, 0.5 μL of 10 μM reverse primer and 0.5 μL of 50X Advantage 2 Polymerase Mix in a total reaction volume of 10.0 μL. The reaction was subjected to the following PCR conditions: 95 °C for 7 min, 35 cycles of 95 °C for 30s, 60 °C for 60s and 72 °C for 1 min, final extension at 72 °C for 10 min. Separate reactions were carried out for normal PCR and deletion PCR with all the constituents being the same except for primers used. The first strand and second strand cDNA PCR products were checked in 1.2% agarose gel to study the effects of deletion of SSRs in pre-miRNAs selected.

Abbreviations

ARF:

Auxin response factor

bHLH:

basic/helix-loop-helix

BZIP:

Basic region/leucine zipper motif

GRAS:

GRAS transcription factor

HLZ:

Helix-leucine zipper motif

MYB:

MYB transcription factor

NAC:

No Apical Meristem domain

SBP:

Squamosa promoter-binding protein domain

WRKY:

WRKY transcription factor

ZF:

Zinc finger

References

  1. Krishnan J, Mishra RK. Code in the non-coding. Proc Indian Nat Sci. 2015;81(3):609–28.

    Google Scholar 

  2. Litt M, Luty JA. A hypervariable microsatellite revealed by in vitro amplification of a dinucleotide repeat within the cardiac muscle actin gene. Am J Human Gen. 1989;44:397–401.

    CAS  Google Scholar 

  3. Kim PM, et al. Analysis of copy number variants and segmental duplications in the human genome: evidence for a change in the process of formation in recent evolutionary history. Gen Res. 2008;18:1865–74.

    Article  CAS  Google Scholar 

  4. Ayers NM, McClung AM, Larkin PD, Bligh HFJ, Jones CA, Park WD. Microsatellites and a single nucleotide polymorphism differentiate apparent amylose classes in an extended pedigree of US rice germplasm. Theor Appl Genet. 1997;94:773–81.

    Article  Google Scholar 

  5. Bao JS, et al. QTL for rice grain quality based on a DH population derived from parents with similar apparent amylose content. Euphytica. 2002;128:317–24.

    Article  CAS  Google Scholar 

  6. Yaish MWF, Pérez De La Vega M. Isolation of (GA) n microsatellite sequences and description of a predicted MADS-box sequence isolated from common bean (Phaseolus vulgaris L.). Gen Mol Biol. 2003;26:337–42.

    Article  CAS  Google Scholar 

  7. Li CD, Zang XQ, Eckstein P, Rossnagel BG, Scoles GJ. A polymorphic microsatellite in the limit dextrinase gene of barley (Hordeum vulgare L.). Mol Breed. 2000;5:569–77.

    Article  Google Scholar 

  8. Fujimori S, et al. A novel feature of microsatellites in plants: a distribution gradient along the direction of transcription. FEBS Lett. 2003;554:17–22.

    Article  CAS  PubMed  Google Scholar 

  9. Grabczyk E, Kumari D, Usdin K. Fragile X syndrome and Friedreich's ataxia: two different paradigms for repeat induced transcript insufficiency. Brain Res Bull. 2001;56:367–73.

    Article  CAS  PubMed  Google Scholar 

  10. Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116:281–97.

    Article  CAS  PubMed  Google Scholar 

  11. Joy N, Asha S, Mallika V, Soniya EV. De novo transcriptome sequencing reveals a considerable bias in the incidence of simple sequence repeats towards the downstream of ‘pre-miRNAs’ of black pepper. PLoS One. 2013;8:56694. https://doi.org/10.1371/journal.pone.0056694.

    Article  Google Scholar 

  12. Chen M, Tan Z, Zeng G, Peng J. Comprehensive analysis of simple sequence repeats in pre-miRNAs. Mol Biol Evol. 2010;27:2227–32.

    Article  CAS  PubMed  Google Scholar 

  13. Nithin C, Patwa N, Thomas A, Bahadur RP, Basak J. Computational prediction of miRNAs and their targets in Phaseolus vulgaris using simple sequence repeat signatures. BMC Plant Biol. 2015;15:140.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Shang X, Cao Y, Ma L. Alternative splicing in plant genes: a means of regulating the environmental fitness of plants. Int J Mol Gen. 2017;18:432.

    Google Scholar 

  15. Reddy AS, Rogers MF, Richardson DN, Hamilton M, Ben-Hur A. Deciphering the plant splicing code: experimental and computational approaches for predicting alternative splicing and splicing regulatory elements. Front Plant Sci. 2012;3:18. https://doi.org/10.3389/fpls.2012.00018.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921.

    Article  CAS  PubMed  Google Scholar 

  17. Sawyer LA, Hennessy JM, Peixoto AA, Rosato E, Parkinson H, Costa R, Kyriacou CP. Natural variation in a Drosophila clock gene and temperature compensation. Science. 1997;278:2117–20.

    Article  CAS  PubMed  Google Scholar 

  18. Hamada H, Seidman M, Howard BH, Gorman CM. Enhanced gene expression by the poly (dT-dG) poly (dC-dA) sequence. Mol Cell Biol. 1984;4:2622–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Young LJ, Winslow JT, Nilsen R, Insel TR. Species differences in V1a receptor gene expression in monogamous and nonmonogamous voles: behavioral consequences. Behav Neurosci. 1997;111:599–605.

    Article  CAS  PubMed  Google Scholar 

  20. Mahmoudi T, Katsani KR, Verrijzer CP. GAGA can mediate enhancer function in trans by linking two separate DNA molecules. EMBO J. 2002;21:1775–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Srivastava S, Puri D, Garapati HS, Dhawan J, Mishra RK. Vertebrate GAGA factor associated insulator elements demarcate homeotic genes in the HOX clusters. Epigen Chromatin. 2013;6:8. https://doi.org/10.1186/1756-8935-6-8.

    Article  CAS  Google Scholar 

  22. Van Steensel B, Delrow J, Bussemaker HJ. Genomewide analysis of Drosophila GAGA factor target genes reveals context-dependent DNA binding. Proc Nat Acad Sci. 2003;100:2580–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Meloni R, Albanèse V, Ravassard P, Treilhou F, Mallet J. A tetranucleotide polymorphic microsatellite, located in the first intron of the tyrosine hydroxylase gene, acts as a transcription regulatory element in vitro. Human Mol Gen. 1998;7:423–8.

    Article  CAS  Google Scholar 

  24. Lin SL, Miller JD, Ying SY. Intronic microRNA (miRNA). Biomed Res Int. 2006;4:26818. https://doi.org/10.1155/JBB/2006/26818.

    Google Scholar 

  25. Lindow M, Jacobsen A, Nygaard S, Mang Y, Krogh A. Intragenomic matching reveals a huge potential for miRNA-mediated regulation in plants. PLoS Comput Biol. 2007;3:238. https://doi.org/10.1371/journal.pcbi.0030238.

    Article  Google Scholar 

  26. Olivero M, et al. Amplification of repeat-containing transcribed sequences (ARTS): a transcriptome fingerprinting strategy to detect functionally relevant microsatellite mutations in cancer. Nucleic Acids Res. 2003;31:e33. https://doi.org/10.1093/nar/gng033.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Yuan Z, Sun X, Liu H, Xie J. MicroRNA genes derived from repetitive elements and expanded by segmental duplication events in mammalian genomes. PLoS One. 2011;6:e17666. https://doi.org/10.1371/journal.pone.0017666.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. SanMiguel P, et al. Nested retrotransposons in the intergenic regions of the maize genome. Science. 1996;274:765–8.

    Article  CAS  PubMed  Google Scholar 

  29. Farazi TA, Juranek SA, Tuschl T. The growing catalog of small RNAs and their association with distinct Argonaute/Piwi family members. Development. 2008;135:1201–14.

    Article  CAS  PubMed  Google Scholar 

  30. Borchert GM, Lanier W, Davidson BL. RNA polymerase III transcribes human microRNAs. Nat Struct Mol Biol. 2006;13:1097–101.

    Article  CAS  PubMed  Google Scholar 

  31. Morgante M, Hanafey M, Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Gen. 2002;30:194–200.

    Article  CAS  Google Scholar 

  32. Li YC, Korol AB, Fahima T, Nevo E. Microsatellites within genes: structure, function, and evolution. Mol Biol Evol. 2004;21:991–1007.

    Article  CAS  PubMed  Google Scholar 

  33. Yaish MWF, dela Vega MP. Isolation of (GA) n microsatellite sequences and description of a predicted MADS-box sequence isolated from common bean (Phaseolus vulgaris L.). Genet Mol Biol. 2003;26(3):337–42.

    Article  CAS  Google Scholar 

  34. Yan K, et al. Stress-induced alternative splicing provides a mechanism for the regulation of microRNA processing in Arabidopsis thaliana. Mol Cell. 2012;48:521–31.

    Article  CAS  PubMed  Google Scholar 

  35. Kelley K, Chang SJE, Lin SL. Mechanism of repeat-associated microRNAs in fragile X syndrome. Neural Plasticity. 2012;2012:104796. https://doi.org/10.1155/2012/104796.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Marquez Y, Brown JW, Simpson C, Barta A, Kalyna M. Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Gen Res. 2012;22:1184–95.

    Article  CAS  Google Scholar 

  37. Pan Q, et al. Quantitative microarray profiling provides evidence against widespread coupling of alternative splicing with nonsense-mediated mRNA decay to control gene expression. Gen Dev. 2006;20:153–8.

    Article  CAS  Google Scholar 

  38. Mishra AK, Agarwal S, Jain CK, Rani V. High GC content: critical parameter for predicting stress regulated miRNAs in Arabidopsis thaliana. Bioinformation. 2009;4:151–4.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Rolle K. The sequence and structure determine the function of mature human miRNAs. PLoS One. 2016;11:0151246. https://doi.org/10.1371/journal.pone.0151246.

    Article  Google Scholar 

  40. Melamed ZE, et al. Alternative splicing regulates biogenesis of miRNAs located across exon-intron junctions. Mol Cell. 2013;50:869–81.

    Article  CAS  PubMed  Google Scholar 

  41. Saini HK, Griffiths-Jones S, Enright AJ. Genomic analysis of human microRNA transcripts. Proc Nat Acad Sci. 2007;104:17719–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Janas MM, et al. Feed-forward microprocessing and splicing activities at a microRNA–containing intron. PLoS Gen. 2011;7:1002330. https://doi.org/10.1371/journal.pgen.1002330.

    Article  Google Scholar 

  43. Temnykh S, DeClerck G, Lukashova A, Lipovich L, Cartinhour S, et al. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Gen Res. 2001;8:1441–52.

    Article  Google Scholar 

  44. Chang TH, Huang HY, Hsu JB, Weng SL, Horng JT, Huang HD. An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs. BMC Bioinform. 2013;14(Suppl 2):S4.

    CAS  Google Scholar 

Download references

Acknowledgements

We thank Dr. Martin Krzywinski, Staff Scientist, Canada’s Michael Smith Genome Sciences Centre, for the Circos plot generated, Department of Science and Technology and Department of Biotechnology, Govt. of India for the financial support.

Funding

NJ is supported by Woman Scientist Scheme (WOS-A) of Department of Science and Technology, Government of India and M.B.Y.P is supported by Council of Scientific and Industrial Research (CSIR), Government of India.

Availability of data and materials

The pre-miRNAs used for the analysis in this work were extracted from publicly available database of miRNAs –miRBase (www.mirbase.org).

Author information

Authors and Affiliations

Authors

Contributions

NJ and EVS conceived the experiment(s), NJ and MBYP conducted the experiment(s) and analysed the results. All authors reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Nisha Joy or E. V. Soniya.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

The frequency and distribution pattern of SSRs in the pre-miRNAs across different taxa. (PDF 162 kb)

Additional file 2:

RegRNA analysis of SSR bearing pre-miRNAs occuring in A.thaliana identified different functional RNA motifs. (PDF 149 kb)

Additional file 3:

List of primer sequences used to carry out Normal and Deletion PCR for five sets of miRNAs selected. (PDF 96 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Joy, N., Maimoonath Beevi, Y.P. & Soniya, E.V. A deeper view into the significance of simple sequence repeats in pre-miRNAs provides clues for its possible roles in determining the function of microRNAs. BMC Genet 19, 29 (2018). https://doi.org/10.1186/s12863-018-0615-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12863-018-0615-x

Keywords