Microsatellites grant more stable flanking genes
BMC Research Notesvolume 5, Article number: 556 (2012)
Microsatellites, or simple sequence repeats (SSRs), are DNA sequences that include tandem copies of specific sequences no longer than six bases. SSRs are ubiquitous in all genomes and highly mutable.
Presentation of the hypothesis
Results from previous studies suggest that flanking regions of SSR are exhibit high stability in a wide range of organisms. We hypothesized that the SSRs ability to discard weak DNA polymerases could be responsible for this unusual stability. . When the weak polymerases are being decayed over SSRs, the flanking sequences would have higher opportunity to be replicated by more stable DNA polymerases. We present evidence of the molecular basis of our hypothesis.
Testing the hypothesis
The hypothesis could be tested by examining the activity of DNA polymerase during and after a number of PCRs. The PCR reactions should be run with the same SSR locus possessing differences in the SSR length. The hypothesis could also be tested by comparing the mutational rate of a transferred gene between two transformations. The first one has a naked T-DNA (transferred DNA), while the second one has the same T-DNA flanked with two SSRs.
Implications of the hypothesis
In any transformation experiment, flanking the T-DNA fragment with SSR sequences would result in more stably transferred genes. This process would decrease the unpredictable risks that may occur because of the mutational pressure on this foreign segment.
Microsatellite or simple sequence repeats (SSRs) are tandem repeated DNA sequences including tandem copies of specific sequences no longer than six bases. SSRs are ubiquitously distributed almost in all eukaryotic and prokaryotic genomes. Their abundance and the tandem repeated nature of SSRs make the loci highly mutable loci. Two mechanisms have been attributed to this phenomenon namely unequal crossing over and slipping of the DNA polymerase when it run over the SSR template strand allowing one strand to hybridize with one of the multi-complimentary tandem sequences on the other strand. These mechanisms results in SSR loci being highly mutable; this unique mutational ratio creates a need to align these loci in a special way.
There are a lot of factors that affect the mutational ratio. The longer SSRs are more variable than shorter ones; however there is no threshold for slippage occurrence. It appears that SSR mutations are dependent on motif size and nucleotide content. Besides, position in genome (coding or non-coding sequences), presence on leading or lagging strand and the distance from origin of replication also affect SSR mutations. Further, fidelity of replication and repair mechanisms as well as epigenetic factors may influence repeat stability.
Despite the high mutation rate of SSRs, their flanking regions exhibit high stability even among different taxa; there is a negative correlation between SSR length and substitution rate in nearby flanking sequence[10, 11]. SSRs are increasingly being used as genetic markers for a wide range of applications such as evolution and diversity studies, genetic mapping and forensic studies.
Presentation of the hypothesis
Our new hypothesis assumes that SSRs may have the power to discard weak DNA polymerases and keep the more robust ones. This selective capacity of SSRs can lead to more accurate amplification of the microsatellite flanking regions. Further, SSR slippage may occur during the decaying and the replacment of weak polymerases. The following paragraphs would present a review on the stability of SSR flanking regions in many organisms and some supportive evidence- related to the conditions of the polymerization as well as our new molecular view of how the selection is done.
The stability of SSR flanking regions
The stability of the SSR flanking regions was investigated in different organisms; three Phytophthora species (algae); rice and bamboo; cotton; Brassica rapa and Arabidopsis thaliana; cowpea, mung bean and adzuki bean; Castanea spp.; Carya and wheat. Similar studies have also been carried out in different animals such as fish; Acropora; cattle, sheep, yak, buffalo and goat; Probarbus jullienii and gerbils; and in human genome studies where different SSR's flanking regions associated with genes expressed in the developing nervous system were compared. In addition, flanking regions of EST-SSR loci demonstrated a common evolutionary origin of grass fungal endophytes taxa.
Recently results with pepper chloroplast genome showed that there are two sites called inverted repeat (IRa and IRb) containing high frequency of tandem repeats. In this study, most of the hotspot regions in pepper chloroplast genome seem to be relayed at the middle of the biggest fragment LSC (large single copy) –about 87 kb– which is far from the IR sites.
These studies provide indication that SSRs may help to protect their flanking regions from different kinds of mutations.
Evidences related to polymerization conditions
Viguera et al. reported that the DNA polymerase dissociates from the synthesize strand when the slippage occurs. We think that this dissociation support our assumption when SSRs replace weak DNA polymerases.
It has been reported that SSRs found on the lagging strand shows higher instability than those on the leading strand without any explanation. This lends credence to our assumption because of the presence of Okazaki fragments when lagging strand is being replicated. Replicating those fragments would allow more DNA polymerases to act on them leading to a higher opportunity for weaker DNA polymerases to replicate on the lagging strand.
Further, a significant increase in DNA polymerase slippage rates was reported as a result of inefficient concentrations of ions (particularly Mg ions) during DNA replication[29, 31]. It has been reported that magnesium-induced the assembly of a complete DNA polymerase catalytic complex.
Kumar et al. reported that the misbalancing of dNTPs increases the mutagenesis ratio. Consistent with our hypothesis, we suggest that the long motifs with one or two types of nucleotides could cause more decayed DNA polymerases because they may create a local misbalancing of dNTPs by using limited types of nucleotides which may enhance mutation occurrences.
Figure1 illustrates a ribbon diagram of one of the DNA polymerases (PDB code - 1BPX). As observed, both the template (blue) and the synthetic strands (yellow) are joined with bracket-like proteomic subunit colored with red and pink. This subunit forms a pipe that fit the 3D structure of the double stranded DNA[35, 36]. The structure of this subunit prevents the DNA from formation of possible secondary structures (like the loop that occurs when the DNA polymerase slips)[36, 37]. While this subunit is stable, the whole protein would be stable and the synthetic strand would have less errors. After building a huge number of bases, the instability of the DNA polymerase would increase as well as the errors rate. Therefore, when the weak polymerase gets over an SSR locus, it would create a suitable environment to generate such loop because of the multi-complementary sequences. In vivo, the DNA polymerase would stop releasing double stranded DNA. Further, the helicase enzyme would still be pushing unwinded strand toward the DNA polymerase[39–41]. This would bring heavy pressure on the weak polymerase. We assumed that when polymerase slips, its domains would separate and the free end would split from the other domain causing a decayed protein and the polymerase would lose its polymerization activity. It's also observed that this slippage happens in vitro during PCR reaction[5, 31] and it has previously been reported that repeats undergo deletion if replication temperature is high. Figure2 demonstrates a diagram describing the whole process.
Testing the hypothesis and implications of the hypothesis
When trying to improve genetic modified organisms, it would be helpful to add an SSR region to the T-DNA. Flanking the transferred gene with SSRs on both sides would decrease the unpredictable risks that may occur if the foreign T-DNA fragment mutates. Microsatellites have been used in the T-DNA as a molecular marker in order to select the transgenic cucumber lines. Microsatellites of T-DNA showed high differences in their stability. For instance, the CAG repeat of the Huntington disease exposes high stability in transgenic mice. Another study showed an instable transferred microsatellite. However, none of these studies or other publications that used transgenic organisms focused their studies on the stability of microsatellite flanking regions.
Our assumption could be tested by comparing the mutation rate of a transferred gene between two cases - the control and testing cases. The first is the control case, which has the T-DNA, while the second, the testing case, has the same T-DNA flanked by two SSRs. If there were significant differences between both cases, the hypothesis might be applicable. The hypothesis could also be tested by checking the DNA polymerase activities during and after many PCRs. All reactions should be carried out for the same SSR locus with differences in the SSR length and another reaction should be done without the SSR itself.
Tautz D: Hypervariability of simple sequences as a general source for polymorphic DNA markers. Nucl Acids Res. 1989, 17: 6563-6571.
Merkel A, Gemmell N: Detecting short tandem repeats from genome data: opening the software black box. Brief Bioinform. 2008, 9 (5): 355-366. 10.1093/bib/bbn028.
Chistiakov DA, Hellemans B, Haley CS, Law AS, Tsigenopoulos CS, Kotoulas G, Bertotto D, Libertini A, Volckaert FA: A microsatellite linkage map of the European sea bass Dicentrarchus labrax L. Genetics. 2006, 170: 1821-1826.
Kelkar YD, Tyekucheva S, Chiaromonte F, Makova KDL: The genome-wide determinants of human and chimpanzee microsatellite evolution. Genome Res. 2008, 18 (1): 30-38.
Ellegren H: Microsatellites: Simple sequences with complex evolution. Nat Rev Genet. 2004, 5: 435-445.
Gow C, Noble JL, Rollinson D, Jones C: A High incidence of clustered microsatellite mutations revealed by parent-offpring analysis in an African freshwater snail, Bulinus forskalii (Gastropoda, Pulmonata). Genetica. 2005, 124: 77-83. 10.1007/s10709-005-0204-6.
Jighly A, Hamwieh A, Ogbonnaya FC: Optimization of sequence alignment for SSR (Simple Sequence Repeat) regions. BMC Res Notes. 2011, 4: 239-10.1186/1756-0500-4-239.
Leclercq S, Rivals E, Jarne P: DNA slippage occurs at Microsatellite loci without minimal threshold length in Humans: a comparative genomic approach. Genome Biol Evol. 2010, 2: 325-335. 10.1093/gbe/evq023.
Choudhary OP, Trivedi S: Microsatellite or simple sequence repeat (SSR) instability depends on repeat characteristics during replication and repair. J Cell and Molecular Biol. 2010, 8 (2): 21-34.
Santibanez-Koref MF, Gangeswaran R, Hancock JM: A relationship between lengths of Microsatellites and nearby substitution rates in Mammalian Genomes. Mol Biol Evol. 2001, 18 (11): 2119-2123. 10.1093/oxfordjournals.molbev.a003753.
Sekar M, Suresh E, Kumar NS, Mayak SK, Balakrishna C: Microsatellite DNA markers, a fisheries perspective. Aquaculture Asia Magazine. 2009, 14 (2): 27-29.
Schlotterer C: The evolution of molecular markers just a matter of fashion. Nat Rev Genet. 2004, 5: 63-69. 10.1038/nrg1249.
Schena L, Cardle L, Elcooke D: Use of genome sequence data in the design and testing of SSR markers for Phytophthora species. BMC Genomics. 2008, 9: 620-10.1186/1471-2164-9-620.
Chen S, Lin Y, Lin C, Chen W, Yang CH, Ku H: Transferability of rice SSR markers to bamboo. Euphytica. 2010, 175: 23-33. 10.1007/s10681-010-0159-2.
Altaf-Khan M, Qureshi SN, Saha S, Jenkins JN, Brubaker CL, Reddy OU: Usefulness of SSR Derived from Tetraploid Gossypium spp. for Analyses of Diploid Gossypium spp. J Crop Improvement. 2006, 16: 1-20. 10.1300/J411v16n01_01.
Suwabe K, Tsukazaki H, Iketani H, Hatakeyama K, Kondo M, Fujimura M, Nunome T, Fukuoka H, Hirai M, Matsumoto S: Simple sequence repeat-based comparative genomics between Brassica rapa and Arabidopsis thaliana The genetic origin of Clubroot resistance. Genetics. 2006, 173: 309-319. 10.1534/genetics.104.038968.
Vir R, Bhat KV, Lakhanpaul S: Transferability of sequence tagged microsatellite sites (STMS) primers to pulse yielding taxa belonging to Phaseolae. IJIB. 2009, 5: 62-66.
Akkak A, Boccacci P, Marinoni DT: Cross-Species Amplification of Microsatellite Markers in Castanea spp And Other Related Species. Acta Hort. (ISHS). 2010, 866: 195-201.
Herrera AM, Grauke LJ, Klein P: Validation Of 14 Nuclear SSR Markers Among Different Species Of The Genus Carya. Plant & Animal Genomes XIX Conference, Town & Country Convention Center. 2011, P153-
Zhang L, Sun G, Yan Z, Chen Q, Yuan Z, Lan X, Zheng Y, Liu D: Comparison of Newly Synthetic Hexaploid Wheat with Its Donors on SSR Products. J Genet Genomics. 2007, 34 (10): 939-946. 10.1016/S1673-8527(07)60105-2.
Zardoya R, Vollmer DM, Craddock C, Streelman JT, Karl S, Meyer A: Evolutionary conservation of microsatellite flanking regions and their use in resolving the phylogeny of cichlid fished (Pisces:Perciformes). Proc R Soc Lond B. 1996, 263: 1589-1598. 10.1098/rspb.1996.0233.
Tang P, Wei NV, Chen C, Wallace CC, Allen C: Comparative Study of Genetic Variability of AAT and CTGT Microsatellites in Staghorn Coral, Acropora (Scleractinia Acroporidae). Zoological Studies. 2010, 49 (5): 657-668.
Shakyawar SK, Joshi BK, Kumar D: SSR repeat dynamics in mitochondrial genomes of five domestic animal species. Bioinformation. 2009, 4 (4): 158-163. 10.6026/97320630004158.
Ghiasi N, Rashid ZA, Hooshmand S, Yusoff K, Tan SG, Bhassu S: The use of locus specific microsatellite markers for detecting genetic variation in hatchery bred Probarbus jullienii. Biotechnology. 2009, 8: 166-170. 10.3923/biotech.2009.166.170.
Du X, Chen Z, Li W, Tan Y, Lu J, Zhu X, Zhao T, Dong G, Zeng L: Development of Novel Microsatellite DNA Markers by Cross-Amplification and Analysis of Genetic Variation in Gerbils. J Hered. 2010, 101 (6): 710-716. 10.1093/jhered/esq066.
Riley DE, Krieger JN: Embryonic nervous system genes predominate in searches for dinucleotide simple sequence repeats flanked by conserved sequences. Gene. 2009, 429: 74-79. 10.1016/j.gene.2008.09.025.
De Jong E, Guthridge KM, Spangenberg GC, Forster GW: Sequence Analysis of SSR-Flanking Regions Identifies Genome Affinities between Pasture Grass Fungal Endophyte Taxa. Int J Evol Biol. 2011, 10: 1-11.
Jo YD, Park J, Kim J, Song W, Hur C, Lee Y, Kang B: Complete sequencing and comparative analyses of the pepper (Capsicum annuum L.) plastome revealed high frequency of tandem repeats and large insertiondeletions on pepper plastome. Plant Cell Rep. 2011, 30: 217-229. 10.1007/s00299-010-0929-2.
Viguera E, Canceill D, Ehrlich SD: Replication slippage involves DNA polymerase pausing and dissociation. J Mol Biol. 2001, 20 (10): 2587-2595.
Wang G, Vasquez KM: Models for chromosomal replication-independent non-B DNA structure-induced genetic instability. Mol Carcinog. 2009, 48 (4): 286-298. 10.1002/mc.20508.
Viguera E, Canceill D, Ehrlich SD: In vitro Replication Slippage by DNA Polymerases from Thermophilic Organisms. J Mol Biol. 2001, 312 (2): 323-333. 10.1006/jmbi.2001.4943.
Batra VK, Beard WA, Shock DD, Krahn JM, Pedersen LC, Wilson SH: Magnesium-induced assembly of catalytic complex. Structure. 2006, 14: 757-766. 10.1016/j.str.2006.01.011.
Kumar D, Abdulovic AL, Viberg J, Nilsson AK, Kunkel TA, Chabes A: Mechanisms of mutagenesis in vivo due to imbalanced dNTP pools. Nucl Acids Res. 2010, 39: 1-12.
Sawaya MR, Prasad R, Wilson SH, Kraut J, Pelletier H: Crystal structures of human DNA polymerase beta complexed with gapped and nicked DNA: evidence for an induced fit mechanism. Biochemistry. 1997, 36: 11205-11215. 10.1021/bi9703812.
Ollis DL, Brick P, Hamlin R, Xuong NG, Steitz TA: Structure of large fragment of Escherichia coli DNA polymerase I complexed with dTMP. Nature. 1985, 313: 762-766. 10.1038/313762a0.
Steitz TA: A mechanism for all polymerases. Nature. 1998, 391: 231-232. 10.1038/34542.
Hubscher U, Spadari S, Villani G, Giovanni M: DNA Polymerases, Discovery. 2010, Ltd: Characterization and Functions in Cellular DNA Transactions. World Scientific Publishing Co. Pte
Nyrén P: Enzymatic method for continuous monitoring of DNA polymerase activity. Anal Biochem. 1987, 167 (2): 235-238. 10.1016/0003-2697(87)90158-8.
Lopes M, Foiani M, Sogo JM: Multiple mechanisms control chromosome integrity after replication fork uncoupling and restart at irreparable UV lesions. Mol Cell. 2006, 21: 15-27. 10.1016/j.molcel.2005.11.015.
Takahashi TS, Wigley DB, Walter JC: Pumps, paradoxes and ploughshares: mechanism of the MCM2-7 DNA helicase. Trends Biochem Sci. 2005, 30: 437-444. 10.1016/j.tibs.2005.06.007.
Bochman ML, Schwacha A: The Mcm2-7 complex has in vitro helicase activity. Mol Cell. 2008, 31: 287-293. 10.1016/j.molcel.2008.05.020.
Sarkar PS, Chang HC, Boudi FB, Reddy S: CTG repeats show bimodal amplification in E. coli. Cell. 1998, 95 (4): 531-540. 10.1016/S0092-8674(00)81620-7.
Zhao L, Liao F, Wang CY, Miao C, Min WA, Li DS, Cheng MAB, Yu HX, Qiang WZ, Ju WR, Gang LM: Generation of transgenic cucumbers with expression of a ten-tandem repeat long acting GLP-1 analogue and their biological function on diabetic rats. Chinese Sci Bull. 2009, 54: 4658-4663. 10.1007/s11434-009-0699-9.
Goldberg YP, Kalchman MA, Metzler M, Nasir J, Zeisler J, Graham R, Koide HB, Kusky JO, Sharp AH, Ross CA, Jirik F, Hayden MR: Absence of disease phenotype and intergenerational stability of the CAG repeat in transgenic mice expressing the human Huntington disease transcript. Hum Mol Genet. 1996, 5 (2): 177-185. 10.1093/hmg/5.2.177.
Adachi H, Kume A, Li M, Nakagomi Y, Niwa H, Do J, Sang C, Kobayashi Y, Doyu M, Sobue G: Transgenic mice with an expanded CAG repeat controlled by the human AR promoter show polyglutamine nuclear inclusions and neuronal dysfunction without neuronal cell death. Hum Mol Genet. 2001, 10 (10): 1039-1048. 10.1093/hmg/10.10.1039.
The authors would like to thank Dr. Francis C. Ogbonnaya for his valuable comments and for reviewing the final draft.
The authors declare that they have no competing interests.
AJ and RJ discussed the hypothesis, drafted and approved the final manuscript.