Skip to main content

Development of microsatellite markers for the Japanese endemic conifer Thuja standishii and transfer to other East Asian species



Design polymorphic microsatellite loci that will be useful for studies of the genetic diversity, gene-flow and reproduction in the Japanese endemic conifer Thuja standishii and test the transferability of these loci to the two other East Asian species, T. sutchuenensis and T. koraiensis.


Fifteen loci were developed which displayed 3 to 21 alleles per locus (average = 9.2) among 97 samples from three populations of T. standishii. Observed heterozygosity for all samples varied between 0.33 and 0.75 (average = 0.54) while expected heterozygosity values were higher with an average over the 15 loci of 0.62 (0.37–0.91). Low multi-locus probability of identity values (< 0.00002) indicate that these markers will be effective for identifying individuals derived from clonal reproduction. All 15 loci amplified in 13 samples of T. sutchuenensis, the sister species of T. standishii, with 1 to 11 alleles per locus (average = 4.33) while 13 loci amplified in four samples of the more distantly related T. koraiensis with 1 to 5 alleles per locus (average = 2.15).


Thuja is a small Cupressaceae genus consisting of five extant species with three species in East Asia and two in North America [6]. Thuja standishii (Gordon) Carriere (kurobe or nezuko in Japanese and Japanese arbor-vitae in English) is endemic to Japan where it has a scattered distribution from 40.67˚N to 33.49˚N on the islands of Honshu, Shikoku and Dogo (Shimane Prefecture). The species is most commonly a component of subalpine forests but also occurs in a variety of habitats including warm temperate forest (rarely), cool temperate forest, moorland and near the alpine zone. The species can grow as a single-stemmed tree up to 35 m tall but occurs as a multi-stemmed shrub under 1 m tall at the maximum limit of its elevational range [15]. It can purportedly reach a great age with trunks up to 3.5 m in diameter (Giant Tree Database, Biodiversity Center of Japan ( and individuals over 1000 years old.

Unlike most other Cupressaceae conifers of Japan, T. standishii has received little research attention with basic information on its conservation (including the impact of past logging), reproductive biology and genetic diversity, lacking. The rarity of forest dominated by T. standishii and its current insignificant role in forestry probably underlies this lack of research. However, the species is undoubtedly an important part of Japan’s biodiversity and cultural heritage. For example, it is one of Japan’s five precious timber trees (Kiso go-boku) that from the early 18th century were strictly protected from cutting in the Kiso region of central Honshu [8] and, in some parts of Japan, forests containing T. standishii are considered to represent the most untouched forests remaining in the landscape (e.g. [13]. Small population size and geographic isolation, its vulnerability to ring barking by deer and the impacts of past logging [15] has resulted in some populations being of conservation concern, especially in western Japan where the species is very rare [15]. One key aspect of the species biology that is poorly understood is the role of asexual reproduction in its regeneration. However, similarly to two other Japanese Cupressaceae conifers, Thujopsis dolobrata and C. pisifera that have been proven to regenerate clonally [3, 4], T. standishii also forms dense understory banks of juveniles (Worth personal observation) which may be clonally derived.

In the current age of genomics, microsatellites retain a vital role in biology due to their high information content, utility in a wide range of genetic applications and cost-efficient nature [5]. This study describes the development of Expressed Sequence Tagged (EST) nuclear microsatellite markers for T. standishii using next generation sequencing. These markers will be useful molecular tools to inform the conservation of this species via studies of its range-wide genetic diversity, gene flow and reproductive biology. In addition, the transferability of the markers was tested in the two other East Asian species for which no microsatellite markers have yet been developed including the Chinese endemic Thuja sutchuenensis, the sister species of T. standishii [7, 10], and T. koraiensis.

Main text

Materials and methods

Total RNA was extracted from an individual of T. standishii collected from the Forestry and Forest Products Research Institute Arboretum using a plant RNA isolation mini kit (Agilent Technologies, USA). An RNA-seq data set was constructed by the Beijing Genomics Institute on an Illumina HiSeq4000 platform. The T. standishii RNA-seq data consisted of 38,076,160 paired-end reads of 100 bp length. De novo assembly was undertaken in CLC Genomics Workbench 8.5.1 and the 53,614 resultant contigs (N50 = 1503 bp) were mined for microsatellite regions. Primers were developed bordering these regions with default settings using PrimerPro ( Microsatellites were selected if the number of tandem repeat units was greater than eight and if the microsatellite was located less than 25 bp from the beginning or end of the contig. These criteria resulted in 64 microsatellite primer pairs which were trialled for amplification in four samples. A total of 36 primer pairs successfully amplified and were subsequently tested for size heterogeneity in eight samples representative of the species range. For all loci, the forward primer was synthesized with one of three different M13 sequences (5′-GCCTCCCTCGCGCCA-3′, 5′-GCCTTGCCAGCCCGC-3′, and 5′-CAGGACCAGGCTACCGTG-3′), and the reverse was tagged with a pig-tail (5′-GTTTCTT-3′; [2]). The PCR reactions were performed following the standard protocol of the Qiagen Multiplex PCR Kit (Qiagen, Hilden, Germany), and consisted of a 10 uL reaction volume, containing approximately 5 ng of DNA, 5 uL of 2× Multiplex PCR Master Mix, and 0.06 uM of forward primer, 0.1 uM of reverse primer, and 0.08 uM of fluorescently labelled M13 primer. The PCR thermocycle consisted of an initial denaturation at 95 °C for 3 min; followed by 35 cycles of 95 °C for 30 s, 60 °C for 3 min, 68 °C for 1 min; and a 20 min extension at 68 °C. The PCR products were separated by capillary electrophoresis on an ABI3130 Genetic Analyzer (Life Technologies, Waltham, MA, USA) with the GeneScan 600 LIZ Size Standard (Life Technologies) and genotyping was done in GeneMarker (SoftGenetics, LLC, PA, USA). Overall, 15 loci were found to amplify reliably, display polymorphism and were readily scorable. The genetic variability of these 15 markers were tested in three populations from Atebi Daira Small Bird Forest, Mt Chausu Nature Park, in Nagano Prefecture (35.2286°N, 137.6673°E), Mt Torigata in Kouchi Prefecture (33.4936° N, 133.0638° E) and Mt Yamizo in Fukushima Prefecture (36.9343°N, 140.2679°E). The 15 primer pairs were also tested in 13 samples of T. sutchuenensis and four of T. koraeinsis (Additional file 1: Table S1). Genetic analyses were undertaken in GenAlEx 6.5 [9] and Genepop 4.2 [11]. In addition, a similarity search of the contigs containing the 15 loci was conducted by the BLASTX algorithm [1] against the National Center for Biotechnology Information (NCBI) non-redundant protein sequences (nr) database.

The multi locus probability of identity (PID) for the 15 markers, that is, the probability that two individuals drawn at random from a population will have the same genotype [14], was calculated in Gimlet version 1.3.3 [12] using all 97 samples from the three population of T. standishii. Three PID estimates outlined by Waits et al. [14] were estimated: biased PID, which assumes individuals mate randomly; unbiased PID, which corrects for sampling a small number of individuals and, sibs PID, which assumes the population is composed of siblings.


In T. standishii, the 15 loci (Table 1) displayed 3 to 21 alleles over the three populations with an average of 9.2 alleles per locus (Table 2). Overall observed heterozygosity varied between 0.33 and 0.75 (average = 0.54) while expected heterozygosity values were generally higher (0.37–0.91 with an average of 0.62). At the population level, the number of alleles observed per locus varied from 2 to 15 (average = 5.43) with six loci showing more than four alleles in each of the three populations. No significant deviations from Hardy–Weinberg equilibrium expectations were detected for any loci except for Kurobe_4219 in the Atebi Daira and Mt Yamizo populations (P ≤ 0.0004). Additionally, allele frequencies appeared independent among loci with no significant linkage disequilibrium detected after Bonferroni correction.

Table 1 Characteristics of the 15 microsatellite markers developed for Thuja standishii
Table 2 Genetic diversity of the 15 polymorphic nuclear microsatellites assessed across the three populations of Thuja standishii

Multi-locus probability of identity values were below the threshold value of 0.01 considered by Waits et al. [14] to be required to reliably distinguish between individual genotypes, even under the sibs PID (Additional file 1: Table S2). This indicates that our markers will be effective for both identifying individuals derived from clonal reproduction and sexually derived individuals in populations even where inbreeding is prevalent.

All 15 loci amplified in T. sutchuenensis (two loci being monomorphic) with 1 to 11 alleles per locus (average = 4.33) while average observed and expected heterozygosity were 0.43 and 0.48, respectively (Table 3). On the other hand, only 13 loci amplified in T. koraiensis with three loci being monomorphic. In this species, 1 to 5 alleles per locus (average = 2.15) were found with average observed and expected heterozygosity of 0.44 and 0.31, respectively (Table 3).

Table 3 Genetic diversity of the 15 microsatellite loci in T. sutchuenensis and T. koraiensis


The development of EST microsatellites for Thuja standishii will enable new genetic research into this important Japanese endemic conifer including studies of range-wide level genetic diversity and gene flow and also stand-level processes such as inbreeding and clonality. The development of molecular markers may help to foster research into this species, which because of its wide ecological range, from warm temperate forests to near the alpine zone, is an ideal species to investigate ecological and genetic processes under strongly contrasting climates.

The transferability of the 15 loci was consistent with the phylogenetic relationships of the East Asian Thuja [7, 10]. Thus, all 15 loci successfully amplified in the sister species of T. standishii, T. sutchuenensis, and displayed considerable allelic diversity with up to 11 alleles per locus. These loci, therefore, may be particularly applicable for use in genetic studies of this geographically restricted endangered species [16]. In contrast, two of the fifteen loci did not amplify in the more distantly related T. koraiensis and the number of alleles per locus (ranging from 1 to 5 alleles) was low although this low allelic diversity is likely to also be due to the low number of samples tested.


  • The number of published microsatellite markers may be too low for optimal performance of some genetic analyses.

  • These microsatellite loci have not been tested in the two North American species, T. plicata and T. occidentalis.

  • We did not afford much time optimizing loci, therefore some polymorphic loci that may have worked with further effort may have been excluded.

Availability of data and materials

The 38,076,160 paired-end reads are deposited in the NCBI BioProject Database, Accession number: PRJNA554973. The genotype data for every individual are available on demand to the corresponding author.



base pair


expressed sequence tag


ribonucleic acid


  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.

    Article  CAS  Google Scholar 

  2. Brownstein MJ, Carpten JD, Smith JR. Modulation of non-templated nucleotide addition by Taq DNA polymerase: primer modifications that facilitate genotyping. Biotechniques. 1996;20:1004–6.

    Article  CAS  Google Scholar 

  3. Hasegawa Y, Takata K, Tsutomu Y, Gaku H, Saitoh T. Clone identification among saplings in a natural forest of Thujopsis dolabrata var. hondae using multiplex EST-SSR analysis. J Jpn For Soc. 2015;97:261–5 (in Japanese).

    Article  CAS  Google Scholar 

  4. Hayakawa T, Tomaru N, Yamamoto S. Stem distribution and clonal structure of Chamaecyparis pisifera growing in an old-growth beech-conifer forest. Ecol Res. 2004;19:411–20.

    Article  Google Scholar 

  5. Hodel RGJ, et al. The report of my death was an exaggeration: a review for researchers using microsatellites in the 21st century. Appl Plant Sci. 2016;4:1600025.

    Article  Google Scholar 

  6. LePage BA. A new species of Thuja (Cupressaceae) from the Late Cretaceous of Alaska: implications of being evergreen in a polar environment. Am J Bot. 2003;90:167–74.

    Article  Google Scholar 

  7. Li JH, Xiang QP. Phylogeny and biogeography of Thuja L. (Cupressaceae), an eastern Asian and North American disjunct genus. J Integr Plant Biol. 2005;47:651–9.

    Article  CAS  Google Scholar 

  8. Mertz M. Wood and traditional woodworking in Japan. 2nd ed. Otsu City: Kaiseisha Press; 2011.

    Google Scholar 

  9. Peakall ROD, Smouse PE. Genalex 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006;6:288–95.

    Article  Google Scholar 

  10. Peng D, Wang XQ. Reticulate evolution in Thuja inferred from multiple gene sequences: implications for the study of biogeographical disjunction between eastern Asia and North America. Mol Phylogenet Evol. 2008;47:1190–202.

    Article  CAS  Google Scholar 

  11. Raymond M, Rousset F. GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism. J Hered. 1995;86:248–9.

    Article  Google Scholar 

  12. Valiere N. GIMLET: a computer program for analysing genetic individual identification data. Mol Ecol Notes. 2002;2:377–9.

    CAS  Google Scholar 

  13. Wada K. The vegetation of Mt Nagisodake at the south-western part of Nagano Prefecture. Bull Inst Nat Educ Shiga Heights Shinshu Univ. 1994;31:21–6.

    Google Scholar 

  14. Waits LP, Luikart G, Taberlet P. Estimating the probability of identity among genotypes in natural populations: cautions and guidelines. Mol Ecol. 2001;10:249–56.

    Article  CAS  Google Scholar 

  15. Worth JRP. Current distribution and climatic range of the Japanese endemic conifer Thuja standishii (Cupressaceae). Bull FFPRI. 2019;18:275–88.

    Google Scholar 

  16. Yang Y, Li N, Christian T, Luscombe D. Thuja sutchuenensis. The IUCN red list of threatened species. 2013; 3.T32378A2816862. Accessed 22 July 2019.

Download references


We thank H. Kanehara for her assistance with DNA extractions and S. Kikuchi, I. Tamaki and I. Tsuyama for help in the field.


This work was supported by the Japanese Society for the Promotion of Science Grant-in-Aid for Young Scientists A (Grant number 16748931) and the Chinese Special fund for basic scientific research business of central public research institutes (Grant number CAFRIFEEP201401). These funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations



JRPW, KC, YH and AQ devised the study and did the sample collection. JRPW designed the primers, undertook fragment analysis and genotyping. JRPW did the data analysis and wrote the manuscript. All authors participated in the draft the final manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to James R. P. Worth.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Sampling information for T. sutchuenensis and T. koraiensis. Apart from two samples from Halla Arboretum on Jeju Island, all samples were derived from natural populations. Table S2. The probability of identity (PID) values for each of the 15 polymorphic loci per locus and the multi-locus values.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Worth, J.R.P., Chang, K.S., Ha, YH. et al. Development of microsatellite markers for the Japanese endemic conifer Thuja standishii and transfer to other East Asian species. BMC Res Notes 12, 694 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: