Chloroplast markers for the Malvaceae and the plastome of Henderson’s checkermallow (Sidalcea hendersonii S.Wats.), a rare plant from the Pacific Northwest
BMC Research Notes volume 16, Article number: 87 (2023)
Sidalcea is a genus of flowering plants restricted to the west coast of North America, commonly known as checkermallows. Remarkably, of the ~ 30 recognized species, 16 are of conservation concern (vulnerable, imperilled or critically imperilled). To facilitate biological studies in this genus, and in the wider Malvaceae, we have sequenced the whole plastid genome of Sidalcea hendersonii. This will allow us both to check those regions already developed as general Malvaceae markers in a previous study, and to search for new regions.
By comparing the Sidalcea genome to that of Althaea, we have identified a hypervariable circa 1 kb region in the short single copy region. This region shows promise for examining phylogeographic pattern, hybridization and haplotype diversity. Remarkably, considering the conservation of plastome architecture between Sidalcea and Althaea, the former has a 237 bp deletion in the otherwise highly conserved inverted repeat region. Newly designed primers provide a PCR assay to determine presence of this indel across the Malvaceae. Screening of previously designed chloroplast microsatellite markers indicates two markers with variation within S. hendersonii that would be useful in future population conservation genetics.
Sidalcea (Malvaceae: Malveae) is a genus of ~ 30 species restricted to western North America, commonly called checkermallows. The Malvaceae is a large family containing such diverse plants as cacao (Theobroma cacao L.), cotton (Gossypium spp.), linden trees (Tilia species) and kapok (Ceiba pentandra (L.) Gaertn.). Sidalcea is a member of the tribe Malveae which mainly comprises herbaceous plants and shrubs. A recent study of Malveae phylogeny  divides the Malveae into two major clades and places the Sidalcea alliance (Sidalcea along with the related Callirhoe) in clade B which also contains the Malva alliance (Malva, Alcea, Lavatera, Althaea, etc.). Sidalcea itself has been the subject of detailed phylogenetic and evolutionary study [2,3,4,5], with particular interest focused on the evolution and maintenance of the gynodioecious mating system found in at least nine Sidalcea species [6,7,8].
Many of the Sidalcea species are narrow endemics, or locally rare, and several are of conservation concern (Table 1). One of these is Sidalcea hendersonii S.Wats. (Henderson’s checkermallow). Although this is the most widespread of the checkermallows, occurring from Oregon to Alaska, it is locally restricted wherever it occurs . It is commonest in Washington State, where it was first collected by Louis Henderson in 1887. It is extremely rare in Oregon, where it has declined sharply since 1950 and is now only known from the Siuslaw River estuary [9, 10]. In 2003, it was discovered in Alaska, where it is only known from a single very small population at one locality [9, 11, 12]. In British Columbia, it is only known from the extreme south east of the Province , the main centre being the estuary of the Fraser River, but it is also found in southern Vancouver Island and the Southern Gulf Islands. It is a wetland species of estuarine swamps or coastal marshes and is vulnerable to environmental change, such as coastal land use alterations, river channel embankment and the spread of alien plants such as Lythrum salicaria . In consequence, in many regions it appears to be declining.
A previous study that used plastome sequencing of cacao (Theobroma cacao L.) to develop plastid microsatellite markers for cacao and other Malvaceae also tested these primers on four species of Sidalcea . Four of these cacao markers amplified successfully in the Sidalcea species, and three were polymorphic. This suggests that chloroplast markers can be of wide cross-species utility in assaying haplotype variation in the Malvaceae as a whole and in Sidalcea species in particular. Such markers could be useful for phylogeographic studies, as well as for studying haplotype diversity in populations and in detecting interspecific hybridization. To extend the range of resources available for Sidalcea, a genus presenting numerous species of conservation concern, we have therefore sequenced the whole plastome of an exemplar, Sidalcea hendersonii, and screened individuals of this species with select plastid microsatellite markers.
DNA extraction and illumina sequencing, assembly and annotation
Leaf samples of two individuals (one female and one hermaphrodite) from a population of Sidalcea hendersonii near Vancouver, British Columbia were collected into silica gel. Additional sampling for microsatellite marker screening was done from leaf material taken from herbarium specimens in the UBC Beaty Museum (Table 2). All specimens have vouchers deposited at the UBC Beaty Museum herbarium (UBC) with collection and identifier information available through the UBC herbarium Beaty Museum database (see data availability statement below). Total DNA was extracted using a modified version of the CTAB method from . Illumina sequencing followed methods described previously . Briefly, NEXTflex™ (Bioo Scientific Crop, Austen, TX, USA) was used for library construction. Fragment selection (400 bp) used Agencourt AMPure Xp™ magnetic beads (Beckman Coulter Genomics, Danvers, MA, USA). The two libraries were each sequenced on a single lane using 0.2 flow cells on an Illumina HiSeq-2000 sequencer generating 100 bp paired-end reads, giving a raw yield of 4–5 Gb per library and a chloroplast coverage per library of > x1000. The reads were assembled into complete plastomes, separately for each library, using CLC Genomic Workbench v.7.0.2 (CLC Bio). The sequences were annotated using cpGAVAS  with Althaea officinalis (GenBank: NC034701) as reference. The two annotations were then cross-checked using CHLOROBOX tools GeSeq  and GB2sequin , and graphically visualized with OGDRAW .
Sequence analysis and marker testing
To search for regions of the plastid genome that are variable in the Malveae, the sequence was compared to the Althaea plastome sequence using zPicture . Based on observed length variation, we designed new primers using Primer-BLAST  to test for the presence of a 237 bp indel [DP-SID2-indelF: 5’ TCCCGATTCATGGATCTCTCG 3’, DP-SID2-indelR: 5’ TGCCTTTTCTATTGATTCCTACGG 3’] across three subfamilies of Malvaceae.
Putative polymorphic microsatellite regions were tested using four of the primer pairs given in  (Table 2). Forward primers were synthesized with the universal M13 sequence TGTAAAACGACGGCCAGT. PCR amplifications for the microsatellites were performed in a final volume of 15 µl, containing 25 ng of DNA, 1× reaction PCR buffer (10 mM Tris-HCl pH 8.3, 50 mM KCl), 2.5 mM of each dNTP, 1.5 mM MgCl2, 1U of Taq DNA polymerase (Fermentas Canada, Burlington, Ontario, Canada), 0.1 µM forward primer, 0.5 µM reverse primer, and 0.5 µM fluorescently labeled M13 primer. Products were visualized using an ABI 3730 automated DNA Sequencer (Applied Biosystems). Primers were tested on seven individuals of Sidalcea hendersonii from four populations, and three individuals of S. oregana (Nutt. ex Torr. & A.Gray) A.Gray from two populations as well as on three other species (S. campestris Greene, S. glaucescens Greene, and S. nelsoniana Piper) (Table 2).
Plastome features and comparison with Althaea
On assembly, the chloroplast genome was retrieved as a single contig. The two assemblies (female and hermaphrodite) were identical. Although no haplotype polymorphism was discovered, the two independently sequenced and assembled samples being identical gives us strong confidence in the assembly. The female Sidalcea hendersonii plastome sequence has been deposited in GenBank (OP780018).
The Sidalcea hendersonii plastome is 159,663 bp long (Fig. 1a), comparable to the 159,987 bp Althaea officinalis plastid genome. The structure and gene content are identical and there are no major rearrangements (Fig. 1b). This is despite the two species being in different clades of the tribe Malveae of the Malvaceae: the Sidalcea alliance vs. the Malva alliance. This suggests that members of the tribe Malveae have conserved plastid organization. Small indels are found, scattered through the genome, accounting for the 324 bp difference in overall length between the two species. The largest indel is a 237 bp deletion (Appendix 1) in the usually conserved inverted repeat (IR) in Sidalcea (around position 155,430 within ycf2). This deletion appears to be unique to Sidalcea (Fig. 1c), although we have not tested for it within the Sidalcea alliance clade. From sequences on GenBank, it is clear that other members of the Malvaceae subfamily Malvoideae (Hibisceae: Hibiscus, Abelmoschus, Malveae: Althaea) do not have this sequence deletion, neither do members of other subfamilies (Bombacoideae: Bombax; Tilioideae: Tilia; Byttnerioideae: Theobroma; Helicteroideae: Durio; Sterculioideae: Firmiana). Neither does this deletion appear to be present generally in rosid eudicots. The distribution of the deletion in the Sidalcea alliance remains to be more widely tested. We were able, with the newly designed primers [DP-SID2-indelF, DP-SID2-indelR], to confirm presence of the deletion in all four populations of S. hendersonii sampled (Table 2), as well as absence of the deletion in subfamily Dombeyoideae: Trochetiopsis erythroxylon (G.Forst.) Marais, (St Helena redwood), and subfamily Byttnerioideae: Herrania balaensis P. Preuss and Theobroma cacao (Fig. 1c). In addition, a few hotspots of variation were observed, with an especially hypervariable region at around 134,000 bp (c. 133,400–134,400 in Sidalcea; c. 133,600–134,600 in Althaea) in the short single copy (SSC) region (Fig. 1d). The alignment of this region is shown in Appendix 2. We consider this hypervariable region as a promising region in which to look for infraspecific and interspecific haplotype variation. Two of the four plastid microsatellites (CaCrSSR2 and CaCrSSR8) tested showed intraspecific variation within Sidalcea hendersonii (Table 2).
In making the plastome of Sidalcea hendersonii available we hope to stimulate further interest in this fascinating genus, containing as it does many species of critical conservation concern . As has previously been pointed out , the lack of variation in many traditionally sequenced chloroplast regions has held back phylogeography in plants relative to animals. This has been somewhat alleviated by the ability to sequence multiple plastid genomes within a species [25, 26], but this is expensive. An alternative route, and the one used here, is to sequence one or a few plastomes per species and identify evolutionary hotspots, regions of elevated SNP variation of active homopolymer repeat stretches (“chloroplast microsatellites” ). This sort of variation is then amenable to standard, and cheaper laboratory methods. Such tools can produce conservation-relevant insights. For instance, the recent discovery of S. hendersonii disjunct in Alaska (noted above) raises the question of whether this is a result of long distance dispersal from southern populations or the survival of a population that may represent a divergent infraspecific evolutionary lineage and therefore merit particular conservation attention. The identification of haplotype variation as detailed here (e.g., the two polymorphic chloroplast microsatellites), could potentially help in answering this question, and others like it.
Current rarity and threatened status of Sidalcea hendersonii and other Sidalcea species presents difficulties in obtaining sufficient sampling for population analysis; additionally obtaining reasonable quality DNAs from historical samples in herbaria is also challenging.
The annotated Sidalcea hendersonii plastid genome is publicly available from GenBank (OP780018). Additional sample information for herbarium specimens sampled for the chloroplast microsatellites is available from the UBC herbarium Beaty Museum database (https://databases.beatymuseum.ubc.ca/). All other data generated or analysed during this study are included in this published article [and its supplementary information files].
Tate JA, Aguilar JF, Wagstaff SJ, La Duke JC, Slotta TAB, Simpson BB. Phylogenetic relationships within the tribe Malveae (Malvaceae, subfamily Malvoideae) as inferred from ITS sequence data. Am J Bot. 2005;92(4):584–602. https://doi.org/10.3732/ajb.92.4.584.
Andreasen K, Baldwin BG. Unequal evolutionary rates between annual and perennial lineages of checker mallows (Sidalcea, Malvaceae): evidence from 18S–26S rDNA internal and external transcribed spacers. Mol Biol Evol. 2001;18(6):936–44. https://doi.org/10.1093/oxfordjournals.molbev.a003894.
Andreasen K, Baldwin BG. Re-examination of relationships, habital evolution, and phylogeography of checker mallows (Sidalcea; Malvaceae) based on molecular phylogenetic data. Am J Bot. 2003a;90(3):436–44. https://www.jstor.org/stable/4124162.
Andreasen K, Baldwin BG. Nuclear ribosomal DNA sequence polymorphism and hybridization in checker mallows (Sidalcea, Malvaceae). Mol Phylogenet Evol. 2003b;29(3):563–81. https://doi.org/10.1016/S1055-7903(03)00136-2.
Whittall J, Liston A, Gisler S, Meinke RJ. Detecting nucleotide additivity from direct sequences is a SNAP: an example from Sidalcea (Malvaceae). Plant Biol (Stuttg). 2000;2(2):211–7. https://doi.org/10.1055/s-2000-9106.
Ashman TL. Indirect costs of seed production within and between seasons in a gynodioecious species. Oecologia. 1992;92:266–72. https://doi.org/10.1007/BF00317374.
Marshall M, Ganders FR. Sex-biased seed predation and the maintenance of females in a gynodioecious plant. Am J Bot. 2001;88(8):1437–43. https://doi.org/10.2307/3558451.
Schultz ST. Sexual dimorphism in gynodioecious Sidalcea hirtipes (Malvaceae). I. seed, fruit, and ecophysiology. Int J Plant Sci. 2003;164(1):165–73. https://doi.org/10.1086/344550.
Kelly L. Endangered and threatened Wildlife and plants; 90-Day finding on a petition to list Sidalcea hendersonii (Henderson’s checkermallow) as threatened or endangered. Fed Reg. 2006;71(32):8252–7.
Gisler MM, Love RM. Henderson’s checkermallow: the natural, botanical and conservation history of a rare estuarine species. Kalmiopsis. 2005;12:1–8. https://www.npsoregon.org/kalmiopsis/kalmiopsis12/checkermallow.pdf.
Stensvold MC, Anderson E. Henderson’s checkerbloom (Sidalcea hendersonii) newly discovered in Alaska. Alaska Region: Published online report for the USDA Forest Service; 2005.
Goldstein MI, Martin D, Stensvold MC. Forest Service Alaska Region Sensitive Species List. Assessment and proposed revisions to the 2002 list. Alaska Region: Published online report for the USDA Forest Service; 2009.
Denoth M, Myers JH. Competition between Lythrum salicaria and a rare species: combining evidence from experiments and long-term monitoring. Plant Ecol. 2007;191(2):153–61. https://www.jstor.org/stable/40212932.
Yang JY, Motilal LA, Dempewolf H, Maharaj K, Cronk QCB. Chloroplast microsatellite primers for cacao (Theobroma cacao) and other Malvaceae. Am J Bot. 2011;98(12):e372–4. https://doi.org/10.3732/ajb.1100306.
Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 1987;19(1):11–5.
Yang JY, Lambdon P, Williams V, Cronk QC. Plastid markers from whole plastome sequencing for gene-flow studies in the endangered endemic bellflowers of St Helena, Wahlenbergia angustifolia and Wahlenbergia linifolia (Campanulaceae). Conserv Genet Resour. 2017;9(1):131–7. https://doi.org/10.1007/s12686-016-0612-1.
Liu C, Shi L, Zhu Y, Chen H, Zhang J, Lin X, Guan X. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. BMC Genom. 2012;13:715. https://doi.org/10.1186/1471-2164-13-715.
Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. GeSeq – versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45(W1):W6–W11. https://doi.org/10.1093/nar/gkx391.
Lehwark P, Greiner S. GB2sequin - a file converter preparing custom GenBank files for database submission. Genomics. 2019;111:759–61. https://doi.org/10.1016/j.ygeno.2018.05.003.
Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019;47(W1):W59–W64. https://doi.org/10.1093/nar/gkz238.
Ovcharenko I, Loots GG, Hardison RC, Miller W, Stubbs L. zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. Genome Res. 2004;14(3):472–7. https://doi.org/10.1101/gr.2129504.
Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, Madden T. Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinform. 2012;13:134. https://doi.org/10.1186/1471-2105-13-134.
Natureserve. 2022. NatureServe Web Service. Arlington, VA. U.S.A. http://explorer.natureserve.org. (Accessed: 07 December 2022).
Schaal BA, Hayworth DA, Olsen KM, Rauscher JT, Smith WA. Phylogeographic studies in plants: problems and prospects. Mol Ecol. 1998;7(4):465–74. https://doi.org/10.1046/j.1365-294x.1998.00318.x.
Kane N, Sveinsson S, Dempewolf H, Yang JY, Zhang D, Engels JM, Cronk Q. Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA. Am J Bot. 2012;99(2):320–9. https://doi.org/10.3732/ajb.1100570.
Huang DI, Hefer CA, Kolosova N, Douglas CJ, Cronk QC. Whole plastome sequencing reveals deep plastid divergence and cytonuclear discordance between closely related balsam poplars, Populus balsamifera and P. trichocarpa (Salicaceae). New Phytol. 2014;204(3):693–703. https://doi.org/10.1111/nph.12956.
Provan J, Powell W, Hollingsworth PM. Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol. 2001;16(3):142–7. https://doi.org/10.1016/S0169-5347(00)02097-8.
We thank Frank Lomer (Honorary Research Associate, UBC Herbarium) for information on Sidalcea hendersonii sites in British Columbia and for kindly collecting samples. We would also like to thank Andrea Stevenson for assisting with DNA extraction. Information on the conservation status of Sidalcea species was obtained from NatureServe (www.natureserve.org) and its network of Natural Heritage Program members through its NatureServe Explorer web portal. We acknowledge NatureServe as a leading source of information about rare and endangered species and threatened ecosystems. We thank one anonymous reviewer for helpful comments.
We acknowledge the support of the Natural Sciences and Engineering Research Council of Canada (NSERC).
The authors declare no competing interests.
Ethics approval and consent to participate
Consent for publication
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
About this article
Cite this article
Percy, D.M., Sveinsson, S., Ponomarev, A. et al. Chloroplast markers for the Malvaceae and the plastome of Henderson’s checkermallow (Sidalcea hendersonii S.Wats.), a rare plant from the Pacific Northwest. BMC Res Notes 16, 87 (2023). https://doi.org/10.1186/s13104-023-06357-4