Expression level of a flavonoid 3′-hydroxylase gene determines pathogen-induced color variation in sorghum
BMC Research Notes volume 7, Article number: 761 (2014)
Sorghum (Sorghum bicolor L. Moench) accumulates 3-deoxyanthocyanidins and exhibits orange to purple coloration on parts of the leaf in response to infection with the fungus Bipolaris sorghicola. We aimed to identify the key genes determining this color variation.
Sorghum populations derived from Nakei-MS3B and M36001 accumulated apigeninidin, or both apigeninidin and luteolinidin, in different proportions in lesions caused by B. sorghicola infection, suggesting that the relative proportions of the two 3-deoxyanthocyanidins determine color variation. QTL analysis and genomic sequencing indicated that two closely linked loci on chromosome 4, containing the flavonoid 3′-hydroxylase (F3′H) and Tannin1 (Tan1) genes, were responsible for the lesion color variation. The F3′H locus in Nakei-MS3B had a genomic deletion resulting in the fusion of two tandemly arrayed F3′H genes. The recessive allele at the Tan1 locus derived from M36001 had a genomic insertion and encoded a non-functional WD40 repeat transcription factor. Whole-mRNA sequencing revealed that expression of the fused F3′H gene was conspicuously induced in purple sorghum lines. The levels of expression of F3′H matched the relative proportions of apigeninidin and luteolinidin.
Expression of F3′H is responsible for the synthesis of luteolinidin; the expression level of this gene is therefore critical in determining color variation in sorghum leaves infected with B. sorghicola.
Sorghum (Sorghum bicolor L. Moench) is a rich source of phytochemicals, including certain 3-deoxyanthocyanidins, dhurrin, and sorgoleone. 3-deoxyanthocyanidins are not commonly found in higher plants, but sorghum accumulates them in response to pathogen infection[1, 5–7]. One 3-deoxyanthocyanidin, luteolinidin, is toxic to fungi and accumulates at increased levels in sorghum lines resistant to the anthracnose fungus[5, 8]. Sorghum that accumulates 3-deoxyanthocyanidins exhibits various changes in coloration after infection with B. sorghicola. The sorghum REDforGREEN mutant accumulates a >1000-fold higher amounts of the 3-deoxyanthocyanidins luteolinidin and apigeninidin (and variants) than the wild type and exhibits intense red–purple color of the leaves[6, 9]. However, the enzymes required for 3-deoxyanthocyanidin synthesis have not been fully identified, and the key genes required for detemining color variation remain to be elucidated.
Functional genomic studies of sorghum began after its genome sequencing was completed in 2009[10, 11]. Whole-genome sequencing of sorghum BTx623 has revealed that many genes are duplicated and tandemly arrayed. Each gene may have developed different functions related to a particular biochemical reaction. The sequence similarity of these duplicated genes makes it difficult to distinguish the expression of gene members of this family by using polymerase chain reaction (PCR)- or oligonucleotide array-based technology. Given the rapid progress of next-generation sequencing technology, shotgun sequencing of whole transcripts—so called RNA-seq—has been used for the profiling of gene expression in sorghum in response to infection with the fungus Bipolaris sorghicola, the cause of target leaf spot[12, 13]. 3-deoxyanthocyanidin biosynthesis after infection with B. sorghicola occurs through the coordinated expression of genes encoding the catalysts of sequential reactions; these catalysts include phenylalanine ammonia lyase, trans-cinnamate 4-monooxygenase, 4-coumarate:CoA ligase, chalcone synthase (CHS), chalcone isomerase (CHI), dihydroflavonol 4-reductase (DFR), and putative anthocyanidin reductase. De novo transcriptome assembly has revealed that transcripts derived from B. sorghicola induce a defense response in sorghum. Transcriptome analysis is a powerful tool for identifying the key genes expressed among family members.
Here, we aimed to identify the key genes detemining color variation in sorghum. For this purpose, we used sorghum populations derived from Nakei-MS3B (which has purple B. sorghicola lesions) × M36001 (which shows no color change with B. sorghicola infection); this population shows a gradation of different colors. We performed a metabolic analysis to identify accumulated pigments, a quantitative trait locus (QTL) analysis to map candidate genes, and whole mRNA sequencing to comprehensively identify the genes expressed. We found that the expression levels of a particular flavonoid 3′-hydroxylase (F3′H) gene on chromosome 4 matched the relative proportions of the 3-deoxyanthocyanidins apigeninidin and luteolinidin, and this gene was thus responsible for the gradual variation of colors in sorghum leaves infected with B. sorghicola.
Plant materials and phenotyping
The sorghum cultivar Nakei-MS3B, and the M36001 were used as parents. A mapping population was established from a cross between these cultivars. For the plant color test, at Shinshu University in Nagano, Japan, in 2011, the F2 population was grown and inoculated with barley seeds colonized by Bipolaris sorghicola. At Tsukuba, Ibaraki, Japan, in 2012, the F3 populations were subjected to high-density genetic mapping and mRNA-seq analysis. Accumulated pigments were quantified by using LC-MS/MS as described previously.
Marker development and genetic mapping
A mapping population was established from a cross between the sorghum cultivars Nakei-MS3B and M36001. We used 150 F2 progeny; the 122 progeny with color changes in their lesions were used for bulk mapping of purple or orange leaf color, with an analysis of 172 sorghum SSR markers as described previously. The major SSR markers used for the genetic mapping of plant color are shown in Additional file1: Table S1. QTL analysis was performed for the entire population by using Windows QTL Cartographer ver. 2.5 (http://statgen.ncsu.edu/qtlcart/WQTLCart.htm). The F2 intercross algorithm and default linkage criteria [LOD (logarithm (base 10) of odds) 3.0 and 50 cM maximum distance) were applied. The Kosambi function was used to establish genetic distances.
Construction and screening of a sorghum BAC library
BAC (bacterial artificial chromosome) libraries were constructed from young leaves of Nakei-MS3B; they contained 39,267 (average insert size 134 kb) clones, respectively. We used conventional methods, namely a partial DNA digest with Hin dIII enzyme, size fractionation of high-molecular-weight DNA by pulsed-field gel electrophoresis (CHEF; Bio-Rad Laboratories, USA), and vector ligation (pIndigoBAC-5; Epicentre Biotechnologies Madison, WI, USA) and transformation into E. coli (DH10B strain). Positive BAC clones covering the region of the F3′H gene were screened from each library by using tightly linked DNA markers through PCR amplification, and subjected to shotgun sequencing to give approximately 10-fold sequence coverage using a previously described method. A BAC clone containing inserts from the F3′H region—namely MS3B_108E24 (183 kb)—from Nakei-MS3B was found by PCR analysis by using SB20978 and SB20980 (Additional file1: Table S1). The BAC sequences were produced by Sanger shotgun sequencing of subclones followed by assembly of the shotgun sequences. The sequences of candidate genes were obtained from the sorghum genome database (http://www.plantgdb.org) and used for gene expression analysis.
To extract RNA from each plant tissue, five biological replicates were collected, immediately frozen in liquid nitrogen, and mixed to minimize the effect of transcriptome unevenness among plants. Total RNA was extracted by using an RNeasy Plant kit (Qiagen, Hilden, Germany). RNA quality was calculated with a Bioanalyzer 2100 algorithm (Agilent Technologies, Palo Alto, CA, USA); high-quality (RNA Integrity Number >8) RNA was used. Total RNA samples (10 μg) were subjected to cDNA construction for Illumina sequencing, in accordance with the protocol for the mRNA-Seq sample preparation kit (Illumina, San Diego, CA, USA). Oligo (dT) magnetic beads were used to isolate poly (A) RNA from the total RNA samples. The mRNA was fragmented by heating at 94°C for 5 min. First-strand cDNA was synthesized by using random hexamer primers at 25°C for 10 min, 42°C for 50 min, and 70°C for 15 min. After the first strand had been synthesized, dNTPs, RNaseH, and DNA polymerase I were added to synthesize second-strand DNA for 2.5 h at 16°C. The ends of double-stranded cDNA were repaired by using T4 DNA polymerase and Klenow DNA polymerase and phosphorylated by using T4 polynucleotide kinase. A single “A” base was added to the cDNA molecules by using Klenow exonuclease, and the fragments were ligated to the paired end (PE) adapters from the Illumina mRNA-Seq kit. cDNA with 200 ± 25-bp fragments was collected. The purified cDNA was amplified by using 15 cycles of PCR at 98°C for 10 s, 65°C for 30 s, and 72°C for 30 s using PE1.0 and PE2.0 primers.
We used an in-house program to trim out low-quality nucleotides (<Q15) from both the 5′- and the 3′-ends of the reads until a stretch of 3 bp or more of high-quality (≥Q15) nucleotides appeared. Adaptors were also trimmed out by using Cutadapt version 1.0 (http://code.google.com/p/cutadapt/). We used Bowtie 2 version 2.0.0 beta6 to align the reads against sorghum rRNA gene sequences downloaded from the Plant Repeat Database; aligned reads were removed. The reads were deposited in the DDBJ (DNA Data Bank of Japan) Sequence Read Archive (Accession No. DRA001265).
The reads were aligned to the sorghum reference genome of BTx623 by using Bowtie 2, SAMtools version 0.1.18, and TopHat version 2.0.4. RPKM (Reads Per Kilobase of exon model per Million mapped reads) values were calculated for each transcript annotated in Phytozome or assembled by using Cufflinks version 2.0.0. Transcripts that were differentially expressed between cutting-stress samples and control samples were detected by using a G-test with a false discovery rate threshold of 0.1%.
The Sb04g024710 gene sequence was aligned to genes from Phytozome and Cufflinks by using BLAST + version 2.2.26, and the top 50 hits were considered to be Sb04g024710 paralogs. A heatmap of Sb04g024710 paralogs was generated by using R (http://www.R-project.org/) package gplots version 2.10.1 (http://cran.r-project.org/web/packages/gplots/index.html) with log2 values of the RPKM fold changes. Reads mapped to genes in the F3′H region were visualized by using Integrative Genomics Viewer.
Identification of pigments in sorghum exhibiting different-colored lesions
Sorghum leaves exhibit various colors upon infection with Bipolaris sorghicola. Sorghum populations derived from a cross between Nakei-MS3B and M36001 had spots of purple (Nakei-MS3B, #96), red (#62), or orange (#3, #127), or no color change (M36001) (Figure 1A). Sorghum plants produce the 3-deoxyanthocyanidins apigeninidin and luteolinidin in these lesions. Accumulation of these pigments was confirmed by using thin layer chromatography and high performance liquid chromatography (data not shown). In each line, the color pigments in the lesions were further analyzed by using liquid chromatography – mass spectrometry/mass spectrometry (LC-MS/MS) (Figure 1B), which was confirmed by the retention time and MS and MS/MS of authentic compounds. Apigeninidin and luteolinidin were barely detected before infection but were clearly produced after infection. The purple lesions on Nakei-MS3B and line #96 contained luteolinidin and a small amount of apigeninidin, whereas the orange lesions on line #3 contained only apigeninidin. The red lesions on line #62 contained both luteolinidin and apigeninidin in relatively small proportions. Therefore, the relative proportions of luteolinidin and apigeninidin determined lesion color upon infection with Bipolaris sorghicola.
Mapping of QTLs responsible for color variation in B. sorghicola lesions on sorghum
We identified QTLs determining color variation in the sorghum lesions by using a population derived from Nakei-MS3B × M36001. The F2 population segregated for 150 individuals with and without color pigmentation change at a frequency of 122:28 (χ2 = 3.20; P = 0.073 for a 3:1 segregation ratio, chi-squared test). Because our aim was to elucidate the genes responsible for color variation, we subjected the 122 colored-lesion F2 progeny to further analysis. The ratio of the pigments in the lesions was purple to red to orange = 24:67:31 (χ2 = 1.262; P = 0.532 for 1:2:1 segregation ratio, chi-squared test), suggesting that color variation was controlled by a single semi-dominant locus. Bulk mapping revealed a clear bias toward purple or orange lesion color between simple sequence repeat (SSR) markers SB2623 (44.80 Mb) and SB2925 (66.54 Mb) on chromosome 4, indicating that color variation–related genes were present at a single locus (Figure 2, upper panel). We then subjected 150 F2 plants to genetic mapping, which revealed that the predicted regions were segregated into two regions, between SB2685 (53.07 Mb) and SB2734 (56.42 Mb) for purple and between SB2760 (57.96 Mb) and SB2836 (62.14 M b) for orange (Figure 2). Further mapping showed that the candidate genes responsible for purple were located in an 880-kb region between SB2703 (54.15 Mb) and SB2710 (55.03 Mb); those for orange were located in a 2.09-Mb region between SB2792 (60.05 Mb) and SB2836 (62.14 Mb).
At the first locus, which was responsible for the purple and was located between the SSR markers SB2703 and SB2710, 85 genes were annotated in Phytozome. At this locus, three F3′H genes (Sb04g024710, Sb04g024730, Sb04g024750) are tandemly arrayed on the BTx623 genome in the Phytozome annotation. F3′H s are enzymes that introduce a hydroxyl group at the 3′ position of ring B of the flavonoid. We did not perform a complementation experiment on the F3′H proteins encoded by these genes in sorghum, but the proteins encoded by Sb04g024710 and Sb04g024750 (previously named SbF3′H2 and SbF3′H1, respectively) have F3′H activity in the tt7 mutant of Arabidopsis to produce 3′-hydroxylated flavonoids. As the other genes located at the locus were not likely to be involved in flavonoid synthesis, we focused only on these F3′H genes.
The genomic sequences of the F3′H loci were compared in BTx623 (orange) and Nakei-MS3B (purple). BTx623 had three tandemly arrayed F3′H genes (Sb04g024710/SbF3′H2, Sb04g024730/SbF3′H3, and Sb04g024750/SbF3′H1), whereas Nakei-MS3B had a genomic deletion flanked by the 5′-region of Sb04g024710/SbF3′H2 and the 3′-region of Sb04g024730/SbF3′H3, resulting in only two F3′H genes (an Sb04g024710–30 fused gene named Sb04g024710N and Sb04g024750/SbF3′H1) at the locus (Figure 3A). The fused F3′H protein in Nakei-MS3B had two amino acid substitutions in the C-terminal region, namely K503M and A507T (Additional file2: Figure S1). The deletion was detected in lines exhibiting purple lesions (Nakei-MS3B, #96), but not in lines exhibiting orange ones or no color change (#3, #127, M36001; Figure 3B), suggesting that the deletion was inherited from Nakei-MS3B. A heterozygous line (#62) with red lesions contained both alleles (Figure 3B). Genomic PCR analysis confirmed that accessions exhibiting purple lesions after Bipolaris sorghicola infection (i.e. JN43 and Nakei-MS3B) had the deletion, but those exhibiting no color change or orange lesions (i.e. BTx623, bmr-6, and M36001; Figure 3C) did not. This result suggested that there was an association between genomic deletion at the F3′H locus and lesion color.
We compared the genomic sequences of the region upstream of Sb04g024710 between BTx623 and Nakei-MS3B. Two nucleotide substitutions (at positions -144 and -664), and one insertion/deletion (at position -661) were found in the 1000-bp region upstream of the transcription start site of the Sb04g024710 gene (Additional file3: Figure S2). We searched for candidate cis-regulatory elements by using the PLACE (Plant cis-regulatory DNA Elements) program. CTCTT (found only in Nakei-MS3B at position -664) is one of the consensus sequence motifs in promoters activated in the infected cells of root nodules in Vicia faba, Glycine max[28–30]. CACT (found only in BTx623 at position -144) is a key component of Mem1 (mesophyll expression module 1), which is found in the cis-regulatory element of phosphoenolpyruvate carboxylase (ppcA1) in the C4 dicot Flaveria trinervia. GCCAC (found only in Nakei-MS3B at position -644 antisense) is a promoter motif involved in light-induced gene expression in Arabidopsis and rice[32, 33].
At the second locus, which was responsible for the orange and was located between the SSR markers SB2792 and SB2836, we found 243 genes annotated in Phytozome. Among these, the Tannin1 (Tan1) gene encoding a WD40 repeat transcription factor (Sb04g031730). Tan1 controls tannin biosynthesis in sorghum, and transforming the sorghum Tan1 open reading frame into a nontannin Arabidopsis mutant restores the tannin phenotype. Tan1 derived from M36001 (which has no color change upon fungus infection) had a 10-bp insertion (CGGGCAGCGG) in the exon region that caused a frame shift at position 921 nt (307aa) (Additional file4: Figure S3), suggesting that this allele encoded a non-functional transcription factor. M36001, #127, and #3 did not accumulate tannin in the seeds (Table 1). Tan1 is similar to PAC1 (Pale aleurone color1), which encodes a regulator of the maize anthocyanin pathway.
As other candidate genes at the locus, we located genes encoding putative MYB transcription factors (Sb04g031030 and Sb04g031820) or MYB-related proteins (Sb04g030510 and Sb04g031110). The putative MYB transcription factor gene (Sb04g031820) was highly expressed, but its expression level did not change after wounding stress (Additional file5: Table S2). Another putative MYB transcription factor gene (Sb04g031030) and MYB-related protein genes (Sb04g030510 and Sb04g031110) were barely expressed (Additional file5: Table S2). Therefore, we focused on the Tan1 (Sb04g031730) gene.
Transcriptome analysis of sorghum exhibiting different-colored lesions
To determine which F3′H was responsible for the pigmentation, we used RNA-seq to identify genes that were differentially expressed after cutting stress of sorghum leaves. F3′H (Sb04g024710N) was strongly induced in sorghum lines with purple lesions and intermediately in those with red lesions, but it was barely expressed in those with orange lesions (Figure 4). The high level of expression was consistent with the accumulation of luteolinidin (Figure 1B).
We then used RNA-seq to compare the relative expression levels of F3′H genes among 50 family genes with high levels of identity to Sb04g024710. Expression of F3′H (Sb04g024710N) was exclusively induced in sorghum with purple lesions (Figure 5). We therefore considered that expression of an F3′H gene (Sb04g024710N) is responsible for the synthesis of luteolinidin and thus plays a critical role in color variation in B sorghicola spots on sorghum leaves.
We aimed to elucidate the key genes determining color variation in sorghum infected with B. sorghicola. We used sorghum populations derived from Nakei-MS3B (purple lesions) × M36001 (no color change in lesions), which showed graduated changes in lesion color. Metabolic analysis suggested that the relative proportions of the apigeninidin and luteolinidin determined color variation (Figure 1). QTL analysis (Figure 2) and genomic sequencing (Figure 3, Additional file3: Figure S2, and Additional file4: Figure S3) suggested that two loci, containing the F3′H gene and the Tan1 transcription factor gene, were responsible for the color variation (Table 1). Finally, mRNA-seq suggested that the expression of one F3′H gene (Sb04g024710N) was particularly induced in sorghum lines with purple lesions (Figures 4 and5). We therefore concluded that F3′H is responsible for synthesis of luteolinidin, and that its expression level is a critical determinant of color variation in sorghum.
F3′H in the 3-deoxyanthocyanidin pathway
The difference between the chemical formulae of apigeninidin (4′-hydroxylated) and luteolinidin (3′,4′-hydroxylated) is the hydroxylation at the 3′ position of ring B in luteolinidin (Figures 1 and6). Our QTL analysis (Figure 2) and RNA-seq (Figures 4 and5) analysis suggested that F3′H was responsible for the color variation. F3′H enzyme hydroxylates the 3′ position of the B-ring of naringenin to produce eriodictyol[36–38]. Sorghum F3′H (Sb04g024710, previously named SbF3′H2)-encoded proteins have F3′H activity in vivo to produce 3′-hydroxylated flavonoids[8, 26]. Therefore, we consider that expression of F3′H added this step of hydroxylation at the 3′-position of ring B of naringenin; consequently, an additional step led to the production of luteolinidin in the 3-deoxyanthocyanidin pathway (Figure 6).
The 3-deoxyanthocyanidins luteolinidin and apigeninidin are unique flavonoids that are not commonly found in higher plants. Sorghum accumulates 3-deoxyanthocyanidins synthesized from phenylalanine through naringenin as a common intermediate of anthocyanidins (Figure 6). Anthocyanidins are synthesized by the action of flavanone 3-hydroxylase (F3H), which in maize hydroxylates the 3 position of ring C of naringenin (Figure 6). Sorghum F3H1 (Sb06g031790.1) was not expressed in Nakei-MS3B or M36001 (Additional file5: Table S2), as was found in a previous study of BTx623[12, 39]. This lack of F3H activity is the critical determinant of the pathway to production of the unique 3-deoxyanthocyanidin flavonoids, instead of anthocyanidins, in sorghum (Figure 6). We therefore consider that naringenin is the branching point of the metabolic pathway to apigeninidin, luteolinidin, or the anthocyanidins.
What determines the activity of F3′H in sorghum tissues?
(1) F3′H locus
F3′H (Sb04g024710N) was highly expressed in Nakei-MS3B exhibiting purple lesions (Figures 4 and5). F3′H (Sb04g024710) in M36001 and F3′H (Sb04g024750) in both lines were also expressed, but the expression levels were not as high as that of Sb04g024710N in Nakei-MS3B (Figure 4). This suggests that the expression level of F3′H s determines the activity of F3′H in the lesions. We considered that high-level expression was related mainly to the genomic deletion (Figure 3A), as the deletion was commonly inherited from Nakei-MS3B (Figure 3B) and was found in sorghum cultivars with purple lesions (Figure 3C). In the deleted region between Sb04g024710 and Sb04g024730 in lines #3, #62, and M36001, RNA-seq analysis revealed transcription from two unannotated regions (Figure 3B). The upstream transcript had 90% identity with that encoding the DNA-binding protein of Zea mays LOC100281685, and the downstream transcript encoded a heparin-α-glucosamide N-acetyltransferase-like protein similar to that of Setaria italica (LOC101768299). This transcription might inhibit proximal F3′H expression in lines #3, #62 (which had heterozygous alleles), and M36001, and the inhibition might be released by the genomic deletion in Nakei-MS3B and lines #96 and #62. Nucleotide substitutions or insertion/deletion, or both, in the region upstream of F3′H (Additional file3: Figure S2) might affect the binding affinity of transcription factors to the promoter of F3′H, thus also changing the expression level of F3′H. In addition to these changes in expression level, amino acid substitutions in the C-terminal region of the F3′H protein (Figure 3A and Additional file2: Figure S1) might change the enzymatic activity of F3′H. These factors may synergistically affect total F3′H activity (Sb04g024710N in Nakei-MS3B or Sb04g024710 in M36001, and Sb04g024750 in both lines) in sorghum tissues and thus determine the relative proportions of apigeninidin and luteolinidin.
(2) Tan1 transcription factor
Our QTL analysis suggested that the locus containing Tan1 was responsible for color variation (Figure 2). Tan1 regulates the expression of genes encoding enzymes in the tannin or anthocyanin pathway, or both pathways, in the sorghum seed coat; these enzymes include CHS, CHI, F3H, DFR, ANS (anthocyanin synthase), and LAR (leucoanthocyanidin reductase). We hypothesized that Tan1 also controls the expression of F3′H (Figure 6). Expression of F3′H (Sb04g024750; this gene is common to all the sorghum lines used in this study) was higher in Nakei-MS3B (Tan1/Tan1) and #96 (Tan1/tan1-b) than in #3 and #127 (both of which had the tan1-b/tan1-b allele; Table 1, Additional file5: Table S2); several lines (f3′h/ f3′h, Tan1/Tan1; data not shown) had reddish lesions, unlike the orange lesions in #3 and #127 (f3′h/ f3′h, tan1-b/tan1-b). This suggests that Tan1 enhances the expression of F3′H in the leaf. Tan1 (Sb04g031730) was expressed in all sorghum lines used in this study (RPKM: 3.0–10.6; Additional file5: Table S2), but the 10-bp insertion (CGGGCAGCGG) in the exon region of the tan1-b allele caused a frame shift in the encoded protein (Additional file4: Figure S3). BTx623, M36001, #127, and #3 (all of which had the tan1-b/tan1-b allele) did not accumulate tannin in their seeds (Table 1), suggesting that this insertion is a common feature of the alleles encoding non-functional Tan1 transcription factors. As F3′H (Sb04g024710) was slightly expressed in M36001, #127, and #3 (Figure 4), Tan1 is not essential for F3′H expression in the leaf. We therefore consider that although the Tan1 allele is not essential for F3′H expression, Tan1 enhances F3′H expression and thus contributes to the generation of color variation in sorghum leaves.
Tan1 is a WD40-repeat protein. As the expression of anthocyanin biosynthetic genes is regulated through a complex of WD40-repeat proteins, MYB transcription factors (TFs), and basic helix-loop-helix (bHLH) TFs, Tan1 may form a complex with MYB TFs and bHLH TFs. Sorghum F3′H is regulated by a MYB protein encoded by yellow seed1 (y1) in the seed coat, an ortholog of maize pericarp color1 (p1)[42, 43]. P1 protein binds the cis-regulatory elements CCTACC (-614 to -553) and CCAACC (-83 to -78) and controls F3′H expression in maize. However, in sorghum the promoter of F3′H (Sb04g024710N in Nakei-MS3B or Sb04g024710 in M36001) does not contain the consensus sequence (Additional file3: Figure S2), suggesting that sorghum requires other MYB transcription factors for the color variation of leaf caused by infection with B. sorghicola.
Diversity of F3′H genes
Cytochrome P450 participates in metabolic networks such as those involving anthocyanins, tannins, flavones, and isoflavonoids[44, 45]. Cytochrome P450 domain–containing genes are abundant in sorghum: 326 genes encoding cytochrome P450 enzymes are annotated in sorghum BTx623, including the longest tandem gene array (15 genes). Our combination of QTL analysis (Figure 2) and transcriptome analysis (Figures 4 and5) was a powerful tool for identifying the key genes expressed among family members—particularly an F3′H gene (Sb04g024710N) expressed among P450 family members (Figure 5). Sb04g024710 (SbF3′H2) expression is also involved in pathogen-specific 3-deoxyanthocyanidin synthesis in sorghum mesocotyls. Even though the downstream homologous gene Sb04g024750 (SbF3′H1) was also expressed in our study, its expression level was not as high as that of Sb04g024710N (Figure 4). Sb04g024750 (SbF3′H1) is expressed during light-specific anthocyanin accumulation. Sb09g022480.1 had 72.3% amino acid identity to F3′H (Sb04g024710.1), but its expression pattern was different from that of F3′H (Sb04g024710.1) (Figure 5). In sorghum, these duplications have resulted in diversity of both genomic sequences and gene expression; homologous genes have thereby developed different functions on an evolutionary time scale.
In other plants, mutants in which coloration is affected are also deficient in F3′H. The tt 7 mutation in Arabidopsis, which makes the seeds pale brown, is caused by a single base transition generating a stop codon. The t mutant in soybean, which affects pigmentation in the seed coat and trichome hairs, is caused by a frameshift mutation[47, 48]. Of three spontaneous mutations in morning glory species, that on the magenta allele is a nonsense mutation generating a stop codon; pink mutants carry an insertion of the Ac/Ds superfamily transposable element; and the fuchsia allele is a single T insertion generating a stop codon. All of the F3′H genes on mutated alleles in Arabidopsis, soybean, and morning glory encode non-functional proteins. We consider that both of our F3′H alleles (Sb04g024710N and Sb04g024710) were functional, as encoded proteins fully complement the tt7 mutation in Arabidopsis. Thus, the expression levels of F3′H genes are important for the gradual variation in color in disease-affected sorghum leaves. Coloration by flavonoids protects leaf cells from photooxidative damage, thus enhancing the efficiency of nutrient retrieval during senescence, and is responsible for a visual signal that attracts pollinators[37, 51]. Even though luteolinidin is toxic towards fungi and sorghum lines resistant to the fungus accumulate luteolinidin at higher levels than apigeninidin[8, 52], the biological importance of the coloration itself to the defense response against fungi remains to be elucidated.
Expression of F3′H is responsible for the synthesis of luteolinidin; the level of expression of F3′H is thus a critical determinant of color variation in sorghum leaves infected with B. sorghicola.
Snyder BA, Nicholson RL: Synthesis of phytoalexins in sorghum as a site-specific response to fungal ingress. Science. 1990, 248 (4963): 1637-1639. 10.1126/science.248.4963.1637.
Busk PK, Moller BL: Dhurrin synthesis in sorghum is regulated at the transcriptional level and induced by nitrogen fertilization in older plants. Plant Physiol. 2002, 129 (3): 1222-1231. 10.1104/pp.000687.
Dayan FE, Howell J, Weidenhamer JD: Dynamic root exudation of sorgoleone and its in planta mechanism of action. J Exp Bot. 2009, 60 (7): 2107-2117. 10.1093/jxb/erp082.
Clifford MN: Anthocyanins - nature, occurrence and dietary burden. J Sci Food Agric. 2000, 80 (7): 1063-1072. 10.1002/(SICI)1097-0010(20000515)80:7<1063::AID-JSFA605>3.0.CO;2-Q.
Lo SCC, De Verdier K, Nicholson RL: Accumulation of 3-deoxyanthocyanidin phytoalexins and resistance to Colletotrichum sublineolum in sorghum. Physiol Mol Plant P. 1999, 55 (5): 263-273. 10.1006/pmpp.1999.0231.
Kawahigashi H, Kasuga S, Ando T, Kanamori H, Wu J, Yonemaru J, Sazuka T, Matsumoto T: Positional cloning of ds1, the target leaf spot resistance gene against Bipolaris sorghicola in sorghum. Theor Appl Genet. 2011, 123 (1): 131-142. 10.1007/s00122-011-1572-1.
Hipskind JD, Hanau R, Leite B, Nicholson RL: Phytoalexin accumulation in sorghum - identification of an apigeninidin acyl ester. Physiol Mol Plant P. 1990, 36 (5): 381-396. 10.1016/0885-5765(90)90067-8.
Boddu J, Svabek C, Sekhon R, Gevens A, Nicholson RL, Jones AD, Pedersen JF, Gustine DL, Chopra S: Expression of a putative flavonoid 3 ’-hydroxylase in sorghum mesocotyls synthesizing 3-deoxyanthocyanidin phytoalexins. Physiol Mol Plant P. 2004, 65 (2): 101-113. 10.1016/j.pmpp.2004.11.007.
Petti C, Harman-Ware AE, Tateno M, Kushwaha R, Shearer A, Downie AB, Crocker M, DeBolt S: Sorghum mutant RG displays antithetic leaf shoot lignin accumulation resulting in improved stem saccharification properties. Biotechnol Biofuels. 2013, 6: 146-10.1186/1754-6834-6-146.
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, Schmutz J, Spannagl M, Tang H, Wang X, Wicker T, Bharti AK, Chapman J, Feltus FA, Gowik U, Grigoriev IV, Lyons E, Maher CA, Martis M, Narechania A, Otillar RP, Penning BW, Salamov AA, Wang Y, Zhang L, Carpita NC: The Sorghum bicolor genome and the diversification of grasses. Nature. 2009, 457 (7229): 551-556. 10.1038/nature07723.
Mace ES, Tai S, Gilding EK, Li Y, Prentis PJ, Bian L, Campbell BC, Hu W, Innes DJ, Han X, Cruickshank A, Dai C, Frère C, Zhang H, Hunt CH, Wang X, Shatte T, Wang M, Su Z, Li J, Lin X, Godwin ID, Jordan DR, Wang J: Whole-genome sequencing reveals untapped genetic potential in Africa’s indigenous cereal crop sorghum. Nat Commun. 2013, 4: 2320-
Mizuno H, Kawahigashi H, Kawahara Y, Kanamori H, Ogata J, Minami H, Itoh T, Matsumoto T: Global transcriptome analysis reveals distinct expression among duplicated genes during sorghum-interaction. BMC Plant Biol. 2012, 12: 121-10.1186/1471-2229-12-121.
Yazawa T, Kawahigashi H, Matsumoto T, Mizuno H: Simultaneous transcriptome analysis of sorghum and bipolaris sorghicola by using RNA-seq in combination with de novo transcriptome assembly. PLoS One. 2013, 8 (4): e62460-10.1371/journal.pone.0062460.
Sawada Y, Akiyama K, Sakata A, Kuwahara A, Otsuki H, Sakurai T, Saito K, Hirai MY: Widely targeted metabolomics based on large-scale MS/MS data for elucidating metabolite accumulation patterns in plants. Plant Cell Physiol. 2009, 50 (1): 37-47. 10.1093/pcp/pcn183.
Yonemaru J, Ando T, Mizubayashi T, Kasuga S, Matsumoto T, Yano M: Development of genome-wide simple sequence repeat markers using whole-genome shotgun sequences of sorghum (Sorghum bicolor (L.) Moench). DNA Res. 2009, 16 (3): 187-193. 10.1093/dnares/dsp005.
Wu J, Mizuno H, Hayashi-Tsugane M, Ito Y, Chiden Y, Fujisawa M, Katagiri S, Saji S, Yoshiki S, Karasawa W, Yoshihara R, Hayashi A, Kobayashi H, Ito K, Hamada M, Okamoto M, Ikeno M, Ichikawa Y, Katayose Y, Yano M, Matsumoto T, Sasaki T: Physical maps and recombination frequency of six rice chromosomes. Plant J. 2003, 36 (5): 720-730. 10.1046/j.1365-313X.2003.01903.x.
Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012, 9 (4): 357-359. 10.1038/nmeth.1923.
Ouyang S, Buell CR: The TIGR plant repeat databases: a collective resource for the identification of repetitive sequences in plants. Nucleic Acids Res. 2004, 32 (Database issue): D360-D363.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Proc GPD: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079. 10.1093/bioinformatics/btp352.
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL: TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013, 14 (4): R36-10.1186/gb-2013-14-4-r36.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, Rokhsar DS: Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012, 40 (D1): D1178-D1186. 10.1093/nar/gkr944.
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010, 28 (5): 511-515. 10.1038/nbt.1621.
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL: BLAST plus: architecture and applications. BMC Bioinformatics. 2009, 10: 421-10.1186/1471-2105-10-421.
Thorvaldsdottir H, Robinson JT, Mesirov JP: Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform. 2013, 14 (2): 178-192. 10.1093/bib/bbs017.
Shih CH, Chu IK, Yip WK, Lo C: Differential expression of two flavonoid 3′-hydroxylase cDNAs involved in biosynthesis of anthocyanin pigments and 3-deoxyanthocyanidin phytoalexins in sorghum. Plant Cell Physiol. 2006, 47 (10): 1412-1419. 10.1093/pcp/pcl003.
Higo K, Ugawa Y, Iwamoto M, Korenaga T: Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 1999, 27 (1): 297-300. 10.1093/nar/27.1.297.
Vieweg MF, Fruhling M, Quandt HJ, Heim U, Baumlein H, Puhler A, Kuster H, Perlick AM: The promoter of the Vicia faba L. leghemoglobin gene VfLb29 is specifically activated in the infected cells of root nodules and in the arbuscule-containing cells of mycorrhizal roots from different legume and nonlegume plants. Mol Plant Microbe Interact. 2004, 17 (1): 62-69. 10.1094/MPMI.2004.17.1.62.
Fehlberg V, Vieweg MF, Dohmann EMN, Hohnjec N, Puhler A, Perlick AM, Kuster H: The promoter of the leghaemoglobin gene VfLb29: functional analysis and identification of modules necessary for its activation in the infected cells of root nodules and in the arbuscule-containing cells of mycorrhizal roots. J Exp Bot. 2005, 56 (413): 799-806. 10.1093/jxb/eri074.
Stougaard J, Jorgensen JE, Christensen T, Kuhle A, Marcker KA: Interdependence and nodule specificity of Cis-acting regulatory elements in the soybean leghemoglobin-Lbc3 and N23 gene promoters. Mol Gen Genet. 1990, 220 (3): 353-360. 10.1007/BF00391738.
Gowik U, Burscheidt J, Akyildiz M, Schlue U, Koczor M, Streubel M, Westhoff P: cis-regulatory elements for mesophyll-specific gene expression in the C4 plant Flaveria trinervia, the promoter of the C4 phosphoenolpyruvate carboxylase gene. Plant Cell. 2004, 16 (5): 1077-1090. 10.1105/tpc.019729.
Hudson ME, Quail PH: Identification of promoter motifs involved in the network of phytochrome A-regulated gene expression by combined analysis of genomic sequence and microarray data. Plant Physiol. 2003, 133 (4): 1605-1616. 10.1104/pp.103.030437.
Jiao Y, Ma L, Strickland E, Deng XW: Conservation and divergence of light-regulated genome expression patterns during seedling development in rice and Arabidopsis. Plant Cell. 2005, 17 (12): 3239-3256. 10.1105/tpc.105.035840.
Wu Y, Li X, Xiang W, Zhu C, Lin Z, Wu Y, Li J, Pandravada S, Ridder DD, Bai G, Wang ML, Trick HN, Bean SR, Tuinstra MR, Tesso TT, Yu J: Presence of tannins in sorghum grains is conditioned by different natural alleles of Tannin1. Proc Natl Acad Sci U S A. 2012, 109 (26): 10281-10286. 10.1073/pnas.1201700109.
Selinger DA, Chandler VL: A mutation in the pale aleurone color1 gene identifies a novel regulator of the maize anthocyanin pathway. Plant Cell. 1999, 11 (1): 5-14. 10.1105/tpc.11.1.5.
Holton TA, Cornish EC: Genetics and biochemistry of anthocyanin biosynthesis. Plant Cell. 1995, 7 (7): 1071-1083. 10.1105/tpc.7.7.1071.
Mol J, Grotewold E, Koes R: How genes paint flowers and seeds. Trends Plant Sci. 1998, 3 (6): 212-217. 10.1016/S1360-1385(98)01242-4.
Winkel-Shirley B: Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiol. 2001, 126 (2): 485-493. 10.1104/pp.126.2.485.
Liu H, Du Y, Chu H, Shih CH, Wong YW, Wang M, Chu IK, Tao Y, Lo C: Molecular dissection of the pathogen-inducible 3-deoxyanthocyanidin biosynthesis pathway in sorghum. Plant Cell Physiol. 2010, 51 (7): 1173-1185. 10.1093/pcp/pcq080.
Baudry A, Heim MA, Dubreucq B, Caboche M, Weisshaar B, Lepiniec L: TT2, TT8, and TTG1 synergistically specify the expression of BANYULS and proanthocyanidin biosynthesis in Arabidopsis thaliana. Plant J. 2004, 39 (3): 366-380. 10.1111/j.1365-313X.2004.02138.x.
Boddu J, Svabek C, Ibraheem F, Jones AD, Chopra S: Characterization of a deletion allele of a sorghum Myb gene, yellow seed1 showing loss of 3-deoxyflavonoids. Plant Sci. 2005, 169 (3): 542-552. 10.1016/j.plantsci.2005.05.007.
Sharma M, Chai C, Morohashi K, Grotewold E, Snook ME, Chopra S: Expression of flavonoid 3′-hydroxylase is controlled by P1, the regulator of 3-deoxyflavonoid biosynthesis in maize. BMC Plant Biol. 2012, 12: 196-10.1186/1471-2229-12-196.
Grotewold E, Drummond BJ, Bowen B, Peterson T: The Myb-homologous P-gene controls phlobaphene pigmentation in maize floral organs by directly activating a flavonoid biosynthetic gene subset. Cell. 1994, 76 (3): 543-553. 10.1016/0092-8674(94)90117-1.
Mizutani M, Ohta D: Diversification of P450 genes during land plant evolution. Annu Rev Plant Biol. 2010, 61: 291-315. 10.1146/annurev-arplant-042809-112305.
Powles SB, Yu Q: Evolution in action: plants resistant to herbicides. Annu Rev Plant Biol. 2010, 61: 317-347. 10.1146/annurev-arplant-042809-112119.
Schoenbohm C, Martens S, Eder C, Forkmann G, Weisshaar B: Identification of the Arabidopsis thaliana flavonoid 3′-hydroxylase gene and functional expression of the encoded P450 enzyme. Biol Chem. 2000, 381 (8): 749-753.
Toda K, Yang D, Yamanaka N, Watanabe S, Harada K, Takahashi R: A single-base deletion in soybean flavonoid 3′-hydroxylase gene is associated with gray pubescence color. Plant Mol Biol. 2002, 50 (2): 187-196. 10.1023/A:1016087221334.
Zabala G, Vodkin L: Cloning of the pleiotropic T locus in soybean and two recessive alleles that differentially affect structure and expression of the encoded flavonoid 3′ hydroxylase. Genetics. 2003, 163 (1): 295-309.
Hoshino A, Morita Y, Choi JD, Saito N, Toki K, Tanaka Y, Iida S: Spontaneous mutations of the flavonoid 3′-hydroxylase gene conferring reddish flowers in the three morning glory species. Plant Cell Physiol. 2003, 44 (10): 990-1001. 10.1093/pcp/pcg143.
Feild TS, Lee DW, Holbrook NM: Why leaves turn red in autumn. The role of anthocyanins in senescing leaves of red-osier dogwood. Plant Physiol. 2001, 127 (2): 566-574. 10.1104/pp.010063.
Bradshaw HD, Schemske DW: Allele substitution at a flower colour locus produces a pollinator shift in monkeyflowers. Nature. 2003, 426 (6963): 176-178. 10.1038/nature02106.
Nicholson RL, Kollipara SS, Vincent JR, Lyons PC, Cadenagomez G: Phytoalexin synthesis by the sorghum mesocotyl in response to infection by pathogenic and nonpathogenic fungi. Proc Natl Acad Sci U S A. 1987, 84 (16): 5520-5524. 10.1073/pnas.84.16.5520.
The authors thank Ms. Kazuko Ohtsu for the technical assistance in sample preparation, Mr. Muneo Sato, Mr. Yutaka Yamada, Ms. Akane Sakata for their metabolic analysis. This work was supported by a grant from the Ministry of Agriculture, Forestry, and Fisheries of Japan (Genomics for Agricultural Innovation, QTL-5502 and QTL-5506).
The authors declare that they have no competing interests.
H Kaw, SK, and JO prepared plant materials and performed cDNA synthesis; JY and TA performed QTL analysis, YS and MYH performed metabolic analysis, H Kan, JW, and TM performed sequencing experiments; TY performed the data analysis; HM and H Kaw designed the study; and HM wrote the manuscript. All authors read and approved the final manuscript.
Hiroshi Mizuno and Hiroyuki Kawahigashi contributed equally to this work.
Electronic supplementary material
Additional file 2: Figure S1: Comparison of amino acid sequences of F3′H from Nakei-MS3B and BTx623. The F3′H gene of Nakei-MS3B (Sb04g04710N) is the fused gene of Sb04g04710 and Sb04g04730 shown in Figure 3A; that of BTx623 is Sb04g024710, which is annotated in Phytozome. Two amino acids (red) are substituted in Nakei-MS3B. (PDF 39 KB)
Additional file 3: Figure S2: Comparison of upstream regions of F3′H. Upstream regions (1000 bp) of F3′H in BTx623 (Sb04g04710, upper) and Nakei-MS3B (Sb04g04710N, lower) are compared. Two nucleotide substitutions (at positions -144 and -664), and one insertion/deletion (at position -661) are shown in red. Nucleotide positions are counted from the transcription start site (position 1). (PDF 44 KB)
Additional file 4: Figure S3: Nucleotide polymorphisms in Tan1 (Sb04g031730), and the deduced amino acid sequences. Tan1 of Shan Qui Red sorghum encodes a functional WD40 protein. A 10-bp insertion in the exon causes a frame shift in M36001 and BTx623 sorghum. Nucleotide positions are based on the Shan Qui Red tan1 gene (accession number JX122967). (PDF 49 KB)
Additional file 5: Table S2: Expression ratios and description of transcripts. Transcript ID (Transcript), gene ID (Gene), chromosome number (Chromosome), start position (Start), end position (End), strand direction (Strand), description in Phytozome (Description), pfam ID (Pfam), reads per kilobase of exon model per million mapped reads (RPKM) before (before) or after (after) cutting stress, and calculated ratio of RPKM (Fold change) in each line are listed. (XLS 15 MB)
Authors’ original submitted files for images
About this article
Cite this article
Mizuno, H., Yazawa, T., Kasuga, S. et al. Expression level of a flavonoid 3′-hydroxylase gene determines pathogen-induced color variation in sorghum. BMC Res Notes 7, 761 (2014). https://doi.org/10.1186/1756-0500-7-761