The mQTL hotspot on linkage group 16 for phenolic compounds in apple fruits is probably the result of a leucoanthocyanidin reductase gene at that locus
© Khan et al; licensee BioMed Central Ltd. 2012
Received: 16 September 2012
Accepted: 29 October 2012
Published: 2 November 2012
Our previous study on ripe apples from a progeny of a cross between the apple cultivars ‘Prima’ and ‘Fiesta’ showed a hotspot of mQTLs for phenolic compounds at the top of LG16, both in peel and in flesh tissues. In order to find the underlying gene(s) of this mQTL hotspot, we investigated the expression profiles of structural and putative transcription factor genes of the phenylpropanoid and flavonoid pathways during different stages of fruit development in progeny genotypes.
Only the structural gene leucoanthocyanidin reductase (MdLAR1) showed a significant correlation between transcript abundance and content of metabolites that mapped on the mQTL hotspot. This gene is located on LG16 in the mQTL hotspot. Progeny that had inherited one or two copies of the dominant MdLAR1 alleles (Mm, MM) showed a 4.4- and 11.8-fold higher expression level of MdLAR1 respectively, compared to the progeny that had inherited the recessive alleles (mm). This higher expression was associated with a four-fold increase of procyanidin dimer II as one representative metabolite that mapped in the mQTL hotspot. Although expression level of several structural genes were correlated with expression of other structural genes and with some MYB and bHLH transcription factor genes, only expression of MdLAR1 was correlated with metabolites that mapped at the mQTL hotspot. MdLAR1 is the only candidate gene that can explain the mQTL for procyanidins and flavan-3-ols. However, mQTLs for other phenylpropanoids such as phenolic esters, dihydrochalcones and flavonols, that appear to map at the same locus, have so far not been considered to be dependent on LAR, as their biosynthesis does not involve LAR activity. An explanation for this phenomenon is discussed.
Transcript abundances and genomic positions indicate that the mQTL hotspot for phenolic compounds at the top of LG16 is controlled by the MdLAR1 gene. The dominant allele of the MdLAR1 gene, causing increased content of metabolites that are potentially health beneficial, could be used in marker assisted selection of current apple breeding programs and for cisgenesis.
KeywordsPhenylpropanoid pathway Flavonoid pathway Transcript abundance Apple fruits Phenolic compounds Leucoanthocyanidin reductase gene
Apple (Malus × domestica Borkh) is an important source of many secondary metabolites known as phenolic compounds [1, 2]. These phenolic compounds have various functions in the plant such as protection against ultra violet light . The phenolic compounds such as procyanidins are polymers of flavan-3-ols. In plants they often function to prevent herbivory. They provide an astringent taste to foodstuffs and, at longer chain length, form complexes with proteins. Procyanidins are increasingly recognized for their beneficial effects on human health .
One of the important benefits of these compounds to consumers is their potential role against various human diseases such as cancer, coronary heart diseases, cardiovascular diseases, and diabetes [5, 6].
Phenolic compounds are synthesised through the phenylpropanoid and flavonoid pathways. For procyanidins, the biosynthetic pathway largely overlaps with that of anthocyanins. These complex biochemical pathways involve a series of enzymes. Many of these enzymes, as well as the encoding genes have been functionally characterized [7–10]. The first committed step to procyanidins has been postulated to be carried out by leucocyanidin reductase (LAR) .
In our previous study  we genetically mapped phenolic compounds that were detected in peel and in flesh of ripe apple fruits. We detected a hotspot of QTLs of metabolites (mQTLs) at the top of LG16. The metabolites that mapped at this locus were procyanidins (flavan-3-ols and their polymers), and other phenolic compounds such as phenolic esters and flavonol- and dihydrochalcone derivatives. All these compounds belong to the phenylpropanoids, and one could therefore speculate that the mQTL is controlled by a biosynthetic gene from the phenylpropanoid pathway, or by a transcription factor controlling this pathway.
The aim of the present study was to unravel which gene controlled the phenylpropanoid mQTL hotspot in apple. The approach used involved an expression analysis of structural and transcription factor genes of the phenylpropanoid and flavonoid pathway. By looking closer at the draft sequence of the whole genome of the apple cultivar ‘Golden Delicious’ , the structural gene leucoanthocyanidin reductase (MdLAR1) and seven transcription factor genes were detected in the genetic window of the mQTL hotspot. Therefore the transcript abundances of these genes were investigated. In addition, expression profiles of the structural genes of the phenylpropanoid and flavonoid pathways outside of the mQTL hotspot were studied. A strong positive correlation between the expression level of the MdLAR1 gene and the level of metabolites that mapped at LG16 was observed. This was not found for any of the other genes studied. This indicates that the MdLAR1 gene is the major candidate gene controlling the mQTL hotspot on LG16. Further evidence is provided by the fact that the MdLAR1 gene is the only structural gene of the phenylpropanoid and flavonoid pathways that resides in the mQTL hotspot.
In this study, fruits from the segregating F1 population derived from the cross between the cultivars ‘Prima’ and ‘Fiesta’ were used. This population was used in our previous study too, in which the mQTL hotspot and other mQTLs were detected .
Selection of genotypes and harvesting of fruits for gene expression studies
There were three classes of genotypes based on co-segregating genetic markers: MM was the homozygous dominant class. These progeny inherited from each parent one dominant allele for increased content of the procyanidin dimer II. Mm was the heterozygous class, which has one dominant allele from one parent and one recessive allele from the other parent. The heterozygous progeny had high content of the metabolite too. The third class is the homozygous recessive class mm. This class received both recessive alleles from the two parents, and showed a low content of the metabolite.
RNA isolation from apple fruits
Total RNA was isolated from peel and flesh of apple fruits separately according to the CTAB method described by Asif et al. . The RNA quantity was measured on NanoDrop® spectrophotometer model ND-1000 from isogen lifescience scientific company as explained by Khan et al.  and the RNA quality and quantity were measured by running 2 μl of the RNA sample on a 1.5% agarose gel. First single-strand complementary DNA (cDNA) was synthesized using iScript™ cDNA Synthesis Kit (Bio-Rad) according to the manufacturer’s manual.
Selection of genes for qRT-PCR studies and primer design
Genes that were included in the expression analysis
Forward primer (5’→ 3’)
Reverse primer (5’→ 3’)
LG on “Golden Delicious”
Gene position on LG (kbp)
TF genes at mQTL hotspot
Transcription factor genes outside the mQTL hotspot
In view of the importance of MdLAR1, two primer pairs were designed for this gene in two different non-overlapping regions. By means of these primer pairs, the two fragments were amplified and sequenced for each genotype class. This was done for verification of the gene specificity of the primers, since MdLAR1 on LG16 and MdLAR2 on LG13 (Table 1) show 62% similarity at the nucleotide level. The sequences of MdLAR1 and MdLAR2 in ‘Prima’ × ‘Fiesta’ showed good alignment with sequences from cv. ‘Golden Delicious’ on which the primers for qRT-PCR were designed.
Performing q RT-PCR and data analysis
Gene expression was measured using Fluidigm Dynamic Array integrated fluidic circuits for cDNA samples from peel and flesh for the genotypes and development stages mentioned in Additional file 1. Fluidigm used the BioMark™ System and Evagreen DNA binding dye (http://www.fluidigm.com). Three 96×96 Dynamic Arrays of Integrated Fluidic Circuits, comprising 48 primer pairs in two replicates were used. The q RT-PCR set up for the reference gene and other control samples, and data analysis was performed as described by Khan et al. .
Correlation network analysis
The correlation coefficients were calculated between the contents of seven metabolites representative for different branches of the phenylpropanoid and flavonoid pathway, and for the expression of 18 structural genes and 18 transcription factors possibly involved in these pathways. Before calculation of the correlation coefficients, the data were 10log transformed for normalisation purposes. Scatter plots were made between the different 10log transformed variables, in order to make sure that outliers did not bias correlation values, and to check the distributions.
Visualization of the correlation network was performed by the Pajek software package (http://pajek.imfm.si/doku.php). Besides a biological quantitative pattern which is observed in a set of samples as the result of physiological processes, data may have a particular embedded ‘experimental pattern’ which is due to the experiment performance, such as extraction errors and measurement or calibration errors. So, different analytical methods run on the same set of samples may give different experimental patterns. Therefore, correlations between variables observed within particular experiments may be stronger than correlations between variables from different experiments. Here we have a correlation matrix of two different experiments and, therefore, three types of correlations (sub-matrices) are present: gene-to-gene correlations, metabolite-to-metabolite correlations and gene-to-metabolite correlations. Lower correlation coefficients might be expected in the third sub-matrix due to interference of different experimental patterns. To compensate for this effect and to obtain a balanced correlation network we standardized correlation coefficients separately for each of the three sub-matrices. For these a maximum positive and negative correlation coefficients r were found in each sub-matrix and then were set to 1.0 and −1.0, respectively. Other correlation coefficients of each sub-matrix were expressed relative to their maximum ones. The standardized correlation coefficients are further denoted as rs.
Association between expression of structural genes of the phenylpropanoid/flavonoid pathways and concentrations of metabolites that mapped at the mQTL hotspot
The progeny that had inherited the recessive alleles for low procyanidin dimer II content (mm), showed a low expression of MdLAR1 throughout fruit development, both in peel and flesh (Figure 2). However, the heterozygous group (Mm) showed a higher expression, compared to the homozygous recessive (mm) group, whereas the homozygous dominant progeny (MM) with high content of procyanidin dimer II showed the highest expression of MdLAR1 (Figure 2). The expression level of MdLAR1 was highly significantly, positively correlated with procyanidin dimer II content, according to Student’s t-test (P < 0.1%). On the average, the MM genotypes had a four times higher content of this metabolite at the ripe stage compared to the mm genotypes (Figure 1), both in peel and flesh.
No significant correlation was detected between transcript abundance of the other evaluated genes at the one hand with the concentration of procyanidin dimer II at the other hand (Figure 3).
The transcript abundance of MdLAR1 was also significantly correlated with the other metabolites that mapped at the LG16 hotspot (Figure 3). However, the other studied genes did not show this high correlation with any of the metabolites at the mQTL hotspot. This suggests that the MdLAR1 gene is the gene controlling the mQTL of procyanidin dimer II, and of all other phenolic compounds that mapped at this hotspot on LG16.
Association between expression of transcription factor genes and concentrations of metabolites that mapped at the mQTL hotspot
The finding that compounds from different locations in the pathway mapped at the same mQTL hotspot  could suggest that a transcription factor was involved in the mQTL. At the mQTL locus, seven candidate transcription factor genes were identified (Table 1). However, there was no clear correlation for any of these candidate transcription factor genes with the procyanidin dimer II content in peel and flesh (Figure 3). This indicates that the evaluated transcription factor genes at the mQTL hotspot were not responsible for this hotspot.
In addition, 11 more candidate transcription factor genes were identified throughout the genome, or on homology to known transcription factors involved in the phenylpropanoid and flavonoid pathway (Table 1). No clear correlation was found between the expression of any of these putative transcription factor genes and the metabolites that mapped at the hotspot (Figure 3). This indicates that transcription factor genes outside the mQTL hotspot were not controlling this hotspot either.
Associations between expression of structural genes and transcription factor genes
The correlation matrix (Figure 3) shows that the expression levels of many genes were correlated to one another. As an example, the expression of the structural genes MdPAL, MdC4H, MdUFGT, MdCHS, MdCHI, MdF3H, MdDFR, MdLAR2 and MdANS were positively correlated to one another. This cluster of structural genes showed also a positive correlation with the expression of the three transcription factor genes b-HLH1543, MdMyb11.A and MdMyb9. This suggests that these three transcription factor genes may regulate this cluster of structural genes, but did not control the mQTL hotspot on LG16.
In the second cluster of the network, many structural genes and transcription factors appear to be connected to one another (Figure 4). Several genes in the network are important nodes, and are connected to many other genes. This is especially the case for MYB transcription factors, such as MdMYB9, MdMYB11_A, and MdMYB5a_A, and for b-HLH transcription factors, such as b-HLH1881, b-HLH1967, and MdbHLH33 (Figure 4). Probably these transcription factors regulate many structural genes in the phenylpropanoid pathway. However, none of these transcription factor genes is directly connected to metabolites in the first cluster. In spite of the important regulatory roles of the mentioned MYB and b-HLH transcription factor genes in the phenylpropanoid and flavonoid pathway, they were not responsible for the mQTL hotspot.
Aim of the study
In our previous study  we mapped phenolic compounds in ripe fruits of a segregating F1 population derived from the cross between cultivars ‘Prima’ and ‘Fiesta’. There appeared to be a strong hotspot of mQTLs at the top of LG16. Annotation of the metabolites showed that the compounds that mapped on the LG16 hotspot belong to the phenylpropanoid and flavonoid pathways (Figure 5).
We wanted to discover which gene(s) controlled this mQTL hotspot. Therefore, in the present research, transcript abundances for the candidate genes in the mQTL region were measured in progeny genotypes that segregated for these mQTLs. In addition, structural genes of the phenylpropanoid and flavonoid pathways and putative transcription factor genes that are candidates for regulating these pathways and located elsewhere were evaluated as mentioned in the Methods section in detail.
MdLAR1 seems to be the only gene that can explain the mQTL hotspot on LG16
The procyanidin content was higher in the flesh compared to the peel (Figure 1). However, the expression of MdLAR1 was lower in the flesh compared to the peel (Figure 6). A possible explanation is the fact that flavonols and anthocyanins are produced in the peel only. These may compete for the pool of available substrates, leading to relatively lower procyanidins level.
How can MdLAR1 explain the observed mQTLs?
The MdLAR1 gene clearly explains the mQTL for procyanidin content, as LAR from leguminosal species has been implicated in the synthesis of catechin, a building block for procyanidins . Remarkably, we found several mQTLs in the same hotspot on LG16 for metabolites (kaempferol glycosides, phloridzin, phenolic esters) that are synthesized by different branches from the phenylpropanoid pathway  (Figure 5). Since LAR is not known to be involved in the biosynthesis of these other metabolites, the observed differential LAR expression does not provide a straightforward explanation for the presence of the mQTLs of these more upstream metabolites.
One could speculate about the effect that LAR overexpression may have effect on the total flux through the phenylpropanoid pathway. We note that the positively associated mQTLs (procyanidins, dihydrochalcones, phenolic esters and kaempferol glycosides) all map downstream of coumaroyl-CoA ligase (4CL) in the pathway (Figure 5). A metabolite that maps upstream of 4CL is coumaroyl hexoside, for which the level was negatively correlated with e.g. procyanidins. This appears also from Figure 3.
In apple, no 4CL-like gene is located at the mQTL hotspot . Moreover, the expression of the tested 4CL gene did not correlate with the metabolites that mapped at the hotspot. One explanation may be that MdLAR1 overexpression relieves a feedback mechanism on the enzymatic activity of 4CL. 4CL is known to be feedback inhibited by metabolites from the phenylpropanoid pathway, such as naringenin . Possi bly, the enhanced MdLAR1 activity will lead to depletion of pathway intermediates such as naringenin, which may thus activate 4CL activity and lead to a higher general flux, from coumaroyl glycoside towards the downstream metabolites. The support for such a mechanism needs extensive experimentation, which is outside the scope of this article.
An unlikely, but still possible alternative explanation for the mQTL hotspot could be that a transcription factor at the mQTL hotspot regulated the expression of MdLAR1. As we did not see any differencial expression of the transcription factor genes at the mQTL hotspot, the different alleles of that transcription factor gene would not differ in expression levels, but theoretically could differ in effect of the protein. Further, that transcription factor might have influenced 4CL paralogous that were not covered by the used primer pair. We do not regard this as a likely explanation, but it cannot be completely excluded.
Transcript abundances of several structural genes and transcription factor genes were correlated
MdANR also contributes to the synthesis of procyanidins (Figure 5). The expression level of this gene significantly correlated with expression of several structural genes such as PAL, CHS, DFR, and ANS (Figures 3 and 4). Moreover, there was a clear correlation between the expression of these structural genes, and the expression of the transcription factor genes MYB9 and MYB11 (Figures 3 and 4). Possibly, these transcription factors regulated the mentioned structural genes. However, the transcript abundances of none of these structural or transcription factor genes did correlate significantly with the metabolite abundances that mapped at the mQTL hotspot on LG16 (Figure 3). This indicates that these structural genes were not the bottleneck for the pathway, whereas probably MdLAR1 was the limiting factor in the progeny that had inherited both lowly expressed alleles of this gene (mm). Presumably, the bottleneck was (partly) removed in case of presence of one or two higher expressed alleles of MdLAR1 (MM, Mm).
The dominant allele of the MdLAR1 gene, causing increased content of metabolites that are potentially health beneficial, could be used in marker assisted selection of current apple breeding programs. This selection could be made at seedling stage. This would reduce the production costs for the breeders by discarding the undesired seedlings at earlier stage of growth, whereas in classical breeding only after six years, when trees start to bear fruits, selection on fruit content is possible. Another possibility is to clone the dominant allele or alleles for engineering increased content of metabolite(s) into existing apple cultivars by different transformation technologies including cisgenesis [17, 18].
Our results indicate that MdLAR1 is the most likely candidate gene responsible for the mQTL hotspot for phenolic compounds on LG16 of apple, both in peel and flesh. Increased levels of metabolites downstream of MdLAR1, such as the flavan-3-ols epicatechin and procyanidin dimer II may be directly caused by increased transcript abundance of MdLAR1, as this gene is known to participate in procyanidin biosynthesis.
We are thankful to Higher Education Commission (HEC) of Pakistan for the fellowship funding and INOVA fruit B.V. for the financial support during this research.
- Khan SA, Chibon PY, de Vos RCH, Schipper BA, Walraven E, Beekwilder J, van Dijk T, Finkers R, Visser RG, van de Weg EW, et al: Genetic analysis of metabolites in apple fruits indicates an mQTL hotspot for phenolic compounds on linkage group 16. J Exp Bot. 2012, 63 (8): 2895-2908. 10.1093/jxb/err464.PubMedPubMed CentralView ArticleGoogle Scholar
- Lu Y, Foo LY: Identification and quantification of major polyphenols in apple pomace. Food Chem. 1997, 59 (2): 187-194. 10.1016/S0308-8146(96)00287-7.View ArticleGoogle Scholar
- Robberecht R, Caldwell MM: Leaf epidermal transmittance of ultraviolet radiation and its implications for plant sensitivity to ulraviolet-radiation induced injury. Oecologia. 1978, 32 (3): 277-287. 10.1007/BF00345107.View ArticleGoogle Scholar
- Dixon RA, Xie DY, Sharma SB: Proanthocyanidins - a final frontier in flavonoid research?. New Phytol. 2005, 165 (1): 9-28.PubMedView ArticleGoogle Scholar
- Eberhardt MV, Lee CY, Liu RH: Antioxidant activity of fresh apples. Nature. 2000, 405 (6789): 903-904.PubMedGoogle Scholar
- Mcghie TK, Hunt M, Barnett LE: Cultivar and growing region determine the antioxidant polyphenolic concentration and composition of apples grown in New Zealand. J Agric Food Chem. 2005, 53 (8): 3065-3070. 10.1021/jf047832r.PubMedView ArticleGoogle Scholar
- Han Y, Korban SS: Genes encoding flavonoid 3'-hydroxylase in apple and their tagged molecular markers. Acta Horticulturae. 2009, 839: 409-414.View ArticleGoogle Scholar
- Jugde H, Nguy D, Moller I, Cooney JM, Atkinson RG: Isolation and characterization of a novel glycosyltransferase that converts phloretin to phlorizin, a potent antioxidant in apple. FEBS J. 2008, 275 (15): 3804-3814. 10.1111/j.1742-4658.2008.06526.x.PubMedView ArticleGoogle Scholar
- Takos AM, Ubi BE, Robinson SP, Walker AR: Condensed tannin biosynthesis genes are regulated separately from other flavonoid biosynthesis genes in apple fruit skin. Plant Sci. 2006, 170 (3): 487-499. 10.1016/j.plantsci.2005.10.001.View ArticleGoogle Scholar
- Kim SH, Lee JR, Hong ST, Yoo YK, An G, Kim SR: Molecular cloning and analysis of anthocyanin biosynthesis genes preferentially expressed in apple skin. Plant Sci. 2003, 165 (2): 403-413. 10.1016/S0168-9452(03)00201-2.View ArticleGoogle Scholar
- Tanner GJ, Francki KT, Abrahams S, Watson JM, Larkin PJ, Ashton AR: Proanthocyanidin biosynthesis in plants - Purification of legume leucoanthocyanidin reductase and molecular cloning of its cDNA. J Biol Chem. 2003, 278 (34): 31647-31656. 10.1074/jbc.M302783200.PubMedView ArticleGoogle Scholar
- Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio M, Pruss D, et al: The genome of the domesticated apple (Malus x domestica Borkh). Nat Genet. 2010, 42 (10): 833-10.1038/ng.654.PubMedView ArticleGoogle Scholar
- Asif MH, Dhawan P, Nath P: A simple procedure for the isolation of high quality RNA from ripening banana fruit. Plant Molecular Biology Reporter. 2000, 18 (2): 109-115. 10.1007/BF02824018.View ArticleGoogle Scholar
- Khan SA, Beekwilder J, Schaart JG, Mumm R, Soriano JM, Jacobsen E, Schouten HJ: Differences in acidity of apples are probably mainly caused by a malic acid transporter gene on LG16. Tree genetics and genomes. 2012, 10.1007/s11295-012-0571-y.Google Scholar
- Huang YF, Doligez A, Fournier-Level A, Le Cunff L, Bertrand Y, Canaguier A, Morel C, Miralles V, Veran F, Souquet JM, et al: Dissecting genetic architecture of grape proanthocyanidin composition through quantitative trait locus mapping. BMC Plant Biol. 2012, 12: 30-10.1186/1471-2229-12-30.PubMedPubMed CentralView ArticleGoogle Scholar
- Voo Kui S, Whetten RW, O'Malley DM, Sederoff RR: 4-Coumarate: Coenzyme A ligase from loblolly pine xylem. Isolation, characterization, and complementary DNA cloning. Plant Physiol. 1995, 108 (1): 85-97. 10.1104/pp.108.1.85.View ArticleGoogle Scholar
- Schouten HJ, Krens FA, Jacobsen E: Cisgenic plants are similar to traditionally bred plants: International regulations for genetically modified organisms should be altered to exempt cisgenesis. EMBO Rep. 2006, 7 (8): 750-753. 10.1038/sj.embor.7400769.PubMedPubMed CentralView ArticleGoogle Scholar
- Schouten HJ, Krens FA, Jacobsen E: Do cisgenic plants warrant less stringent oversight?. Nat Biotechnol. 2006, 24 (7): 753-10.1038/nbt0706-753.PubMedView ArticleGoogle Scholar