- Research article
- Open Access
Structural insights into the mechanism defining substrate affinity in Arabidopsis thaliana dUTPase: the role of tryptophan 93 in ligand orientation
BMC Research Notesvolume 8, Article number: 784 (2015)
Deoxyuridine triphosphate nucleotidohydrolase (dUTPase) hydrolyzes dUTP to dUMP and pyrophosphate to maintain the cellular thymine-uracil ratio. dUTPase is also a target for cancer chemotherapy. However, the mechanism defining its substrate affinity remains unclear. Sequence comparisons of various dUTPases revealed that Arabidopsis thaliana dUTPase has a unique tryptophan at position 93, which potentially contributes to its degree of substrate affinity. To better understand the roles of tryptophan 93, A. thaliana dUTPase was studied.
Enzyme assays showed that A. thaliana dUTPase belongs to a high-affinity group of isozymes, which also includes the enzymes from Escherichia coli and Mycobacterium tuberculosis. Enzymes from Homo sapiens and Saccharomyces cerevisiae are grouped as low-affinity dUTPases. The structure of the homo-trimeric A. thaliana dUTPase showed three active sites, each with a different set of ligand interactions between the amino acids and water molecules. On an α-helix, tryptophan 93 appears to keep serine 89 in place via a water molecule and to specifically direct the ligand. Upon being oriented in the active site, the C-terminal residues close the active site to promote the reaction.
In the high-affinity group, the prefixed direction of the serine residues was oriented by a positively charged residue located four amino acids away, while low-affinity enzymes possess small hydrophobic residues at the corresponding sites.
Deoxyuridine triphosphate nucleotidohydrolase (dUTPase; EC 220.127.116.11) is an important enzyme that prevents uracil misincorporation during de novo DNA synthesis . It catalyzes the hydrolysis of dUTP to deoxyuridine monophosphate (dUMP) and inorganic pyrophosphate [2, 3], thereby maintaining an appropriate level of dUTP with respect to deoxythymidine triphosphate (dTTP) levels . Compromising dUTPase activity in fast-growing cells causes an imbalance in the dUTP–dTTP ratio that can cause uracil misincorporation into DNA . Due to its role in fast-growth-specific cell death, dUTPase has been a target for cancer chemotherapy [5, 6].
Homo-trimeric dUTPases have three active sites, each of which consists of five conserved motifs  (Fig. 1a–c). An aspartate in motif 1 interacts with active site water molecules to stabilize the divalent cation cofactor Mg2+, which is important for enzymatic activity [7–9]. A serine in motif 2 interacts with the oxygen atom between α, β-phosphate to induce a reaction-favorable orientation  (Ser 89 in Arabidopsis dUTPase). An aspartate in motif 3 activates catalytic water ; a glutamine in motif 4 also interacts with the catalytic water (Wcat in Fig. 1a) and the ligand . Interactions between ligands and residues in motif 5 help orient the ligand so that the α phosphate locates close to the catalytic water . The homo-trimeric dUTPase kinetic mechanism has mainly been studied by multidimensional nuclear magnetic resonance (NMR) [13, 14], quench-flow experiments , and the mixed quantum mechanics/molecular mechanics (QM/MM) calculations . These studies have revealed at least four distinct enzymatic steps including substrate binding, isomerization, hydrolysis, and release. However, the mechanism for defining the substrate affinity remains unclear.
To address the basis for substrate affinity differences among dUTPases, we chose those five dUTPases using two criteria, including (i) consistent measurement conditions of Km accompanied by (ii) reported X-ray structure of both apo and holo formats. We compared amino acid sequences as a function of substrate affinity, including in the analysis high-affinity isozymes (low Km) from Escherichia coli , Mycobacterium tuberculosis , and Arabidopsis thaliana (thale cress; this study)  and low-affinity isozymes (high Km) from Saccharomyces cerevisiae (yeast)  and Homo sapiens (human)  (Fig. 1c; Table 1).
We found that Arabidopsis dUTPase contains a unique tryptophan at the 93rd position, which is located between motifs 2 and 3 (Fig. 1c; Table 1). To identify the role of the 93rd tryptophan, we solved the structure of Arabidopsis dUTPase in its holo form. This homo-trimeric enzyme shows a unique set of interactions with ligands, amino acids, and water molecules at each active site. A comparison of the active sites reveals that tryptophan 93 seems to play a key role in guiding serine reorientation via a water molecule to orient the incoming ligand. In high-affinity dUTPases, the serine residue can be held in place in similar manner by a positively charged residue located four amino acids away. In contrast, low-affinity enzymes lack charges at the corresponding sites.
Arabidopsis dUTPase was prepared as described previously [18, 20, 21]. Briefly, His-tagged Arabidopsis dUTPase was purified via Ni–NTA chromatography from the cleared lysate of E. coli JM103 (DE3) cells. The tag was removed by thrombin cleavage. The resulting dUTPase includes three extra amino acids, Gly–Ser–His, at the amino-terminus (Fig. 1c).
dUTPase activity assay
The enzymatic activity assay was performed using cresol red, and the Km values were calculated using the integrated Michaelis–Menten method [22, 23] with a stopped-flow instrument (Hi-Tech SF-61DX2, TgK Scientific, Bradford-on-Avon, UK) equipped with a photodiode array detector. The assay solution contained 100 mM KCl, 5 mM MgCl2, and 0.25 mM bicine at pH 7.6. dUTPase, to a final concentration of 50 nM, was rapidly mixed with 1–5 μM dUTP solutions in the stopped-flow system, and absorbance was monitored at 573 nm (Fig. 1d).
Crystals of holo Arabidopsis dUTPase were grown by vapor diffusion with the hanging drop method using 2 M ammonium sulfate as a precipitant in 50 mM Tris–HCl at pH 7.4 and 5 mM of the non-hydrolyzable ligand analog 2′-deoxyuridine 5′-[(α,β)-imido]-triphosphate (dUpNHpp; Jena Bioscience, Jena, Germany) along with 5 mM MgSO4 (Table 2) [18, 20, 21].
X-ray diffraction data collection, structural analysis, and structural mining
Diffraction data were collected from a single holo dUTPase crystal at the Advanced Photon Source (Argonne, IL, USA) sector 14-BM-C. The data collection and refinement statistics are shown in Table 2. While the holo crystal diffracted beyond 1.2 Å resolution, we only used reflections up to 1.5 Å resolution to keep the redundancy more than five and the linear R factor less than 0.6. To increase the resolution of the structure in apo formats, we further refined the apo structure (PDB ID, 2P9O) with previously collected data by PHENIX ; the updated apo structure was used as the starting model for the structure in the holo format. The structure of the holo enzyme was also refined by PHENIX using the same Rfree flag assignment used in apo structure refinement. We deposited the apo and holo structures in PDB with the IDs 4OOQ and 4OOP, respectively. The COOT , PISA , and PyMOL (Schrödinger, San Diego, CA) software packages were used for structural mining and graphical presentation.
Results and discussion
Enzymatic activity and crystallization
The Arabidopsis dUTPase showed enzymatic activity, with estimated Km and Vmax values of 0.4 ± 0.1 μM and 1.2 ± 0.05 µM s−1, respectively, at pH 7.6 and 25 ℃ (Fig. 1d). Therefore, the enzyme belongs to the high-substrate-affinity group, which also includes the E. coli and M. tuberculosis proteins (Table 1).
Crystallization of the holo Arabidopsis dUTPase was performed using the non-hydrolysable dUTP analog dUpNHpp and ammonium sulfate as the precipitant. In the apo Arabidopsis dUTPase, taurine was an indispensable additive for the growth of single crystals . In the holo format, the growth of single crystals was not dependent on the presence of taurine. The protein formed needle clusters in the absence of dUpNHpp and taurine.
Arabidopsis dUTPase structure
The refined Arabidopsis dUTPase structure showed a trimeric structure (Figs. 1a, 2a; Table 2). The first 24 N-terminal residues of all of the subunits were equally disordered. However, there were notable differences in the interpretable C-terminal domains when all of the active sites had a bound ligand (Fig. 2b). Next, we used PISA to assess the effects of crystal packing on the different C-terminal lengths . Chains A and C interacted with neighboring subunits (Additional file 1: Figure S1), while chain B did not. Since α carbon positions between chain B and C are identical, crystal packing apparently contributes to the variety in the chain A C-terminal length.
Three apo–holo common waters accommodate the substrate
Because the C-terminal residues are involved in ligand interactions, the environment of the ligand-binding site was analyzed (Additional file 2: Table S1 and Additional file 3: Figure S2). Aside from the three ordered water molecules that interact with the magnesium ion, there were three additional ordered water molecules (Fig. 2c) that interact with the ligand at the nitrogenous base oxygen atom O4, the α-phosphate oxygen atom, and the β-phosphate oxygen atom. When the active sites of both the apo and holo forms are superimposed, all of the apo active sites have three water molecules at similar locations as those commonly found in the holo form. These data suggest that the coordination of these water molecules is necessary for initial ligand binding at the active site.
Replacement of ligand-associated water with C-terminal residues reorients the ligand
Ligand 1 bound to active site 1 involves the shortest interpretable C-terminus (Fig. 2d, active site 1) and has the most interactions with water molecules. Ligand 2 in active site 2 involves a medium-length interpretable C-terminus, and has the fewest interactions with water molecules and amino acid residues. Ligand 2 has the highest average B-factor among the 3 ligands, namely 42.8 Å2 as calculated by COOT. A likely explanation for this result is that the C-terminal residues involved in this active site have the most interactions with neighboring subunits due to crystal packing, as discussed above (Additional file 1: Figure S1). Superimposition of the ligand 1 and 2 binding sites (Fig. 2d, active sites 1 and 2) shows that the ligand 1-interacting water molecule (indicated by a green arrow in Fig. 2d) is located at a coordinate similar to that of the Arg156 amino group in the ligand 2 binding site. The electron density map does not support the presence of the corresponding Arg156 coordinate in the ligand 1 binding site. We interpret these data as ligand 1 being in a pre-ordered state, such that the bound ligand is reoriented by the replacement of the ligand–water interaction with the ligand–Arg156 interaction.
This “interaction–replacement” phenomenon is also observed in the ligand 3 binding site (red and black arrows in Fig. 2d, active sites 2 and 3). This binding site involves the completed C-terminus and has the largest number of ligand-amino acid interactions. A superimposition of all of the ligand-binding sites shows that Arg156 undergoes a structural change to interact with the water molecule ordered by the magnesium ion in the holo form. Some ligand–water molecule interactions found in both the ligand 1 and 2 binding sites are replaced by ligand–Ser163 or ligand–Thr164 interactions in the ligand 3 binding site.
We compared ligand coordinates focusing between ligand 1 and 3 binding sites. It was because the C-terminal residues involved in the ligand 2 binding site (chain A residues from 145 to 152) had interactions with four neighboring residues, and resulted the structural difference in the C-termini residues comparing from other subunits, chain B and C (Additional file 1: Figure S1). It appears that the ligand coordinate in the ligand 3 binding site appears to be engaged in the more stable orientation in terms of increased ligand–amino acid interactions. Additionally, the ligand coordinate is in the most favorable position for nucleophilic attack. The γ-phosphate group of ligand 3 occupies a position that is different from those of the other two ligands. A comparison of the ligand 3 and 1 coordinates relative to the catalytic water shows that the ligand 3 α-phosphate is closer to the catalytic water by approximately 0.3 Å (Wcat in Fig. 1a, inhibitor-bound active site).
Roles of Trp93 in Arabidopsis dUTPase ligand binding
Serine 89 of motif 2 in Arabidopsis dUTPase is an important residue for maintaining a reaction-favorable ligand orientation at the active site (Fig. 2c). It undergoes a conformational change between the apo and holo forms to interact with the oxygen atom between α,β-phosphate . This serine side chain flipping is observed in the obtained Arabidopsis dUTPase structure (Figs. 2c, 3a). This rearrangement is commonly observed in all active sites of the compared dUTPases except for one of the active sites in yeast dUTPase.
Tryptophan 93 of Arabidopsis dUTPase may play an important role in orienting serine 89 in the holo form (Fig. 2c). In the apo form, all of the tryptophan 93 side-chain coordinates were oriented upwards; thus, all of the tryptophan 93 Nε1 atoms were located away from the serine 89 Oγ. These are likely due to crystal-packing induced hydrophobic interactions with proline 46 in neighboring subunits. In contrast, the tryptophan 93 side chain in the holo-form chain B had different coordinates compared with those in other two subunits. It is located such that the tryptophan 93 Nε1 was closer to the serine 89 Oγ. Although tryptophan 93 in the holo form may make the same hydrophobic interactions with proline 46 in neighboring subunit, this particular orientation was likely due to the presence of a nearby water molecule, which bridges its interaction with serine 89. This serine 89–water–tryptophan 93 interaction was only found in chain B. This is part of the ligand 1 binding site, which involves the shortest interpretable C-terminus.
Together with the finding that ligand 1 is likely in a pre-ordered state for the reaction, we assume that tryptophan 93 acts as a key residue for initial ligand orientation at the active site by promoting serine 89 side-chain coordinate changes by interacting with the ordered water molecule.
Molecular mechanism for the differences in ligand affinity
Arabidopsis dUTPase belongs to the high-affinity group, along with the E. coli and Mycobacterium dUTPases (Table 1; Fig. 2c). Tryptophan 93 of Arabidopsis dUTPase holds serine 89 in place via a water molecule and forms a favorable conformation for substrate binding. It appears that the high-affinity dUTPases from species such as E. coli and Mycobacterium have charged or polar amino-acid substitutions corresponding to tryptophan 93 in Arabidopsis dUTPase. In contrast, the low-affinity dUTPases from humans and yeast have non-polar amino-acid substitutions. Additionally, chain C of the yeast dUTPase has the motif 2 serine residue whose side-chain oxygen atom is located away from the nitrogen atom between α,β-phosphate, and yeast dUTPase has the highest Km value among the five dUTPases compared in this study. These data suggest that the amino-acid substitution affects the hydration state at the active site and may influence ligand-binding affinity.
The structure of Arabidopsis dUTPase has been analyzed. Interestingly, this homotrimeric enzyme shows varying binding site environments with respect to their types of ligand interactions. Additionally, the tryptophan 93 substitution seems to use ordered water molecules to aid in coordinating Ser89 for initial ligand binding.
Hogrefe HH, Hansen CJ, Scott BR, Nielson KB. Archaeal dUTPase enhances PCR amplifications with archaeal DNA polymerases by preventing dUTP incorporation. Proc Natl Acad Sci USA. 2002;99(2):596–601. doi:10.1073/pnas.012372799.
Mol CD, Harris JM, McIntosh EM, Tainer JA. Human dUTP pyrophosphatase: uracil recognition by a beta hairpin and active sites formed by three separate subunits. Structure. 1996;4(9):1077–92.
Vertessy BG, Toth J. Keeping uracil out of DNA: physiological role, structure and catalytic mechanism of dUTPases. Acc Chem Res. 2009;42(1):97–106. doi:10.1021/ar800114w.
Greenberg GR, Somerville RL. Deoxyuridylate kinase activity and deoxyuridinetriphosphatase in Escherichia coli. Proc Natl Acad Sci USA. 1962;48:247–57.
Ladner RD. The role of dUTPase and uracil-DNA repair in cancer chemotherapy. Curr Protein Pept Sci. 2001;2(4):361–70.
Hassan M, Watari H, AbuAlmaaty A, Ohba Y, Sakuragi N. Apoptosis and molecular targeting therapy in cancer. Biomed Res Int. 2014;2014:150845. doi:10.1155/2014/150845.
Oliveros M, Garcia-Escudero R, Alejo A, Vinuela E, Salas ML, Salas J. African swine fever virus dUTPase is a highly specific enzyme required for efficient replication in swine macrophages. J Virol. 1999;73(11):8934–43.
Zhang Y, Moriyama H, Homma K, Van Etten JL. Chlorella virus-encoded deoxyuridine triphosphatases exhibit different temperature optima. J Virol. 2005;79(15):9945–53.
Takacs E, Nagy G, Leveles I, Harmat V, Lopata A, Toth J, et al. Direct contacts between conserved motifs of different subunits provide major contribution to active site organization in human and mycobacterial dUTPases. FEBS Lett. 2010;584(14):3047–54. doi:10.1016/j.febslet.2010.05.018.
Palmen LG, Becker K, Bulow L, Kvassman JO. A double role for a strictly conserved serine: further insights into the dUTPase catalytic mechanism. Biochemistry. 2008;47(30):7863–74. doi:10.1021/bi800325j.
Barabas O, Pongracz V, Kovari J, Wilmanns M, Vertessy BG. Structural insights into the catalytic mechanism of phosphate ester hydrolysis by dUTPase. J Biol Chem. 2004;279(41):42907–15. doi:10.1074/jbc.M406135200.
Varga B, Barabas O, Kovari J, Toth J, Hunyadi-Gulyas E, Klement E, et al. Active site closure facilitates juxtaposition of reactant atoms for initiation of catalysis by human dUTPase. FEBS Lett. 2007;581(24):4783–8. doi:10.1016/j.febslet.2007.09.005.
Dubrovay Z, Gaspari Z, Hunyadi-Gulyas E, Medzihradszky KF, Perczel A, Vertessy BG. Multidimensional NMR identifies the conformational shift essential for catalytic competence in the 60-kDa Drosophila melanogaster dUTPase trimer. J Biol Chem. 2004;279(17):17945–50. doi:10.1074/jbc.M313644200.
Barabas O, Nemeth V, Bodor A, Perczel A, Rosta E, Kele Z, et al. Catalytic mechanism of alpha-phosphate attack in dUTPase is revealed by X-ray crystallographic snapshots of distinct intermediates, 31P-NMR spectroscopy and reaction path modelling. Nucleic Acids Res. 2013;41(22):10542–55. doi:10.1093/nar/gkt756.
Toth J, Varga B, Kovacs M, Malnasi-Csizmadia A, Vertessy BG. Kinetic mechanism of human dUTPase, an essential nucleotide pyrophosphatase enzyme. J Biol Chem. 2007;282(46):33572–82. doi:10.1074/jbc.M706230200.
Lopata A, Jambrina PG, Sharma PK, Brooks BR, Toth J, Vertessy BG, Rosta E. Mutations decouple proton transfer from phosphate cleavage in the dutpase catalytic reaction. ACS Catal. 2015;5:3225–37.
Pecsi I, Leveles I, Harmat V, Vertessy BG, Toth J. Aromatic stacking between nucleobase and enzyme promotes phosphate ester hydrolysis in dUTPase. Nucleic Acids Res. 2010;38(20):7179–86. doi:10.1093/nar/gkq584.
Bajaj M, Moriyama H. Purification, crystallization and preliminary crystallographic analysis of deoxyuridine triphosphate nucleotidohydrolase from Arabidopsis thaliana. Acta Crystallogr, Sect F: Struct Biol Cryst Commun. 2007;63(Pt 5):409–11. doi:10.1107/S1744309107016004.
Tchigvintsev A, Singer AU, Flick R, Petit P, Brown G, Evdokimova E, et al. Structure and activity of the Saccharomyces cerevisiae dUTP pyrophosphatase DUT1, an essential housekeeping enzyme. Biochem J. 2011;437(2):243–53. doi:10.1042/BJ20110304.
Homma K, Moriyama H. Crystallization and crystal-packing studies of Chlorella virus deoxyuridine triphosphatase. Acta Crystallogr, Sect F: Struct Biol Cryst Commun. 2009;65(Pt 10):1030–4. doi:10.1107/S1744309109034459.
Badalucco L, Poudel I, Yamanishi M, Natarajan C, Moriyama H. Crystallization of Chlorella deoxyuridine triphosphatase. Acta Crystallogr Sect F Struct Biol Cryst Commun. 2011;67(Pt 12):1599–602. doi:10.1107/S1744309111038097.
Larsson G, Nyman PO, Kvassman JO. Kinetic characterization of dUTPase from Escherichia coli. J Biol Chem. 1996;271(39):24010–6.
Vertessy BG. Flexible glycine rich motif of Escherichia coli deoxyuridine triphosphate nucleotidohydrolase is important for functional but not for structural integrity of the enzyme. Proteins. 1997;28(4):568–79.
Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr Sect D Biol Crystallogr. 2010;66(Pt 2):213–21. doi:10.1107/S0907444909052925.
Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr Sect D Biol Crystallogr. 2004;60(Pt 12 Pt 1):2126–32. doi:10.1107/S0907444904019158.
Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372(3):774–97. doi:10.1016/j.jmb.2007.05.022.
Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):2577–637. doi:10.1002/bip.360221211.
Huang X, Miller W. A time-efficient, linear-space local similarity algorithm. Advances in Applied Mathematics. 1991;12:337–357. doi:10.1016/0196-8858(91)90017-D.
NI and MB carried out the protein crystallography. NI, KC, and MY carried out the kinetic analysis. NI and YJ carried out the molecular dynamics and structural mining. HM, DB, CPC, MKK conceived the study, designed the experiments, analyzed the data and wrote the manuscript. All authors read and approved the final manuscript.
We thank the Arabidopsis Biological Resource Center at the Ohio State University for providing us with the dUTPase clone. We also thank Dr. Javier Seravalli in the Dept. of Biochemistry at UNL for his contributions to the stopped-flow activity assays. This research used resources from the Advanced Photon Source, a US Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357.
The authors declare that they have no competing interests.