Expression, purification and structural analysis of the Pyrococcus abyssi RNA binding protein PAB1135
BMC Research Notes volume 3, Article number: 97 (2010)
The gene coding for the uncharacterized protein PAB1135 in the archaeon Pyrococcus abyssi is in the same operon as the ribonuclease P (RNase P) subunit Rpp30.
Here we report the expression, purification and structural analysis of PAB1135. We analyzed the interaction of PAB1135 with RNA and show that it binds efficiently double-stranded RNAs in a non-sequence specific manner. We also performed molecular modeling of the PAB1135 structure using the crystal structure of the protein Af2318 from Archaeoglobus fulgidus (2OGK) as the template.
Comparison of this model has lead to the identification of a region in PAB1135 that could be involved in recognizing double-stranded RNA.
Despite the recent progress in various genome analysis projects, about a quarter of the archaeal genomes encode functionally uncharacterized proteins, which are almost all only common to other archaeal species [1–4]. Pyrococcus abyssi PAB1135 protein function has not yet been characterized, and it is classified in the family domain of unknown function 54 (DUF54) and in the uncharacterized protein family 0201 (UPF0201). This group's members have been annotated as conserved hypothetical proteins in 46 archaeal species. Some of these proteins were annotated as possible exosome subunits (TK1451 - GI: 57641386, Thermococcus kodakarensis; MK0388 - GI: 20093826, Methanopyrus kandleri; Msp1244 - GI: 84490032, Methanosphaera stadtmanae) [4–6], but analysis of completely sequenced Pyrococcus abyssi genome revealed the presence ofPAB1135 gene in the same operon as Pa1136, the ribonuclease P (RNase P) subunit Rpp30 .
RNase P is an endoribonuclease responsible for maturation of tRNAs in all domains of life. RNase P is a ribonucleoprotein (RNP) complex, formed by one RNA molecule and a variable number of protein subunits, depending on the organism. Bacterial RNase P contains one protein, whereas the archaeal relative contains at least four proteins, and in humans, it contains at least 10 protein subunits [8, 9]. Pyrococcus horikoshii RNase P has been shown to be formed by one catalytic RNA and the proteins Ph1481, Ph1601, Ph1771, and Ph1877, which show homology to the human RNase P subunits hPop5, Rpp21, Rpp29, and Rpp30, respectively . The structures of these P. horikoshii proteins have been solved and a possible arrangement of the protein complex has been proposed .
Although the function of Pyrococcus abyssi PAB1135 has not been characterized, nor its association with RNase P complex, the genome location of the gene suggests that PAB1135 is involved in RNA metabolism. Here we show that PAB1135 binds RNA in vitro, showing higher affinity for double-stranded RNAs. In addition, structural analysis of PAB1135 by molecular modeling indicates a possible region for protein-RNA interaction.
Cloning of PAB1135 sequence
The Escherichia coli strains used in this study were DH5α and BL21-CodonPlus (DE3)-RIL (Stratagene). Plasmid DNA was extracted using Qiagen plasmid purification systems. Restriction enzymes and other DNA-modifying enzymes were used as recommended by the manufacturer (New England Biolabs). PAB1135 coding sequence was PCR-amplified from P. abyssi GE5 genomic DNA Genomic DNA (kindly provided by Dr. Patrick Forterre from Institut de Génétique et Microbiologie, Université Paris Sud, France) using primers PAB1135for (5'-GTTAGGGGGGATCC ATG GCAG-3') and PAB1135rev (5'-CGGCCTCGA GTCAAT CCTCCC-3'). The restriction sites used are underlined in the primers' sequences, and the start and stop codons are in bold. A DNA fragment of 462 bp was obtained from the PCR reaction and inserted into vector pET28a (Novagen), digested with BamH I-Xho I. A 21 kDa tagged protein His-PAB1135 was produced from this plasmid
Expression and purification of the recombinant protein
The pET28a-PAB1135 was transformed into the E. coli BL21-CodonPlus (DE3)-RIL strain. The transformed cells were grown at 37°C in 2xTY medium supplemented with 20 mg/L kanamycin and 17 mg/L chloramphenicol. The expression of His-PAB1135 was induced for two hours with 0.5 mM IPTG. Cells were harvested by centrifugation, suspended in buffer A (30 mM Tris-HCl, pH 8.0, 500 mM NaCl, 5 mM imidazole) and lysed in a French press. The lysate was heated at 85°C for 30 min and cooled on ice for 15 min. After centrifugation at 20,000 × g for 30 min, the supernatant was fractionated by affinity chromatography in Ni-NTA-agarose (Qiagen). The purified fractions were analyzed by SDS-PAGE.
Thrombin and Trypsin digestion and analysis
To remove the His-tag from PAB1135, tagged protein was incubated with thrombin for 8 h at room temperature, as recommended by the manufacturer (GE Healthcare). Trypsin treatment was performed by incubating His-PAB1135 with 0.1% trypsin solution (0.1 mg/ml) for 2 hours at room temperature, followed by the addition of 1 mM PMSF. His-PAB1135 and its tryptic cleavage products were subjected to SDS-PAGE and transferred to PVDF membranes (BioRad), which were incubated with an anti-poly-histidine antibody (GE Healthcare). The immunoblots were developed using the ECL system (GE Healthcare).
Gel filtration and circular dichroism
Gel filtration assays of trypsin-treated PAB1135 were carried out in a superdex 75 XK 16/60 column (GE Healthcare) in the presence of 50 mM Tris-HCl pH 8.0, 150 mM NaCl, 0.5 mM EDTA. Apparent molecular masses were assessed based on the retention time of the molecular mass markers (low molecular mass gel filtration calibration kits, GE Healthcare: bovine serum albumin, 67 kDa; ovalbumin, 43 kDa; chymotrypsinogen, 25 kDa; ribonuclease A, 13.7 kDa).
For the circular dichroism analysis, PAB1135 was dialyzed against buffer B (50 mM Tris-HCl pH 8.0, 200 mM NaCl, 5 mM MgCl2, 1% glycerol, 0.02% tween 20, 1 mM EDTA) and concentrated to 2 mg/ml using Amicom ultra (Millipore). The circular dichroism experiments were conducted on a JASCO 810 spectropolarimeter using a 1 mm path length cuvette. The K2d program  was used for the estimation of the percentages of protein secondary structure from circular dichroism data. Several CD curves were generated using the algorithm proposed by  to obtain best fit with the experimental data. The CD deconvolution and generation of CD curves were performed using the K2d program at http://www.embl.de/~andrade/k2d/.
RNA binding assays
RNA binding assays were carried out with 1 pmol 32P 5'-labeled oligoribonucleotides. The oligos used were: U8C5A8 (5'UUUUUUUUCCCCCAAAAAAAA3'), C8U5G8 (5'CCCCCCCCUUUUUGGGGGGGG3'), and UUA/C (5'UUAUUAUUCAUUCAUUAUUCA3'). The assays were performed as described previously [14, 15], in Tris-HCl pH 8.0, 20 mM KCl, 5 mM MnCl2, 1 mM DTT, 100 ug/ml BSA, 0.8 U Rnasin. Different amounts of trypsin-treated PAB1135 were incubated with the RNA oligos in 20 μl at 37°C for 30 minutes. The samples were resolved on 8% native polyacrylamide gels and visualized on a Phosphorimager (MolecularDynamics).
Molecular modeling and structural analysis
MODELLER [, version 9v1] was used to produce a homology molecular model of PAB1135. The structure of the conserved hypothetical protein Af2318 from Archaeoglobus fulgidus (PDB code 2OGK, ) was used as a template. This protein shares 40% identity with the PAB1135 sequence, being the top hit in a search through the PDB  using BLAST . The parameters used during the modeling exercise were the default of the programs. The alignment used in MODELLER was produced with CLUSTALX . The alignments of the N- and C-terminal regions were cut at the corresponding ends of the crystallographic model.
The homology models were validated with the VERIFY-3D  and PROCHECK  softwares. The analysis was made through visualization of the superimposed structures using PYMOL  and various alignments produced with CLUSTALX. COOT  was used for the superposition  of the atomic coordinates of the models and PDB files: 2OGK, 1JJ2, and 1MJI. DALI  was also used for analysis of correlated structures.
Purification of PAB1135
The localization of the Pyrococcus abyssi PAB1135 gene in the same operon as the RNase P subunit Rpp30  suggested the involvement of the uncharacterized PAB1135 protein in RNA metabolism. To analyze PAB1135 structure and its association with RNA, PAB1135 gene was cloned and the recombinant protein His-PAB1135 was purified by affinity chromatography. The SDS-PAGE showed protein bands with the expected molecular weight of 21 kDa (Figure 1A,B). The His-tag was released from PAB1135 after cleavage with thrombin (Figure 1C). Limited proteolysis of His-PAB1135 with trypsin resulted in the detection of a very stable protein, corresponding to PAB1135, with a molecular weight close to that expected for the native protein 17.9 kDa (Figure 1D). To ascertain the identity of the peptide released after trypsin cleavage, immunoblot was performed with anti-His antibody and the results show that anti-His is only able to detect the protein before trypsin treatment (Figure 1E,F).
Circular dichroism analysis of PAB1135 show a spectrum with double minimum at 208 and 222 nm, and positive peak near 190 nm, as expected for a protein with α/β content, indicating that PAB 1135 has a well defined structure (Figure 2A). The deconvolution of CD spectra using K2d algorithm (Figure 2A; blue line in box) indicates a >27% and >25% content for alpha and beta structures, respectively. A CD curve that better fits the experimental data corresponds to a protein containing 30% and 25% of alpha and beta structures, respectively (Figure 2A; red line in box). These estimations are in accordance with the obtained molecular model for Pa1135 protein.
Interestingly, results from gel filtration assays suggest that PAB1135 forms a homodimer in solution, with an apparent molecular weight of about 30 kDa (Figure 2B), whereas a PAB1135 monomer runs as an approximately 20 kDa protein on SDS-PAGE. This is very similar to the calculated 35.8 kDa molecular weight of a homodimer. In the conditions tested, PAB1135 appears monodisperse in solution since 100% of the mass was accounted for by a single peak at 2.7 nm. The temperature variation between 25°C and 50°C did not affect the results.
We had previously analyzed the RNA binding ability of PAB1135 and observed that it does not bind single-stranded RNA oligos, but binds efficiently RNAs that can form low stability hydrogen bonds (J.S. Luz and C.C. Oliveira, unpublished results). Here, we extended these analyses by electrophoresis mobility shift assays. Different amounts of purified PAB1135 were incubated with 32P-labeled RNA oligos. Samples were separated by electrophoresis on native polyacrylamide gels and visualized by phosphorimaging (Figure 3). Very weak shifted RNA bands can be visualized when PAB1135 is incubated with the RNA oligos U8C5A8 (low stability secondary structure; ΔG = -1.9 kcal/mol) and UUA/C (single strand) (Figure 3; lanes 1-5 and 11-15, respectively). When incubated with an RNA that forms higher stability secondary structure (oligo C8U5G8, ΔG = -15.9 kcal/mol), however, PAB1135 binds it much more efficiently, and stronger RNA shifted bands can be visualized in the presence of the protein (Figure 3; lanes 6-10). These results indicate that PAB1135 binds RNAs in a non-sequence specific manner, with higher affinity for double-stranded RNAs.
A BLAST search with the PAB1135 sequence against the PDB shows the Archaeoglobus fulgidus2OGK as the closest sequence to PAB1135, with 40% identity and 63% similarity, followed by Sulfolobus solfataricus SSO0741  with 27% identity and 48% similarity (Figure 4A). Interestingly, and similar to Pyrococcus abyssi, the gene encoding the Archaeoglobus fulgidus Af2318 protein is present in the same operon as RNase P subunit Rpp30 . Sequence analysis has led to the clustering of these proteins in the UPF0201 family . The structures of members of this protein family show a single α/β domain characterized by a twisted five-stranded anti parallel β-sheet with five α-helices on one side and an unprotected concave surface of the sheet on the other side . A homology model of the PAB1135 based on 2OGK presents good stereochemistry as judged by PROCHECK and VERIFY-3D. The model was used in a DALI search for structural neighbors, yielding a list that was topped by four UPF0201 Archaea structures followed by 1IQ4 and 1JJ2, all with Z score above 7. After these structures the Z score dropped significantly to 3.5 and sequence identity was below 19%. The first three hits were expected and did not yield new information, but the latter two structures are from the ribosomal protein L5 from Bacillus stearothermophilus and the large ribosomal subunit from Haloarcula marismortui, respectively. In the case of the whole ribosomal subunit, the similarity was also with the L5 protein (Figure 4B).
In light of the functional data showing higher affinity of PAB1135 for double-stranded RNA oligonucleotides, we superposed the structures of the PAB1135, 2OGK, 1JJ2 (only Cα atoms) and 1MJI. The latter structure, from the L5 ribosomal protein of Thermus thermophilus, has the complete protein bound to RNA. The superposition of the PAB1135 model onto the 1JJ2 structure is seen in figure 5. The concave surface of the β-sheet of 1JJ2 is responsible for binding to a double-stranded RNA (dsRNA) region of the 23S rRNA. The size and shape are similar to the surface of PAB1135 although the nature of the residues does not have a clear equivalence (Figure 5). Analysis of the concave surface in the model shows a patch of basic residues formed by Arg11, Arg13, Arg106, His127, Arg129 and Lys131 on the exposed surface of the β-sheet and Lys65 and Lys133 on nearby loops (Figure 5B). Based on the presence of these residues we raised the hypothesis that this region could complex with and stabilize other negatively charged molecules such as the outside of a double-stranded RNA. These residues are not all present in other members of the UPF0201 family which might not bind to RNA. In addition to binding the 23S rRNA, the Thermus thermophilus L5 ribosomal protein also interacts with the 5S rRNA through a portion of the protein that is not present in the PAB1135 model and therefore was not considered a good RNA binding site.
We show that PAB1135 is highly conserved and its structure can be inferred by molecular modeling based on the crystal structures of archaeal UPF0201 proteins. Furthermore, as shown here, PAB1135 binds RNA in vitro, with higher affinity for structured RNAs, in accordance with the model's suggestion for the presence of an RNA binding scaffold in the protein. It is possible that PAB1135 binds structured RNAs in vivo, such as RNase P RNA and tRNAs.
Cohen GN, Barbe V, Flament D, Galperin M, Heilig R, Lecompte O, Poch O, Prieur D, Quérellou J, Ripp R, Thierry JC, Oost Van der J, Weissenbach J, Zivanovic Y, Forterre P: An integrated analysis of the genome of the hyperthermophilic archaeon Pyrococcus abyssi. Mol Microbiol. 2003, 47: 1495-1512. 10.1046/j.1365-2958.2003.03381.x.
Bult CJ, White O, Olsen GJ, Zhou L, Fleischmann RD, Sutton GG, Blake JA, FitzGerald LM, Clayton RA, Gocayne JD, Kerlavage AR, Dougherty BA, Tomb JF, Adams MD, Reich CI, Overbeek R, Kirkness EF, Weinstock KG, Merrick JM, Glodek A, Scott JL, Geoghagen NS, Venter JC: Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996, 273: 1058-1073. 10.1126/science.273.5278.1058.
She Q, Singh RK, Confalonieri F, Zivanovic Y, Allard G, Awayez MJ, Chan-Weiher CC, Clausen IG, Curtis BA, De Moors A, Erauso G, Fletcher C, Gordon PM, Heikamp-de Jong I, Jeffries AC, Kozera CJ, Medina N, Peng X, Thi-Ngoc HP, Redder P, Schenk ME, Theriault C, Tolstrup N, Charlebois RL, Doolittle WF, Duguet M, Gaasterland T, Garrett RA, Ragan MA, Sensen CW, Oost Van der J: The complete genome of the crenarchaeon Sulfolobus solfataricus P2. PNAS. 1996, 98: 7835-7840. 10.1073/pnas.141222098.
Fukui T, Atomi H, Kanai T, Matsumi R, Fujiwara S, Imanaka T: Complete genome sequence of the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1 and comparison with Pyrococcus genomes. Genome Res. 2005, 15: 352-363. 10.1101/gr.3003105.
Slesarev AI, Mezhevaya KV, Makarova KS, Polushin NN, Shcherbinina OV, Shakhova VV, Belova GI, Aravind L, Natale DA, Rogozin IB, Tatusov RL, Wolf YI, Stetter KO, Malykh AG, Koonin EV, Kozyavkin SA: The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens. Proc Natl Acad Sci. 2002, 99: 4644-4649. 10.1073/pnas.032671499.
Fricke WF, Seedorf H, Henne A, Kruer M, Liesegang H, Hedderich R, Gottschalk G, Thauer RK: The Genome Sequence of Methanosphaera stadtmanae Reveals Why This Human Intestinal Archaeon Is Restricted to Methanol and H2 for Methane Formation and ATP Synthesis. J Bacteriol. 2006, 188: 642-658. 10.1128/JB.188.2.642-658.2006.
Koonin EV, Wolf YI, Aravind L: Prediction of the archaeal exosome and its connections with the proteasome and the translation and transcription machineries by a comparative-genomic approach. Genome Res. 2001, 11: 240-252. 10.1101/gr.162001.
Evans D, Marquez SM, Pace NR: RNase P: interface or the RNA and protein worlds. TIBS. 2006, 31: 333-341.
Altman S: A view of RNase P. Mol Biosyst. 2007, 3: 604-607. 10.1039/b707850c.
Kouzuma Y, Mizguchi M, Takagi H, Fukuhara H, Tsukamoto M, Numata T, Kimura M: Reconstitution of archaeal ribonuclease P from RNA and four protein components. Bichem Bioph Res Commun. 2003, 306: 666-673. 10.1016/S0006-291X(03)01034-9.
Kawano S, Nakeshima T, Kakuta Y, Tanaka I, Kimura M: Crystal structure of protein Ph1481p in complex with protein Ph1877p of archaeal RNase P from Pyrococcus horikoshii OT3: implication of dimer formation of the holoenzyme. J Mol Biol. 2006, 257: 583-591. 10.1016/j.jmb.2005.12.086.
Andrade MA, Chacón P, Merelo JJ, Morán F: Evaluation of secondary structure of proteins from UV circular dichroism spectra using an unsupervised learning neural network. Protein Eng. 1993, 6: 383-390. 10.1093/protein/6.4.383.
Chang CT, Wu CS, Yang JT: Circular dichroic analysis of protein conformation: inclusion of the beta-turns. Anal Biochem. 1978, 91: 13-31. 10.1016/0003-2697(78)90812-6.
Luz JS, Tavares JR, Gonzales FA, Santos MCT, Oliveira CC: Analysis of the Saccharomyces cerevisiae exosome architecture and of the RNA binding activity of Rrp40p. Biochimie. 2007, 89: 686-691. 10.1016/j.biochi.2007.01.011.
Ramos CRR, Oliveira CLP, Torriani IL, Oliveira CC: The Pyrococcus exosome complex: Structural and functional characterization. J Biol Chem. 2006, 281: 6751-6759. 10.1074/jbc.M512495200.
Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234: 779-815. 10.1006/jmbi.1993.1626.
Rao KN, Burley SK, Swaminathan S: UPF201 archaeal specific family members reveal structural similarity to RNA-binding proteins but low likelihood for RNA-binding function. PLoS One. 2008, 3: e3903-10.1371/journal.pone.0003903.
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Research. 2000, 28: 235-242. 10.1093/nar/28.1.235.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Research. 1997, 24: 4876-4882. 10.1093/nar/25.24.4876.
Bowie JU, Luthy R, Eisenberg D: A method to identify protein sequences that fold into a known three-dimensional structure. Science. 1991, 253: 164-170. 10.1126/science.1853201.
Laskowski RA, MacArthur MW, Moss DS, Thornton JM: PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Cryst. 1993, 26: 283-291. 10.1107/S0021889892009944.
DeLano WL: The PyMOL Molecular Graphics System (2002) on World Wide Web. [http://www.pymol.org]
Emsley P, Cowtan K: Coot: model-building tools for molecular graphics. Acta Crystallogr. 2004, 60: 2126-2132.
Krissinel E, Henrick K: Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. 2004, 60: 2256-2268.
Ban N, Nissen P, Hansen J, Moore PB, Steitz TA: The complete atomic structure of the large ribosomal subunit at 2.4 a resolution. Science. 2000, 289: 905-920. 10.1126/science.289.5481.905.
Perederina A, Nevskaya N, Nikonov O, Nikulin A, Dumas P, Yao M, Tanaka I, Garber M, Gongadze G, Nikonov S: Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex. RNA. 2002, 8: 1548-1557.
Holm L, Sander C: Searching protein structure databases has come of age. Proteins. 1994, 19: 165-173. 10.1002/prot.340190302.
This work was supported by a FAPESP grant (07/57096-9 to C.C.O.). J.S.L. was recipient of a FAPESP fellowship.
The authors declare that they have no competing interests.
JSL purified the recombinant proteins and carried out the activity assays, protein interaction assays and drafted portions of the manuscript. CRRR, stablished the protein purification protocols, helped with the CD analysis, and drafted portions of the manuscript. JARGB coordinated parts of the work, performed the molecular modeling and structural analysis, participated in the interpretation of data and drafted portions of the manuscript. CCO designed, organized and coordinated the experiments, drafted the manuscript and edited the final text. All authors read and approved the final manuscript.
About this article
Cite this article
Luz, J.S., Barbosa, J.A., Ramos, C.R. et al. Expression, purification and structural analysis of the Pyrococcus abyssi RNA binding protein PAB1135. BMC Res Notes 3, 97 (2010). https://doi.org/10.1186/1756-0500-3-97