PolyCTLDesigner: a computational tool for constructing polyepitope T-cell antigens
© Antonets and Bazhan; licensee BioMed Central Ltd. 2013
Received: 14 February 2013
Accepted: 24 September 2013
Published: 10 October 2013
Construction of artificial polyepitope antigens is one of the most promising strategies for developing more efficient and safer vaccines evoking T-cell immune responses. Epitope rearrangements and utilization of certain spacer sequences have been proven to greatly influence the immunogenicity of polyepitope constructs. However, despite numerous efforts towards constructing and evaluating artificial polyepitope immunogens as well as despite numerous computational methods elaborated to date for predicting T-cell epitopes, peptides binding to TAP and for antigen processing prediction, only a few computational tools were currently developed for rational design of polyepitope antigens.
Here we present a PolyCTLDesigner program that is intended for constructing polyepitope immunogens. Given a set of either known or predicted T-cell epitopes the program selects N-terminal flanking sequences for each epitope to optimize its binding to TAP (if necessary) and joins resulting oligopeptides into a polyepitope in a way providing efficient liberation of potential epitopes by proteasomal and/or immunoproteasomal processing. And it also tries to minimize the number of non-target junctional epitopes resulting from artificial juxtaposition of target epitopes within the polyepitope. For constructing polyepitopes, PolyCTLDesigner utilizes known amino acid patterns of TAP-binding and proteasomal/immunoproteasomal cleavage specificity together with genetic algorithm and graph theory approaches. The program was implemented using Python programming language and it can be used either interactively or through scripting, which allows users familiar with Python to create custom pipelines.
The developed software realizes a rational approach to designing poly-CTL-epitope antigens and can be used to develop new candidate polyepitope vaccines. The current version of PolyCTLDesigner is integrated with our TEpredict program for predicting T-cell epitopes, and thus it can be used not only for constructing the polyepitope antigens based on preselected sets of T-cell epitopes, but also for predicting cytotoxic and helper T-cell epitopes within selected protein antigens. PolyCTLDesigner is freely available from the project’s web site: http://tepredict.sourceforge.net/PolyCTLDesigner.html.
One of the most promising approaches to designing more efficient and safer vaccines is construction of artificial polyepitope antigens [1–6]. Their advantages over conventional vaccines include reduced risk of developing autoimmunity and other pathological conditions since such constructions doesn’t contain whole microbial molecular structures and epitopes, sharing profound similarity to human proteins. Besides, polyepitopes may contain both cytotoxic (CTL) and T-helper epitopes belonging to different antigens, including those of distinct pathogenic microorganisms, thus making possible to induce immune responses with a wider specificity. In addition, polyepitopes may be designed taking into account the prevalences of certain HLA class I molecules allomorphs within the target human population or even within an individual patient. Polyepitopes may also be constructed in a way maximizing efficiency of processing and presentation of the majority of included epitopes [4, 7–9]. Additional signal sequences (for example, N-terminal ubiquitin, N-terminal leader peptide, and C-terminal fragment of human LAMP-1 protein) could be introduced into target polyepitopes to increase their efficiency of stimulating either CD8+ and/or CD4+ T-cell response [10–14]. However, despite numerous efforts towards constructing artificial polyepitope immunogens and evaluating their immunogenicity and protectivity [2, 4–9, 15, 16] and despite numerous computational methods developed to date for predicting T-cell epitopes [17–20], proteasomal cleavage sites [21–25], and peptide binding to TAP (transporters associated with antigen processing) [24, 26–28], only a few computational tools intended for rational design of polyepitope T-cell immunogens were developed to date [29, 30].
Both interaction of peptides with MHC molecules and peptides binding to TAP complex are sufficiently specific for certain amino acid patterns. Proteasomal and immunoproteasomal cleavage sites are also to a considerable degree determined by the amino acid sequences of antigens and degenerate amino acid motifs that determine the efficiency of proteasomal cleavage sites are currently known [22, 31] as well as the motifs determining the affinities of oligopeptides binding to TAP [26–28]. It has been shown that a concurrent prediction of peptide binding affinities for MHC molecules and TAP decreases false positive rate when predicting T-cell epitopes [26, 27]. As it has been experimentally demonstrated, that introduction of spacer sequences to between individual epitopes could considerably increase the ability of such polyepitope constructs to induce cytotoxic T-cell immune responses [4, 7, 9]. Epitope rearrangements within the polyepitopic constructs were also found to significantly influence their immunogenicity and it was also hypothesized that while constructing the polyepitope antigen one should minimize the number of nontarget junctional epitopes .
It has been shown that longer peptides transported into the endoplasmic reticulum (ER) undergo N-terminal trimming by ER aminopeptidases (ERAPs) to allow them to bind MHC class I molecules , and that C-termini of the epitopes are preferentially generated by proteasomal cleavage [33–35]. Thus individual epitopes within a polyepitope should be arranged in a way providing sufficiently efficient proteasomal cleavage sites at their C-termini while, if necessary, their N-termini might be flanked with certain amino acid residues to optimize their binding to TAP.
The goal of this work was to develop a program intended for rational design of polyepitope T-cell antigens with a special focus on optimizing their immunogenicity via selecting amino acid spacer sequences for each pair of epitopes and choosing the optimal ordering of the epitopes within the polyepitope.
A program named PolyCTLDesigner was developed. Given a set of either known or predicted CTL epitopes, PolyCTLDesigner predicts affinity of their binding to TAP and then N-termini of inefficient binders are extended with certain flanking residues using the model created by Peters et al. , implying that the first three N-terminal amino acid residues of the peptide and the last C-terminal one are the major contributors to its binding to TAP.
where W is the weight (rank) of spacer sequence ss between the epitopes pep1 and pep2; rank HLA is the rank of non-target junctional epitope predicted to be the most efficient binder for HLA class I allele HLA; freq HLA is the genotypic frequency of that allele within the population of interest (HLA alleles genotypic frequencies were taken from dbMHC ); len(ss) is the length of spacer ss; rank pr corresponds to the rank of proteasomal cleavage site predicted at the pep1 C-terminus (this value ranges from 1 to 11 with 1 and 11 corresponding to the most and the least efficient proteasomal cleavage, respectively); rank impr is the rank of immunoproteasomal cleavage site; designates the mean rank HLA value; N eps is the number of predicted junctional epitopes and N HLA is the number of HLA alleles predicted to bind non-target epitopes with sufficient affinity (currently PolyCTLDesigner predicts T-cell epitopes with our program TEpredict , that was recently updated); rank HLA value of 1 corresponds to moderate binding affinity (6.3 ≤ pIC50 < 7.3), the value of 2 corresponds to high affinity (7.3 ≤ pIC50 < 8.3) and 3 corresponds to the highest affinity (with predicted pIC50 value ≥ 8.3). Thus the optimal spacer sequence should have the least weight.
Building the graph is the most time consuming step and for a set of 40–50 peptides it can take about 6–8 hours to be completed on typical desktop personal computer. However in future we plan to implement parallelized algorithm to reduce time consumption.
To illustrate the importance of poly-CTL-epitope optimization we performed a small theoretical analysis: six well-defined HLA-A*02:01-restricted HIV-1 CTL epitopes were used to produce artificial polyepitopes using our PolyCTLDesigner program. It was found that probability of selecting an optimal epitopes permutation at random was less than 0.00139 and only 17% of all possible polyepitope constructs did not contain inefficient proteasomal cleavage sites between target CTL epitopes. Besides, the choice of spacer sequences can have a great impact on proteasomal cleavage efficiency, especially for certain peptides. The detailed description of the study and its results can be found in Additional file 1.
In addition, PolyCTLDesigner is also able to assist in constructing polyepitope fragments containing T-helper epitopes. Currently T-helper epitopes can be predicted with ProPred models  (based on TEpitope models ). From a set of proposed antigens the program selects peptide fragments having 20–40 amino acid residues in length which contain the maximal number of overlapping T-helper epitopes restricted by maximal repertoire of HLA class II allomorphs. Then, each fragment is extended by five amino acid residues at both C- and N-terminals, since residues flanking the core epitope can play an important role in binding to T-cell receptors of CD4+ T lymphocytes . The fragments containing T-helper epitopes can be joined through [KR][KR] motif, which form cleavage sites for several lysosomal cathepsins involved in antigen processing. It has been shown that such motifs can increase the immunogenicity of individual T-cell epitopes [42, 43].
The developed software realizes a rational approach to designing highly immunogenic poly-CTL-epitope antigens and can be used to develop new candidate polyepitope vaccines. The current version of PolyCTLDesigner is integrated with our TEpredict program for predicting T-cell epitopes, and thus it can be used not only for constructing the polyepitope antigens based on preselected sets of T-cell epitopes, but also for predicting cytotoxic and helper T-cell epitopes within selected protein antigens. In addition, PolyCTLDesigner allows the user to select a minimal set of epitopes covering a specified repertoire of allelic variants of HLA molecules with a desired level of redundancy. The program was implemented using Python programming language and can be used either interactively or through scripting, which allows the users familiar with Python to create custom pipelines. PolyCTLDesigner source code can be found in Additional file 2 and it is also freely available at the project’s web site http://tepredict.sourceforge.net/PolyCTLDesigner.html.
PolyCTLDesigner was implemented using the Python programming language. The affinity of peptides binding to TAP was predicted and the flanking sequences were selected using the model developed by Peters et al. . The models by Toes et al.  were used to predict proteasomal and/or immunoproteasomal processing and to select spacer sequences to optimize polyepitope processing. The procedures involving graphs were realized using Python graph library . The poly-CTL-epitope fragment was constructed with the help of genetic algorithm using the PyEvolve library . Biopython library was used to read amino acid sequences written in Fasta or GenBank format . T-helper epitope predictions were based on ProPred models . CTL epitopes predictions were made using our program TEpredict .
Availability and requirements
Project name: PolyCTLDesigner
Project home page: http://polyctldesigner.sourceforge.net
Operating system(s): platform-independent
Programming language: Python
Other requirements: Python 2.7, TEpredict (provided with the PolyCTLDesigner), Biopython, NumPy, PyEvolve, and Python graph
License: Creative Commons Attribution Non-Commercial License V2.0 (CC BY-NC 2.0)
Any restrictions to use by non-academics: see CC BY-NC 2.0 license
This work was supported by the Ministry of Education and Science of the Russian Federation (contracts #16.512.11.2186 and #8289) and by Russian Foundation for Basic Research (grant #12-04-31746 mol_a).
- Berzofsky JA, Berkower IJ: Novel approaches to peptide and engineered protein vaccines for HIV using defined epitopes: advances in 1994–1995. AIDS. 1995, 9 (A): S143-S157.PubMedGoogle Scholar
- Woodberry T, Gardner J, Mateo L, Eisen D, Medveczky J, Ramshaw IA, Thomson SA, Ffrench RA, Elliott SL, Firat H, Lemonnier FA, Suhrbier A: Immunogenicity of a human immunodeficiency virus (HIV) polytope vaccine containing multiple HLA A2 HIV CD8(+) cytotoxic T-cell epitopes. J Virol. 1999, 73: 5320-5325.PubMedPubMed CentralGoogle Scholar
- Bazhan SI, Belavin PA, Seregin SV, Danilyuk NK, Babkina IN, Karpenko LI, Nekrasova NA, Lebedev LR, Ignatyev GM, Agafonov AP, Poryvaeva VA, Aborneva IV, Ilyichev AA: Designing and engineering of DNA-vaccine construction encoding multiple CTL-epitopes of major HIV-1 antigens. Vaccine. 2004, 22: 1672-1682. 10.1016/j.vaccine.2003.09.048.PubMedView ArticleGoogle Scholar
- Cardinaud S, Bouziat R, Rohrlich P-S, Tourdot S, Weiss L, Langlade-Demoyen P, Burgevin A, Fiorentino S, Van Endert P, Lemonnier FA: Design of a HIV-1-derived HLA-B07.02-restricted polyepitope construct. AIDS. 2009, 23: 1945-1954. 10.1097/QAD.0b013e32832fae88.PubMedView ArticleGoogle Scholar
- Iglesias MC, Mollier K, Beignon A-S, Souque P, Adotevi O, Lemonnier F, Charneau P: Lentiviral vectors encoding HIV-1 polyepitopes induce broad CTL responses in vivo. Molec Ther. 2007, 15: 1203-1210.Google Scholar
- Tine JA, Firat H, Payne A, Russo G, Davis SW, Tartaglia J, Lemonnier FA, Demoyen PL, Moingeon P: Enhanced multiepitope-based vaccines elicit CD8+ cytotoxic T cells against both immunodominant and cryptic epitopes. Vaccine. 2005, 23: 1085-1091. 10.1016/j.vaccine.2003.01.001.PubMedView ArticleGoogle Scholar
- Bazhan SI, Karpenko LI, Ilyicheva TN, Belavin PA, Seregin SV, Danilyuk NK, Antonets DV, Ilyichev AA: Rational design based synthetic polyepitope DNA vaccine for eliciting HIV-specific CD8+ T cell responses. Molec Immunol. 2010, 47: 1507-1515. 10.1016/j.molimm.2010.01.020.View ArticleGoogle Scholar
- Ishioka GY, Fikes J, Hermanson G, Livingston B, Crimi C, Qin M, Del Guercio MF, Oseroff C, Dahlberg C, Alexander J, Chesnut RW, Sette A: Utilization of MHC class I transgenic mice for development of minigene DNA vaccines encoding multiple HLA-restricted CTL epitopes. J Immunol. 1999, 162: 3915-3925.PubMedGoogle Scholar
- Livingston BD, Newman M, Crimi C, McKinney D, Chesnut R, Sette A: Optimization of epitope processing enhances immunogenicity of multiepitope DNA vaccines. Vaccine. 2001, 19: 4652-4660. 10.1016/S0264-410X(01)00233-X.PubMedView ArticleGoogle Scholar
- Bonehill A, Heirman C, Tuyaerts S, Michiels A, Breckpot K, Brasseur F, Zhang Y, Van Der Bruggen P, Thielemans K: Messenger RNA-electroporated dendritic cells presenting MAGE-A3 simultaneously in HLA class I and class II molecules. J Immunol. 2004, 172: 6649-6657.PubMedView ArticleGoogle Scholar
- Bonini C, Lee SP, Riddell SR, Greenberg PD: Targeting antigen in mature dendritic cells for simultaneous stimulation of CD4+ and CD8+ T cells. J Immunol. 2001, 166: 5250-5257.PubMedView ArticleGoogle Scholar
- Su Z, Vieweg J, Weizer AZ, Dahm P, Yancey D, Turaga V, Higgins J, Boczkowski D, Gilboa E, Dannull J: Enhanced induction of telomerase-specific CD4(+) T cells using dendritic cells transfected with RNA encoding a chimeric gene product. Cancer Res. 2002, 62: 5041-5048.PubMedGoogle Scholar
- Kim TW, Hung C-F, Boyd D, Juang J, He L, Kim JW, Hardwick JM, Wu T-C: Enhancing DNA vaccine potency by combining a strategy to prolong dendritic cell life with intracellular targeting strategies. J Immunol. 2003, 171: 2970-2976.PubMedView ArticleGoogle Scholar
- Fassnacht M, Lee J, Milazzo C, Boczkowski D, Su Z, Nair S, Gilboa E: Induction of CD4(+) and CD8(+) T-cell responses to the human stromal antigen, fibroblast activation protein: implication for cancer immunotherapy. Clini Cancer Res. 2005, 11: 5566-5571. 10.1158/1078-0432.CCR-05-0699.View ArticleGoogle Scholar
- Mateo L, Gardner J, Chen Q, Schmidt C, Down M, Elliott SL, Pye SJ, Firat H, Lemonnier FA, Cebon J, Suhrbier A: An HLA-A2 polyepitope vaccine for melanoma immunotherapy. Journal of immunology. 1999, 163: 4058-4063.Google Scholar
- Smith SG, Patel PM, Porte J, Selby PJ, Jackson AM: Human dendritic cells genetically engineered to express a melanoma polyepitope DNA vaccine induce multiple cytotoxic T-cell responses. Clin Cancer Res. 2001, 7: 4253-4261.PubMedGoogle Scholar
- Singh H, Raghava GPS: ProPred1: prediction of promiscuous MHC Class-I binding sites. Bioinformatics. 2003, 19: 1009-1014. 10.1093/bioinformatics/btg108.PubMedView ArticleGoogle Scholar
- Lundegaard C, Lamberth K, Harndahl M, Buus S, Lund O, Nielsen M: NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8–11. Nuc Acids Res. 2008, 36: W509-W512. 10.1093/nar/gkn202.View ArticleGoogle Scholar
- Zhang GL, Ansari HR, Bradley P, Cawley GC, Hertz T, Hu X, Jojic N, Kim Y, Kohlbacher O, Lund O, Lundegaard C, Magaret CA, Nielsen M, Papadopoulos H, Raghava GPS, Tal V-S, Xue LC, Yanover C, Zhu S, Rock MT, Crowe JE, Panayiotou C, Polycarpou MM, Duch W, Brusic V: Machine learning competition in immunology - prediction of HLA class I binding peptides. J Immunol Methods. 2011, 374: 1-4. 10.1016/j.jim.2011.09.010.PubMedView ArticleGoogle Scholar
- Bhasin M, Raghava GPS: A hybrid approach for predicting promiscuous MHC class I restricted T cell epitopes. J Biosci. 2007, 32: 31-42. 10.1007/s12038-007-0004-5.PubMedView ArticleGoogle Scholar
- Kuttler C, Nussbaum AK, Dick TP, Rammensee H, Hadeler K: An algorithm for the prediction of proteasomal cleavages. J Mol Biol. 2000, 301: 229-View ArticleGoogle Scholar
- Toes RE, Nussbaum AK, Degermann S, Schirle M, Emmerich NP, Kraft M, Laplace C, Zwinderman A, Dick TP, Müller J, Schönfisch B, Schmid C, Fehling HJ, Stevanovic S, Rammensee HG, Schild H: Discrete cleavage motifs of constitutive and immunoproteasomes revealed by quantitative analysis of cleavage products. J Exp Med. 2001, 194: 1-12. 10.1084/jem.194.1.1.PubMedPubMed CentralView ArticleGoogle Scholar
- Nussbaum AK, Kuttler C, Hadeler KP, Rammensee HG, Schild H: PAProC: a prediction algorithm for proteasomal cleavages available on the WWW. Immunogenetics. 2001, 53: 87-94. 10.1007/s002510100300.PubMedView ArticleGoogle Scholar
- Larsen MV, Lundegaard C, Lamberth K, Buus S, Brunak S, Lund O, Nielsen M: An integrative approach to CTL epitope prediction: a combined algorithm integrating MHC class I binding, TAP transport efficiency, and proteasomal cleavage predictions. Eur J Immunol. 2005, 35: 2295-2303. 10.1002/eji.200425811.PubMedView ArticleGoogle Scholar
- Nielsen M, Lundegaard C, Lund O, Keşmir C: The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage. Immunogenetics. 2005, 57: 33-41. 10.1007/s00251-005-0781-7.PubMedView ArticleGoogle Scholar
- Peters B, Bulik S, Tampe R, Van Endert PM, Holzhütter H-G: Identifying MHC class I epitopes by predicting the TAP transport efficiency of epitope precursors. J Immunol. 2003, 171: 1741-1749.PubMedView ArticleGoogle Scholar
- Doytchinova I, Hemsley S, Flower DR: Transporter associated with antigen processing preselection of peptides binding to the MHC: a bioinformatic evaluation. J Immunol. 2004, 173: 6813-6819.PubMedView ArticleGoogle Scholar
- Ren Y, Wu B, Pan Y, Lv F, Kong X, Luo X, Li Y, Yang Q: Characterization of the binding profile of peptide to transporter associated with antigen processing (TAP) using Gaussian process regression. Computers Biol Med. 2011, 41: 865-870. 10.1016/j.compbiomed.2011.07.004.View ArticleGoogle Scholar
- He Y, Rappuoli R, De Groot AS, Chen RT: Emerging vaccine informatics. J Biomed Biotechnol. 2010, 2010: 218590-PubMedPubMed CentralGoogle Scholar
- Lee Y, Ferrari G, Lee SC: Estimating design space available for polyepitopes through consideration of major histocompatibility complex binding motifs. Biomed Microdevices. 2010, 12: 207-222. 10.1007/s10544-009-9376-7.PubMedView ArticleGoogle Scholar
- Altuvia Y, Margalit H: Sequence signals for generation of antigenic peptides by the proteasome: implications for proteasomal cleavage mechanism. J Mol Biol. 2000, 295: 879-890. 10.1006/jmbi.1999.3392.PubMedView ArticleGoogle Scholar
- Serwold T, Gonzalez F, Kim J, Jacob R, Shastri N: ERAAP customizes peptides for MHC class I molecules in the endoplasmic reticulum. Nature. 2002, 419: 480-483. 10.1038/nature01074.PubMedView ArticleGoogle Scholar
- Neefjes J, Jongsma MLM, Paul P, Bakke O: Towards a systems understanding of MHC class I and MHC class II antigen presentation. Nat Rev Immunol. 2011, 11: 823-836.PubMedGoogle Scholar
- Mo XY, Cascio P, Lemerise K, Goldberg AL, Rock K: Distinct proteolytic processes generate the C and N termini of MHC class I-binding peptides. J Immunol. 1999, 163: 5851-5859.PubMedGoogle Scholar
- Craiu A, Akopian T, Goldberg A, Rock KL: Two distinct proteolytic processes in the generation of a major histocompatibility complex class I-presented peptide. Proc Natl Acad Sci U S A. 1997, 94: 10850-10855. 10.1073/pnas.94.20.10850.PubMedPubMed CentralView ArticleGoogle Scholar
- Meyer D, Singe RM, Mack SJ, Lancaster A, Nelson MP, Erlich H, Fernandez-Vina M, Thomson G: Single Locus Polymorphism of Classical HLA Genes. Immunobiology of the Human MHC: Proceedings of the 13th International Histocompatibility Workshop and Conference, Volume I. Edited by: Hansen JA. 2007, Seattle, WA: IHWG Press, 653-704.Google Scholar
- Antonets DV, Maksyutov AZ: TEpredict: Software for T-Cell epitope prediction. Mol Biol. 2010, 44: 119-127. 10.1134/S0026893310010152.View ArticleGoogle Scholar
- Pyevolve: Genetic Algorithm framework written in pure python.http://pyevolve.sourceforge.net,
- Singh H, Raghava GP: ProPred: prediction of HLA-DR binding sites. Bioinformatics. 2001, 17: 1236-1237. 10.1093/bioinformatics/17.12.1236.PubMedView ArticleGoogle Scholar
- Sturniolo T, Bono E, Ding J, Raddrizzani L, Tuereci O, Sahin U, Braxenthaler M, Gallazzi F, Protti MP, Sinigaglia F, Hammer J: Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices. Nature Biotechnol. 1999, 17: 555-561. 10.1038/9858.View ArticleGoogle Scholar
- Rudolph MG, Stanfield RL, Wilson IA: How TCRs bind MHCs, peptides, and coreceptors. Ann Rev Immunol. 2006, 24: 419-466. 10.1146/annurev.immunol.23.021704.115658.View ArticleGoogle Scholar
- Schneider SC, Ohmen J, Fosdick L, Gladstone B, Guo J, Ametani A, Sercarz EE, Deng H: Cutting edge: introduction of an endopeptidase cleavage motif into a determinant flanking region of hen egg lysozyme results in enhanced T cell determinant display. J Immunol. 2000, 165: 20-23.PubMedView ArticleGoogle Scholar
- Zhu H, Liu K, Cerny J, Imoto T, Moudgil KD: Insertion of the dibasic motif in the flanking region of a cryptic self-determinant leads to activation of the epitope-specific T cells. J Immunol. 2005, 175: 2252-2260.PubMedView ArticleGoogle Scholar
- python-graph - a library for working with graphs in Python.http://code.google.com/p/python-graph/,
- Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B, De Hoon MJL: Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009, 25: 1422-1423. 10.1093/bioinformatics/btp163.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.