- Research Note
- Open access
- Published:
A small stretch of poor codon usage at the beginning of dengue virus open reading frame may act as a translational checkpoint
BMC Research Notes volume 16, Article number: 359 (2023)
Abstract
Objective
Rare codons were previously shown to be enriched at the beginning of the dengue virus (DENV) open reading frame. However, the role of rare codons in regulating translation efficiency and replication of DENV remains unclear. The present study aims to clarify the significance of rare codon usage at the beginning of DENV transcripts using the codon adaptation index (CAI).
Methodology
CAIs of the whole starting regions of DENV transcripts as well as 18-codon sliding windows of the regions were analyzed.
Results
One of the intriguing findings is that those rare codons do not typically result in uniformly low CAI in the starting region with rare codons. However, it shows a notable local drop in CAI around the 50th codon in all dengue serotypes. This suggests that there may be a translational checkpoint at this site and that the rare codon usage upstream to this checkpoint may not be related to translational control.
Introduction
In the standard genetic code table, 61 codons correspond to the 20 amino acids. Even though synonymous codons encode the same amino acid, their distribution in the genomes of all organisms is not random causing codon usage bias [1].
Codon usage bias differs among genomes of species including viruses [1, 2], and can differ even among gene groups within the same genome [3, 4]. However, some genes may show little or no codon usage bias [5]. Most viruses have codon usage pattern unrelated to their host [6, 7]. Viral codon usage may depend on their genome composition and the abundance of tRNA [8]. Codon usage may affect gene expression at both the transcriptional and translational levels [2]. Optimal codons match with abundant tRNAs and properly interact with anticodons leading to efficient translation [9]. In contrast, the use of rare codons results in a decrease in translation rate [10].
Rare codons can be found in the genomes of a wide range of organisms [11] and are also common in viruses. Previous studies showed that rare codons are over-represented in the initiation site of DENV and West Nile virus mRNA [12, 13], but not in the hepatitis C virus [14]. It was proposed that these rare codons may provide a translational regulation [12, 13].
Tuller et al. [15]. previously showed that the first 30–50 codons of genes are translated with a low efficiency corresponding to low abundant tRNAs by measuring the tRNA adaptation index (tAI). They proposed the “ribosomal traffic rules” so that the rare codons serve as a ramp, in which ribosomes pause and form a queue thereby preventing ribosome collisions. Here, we attempt to understand the role of rare codons at the beginning of the DENV open reading frame by investigating their CAI.
Materials and methods
DENV genome sequence data
A total of 160 strains of the four DENV serotypes (DENV1-4) were used in this study (Additional file 1: Table S1). We randomly selected these strains from geographically isolated regions of the world, such as Africa, Asia, Europe, North America, South America, the Caribbean, and the Pacific. The datasets for the strain, isolated region, year of isolation, genome length, and GenBank accession number are also shown in this table. The complete DENV genome sequences were obtained from the National Center for Biotechnology Information (NCBI) GenBank database.
Relative synonymous codon usage analysis
The relative synonymous codon usage (RSCU) analysis is used to measure the degree of codon usage bias for each codon of each amino acid in a coding sequence. To characterize the synonymous codon usage bias in the various initiation sites of the four DENV serotype coding sequences, the first 25, 50, 75, and 100 codon sites, as well as the entire DENV genome, were prepared as follows: A total of 40 strains of each serotype were excised at the indicated individual codon positions from the start codon, followed by assembly of each serotype strain using BioEdit version 7.2.5. The RSCU values of each codon were calculated using the automated codon usage analysis software (ACUA version 1.0), which can be downloaded from http://www.bioinsilico.com/acua [16]. The synonymous codons with RSCU values > 1.6 and < 0.6 were over- and under-represented codons, respectively, while codons with RSCU values between 0.6 and 1.6 are considered unbiased or randomly used [3, 14, 17,18,19]. The RSCU values of human highly-expressed genes were calculated using the following formula:
where Xi is the number of occurrences of synonymous codon i, and n is the number of synonymous codons for that amino acid [9].
Codon adaptation index analysis
The codon adaptation index (CAI) analysis was used to measure synonymous codon usage bias in a coding sequence, gene expression level, and translation efficiency [20, 21]. The CAI value of a coding sequence was calculated using the CAIcal (http://ppuigbo.me/programs/CAIcal/), which required a reference set of known highly expressed genes [22]. A CAI value ranges from 0 to 1.0, with a higher value indicating a stronger codon usage bias as well as a higher degree of translation efficiency. In the present study, CAI analysis of the initiation site along the four DENV serotype coding sequences was calculated using codon usage reference tables for Homo sapiens and Aedes aegypti (Additional file 2: Table S2) from the codon usage database (http://www.kazusa.or.jp/codon/). A sliding window of CAI output along the gene sequence was assigned using a window size of 18 codons and a window step of 5 codons.
Results
Rare codon usage at the 5’ region of DENV mRNA
We calculated the RSCU values to determine the synonymous codon usage pattern in the various initiation sites of the four DENV serotypes coding sequence, the first 25, 50, 75, and 100 codon sites, as well as the entire DENV genome, as shown in Fig. 1 (graphical-based representation) and Additional file 3: Table S3 (calculated RSCU data). The zero RSCU values were caused by unused or absent synonymous codons in those sequences. Among the 59 synonymous codons in the DENV1-4 coding sequence, six were observed to be over-represented: GGA for Gly, CCA for Pro, AGA for Arg, UCA for Ser, ACA for Thr, and GUG for Val, and nine were found to be under-represented: GCG for Ala, GGU for Gly, CCG for Pro, CGA, CGC, CGG, and CGU for Arg, UCG for Serine, and ACG for Thr. The most frequently used codon in DENV1-4 genomes was AGA for Arg. The codon usage of the 5’regions appeared to be different from the usage pattern of the whole genome. The CUG for Leu, AGA for Arg, UCA for Ser, and GUG for Val codons were persistently over-represented in all the 5’ regions with various lengths, whereas many under-represented codons in the whole genomes such as CGC for Arg were more frequently used in the 5’ regions. Some of the over-represented codons in the 5’ regions such as GCG for Ala were rare codons in human codon usage. It was previously proposed that this different codon usage in the initiation region of DENV mRNA may affect translational efficiency and provide some translational control [12, 13].
To understand the role of this codon usage, we calculated CAIs of the whole genomes and the 5’regions. In accordance with previous reports [23], CAIs of DENV full length sequences were found to be relatively low (Additional file 4: Table S4). However, except for slightly lower CAIs at the first 25–75 codons in DENV1 and the first 75 codons of DENV2, the CAIs of the 5’regions were not significantly lower than those of the entire genome. Similarly, when compared to Ades mosquito vector codon usage, the CAIs of the 5’regions were not lower than those of the entire genome (Additional file 5: Table S5). This was unexpected and could be explained by the presence of some optimal codons such as CUG for Leu and GUG for Val in this region, making the CAI no lower than the average of the genome despite the overrepresentation of rare codons.
A putative translational checkpoint was found around the 50th codon
To further investigate how the presence of rare codons in the initiation site of mRNA affects local translation efficiency of DENV in human and Aedes mosquito cells, CAI values of sliding 18-codon fragments were calculated in the first 200 codons across the codon sequence of four DENV serotypes using a reference set of highly expressed genes for Homo sapiens and Aedes aegypti (Additional file 6: Table S6 and Additional file 7: Table S7). As shown in Fig. 2, we found that the local CAI profiles gradually decreased from the start site, then uniformly dropped to the lowest CAI at the 50th -codon position. These results showed that the presence of rare codons at the beginning of DENV1-4 transcripts resulted in low CAI values at the 50th -codon specific position with a boundary of 38 codons, indicating a low level of mRNA translation and thereafter a slowing down of translation speed. The region with a local decline in CAI is proposed as a translational checkpoint with a local slow translation rate. Although there are some other local dips in the CAIs at some other positions, for example a dip at 100th codon for DENV3, the local drop at the 50th is the lowest and the only uniform one among all DENVs.
Synonymous codon usage bias in the translational checkpoint of DENV
To analyze the synonymous codon usage bias pattern at the translational checkpoint of the four DENV serotypes, we calculated the RSCU values for the codon sequences at the translational checkpoint in DENV1-4 (Fig. 2a) as follows: checkpoint at 35–55 codons for DENV1, checkpoint at 40–60 codons for DENV2, checkpoint at 35–55 codons for DENV3, and checkpoint at 30–50 codons for DENV4. As indicated in Fig. 3 and Additional file 8: Table S8 (calculated RSCU data), each serotype of DENV showed the synonymous codon usage bias pattern in the translational checkpoint with notable over-represented codons (RSCU values > 1.6). These included AGA (Arg), UCA (Ser), ACA (Thr), and GUG (Val) for DENV1; UCA (Ser), ACA (Thr), and GUG (Val) for DENV2; AGA (Arg) for DENV3; and UCC (Ser) for DENV4. These codons are most frequently used in their codon usage tables, with the exception of UCC (Ser) for DENV4. However, most of these codons were not found in the codon usage pattern of the first 25 or 50 codon sites (Fig. 1), excluding UCA (Ser) and GUG (Val) for DENV1 and DENV2.
Discussion
The over- and under-represented codons are consistent with the earlier studies [12, 24]. In their genome, all four DENV serotypes prefer A-ending codons, with the exception of GUG for Val. The codon AGA for Arg is the most preferred codon by DENV and other Flaviviridae family viruses, excluding hepatitis C virus [25]. In addition, codons containing CpG dinucleotides are under-represented in agreement with previous reports [24, 26].
The hypothesis proposed that the presence of rare codons at the beginning of mRNA transcripts may contribute to the slowing down of translation in order to maintain optimal protein expression levels, thereby increasing translation efficiency. Our result, on the other hand, suggests that the translation slowing down may not start from the beginning but only involve a narrow checkpoint around the 50th codon, and that the rare codons upstream to this checkpoint may be there for other reasons.
The codon adaptation index (CAI) is the primary usage for determining the efficiency of translation elongation rate in the coding region of all species [20, 27]. The results of CAI analysis along the first 200 codons of DENV coding sequences, given in Fig. 2 shows the local delay of the translational checkpoints in the 50th -codon region with a boundary of 38 codons in all four DENV serotypes. Our findings are supported by the mechanism of intragenic pattern of codon usage of Tuller et al [15].: the slow “ramp” of the presence of rare codons in the first 30–50 codons of the genes is translated with low efficiency and presumably reduces the ribosomal traffic jam, thereby preventing ribosome collisions during translation elongation, which can lead to mRNA degradation [28], and thus improving translation efficiency. Interestingly, the local CAI does not tend to decrease throughout the 50 codons at the 5’end of the transcript, implying that the translational checkpoint might not have to be 50 codons long starting at the 5’ end. This could be due to the difference between viral mechanisms and common host systems. It might be because the virus needs to eliminate constraints in the translation initiation region to enhance the efficiency of protein synthesis. As viruses require efficient viral protein expression, the translational checkpoint may be essential for viral replication and pathogenesis. Interestingly, the checkpoint was found at similar position in both the context of human and mosquito codon usage. This suggests that the checkpoint evolved for the adaptation to both human and mosquito hosts simultaneously. Elimination of the checkpoint may provide a new approach for viral attenuation. As rare codons were also found at the 5’ region of other flaviviruses, the checkpoint mechanism may also be present in other flaviviruses. Whether similar checkpoints are also present in other types of viruses requires further studies.
Altogether, our results suggest that the presence of local low CAI checkpoints around the 50th codon region of DENV sequences may provide a translational regulation. This supports the notion that the translation elongation speed may be regulated by the codon usage pattern.
Limitations
We analyzed a limited number of DENV strains. However, these strains were randomly selected from geographically isolated regions of the world, such as Africa, Asia, Europe, North America, South America, the Caribbean, and the Pacific. Moreover, the complete genome sequences of DENV strains revealed little variation in codon usage and nucleotide composition.
Data Availability
The datasets generated in this study are available in the manuscript and additional files.
Abbreviations
- ACUA:
-
Automated Codon Usage Analysis
- CAI:
-
Codon Adaptation Index
- CDS:
-
Coding sequence
- DENV:
-
Dengue virus
- mRNA:
-
Messenger RNA
- NCBI:
-
Center for Biotechnology Information
- RSCU:
-
Relative Synonymous Codon Usage
- tAI:
-
tRNA Adaptation Index
- tRNA:
-
Transfer RNA
References
Mitra S, Ray SK, Banerjee R. Synonymous codons influencing gene expression in organisms. Rese Rep Biochem. 2016;6:57–65.
Zhou Z, Dang Y, Zhou M, Li L, Yu C-h, Fu J, et al. Codon usage is an important determinant of gene expression levels largely through its effects on transcription. Proc Natl Acad Sci. 2016;113(41):E6117–E25.
Rahman SU, Yao X, Li X, Chen D, Tao S. Analysis of codon usage bias of Crimean-Congo hemorrhagic Fever virus and its adaptation to hosts. Infect Genet Evol. 2018;58:1–16.
Hershberg R, Petrov DA. Selection on codon bias. Annu Rev Genet. 2008;42:287–99.
Yu C-H, Dang Y, Zhou Z, Wu C, Zhao F, Sachs MS, et al. Codon usage influences the local rate of translation elongation to regulate co-translational protein folding. Mol Cell. 2015;59(5):744–54.
Félez-Sánchez M, Trösemeier J-H, Bedhomme S, González-Bravo MI, Kamp C, Bravo IG. Cancer, warts, or asymptomatic Infections: clinical presentation matches codon usage preferences in human papillomaviruses. Genome Biol Evol. 2015;7(8):2117–35.
Schubert AM, Putonti C. Evolution of the sequence composition of Flaviviruses. Infect Genet Evol. 2010;10(1):129–36.
Belalov IS, Lukashev AN. Causes and implications of codon usage bias in RNA viruses. PLoS ONE. 2013;8(2):e56642.
Moriyama E. Codon usage. Papers in Genetics. 2003:4.
Varenne S, Buc J, Lloubes R, Lazdunski C. Translation is a non-uniform process: effect of tRNA availability on the rate of elongation of nascent polypeptide chains. J Mol Biol. 1984;180(3):549–76.
Clarke IVTF, Clark PL. Rare codons cluster. PLoS ONE. 2008;3(10):e3412.
Zhou J-h, Zhang J, Sun D-j, Ma Q, Chen H-t, Ma L-n, et al. The distribution of synonymous codon choice in the translation initiation region of dengue virus. PLoS ONE. 2013;8(10):e77239.
Ma X, Feng Y, Liu J, Chen L, Zhao Y, Guo P, et al. Characteristics of synonymous codon usage bias in the beginning region of West Nile virus. Genet Mol Res. 2014;13(3):7347–55.
Zhou J-h, Su J-h, Chen H-t, Zhang J, Ma L-n, Ding Y-z, et al. Clustering of low usage codons in the translation initiation region of Hepatitis C virus. Infect Genet Evol. 2013;18:8–12.
Tuller T, Carmi A, Vestsigian K, Navon S, Dorfan Y, Zaborske J, et al. An evolutionarily conserved mechanism for controlling the efficiency of protein translation. Cell. 2010;141(2):344–54.
Vetrivel U, Arunkumar V, Dorairaj S. ACUA: a software tool for automated codon usage analysis. Bioinformation. 2007;2(2):62–3.
Butt AM, Nasrullah I, Tong Y. Genome-wide analysis of codon usage and influencing factors in chikungunya viruses. PLoS ONE. 2014;9(3):e90905.
Wong EH, Smith DK, Rabadan R, Peiris M, Poon LL. Codon usage bias and the evolution of Influenza A viruses. Codon usage biases of Influenza Virus. BMC Evol Biol. 2010;10(1):1–14.
Khandia R, Singhal S, Kumar U, Ansari A, Tiwari R, Dhama K, et al. Analysis of Nipah virus codon usage and adaptation to hosts. Front Microbiol. 2019;10:886.
Xia X. An improved implementation of codon adaptation index. Evolutionary Bioinf. 2007;3:53–8.
Gingold H, Pilpel Y. Determinants of translation efficiency and accuracy. Mol Syst Biol. 2011;7(1):481.
Puigbò P, Bravo IG, Garcia-Vallve S. CAIcal: a combined set of tools to assess codon usage adaptation. Biol Direct. 2008;3(1):1–8.
dos Passos Cunha M, Ortiz-Baez AS, de Melo Freire CC, de Andrade Zanotto PM. Codon adaptation biases among sylvatic and urban genotypes of Dengue virus type 2. Infect Genet Evol. 2018;64:207–11.
Ma J, Feng F, Zhang J, Zhou JH, Ma LN, Ding Y-z, et al. Analysis of synonymous Codon usage in Dengue viruses. J Anim Vet Adv. 2013;12:88–98.
Moosavi F, Mohabatkar H, Mohsenzadeh S. Analysis of synonymous codon usage bias and nucleotide and amino acid composition in 13 species of Flaviviridae. J Cell Mol Res. 2011;3(1):1–11.
Simmonds P, Xia W, Baillie JK, McKinnon K. Modelling mutational and selection pressures on dinucleotides in eukaryotic phyla–selection against CpG and UpA in cytoplasmically expressed RNA and in RNA viruses. BMC Genomics. 2013;14:1–16.
Plotkin JB, Kudla G. Synonymous but not the same: the causes and consequences of codon bias. Nat Rev Genet. 2011;12(1):32–42.
Simms CL, Yan LL, Zaher HS. Ribosome collision is critical for Quality Control during No-Go Decay. Mol Cell. 2017;68(2):361–373e5.
Acknowledgements
This study was supported by the Chair Professor Program (P-20-52262), the National Science and Technology Development Agency (NSTDA), Thailand, and partly supported by the Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand.
Funding
PA was supported by the Chair Professor Program (P-20-52262), the National Science and Technology Development Agency (NSTDA), Thailand (https://www.nstda.or.th/en/). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
13104_2023_6615_MOESM3_ESM.xlsx
Additional file 3: Table S3. RSCU values of DENV1-4 in the CDS of the entire genome and their various initiation sites; the first 25, 50, 75, and 100 codons and the host
13104_2023_6615_MOESM4_ESM.xlsx
Additional file 4: Table S4. The CAI data of DENV1-4 in the CDS of the entire genome and their various initiation sites; the first 25, 50, 75, and 100 codons using codon usage table of Homo sapiens as a reference set
13104_2023_6615_MOESM5_ESM.pdf
Additional file 5: Table S5. The CAI data of DENV1-4 in the CDS of the entire genome and their various initiation sites; the first 25, 50, 75, and 100 codons using codon usage table of Aedes aegypti as a reference set
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Yimyaem, M., Jitobaom, K. & Auewarakul, P. A small stretch of poor codon usage at the beginning of dengue virus open reading frame may act as a translational checkpoint. BMC Res Notes 16, 359 (2023). https://doi.org/10.1186/s13104-023-06615-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13104-023-06615-5