- Short Report
- Open Access
A unique genetic code change in the mitochondrial genome of the parasitic nematode Radopholus similis
BMC Research Notesvolume 2, Article number: 192 (2009)
Mitochondria (mt) contain their own autonomously replicating DNA, constituted as a small circular genome encoding essential subunits of the respiratory chain. Mt DNA is characterized by a genetic code which differs from the standard one. Interestingly, the mt genome of nematodes share some peculiar features, such as small transfer RNAs, truncated ribosomal RNAs and - in the class of Chromadorean nematodes - unidirectional transcription.
We present the complete mt genomic sequence (16,791 bp) of the plant-parasitic nematode Radopholus similis (class Chromadorea). Although it has a gene content similar to most other nematodes, many idiosyncrasies characterize the extremely AT-rich mt genome of R. similis (85.4% AT). The secondary structure of the large (16S) rRNA is further reduced, the gene order is unique, the large non-coding region contains two large repeats, and most interestingly, the UAA codon is reassigned from translation termination to tyrosine. In addition, 7 out of 12 protein-coding genes lack a canonical stop codon and analysis of transcriptional data showed the absence of polyadenylation. Northern blot analysis confirmed that only one strand is transcribed and processed. Furthermore, using nucleotide content bias methods, regions for the origin of replication are suggested.
The extraordinary mt genome of R. similis with its unique genetic code appears to contain exceptional features correlated to DNA decoding. Therefore the genome may provide an incentive to further elucidate these barely understood processes in nematodes. This comprehension may eventually lead to parasitic nematode-specific control targets as healthy mitochondria are imperative for organism survival. In addition, the presented genome is an interesting exceptional event in genetic code evolution.
Nematodes are one of the largest phyla of multicellular animals on earth with over 20,000 described species. The burrowing nematode Radopholus similis infects numerous (sub)tropical crops and is considered as one of the most damaging pests on banana. Recently transcriptomic sequence data of this parasite were studied and several 'expressed sequence tags' (ESTs) originated from genes of the mitochondrial (mt) genome . Mitochondria are found in all eukaryotic cells and provide the cell with energy through the process of oxidative phosphorylation. Originating from an ancestral endosymbiotic α-proteobacterial species , they still contain a haploid, autonomously replicating genome of relatively short length, in nematodes ranging from 12.5 kb to 26 kb . To date, complete mt genomes of 31 nematode species are available in GenBank: 9 of the Enoplean class and 22 (mostly comprising animal-parasitic nematodes) of the Chromadorean class. The mt gene products are usually 2 ribosomal RNAs, 22 transfer RNAs and 12-13 intronless protein-coding genes which encode crucial subunits in respiratory complexes I, III, IV and V. Mt genomes of nematodes differ in some aspects from other metazoan mt genomes. The compact nematode mt genomes usually lack ATPase subunit 8, and contain shortened rRNA molecules and truncated tRNAs. Nearly all nematode mt tRNAs lack one arm, either the TΨC arm which is replaced by the 'TV-replacement loop', or the DHU arm which is replaced by D-replacement loops [3, 4]. In addition, in nematodes of the class Chromadorea (containing most important parasitic nematodes), all mt genes are unidirectionally transcribed from one strand.
Results and discussion
The complete mt genome of R. similis was amplified in three overlapping fragments, with the complete assembly being 16,791 bp [EMBL:FN313571] (figure 1), which was confirmed by southern blot (additional file 1). With an AT-content of 85.4%, it is the most AT-rich nematode mt genome sequenced to date, and the first complete mt genome of a Chromadorean plant-parasite (hereby disregarding the highly atypical multipartite mitochondrial genome of Globodera sp. ). All genes are unidirectionally transcribed from the coding strand, which has an asymmetrical nucleotide composition of 52.9% T, 32.5% A, 10.2% G and 4.4% C and is also referred to as the heavy strand. A quarter of the genome is non-coding, comprising two large repeat regions. Similar to other described nematode mt genomes, 22 tRNAs have been predicted ranging from 51 nt to 59 nt in length (additional file 2). All anticodons are conserved in nematodes, except for the UCG anticodon of tRNAArg, which is common in other metazoans, but deviates from the ACG anticodon used by most Chromadorean nematode mt genomes (table 1). Contrary to other nematodes, 2 nucleotides occur instead of one between the DHU and the anticodon arm in some R. similis tRNAs. In those tRNA species, the DHU arm is on both sides bordered by uracil (occurring in 8 tRNAs; additional file 2).
Another characteristic feature is the occurrence of three instead of two nucleotides between the amino-acyl acceptor stem and the DHU arm (occurring in 10 tRNAs). Two rRNA genes (12S and 16S) were identified on the mt genome. The 12S rRNA gene (or rrnS) is 692 bp long, comparable to other nematode mitochondrial 12S rRNA genes (698 ± 33 bp) (additional file 3). The boundaries of this gene were validated through a CT-RT-PCR technique (additional file 4). The secondary structure of this rRNA revealed similar topology as other published nematode 12S rRNA structures [6–9]. In contrast, the large subunit rRNA (16S rRNA or rrnL) is 840 bp in length, which is considerably smaller than the average of 943 ± 57 bp for other nematodes. The reductions in the overall comparable structure compared to other nematodes are indicated by arrows in figure 2. Most conserved nucleotides in nematode mt rRNAs are generally found in loops and not in stems (figure 2 and additional file 3), which is in agreement with observations that interacting proteins often bind to loops . Many nucleotides involved in the P site and A site of the ribosome are conserved; however, conserved residues in the G3 stem, which has been implicated as part of the exit site, are absent .
The mt genome contains 12 protein-coding genes (PCGs), lacking - as the majority of nematode mt genomes - the ATPase subunit 8 gene. To our surprise, we discovered that the UAA codon, which encodes translation termination according to the standard invertebrate mitochondrial genetic code, was abundant in all open reading frames of the 12 PCGs. Based on alignments with other nematode mt proteins, UAA encodes for tyrosine in the R. similis mt genome . The UAAStop to UAATyr reassignment was confirmed for another Radopholus species (R. arabocoffeae) based on a sequenced mtDNA fragment (additional file 5). This genetic code change has never been confirmed in any other animal genus. Though a similar codon reassignment was once reported for the planarian Dugesia japonica , subsequent studies could not confirm this finding . Thus until now, the UAAStop to UAATyr reassignment is unique for the mt genome of Radopholus. The sole Stop codon in use, UAG, could only be inferred for 5 PCGs: nad4L, cox3, nad1, atp6 and nad5. Of the remaining genes, nad6 is separated from the subsequent nad4L by a single thymine. This is often called a 'truncated stop', as it can form a complete canonical UAA stop codon after polyadenylation . Intriguingly, this could not be the case for Radopholus due to the codon reassignment. Absence of stop codons is a very uncommon feature in mitochondrial genes. However, it has been shown that mitochondria of humans, plants and dinoflagellates are capable of translating templates lacking Stop codons [15–17]. The inferred start codons are UUG (2), AUA (5), AUG, AUU (2) and UUA (2), all of which have been reported before in nematode mt genomes. Northern blot using 3 different probes specifically hybridizing to cytB (no stop codon), nad5 (UAG stop codon) and cox1 (no stop codon) revealed that the transcripts of these genes are processed into monocistronic transcripts, although the length appears larger than expected (1.4 kb instead of 1.0, 2.2 instead of 1.5, 2.2 instead of 1.6, for cytB, nad5 and cox1, respectively; see figure 3). This observation can not be explained easily, but it may point to a modification of mRNA molecules that affects their migration in denaturing agarose gels. Probing with the corresponding sense probes revealed no signals (data not shown). We conclude that exclusively the heavy strand is transcribed, eventually producing monocistronic transcripts lacking stop codons as a template for protein synthesis. Investigating EST data, 623 ESTs were aligned to the mt genome. The ESTs cover 7,971 bp (47.5%) of the mt genome. Remarkably, the majority (n = 581) is derived from 12S rRNA, pointing to a higher expression level and/or higher stability compared to other mt genes. None of the mt ESTs contain a poly(A) tail and - in remarkable contrast to the northern blot results - large parts of the mt genome are contiguously covered by EST data, crossing gene borders (figure 1). Only one EST cluster aligned to a gene boundary by starting exactly at the 5' end of tRNAGlu (additional file 6). Attempts to ascertain PCG boundaries using CT-RT-PCR failed. Besides some indels in poly(T) tracts, barely any sequence difference between the mt DNA and ESTs were present - except for 12S rRNA. Three different EST clusters align to this gene, containing 549, 18 and 2 ESTs respectively, with differences of 0, 2.4 and 7.5% respectively. Some transcriptomic data match parts of the non-coding region, often called the control or AT-region in nematode mt genomes. Based on sequence features, the control region in the mt genome of R. similis was artificially divided into five sub-regions (figure 1 and 4). Characteristic for the first part are poly-thymine and poly-adenine stretches, resulting in a high AT content of 87.6%. Some of the poly-T stretches are separated by a single G nucleotide, a motif which is found in several other nematode control regions [4, 18, 19]. A remarkably large and nearly perfect stem loop of 126 bp ends the first region. The second region of 460 bp runs until the small repeat region. This region is enriched in GC nucleotides (81.2% AT), which are concentrated in a 225 bp region and can adapt a stable secondary structure (figure 4A). The third region (84.0% AT) contains a repeat region of 806 bp in which a 26 bp motif is repeated 31 times. The fourth region is an AT-rich sequence part (87.5% AT) of 944 bp with unique features such as a roughly equal G- and C-content and a remarkable stretch of 6 C nucleotides. The sequence surrounding this sextuple C-motif can fold into stable secondary structures (figure 4C). The control region ends with another repeat region of 1,208 bp, containing a 302 bp sequence directly repeated 4 times. This repeat region partly overlaps with the adjacent tRNASerAGN. In contrast to Caenorhabditis elegans and Ascaris suum, no runs of AT dinucleotides are found in the control region . Fourteen other smaller non-coding regions were found interspersed in between the genes. Although most are small (ranging from 1 to 15 bp), one is 65 bp in length and located between tRNAMet and tRNAThr. This region is solely composed of AT nucleotides and can form a secondary structure (figure 4D).
Mt DNA replication is an asymmetrical process, in which the heavy strand replication initiation precedes that of the light strand. Hence some sequence parts of mt genomes spend more time single stranded, a state in which the DNA is more prone to deaminations of cytosine to uracil, and to a lesser extent of adenine to hypoxanthine . The subsequent accumulation of T and G (at the expense of C and A, respectively) in these regions can provide information about the origins of replication . In this way, the light strand origin of replication is predicted to be located in secondary structures surrounding the sextuple C motif (figure 1 and 5). Analysis of the relative amount of T and G versus A and C, on complete sequence as well as on nucleotides of the third codon positions, showed a pattern that could only be explained by assuming two different origins of replication . Based on these observations, we hypothesize that the origin of the heavy strand is located in the cox2 gene (figure 5). Although for nematodes nothing is known on this issue, our data suggest a similar mode of action for mt replication as other Metazoa, with origins of replications correlated to secondary structures in the DNA. The predicted OriL is located at the region harboring the sextuple C motif and surrounding secondary structures, and the predicted OriH lies in the vicinity of two tRNA genes, which could function as replication origins as observed in Vertebrates . However, no clear similarity between the secondary structures reported for other nematodes (see for example ) and those of R. similis was found.
DNA extraction and LD-PCR
R. similis was cultured as decribed in . On 500 ng of total phenol/chloroform extracted DNA from approximately 10,000 nematodes, LD-PCR was performed using a combination of Expand Long Range dNTP Pack (Roche, Mannheim, Germany) and Phusion High-Fidelity DNA Polymerase (Finnzymes, Espoo, Finland) with gene specific primers based on EST sequences  (see additional file 6). PCRs were done following the manufacturer's instructions in 50 μL reaction mixtures, containing 6% DMSO. All fragments were directly sequenced.
Extraction of mitochondria
Approximately 400 μL packed nematodes (approximately 200,000 nematodes) were washed with sterile demineralized water, and brought into 1 mL sucrose buffer (300 mM sucrose, 30 mM Tris, and 10 mM EDTA, pH7.9). With the aid of a Teflon pestle and some sand, a nematode homogenate was obtained and centrifuged for 5 min at 500 g. The supernatant was further purified by additional centrifugation for 5 min at 1000 g. A pellet of mitochondria was obtained by centrifuging the supernatant at 15,000 g for 45 min. Obtained mitochondria were used immediately.
Mitochondrial RNA extraction
The mitochondria were dissolved in 200 μL Trireagent (Sigma, St. Louis, MO, USA), and incubated for 1 h at RT. The solution was three times sonicated at the lowest setting for 2 sec while keeping on ice (Branson Sonifier 250, Danbury, USA). After incubation for 15 min at room temperature, 40 μL chloroform was added and the tube was vigorously shaken by hand. Further RNA extraction steps were performed following the manufacturer's instructions.
Mt RNA was circularized using T4 RNA ligase (Fermentas, St. Leon-Rot, Germany), using 1 μg mt RNA in a total volume of 20 μL, following the manufacturer's instructions. After incubation for 1 h at 37°C, the volume was adjusted to 200 μL with demineralized water, and RNA purified by standard phenol/chloroform extraction. After resuspension in 20 μL demineralized water, the circularized RNA was used for the reverse transcriptase reaction using SuperScript II (Invitrogen, Carlsbad, CA, USA) following manufacturer's instructions and a reverse-oriented gene-specific primer located at the 5' end of the gene (see additional file 4 and 7). The resulting cDNA was subjected to nested PCR using gene-specific primers. PCR products were ligated into the pGEM-T vector (Promega, Wisconsin, USA) and transformed into E. coli DH5α cells. Positive clones were selected on LB plates supplemented with carbenicilin (100 μg/ml) and inserts were sequenced.
Mt RNA was separated on a 1× MOPS 1% formaldehyde agarose gel and blotted (downward capillary) onto Hybond-N+ membrane (Amersham, Uppsala, Sweden). RNA probes were generated from plasmids containing fragments of cox1, cytB and nad5 as template (see additional file 7 for primers) using Riboprobe kit (Promega) in presence of 32P-labeled nucleotides. Hybridization was done overnight at 56°C, and after washing, signals were visualized using a fluor imager FLA-5100 (Fujifilm, Tokyo, Japan).
1.2 μg total DNA was digested with SpeI and XmnI restriction enzymes, and separated by inversed field electrophoresis in 0.5 × TBE 1% agarose gel. After capillary blot, hybridization was performed using the High Prime DIG labeling Starter Kit (Roche) following manufacturer's instructions with a probe constructed using primers RsNADHD_2 and RsCOXI_7 (see additional file 7) covering 3,504 bp of the mitochondrial genome. The chemiluminescent signal was detected on a fluor imager FLA-5100 using standard settings (Fujifilm).
Annotation and nucleotide composition analysis
Combinations of BLAST searches and ClustalW alignments to other nematode mt genes (collected using MitoBank2.1 ) were used to detect protein gene and rRNA sequences. Folding of rRNA sequences based on published structures was done with the aid of Mfold , and structures were edited with RNAviz . Predictions of the tRNA genes by tRNAScan-SE , set to detect mt nematode tRNAs, were confirmed by aligning to other nematode mt tRNAs using LARA . Analysis of nucleotide composition was performed with in-house perl programs. Based on Grigoriev to estimate the OriL , GC-skew values (G-C/G+C) were calculated for each 5th position in the genome using a window of 300 nt and the cumulative values plotted. The graph followed a second order polynomial function (R2 > 0.999), except for an offset starting around position 11,100, which points to OriL. Due to deaminations occurring during on single-stranded states caused by replication, T/C and G/A ratios increase along mt genomes. Based on a sliding window of 244 nt (depending on C content), the T/C, G/A and T+G/C+A was calculated for every 10th position. The cumulative values followed a linear function (R2>0.999). The relative difference with this function is plotted, where 1 corresponds to the mean increase over the genome. Values higher than one represent a higher than average amount of T, G, or T+G compared to C, A, or C+A respectively. In addition, for each gene, the different ratios were determined based on nucleotides on the third codon positions. In this way, T/C could not be calculated for 7 genes (cox2, cytB, nad2, nad4, nad4L and nad6) due to the lack of C. Therefore, only T+G/C+A and G/A is plotted in figure 5D.
Jacob J, Mitreva M, Vanholme B, Gheysen G: Exploring the transcriptome of the burrowing nematode Radopholus similis. Molecular Genetics and Genomics. 2008, 280: 1-17. 10.1007/s00438-008-0340-7.
Cavalier-Smith T: Origin of mitochondria by intracellular enslavement of a photosynthetic purple bacterium. Proceedings of the Royal Society B. 2006, 273: 1943-1952. 10.1098/rspb.2006.3531.
Hu M, Gasser RB: Mitochondrial genomes of parasitic nematodes - progress and perspectives. Trends in Parasitology. 2006, 22: 78-84. 10.1016/j.pt.2005.12.003.
Montiel R, Lucena M, Medeiros J, Simões N: The complete mitochondrial genome of the entomopathogenic nematode Steinernema carpocapsae: insights into nematode mitochondrial DNA evolution and phylogeny. Journal of Molecular Evolution. 2006, 62: 211-225. 10.1007/s00239-005-0072-9.
Gibson T, Blok V, Dowton M: Sequence and characterization of six mitochondrial subgenomes from Globodera rostochiensis: multipartite structure is conserved among close Nematode relatives. Journal of Molecular Evolution. 2007, 65: 308-315. 10.1007/s00239-007-9007-y.
Hu M, Chilton NB, Gasser RB: The mitochondrial genomes of the human hookworms, Ancylostoma duodenale and Necator americanus (Nematoda: Secernentea). International Journal for Parasitology. 2002, 32: 145-158. 10.1016/S0020-7519(01)00316-2.
Jex AR, Hu M, Littlewood DT, Waeschenbach A, Gasser RB: Using 454 technology for long-PCR based sequencing of the complete mitochondrial genome from single Haemonchus contortus (Nematoda). BMC Genomics. 2008, 9: 11-10.1186/1471-2164-9-11.
Keddie EM, Higazi T, Unnasch TR: The mitochondrial genome of Onchocerca volvulus: Sequence, structure and phylogenetic analysis. Mol Biochem Parasitol. 1998, 95: 111-127. 10.1016/S0166-6851(98)00102-9.
Lavrov DV, Brown WM: Trichinella spiralis mtDNA: a nematode mitochondrial genome that encodes a putative ATP8 and normally structured tRNAs and has a gene arrangement relatable to those of Coelomate metazoans. Genetics. 2001, 157: 621-637.
Wimberly BT, Brodersen DE, Clemons WM, Morgan-Warren RJ, Carter AP, Vonrhein C, Hartsch T, Ramakrishnan V: Structure of the 30S ribosomal subunit. Nature. 2000, 407: 327-339. 10.1038/35030006.
Abascal F, Zardoya R, Posada D: GenDecoder: genetic code prediction for metazoan mitochondria. Nucl Acids Res. 2006, 34: W389-W393. 10.1093/nar/gkl044.
Bessho Y, Ohama T, Osawa S: Planarian mitochondria II. The unique genetic code as deduced from cytochrome c oxidase subunit I gene sequences. Journal of Molecular Evolution. 1992, 34: 331-335. 10.1007/BF00160240.
Jeon HK, Kim KH, Eom KS: Complete sequence of the mitochondrial genome of Taenia saginata: Comparison with T. solium and T. asiatica. Parasitology International. 2007, 56: 243-246. 10.1016/j.parint.2007.04.001.
Bobrowicz AJ, Lightowlers RN, Chrzanowska-Lightowlers Z: Polyadenylation and degradation of mRNA in mammalian mitochondria: a missing link?. Biochem Soc Trans. 2008, 36: 519-10.1042/BST0360517.
Chrzanowska-Lightowlers ZMA, Temperley RJ, Smith PM, Seneca SH, Lightowlers RN: Functional polypeptides can be synthesized from human mitochondrial transcripts lacking termination codons. Biochem J. 2004, 377: 725-731. 10.1042/BJ20031556.
Jackson C, Norman J, Schnare M, Gray M, Keeling P, Waller R: Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria. BMC Biology. 2007, 5: 41-10.1186/1741-7007-5-41.
Raczynska KD, Le Ret M, Rurek M, Bonnard G, Augustyniak H, Gualberto J: Plant mitochondrial genes can be expressed from mRNAs lacking stop codons. FEBS Letters. 2006, 580: 5641-5646. 10.1016/j.febslet.2006.09.010.
Li MW, Lin RQ, Song HQ, Wu XY, Zhu XQ: The complete mitochondrial genomes for three Toxocara species of human and animal health significance. BMC Genomics. 2008, 9: 224-10.1186/1471-2164-9-224.
Okimoto R, Chamberlin HM, Macfarlane JL, Wolstenholme DR: Repeated sequence sets in mitochondrial DNA molecules of root knot nematodes (Meloidogyne): nucleotide sequences, genome location and potential for host-race identification. Nucleic Acids Res. 1991, 19: 1619-1626. 10.1093/nar/19.7.1619.
Okimoto R, Macfarlane JL, Wolstenholme DR: The mitochondrial ribosomal RNA genes of the nematodes Caenorhabditis elegans and Ascaris suum: consensus secondary-structure models and conserved nucleotide sets for phylogenetic analysis. Journal of Molecular Evolution. 1994, 39: 598-613. 10.1007/BF00160405.
Krishnan NM, Seligmann H, Raina SZ, Pollock DD: Detecting gradients of asymmetry in site-specific substitutions in mitochondrial genomes. DNA and Cell Biology. 2004, 23: 707-714. 10.1089/dna.2004.23.707.
Grigoriev A: Analyzing genomes with cumulative skew diagrams. Nucleic Acids Res. 1998, 26: 2286-2290. 10.1093/nar/26.10.2286.
Scarpulla RC: Transcriptional paradigms in mammalian mitochondrial biogenesis and function. Physiol Rev. 2008, 88: 611-638. 10.1152/physrev.00025.2007.
Abascal F, Posada D, Zardoya R: MtArt: a new model of amino acid replacement for Arthropoda. Mol Biol Evol. 2007, 24: 1-5. 10.1093/molbev/msl136.
Zuker M: Mfold web server for nucleic acid folding and hybridization prediction. Nucl Acids Res. 2003, 31: 3406-3415. 10.1093/nar/gkg595.
De Rijk P, De Wachter R: RnaViz, a program for the visualisation of RNA secondary structure. Nucleic Acids Res. 1997, 25: 4679-4684. 10.1093/nar/25.22.4679.
Lowe TM, Eddy SR: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucl Acids Res. 1997, 25: 955-964. 10.1093/nar/25.5.955.
Bauer M, Klau G, Reinert K: Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization. BMC Bioinformatics. 2007, 8: 271-10.1186/1471-2105-8-271.
De Rijk P, Robbrecht E, de Hoog S, Caers A, Peer Van de Y, De Wachter R: Database on the structure of large subunit ribosomal RNA. Nucleic Acids Res. 1999, 27: 174-178. 10.1093/nar/27.1.174.
Noller H, Asire M, Barta A, Douthwaite S, Goldstein T, Gutell RR, Moazed D, Normanly J, Prince JB, Stern S, et al: Studies on the structure and function of ribosomal RNA. Structure, function and Genetics of Ribosomes. 1986, New York: Springer-Verlag, 143-163.
JJ has a grant of Ghent University (BOF).
The authors declare that they have no competing interests.
JJ carried out the experiments and drafted the manuscript. BV, TVL and LG participated in discussion of the results, design and drafting of the manuscript. All authors read and approved the final manuscript.