- Research note
- Open Access
Mitochondrial genome variation of Atlantic cod
BMC Research Notesvolume 11, Article number: 397 (2018)
The objective of this study was to analyse intraspecific sequence variation of Atlantic cod mitochondrial DNA, based on a comprehensive collection of completely sequenced mitochondrial genomes.
We determined the complete mitochondrial DNA sequence of 124 cod specimens from the eastern and western part of the species’ distribution range in the North Atlantic Ocean. All specimens harboured a unique mitochondrial DNA haplotype. Nine hundred and fifty-two polymorphic sites were identified, including 109 non-synonymous sites within protein coding regions. Eighteen variable sites were identified as indels, exclusively distributed in structural RNA genes and non-coding regions. Phylogeographic analyses based on 156 available cod mitochondrial genomes did not reveal a clear structure. There was a lack of mitochondrial genetic differentiation between two ecotypes of cod in the eastern North Atlantic, but eastern and western cod were differentiated and mitochondrial genome diversity was higher in the eastern than the western Atlantic, suggesting deviating population histories. The geographic distribution of mitochondrial genome variation seems to be governed by demographic processes and gene flow among ecotypes that are otherwise characterized by localized genomic divergence associated with chromosomal inversions.
The Atlantic cod (Gadus morhua) is one of the most important species for fisheries in the North Atlantic Ocean , and recently the nuclear genome was reported [2, 3]. The mitochondrial genome (mitogenome) is considered the second genome of the cell, and its gene content is conserved among most vertebrates . The 16.7 kb Atlantic cod mitogenome encodes the standard set of 13 hydrophobic membrane proteins, 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs), as well as peptides and long non-coding RNAs, and is organized similarly to that of humans [5,6,7].
On average, the Atlantic cod mitogenome evolves about 14 times more rapidly at the nucleotide level than the nuclear genome , and mitochondrial sequence variation in cod was previously used to trace population structures and patterns of mitogenome evolution [8,9,10,11,12]. Árnason  investigated sequence variants of a 250-bp cytochrome b (CytB) gene fragment in 1278 Atlantic cod specimens throughout the distribution range, and identified trans-Atlantic haplotype clines with more diversity in northeastern and mid-Atlantic cod as compared to northwestern cod. Carr et al. [10, 12] reported on mitogenome variation based on 32 cod specimens and identified 298 single nucleotide polymorphic (SNP) sites. They found similar diversities in northwest and northeast Atlantic cod, but their sample from the northeast Atlantic consisted of six specimens only . In the present study, we sequenced the complete mitogenome of 124 individuals, generating a mitochondrial sequence resource for future studies of Atlantic cod. We analysed relationships among the 156 cod mitogenomes currently available and compared mitogenome variation of the offshore migratory and stationary coastal cod ecotypes , both from the northeast Atlantic, and cod from the northwest Atlantic.
Tissue samples, nucleic acid extraction, PCR amplification, and plasmid cloning
Atlantic cod tissue samples were collected from the western (off Nova Scotia and Newfoundland) and eastern parts (off the British Isles, in the Baltic Sea, Irish Sea, North Sea, along the Norwegian coast and fjords, and in the Barents Sea) of the North Atlantic Ocean (Additional file 1: Table S1). DNA was extracted from fresh muscle tissue or ethanol preserved tissue (stored at − 20 °C) using the High Pure PCR Template Preparation kit (Roche) or the MasterPure™ Complete DNA and RNA Purification Kit (Epicentre®) according to the manufacturer’s protocols. Complete mitogenomes were PCR amplified in five overlapping fragments of approximately 4–4.5 kb in size using LaTaq polymerase (TAKARA BIO INC). The PCR products were purified using USB® ExoSAP-IT® reagent (Affymetrix). Agarose gel electrophoresis and gel extraction using Invitrogen™ PureLink® Quick Gel Extraction Kit or Invitrogen™ PureLink® PCR Purification Kit were performed according to the manufacturer’s protocols. PCR and sequencing primers used in this study have been described previously . Plasmid cloning of the control region (CR) was performed by using Invitrogen™ TOPO® TA Cloning® Kit with One Shot® TOP10 E. coli competent cells. Positive clones were cultivated and plasmid DNA was purified using Invitrogen™ PureLink® Quick Plasmid Miniprep Kit.
Mitogenome sequencing and data analysis
The complete mitogenome sequences of 124 Atlantic cod specimens were determined, using Sanger, Illumina, and Roche 454 technologies (117, six, and one specimen, respectively). The latter, based on pyrosequencing, was reported previously . The Illumina GAII sequencing was performed according to protocols in  using 2 × 108 bp paired end reads, library inserts of 550–575 bp, and 3.1–6.6 times (average 4.8 times) whole genome coverage (Norwegian Sequencing Centre—Oslo, Norway). Ninety-five mitogenomes were determined by Sanger sequencing provided by Eurofins MWG Operon (Germany). Two Sanger sequenced mitogenomes (NF1 and NC3) have been reported previously [8, 15]. The 20 remaining mitogenomes were sequenced in-house by Sanger technology directly on purified PCR products or plasmid DNA using the BigDye kit (Applied Biosystems). The complete 16,696-bp NC3 Atlantic cod mitogenome (HG514359) was used as a reference for assembly and mapping of mitogenome sequences and reads. Computer analyses of Sanger-generated mitogenomes were performed using DNASTAR® Lasergene software. For mitogenome sequences generated by Roche 454 and Illumina platforms, reference mappings were performed on the CLC Genomics Workbench (QIAGEN®).
A total of 156 available mitogenomes were used to calculate population genetic parameters and reconstruct molecular relationships among Atlantic cod specimens. The CR, tRNA-Phe, and half of the tRNA-Pro sequence were excluded from these comparisons, as these sequences were not available for the 32 specimens previously reported . Population genetic statistics and measures of genetic differentiation were estimated for the following three subsets of specimens, defined by their geographic origin and ecotype: cod from the northwest Atlantic (NW; N = 32), cod from the north east Atlantic of the coastal stationary ecotype (NC; N = 25), and Arctic cod from the Barents Sea of the migratory ecotype (NA; N = 97) (Additional file 1: Table S1). Two specimens from the Baltic Sea were excluded from these analyses, since differentiation from NC due to vicariance is likely. Nucleotide sequence alignments were generated using T-coffee v/9 software  with manual refinements. The tree-building method of maximum likelihood (ML) in MEGA version 6  was used to reconstruct molecular relationships. The ML trees were built from best-fit models of nucleotide evolution generated by MEGA6 [Bayesian information criterion calculations resulted in TN93+I+G as best-fit model]. The topologies of the ML trees were evaluated by bootstrap analyses (2000 replications). We analysed nucleotide diversity indices, Tajima’s D statistic, and genetic differentiation indices FST and Da (the average number of net nucleotide substitutions), as implemented in the DnaSP version 6 software .
Sequence variation among Atlantic cod mitogenomes
Complete mitogenome sequences of approximately 16.7 kb were obtained for 124 cod specimens sampled in the western and eastern parts of the North Atlantic Ocean. Mitochondrial sequence variation was initially assessed by considering nucleotide variants of the CytB gene fragment (250 bp) previously reported for 1278 Atlantic cod specimens throughout the species’ range . Eleven haplotypes from that study, including all main haplotypes (A, C, D, E and G) were identified among the 124 specimens, as well as 12 other singleton haplotypes (Additional file 2: Table S2).
The 124 mitogenome sequences were unambiguously aligned using the Norwegian costal NC3 (HG514359)  as an Atlantic cod reference, resulting in an alignment of 16,551 positions. The total number of polymorphic sites was 952 (5.7% of mitogenome positions), and these were distributed across the two rRNA genes, all 13 protein coding genes, 18 of the 22 tRNA genes, and major non-coding regions (TP-spacer and CR) (Fig. 1). Only 18 variable sites (1.9%) were identified as indels (6 in structural RNA genes and 12 in non-coding regions). Protein coding genes contained 756 (79.4%) substitutions, of which 109 were non-synonymous (14.4%), resulting in amino acid changes in all 13 proteins (Additional file 3: Table S3).
Key features of mitogenome sequence variation are summarized in Additional file 4: Table S4, and several features are noted. Protein coding genes have 2.3 times more SNPs per nucleotide position than RNA genes. This reflect high sequence conservation in RNA genes at the species level. An interesting observation is that variable sites in the mitochondrial small subunit rRNA gene are mainly clustered within the 3′M structural domain (Additional file 5: Figure S1). A similar feature was not noted in the mitochondrial large subunit rRNA (Additional file 6: Figure S2). The cytochrome oxidase (CO) genes and proteins (Complex IV) are generally more conserved than the NADH dehydrogenase (ND) genes and proteins (Complex I). Here, the ND2 gene contains 7.2 times more polymporphic sites per position than, e.g. COII or the RNA genes. The ND2, ND4, ND5, and CytB genes contain the highest density of polymorphic sites.
Mitogenome diversity, intraspecific relationships and genetic differentiation
The alignment of 156 available cod mitogenomes resulted in 15,592 common sites following the exclusion of alignment gaps. There were 1002 polymorphic sites, and 1034 substitutions in total among the mitogenome sequences. The nucleotide diversity index was slightly lower for cod in the northwest Atlantic (0.203%) compared to stationary and migratory cod in the northeast Atlantic (0.291 and 0.285%, respectively; Table 1). Tajima’s D statistic was significantly negative for all population subsets, rejecting the null hypothesis of stable populations with no selection. A representative maximum likelihood (ML) tree is shown in Fig. 2. Twelve clades were supported in > 80% of bootstrap replications, but there was not a clear geographic structuring of clades. However, while some clades were dominated by NA and NC cod individuals, others harboured mainly NW cod (Fig. 2). Measures of pairwise genetic differentiation were negative (interpreted as nil) between NA and NC cod in the northeast Atlantic, while NW cod were differentiated from both NC and NA cod (FST of approximately 0.06 and 0.09, respectively; Additional file 7: Table S5).
Here we provide a comprehensive SNP map of the Atlantic cod mitochondrial genome based on 124 completely sequenced mtDNAs. The 952 variable sites identified among 124 specimens were not evenly distributed throughout the mitogenome. Structural RNA genes have a significantly lower density of overall SNPs per site and variable sites per position compared to protein coding genes. Furthermore, the ND2 gene and the COII gene were the least and most conserved, respectively, among the protein coding genes. This feature was also observed at the protein level. Thus, the Atlantic cod mitogenome follow a similar pattern of conservation as seen for, e.g. zebrafish  or humans [20, 21].
One hundred and twenty-four cod individuals harboured substantial sequence variation in their mitogenomes, including 349 phylogenetically informative parsimony sites. Phylogenetic analysis of 156 available mitogenomes identified ten haplotype clusters supported by high bootstrap values, but little phylogeographic structuring. Mitogenome evolution in cod seems to be nearly neutral [8, 12], suggesting that the significantly negative Tajima’s D statistic mainly signifies recent demographic change, rather than selection. The differentiation of certain cod populations into so-called ecotypes defined by migratory and stationary behaviour, most notably NC and NA cod in the northeastern Atlantic, has long been a conundrum . Recently, it was shown that these ecotypes are associated with genomic islands of differentiation, inferred to reside within chromosomal inversions in at least four linkage groups [22, 23]. It is conceivable that such genomic regions could preclude recombination and break-up of co-adapted genes within them, and thus make it possible for locally adapted ecotypes to persist in the face of continued gene flow. Similar chromosomal inversions, suggesting a common ancestry, were subsequently found to contribute to ecotype divergence in the western Atlantic as well . The mitogenome data indicate some differentiation between western and eastern cod, but a lack of differentiation between NC and NA cod. This would be consistent with isolation by distance and some gene flow between ecotypes in their mitochondrial genes and neutrally evolving parts of the nuclear genome. Thus, the geographic structuring of mitogenome variation in cod seems to be governed mainly by demographic and stochastic processes in a species with high fecundity and variance in offspring number, much in line with Árnason’s conclusions based on CytB sequences .
Our study provides a mitochondrial genome resource obtained from Atlantic cod tissue samples collected at site of fisheries in the North Atlantic Ocean. Phylogeographic analyses based on 156 mitochondrial genomes did not reveal a clear structure, but eastern and western cod were differentiated. Mitochondrial genome diversity was higher in the eastern than the western Atlantic, suggesting deviating population histories.
The SNP map of the Atlantic cod mitochondrial genome consisted of 952 polymorphic sites among the 124 specimens studied here, and 1002 polymorphic sites among 156 available mitogenomes from the western and eastern parts of the Atlantic Ocean. A more exhaustive SNP map of the cod mitogenome would most certainly require a substantial increase in the number of mitogenomes collected from the vast distribution range of the species.
single nucleotide polymorphism
Johansen SD, Coucheron DH, Andreassen M, Karlsen BO, Furmanek T, Jørgensen TE, Emblem Å, Breines R, Nordeide JT, Moum T, Nederbragt AJ, Stenseth NC, Jakobsen KS. Large-scale sequence analyses of Atlantic cod. N Biotechnol. 2009;25:263–71.
Star B, Nederbragt AJ, Jentoft S, Grimholt U, Malmstrøm M, Gregers TF, Rounge TB, Paulsen J, Solbakken MH, Sharma A, Wetten OF, Lanzén A, Winer R, Knight J, Vogel JH, Aken B, Andersen O, Lagesen K, Tooming-Klunderud A, Edvardsen RB, Tina KG, Espelund M, Nepal C, Previti C, Karlsen BO, Moum T, Skage M, Berg PR, Gjøen T, Kuhl H, Thorsen J, Malde K, Reinhardt R, Du L, Johansen SD, Searle S, Lien S, Nilsen F, Jonassen I, Omholt SW, Stenseth NC, Jakobsen KS. The genome sequence of Atlantic cod reveals a unique immune system. Nature. 2011;477:207–10.
Tørresen OK, Star B, Jentoft S, Reinar WB, Grove H, Miller JR, Walenz BP, Knight J, Ekholm JM, Peluso P, Edvardsen RB, Tooming-Klunderus A, Skage M, Lien S, Jakobsen KS, Nederbragt AJ. An improved genome assembly uncovers prolific tandem repeats in Atlantic cod. BMC Genomics. 2017;18:95.
Boore JL. Animal mitochondrial genomes. Nucleic Acids Res. 1999;27:1767–80.
Johansen S, Bakke I. The complete mitochondrial DNA sequence of Atlantic cod (Gadus morhua): relevance to taxonomic studies among codfishes. Mol Mar Biol Biotechnol. 1996;5:203–14.
Jørgensen TE, Johansen SD. Expanding the coding potential of vertebrate mitochondrial genomes: lesson learned from the Atlantic cod. In: Mattila M, editor. Mitochondrial DNA—New Insights. New York: InTech Open; 2018.
Jørgensen TE, Bakke I, Ursvik A, Andreassen M, Moum T, Johansen SD. An evolutionary preserved intergenic spacer in gadiform mitogenomes generates a long noncoding RNA. BMC Evol Biol. 2014;14:182.
Karlsen BO, Emblem Å, Jørgensen TE, Klingan KA, Nordeide JT, Moum T, Johansen SD. Mitogenome sequence variation in migratory and stationary ecotypes of North-east Atlantic cod. Mar Genomics. 2014;15:103–8.
Árnason E. Mitochondrial cytochrome b DNA variation in the high-fecundity Atlantic cod: trans-Atlantic clines and shallow gene genealogy. Genetics. 2004;166:1871–85.
Carr SM, Marshall HD. Intraspecific phylogeographic genomics from multiple complete mtDNA genomes in Atlantic cod (Gadus morhua): origins of the ‘codmother’, transatlantic vicariance and midglacial population expansion. Genetics. 2008;180:381–9.
Nordeide JT, Johansen SD, Jørgensen TE, Karlsen BO, Moum T. Population connectivity among migratory and stationary cod Gadus morhua in the Northeast Atlantic—a review of 80 years of study. Mar Ecol Prog Ser. 2011;435:269–83.
Marshall HD, Coulson MW, Carr SM. Near neutrality, rate heterogeneity, and linkage govern mitochondrial genome evolution in Atlantic cod (Gadus morhua) and other gadine fish. Mol Biol Evol. 2009;26:579–89.
Breines R, Ursvik A, Nymark M, Johansen SD, Coucheron DH. Complete mitochondrial genome sequences of the Arctic Ocean codfishes Arctogadus glacialis and Boreogadus saida reveal oriL and tRNA gene duplications. Polar Biol. 2008;31:1245–52.
Kirubakaran TG, Grove H, Kent MP, Sandve SR, Baranski M, Nome T, De Rosa MC, Righino B, Johansen T, Otterå H, Sonesson A, Lien S, Andersen Ø. Two adjacent inversions maintain genomic differentiation between migratory and stationary ecotypes of Atlantic cod. Mol Ecol. 2016;25:2130–43.
Ursvik A, Breines R, Christiansen JS, Fevolden S-E, Coucheron DH, Johansen SD. A mitogenomic approach to the taxonomy of pollocks: Theragra chalcogramma and T. finnmarchica represent one single species. BMC Evol Biol. 2007;7:87.
Notredame C, Higgins DG, Heringa J. T-Coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2001;302:205–17.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.
Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, Sánchez-Gracia A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34:3299–302.
Flynn T, Signal B, Johnson SL, Gemmell NJ. Mitochondrial genome diversity among six laboratory zebrafish (Danio rerio) strains. Mitochondrial DNA Part A. 2016;27:4364–71.
Ingman M, Gyllenstein U. Analysis of the complete human mtDNA genome: methodology and inferences for human evolution. J Heredity. 2001;92:454–61.
Diroma MA, Calabrese C, Simone D, Santorsola M, Calabrese FM, Gasparre G, Attimonelli M. Extraction and annotation of human mitochondrial genomes from 1000 genomes whole exome sequencing data. BMC Genomics. 2014;15(Suppl 3):S2.
Karlsen BO, Klingan K, Emblem Å, Jørgensen TE, Jüterbock A, Furmanek T, Hoarau G, Johansen SD, Nordeide JT, Moum T. Genomic divergence between migratory and stationary Atlantic cod. Mol Ecol. 2013;22:5098–111.
Berg PR, Star B, Pampoulie C, Sodeland M, Barth JMI, Knutsen H, Jakobsen KS, Jentoft S. Three chromosomal rearrangements promote genomic divergence between migratory and stationary ecotypes of Atlantic cod. Scient Rep. 2016;6:23246.
Berg PR, Star B, Pampoulie C, Bradbury IR, Bentzen P, Hutchings JA, Jentoft S, Jakobsen KS. Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions. Heredity. 2017;119:418–28.
TEJ, BOK, ÅE, RB, MA, MN, AU, DHC, TM and SDJ organized the sequencing of the mitochondrial genomes. TBR, AJN and KSJ provided mitogenomes sequenced by Illumina and 454 sequencing. TEJ, BOK, LMJ, JTN, TM, SDJ contributed to mtDNA sequence analyses. SDJ directed the research in collaboration with JTN and TM. SDJ wrote the paper in collaboration with all authors. All authors read and approved the final manuscript.
We thank Espen Olsen, Morten Krogstad, Espen Søreng, Ove Nicolaisen, Agnieszka Kijewska, Bill Hutchinson, Sherrylynn Rowe, Asgeir Aglen, and Francois Botha for the contribution of Atlantic cod specimen samples.
The authors declare that they have no competing interests.
Availability of data
Accession numbers of mitogenomes are available at ENA under the study Accession Number PRJEB23234/ERP104973.
Consent for publication
Ethics approval and consent to participate
Fish tissue samples were obtained at site of fisheries, and do not involve research on animals. In general, this study was carried out in accordance with ethical guidelines stated by the Norwegian Ministry of Agriculture and Food through the Animal Welfare Act.
This work was supported by grants from the Norwegian Research Council (GenoFisk), Nord University, and the Arctic University of Norway-UiT.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.