Skip to main content

Genotyping data of French wild boar populations using porcine genome-wide genotyping array



The admixture of domestic pig into wild boar populations is controlled until now, by cytogenetic analysis. Even if a first-generation hybrid animal is discernable because of its 37-chromosome karyotype, the cytogenetic method is not applicable in the case of advanced intercrosses. The aim of this study is therefore to evaluate the use of SNP (Single Nucleotide Polymorphism) markers as an alternative technology to characterize recent or past hybridization between the two sub-species. The final goal would be to develop a molecular diagnostic tool.

Data description

The Geneseek Genomic Profiler High-Density porcine beadchip (GGP70KHD, Illumina, USA), comprising 68,516 porcine SNPs, was used on a set of 362 wild boars with diverse chromosomal statuses collected from different areas and breeding environments in France. We generated approximately 62,192–64,046 genotypes per wild boar. The present dataset might be useful for the community (i) for developing molecular tools to evaluate the admixture of domestic pig into wild boar populations, and (ii) for genetic diversity studies including wild boar species or phylogeny analyses of Suidae populations.

Raw data files and a processed matrix data file were deposited in the ArrayExpress at European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI) data portal under accession number E-MTAB-10591.


Different studies have reported that wild boar populations have significantly increased in France, like in other parts of the world [1]. Among the different hypothesis, the mating of domestic pigs with wild boars can partly explain this expansion [2]. The growth of the wild boar population today is associated with many problems such as crop damage, increased health risks for livestock and decreased biodiversity. Consequently, the genetic status of the wild boar populations has been monitored by cytogenetic analysis since the 1980s in France [3]. Indeed, the karyotype of the wild boar (2n = 36) differs from that of the domestic pig (2n = 38) due to a Robertsonian translocation between chromosomes 15 and 17. However, this method is not relevant enough to guarantee the “wild boar status” of an animal with 36 chromosomes resulting, for example, from the mating of two parents with 37 chromosomes. Therefore, the aim of this analysis was to evaluate the use of molecular markers as an alternative technology. A genome-wide high-throughput genotyping approach had not yet been performed on a large number of French wild boars; only a small-scale molecular approach has recently been evaluated [4, 5]. In addition, chromosomal control requires metaphase spreads prepared from cultures of peripheral blood lymphocytes obtained in fresh whole heparin blood samples, which can be very difficult to collect on wild animals. The final aim would be to develop a robust and practical molecular diagnostic tool to replace cytogenetic analyses.

The data set is still confidential but a scientific publication is in progress and initial analyses have already been presented in an oral communication [6]. In the future, the use of molecular markers for wild boar genotyping should make it possible to refine our knowledge about the consequences of hybridization phenomena on the phenotypic traits of wild animals and to evaluate if the evolutions observed in wild populations can result from the genetic introgression of alleles from the domestic pig.

The present data set might be useful for genetic diversity studies to improve wild species preservation measures in our country. In addition, these all-genome genotyping data from the French wild boar populations could be useful for phylogeny studies of Suidae populations.

Data description


The experiment included 362 French wild boars from different French regions and environments. Most of the animals (n = 252) were bred in enclosed wild boar farms spread out over 38 French administrative subdivisions. The other animals were bred in free-range conditions, either in a protected area (French administrative subdivision, “Deux-Sèvres”; n = 31), or in an unprotected area (French administrative subdivision, “Ardèche”; n = 79). There were 223 females and 139 males. According to the cytogenetic analysis, there were 203 animals with chromosome numbers equal to 2n = 36, 70 animals with 2n = 37, 10 animals with 2n = 38, and 79 animals with unknown chromosomal status. No animals in our study were bred/killed/taken specifically for the needs of our project, which therefore did not require explicit authorization (in accordance with the 2010/63/EU European Directive).

Sample collection

A total of 362 biological samples were collected. The biological samples of animals from free ranging “Ardèche” were ear biopsies (n = 79) collected between the years 2014–2016 for an edema disease project [7]. The other samples were blood samples (n = 283) collected between the years 2017–2019, analyzed on the chromosomal control platform [3] and selected to be representative of the farm diversity.

DNA extraction

DNA samples were extracted for the 362 biological samples. Genomic DNA of 283 heparin blood samples were extracted using the Blood DNA Isolation (Norgen) kit. Genomic DNA of 79 ear biopsies were extracted with an in-house protocol (proteinase K lysis followed by salt-based DNA extraction and ethanol precipitation). Total genomic DNA quality was determined using the Nanodrop 8000 spectrophotometer (ND8000LAPTOP, Thermo Fisher Scientific, USA). Total genomic DNA concentration was determined using the Quant-iT picogreen dsDNA Assay Broad Range kit (Invitrogen, Q33130, Thermo Fisher Scientific, USA) with the QuantStudio6 instrument at the Genomic and Transcriptomic (GeT) platform [8]. All the samples were managed with the GeT Barcode database.

DNA genotyping experiment and raw data

A total of 362 genomic DNA samples were genotyped in collaboration with the Cancer Research Center of Toulouse (CRCT) core facility platform [9]. Genotypes were performed using the GGP70KHD porcine array comprising 68,516 porcine SNPs. The Infinium High Density Ultra Assay protocol (Illumina, USA) was performed according to the manufacturer’s recommendations. The whole data set was produced on 16 beadchips. Raw data were processed with an Array Scanner iScan System (Illumina, USA) instrument, and 724 raw data files (Table 1 Data set 1 and Data set 2) were deposited in ArrayExpress at the EMBL-EBI data portal under accession number E-MTAB-10591 [10].

Table 1 Overview of data set

DNA genotyping analysis

Genotypes were inferred from the raw fluorescence intensity data using the Genotyping Analysis Module included in GenomeStudio (version 2.0.5) software (Illumina, USA). We used a custom cluster from the commercial cluster file and Illumina guidelines [11]. The call rate of wild boar samples is approximately 0.91–0.93, and we generated approximately 62,192–64,046 genotypes per DNA sample. The genotyping report of all samples was exported in a single file with top format alleles, including B Allele Freq and Log R Ratio for potential CNV (Copy Number Variation) analysis. The processed matrix data file (Table 1 Data set 3) was deposited in ArrayExpress at the EMBL-EBI data portal under accession number E-MTAB-10591 [10].


The call rate of wild boar samples is approximately 0.91–0.93. Quality of wild boar genotyping using porcine array depends on the quality of heterologous hybridization.

Availability of data and materials

The data presented in this Data Note can be freely and openly accessed on the EMBL-EBI ArrayExpress portal under accession number E-MTAB-10591, [10].



Single nucleotide polymorphism


Geneseek genomic profiler 70 K high density


European molecular biology laboratory-european bioinformatics institute


Copy number variation


Gestion integree de la sante animaux


Emergence of a pig disease in wild boars in a context of increasing interspecific interactions


Genomic and transcriptomic


Cancer research center of toulouse


  1. Massei G, Kindberg J, Licoppe A, Gačić D, Šprem N, Kamler J, et al. Wild boar populations up, numbers of hunters down? a review of trends and implications for Europe: wild boar and hunter trends in Europe. Pest Manag Sci. 2015;71(4):492–500.

    CAS  Article  Google Scholar 

  2. Khederzadeh S, Kusza S, Huang C, Markov N, Scandura M, Babaev E, et al. Maternal genomic variability of the wild boar ( Sus scrofa ) reveals the uniqueness of East-Caucasian and central Italian populations. Ecol Evol. 2019;9(17):9467–78.

    Article  Google Scholar 

  3. Ducos A, Revay T, Kovacs A, Hidas A, Pinton A, Bonnet-Garnier A, et al. Cytogenetic screening of livestock populations in Europe: an overview. Cytogenet Genome Res. 2008;120(1–2):26–41.

    CAS  Article  Google Scholar 

  4. Beugin M-P, Baubet E, Dufaure De Citres C, Kaerle C, Muselet L, Klein F, et al. A set of 20 multiplexed single nucleotide polymorphism (SNP) markers specifically selected for the identification of the wild boar (Sus scrofa scrofa) and the domestic pig (Sus scrofa domesticus). Conserv Genet Resour. 2017;9(4):671–5.

    Article  Google Scholar 

  5. Lorenzini R, Fanelli R, Tancredi F, Siclari A, Garofalo L. Matching STR and SNP genotyping to discriminate between wild boar, domestic pigs and their recent hybrids for forensic purposes. Sci Rep. 2020;10(1):3188.

    CAS  Article  Google Scholar 

  6. Mary N, Iannuccelli N, Petit G, Bonnet N, Pinton A, Grosbois V, et al. Analyse de l’hybridation dans les populations françaises de sangliers à l’aide de données de génotypage pangénomique. J Recherche Porcine. 2021;1:53–6.

    Google Scholar 

  7. Petit G, Grosbois V, Chalvet-Monfray K, Ducos A, Desmecht D, Martineau G-P, et al. Polymorphism of the alpha-1-fucosyltransferase (FUT1) gene in several wild boar (Sus scrofa) populations in France and link to edema disease. Res Vet Sci. 2020;29(131):78–86.

    Article  Google Scholar 

  8. GeT—Génomique & transcriptomique. 2021

  9. Pôle technologique du CRCT—Site présentant les travaux et technologiques du pôle technologique du CRCT de Toulouse. 2021

  10. ArrayExpress < EMBL-EBI.

  11. A guide for analysis Infinium genotyping data using the Illumina GenomeStudio Genotyping Module 2010 Infinium Genotyping Data Analysis https://

Download references


The authors would like to thank the people who work on the INRAE GenPhySE chromosomic platform. The authors are also grateful to the ONCFS (Office National Chasse Faune Sauvage).


The authors are thankful for the research grant given by the INRAE-GISA (Gestion Intégrée de la Santé des Animaux) metaprogram project EPIDEWILD-3i (Emergence of a Pig DisEase in WILD boars in a context of Increasing Interspecific Interactions).

Author information




NI carried out DNA isolation, performed the experiment, managed the data and wrote the manuscript. NM carried out DNA isolation and analyzed the genotypes. GP and NB collected the wild boar biological samples. CV managed the genotyping service on the platform. AD and JR designed the study and secured funding. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Nathalie Iannuccelli.

Ethics declarations

Ethics approval and consent to participate

No animals in our study were bred/killed/taken specifically for the needs of our project, which therefore did not require explicit authorization (in accordance with the 2010/63/EU European Directive).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Iannuccelli, N., Mary, N., Bonnet, N. et al. Genotyping data of French wild boar populations using porcine genome-wide genotyping array. BMC Res Notes 15, 170 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Wild boar
  • Genome-wide DNA genotyping
  • SNP
  • Porcine microarray