Genome resequencing data for Iranian local dogs and wolves
BMC Research Notes volume 13, Article number: 436 (2020)
The data provided herein represent the whole-genome resequencing data related to three wolves and three Iranian local dogs. The understanding of genome evolution during animal domestication is an interesting subject in genome biology. Dog is an excellent model for understanding of domestication due to its considerable variety of behavioral and physical traits. The Zagros area of current day Iran has been identified as one of the initial centers of animal domestication. The availability of the complete genome sequences of Iranian local canids can be a valuable resource for researchers to address questions and testing hypotheses on the dog domestication process.
We collected blood samples from six Iranian local canids including two hunting dogs (Saluki breed), a mastiff dog (Qahderijani ecotype) and three wolves. We extracted genomic DNA from blood samples. Sequence data were produced using the Illumina HiSeq 2500 system. All sequence data are available in the National Genomics Data Center (NGDC), Genome Sequence Archive (GSA) database under the accession of CRA001324 and the National Center for Biotechnology Information (NCBI) under the accession of PRJNA639312. The short-read sequences with the mean depth of 16X were aligned to the dog reference genome (CanFam3.1) and achieved 99% coverage of the reference assembly. The obtained information from this experiment will be useful in evolutionary biology.
Dogs (Canis familiaris) were probably the earliest domesticated animals and one of the human companions in ancient times [1, 2]. Archaeological findings and genetic research indicated that the dog breeds have derived from wild wolves [3,4,5]. In the Southwest Asia, major–scale farming extended within the so-named Fertile Crescent (FC), where the independent domestication of plants and animals occurred [6, 7]. Extensively, cultural advances occurred in the Zagros area of current day Iraq and Iran, connecting Iranian plateau and Mesopotamia . Dogs had been pictured frequently in Southwest Asia [1, 9]. Consequently, one of the notable viewpoints on the primary location of the dog domestication has been the Southwest Asia, likely the Middle East . Moreover, the Middle East has been included in the considerable allelic distribution between dog breeds and wolf ; however, this presumption has been queried because of dog-wolf hybridization as stated in previous studies [11,12,13]. The dog is a considerable example of phenotypic variation under artificial selection and demographic forces, but genetic basis of this diversity is not yet completely clear. Therefore, the availability of complete whole-genome resequencing data of Iranian local canids will provide an opportunity for researchers to trace the origin of dog domestication. We firstly carried out genome sequencing of six Iranian local canids including two hunting dogs (Saluki breed), a mastiff dog (Qahderijani ecotype) and three wolves (Table 1). We used these data for identifying effective genomic variants in dogs and wolves .
We collected blood samples from three Iranian local dogs and three Iranian local wolves with the approval of the owners from six various sites in Iran. Sampling of Saluki dogs was done on Jamil Tavanaei’s personal farms in Kurdistan zone (Sanandaj and Bijar) and sampling of a Qahderijani dog was conducted on Alireza Hoseini private farm in Isfahan zone. One of the wolf samples was collected from Kerman zoological garden in Kerman zone and the others were collected from Eram zoological garden in Tehran zone. DNA was extracted with phenol/chloroform method. For sequencing library preparation, the genomic DNA was sheared to fragments of 300–500 bp, which were then end-repaired, “A”-tailed, and ligated to Illumina sequencing adapters. The ligated products with sizes of 400–500 bp were selected on 2% agarose gels and then amplified by LM-PCR. Illumina paired-end whole-genome resequencing for six individuals was done with Hiseq2500 Illumina system) http://www.berrygenomics.com). Both nuclear and mitochondrial genomes were sequenced. We created 287.5 Gb data with a uniform read length of 150 bp. A total of 1,884,054,828 short reads were generated for all of the six individuals. After filtering, the range of total high-quality sequence data was from 42.1 Gb to 51 Gb and the coverage varied from 14.51X to 17.15X. The range of the mean insert sizes and their standard deviations in the sequenced data for all samples was from 280.06 to 331.86 and from 27.12 to 33.94, respectively.
The quality assessment of raw sequence reads was done with FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). We used BWA (v.0.7.15)  program to compare sequence data with the reference genome (CanFam3.1) downloaded from the Ensembl (http://asia.ensembl.org/Canis_lupus_familiaris/Info/Index). The alignment quality was assessed with SAMtools v.1.9 using flagstat and depth commands . The short-read sequences with the mean depth of 16X were mapped to the dog reference genome (CanFam3.1) and achieved 99% coverage of the reference assembly. The mapping output files were preprocessed using SAMtools , the Picard tools (http://broadinstitute.github.io/picard/) and GATK tools . We used variome detection pipeline for this data using CNVnator , BreakDancer , DELLY  and Bedtools  programs . Finally, we compared the effect of variome between the dog and wolf genomes using Sorting Intolerant from Tolerant (SIFT) algorithm , Ensembl annotation  and DAVID  tool . The data presented herein together with our previously mitochondrial DNA sequence on Iranian dogs  will provide useful resources to understand genetic structure of the Iranian dogs and testing hypotheses on the dog origin and domestication issues.
Sample size for the dog and wolf populations is a limitation of our work. We could create genome sequence data from only three wolves and three dogs. In addition, we produced the short-reads with a mean depth of 16X which is a medium depth and it might not be suitable for some genomic analyses.
Availability of data and materials
The raw data reported here are available in the NGDC, GSA database (https://bigd.big.ac.cn/gsa/) under the accession number of CRA001324 and NCBI under the accession of PRJNA639312. Please see the data files 1 to 6 in Table 1 for more details on the raw sequence data [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45].
Genome sequence archive
National Center for Biotechnology Information
National Genomics Data Center
Sorting intolerant from tolerant
Clutton-Brock J. Domesticated animals: from early times, Heinemann in assoc. with British Museum. London: Natural history; 1981.
Wang GD, Zhai W, Yang HC, Wang L, Zhong L, Liu YH, et al. Out of southern East Asia: the natural history of domestic dogs across the world. Cell Res. 2016;26:21–33.
Clutton-Brock J. Origins of the dog: domestication and early history. In: Serpell J, editor. The domestic dog: its evolution, behaviour, and interactions with people. New York: Cambridge University Press; 1995. p. 7–20.
Vilà C, Savolainen P, Maldonado JE, Amorim IR, Rice JE, Honeycutt RL, et al. Multiple and ancient origins of the domestic dog. Science. 1997;276:1687–9.
Wayne RK. Molecular evolution of the dog family. Trends Genet. 1993;9:218–24.
Colledge S, Conolly J, Shennan S, Bellwood P, Bouby L, Hansen J, et al. Archaeobotanical evidence for the spread of farming in the Eastern Mediterranean 1. Curr Anthropol. 2004;45:S35–58.
Zeder MA. Domestication and early agriculture in the Mediterranean Basin: origins, diffusion, and impact. Proc Natl Acad Sci U S A. 2008;105:11597–604.
Alizadeh A. The rise of the highland Elamite state in southwestern Iran. Curr Anthropol. 2010;51:353–83.
Przezdziecki XJB, Paris G. Our levriers: the past, present and future of all sighthounds. France: Les Amis de Xavier Przezdziecki, La Colle-sur-Loup; 2001.
VonHoldt BM, et al. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication. Nature. 2010;464:898–902.
Amiri Ghanatsaman Z, Adeola AC, Asadi Fozi M, Ma YP, Peng MS, Wang GD, et al. Mitochondrial DNA sequence variation in Iranian native dogs. Mitochondrial DNA A DNA Mapp Seq Anal. 2017;17:1–9.
Ardalan A, Kluetsch CF, Zhang AB, Erdogan M, Uhlén M, Houshmand M, et al. Comprehensive study of mtDNA among Southwest Asian dogs contradicts independent domestication of wolf, but implies dog–wolf hybridization. Ecol Evol. 2011;1:373–85.
Freedman AH, Gronau I, Schweizer RM, Ortega-Del Vecchyo D, Han E, Silva PM, et al. Genome sequencing highlights the dynamic early history of dogs. PLoS Genet. 2014;10:e1004016.
Amiri Ghanatsaman Z, Wang GD, Asadollahpour Nanaei H, Asadi Fozi M, Peng MS, Esmailizadeh A, et al. Whole genome resequencing of the Iranian native dogs and wolves to unravel variome during dog domestication. BMC Genomics. 2020;21:1–11.
Li H, Durbin R. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce frame work for analyzing nextgeneration DNA sequencing data. Genome Res. 2010;20:1297–303.
Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011;21:974–84.
Chen K, Wallis J, McLellan M, et al. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods. 2009;6:677–81.
Rausch T, Zichner T, Schlattl A, Stütz AM, Benes V, Korbel JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28:i333–9.
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.
Flicek P, Amode MR, Barrell D, Beal K, Brent S, Carvalho-Silva D, et al. Ensembl 2012. Nucleic Acids Res. 2011;40:D84–90.
Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/bioproject/browse/PRJCA001183.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042720.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042721.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042722.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042723.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042724.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042725.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042726.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042727.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042728.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042729.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042730.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042731.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042732.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042733.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042734.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042735.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042736.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042737.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042738.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042739.
BIGD Genome Warehouse; 2020. https://bigd.big.ac.cn/gsa/browse/CRA001324/CRR042740.
We are thankful for helping personnel from Department of Environmental Protection in Iran, office of natural resources in Kerman and Tehran, Shiraz, Tehran Eram and Kerman zoological gardens and the dog owners. Also, we greatly appreciate Dr. Iman Memarian and Dr. Hosein Rashidi for sampling from wolf in Tehran Eram and Kerman zoological gardens.
The funds for conducting this experiment were provided by the Youth Innovation Promotion Association, Chinese Academy of Sciences, the Chinese Academy of Sciences President’s International Fellowship Initiative (No. 2016VBA050), the National Natural Science Foundation of China (No. 91531303), the International Cooperation Program of Bureau of International Cooperation of Chinese Academy of Sciences (No.GJHZ1559) and the Animal Branch of the Germplasm Bank of Wild Species, Chinese Academy of Sciences (the Large Research Infrastructure Funding).
Ethics approval and consent to participate
This work had Institutional Animal Care and Use Committee (Kunming Institute of Zoology, approval ID: SYDW-2013021) approval. We collected peripheral blood samples from 3 Iranian dogs with the consent of owners and 3 gray wolves after obtaining consent for research from the Department of Environmental Protection in Iran (No. 93/34089, dated 14 October 2014).
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Amiri Ghanatsaman, Z., Wang, GD., Asadi Fozi, M. et al. Genome resequencing data for Iranian local dogs and wolves. BMC Res Notes 13, 436 (2020). https://doi.org/10.1186/s13104-020-05271-3
- Whole-genome resequencing