- Research note
Reduction of the contaminant fraction of DNA obtained from an ancient giant panda bone
BMC Research Notesvolume 10, Article number: 754 (2017)
A key challenge in ancient DNA research is massive microbial DNA contamination from the deposition site which accumulates post mortem in the study organism’s remains. Two simple and cost-effective methods to enrich the relative endogenous fraction of DNA in ancient samples involve treatment of sample powder with either bleach or Proteinase K pre-digestion prior to DNA extraction. Both approaches have yielded promising but varying results in other studies. Here, we contribute data on the performance of these methods using a comprehensive and systematic series of experiments applied to a single ancient bone fragment from a giant panda (Ailuropoda melanoleuca).
Bleach and pre-digestion treatments increased the endogenous DNA content up to ninefold. However, the absolute amount of DNA retrieved was dramatically reduced by all treatments. We also observed reduced DNA damage patterns in pre-treated libraries compared to untreated ones, resulting in longer mean fragment lengths and reduced thymine over-representation at fragment ends. Guanine–cytosine (GC) contents of both mapped and total reads are consistent between treatments and conform to general expectations, indicating no obvious biasing effect of the applied methods. Our results therefore confirm the value of bleach and pre-digestion as tools in palaeogenomic studies, providing sufficient material is available.
Ancient DNA (aDNA) research contributes a wide range of applications and prospects to the field of evolutionary biology . As a result of post mortem microbial colonisation, the endogenous fraction of DNA in ancient samples typically makes up less than 1% of the total retrieved DNA (e.g. ). The financial costs required to sequence such a sample may therefore be prohibitive. Multiple approaches exist to reduce the contaminant fraction of DNA [3, 4]. Particularly appealing for their simplicity and cost-effectiveness are the exposure of powder from bones or teeth to bleach (sodium hypochlorite solution) , a pre-digestion buffer , or a combination of both . However, the precise mechanisms underlying these methods remain uncertain, and their exact effect on a particular sample is difficult to accurately predict . The data presented here contribute to a better understanding of these pre-treatment methods.
In this study, three different concentrations of bleach as well as a pre-digestion buffer were applied to the powder of a single ancient bone fragment from a giant panda (Ailuropoda melanoleuca). The effects of these different applications on endogenous DNA content, complexity of DNA sequencing libraries, and on characteristic aDNA damage patterns were evaluated.
Ancient bone sample
The investigated bone fragment was found in the sinkhole of Xiaoshuijing, Jiangdong Hill, Tengchong County, Yunnan, China. Its age is 8470 ± 45 years based on radio-carbon dating .
Established procedures to avoid modern contamination were followed during DNA extraction and library preparation . Appropriate blank controls were included during all procedures, and consisted of: extraction buffer without bone powder for extraction; nuclease free water instead of DNA extract for library preparation; and Tris–EDTA–Tween (TET) buffer containing 10 mM Tris–HCl (Thermo Fisher 15568-025), 1 mM EDTA (VWR E177-500MLDB) and 0.05% (v/v) Tween-20 (A. Hartenstein CT20), instead of DNA template for all PCRs. No investigator blinding or treatment randomisation was carried out.
Using a mixer mill (Retsch MM 400), the 1.55 g bone fragment was ground into homogenous powder and then divided into 61 portions of ~ 25 mg and stored at − 20 °C. Eleven portions of 25.1 ± 0.6 mg were used in this study.
Two replicates were carried out for each pre-treatment method. Bleach pre-treatments comprised v/v-dilutions of 0.1, 0.5 and 1.0% bleach (laboratory grade sodium hypochlorite, Sigma Aldrich 425044, 10–15% available chlorine), resulting in ~ 0.015, ~ 0.075 and ~ 0.150% available chlorine, respectively. Following Korlević et al. , 1 mL of bleach solution was added to the bone powder, and rotated for 15 min at room temperature. The sample was then pelleted by centrifugation at 16,300×g, and the resulting supernatant discarded. Three wash steps were then carried out involving rotation for 3 min in 1 mL water, and centrifugation at 16,300×g. For pre-digestion treatment, a pre-digestion buffer containing 0.5% (w/v) N-lauroylsarcosine (Sigma Aldrich L9150-50G), 0.5 M EDTA (VWR E177-500MLDB) and 0.25 mg/mL Proteinase K (Promega V3021) was applied to bone powder samples as described by Damgaard et al. . The buffer volume was adjusted for the lower amount of bone powder used here, compared with Damgaard et al. , by using 312.5 µL per application instead of 5 mL. Samples were incubated at 37 °C (rather than 50 °C as used in ), to match final extraction conditions, and for 45 min (rather than 30 min as used in ), as recommended for lower incubation temperatures . After incubation, the tubes were centrifuged for 2 min at 16,300×g, and the supernatant discarded. All pre-treatments were immediately followed by DNA extraction. In order to gain comparative values, three bone powder portions were additionally processed without any pre-treatment.
DNA extraction was performed according to Dabney et al.  with reduced bone powder input mass, and reduced centrifugation speed of the binding apparatus at approximately 450×g. The lower centrifugation speed was chosen based on our previous experience that the binding apparatus can break during centrifugation at higher speeds. To examine the influence of a reduced bone powder input, we performed 12 independent extractions on six cave hyena samples using both 25 and 50 mg of bone powder. Results from this comparison (Additional file 1: Figure S1) showed no consistent evidence of a reduction in DNA yield greater than would be expected based on input bone powder amounts (i.e. DNA yield from 25 mg bone powder being half that obtained from 50 mg bone powder). We interpret this result as indication that no obvious negative influence on extraction efficiency is caused by using this reduced input bone powder amount.
Sequencing libraries were generated from each DNA extract following a single-stranded library preparation protocol , which included treatment with uracil-DNA glycosylase (New England Biolabs M0279) and endonuclease VIII (New England Biolabs M0299). The Klenow Fragment of DNA polymerase I (Thermo Fisher Scientific EP0051) was used for the fill-in reaction . 2.5 U/μL of Circligase II (Biozym 131406) was used and the ligation reaction carried out overnight. A quantitative PCR (qPCR) experiment was carried out using 0.2% of the unamplified library to estimate relative library complexities (Additional file 2: Table S1), and to determine the optimal number of cycles for subsequent indexing PCR, representing the inflection point of the respective library amplification curves, corrected for reaction volume and template amount. qPCR was performed on a PikoReal 96 Real-Time PCR machine (Thermo Fisher Scientific TCR0096) with 3 replicates for each library, involving an initial 10 min denaturation at 95 °C, followed by 40 cycles of: 15 s at 95 °C, 30 s at 60 °C, and 1 min at 72 °C. The 10 μL qPCR reaction mix contained 1 μL of diluted library and final concentrations of 1 × SYBR Green qPCR Master Mix (Applied Biosystems 4309155) and 0.2 μM of each primer IS7 and IS8 . The indexing PCR was then performed for the appropriate number of cycles, introducing unique 8 bp indices to both 5′ and 3′ adapters. Final concentrations and PCR were as described by Gansauge and Meyer , but using 20 μL of template DNA in a total reaction volume of 80 μL.
DNA sequencing was performed on an Illumina NextSeq 500 sequencing platform, using 500/550 Mid Output v2 (150 cycles, Illumina FC-404-2001) and 500/550 High Output v2 (75 cycles, Illumina FC-404-2005) kits, with a custom read-1  and a custom index-2  sequencing primer. Although paired-end data was acquired for some libraries (GP1-01, GP1-02, GP1-03), only their first reads were used in data analysis, effectively unifying all sequence reads to single-end data of 76 bp length.
Sequence data analysis
Sequencing reads were trimmed using the software cutadapt (version 1.4.2) , requiring a minimum 4 bp overlap for adapter trimming. Duplicates were removed from the trimmed reads using Tally  (version 14-020), and 1,500,000 reads subsampled in order to estimate the fragment length distribution of the total DNA (both endogenous and contaminant) recovered using each treatment (Additional file 3: Table S2, Additional file 4: Table S3).
For comparison of endogenous DNA, 1,500,000 trimmed reads ≥ 30 bp were randomly subsampled (Additional file 3: Table S2) and mapped to the reference genome assembly of the giant panda  using the “aln” algorithm of BWA  (version 0.7.8-r455), with default parameters, and converted to bam format using BWA’s “samse” utility. Using SAMtools  (version 0.1.19-44428cd), reads mapping with a phred quality score below 30 were removed (samtools view). The alignment was sorted by 5′ read position (samtools sort), and duplicate reads were collapsed (samtools rmdup). Thymine over-representation at 5′ ends of endogenous DNA fragments was assessed using mapDamage2.0  (version 2.0.2-8-gaeeeffc-dirty).
For the total DNA, guanine–cytosine (GC) contents were obtained directly from trimmed FASTQ files (Additional file 3: Table S2, Additional file 4: Table S3). For endogenous DNA, mapped reads were converted into the FASTQ format using BEDtools  (version v2.25.0) for their GC content to be assessed.
Endogenous content and total DNA recovery
All pre-treated libraries showed higher endogenous contents than untreated ones (Table 1). Considering mean values for each pre-treatment method, the highest increase in endogenous content was observed for 0.5% bleach (ninefold, Fig. 1), which is consistent with the results of previous studies . 0.1 and 1.0% bleach concentrations resulted in eightfold and fivefold mean-increase in endogenous content, respectively. The effect of pre-digestion was less pronounced, providing a twofold increase in endogenous content. Overall DNA retrieval was drastically reduced by all pre-treatment methods, scaling in magnitude with bleach concentration, with the effect of pre-digestion again being less pronounced (Fig. 1). It should be noted that the amounts of retrieved DNA vary within most treatments (by up to 61%, Table 1), which appears to be a common phenomenon in aDNA studies (e.g. ).
The up to ninefold increase of endogenous content in pre-treated libraries implies an equivalent cost reduction for further sequencing attempts of the sample investigated here. However, the large reduction in DNA retrieval rates will be associated with reduced library complexity (i.e. the number of distinct DNA molecules it contains), which may counter any increases in endogenous content by increasing sequence duplication rates. This effect can be mitigated by processing an increased amount of bone powder, provided sufficient material is available.
Fragment lengths, thymine over-representation and GC content
Mean fragment lengths were generally higher for pre-treated libraries than for untreated ones (Fig. 2a–e). For bleach treated libraries, the length increase appears to be positively correlated with bleach concentration. The mean length of pre-digested libraries was intermediate between that of the 0.1 and 0.5% bleach treated libraries. However, the variation of mean fragment length within replicates was often larger than the difference between treatments. Because of the small number of replicates carried out, any conclusions based on mean values are therefore tentative, and more replicates would be needed in order to robustly test these hypotheses. We also observed consistently reduced levels of thymine overrepresentation in pre-treated libraries compared to untreated ones, albeit with some variation in exact frequencies between replicates (Fig. 2f).
DNA fragmentation and cytosine deamination are typical forms of damage occurring to ancient DNA molecules . The observation that libraries pre-treated with bleach exhibit larger average fragment lengths and reduced cytosine deamination (inferred from thymine over-representation), despite the DNA degrading properties of bleach [22, 23], seems counter-intuitive. We hypothesise that DNA from osteocytes is more protected from both damage and contamination due to its location within the bone’s lacunae [24, 25]. Sample pre-treatment thus enriches for this osteocyte DNA providing both an increase in the endogenous fraction and a reduction in damage rates, but this hypothesis is currently untested.
The GC content of the total recovered DNA was consistently lower in bleached libraries in comparison to untreated ones (Table 1). Smaller reductions in GC content were observed with pre-digestion. For endogenous DNA (mapped reads), GC contents were around 38%, regardless of the pre-treatment method (Table 1), as expected for a mammalian genome , indicating no obvious GC content bias introduced by the pre-treatments used in this study.
Our results add to a growing body of research confirming that bleach and pre-digestion are valuable tools for the study of both palaeogenomes [5,6,7] and forensics [20, 27]. The majority of published studies have applied these methods to bone and/or tooth samples, however bleach has also been used successfully to remove modern human DNA contamination from hairs (e.g. ). To our knowledge, only mitochondrial sequences have been successfully retrieved from ancient hair specimens to date, rendering them potentially less useful than bone samples for studies of ancient nuclear genomes. The wider potential for these pre-treatment methods in the retrieval of genetic data from other ancient or degraded tissues appears largely unexplored, but may represent a beneficial area for future research.
The increases in endogenous content associated with the pre-treatment methods applied here could provide a direct and equivalent reduction in sequencing costs. However, the inevitable reduction of library complexity may counter such gains and necessitate the processing of more sample material. Pre-treatments may be further improved by fine tuning concentrations and incubation times, as well as by comparing their effect on samples from different species, time periods and deposition environments. Even now, samples with very low endogenous DNA contents may become viable for whole-genome sequencing if pre-treatment is applied, greatly increasing the number of potential study subjects.
Finally, the high experimental noise observed in the rates of DNA retrieval (Table 1) and mean fragment lengths (Fig. 2a–e), appear to be common in aDNA research (e.g. ). A large number of replicates may therefore be needed for statistical confirmation of the observed trends, particularly when effect sizes are small.
Only one bone was investigated, precluding broad generalisations as bleach and pre-digestion treatments are known to have sample-specific effects.
Due to limited amount of material, not enough replicates could be prepared to statistically confirm the results.
- GC content:
next generation sequencing
Hofreiter M, Paijmans JLA, Goodchild H, Speller CF, Barlow A, Fortes GG, et al. The future of ancient DNA: technical advances and conceptual shifts. BioEssays. 2015;37:284–93.
Carpenter ML, Buenrostro JD, Valdiosera C, Schroeder H, Allentoft ME, Sikora M, et al. Pulling out the 1%: whole-genome capture for the targeted enrichment of ancient DNA sequencing libraries. Am J Hum Genet. 2013;93:852–64.
Mamanova L, Coffey AJ, Scott CE, Kozarewa I, Turner EH, Kumar A, et al. Target-enrichment strategies for next-generation sequencing. Nat Methods. 2010;7:111–8.
Briggs AW, Good JM, Green RE, Krause J, Maricic T, Stenzel U, et al. Targeted retrieval and analysis of five Neandertal mtDNA genomes. Science. 2009;325:318–21.
Korlević P, Gerber T, Gansauge M-T, Hajdinjak M, Nagel S, Aximu-Petri A, et al. Reducing microbial and human contamination in DNA extractions from ancient bones and teeth. Biotechniques. 2015;59:87–93.
Damgaard PB, Margaryan A, Schroeder H, Orlando L, Willerslev E, Allentoft ME. Improving access to endogenous DNA in ancient bones and teeth. Sci Rep. 2015;5:11184.
Boessenkool S, Hanghøj K, Nistelberger HM, Der Sarkissian C, Gondek A, Orlando L, et al. Combining bleach and mild pre-digestion improves ancient DNA recovery from bones. Mol Ecol Resour. 2016. https://doi.org/10.1111/1755-0998.12623/full.
Jablonski NG, Xueping J, Hong L, Zheng L, Flynn LJ, Zhicai L. Remains of Holocene giant pandas from Jiangdong Mountain (Yunnan, China) and their relevance to the evolution of quaternary environments in south-western China. Hist Biol. 2012;24:527–36.
Fulton TL. Setting up an ancient DNA laboratory. Methods Mol Biol. 2012;840:1–11.
Dabney J, Knapp M, Glocke I, Gansauge M-T, Weihmann A, Nickel B, et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc Natl Acad Sci USA. 2013;110:15758–63.
Gansauge M-T, Meyer M. Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA. Nat Protoc. 2013;8:737–48.
Paijmans JLA, Baleka S, Henneberger K, Barlow A. Sequencing single-stranded libraries on the Illumina NextSeq 500 platform. 2017. arXiv:1711.11004v1.
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10.
Davis MPA, van Dongen S, Abreu-Goodger C, Bartonicek N, Enright AJ. Kraken: a set of tools for quality control and analysis of high-throughput sequence data. Methods. 2013;63:41–9.
Li R, Fan W, Tian G, Zhu H, He L, Cai J, et al. The sequence and de novo assembly of the giant panda genome. Nature. 2010;463:311–7.
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Jónsson H, Ginolhac A, Schubert M, Johnson PLF, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics. 2013;29:1682–4.
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.
Barta JL, Monroe C, Kemp BM. Further evaluation of the efficacy of contamination removal from bone surfaces. Forensic Sci Int. 2013;231:340–8.
Dabney J, Meyer M, Pääbo S. Ancient DNA damage. Cold Spring Harb Perspect Biol. 2013;5:a012567.
Hayatsu H, Pan S-K, Ukita T. Reaction of sodium hypochlorite with nucleic acids and their constituents. Chemistry Pharmacy Bulletin. 1971.
Ohnishi S, Murata M, Kawanishi S. DNA damage induced by hypochlorite and hypobromite with reference to inflammation-associated carcinogenesis. Cancer Lett. 2002;178:37–42.
Bell LS, Kayser M, Jones C. The mineralized osteocyte: a living fossil. Am J Phys Anthropol. 2008;137:449–56.
Salamon M, Tuross N, Arensburg B, Weiner S. Relatively well preserved DNA is present in the crystal aggregates of fossil bones. Proc Natl Acad Sci USA. 2005;102:13783–8.
Li XQ, Du D. Variation, evolution, and correlation analysis of C+G content and genome or chromosome size in different kingdoms and phyla. PLoS ONE. 2014;9:e88339.
Kemp BM, Smith DG. Use of bleach to eliminate contaminating DNA from the surface of bones and teeth. Forensic Sci Int. 2005;154:53–61.
Gilbert MTP, Menez L, Janaway RC, Tobin DJ, Cooper A, Wilson AS. Resistance of degraded hair shafts to contaminant DNA. Forensic Sci Int. 2006;156:208–12.
NB, GX, MVW and GS carried out the lab work. NB carried out the data analysis. LS processed the bone sample. NB, GX and AB designed the experiments. AB supervised the research. GS obtained the funding. NB and AB wrote the manuscript. All authors read and gave comments on the draft version of the manuscript. All authors read and approved the final manuscript.
First author NB performed the work presented in this study in the context of his bachelor thesis in Biosciences at the University of Potsdam. He is currently following the master programme in Biochemistry and Molecular Biology at the University of Potsdam and aims for a scientific career in the fields of evolutionary and molecular biology.
We thank Professor Xueping Ji of the Yunnan Cultural Relics and Archaeology Institute for providing the specimen and sharing the radiocarbon date. We thank Professor Xulong Lai of the State Key Laboratory of Biogeology and Environmental Geology (BGEG) at the China University of Geosciences, Wuhan, for collecting the specimen from the Institute. We thank Professor Michael Hofreiter for his helpful advice on the interpretation of the results and for providing useful comments on the manuscript.
The authors declare that they have no competing interests.
Availability of data and materials
Consent for publication
Ethics approval and consent to participate
The authors obtained permission from Professor Xulong Lai of the State Key Laboratory of Biogeology and Environmental Geology (BGEG) at the China University of Geosciences, Wuhan, to use the palaeontological sample for research purposes.
This research was funded by the National Natural Science Foundation of China (No. 41672017).
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Figure S1. DNA mass concentrations of cave hyena extracts. Comparison of DNA yield in 25 mg extractions and 50 mg extractions in cave hyena samples.
Additional file 2: Table S1. Results of qPCR experiments. Estimated Number of qPCR cycles to 80 Relative Fluorescence Units for three replicates of each processed library.
Additional file 3: Table S2. Terminal commands. One-line UNIX terminal commands used for random subsampling and recording read lengths and GC content.
Additional file 4: Table S3. Raw read data. Raw read lengths and GC contents for all libraries.
About this article
- Ancient DNA (aDNA)
- Endogenous content
- Next generation sequencing (NGS)
- Giant panda
- Ailuropoda melanoleuca