- Data note
- Open Access
Draft genome sequence of Paenibacillus sp. EZ-K15 isolated from wastewater systems
© The Author(s) 2017
- Received: 20 October 2017
- Accepted: 5 December 2017
- Published: 12 December 2017
Paenibacillus species, belonging to the family Paenibacillaceae, are able to survive for long periods under adverse environmental conditions. Several Paenibacillus species produce antimicrobial compounds and are capable of biodegradation of various contaminants; therefore, more investigations at the genomic level are necessary to improve our understanding of their ecology, genetics, as well as potential biotechnological applications.
In the present study, we describe the draft genome sequence of Paenibacillus sp. EZ-K15 that was isolated from nitrocellulose-contaminated wastewater samples. The genome comprises 7,258,662 bp, with a G+C content of 48.6%. This whole genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PDHM00000000. Data demonstrated here can be used by other researchers working or studying in the field of whole genome analysis and application of Paenibacillus species in biotechnological processes.
- Paenibacillus sp.
- Genome sequencing
- Draft genome assembly and annotation
Paenibacillus species, belonging to the family Paenibacillaceae, are rod-shaped Gram-positive or Gram-variable endospore forming aerobic or facultatively anaerobic bacteria, which are able to survive for long periods under adverse environmental conditions. Bacteria belonging to the genus Paenibacillus can be isolated from a wide range of environments including humans, animals, plants and the environment [1, 2]. Many species of Paenibacillus genus synthesize antimicrobial compounds that can be used as pesticides as well as in medicine, and many species produce enzymes important in bioremediation related technologies. Paenibacillus strains can be successfully applied for contaminants removal from a variety of wastewater systems. Also, several Paenibacillus strains can be involved in hydrolysis of cellulose and hemicellulose, lignin depolymerization and degradation of various textile dyes, polyvinyl alcohol, diesel fuel, bitumen, polycyclic aromatic hydrocarbons, benzene and other compounds . Hence, more studies at the genomic level are important to clarify our understanding of their ecology, genetics, as well as potential biotechnological applications. Thus, Paenibacillus sp. EZ-K15 was isolated from nitrocellulose-contaminated wastewater systems, Kazan, Republic of Tatarstan, Russia . These industrial wastes produce high levels of wastewaters polluted with multifarious dissolved chemical compounds and nitrocellulose particles. Therefore, isolation of bacteria which are able to transform various adverse pollutants and their genome analysis are of high importance for the creation of effective bioremediation strategies [4, 5].
Overview of data files/data sets
The draft genome sequence of Paenibacillus sp. EZ-K15 composed of 36 contigs ranging from 512 to 911,265 bp with a total size of 7,258,662 bp, a G+C content of 48.6% and N50 of 242,001 bp. The Rapid Annotation System Technology server predicted 6682 coding sequences where 2551 coding sequences (39%) were annotated as seed subsystem features and 4131 coding sequences (61%) were annotated as outside of the seed subsystem. In total 4560 and 2122 coding sequences were assigned as non-hypothetical and hypothetical, accordingly. The genome was shown to encode at least 3 rRNAs and 66 tRNAs. The strain Paenibacillus sp. EZ-K15 possesses a substantial number of genes responsible for denitrification and nitrate/nitrite ammonification (e.g., for nitrate released during nitrocellulose denitration) as well as for metabolism of aromatic compounds, including genes involved in benzoate, gentisate and some other compounds biodegradation. Numerous genes responsible for resistance to toxic compounds, including arsenic, mercury, cadmium, as well as chromium compounds, were additionally detected. This resistant strain may have future usefulness in bioremediation of various sites.
Current data is based on the draft level genome sequence, due to which exact length of the genome, synteny, number of rRNA and repetitive elements cannot be reported. In addition, whether the genome consists of any plasmid/s or extra-chromosomal DNA cannot be certainly predicted.
WSM and EEZ conducted experiments, performed genome analysis, interpretation of the data and drafted the manuscript. EIS and NEG carried out Illumina sequencing and revised the manuscript. AMZ supervised the project, designed the study, performed genome analysis and professionally revised the manuscript. All authors read and approved the final manuscript.
We thank Dr. Rushan Agzamov for support during the sampling.
The authors declare that they have no competing interests.
Availability of data materials
The data described in this Data note can be freely and openly accessed at DDBJ/ENA/GenBank. Please see Table 1 for details and links to the data.
Consent for publication
Ethics approval and consent to participate
The reported study was funded by the Russian Foundation for Basic Research [Grant No. 16-34-60093 mol_a_dk].
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Sáez-Nieto JA, Medina-Pascual MJ, Carrasco G, Garrido N, Fernandez-Torres MA, Villalón P, Valdezate S. Paenibacillus spp. isolated from human and environmental samples in Spain: detection of 11 new species. New Microbes New Infect. 2017;19:19–27.View ArticlePubMedPubMed CentralGoogle Scholar
- Grady EN, MacDonald J, Liu L, Richman A, Yuan ZC. Current knowledge and perspectives of Paenibacillus: a review. Microb Cell Fact. 2016;15:203.View ArticlePubMedPubMed CentralGoogle Scholar
- Ziganshina EE, Ibragimov EM, Ilinskaya ON, Ziganshin AM. Bacterial communities inhabiting toxic industrial wastewater generated during nitrocellulose production. Biologia. 2016;71:70–8.View ArticleGoogle Scholar
- Khilyas IV, Ziganshin AM, Pannier AJ, Gerlach R. Effect of ferrihydrite on 2,4,6-trinitrotoluene biotransformation by an aerobic yeast. Biodegradation. 2013;24:631–44.View ArticlePubMedGoogle Scholar
- Ziganshin AM, Ziganshina EE, Byrne J, Gerlach R, Struve E, Biktagirov T, Rodionov A, Kappler A. Fe(III) mineral reduction followed by partial dissolution and reactive oxygen species generation during 2,4,6-trinitrotoluene transformation by the aerobic yeast Yarrowia lipolytica. AMB Express. 2015;5:8.View ArticlePubMedPubMed CentralGoogle Scholar
- Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27:863–4.View ArticlePubMedPubMed CentralGoogle Scholar
- Zerbino DR. Using the Velvet de novo assembler for short-read sequencing technologies. Curr Protoc Bioinform. 2010;11(11):5.Google Scholar
- Rissman AI, Mau B, Biehl BS, Darling AE, Glasner JD, Perna NT. Reordering contigs of draft genomes using the Mauve aligner. Bioinformatics. 2009;25:2071–3.View ArticlePubMedPubMed CentralGoogle Scholar
- Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75.View ArticlePubMedPubMed CentralGoogle Scholar
- Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–8.View ArticlePubMedPubMed CentralGoogle Scholar
- Lowe TM, Eddy SR. tRNA scan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.View ArticlePubMedPubMed CentralGoogle Scholar
- Yoon SH, Ha SM, Kwon S, Lim J, Kim Y, Seo H, Chun J. Introducing EzBioCloud: a taxonomically united database of 16S rRNA and whole genome assemblies. Int J Syst Evol Microbiol. 2017;67:1613–7.View ArticlePubMedGoogle Scholar