Draft genome sequence of Bacillus pumilus strain EZ-C07 isolated from digested agricultural wastes

Objectives Bacillus species, belonging to the family Bacillaceae, are rod-shaped aerobic or facultative anaerobic Gram-positive bacteria that can be isolated from various environmental niches. Bacillus pumilus strains are resistant to unfavorable conditions such as UV, H2O2 and chemical disinfection. Furthermore, B. pumilus strains synthesize multifarious important enzymes and can be used in the production of some fermented foods, bioremediation of wastewater systems and biodegradation of environmental contaminants. Hence, investigation at the genomic level is required to understand their ecology, genetics and potential applications. Data description In this research, we provide the genomic insights into one Bacillus species (EZ-C07) isolated from digested agricultural waste materials. The draft genome of the strain EZ-C07 consists of 3,724,869 bp with 3890 coding sequences and 41.5% G + C content. Based on 16S rRNA gene sequence analysis followed by in silico DNA–DNA hybridization studies, the strain EZ-C07 was identified as Bacillus pumilus belonging to the family Bacillaceae within the phylum Firmicutes. The whole genome shotgun project of B. pumilus strain EZ-C07 can be accessed at DDBJ/ENA/GenBank under the Accession QLVI00000000.


Objective
Bacillus species, belonging to the family Bacillaceae, are rod-shaped aerobic or facultative anaerobic Gram-positive bacteria that can be isolated from various environments. One of the features of this genus is the ability to form endospores in response to various environmental and nutritional stresses [1,2]. 16S rRNA gene sequence analysis showed a high level of phylogenetic heterogeneity in the genus Bacillus. Members of this genus can produce a wide range of useful pharmaceutical, agricultural and industrial products (such as antibiotics, enzymes, amino acids, sugars) [1][2][3]. Also, several Bacillus strains with strong proteolytic activities displayed the robust survival in the protein-fed anaerobic biogas reactors and finally improved the biogas productivity [4]. The anaerobic digestion of biomass belongs to a more suitable method to utilize various agricultural waste materials [5,6].
From the array of various Bacillus species, Bacillus pumilus strains are resistant to unfavorable conditions, including UV, H 2 O 2 and chemical disinfection [7,8]. In addition, based on their abilities to synthesize multifarious enzymes and several other bioactive compounds [9,10], B. pumilus strains can be used in fermented foods production [11], biofertilizers synthesis [12], bioremediation of wastewater systems [13] and biodegradation of environmental contaminants [14]. Moreover, some B. pumilus strains were previously isolated from a biogas reactor utilizing abattoir waste [15]. Hence, more investigations at the genomic level are required to understand ecology, genetics and potential applications of B. pumilus

Data description
Bacillus pumilus strain EZ-C07 was isolated from the effluent of a biogas reactor fed with agricultural wastes, including chicken manure, and operated at high ammonia levels (> 4.0 NH 4 -N g L −1 ). The chicken manure was obtained from a poultry farm located in the Zelenodolsky district, the Republic of Tatarstan (Russian Federation). The bacterial strain B. pumilus EZ-C07 was cultured on Luria-Bertani agar at + 37 °C for 24-48 h under microaerophilic conditions. Genomic DNA extraction and preparation for further sequencing were performed as described previously [16]. Briefly, genomic DNA of the strain EZ-C07 was isolated by using a FastDNA spin kit (#116540600; MP Biomedicals, USA) and a FastPrep-24 homogenizer (#116004500; MP Biomedicals, USA). Extracted DNA quality was estimated by agarose gel electrophoresis, while concentration and purity of the received DNA were estimated with a NanoDrop 2000 spectrophotometer (#ND-2000; Thermo Fisher Scientific, USA). Extracted DNA sample of the strain was stored at − 20 °C until further processing. The identification of the strain B. pumilus EZ-C07 was based on morphological characteristics and biochemical tests and then confirmed by sequencing of its 16S rRNA gene (Accession Number MH510687) on an ABI PRISM 3130xl Genetic Analyzer (#4359571; Thermo Fisher Scientific, USA) (  [17], and the received contigs were ordered with Mauve version 2.4.0 [18] with default parameters. The genome sequence of B. pumilus EZ-C07 was annotated with the RAST automatic annotation server [19]. The rRNA and tRNA genes were determined with the RNAmmer 1.2 server [20] and tRNA scan-SE 1.23 search server [21], accordingly. Finally, based on 16S rRNA gene sequence analysis followed by in silico DNA-DNA hybridization studies (against type strain B. pumilus ATCC 7061), the strain was identified as Bacillus pumilus belonging to family Bacillaceae within the phylum Firmicutes.
The obtained genome sequence of B. pumilus EZ-C07 included 31 contigs (> 500 bp in size) with a calculated size of 3,724,869 bp in length and N50 of 183,828 bp. The G + C content for the draft genome is 41.5%. A total of 3890 coding sequences (CDS) were predicted, where 1818 CDS (47%) were annotated as seed subsystem features and 2072 CDS (53%) were annotated as outside of the seed subsystem. In total, 2675 and 1215 proteins were identified as non-hypothetical and hypothetical, respectively. It was demonstrated that genome encoded at least 3 rRNAs and 50 tRNAs. The strain B. pumilus EZ-C07 possesses numerous genes involved in monosaccharides and proteins metabolism, fermentation, as well as biphenyl and gentisate biodegradation. Many genes responsible for the bacterial strain resistance to several antibiotics and various toxic compounds, such as arsenic, cobalt, zinc and cadmium, were also identified. This resistant bacterial strain can be used in different biotechnologies.

Limitations
The obtained results are based on the draft genome assembly; therefore, the exact genome's length, number of rRNAs, repetitive elements cannot be certainly reported. Also, the presence of any plasmids cannot be clearly predicted.
Authors' contributions EEZ and WSM carried out experiments, performed interpretation of the received data and drafted the manuscript. EIS and LHS carried out Illumina HiSeq sequencing and revised the manuscript. AMZ supervised the project, performed genome analysis and professionally revised the manuscript. All authors read and approved the final manuscript.