Skip to main content

Draft genome sequence of two “Candidatus Intestinicoccus colisanans” strains isolated from faeces of healthy humans



In order to provide a better insight into the functional capacity of the human gut microbiome, we isolated a novel bacterium, “Candidatus Intestinicoccus colisanans” gen. nov. sp. nov., and performed whole genome sequencing. This study will provide new insights into the functional potential of this bacterium and its role in modulating host health and well-being. We expect that this data resource will be useful in providing additional insight into the diversity and functional potential of the human microbiome.

Data description

Here, we report the first draft genome sequences of “Candidatus Intestinicoccus colisanans” strains MH27-1 and MH27-2, recovered from faeces collected from healthy human donors. The genomes were sequenced using short-read Illumina technology and whole-genome-based comparisons and phylogenomics reconstruction indicate that “Candidatus Intestinicoccus colisanans” represents a novel genus and species within the family Acutalibacteraceae. Both genomes were estimated to be > 98% completed and to range in size from 2.9 to 3.3 Mb with a G + C content of approximately 51%. The gene repertoire of “Candidatus Intestinicoccus colisanans” indicate it is likely a saccharolytic gut bacterium.

Peer Review reports


The healthy human gut is colonized by a diverse microbial community that provides a suite of functionalities relevant to host health. It is estimated that over 70% of the human microbiome remains uncultured and this remains a key challenge to better understanding the ecological and functional role of individual microbial species [1]. To better address this challenge, we applied a genome-directed isolation approach [2] to isolate a novel uncultured bacterium that is both numerically abundant and prevalent in the healthy human gut. To isolate “Candidatus Intestinicoccus colisanans” MH27-1, a dilution-to-extinction enrichment culture series was generated from a faecal sample collected from a healthy human donor and incubated at 37 °C. Following metagenomic sequencing, a low diversity enrichment culture dominated by “Candidatus Intestinicoccus colisanans” MH27-1 was identified. “Candidatus Intestinicoccus colisanans” MH27-1 was isolated on YCFA medium supplemented with 5% v/v of an aqueous faecal extract and 1.5% w/v agar following incubation at 37 °C. As “Candidatus Intestinicoccus colisanans” was uncultured, we hypothesized that the aqueous faecal extract was necessary for growth. However, “Candidatus Intestinicoccus colisanans” MH27-1 grew after 72 h of culture in PYG broth medium at 37 °C thereby revealing the aqueous faecal extract was dispensable for growth. To isolate “Candidatus Intestinicoccus colisanans” MH27-2, a dilution-to-extinction enrichment culture series was generated from a faecal sample collected from an independent healthy human donor incubated at 37 °C. A low diversity enrichment culture dominated by “Candidatus Intestinicoccus colisanans” MH27-2 was identified following metagenomic sequencing. Following purification on PYG medium supplemented with 1.5% w/v agar at 37 °C, an axenic isolate was produced that grew after 72 h of culture in PYG broth medium at 37 °C. Both isolates formed raised creamy white/milky colonies with an entire margin on agar and were typically observed as Gram-variable coccoid/ovoid cells that were often present as pairs or short chains (see Supplementary Information Figures S1 and S2).

Data description

We performed whole-genome sequencing to assess the functional potential of “Candidatus Intestinicoccus colisanans” and better understand its interactions with the host. Both strains were grown in PYG based medium and DNA was extracted using the Nextera DNA Flex Microbial Extraction protocol [3]. DNA libraries were prepared using the Illumina DNA Prep Library Preparation Kit as per the manufacturer’s instructions, with unique dual indexes (IDT for Illumina DNA/RNA UD Index set A-D 20027213-6) and PhiX spike in at 2%, and sequenced on the NovaSeq6000 in 2 × 150 bp format. The libraries produced 5,719,975 and 3,565,993 150 bp paired-end reads for “Candidatus Intestinicoccus colisanans” MH27-1 and MH27-2, respectively. For QC, assembly and functional annotation, default parameters were used for software except where otherwise noted. Illumina reads were trimmed and quality controlled using Trimmomatic v0.36 (ILLUMINACLIP:adapters_NexteraPE-PE_TruSeq3-PE.fa:2:30:10, LEADING:3, TRAILING:3, SLIDINGWINDOW:4:15, CROP:150 HEADCROP:0, MINLEN:100) [4], and PhiX reads were removed using bbduk from bbmap v38.68 [5]. Reads were merged using using bbmerge from bbmap assembled using Spades v3.13.0 [6]. Quality controlled reads were merged assembled producing 31 contigs for “Candidatus Intestinicoccus colisanans” MH27-1 (N50 = 308,050) and 25 contigs for MH27-2 (N50 = 454,909).

Table 1 Overview of data files/data sets

The genome size of “Candidatus Intestinicoccus colisanans” MH27-1 was 3,304,406 bp (GC = 51.2%) and candidatus “Candidatus Intestinicoccus colisanans” MH27-2 was 2,969,717 bp (GC = 50.9%). Both genomes were estimated to be 98.66% complete and 0% contaminated by CheckM v1.0.18 [7]. NCBI designated the new isolates as Oscillispiraceae sp. however standardised genome-based taxonomy using the Genome Taxonomy Database r89 [8] assigned both isolates to the uncultured bacterial species UBA1417 sp003531055 (GTDB taxonomy: d__Bacteria; p__Bacillota_A; c__Clostridia; o__Oscillospirales; f__Acutalibacteraceae; g__UBA1417; s__UBA1417 sp003531055). Analysis with Prokka v1.14.6 [9] revealed “Candidatus Intestinicoccus colisanans” MH27-1 and MH27-2 encoded 3151 and 2789 protein coding genes, respectively. plasmidSpades (v3.15.3) analyses, followed by manual curation, revealed a small plasmid in both MH27-1 (5.3 kb; 56.8% GC; 6 proteins) and MH27-2 (6.1 kb, 50.1% GC; 8 proteins). Analysis with Phaster [10] identified a 31Kb putative prophage (GC 52.2%) containing 44 proteins in “Candidatus Intestinicoccus colisanans” MH27-1.

Gapmind [11] analysis revealed both strains encode complete pathways for the biosynthesis of 7 amino acids (arg, asp, cys, gly, glu, lys and val). Analysis with dbCAN2 revealed MH27-1 and MH27-2 encode 44 and 45 carbohydrate active enzymes respectively, including four copies of GH5 (cellulase) and GH29 (fucosidase) enzymes each. Analysis with AntiSmash v5.1.2 [12] revealed “Candidatus Intestinicoccus colisanans” MH27-1 and MH27-2 encode one and two cryptic RiPP biosynthetic gene clusters, respectively.

In summary, there is a renewed interest in applying improved culture-based approaches to isolate novel gut microbes (reviewed by [18]). The isolation of “Candidatus Intestinicoccus colisanans” will enable a more thorough evaluation of its role in health and disease, and a mechanistic dissection of its functional capacities.


The genome sequences of “Candidatus Intestinicoccus colisanans” MH27-1 and MH27-2 were produced from short read data and remain incomplete. The closure of these genomes coupled with the isolation and sequencing of additional strains will provide a greater insight into the gene repertoire and functional capacity of this taxon.

Data availability

The data described in this Data note can be freely and openly accessed on Genbank under the accession numbers GCA_021029585.1 and GCA_021029595.1 (biosample numbers SAMN23040991 and SAMN23040992). Please see Table 1 for details and links to the data. The isolates were deposited as Intestinicoccus colisanans MH27-1 and MH27-2 with the National Measurements Institute (Australia) culture collection under accession numbers V21/015887 and V21/015888, respectively.



Guanine–Cytosine content


Glycoside hydrolase


Ribosomally synthesized and post-translationally modified peptide


  1. Almeida A, Nayfach S, Boland M, Strozzi F, Beracochea M, Shi ZJ, Pollard KS, Sakharova E, Parks DH, Hugenholtz P, et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat Biotechnol. 2021;39(1):105–14.

    Article  CAS  PubMed  Google Scholar 

  2. Tyson GW, Lo I, Baker BJ, Allen EE, Hugenholtz P, Banfield JF. Genome-directed isolation of the key nitrogen fixer Leptospirillum ferrodiazotrophum sp. nov. from an acidophilic microbial community. Appl Environ Microbiol. 2005;71(10):6319–24.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Illumina NDFMCEP. In.

  4. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinf (Oxford England). 2014;30(15):2114–20.

    CAS  Google Scholar 

  5. Bushnell B. BBMap: A Fast, Accurate, Splice-Aware Aligner: Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States). 2014.

  6. Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. Using SPAdes De Novo Assembler. Curr Protocols Bioinf. 2020;70(1):e102.

    Article  CAS  Google Scholar 

  7. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 2015.

  8. Parks DH, Chuvochina M, Chaumeil PA, Rinke C, Mussig AJ, Hugenholtz P. A complete domain-to-species taxonomy for Bacteria and Archaea. Nat Biotechnol. 2020;38(9):1079–86.

    Article  CAS  PubMed  Google Scholar 

  9. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinf (Oxford England). 2014;30(14):2068–9.

    CAS  Google Scholar 

  10. Arndt D, Grant JR, Marcu A, Sajed T, Pon A, Liang Y, Wishart DS. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44(W1):W16–W21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Price MN, Deutschbauer AM, Arkin AP. GapMind: automated annotation of amino acid biosynthesis. mSystems 2020, 5(3).

  12. Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res. 2019;47(W1):W81–w87.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Zhou J, Nyeverecz ABJ, Angel B, Wood N, Hugenholtz DLA, Tyson P, K GWL. Ó Cuív P: NCBI BioProject information for MH27-1 and MH27-2. In:; 2022.

  14. Zhou J, Nyeverecz ABJ, Angel B, Wood N, Hugenholtz DLA, Tyson P, K GWL. Ó Cuív P: Illumina raw sequences for MH27-1. In:; 2022.

  15. Zhou J, Nyeverecz ABJ, Angel B, Wood N, Hugenholtz DLA, Tyson P, K GWL. Ó Cuív P: Genome sequence of MH27-1. In:; 2022.

  16. Zhou J, Nyeverecz ABJ, Angel B, Wood N, Hugenholtz DLA, Tyson P, K GWL. Ó Cuív P: Illumina raw sequences for MH27-2. In:; 2022.

  17. Zhou J, Nyeverecz ABJ, Angel B, Wood N, Hugenholtz DLA, Tyson P, K GWL. Ó Cuív P: genome sequence of MH27-2. In:; 2022.

  18. Ó Cuív P. A question of culture: bringing the gut microbiome to life in the -omics era. In: Improving Rumen Function Edited by McSweeney CS, Mackie RI, 1st edn. Cambridge, U.K.: Burleigh Dodds Science Publishing Limited; 2020: 29–54.

Download references


We acknowledge Maria Chuvochina for assistance with analysis of the GTDB taxonomy.


This research was funded by Microba Life Sciences.

Author information

Authors and Affiliations



JZ, BN and PÓC produced the enrichments and isolates; CV performed the microscopy; NA and DW performed the sequencing; JB performed the genomic and phylogenetic analyses; JZ, JB, BN, CV, PH, GT, LK and PÓC analysed the data; PÓC and JB wrote the manuscript with JZ, BN, CV, NA, DW, PH, GT and LK.

Corresponding authors

Correspondence to Lutz Krause or Páraic Ó Cuív.

Ethics declarations

Ethics approval and consent to participate

The study was validated by the ethics committee of Bellberry (Adelaide, Australia) under number HREC2018-05-400. All participants provided informed consent for the use of their de-identified samples to be used for research purposes.

Experimental methods

All experiments were performed in accordance with relevant Queensland and Australian governmental guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

All authors are current or former employees of Microba Life Sciences and have stock and/or equity interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, J., Boyd, J.A., Nyeverecz, B. et al. Draft genome sequence of two “Candidatus Intestinicoccus colisanans” strains isolated from faeces of healthy humans. BMC Res Notes 16, 174 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: