Label | Name of data file/data set | File types (file extension) | Data repository and identifier (DOI or accession number) |
---|---|---|---|
Data file 1 | Whole genome sequence | FASTA | DDBJ (Accession numbers: BCFP01000001–BCFP01060602) (http://getentry.ddbj.nig.ac.jp/) |
Data file 2 | Gene clusters | MS Excel file (.xlsx) | Mendeley database (https://doi.org/10.17632/yjyzx5gk7s.1) (https://data.mendeley.com/datasets/yjyzx5gk7s/1) |
Data file 3 | Clusters of Orthologous Groups of Proteins (COGs) | MS Excel file (.xlsx) | Mendeley database (https://doi.org/10.17632/5rhfd4n37k.1) (https://data.mendeley.com/datasets/5rhfd4n37k/1) |
Data file 4 | Sequence variants | MS Excel file (.xlsx) | Mendeley database (https://doi.org/10.17632/4y8hdw7tb7.1) (https://data.mendeley.com/datasets/4y8hdw7tb7/1) |