The genome of the Lactobacillus sanfranciscensis temperate phage EV3
© Ehrmann et al.; licensee BioMed Central Ltd. 2013
Received: 2 September 2013
Accepted: 29 November 2013
Published: 5 December 2013
Bacteriophages infection modulates microbial consortia and transduction is one of the most important mechanism involved in the bacterial evolution. However, phage contamination brings food fermentations to a halt causing economic setbacks. The number of phage genome sequences of lactic acid bacteria especially of lactobacilli is still limited. We analysed the genome of a temperate phage active on Lactobacillus sanfranciscensis, the predominant strain in type I sourdough fermentations.
Sequencing of the DNA of EV3 phage revealed a genome of 34,834 bp and a G + C content of 36.45%. Of the 43 open reading frames (ORFs) identified, all but eight shared homology with other phages of lactobacilli. A similar genomic organization and mosaic pattern of identities align EV3 with the closely related Lactobacillus vaginalis ATCC 49540 prophage. Four unknown ORFs that had no homologies in the databases or predicted functions were identified. Notably, EV3 encodes a putative dextranase.
EV3 is the first L. sanfranciscensis phage that has been completely sequenced so far.
In many large-scale food fermentations manufactured with lactobacilli, the risk of bacteriophage contamination is a serious threat. Phage infections are detrimental in industrial dairy or acetic acid fermentations [1–3], where the liquid state of the medium allows the rapid dissemination of the viral particles. Despite spreading of the phage within a sourdough is hindered, probably as a consequence of the semifluid physical state of the matrix, phages of lactobacilli have been already isolated from sourdough samples [4, 5] and it has been proven that viral infection can be transmitted from one dough to another . Interestingly, phage spreading into sourdough did neither adversely affect acidification and volume increase of the dough nor reduced lactobacilli cell counts .
In a previous work phage EV3 was isolated and phenotypically characterized, showing to be active on five different strains of L. sanfranciscensis. This viral particle was ascribed to the Siphoviridae family with a morphotype B1. Its lytic life cycle at 25°C lasted 3 h with a burst size of about 30 viral particles per infected cell. The genome estimated by digestion with different restriction enzymes was 31.8 ± 1.5 kbp long, and it was a double-stranded linear DNA molecule with a pac-type system. Phage EV3 behaves as a temperate phage that can either multiply via the lytic cycle or enter a dormant state integrating into the host chromosome as a prophage.
Phages may be the most abundant life forms on Earth with a global population on the order of 1031. Significant amount of sequencing data is generated by phage genome projects and by sampling of DNA in the environment. Actually, since phages are the main vectors of gene exchange phenomena, they are considered the most important factors in driving evolution in prokaryotes .
To date, validated genome sequences of 16 Lactobacillus bacteriophages (including prophages) are available from the National Center for Biotechnology Information (NCBI) reference sequence database (RefSeq). The availability of those data allows for comparison of viral genomes in order to understand the genetic relationships among different phages and the function of putative genes. Whereas knowledge on phages and genomes thereof derived from lactic acid bacteria of the dairy environment is increasing, reports on phages coming from cereal fermentations are still rare.
This is the first report of the genome analysis of a L. sanfranciscensis phage.
Results & discussion
Open reading frames and genetic features of L. sanfranciscensis EV3 phage
Putative RBS*and start codon‡
orf length (aa)
Best hit EMBL protein name
Phage terminase L. vaginalis, small subunit ATCC 49540
Phage terminase L. vaginalis, large subunit, ATCC 49540
Phage portal protein L. vaginalis ATCC 49540
phage capsid protein L. vaginalis ATCC 49540
major tail protein Staph. pseudintermedius HKU10-03
Putative uncharacterized protein L. vaginalis ATCC 49540
Head-tail joining protein L. vaginalis ATCC 49540
Head-tail joining protein L. vaginalis ATCC 49540
Phage tail protein L. vaginalis ATCC 49540
Phage major tail protein L. vaginalis ATCC 49540
Putative uncharacterized protein L. vaginalis ATCC 49540
Putative uncharacterized protein L. vaginalis ATCC 49540
Phage minor tail protein L. vaginalis ATCC 49540
Putative uncharacterized protein L. saliv arius (strain CECT 5713)
Glycosylhydrolase L. plantarum
Put. Minor structural protein Leu. kimchii IMSNU 11154
Dextranase L. fermentum phage phiPYB5
Predicted protein P. acidilactici
L. plantarum WCFS1 phage P1 holin, lp0683
putative endolysin L. vaginalis ATCC 49540
phage integrase L. salivarius ACS-116-V-Col5a
Putative uncharacterized protein L. pentosus MP-10
XRE family transcriptional regulator L. pentosus
Phage antirepressor L. ruminis ATCC 25644
Phage protein Listeria monocytogenes FSL N1-017 helix-turn-helix protein
hydrolase NUDIX family L. delbrueckii
Putative uncharacterized protein Mahella australiensis DSM 15567
Putative uncharacterized protein lp_0862 L. pentosus IG1
NTP-binding protein L. paracasei subsp. paracasei ATCC 25302
Putative helicase Lactobacilus phage A2
Single stranded binding protein L. hilgardii ATCC 8290
Phage primase, P4 family L. buchneri NRRL B-30929
VRR-NUC domain Phage protein L. plantarum JDM1
Ribonucleoside-diphosphate reductase 2, Ent. faecalis
Phage transcriptional regulator Lactobacillus phage phig1e
Putative transporter protein L. reuteri ATCC 53608
EV3 DNA packaging
The predicted protein products of ORF EV3_01 and EV3_02 were similar to the putative small and large terminase subunits from L. vaginalis ATCC 49540 phage. In tailed phages, terminases consist of a large subunit containing the ATPase activity that controls DNA translocation together with an endonuclease activity that cuts concatemeric DNA into genome lengths, and a small subunit responsible for specific DNA binding. Therefore, these two EV3 proteins were probably involved in DNA packaging. In a previous work  it was already highlighted that EV3 had no cos site and therefore it is likely to pack its DNA through a pac system. The protein encoded by ORF EV3_035 had a high similarity with the putative DNA binding protein of L. hilgardii ATCC 8290. Its position was quite close to terminases genes suggesting that the putative gene product of ORF EV3_035 could also be involved in DNA packaging.
ORF EV3_003 and EV3_004 constituted the putative head module, since they were similar to portal protein and capsid protein of L. vaginalis ATCC 49540 phage, respectively. The portal complex forms a channel through which the viral DNA is packaged into the capsid, and exits during infection. The portal protein is thought to rotate during DNA packaging. It also forms the junction between the phage head (capsid) and the tail proteins. Putative gene products encoded by ORF EV3_007 and EV3_008 were likely to connect head and tail structures. The overlapping of the two genes suggest a translational coupling. The putative tail module was positioned downstream from the predicted head-tail-joining genes, and it was composed by ORF EV3_009, EV3_010. EV3_013 encoded product was similar to various tail component and tape measure proteins (TMP) from phages of L. vaginalis and L. fermentum. TMP generally works as template for measuring length during tail assembly, thus, it is reasonable to ascribe this function to the protein.
The predicted protein product from ORF EV3_021 had a 44% overall identity with the holin of L. plantarum WCFS1 phage P1. Holins are a diverse family of proteins that cause bacterial membrane lysis during late-protein synthesis. ORF EV3_022 encodes a putative endolysin that is quite similar (54% identity) to the endolysin of L. vaginalis ATCC 49540. The C terminus of this ORF contains two Lysine Motif domains that are likely to be implicated in bacterial cell wall degradation, while the N terminus encloses a Cpl-1 lysin (also known as Cpl-9 lysozyme/muramidase) that is a bacterial cell wall endolysin. A signal peptide with a predicted cleavage site (probability of 0.750) between position 26 and 27 of the amino acid sequence was identified. An analogous signal peptide was already reported for other phages of lactic acid bacteria and was demonstrated to be active [10, 11].
Integrase module and attachment site
ORF EV3_023 has an amino acid sequence comparable to phage integrase of L. salivarius ACS-116-V-Col5a phage. In order to identify the attP site the non coding sequence of 429 bp between the lys (orf EV3_22) and int (orf EV3_23) genes of the phage EV3 was blasted against the whole genome sequence of L. sanfranciscensis TMW 1.1304 the only strain whose genome sequence is available . We found only one significant hit of 16 nucleotides matching the 3′ end of a tRNALeu gene. Since the host attachment sites are commonly located near tRNA genes , we assumed this sequence as putative attB site in L. sanfranciscensis H2A.
Most probably the sequence: 5′ GCCGAGAGCGGG 3′ found on L. sanfranciscensis genome, is the region recognised by the bacteriophage (att B site) since an homologous region was found also on EV3 genome (att P site). The att B region, located between the Lysin and the Integrase genes in lactobacilli containing the phage, corresponds to a gene encoding for a tRNA confirming that some phages integrate their genome directly into genes for the tRNA.
The protein encoded by ORF EV3_025 is similar to XRE family transcriptional regulator of L. pentosus MP-10. This large family of DNA binding helix-turn helix proteins includes Cro and CI. The product encoded by EV3_026 shows an identity with phage antirepressor of L. vaginalis ATCC 49540 phage. This protein is thought to promote transcription of genes required for phage production.
Phage replication module
ORF EV3_027 had a DNA binding domain in the N-terminal region with an identity to excisionase protein (Xis protein) and a helix-turn-helix (HTH) DNA binding domain. The predicted proteins from EV3_028 to EV3_032 have an unknown function or they are not characterized. ORF EV3_033 and EV3_034 encode for a phage-DNA binding protein and a helicase, respectively.
The gene is active since it was experimentally shown that clones of H2A strain hosting EV3 phage become dextranase positive . To our knowledge this is the second time that a gene encoding for this enzyme has been found in the phage genome of a lactobacillus . Looking at the position of dextranase gene in the sequences of the viral genome we could speculate that such enzymatic activity can help the viral particle in breaking through dextran producing strains after cell lysis occurs.
Temperate phages are known to carry virulence genes that contribute to the “success” of pathogenic bacteria. There is also a substantial scientific literature explaining this phenomenon by evolutionary arguments . It could well be that temperate phages play this role only for pathogenic bacteria. However, some theoretical reasoning suggests that prophages from non-pathogenic bacteria should encode more general fitness genes that are of selective benefit to the lysogen and/or the host, albeit up to know there is no direct evidence demonstrating prophages encoded fitness factors on of bacterial commensals or food microbes. The present manuscript may be one of the best hints so far for such a fitness factor in the field of LAB, which comprise many industrially important food bacteria. The demonstration of such a fitness factor could thus have an important impact on theoretical reasoning about the role of phages for the evolution of bacteria in general.
The phylogenetic position of EV3 was evaluated using the large subunit of the terminase gene as well as the large subunit portal protein gene. These genes have been previously established as valuable marker for phage phylogeny [17, 18]. Both marker genes positioned EV3 to a monophyletic group together with phages identified in the genomes of L. vaginalis, L. fermentum, L. jensenii, L. rhamnosus and L. casei. These species are neither the closest relatives to L. sanfranciscensis nor typically isolated from the sourdough ecosystem. This result may be reflected by the current unavailability of genome sequences of lactobacilli adapted to sourdough fermentations.
To our knowledge, this study represents the first complete genome sequence and genetic characterization of a L. sanfranciscensis phage. Bioinformatic analysis revealed that phage EV3 is a unique temperate phage compared to phages infecting related species of LAB. The endolysin gene was preceeded by a holin gene. The tail morphogenesis module is interspersed with cell lysis genes. The overall amino acid sequences of the phage proteins had little similarity to other sequenced phages. The phage carries a dextranase gene whose function in establishing a stable relationship with their host (lysogen) and influencing its lifestyle and fitness in sourdough fermentations remains to be elucidated. The results of this study may provide new insights that deepen our understanding of phage genetics and phage-host interactions in dynamic ecosystem such as cereal fermentations.
Availability of supporting data
The phage EV3 genome sequence is deposited at EMBL accession number PRJEB61 http://www.ebi.ac.uk/ena/data/view/display=html&PRJEB61.
Isolation of phage DNA
L. sanfranciscensis H2A strain was used as host culture for viral multiplication. Phage DNA was isolated from a high-titer phage lysate obtained by cesium chloride gradient according to Sambrock et al., .
For full sequencing, purified phage DNA was fragmented by ultrasonification, and ligated into the plasmids pBluescriptKSII and pSmart. Escherichia coli DH5a cells were transformed and colonies were selected by blue/white selection.
Sequencing was performed on 3 × 96 shotgun clones by Sanger sequencing, resulting in a sixfold genome coverage.
Remaining gaps were closed by a Two-Step Gene Walking technique based on randomly primed polymerase chain reaction (PCR) as previously described by Pilhofer et al., . Amplification were performed by use of Kapa2G-Robust Polymerase (Kapa Biosystems, Inc.). It presents a simple workflow, which comprises only two major steps of a Walking-PCR with a single specific outward pointing primer (step 1) and the direct sequencing of its product using a nested specific primer (step 2). Open reading frames (ORFs) were predicted with Gene- Mark.hmm for Prokaryotes, Version 2.4 . All ORF predictions were verified and modified by blasting ORFs to NCBI nrdb. Additionally, the predicted start codons of all ORFs were inspected manually using the Artemis program . This genome project has been deposited in the European Molecular Biology Laboratory (EMBL)/Gen- Bank under the accession number PRJEB61. The presence of signal peptides was analysed with SignalP (http://www.cbs.dtu.dk/services/SignalP/).
Determination of the attachment site on the host genome
In order to identify the attP site we blasted the sequence of 429 bp between the lys and int genes of the phage EV3 against the whole genome sequence of L. sanfranciscensis TMW 1.1304 .
Primers P_08960 (5′-ATGGAAAAATCGATGTATG) and P_leu (5´-GCCGAGATGGCGGAATTG) placed in the bacterial genes flanking the prophage (orf LSA_08960 and Leu-tRNA), and primers P4 (5′-CGTCGATATTTATATCATTAG) and P1 (5´-GATACCTTAACCAGATTAAG) running out of the int and lys genes we amplified 658 bp and 370 bp long DNA fragments, respectively (Figure 2).
We thank Monika Hadek for her technical assistance in laboratory work.
- Brussow H, Desiere F: Phages of dairy bacteria. Annu Rev Microbiol. 2001, 55: 283-303. 10.1146/annurev.micro.55.1.283.PubMedView ArticleGoogle Scholar
- Emond E, Moineau S: Bacteriophage Genetics and Molecular Biology. 2007, Norfolk UK: Caister Academic PressGoogle Scholar
- Garneau JE, Moineau S: Bacteriophages of lactic acid bacteria and their impact on milk fermentations. Microb Cell Fact. 2011, 10: 1-20. 10.1186/1475-2859-10-1.View ArticleGoogle Scholar
- Foschino R, Perrone F, Galli A: Characterization of two virulent Lactobacillus fermentum bacteriophages isolated from sour dough. J Appl Bact. 1995, 79: 677-683. 10.1111/j.1365-2672.1995.tb00954.x.View ArticleGoogle Scholar
- Foschino R, Venturelli E, Picozzi C: Isolation and characterization of a virulent Lactobacillus sanfranciscensis bacteriophage and its impact on microbial population in sourdough. Curr Microbiol. 2005, 51: 413-418. 10.1007/s00284-005-0122-y.PubMedView ArticleGoogle Scholar
- Foschino R, Galli A, Pagani A, Ottogalli G: Isolation and characterization of bacteriophages active on heterofermentative lactobacilli in sourdoughs. Microbiol Alim Nutr. 1996, 14: 15-22.Google Scholar
- Hendrix RH: Bacteriophages: evolution of the majority. Theor Popul Biol. 2002, 61: 471-480. 10.1006/tpbi.2002.1590.PubMedView ArticleGoogle Scholar
- Weinbauer MG, Agis M, Bonilla-Findji O, Malits A, Winter C: Bacteriophages in the environment. 2007, Norfolk, UK: Caister Academic PressGoogle Scholar
- Pouwels PH, Leer RJ: Genetics of lactobacilli: plasmids and gene expression. Antonie Van Leeuwenhoek. 1993, 64: 85-107.PubMedView ArticleGoogle Scholar
- Durmaz E, Miller MJ, Azcarate-Peril MA, Toon SP, Klaenhammer TR: Genome sequence and characteristics of Lrm1, a prophage from indudtrial Lactobacillus rhamnosus strain M1. Appl Environm Microbiol. 2008, 74: 4601-4609. 10.1128/AEM.00010-08.View ArticleGoogle Scholar
- Sao-Jose C, Parreira R, Vieira G, Santos MA: The N-terminal region of the Oenococcus oeni bacteriophage fOg44 lysin behaves as a bona fide signal peptide in Escherichia coli and as a cis-inhibitory element, preventing lytic activity on oenococcal cells. J Bacteriol. 2000, 182: 5823-5831. 10.1128/JB.182.20.5823-5831.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Vogel R, Pavlovic M, Ehrmann M, Wiezer A, Liesegang H, Offschanka S, Voget S, Angelov A, Böcker G, Liebl W: Genomic analysis reveals Lactobacillus sanfranciscensis as stable element in traditional sourdoughs. Microb Cell Fact. 2011, 10 (Suppl 1): S6-10.1186/1475-2859-10-S1-S6.PubMedPubMed CentralView ArticleGoogle Scholar
- Williams KP: Integration sites for genetic elements in prokaryotic tRNA and tmRNA genes: sublocation preference of integrase subfamilies. Nucleic Acid Res. 2002, 30: 866-875. 10.1093/nar/30.4.866.PubMedPubMed CentralView ArticleGoogle Scholar
- Picozzi C, Meißner D, Foschino R, Vogel RF: Dextranase gene transferred by a Lactobacillus sanfranciscensis phage. Book of Abstracts of VI Symposium on Sourdough 14–17 October 2009. Edited by: Vogel RF. 2009, Freising: Vogel RF, 22-Google Scholar
- Zhang X, Wang S, Guo T, Kong J: Genome analysis of Lactobacillus fermentum temperate bacteriophage φPYB5. Int J Food Microbiol. 2011, 144: 400-405. 10.1016/j.ijfoodmicro.2010.10.026.PubMedView ArticleGoogle Scholar
- Fortier L-C, Sekulovic O: Importance of prophages to evolution and virulence of bacterial pathogens. Virulence. 2013, 4: 354-365. 10.4161/viru.24498.PubMedPubMed CentralView ArticleGoogle Scholar
- Sullivan MB, Coleman ML, Quinlivan V, Rosenkrantz JE, Defrancesco AS: Portal protein diversity and phage ecology. Environ Microbiol. 2008, 10: 2810-2823. 10.1111/j.1462-2920.2008.01702.x.PubMedPubMed CentralView ArticleGoogle Scholar
- Rao VB, Feiss M: The bacteriophage DNA packaging motor. Ann Rev Genet. 2008, 42: 647-681. 10.1146/annurev.genet.42.110807.091545.PubMedView ArticleGoogle Scholar
- Sambrook J, Fitsch EF, Maniatis T: Molecular Cloning: A Laboratory Manual. 1989, Cold Spring Harbor: Cold Spring Harbor PressGoogle Scholar
- Pilhofer M, Bauer AP, Schrallhammer M, Richter L, Ludwig W, Schleifer KH, Petroni G: Characterization of bacterial operons consisting of two tubulins and a kinesin-like gene by the novel two-step gene walking method. Nucleic Acids Res. 2007, 35: e135-10.1093/nar/gkm836.PubMedPubMed CentralView ArticleGoogle Scholar
- Borodovsky M, Mills R, Besemer J, Lomsadze A: Prokaryotic gene prediction using GeneMark and GeneMark.hmm. Curr Protoc Bioinformatics. 2003, Chapter 4 (Unit4): 5-PubMedGoogle Scholar
- Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics. 2000, 16: 944-945. 10.1093/bioinformatics/16.10.944.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.