Genomic Characterisation of UFJF_PfDIW6: A Novel Lytic Pseudomonas fluorescens-Phage with Potential for Biocontrol in the Dairy Industry

In this study, we have presented the genomic characterisation of UFJF_PfDIW6, a novel lytic Pseudomonas fluorescens-phage with potential for biocontrol in the dairy industry. This phage showed a short linear double-stranded DNA genome (~42 kb) with a GC content of 58.3% and more than 50% of the genes encoding proteins with unknown functions. Nevertheless, UFJF_PfDIW6’s genome was organised into five functional modules: DNA packaging, structural proteins, DNA metabolism, lysogenic, and host lysis. Comparative genome analysis revealed that the UFJF_PfDIW6’s genome is distinct from other viral genomes available at NCBI databases, displaying maximum coverages of 5% among all alignments. Curiously, this phage showed higher sequence coverages (38–49%) when aligned with uncharacterised prophages integrated into Pseudomonas genomes. Phages compared in this study share conserved locally collinear blocks comprising genes of the modules’ DNA packing and structural proteins but were primarily differentiated by the composition of the DNA metabolism and lysogeny modules. Strategies for taxonomy assignment showed that UFJF_PfDIW6 was clustered into an unclassified genus in the Podoviridae clade. Therefore, our findings indicate that this phage could represent a novel genus belonging to the Podoviridae family.


Introduction
Pseudomonas is an important group of bacteria widely distributed in several ecological niches, including plants, soil, water, animals and humans [1]. This genus includes species saprophytes, commensals, and pathogens to plants, insects, animals, and humans, with which some have the potential for biotechnological applications [2]. For example, in the dairy industry, Pseudomonas spp., mainly P. fluorescens, are common contaminants associated with the spoilage of milk and dairy products [3,4]. The growth of Pseudomonas in milk has been long associated with the production of peptidases and lipases, leading to rancid, bitter flavour, off-flavour, milk heat-stability loss, gelation of UHT milk, and reduction of yield in cheese manufacturing [5][6][7].
Bacteriophages have emerged as a promising alternative to bacterial control in the food chain [8]. Although bacteriophages have long traditionally been considered enemies for the (0.25 mg/mL RNAse A, 0.25 mg/mL DNAse I, 150 mM NaCl, 50% v/v glycerol) at 37 • C for 30 min followed by 25 • C for 1 h to remove bacterial DNA and RNA. Then, the phage particles were precipitated using precipitant suspension (20% w/v PEG 8000, 1.76 M NaCl), centrifuged (20 min, 12,000× g, 4 • C) and resuspended in SM buffer (50 mM Tris-HCl (pH 7.5), 0.1 M NaCl, 8 mM MgSO 4 × 7H 2 O). After that, phage DNA was extracted using the Wizard DNA Clean-Up kit with modifications as proposed by Summer [38]. The degradation, purity, and concentration of DNA extracted were evaluated by 1% agarose gel electrophoresis, spectrophotometry (OD260/OD280) using a NanodropTM (Thermo Fisher Scientific, Waltham, MA, USA), and fluorimetry using a Qubit 2.0 fluorometer (Life Technologies (Carlsbad, CA, USA), Invitrogen (Waltham, MA, USA), respectively.
UFJF_PfDIW6 genome library construction was performed using the NEBNext Ultra II DNA Library Prep Kit for Illumina (New England Biolabs, Ipswich, MA, USA) according to the manufacturer's instructions and sequenced using the Illumina NovaSeq 6000 platform with a read length of 2 × 150 bp by GenOne Biotech (Rio de Janeiro, RJ, Brazil).

Genome Assembly, Gene Prediction and Protein Annotation
The ab initio assembly of the phage genome was performed using the "careful" option of SPAdes version 3.14.1 [41] and all odd k-mers between 21 and 127. Then, paired reads were mapped to the assembly using the BWA-MEM algorithm of BWA version 0.7.17 [42] and insert size metrics were collected using Picard toolkit version 2.26.2 (https://github. com/broadinstitute/picard) to perform an additional step of scaffolding using SSPACE version 3.0 [43]. The final assembly was evaluated using assembly-stats version 1.0.1 (https://github.com/sanger-pathogens/assembly-stats), and the coverage of scaffolds was calculated using BBMap version 38.76 (https://sourceforge.net/projects/bbmap).
UFJF_PfDIW6 genome and predicted proteins were submitted to sequence similarity searches using BLAST [50], considering an E-value threshold ≤1e −10 for selecting significant alignments. The alignments were further evaluated by query coverage, identity and or similarity.
The genome sequence was aligned to viruses genomes (taxonomy ID 10239) available at the NCBI Nucleotide collection (nt) database using the BLASTn tool of the web BLAST server (https://blast.ncbi.nlm.nih.gov/). A search for related prophages was also performed by aligning the phage genome to bacteria genomes (taxonomy ID 2). PHASTER (PHAge Search Tool Enhanced Release) webserver (https://phaster.ca/) [51,52] was used to screen the prophages in the bacteria genomes that showed significant alignments with the UFJF_PfDIW6 genome.
The protein sequences were aligned to viral proteins available at the NCBI Virus database (26 October 2021; Taxonomy ID 10239) using the BLASTp tool of BLAST version 2.12.0. Only significant alignments with overall similarity (similarity × coverage) ≥ 40% were considered for annotation of UFJF_PfDIW6 proteins.

Comparative Genomics
The UFJF_PfDIW6's genome was compared to other genomes of phages and prophages that showed significant alignments in the similarity searches. The selected genomes were aligned using the progressive Mauve algorithm [63] of Mauve version 2.4.0 (http: //darlinglab.org/mauve/mauve.htm) to identify conserved locally collinear blocks (LCBs). The synteny and genes' modular organisation of phages and prophages genomes were evaluated using Clinker version 0.0.23 [64].

Taxonomy Assignment and Phylogenetic Analysis
UFJF_PfDIW6's taxonomy assignment was evaluated using the VIRIDIC web server (http://rhea.icbm.uni-oldenburg.de/VIRIDIC/) [65] and were compared to its genome sequence with the genomes of 130 reference species of the Podoviridae family which are recognised by the International Committee on Taxonomy of Viruses (ICTV) (https://talk. ictvonline.org/taxonomy/, release 2020). UFJF_PfDIW6's genome was also compared to the genomes that showed significant results in similarity searches. We considered a genomic similarity threshold of 95% for species assignment and 70% for the genus.
Phylogenetic analysis was performed by selecting terminase large subunit (TerL) amino acid sequences from UFJF_PfDIW6 and other TerL sequences of Pseudomonas phages available at the NCBI RefSeq database (https://www.ncbi.nlm.nih.gov/refseq/, accessed on 26 October 2021). TerL sequences were aligned by MAFFT version 7.487 [66] using a local pairwise alignment with 100,000 iterations of parameter refinements to improve the alignment. The alignment was trimmed using the "automated1" heuristic method of Trimal version 1.4.1 [67], and the best-fit amino-acid substitution model was selected according to Bayesian Information Criterion (BIC) using Modeltest-NG version 0.1.6 [68]. The Phylogenetic tree was inferred by Maximum Likelihood (ML) using RAxML-NG version 1.0.3 [69]. The Majority Rule (MR) consensus tree was calculated with 1000 bootstrap replicates and then visualised using Figtree version 1.4.4 (https://github.com/rambaut/figtree).

Results and Discussion
The UFJF_PfDIW6 phage was previously isolated from dairy industry wastewater and described as showing great potential to be applied for Pseudomonas biocontrol and prevent spoilage of raw milk stored under refrigeration [27]. The plaque morphology of UFJF_PfDIW6 on its propagating host P. fluorescens UFV 041, virion morphology, and a summary of lifecycle characteristics are shown in Figure S1.

UFJF_PfDIW6's Genome Features
The genome of the UFJF_PfDIW6 phage was assembled as a circular sequence without terminal repeats, which represents its replicative intermediate. This genome was sequenced with 35,802 fold coverage and is available at the GenBank database (https://www.ncbi. nlm.nih.gov/genbank/) under accession OM418631.
The UFJF_PfDIW6 genome comprises 42,322 bp of linear double-stranded DNA with a GC content of 58.3%. Its sequence has a bidirectional organisation with a gene density of 1.38 gene/Kb, including 58 coding DNA sequences (CDSs), and was opened to the small terminase subunit gene (TerS; g01) ( Figure 1 and Table S4). Thirty-eight CDSs encode proteins similar to other phage proteins, while nineteen CDSs encode proteins similar to bacteria's proteins that possibly represent prophages. Only one CDS encodes a protein that did not significantly align to any sequence. Thirty-one proteins had a predicted function, while twenty-seven were annotated as hypothetical proteins. Twenty-five Rhoindependent terminator sequences (Table S6), four promoter sequences recognised by host's RNA polymerase (RNAP) and one by phage's RNAP (Table S7) were also found in the UFJF_PfDIW6 genome. In addition, no evidence of RNAP was found among predicted proteins, even among profiles of the Virus Orthologous Groups database (Table S3), and no tRNA genes were found, which suggest that UFJF_PfDIW6 phage could be highly dependent on host translation machinery. Figure 1. Genomic features of Pseudomonas phage UFJF_PfDIW6. UFJF_PfDIW6's genome is shown above all others. Its sequence has a bidirectional organisation with 58 genes (represented by arrows) organised into five functional modules, represented by the coloured bars and indicated in the genes' annotation list. The numbers above the arrows indicate the genes with predicted functions, shown in the annotation list. The arrow's colours represent the gene clusters identified by Clinker, which encode similar proteins. The lines connecting the arrows represent gene-encoding proteins that share sequence identity more significant than 40%.
Despite that more than 50% of proteins had no predicted functions, UFJF_PfDIW6's genome can be organised into five functional modules: (A) DNA packaging; (B) structural proteins; (C) DNA metabolism (replication, processing and modification); (D) lysogenic; and (E) host lysis (Figure 1).
Three major genes encode small and large subunits of terminase, and portal proteins are generally identified in the DNA packaging module of phages' genomes. Terminase proteins are responsible for recognition and cleavage of viral DNA and its translocation into a preformed empty phage capsid by ATP hydrolysis [70,71]. Portal proteins also play a crucial role in virion assembly, DNA packaging, and DNA delivery in phages of the Caudovirales order. They serve as a portal for phage DNA passage during DNA packaging and ejection [72]. UFJF_PfDIW6's genome is opened by two genes that encode the terminase small subunit (TerS; g01) and large terminal subunit (TerL; g02). The third gene (g03) encodes a 117-aa protein that did not significantly align to any sequence available in the selected databases, but we speculate that it encodes a portal protein.
The structural module contains nine genes that encode virion's structural proteins such as head-to-tail-connector (g04), stabilisation protein (g06), capsid protein (g07), tail tubular proteins A (g10), tail tubular proteins B (g11 and g12), internal virion proteins (IVPs) A and B (g13 and g14) and tail spike protein (g17). This module also has two proteins, a structural protein (g08), and a structural lysozyme (g15). Three genes (g05, g09 and g16) that encode proteins with unknown functions are located among the structural genes and are possibly virion structural proteins.
Capsid proteins are essential structural components of the virion and protect the viral genome from the harsh outer environment. The tail structures form a complex macromolecular machine responsible for recognition and penetration of the host cell and channel formation for phage genome injection [73]. Two-tail tubular proteins (A and B) are frequently identified in the genome of Podoviridae phages. Tail tubular A forms the attachment for the virion's side fibres and mediates the initiation of the infection process working similar to a conformational switch. Side fibre engagement causes the release of capsid contents. Tail tubular B associates directly below the protein A in a nozzle-like structure, extending the tube through which the DNA will pass [74].
Other structural proteins vital in the viral infection process are the Internal Virion Protein (IVPs), which form a transient tube crossing the cell wall to the cellular membrane and protect the viral genome from periplasmic nucleases [75]. IVPs are sometimes associated with lysozymes [74], which have lytic transglycosylase activity and break the peptidoglycan layer of the host cell during the phage's infection process.
DNA topoisomerases are enzymes that play a crucial role in DNA metabolism, as they can solve DNA topological problems and enable processes such as DNA replication, transcription, and recombination [76]. DnaC proteins are known to recruit the helicase protein to the initiator complexes into replication origins in both phages and bacteria [77]. HNH endonucleases are large group proteins that have been identified from various organisms, including bacteriophages and bacteria. They have been related to DNA packaging and described as a cofactor associated with terminases for many phages [78,79].
Our previous study assessed eight strains of five Pseudomonas species, and the UFJF_PfDIW6 phage was host specific [27]. UFJF_PfDIW6 had a typical lytic phage behaviour during all experimental assays by showing clear lysis plaques on the lawn of P. fluorescens UFV 041 host since its purification ( Figure S1), high titres after successive rounds of propagation, and infectivity after storage under both refrigeration and freezing [27]. However, UFJF_PfDIW6 was predicted as having a temperate lifestyle (p = 0.90) by the BACPHLIP algorithm. A detailed analysis of its genome revealed a typical lysogeny module containing genes that encode two Arc family DNA-binding proteins (g19 and g20), antirepressor (g21), integrase (g22), TolA protein (g31), Cro/CI family transcriptional regulator (g41) and CII regulatory (g43) proteins (Figure 1). There is no evidence of gene encoding excisionase in UFJF_PfDIW6's genome, an essential enzyme in the excision of prophage from the bacterial chromosome to establish the phage's lytic cycle. However, some authors reported that, eventually, integrase could catalyse both integration and excision reactions, regardless of the presence of an excisionase [80,81].
During lysogeny, the phage genome is integrated into the bacterial chromosome as a prophage, which will be replicated through new host generations [82,83]. The switch between the lytic and lysogenic cycle depends on the presence and regulation of some genes of the phages' genome, such as genes that encode CII regulatory protein and Cro/CI-like repressor. CII regulatory protein activates the transcription of both CI and integrase leading to lysogeny [84]. Conversely, a Cro-like repressor is a repressor of CI, which leads the phage to the lytic cycle [85].
Many genes encoding hypothetical proteins were annotated among genes of the lysogeny module, and some critical genes for the complete functioning of this module were not found, which suggests that the UFJF_PfDIW6 genome has an incomplete module, or an environmental factor could explain its lytic behaviour. In addition to crucial genes of the lysogeny module, both host incubation temperature and multiplicity of infection (MOI) can affect the lysis-lysogeny decision in the phage infection. For example, in the Enterobacteria Phage Lambda, the lysis-lysogeny decision occurs by cII protein accumulation, which activates transcription of the cI gene and the integrase gene, culminating in lysogen formation [86]. The accumulation of the CII protein depends on no activation of a protease encoded by the host hfl gene, which is inhibited by poor host growth conditions. Similarly, high MOI increases CIII protein, a proteolysis inhibitor of cII protein. Changes in the size and appearance of lysis plaques were observed when host infections by UFJF_PfDIW6 occurred at 30 or 37 • C, following the same conditions previously described [27].
Two genes that encode phage holin (g55) and endolysin (g56) were classified as being components of the host lysis module (Figure 1). UFJF_PfDIW6 was highly specific regarding the host range in previous tests, infecting only the host bacterium among 23 bacterial strains belonging to nine genera [27]. However, this phage reduced the bacterial count of naturally contaminated raw milk, indicating the infection ability of other bacterial species.
The host range of UFJF_PfDIW6 was predicted using its genome to complement previously reported experimental results. Computational tools based on whole phage genome sequences analysis have been frequently used to predict host range [87]. HostPhinder compares k-mers between the query phage genome and the genomes of reference phages with known hosts [48]. No bacterial species from the HostPhinder's database were predicted as a target to be infected by UFJF_PfDIW6 (Table S5), contrasting to the experimental evidence that confirmed its effectiveness in reducing raw milk microbiota. This divergence could be explained by limitations of HostPhinder's database regarding its completeness, with a limited number of phage reference genomes and bacterial hosts [48].
The endolysin belongs to the phage lysozyme family (Glycoside hydrolase family 24; PF00959; E-value: 3.7e −4 ). Among the muramidase superfamily of endolysins, PF00959 is the most common Pfam domain in Pseudomonas phages endolysins [88]. In addition, the endolysin contains a signal peptide with a cleavage site at residue 26 (likelihood: 0.9005), and no transmembrane helices were found on its sequence. These results suggest that predicted endolysin contains a canonical signal peptide compatible with the Sec-mediated transport and no SAR (signal-arrest-release) domain.
The canonical lysis module of bacteriophages includes endolysins and holins [89]. Endolysins are responsible for peptidoglycan (PG) disruption after holins form pores in the cytoplasmatic membrane. These pores provide PG access to endolysins [90]. In the case of SP-endolysins, holins also control the depolarization of the cell membrane and trigger the activation of secreted endolysins [90,91]. This is the case for the UFJF_PfDIW6 phage. Some bacteriophages also encode pinholins and, in the case of bacteriophages that infect Gram-negative bacteria, spanins [89]. Spanins form junctions between the inner and the outer membrane of Gram-negative bacteria to promote the release of the new viral particles after viral infection [92]. No spanin homolog was found in the genome of the UFJF_PfDIW6 phage.

Comparative Genomics
Similarity searches revealed that the UFJF_PfDIW6's genome is significantly different from other viral genomes available at the NCBI GenBank database, displaying a maximum coverage of 5% among all alignments (Table S1). This intriguing result led to an investigation into whether its genome could be similar to putative prophages integrated into bacterial genomes once bacteria can harbour several phages that are still uncharacterised [93]. UFJF_PfDIW6's genome showed higher sequence coverages when aligned with genomes from Pseudomonas species, ranging from 38% to 49% (Table S1). In addition, all bacteria's genomic regions that were targets of the alignments contain putative prophages classified as intact by the PHASTER.
Five prophage genomes and two genomes of Pseudomonas phages (AF and HU1) were selected and compared to UFJF_PfDIW6's genome (Figure 1 and Figure S2). The selected genomes were re-opened to the gene that encodes terminase small subunit (TerS). Comparative genomic analysis revealed that they share conserved locally collinear blocks (LCBs) comprising genes of DNA packing and structural proteins modules. UFJF_PfDIW6's genome shares Pseudomonas prophage's one LCB, which includes two genes of the host lysis module. The genes included in these LCBs are syntenic and encode proteins related to virion assembly and its interaction with host bacteria. Based on these genomic features, some insights about the host spectrum of UFJF_PfDIW6 phage can be considered, such as its ability to infect other Pseudomonas species.
Among structural proteins, tail spike protein was suggested as being the only virion component of Pseudomonas phage AF that determines its host specificity [94]. The protein encoded by gene g17 of UFJF_PfDIW6's genome only showed significant similarity with a tail spike protein from a metagenome-assembled genome (MAG) sequence classified as Siphoviridae sp. (Table S2). However, UFJF_PfDIW6's tail spike protein also showed a high similarity with hypothetical proteins of putative prophages of Pseudomonas sp. HN8-3 (NCBI accession WP_228308670.1; 95% of similarity), P. fragi (WP_169907497.1; 81%), P. lundensis (WP_070413385.1; 81%) and P. lactis (WP_057976804.1; 82). These findings suggest a broader host range spectrum for UFJF_PfDIW6 phage and corroborate the experimental evidence about its ability in reducing raw milk microbiota [27].
In contrast to the structural and host lysis modules, the genomic region, including the modules of DNA metabolism and lysogeny, is the primary point of differentiation among all analysed genomes. This region corresponds to 44% of UFJF_PfDIW6's genome and is delimited by genes g18 (position 22,019) and g54 (position 40,821), comprising most of the genes that encode proteins with unknown function (Table S4).

Taxonomic Assignment
Because comparative genomics revealed conserved regions among UFJF_PfDIW6's genome and other phages' genomes, the question is its taxonomic classification.
The genomic similarity matrix calculated by VIRIDIC showed that UFJF_PfDIW6's genome is different from all reference genomes of species classified in the Podoviridae family listed in the current release of the International Committee on Taxonomy of Viruses (ICTV). UFJF_PfDIW6's genome was placed in different species and genus clusters from these genomes (Table S8). Among RefSeq genomes, the genome of Salmonella virus Epsilon 15, from the Uetakevirus genus, showed the highest similarity with 5.54%, far from the genomic similarity threshold of 95% for species assignment and 70% for the genus. The genome of prophage identified in Pseudomonas sp. HN8-3 with 47.81% was the most similar among the genomes presented in Figure 1. In addition, the genomes of Pseudomonas phage AF and Pseudomonas phage HU1 showed 6.83% and 10.22% of genomic similarity, respectively (Table S9).
A phylogenetic hypothesis was inferred using the terminase large subunits (TerL) protein sequences from UFJF_PfDIW6 and other Pseudomonas phages as evolutionary markers to solve its taxonomic assignment ( Figure 2). All genera were included in monophyletic clades in the phylogenetic tree, supported by high bootstrap values and reinforcing the TerL as an excellent phylogenetic marker for phages' genus delimitation. The midpoint rooted maximum likelihood (ML) tree was obtained by analysing terminase large subunit (TerL) amino acid sequences from UFJF_PfDIW6 and other Pseudomonas phages available at NCBI RefSeq database. The bootstrap values (expressed as percentages) calculated from 1000 replicates are shown beside each node. The monophyletic clades corresponding to each genus are shown in different colours. The genomes of Pseudomonas phages were clustered by VIRIDIC using the genomic similarity threshold of 70% for genus assignment (Genus' cluster) and 95% for species assignment (Species' cluster).
The UFJF_PfDIW6 phage was clustered with Pseudomonas phage AF and Pseudomonas phage HU1, which displayed conserved regions in comparative genomic analysis representing a monophyletic cluster of the unclassified genus in the Podoviridae clade. A genomic similarity matrix was also calculated for these Pseudomonas phages and revealed that even phages previously classified in the same genus could be placed in different genus clusters by VIRIDIC (Figure 2). In addition, UFJF_PfDIW6 phage, Pseudomonas phage AF and Pseudomonas phage HU1 were also placed in different genus and species clusters.
The viral proteomic tree (ViPTree), including the reference genomes of the Podoviridae family, also clustered the genomes of UFJF_PfDIW6 phage, Pseudomonas phage AF and Pseudomonas phage HU1 in a distinct monophyletic cluster of unclassified viruses (Figure 3), such as that observed in the phylogenetic tree based on TerL proteins of Pseudomonas phages. Taken together, all evidence presented here reinforces that the UFJF_PfDIW6 phage is representative of a novel genus of the Podoviridae family, such that Pseudomonas phage AF and Pseudomonas phage HU1 are also single species of novel genera. These findings are in line with the need for taxonomic revision of the order Caudovirales through genome-based approaches [36]. . Viral proteomic tree of reference genomes for Podoviridae family. The proteomic tree was obtained from UFJF_PfDIW6 and 130 reference genomes for the genus of Podoviridae phages. The genomes were aligned all-against-all using ViPTree, and the genomic similarity scores (SG) were computed. The cluster, including UFJF_PfDIW6, is highlighted in red. The genomes of Podoviridae phages were also clustered by VIRIDIC using the genomic similarity threshold of 70% for genus assignment (Genus' cluster) and 95% for species assignment (Species' cluster).

Conclusions
In this study, we report a taxonomic proposal for Pseudomonas phage UFJF_PfDIW6 to represent a novel genus of the Podoviridae family. The UFJF_PfDIW6's genome comprises 42,322 bp of linear double-stranded DNA containing 58 genes organised into five functional modules. Despite UFJF_PfDIW6 showing a lytic behaviour in the in vitro assays on the host lawn, a set of genes related to lysogeny was annotated on its genome. Comparative genomic analysis showed that UFJF_PfDIW6 shares, with other Pseudomonas phages, conserved locally collinear blocks (LCBs) comprising genes of DNA packing and structural protein modules related to virion assembly and host recognition. However, UFJF_PfDIW6's sequence is distinct from other genomes of virulent phages and prophages, having genomic similarities below the recommended thresholds for genus and species assignments. Phy-logenetic analysis, using the terminase large subunit (TerL) protein sequences and viral proteomic tree, clustered UFJF_PfDIW6 into a monophyletic cluster of unclassified viruses, being a single representative of a novel genus in the Podoviridae clade.
Supplementary Materials: The following supporting information can be downloaded at: https:// www.mdpi.com/article/10.3390/v14030629/s1, Figure S1. Phenotypic features of Pseudomonas phage UFJF_PfDIW6. (A) Lysis plaques formed by UFJF_PfDIW6 on the lawn of P. fluorescens UFV 041. Incubation conditions: BHI agar at 30 • C for 24 h. (B) Virion morphology. Transmission electron micrograph (TEM) negatively stained with 2% uranyl acetate. Bar magnification: 140,000X. * Tests performed in TSB medium at 30 • C; ** Test performed against 23 bacterial strains belonging to nine genera. The methodology used for phenotypic characterization of UFJF_PfDIW6 was described in detail by Nascimento et. [27]. Figure S2: Conserved locally collinear blocks (LCBs) shared among Pseudomonas UFJF_PfDIW6 phage, Pseudomonas sp. HN8-3 prophage, P. fluorescens str PF08 prophage, P. chlororaphis str PCLAR04 prophage, P. chlororaphis subsp. aurantiaca str CW2 prophage, P. yamanorum str GZD14026 pro-phage, Pseudomonas phage AF, and Pseudomonas phage HU1 genomes. The alignment was performed using progressive Mauve with default parameters. The coloured blocks correspond to the LCBs predicted by Mauve. Horizontal white bars correspond to the Coding DNA sequences (CDSs) annotated in each genome. Table S1: Similarity searches between UFJF_PfDIW6 genome sequence and genomes available at NCBI database. Sequences were aligned using the BLASTn tool BLAST. An E-value threshold ≤ 1e −10 was considered for selecting significant alignments. Table S2: Similarity searches among the proteins annotated in UFJF_PfDIW6 genome sequence and sequences available at NCBI Protein Database (nr). Sequences were aligned using the BLASTp tool of BLAST. An E-value threshold ≤ 1e −10 was considered for selecting significant alignments. Table S3: Functional classification of UFJF_PfDIW6 proteins in Virus Orthologous Groups (VOG). Table S4: Genome annotation of Phage UFJF_PfDIW6. Table S5: Host prediction of phage UFJF_PfDIW6. Table S6: Rho-independent terminators of Phage UFJF_PfDIW6 genome. Table S7: Promoters of Phage UFJF_PfDIW6 genome. Table S8: Intergenomic similarities between UFJF_PfDIW6 genome and genomes of Podoviridae species recognised by the International Committee on Taxonomy of Viruses (ICTV). Genome sequences were compared by VIRIDIC and clustered, considering a genomic similarity threshold of 95% for species assignment and 70% for the genus. Table S9: Intergenomic similarities between UFJF_PfDIW6 genome and genomes presented in Figure 1. Genome sequences were compared by VIRIDIC and clustered, considering a genomic similarity threshold of 95% for species assignment and 70% for the genus.

Data Availability Statement:
The data presented in this study are openly available at the NCBI (https://www.ncbi.nlm.nih.gov/genbank/). The complete genome sequence of phage UFJF_PfDIW6 is available at GenBank database, accession number OM418631.