Next Article in Journal
Persistent Type I Interferon Signaling Impairs Innate Lymphoid Cells During HIV-1 Infection Under Suppressive ART
Previous Article in Journal
Homologous and Heterologous Vaccination Regimens with mRNA and rVSV Platforms Induce Potent Immune Responses Against SFTSV Glycoprotein
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

New Isolates of Betachloroviruses Shed Light on the Diversity and Biological Complexity of an Unexplored Group of Giant Algal Viruses

by
Júlia W. Souza
1,
Lethícia R. Henriques
1,2,
Roger M. Carlson
3,4,
Bruna B. F. Botelho
1,
João Victor R. P. Carvalho
1,
João Pedro N. Santos
5,
Eric R. G. R. Aguiar
5,
Irina V. Agarkova
3,4,
James L. Van Etten
3,4,
David D. Dunigan
3,4,* and
Rodrigo A. L. Rodrigues
1,*
1
Laboratório de Vírus, Departamento de Microbiologia, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, MG, Brazil
2
Núcleo de Apoio Técnico ao Ensino, Pesquisa e Extensão, Instituto de Ciências Ambientais, Químicas e Farmacêuticas, Universidade Federal de São Paulo, Diadema 09913-030, SP, Brazil
3
Nebraska Center for Virology, University of Nebraska, Lincoln, NE 68583-0900, USA
4
Department of Plant Pathology, University of Nebraska, Lincoln, NE 68583-0722, USA
5
Laboratório de Bioinformática de Vírus, Universidade Estadual de Santa Cruz, Ilhéus 45662-900, BA, Brazil
*
Authors to whom correspondence should be addressed.
Viruses 2025, 17(8), 1096; https://doi.org/10.3390/v17081096
Submission received: 18 June 2025 / Revised: 22 July 2025 / Accepted: 6 August 2025 / Published: 8 August 2025
(This article belongs to the Special Issue Cyanophage and Algal Virus)

Abstract

The majority of giant algal viruses belong to the family Phycodnaviridae, class Algavirales, phylum Nucleocytoviricota. Among them, the genus Chlorovirus is the most studied, with three recognized groups based on genomics and host range, although many fundamental questions remain to be elucidated, particularly regarding their diversity. In this study, we focus on betachloroviruses, a poorly explored subgroup that infects the alga Micractinium conductrix Pbi. Here, we describe the isolation and genomic analysis of 11 new betachloroviruses from water samples collected in Nebraska, USA. With 25 fully sequenced genomes now available, we assessed the genomic diversity of these viruses. They have double-stranded DNA genomes ranging from 295 to 374 kbp, encoding hundreds of ORFs, of which a large number (~40%) lack known function. Comparative genomics and phylogenetic analyses revealed three species of betachlorovirus, each with high intra-species genomic identity. Notably, some isolates with over 99.5% genomic identity display markedly different plaque phenotypes, which led us to propose the use of the term genomovar among giant algal viruses, a concept potentially applicable to other giant viral groups yet to be explored. Altogether, this work advances our understanding of betachloroviruses and highlights the importance of linking viral genotype to phenotype, opening new avenues for exploring the diversity of giant algal viruses.

1. Introduction

Aquatic environments are rich in biodiversity and harbor an astronomical number of viruses, with estimates indicating around 1.8 × 1027 viral particles in freshwater environments [1]. Most of these viruses infect prokaryotes, but a significant portion infect diverse eukaryotes, from large vertebrates to the smallest unicellular organisms, such as amoebas and algae. These viruses include the so-called giant viruses, officially classified in the phylum Nucleocytoviricota [2,3]. This group comprises large nucleocytoplasmic DNA viruses, with particle and/or genome sizes above the average in the virosphere, which have their genetic material replicated in the nucleus or cytoplasm of host cells. According to the International Committee on Taxonomy of Viruses (ICTV), this group is currently divided into three classes, five orders, and 15 families, with the family Phycodnaviridae (class Megaviricetes, order Algavirales) being the one that includes the majority of the known giant algal viruses [4,5].
The family Phycodnaviridae is composed of viruses that infect different types of algae. The family consists of six genera, historically based on the host range of these viruses, that was later corroborated by genomic data [5,6]. Among the phycodnaviruses, the genus Chlorovirus is the most studied and was the first to be established due to the isolation and characterization of Paramecium bursaria Chlorella virus 1 (PBCV-1), a giant algal virus described as capable of infecting chlorella-like cells and forming lysis plaques under laboratory conditions [7,8]. Chloroviruses are found in inland waters and can infect different species of algae in the family Chlorellaceae that are endosymbionts of protozoa and animals, as far as it is known [9]. There are three well-defined clades of chloroviruses, which we assigned to subgenera and different species based on multiple lines of evidence, including host range, phylogeny, and several genomic analyses [9,10]. The “alphachloroviruses” infect strains of Chlorella variabilis; there are currently 68 isolates with sequenced genomes available in the databases [10]. An important feature of these viruses is an apparent host-driven specialization, considering that several species of “alphachloroviruses” can only infect the Syngen 2–3 strain of C. variabilis [10]. The “gammachloroviruses” infect Chlorella heliozoae. In our analysis of 37 isolates, we demonstrated a great diversity in this group, with the definition of 10 viral species, an open pan-genome, and a curious absence of ecological barriers among these viruses [11]. The “betachloroviruses” are the least studied chlorovirus subgenus, with only 14 isolates described to date. This group has historically been called Pbi-viruses, due to their plaque-forming infection of Micractinium conductrix Pbi [12,13].
The genomes of the first betachloroviruses were sequenced almost two decades ago, revealing linear DNA genomes of approximately 320 kbp, more than 300 protein-coding sequences, and up to 10 genes for tRNAs [12]. A few years later, more genomes were sequenced from isolates obtained from different locations, such as the USA, Canada, France, Germany, Norway, and the Czech Republic, revealing a group of viruses with apparently lower genetic diversity when compared to other groups of chloroviruses [13]. An exception was an isolate obtained from samples from Nebraska/USA (NE-JV-1) that had genomic characteristics that distinguished it from other chloroviruses (e.g., absence of genomic collinearity and a more basal phylogenetic position compared to other Pbi-viruses) [13,14]. Given this scenario, isolating and characterizing new betachloroviruses is necessary to improve our understanding of the diversity and biology of these viruses.
In this work, we report the isolation and genomic characterization of 11 new betachloroviruses obtained from one lake and two rivers in Nebraska/USA. Together with the genomic data of the other betachloroviruses, we performed a series of genomic and phylogenetic analyses that led us to propose the existence of 3 species of betachloroviruses and to evaluate the intraspecific diversity of this group. One of the species contains only two isolates, NE-JV-1 and a new, closely related isolate, suggesting that the genetic diversity of this group of viruses is still far from being completely elucidated. From genotypic and phenotypic analyses, we suggest the existence of chlorovirus genomovars, something that can be explored for other giant viruses in the future, opening doors to new discoveries related to the biology of viruses that have been explored limitedly until now.

2. Materials and Methods

2.1. Virus Isolation and Purification

Water samples were collected from distinct lakes at the Crescent Lake National Wildlife Refuge in Nebraska (USA), as well as nearby rivers, to investigate the presence of chloroviruses between 2017 and 2020 under US Fish and Wildlife Service Approved Permit # CRL-8_002. The samples were filtered through a 0.45 µm membrane (PES filters, Sartorius, Göttingen, Germany) and plaque-assayed using Micractinium conductrix Pbi cells grown on FES agar plates [15], incubated for up to two weeks at 25 °C under constant light. Plaques were picked and plaque assayed serially to achieve pure, single plaque isolates. Viral production and purification on sucrose density gradients were performed as previously described [16,17].

2.2. Plaque Phenotype Assays

To investigate the plaque phenotypes of different isolates, we used purified viral isolates in plaque assays [7,16]. Each viral isolate that was selected for this project went through at least three rounds of single plaque isolation in series (independent infections). Briefly, 0.1 mL of virus lysate, diluted 10−2, 10−4, and 10−6 in virus stabilization buffer, was mixed with 0.3 mL of concentrated cells (2.0 × 108 cells/mL) and added to 2.5 mL of 0.75% FES-agar medium, which had been diluted by the addition of 0.9 mL of virus stabilization buffer. The top agar mixture was poured onto 1.5% FES-agar plates and allowed to solidify. The infections were all done on the same day with the same cell density, agar thickness and incubation conditions (medium, light, temperature) to allow proper comparison. Plates were incubated at 25 °C in continuous light and evaluated at 5, 7, 12, and 14 days of incubation. Photographs were also taken at these time points. Photographs were made using a Nikon D5200 DSLR camera with a Micro-Nikkor 55 mm 1:2.8 manual focus lens (Nikon Corporation, Tokyo, Japan). Camera settings: shutter speed 1/125, ISO 100, f5.6, custom white balance set to light source. A fluorescent light box with black paper mask was used to backlight plates with the camera mounted to a fixed arm above the light box to provide a consistent scale for comparison of plaques.

2.3. Genome Sequencing and Assembly

This study utilized genome sequences from 25 Micractinium conductrix-infecting viruses. The genome sequences of 14 betachloroviruses isolated before 2008 were downloaded from public databases [13]. The viral genomes of the other 11 isolates were sequenced and assembled as described by Carvalho et al. [10]. Briefly, new isolates were sequenced using PacBio Sequel II platform (Pacific Biosciences of California, Inc., Menlo Park, CA, USA. Raw long-read data were assembled de novo using Canu version 2.2 [18]. All genomes were assembled into a single scaffold with >40× coverage. The new viral genomes were deposited in the NCBI GenBank database with accession numbers PV288763 to PV288773.

2.4. Gene Prediction and Annotation

Coding sequences (CDS) were predicted using GeneMarkS version 4.28 [19] online software using default parameters. tRNA predictions were made using ARAGORN version 1.2.41 [20] and tRNAscan-SE [21,22]. Results were processed by removing predicted CDSs shorter than 40 amino acids or fewer than 150 nucleotides from strand ends. CDSs overlapping tRNA regions or containing stop codons were also excluded. Ambiguous tRNAs predicted by both software were removed from the final dataset, as well as pseudo tRNAs predicted by either software. The CDSs were annotated using a combined strategy of software to improve the quality of gene annotation. The NCBI nr database was queried using the Diamond algorithm [23], considering only results with an E-value lower than 10−5. Protein domains were identified using HHpred online [24], with results accepted only if the probability was ≥80%. Both results were compared to determine the best final hit. In cases of discordance, InterProScan 5 software [25] was used. For functional characterization, final hits were classified based on the Nucleocytoviricota orthologous groups of genes (NCVOG) database [26].

2.5. Phylogenetic Analyses

Alignments of amino acid sequences of the NCLDV hallmark genes [3], including DNA polymerase family B, DNA topoisomerase II, SNF2-like helicase, packing ATPase A32, transcription initiation factor IIB, and poxvirus late transcription factor VLTF3, were concatenated to construct a phylogenetic tree of M. conductrix-infecting viruses. In addition to the 25 betachloroviruses (Table S1), Acanthocystis turfacea Chlorella virus 1 (ATCV-1) (GenBank accession: NC_008724.1) was included as the outgroup. The six datasets were aligned using Muscle version 5 [27] and concatenated in MEGA version 11 [28]. The alignments were analyzed using the maximum likelihood method and best evolutionary model defined using ModelFinder implemented in IQ-TREE 2 [29,30], with 1000 bootstrap replicates for statistical support of the nodes. The concatenated and individual trees were visualized using iToL version 7 [31].

2.6. Genomic Collinearity Analysis and Species Demarcation Criteria

In order to evaluate the synteny and the alignment of genomic elements, the Mauve version 2.4.0 [32] and the Dynamic Genomic Alignment server (DiGAlign) version 2.0 were employed [33]. The average nucleotide identity (ANI) of the 25 whole genomes was calculated using FastANI version 1.3 [34], hosted on the European Galaxy server (usegalaxy.eu). The average amino acid identity (AAI) was calculated using EzAAI [35]. The results were organized into a similarity matrix that served as input for constructing a heatmap using in-house scripts described previously [10]. Based on the heatmap, the species of betachloroviruses were demarcated using a cutoff of 94% identity, a value consistent with our previous work on alphachloroviruses to define a chlorovirus species [10]. Hierarchical clustering was conducted using Pearson correlation above 0.7 using the hclust package version 3.6.2 in R [36]. For comparison with species from other groups, such as alphachloroviruses and gammachloroviruses, we included ANI values comparing isolates from the species Chlorovirus vanettense and Chlorovirus heliozoae. The isolates belonging to each chlorovirus species were defined in previous studies, describing the type isolates from the alpha- and gammachloroviruses [10,11]. Hierarchical clustering was conducted as described above for betachloroviruses.

2.7. Ortholog Clustering and Pan-Genome Construction

Predicted CDSs were clustered into orthologous groups (COGs) using the OrthoFinder tool version 2.5.5 [37], with an MCL inflation parameter of 4. Each OrthoFinder run included sequential groups of multi-fasta files as input, containing amino acid sequences. In constructing the pan-genome of Micractinium conductrix-infecting viruses, both the number of core genes (those shared by all isolates) and total COGs were considered. Results from each run were organized and plotted using GraphPad Prism 9, generating a pan-genome evolution graph. To visualize the overall COG-sharing among the M. conductrix-infecting viruses, we utilized Gephi version 0.9 [38] and employed the Force-Atlas 2 algorithm to generate the network layout, with minimal manual adjustments to the nodes to highlight the singletons. To compare the COG-sharing between virus species, we used COG data shared between isolates of each of the three species to build an upset plot using UpSetR version 1.4.0 in conjunction with the ComplexHeatmap package version 2.22.0 in R [39,40].

3. Results

3.1. Phylogenetics and Genomic Features of New Betachloroviruses

Eleven new chloroviruses were isolated and sequenced in this study, increasing the total number of viruses associated with Micractinium conductrix Pbi to 25 isolates (Table S1). These viruses have genome sizes ranging from 295 kbp (NE-P-2-m) to 374 kbp (NE-P-6-s), with an average GC content of 45%, varying between 44.1% and 46.8%. Betachloroviruses encode hundreds of CDSs, with an average of 377 CDSs per genome, ranging from 337 (NE-P-2-m) to 421 (NE-P-6-s). There is a positive correlation between genome size and the number of CDSs. However, this relationship does not extend to tRNA-coding genes. On average, betachloroviruses encode eight tRNA genes per genome, with observed values ranging from five (NE-JV-1) to twelve (several isolates). Interestingly, the isolate with the smallest genome (NE-P-2-m) is among those with the highest number of tRNAs, while the isolate with the fewest tRNAs (NE-JV-1) has a genome size close to the average for betachloroviruses, approximately 326 kbp (Figure 1).
To reconstruct the phylogeny of betachloroviruses, we used six of the most conserved genes among giant viruses [3], employing isolate ATCV-1 (Gammachlorovirus) as an outgroup. The resulting phylogeny reveals two well-supported clades within betachloroviruses. One of these clades forms a long branch composed of isolates NE-JV-1 and NE-P-6-s (Figure 1 and Figure S1), suggesting substantial genomic divergence. The other clade features much shorter branches, indicative of a greater genetic conservation among its members. We identified a distinct subclade comprising 11 isolates, eight of which were newly obtained from Island Lake in the Crescent Lake National Wildlife Refuge in Nebraska, and from the North and South Platte rivers, also in Nebraska. Remarkably, the remaining three isolates in this group were collected from geographically distant locations: one from Oregon, USA (OR0704.2.2), one from France (Fr5L), and one from the Czech Republic (CZ-2) (Figure 1; Table S1). This observation suggests the existence of a genetically cohesive group of betachloroviruses with a broad geographic distribution.
In all isolates, we identified conserved genes typical of the phylum Nucleocytoviricota, including DNA polymerase family B (PolB), Major Capsid Protein (MCP), Transcription Factor IIB (TFIIIB), DNA Topoisomerase II (TopoII), DNA packaging ATPase (A32), Poxvirus Late Transcription Factor 3 (VLTF3), and DEAD/SFN2-like helicase (SFII). As previously reported for chloroviruses [9,13,41], no homologs of RNA polymerase were found (Figure 2A). Betachloroviruses possess several genes associated with DNA replication, recombination, and repair, such as methyltransferases (specific for adenine and cytosine), primase, and endonucleases. Genes involved in carbohydrate metabolism are also present, including those encoding alginate lyase, chitinase, β-glucanase, and multiple glycosyltransferases. However, the absence of some of these enzymes in certain isolates suggests variability in glycosylation patterns among betachloroviruses (Figure 2A). In addition, a variety of genes involved in other metabolic functions were identified, such as those related to nucleotide metabolism (e.g., dCMP deaminase), membrane transport (e.g., potassium channel protein Kcv), gene transcription (e.g., mRNA capping enzyme), and protein metabolism (e.g., polypeptide chain elongation factor 3) (Figure 2A). We also observed several homologs of capsid proteins previously described in the PBCV-1 genome [42].
One of the most interesting genes found in 23 of the 25 betachloroviruses codes for a homolog of heliorhodopsin proteins (Figure 2A). The two betachloroviruses (NE-JV-1 and NE-P-6-s) that lack this gene are those distantly related to the others in phylogenetic reconstructions. None of the alpha- and gammachlorovirus members have this gene. Some of the betachlorovirus heliorhodopsin-like proteins are about 120 amino acids longer than authentic heliorhodopsin proteins, although they have internal Met residues that could be the translation starting point for the proteins, which would make them similar in size. Two of the 23 betachloroviruses encoding heliorhodopsin-like proteins have truncated versions of the protein (P-NE-12 and NE-P-8-f), with only 109 amino acid residues. All three types of light using rhodopsins have lysine in the seventh transmembrane helix which is essential for the protein to be functional [43]. However, the 23 betachloroviruses have an asparagine in this position.
Chloroviruses are known to encode tRNAs, typically clustered in a central region of the genome, which may supplement the host’s tRNA pool during infection [44]. In betachloroviruses, we similarly observed that tRNA genes are predominantly located near the center of the genome. These viruses encode tRNAs for 10 of the 20 canonical amino acids. Among these, tRNA-Arg, tRNA-Leu, and tRNA-Thr are present in all 25 isolates (Figure 2B). For tRNA-Leu and tRNA-Thr, two anticodons were detected in different isolates, but unevenly distributed. In contrast, tRNA-Asp was found exclusively in the isolate NE-P-6-s. Some tRNAs, such as tRNA-Asn, tRNA-Lys and tRNA-Gly, were present in almost all isolates, with the exception of NE-JV-1, the isolate with the lowest number of tRNAs among the genus Chlorovirus (Figure 2B). Interestingly, only tRNA-Asn has duplications in different isolates, with the isolate CZ-2 having four copies of this gene. tRNA-Tyr is also present in most isolates, except for AP110A and the two most divergent isolates, NE-JV-1 and NE-P-6-s. tRNA-Ile and tRNA-Phe show a more heterogeneous distribution, with no clear pattern of conservation among isolates (Figure 2B).

3.2. Genomic Organization of Different Species of Betachloroviruses

Chloroviruses have been divided into three main groups based on the host they infect [9,13]. This classification is supported by phylogenetic and genomic data (e.g., genome synteny and gene identity), which reveal significant divergence among the clades [13,41]. When comparing the genomic structure of the newly isolated betachloroviruses with previously described isolates, we observed a high degree of genomic collinearity, indicating strong synteny among these viral genomes (Figure 3 and Figure S2). In some cases, genomic inversions were observed in the first half of the genome (e.g., NE-P-1-L vs. NE-P-3-s). Synteny analysis revealed that the isolates NE-JV-1 and NE-P-6-s have a high degree of conservation in their genomes when compared to each other, yet exhibit a completely different gene organization compared to the other betachloroviruses. This observation supports the distinct long branch these isolates occupy in the phylogeny (Figure 1 and Figure 3). In these two isolates, a genomic inversion was also detected, occurring in the central region of the genome, between 150 and 250 kbp. When comparing the synteny of the new betachlorovirus isolates with representatives from other chlorovirus groups, we observed a complete lack of synteny, clearly underscoring the genomic distinction between the three clades within the Chlorovirus genus (Figure S3).
Average Nucleotide Identity (ANI) analysis, combined with hierarchical clustering using Pearson correlation, revealed a clear separation of betachloroviruses into three distinct clusters, corroborating both the phylogenetic and synteny data (Figure 4). Each cluster was defined by isolates sharing at least 94% nucleotide identity (Table S2). Between clusters, identity values dropped to as low as 82.97%. Based on the >94% ANI threshold, along with supporting evidence from phylogeny and genomic organization, it is possible to delineate three distinct species within the group of betachloroviruses. Furthermore, the ANI values were highly consistent with the results from the Average Amino Acid Identity (AAI) analysis, which also revealed the separation of isolates into three clusters, with intraspecific similarity values ranging from 93.24% to 99.97% (Figure S4).
Among the three clusters—hereafter defined as species I (12 isolates), species II (11 isolates), and species III (2 isolates)—the latter is clearly the most divergent, sharing approximately 83% nucleotide identity with isolates from the other two species. This divergence is supported by the long branch in the phylogenetic analysis. Species I exhibits the highest intraspecific genetic variation, with ANI values ranging from approximately 94% to almost 100%. In contrast, species II shows a narrower range of genetic variability, with ANI values between 96.31% and 99.99%. Species III, composed of only two isolates, displays a genetic difference of only 1.65% between them (Table S2). Some isolates are nearly identical, showing minimal differences in genetic content and forming tight subclusters with over 99% ANI (Figure 4; Table S2). This pattern is observed in a subcluster within species I (isolates CVR-1 and CVA-1) and in two subclusters within species II: subcluster 1 (isolates NE-P-8-f, NE-P-3-s, NE-P-4-fs, and P-NE-12) and subcluster 2 (isolates P-NE-13, P-NE-11, NE-P-1-L, and NE-P-1-m) (Figure S5).
To date, betachloroviruses exhibit the lowest genetic diversity within the Chlorovirus genus, being composed of only three species with markedly lower intraspecific variation when compared to alphachloroviruses and gammachloroviruses. It is noteworthy that, across all groups, the use of ANI values > 94%, in combination with phylogenomic analyses, has proven to be a robust criterion for species delineation within the Chlorovirus genus [10]. For comparative purposes, we analyzed the genetic variation within representative species of the other Chlorovirus clades. In Chlorovirus vanettense (22 isolates infecting Chlorella variabilis NC64A), ANI values range from 94% to 99.98%, with four subclusters identified in which isolates share at least 99% ANI. The most prominent subcluster includes isolates PBCV-1, XZ-4A, XZ-3A, 40-NE-4, 40-NE-3, and 41-NE-5 (Figure S6A). For Chlorovirus heliozoae (18 isolates infecting Chlorella heliozoae SAG 3.83), ANI values range from 94% to 99.16%, with only three subclusters, each composed of two isolates sharing over 99% ANI (Figure S6B). These findings reinforce that gammachloroviruses exhibit the highest genetic diversity known among chloroviruses to date [11].

3.3. Pan-Genome Evolution and COG-Sharing Pattern of Betachloroviruses

Although betachloroviruses exhibit lower genetic diversity compared to other chlorovirus groups, we still observed an upward trend in their pan-genome, indicating an open pan-genome (Figure 5A). Among the 25 isolates analyzed, a total of 9431 genes were grouped into 678 clusters of orthologous genes (COGs), of which 223 were shared by all isolates, constituting the core genome of betachloroviruses (Figure 5B,C). A total of 334 COGs were shared by 2 to 24 isolates, forming the satellite genome of the group, while 121 COGs were classified as singletons, being present in only one isolate (Figure 5B,C). Among these, isolate NE-P-6-s stands out for containing the highest number of singletons among betachloroviruses, with 13 unique COGs. In contrast, some isolates, such as NE-P-3-s and NE-P-4-fs, have only one singleton each. This observation corroborates the ANI data, as these isolates present ANI values above 99%, and thus, a large number of unique genes would not be expected. Nonetheless, it is noteworthy that even nearly identical genomes harbor exclusive genes, which may reflect biological differences between the isolates.
Species II harbors the largest number of COGs, with a total of 527 shared among the 11 isolates comprising the species, followed by species I with 458 COGs and species III with 382 COGs (Figure 5B,C). When analyzing COG sharing across species, we observed that 285 COGs (42.2%) are shared by at least one isolate from each species, while 262 COGs (38.9%) are exclusive to one or more species. Species III has the fewest exclusive COGs (64; 9.5%), but the highest number of singletons by isolate (Figure 5B,C). Species I and II share 92 COGs, the highest number among interspecies comparisons. In contrast, species III shares the fewest COGs with the others—only 11 with species I and 22 with species II—highlighting the high genetic divergence of species III relative to the others (Figure 5B,C). Among the COGs exclusive to species III, isolate NE-JV-1 contains 8 singletons, and NE-P-6-s has 13. Most of these encode hypothetical proteins, and a substantial portion are classified as ORFans (n = 10/13; 76.9%).

3.4. Quasi-Identical Isolates Exhibit Different Plaque Phenotypes

Among the 25 betachlorovirus isolates, we observed some isolates that have nearly identical genomes, i.e., ANI values above 99.5%, some sharing up to 99.99% average nucleotide identity (Figure 6A). In genomes with an average size of 330 kbp, as is the case with these betachloroviruses, this represents a genetic difference of less than 200 nucleotides between some isolates (e.g., CVA-1 and CVR-1; ANI = 99.95%). In species I, there are two nearly identical genomes, which we can consider as genetic variants, or genomovars [45], which share 99.95% of nucleotides. In species II, we observed two groups of isolates that form genomovars, whose isolates share average ANI values of 99.94% (genomovar II.1; isolates P-NE-11, P-NE-13, NE-P-1-L, and NE-P-2-m) and 99.79% (genomovar II.2; isolates P-NE-12, NE-P-3-s, NE-P-8-f, and NE-P-4-fs) (Figure 6A). Interestingly, although some isolates have high genomic identity, they exhibit considerably different plaque morphologies.
Viral isolates of genomovar I.1 (isolates CVA-1 and CVR-1) share 99.95% average identity and the plaque phenotype is highly similar, exhibiting large plaques (>4 mm) with regular edges and clear plaque interiors (Figure 6B,C). It is possible that these isolates exhibit very similar characteristics in general, being almost clones, especially considering that they were isolated from the same region (Table S1). However, when we analyzed the genomovars of species II, we observed a curious variety of plaque forms in isolates that share more than 99.50% nucleotide identity. The genomovar II.1 isolates exhibit similar plaque phenotypes, sharing ANI > 99.79%. All four isolates form large plaques with regular edges, with NE-P-2-m forming slightly smaller plaques compared to the others. Curiously, isolate P-NE-11 forms large plaques with more diffuse boundaries than the other three isolates in its genomovar; Some plaques of P-NE-11 even display a cloudy ring near the edge that is not seen in others (Figure 6B,C). However, these isolates exhibit only 0.01% genomic divergence from each other, corresponding to a difference of less than 50 nucleotides on average. It is even more intriguing when we look at the genomovar II.2 isolates. These isolates share less than 0.05% genomic divergence, but have highly different plaque phenotypes. Isolates P-NE-12, NE-P-3-s, and NE-P-4-fs form small plaques (<2 mm) with regular edges, while isolate NE-P-8-f forms medium-size plaques (~2.5 mm), with cloudy interior and irregular boundaries. These isolates have ANI = 99.55% (Figure 6). One feature that should not be overlooked is that all isolates show some degree of variability in plaque appearance, with some being more consistent (e.g., CVA-1) and some showing different plaque types (e.g., NE-P-8-f); the plaque representations in Figure 6B are the average plaque appearance observed for each isolate.
It is still unclear what is behind this morphotypic difference in nearly genetically identical viruses. All isolates in Species II genomovar 1 are from Island Lake. Isolates in Species II genomovar 2 are from Island Lake and the North Platte River (NE-P-8-f). Most are from the same sample year. These data suggest that there is a rich biological diversity to be explored in these locations, even considering viruses with very similar genetic characteristics.

4. Discussion

Chloroviruses were discovered in the early 1980s and expanded the frontiers of virology by having large icosahedral particles (~190 nm) and large genomes encoding hundreds of proteins, resulting in the proposal of the term “giant virus” [7,8,46,47]. Since then, much has been discovered, especially in relation to the prototype chlorovirus PBCV-1, now classified as part of the species Chlorovirus vanettense. Based on many studies involving PBCV-1, we now know that chloroviruses are much more complex than initially imagined. Chloroviruses are pseudoicosahedral particles with capsids composed of many proteins, including at least 5 MCP variants and more than 25 minor capsid proteins [40], and a unique and autonomous glycosylation system [48,49,50,51,52]. Also, they are capable of manipulating the nucleotide metabolism of the host to better suit their molecular requirements during replication [53]. With the discovery of new isolates genetically close to PBCV-1, we observed great diversity and the existence of two main groups of these viruses, with one of them being capable of infecting only one type of host, the so-called OSy-virus, which apparently is derived from a subpopulation of the now-called alphachloroviruses [10,16]. In addition to these, other viruses related to another model virus, ATCV-1, were isolated and characterized, revealing great genetic diversity within the gammachloroviruses [11,54]. However, a group that has not been explored until now is the betachloroviruses (traditionally known as Pbi-viruses). In this study, we sought to fill some gaps in our understanding of their diversity and genomics.
The first betachlorovirus genomes were published in 2007, belonging to isolates FR483 (site of collection: France) and MT325 (site of collection: USA), revealing genomes of 321 and 314 kbp, respectively, and low collinearity compared to PBCV-1 and ATCV-1 [12]. At the time, they were the smallest chlorovirus genomes known. Subsequently, other chloroviruses associated with Micractinium conductrix were isolated and sequenced, most obtained from European samples, revealing genomes ranging from 303 to 330 kbp; an analysis comparing these with other isolates’ genomes demonstrated an open pan-genome for the genus Chlorovirus [13,41]. In this work, we describe 11 new isolates, increasing the total number of sequenced betachloroviruses to 25, with genomes ranging from 295 to 374 kbp, expanding the limits of genome size for these viruses. These viruses have an average of 370 CDSs, of which a large number (~40%) encode proteins with no known function. This pattern is similar to that observed for other giant viruses, not only chloroviruses, indicating that there is still much to be explored by applying the omics sciences to the study of Nucleocytoviricota [2]. Curiously, betachloroviruses are the only chloroviruses that harbor a heliorhodopsin-like homolog, except for two isolates (species III). These genes have been described in coccolithoviruses and are characterized as a light-activated proton transporter that has been hypothesized to be involved in overcoming host defense mechanisms [55]. The role of heliorhodopsin-like proteins in betachloroviruses is still unclear. We speculate that they may be involved in response to light availability during infection. Bornemann and Follman (1997) reported the appearance of concentric rings within viral plaques when betachlorovirus-infected cells were incubated with a circadian change in light intensity, but plaques remained uniformly clear under constant light [56]. Assessing the impact of heliorhodopsins under different light conditions may provide valuable insights into the role of these proteins in chloroviruses.
Synteny analyses revealed that betachloroviruses generally exhibit strong genomic collinearity among themselves, with a few inverted regions in some isolates. Notably, two viruses stand out for having a gene organization markedly distinct from the others. These results reinforce findings from previous studies, which have highlighted the extensive genomic diversity among the three chlorovirus groups [41]. Using ANI and AAI analyses and applying a 94% identity threshold among isolates, we were able to define three distinct species of betachloroviruses. This threshold aligns well with observations in alphachloroviruses and gammachloroviruses [10,11], supporting its robustness as a criterion for species delineation within the Chlorovirus genus. When integrated with additional evidence such as phylogenetic relationships and host range, this approach strengthens taxonomic proposals in line with ICTV guidelines [57].
Although betachloroviruses display lower genetic variability compared to other chloroviruses, they still exhibit an expanding pan-genome. This suggests that the continued isolation and characterization of new strains may uncover genetic novelties and further increase the slope of the pan-genome curve. As observed in alphachloroviruses and gammachloroviruses [10,11], the core-genome curve reaches a plateau, indicating a probable set of essential genes that define members of this viral group. It is important to highlight that one of the identified betachlorovirus species is currently represented by only two isolates. The recent discovery of NE-P-6-s is particularly significant, demonstrating the profound impact of virus prospecting efforts on our understanding of viral diversity. Until now, isolate NE-JV-1 had been regarded as a phylogenetic outlier due to its considerable divergence from other betachloroviruses [9,13,14]. Depending on the molecular marker used for phylogenetic reconstructions, NE-JV-1 occasionally appeared more closely related to gammachloroviruses [14,41], underscoring how limited sampling can skew evolutionary interpretations.
Among the betachlorovirus isolates, we found no clear relationship between geographic origin and genetic diversity, unlike what has been considered for other giant virus groups [58]. Viruses isolated from distant countries such as France, Norway, and Canada exhibit high genomic similarity and belong to the same species. Conversely, isolates from a single water sample in the United States show greater genetic divergence and are classified into different species (e.g., both P-NE-9 and NE-P-8-f are from the same North Platte River sample collected in October, 2017). It is noteworthy that all currently known betachlorovirus isolates originate from samples collected in North America and Europe. Expanding virus isolation efforts to underrepresented regions of the world will be essential to gain a more complete understanding of the true diversity and global distribution of these viruses.
When analyzing intraspecific diversity, we observed only slight genomic variation among some isolates, with several pairs displaying ANI values above 99%. A comprehensive study involving over 1000 viral genomes, predominantly bacteriophages but also including eukaryotic viruses, proposed the use of ANI to define intraspecific viral units, particularly when identity exceeds 99% [45]. The authors introduced the term “genomovar” to describe each of these subunits within a viral species. Our data support the existence of genomovars in chloroviruses. Remarkably, we observed that some of these genomovars exhibit distinct phenotypic traits despite having nearly identical genomes. One of the most striking phenotypic differences is the variation in plaque morphology, a characteristic often used to infer viral virulence and to screen for genetically modified clones [59,60,61]. Curiously, even in highly purified isolates, we can still observe slightly different plaque phenotypes, indicating a considerable heterogeneity in biological traits of chloroviruses. It is important to note that plaque assays for comparison purposes should be performed simultaneously to control for potential variability that may arise between experiments. Plaque assays performed at different times with different cell cultures may result in a difference in plaque appearance, making data interpretation difficult. Despite our efforts to purify viral plaques, we still observed variations in plaque sizes. When analyzing the raw genome sequences, we found no evidence of different viruses in the samples. We still do not have an explanation for the observed variation in plaque size among chlorovirus isolates, and this can be better evaluated in future studies.
It is possible that subtle genetic differences, such as small deletions or point mutations, underlie these morphotypic variations. Similar phenomena have been documented in other giant viruses, such as poxviruses. For instance, vaccinia virus has been shown to display at least two distinct plaque morphologies, associated with minor variations in specific genes, although a definitive genotype-to-phenotype link has yet to be established [62,63]. This pattern is also observed in RNA viruses like enteroviruses and flaviviruses, where single-nucleotide polymorphisms may influence plaque morphotype and pathogenic potential [64,65]. Further investigation into betachlorovirus genomovars, especially through integrative genomic and biological analyses, will be key to uncovering the molecular basis of intraspecific variation and advancing our understanding of the complex biology of these giant algal viruses.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v17081096/s1, Figure S1: Phylogenetic trees of betachloroviruses; Figure S2: Synteny of all 25 betachloroviruses; Figure S3: Comparative synteny among selected members of the 3 chlorovirus subgenera; Figure S4: Amino Acid Identity (AAI) analysis of betachloroviruses; Figure S5: Average Nucleotide Identity of each betachlorovirus species; Figure S6: Average Nucleotide Identity of alphachlorovirus and gammachlorovirus; Table S1: General genomic features and isolation information of betachloroviruses; Table S2: Average Nucleotide Identity values of betachloroviruses.

Author Contributions

Conceptualization, R.A.L.R.; formal analysis, J.W.S., L.R.H., B.B.F.B., J.V.R.P.C., R.M.C. and J.P.N.S.; methodology, J.V.R.P.C., L.R.H., B.B.F.B., R.M.C. and I.V.A.; investigation, J.W.S. and R.A.L.R.; data curation, J.W.S., L.R.H., B.B.F.B. and R.A.L.R.; writing—original draft preparation, R.A.L.R.; writing—review and editing, J.W.S., L.R.H., E.R.G.R.A., R.M.C., I.V.A., J.L.V.E., D.D.D. and R.A.L.R.; supervision, R.A.L.R., D.D.D. and J.L.V.E.; project administration, R.A.L.R., D.D.D. and J.L.V.E.; funding acquisition, R.A.L.R., E.R.G.R.A., D.D.D. and J.L.V.E. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially funded by the National Council for Scientific and Technological Development (CNPq), Grant No. 402038/2023-1; Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG), Grant No. APQ-01057–23; Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)—Finance Code 001; National Science Foundation under grant 1736030 (D.D.D. and J.L.V.E.), the University of Nebraska-Lincoln Agricultural Research Division (J.L.V.E.), the University of Nebraska-Lincoln Office of Research and Economic Development (D.D.D.), University of Nebraska-Collaboration Initiative (D.D.D.), and Algal Virus Research Funds from the University of Nebraska Foundation (J.L.V.E.). R.A.L.R. and E.R.G.R.A. are CNPq researchers. The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All genomic data used in this work is publicly available at the GenBank database. The new viral genomes were deposited in the NCBI GenBank database with accession numbers PV288763 to PV288773. All genomic data used in this research is publicly available at the BioProject PRJNA1154233 “Chlorovirus Diversity”. This site contains links to all chlorovirus genomes in GenBank as well as the raw sequence data in the SRA, and information about the isolates.

Acknowledgments

We thank all the members and colleagues of the Virus Laboratory of the Federal University of Minas Gerais, Brazil, for all the scientific and technical support during the preparation of this work. We also thank Glória Franco for providing us access to the computational cluster at the Laboratory of Genetics Biochemistry of UFMG for large computational analysis included in this study. The text has been grammatically reviewed using the AI-based software, ChatGPT 4.0. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
A32Packaging ATPase
AAIAverage Amino acid Identity
ANIAverage Nucleotide Identity
ATCV-1Acanthocystis turfacea Chlorella virus 1
CDSCoding Sequence
COGsCluster of Orthologous Groups of genes
dsDouble-stranded
ICTVInternational Committee on Taxonomy of Viruses
kbpKilobase pair
MCPMajor Capsid Protein
NCBINational Center for Biotechnology Information
NCVOGNucleocytoviricota Orthologous Groups of genes
ORFsOpen Reading Frames
PBCV-1Paramecium bursaria Chlorella virus 1
PolBDNA polymerase B family
TFIIBTranscription Factor IIB
TopoIIDNA Topoisomerase II
VLTF3Poxvirus Late Transcription Factor 3

References

  1. Güemes, A.G.C.; Youle, M.; Cantú, V.A.; Felts, B.; Nulton, J.; Rohwer, F. Viruses as Winners in the Game of Life. Annu. Rev. Virol. 2016, 3, 197–214. [Google Scholar] [CrossRef]
  2. Schulz, F.; Abergel, C.; Woyke, T. Giant Virus Biology and Diversity in the Era of Genome-Resolved Metagenomics. Nat. Rev. Microbiol. 2022, 20, 721–736. [Google Scholar] [CrossRef] [PubMed]
  3. Aylward, F.O.; Moniruzzaman, M.; Ha, A.D.; Koonin, E.V. A Phylogenomic Framework for Charting the Diversity and Evolution of Giant Viruses. PLoS Biol. 2021, 19, e3001430. [Google Scholar] [CrossRef] [PubMed]
  4. Current ICTV Taxonomy Release | ICTV. Available online: https://ictv.global/taxonomy (accessed on 17 April 2025).
  5. Wilson, W.H.; Van Etten, J.L.; Allen, M.J. The Phycodnaviridae: The Story of How Tiny Giants Rule the World. In Lesser Known Large dsDNA Viruses; Van Etten, J.L., Ed.; Springer: Berlin, Heidelberg, 2009; pp. 1–42. ISBN 978-3-540-68618-7. [Google Scholar]
  6. Van Etten, J.L.; Dunigan, D.D.; Nagasaki, K.; Schroeder, D.C.; Grimsley, N.; Brussaard, C.P.D.; Nissimov, J.I. Phycodnaviruses (Phycodnaviridae). In Encyclopedia of Virology, 4th ed.; Bamford, D.H., Zuckerman, M., Eds.; Academic Press: Oxford, UK, 2021; pp. 687–695. ISBN 978-0-12-814516-6. [Google Scholar]
  7. Van Etten, J.L.; Burbank, D.E.; Kuczmarski, D.; Meints, R.H. Virus Infection of Culturable Chlorella-Like Algae and Development of a Plaque Assay. Science 1983, 219, 994–996. [Google Scholar] [CrossRef] [PubMed]
  8. Van Etten, J.L.; Meints, R.H. Giant Viruses Infecting Algae. Annu. Rev. Microbiol. 1999, 53, 447–494. [Google Scholar] [CrossRef]
  9. Van Etten, J.L.; Agarkova, I.V.; Dunigan, D.D. Chloroviruses. Viruses 2020, 12, 20. [Google Scholar] [CrossRef]
  10. Carvalho, J.V.R.P.; Carlson, R.M.; Ghosh, J.; Queiroz, V.F.; de Oliveira, E.G.; Botelho, B.B.; Filho, C.A.C.; Agarkova, I.V.; McClung, O.W.; Van Etten, J.L.; et al. Genomics and Evolutionary Analysis of Chlorella Variabilis-Infecting Viruses Demarcate Criteria for Defining Species of Giant Viruses. J. Virol. 2024, 98, e0036124. [Google Scholar] [CrossRef]
  11. Henriques, L.R.; Botelho, B.B.F.; Carlson, R.M.; Carvalho, J.V.R.P.; Oliveira, E.G.; Agarkova, I.V.; Van Etten, J.L.; Dunigan, D.D.; Rodrigues, R.A.L. Revealing the Hidden Diversity of Chlorella Heliozoae-Infecting Giant Viruses. npj Viruses 2025, 3, 12. [Google Scholar] [CrossRef]
  12. Fitzgerald, L.A.; Graves, M.V.; Li, X.; Feldblyum, T.; Hartigan, J.; Van Etten, J.L. Sequence and Annotation of the 314-Kb MT325 and the 321-Kb FR483 Viruses That Infect Chlorella Pbi. Virology 2007, 358, 459–471. [Google Scholar] [CrossRef]
  13. Jeanniard, A.; Dunigan, D.D.; Gurnon, J.R.; Agarkova, I.V.; Kang, M.; Vitek, J.; Duncan, G.; McClung, O.W.; Larsen, M.; Claverie, J.-M.; et al. Towards Defining the Chloroviruses: A Genomic Journey through a Genus of Large DNA Viruses. BMC Genom. 2013, 14, 158. [Google Scholar] [CrossRef]
  14. Speciale, I.; Agarkova, I.; Duncan, G.A.; Van Etten, J.L.; De Castro, C. Structure of the N-Glycans from the Chlorovirus NE-JV-1. Antonie van Leeuwenhoek 2017, 110, 1391–1399. [Google Scholar] [CrossRef]
  15. Reisser, W.; Becket, B.; Klein, T. Studies on ultrastructure and host range of a Chlorella attacking virus. Protoplasma 1986, 135, 162–165. [Google Scholar] [CrossRef]
  16. Quispe, C.F.; Esmael, A.; Sonderman, O.; McQuinn, M.; Agarkova, I.; Battah, M.; Duncan, G.A.; Dunigan, D.D.; Smith, T.P.L.; De Castro, C.; et al. Characterization of a New Chlorovirus Type with Permissive and Non-Permissive Features on Phylogenetically Related Algal Strains. Virology 2017, 500, 103–113. [Google Scholar] [CrossRef] [PubMed]
  17. Agarkova, I.V.; Lane, L.C.; Dunigan, D.D.; Quispe, C.F.; Duncan, G.A.; Milrot, E.; Minsky, A.; Esmael, A.; Ghosh, J.S.; Van Etten, J.L. Identification of a Chlorovirus PBCV-1 Protein Involved in Degrading the Host Cell Wall during Virus Infection. Viruses 2021, 13, 782. [Google Scholar] [CrossRef] [PubMed]
  18. Koren, S.; Walenz, B.P.; Berlin, K.; Miller, J.R.; Bergman, N.H.; Phillippy, A.M. Canu: Scalable and Accurate Long-Read Assembly via Adaptive k-Mer Weighting and Repeat Separation. Genome Res. 2017, 27, 722–736. [Google Scholar] [CrossRef] [PubMed]
  19. Besemer, J.; Lomsadze, A.; Borodovsky, M. GeneMarkS: A Self-Training Method for Prediction of Gene Starts in Microbial Genomes. Implications for Finding Sequence Motifs in Regulatory Regions. Nucleic Acids Res. 2001, 29, 2607–2618. [Google Scholar] [CrossRef]
  20. Laslett, D.; Canback, B. ARAGORN, a Program to Detect tRNA Genes and tmRNA Genes in Nucleotide Sequences. Nucleic Acids Res. 2004, 32, 11–16. [Google Scholar] [CrossRef] [PubMed]
  21. Lowe, T.M.; Chan, P.P. tRNAscan-SE On-Line: Integrating Search and Context for Analysis of Transfer RNA Genes. Nucleic Acids Res. 2016, 44, W54–W57. [Google Scholar] [CrossRef] [PubMed]
  22. Chan, P.P.; Lowe, T.M. tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences. In Gene Prediction: Methods and Protocols; Kollmar, M., Ed.; Methods in Molecular Biology; Springer: New York, NY, USA, 2019; pp. 1–14. ISBN 978-1-4939-9173-0. [Google Scholar]
  23. Buchfink, B.; Reuter, K.; Drost, H.-G. Sensitive Protein Alignments at Tree-of-Life Scale Using DIAMOND. Nat. Methods 2021, 18, 366–368. [Google Scholar] [CrossRef]
  24. Zimmermann, L.; Stephens, A.; Nam, S.-Z.; Rau, D.; Kübler, J.; Lozajic, M.; Gabler, F.; Söding, J.; Lupas, A.N.; Alva, V. A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at Its Core. J. Mol. Biol. 2018, 430, 2237–2243. [Google Scholar] [CrossRef]
  25. InterProScan 5: Genome-Scale Protein Function Classification | Bioinformatics | Oxford Academic. Available online: https://academic.oup.com/bioinformatics/article/30/9/1236/237988 (accessed on 16 April 2025).
  26. Yutin, N.; Wolf, Y.I.; Raoult, D.; Koonin, E.V. Eukaryotic Large Nucleo-Cytoplasmic DNA Viruses: Clusters of Orthologous Genes and Reconstruction of Viral Genome Evolution. Virol. J. 2009, 6, 223. [Google Scholar] [CrossRef]
  27. Edgar, R.C. MUSCLE: Multiple Sequence Alignment with High Accuracy and High Throughput. Nucleic Acids Res. 2004, 32, 1792–1797. [Google Scholar] [CrossRef] [PubMed]
  28. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef] [PubMed]
  29. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast Model Selection for Accurate Phylogenetic Estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef]
  30. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  31. Letunic, I.; Bork, P. Interactive Tree of Life (iTOL) v6: Recent Updates to the Phylogenetic Tree Display and Annotation Tool. Nucleic Acids Res. 2024, 52, W78–W82. [Google Scholar] [CrossRef] [PubMed]
  32. Darling, A.C.E.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements. Genome Res. 2004, 14, 1394–1403. [Google Scholar] [CrossRef] [PubMed]
  33. Nishimura, Y.; Yamada, K.; Okazaki, Y.; Ogata, H. DiGAlign: Versatile and Interactive Visualization of Sequence Alignment for Comparative Genomics. Microbes Environ. 2024, 39, ME2306. [Google Scholar] [CrossRef]
  34. Jain, C.; Rodriguez-R, L.M.; Phillippy, A.M.; Konstantinidis, K.T.; Aluru, S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 2018, 9, 5114. [Google Scholar] [CrossRef]
  35. Kim, D.; Park, S.; Chun, J. Introducing EzAAI: A pipeline for high throughput calculations of prokaryotic average amino acid identity. J. Microbiol. 2021, 59, 476–480, Erratum in: J. Microbiol. 2023, 61, 879. [Google Scholar] [CrossRef]
  36. R: The R Project for Statistical Computing. Available online: https://www.r-project.org/ (accessed on 13 November 2014).
  37. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic Orthology Inference for Comparative Genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef]
  38. Bastian, M.; Heymann, S.; Jacomy, M. Gephi: An Open Source Software for Exploring and Manipulating Networks. Proc. Int. AAAI Conf. Web Soc. Media 2009, 3, 361–362. [Google Scholar] [CrossRef]
  39. Conway, J.R.; Lex, A.; Gehlenborg, N. UpSetR: An R Package for the Visualization of Intersecting Sets and Their Properties. Bioinformatics 2017, 33, 2938–2940. [Google Scholar] [CrossRef] [PubMed]
  40. Gu, Z.; Eils, R.; Schlesner, M. Complex Heatmaps Reveal Patterns and Correlations in Multidimensional Genomic Data. Bioinformatics 2016, 32, 2847–2849. [Google Scholar] [CrossRef]
  41. Rodrigues, R.A.L.; Queiroz, V.F.; Ghosh, J.; Dunigan, D.D.; Van Etten, J.L. Functional Genomic Analyses Reveal an Open Pan-Genome for the Chloroviruses and a Potential for Genetic Innovation in New Isolates. J. Virol. 2022, 96, e0136721. [Google Scholar] [CrossRef] [PubMed]
  42. Shao, Q.; Agarkova, I.V.; Noel, E.A.; Dunigan, D.D.; Liu, Y.; Wang, A.; Guo, M.; Xie, L.; Zhao, X.; Rossmann, M.G.; et al. Near-Atomic, Non-Icosahedrally Averaged Structure of Giant Virus Paramecium Bursaria Chlorella Virus 1. Nat. Commun. 2022, 13, 6476. [Google Scholar] [CrossRef]
  43. Pushkarev, A.; Inoue, K.; Larom, S.; Flores-Uribe, J.; Singh, M.; Konno, M.; Tomida, S.; Ito, S.; Nakamura, R.; Tsunoda, S.P.; et al. A distinct abundant group of microbial rhodopsins discovered using functional metagenomics. Nature 2018, 558, 595–599. [Google Scholar] [CrossRef]
  44. Duncan, G.A.; Dunigan, D.D.; Van Etten, J.L. Diversity of tRNA Clusters in the Chloroviruses. Viruses 2020, 12, 1173. [Google Scholar] [CrossRef]
  45. Aldeguer-Riquelme, B.; Conrad, R.E.; Antón, J.; Rossello-Mora, R.; Konstantinidis, K.T. A Natural ANI Gap That Can Define Intra-Species Units of Bacteriophages and Other Viruses. mBio 2024, 15, e0153624. [Google Scholar] [CrossRef]
  46. Meints, R.H.; Van Etten, J.L.; Kuczmarski, D.; Lee, K.; Ang, B. Viral Infection of the Symbiotic Chlorella-like Alga Present in Hydra Viridis. Virology 1981, 113, 698–703. [Google Scholar] [CrossRef]
  47. Meints, R.H.; Lee, K.; Burbank, D.E.; Van Etten, J.L. Infection of a Chlorella-like Alga with the Virus, PBCV-1: Ultrastructural Studies. Virology 1984, 138, 341–346. [Google Scholar] [CrossRef] [PubMed]
  48. Van Etten, J.L.; Gurnon, J.R.; Yanai-Balser, G.M.; Dunigan, D.D.; Graves, M.V. Chlorella Viruses Encode Most, If Not All, of the Machinery to Glycosylate Their Glycoproteins Independent of the Endoplasmic Reticulum and Golgi. Biochim. Biophys. Acta (BBA)-Gen. Subj. 2010, 1800, 152–159. [Google Scholar] [CrossRef]
  49. De Castro, C.; Molinaro, A.; Piacente, F.; Gurnon, J.R.; Sturiale, L.; Palmigiano, A.; Lanzetta, R.; Parrilli, M.; Garozzo, D.; Tonetti, M.G.; et al. Structure of N-Linked Oligosaccharides Attached to Chlorovirus PBCV-1 Major Capsid Protein Reveals Unusual Class of Complex N-Glycans. Proc. Natl. Acad. Sci. USA 2013, 110, 13956–13960. [Google Scholar] [CrossRef] [PubMed]
  50. De Castro, C.; Klose, T.; Speciale, I.; Lanzetta, R.; Molinaro, A.; Van Etten, J.L.; Rossmann, M.G. Structure of the Chlorovirus PBCV-1 Major Capsid Glycoprotein Determined by Combining Crystallographic and Carbohydrate Molecular Modeling Approaches. Proc. Natl. Acad. Sci. USA 2018, 115, E44–E52. [Google Scholar] [CrossRef] [PubMed]
  51. Speciale, I.; Laugieri, M.E.; Noel, E.; Lin, S.; Lowary, T.L.; Molinaro, A.; Duncan, G.A.; Agarkova, I.V.; Garozzo, D.; Tonetti, M.G.; et al. Chlorovirus PBCV-1 Protein A064R Has Three of the Transferase Activities Necessary to Synthesize Its Capsid Protein N-Linked Glycans. Proc. Natl. Acad. Sci. USA 2020, 117, 28735–28742. [Google Scholar] [CrossRef]
  52. Speciale, I.; Notaro, A.; Abergel, C.; Lanzetta, R.; Lowary, T.L.; Molinaro, A.; Tonetti, M.; Van Etten, J.L.; De Castro, C. The Astounding World of Glycans from Giant Viruses. Chem. Rev. 2022, 122, 15717–15766. [Google Scholar] [CrossRef]
  53. Dunigan, D.D.; Agarkova, I.V.; Esmael, A.; Alvarez, S.; Van Etten, J.L. Early-Phase Drive to the Precursor Pool: Chloroviruses Dive into the Deep End of Nucleotide Metabolism. Viruses 2023, 15, 911. [Google Scholar] [CrossRef]
  54. Bubeck, J.A.; Pfitzner, A.J.P. Isolation and Characterization of a New Type of Chlorovirus That Infects an Endosymbiotic Chlorella Strain of the Heliozoon Acanthocystis Turfacea. J. Gen. Virol. 2005, 86, 2871–2877. [Google Scholar] [CrossRef]
  55. Hososhima, S.; Mizutori, R.; Abe-Yoshizumi, R.; Rozenberg, A.; Shigemura, S.; Pushkarev, A.; Konno, M.; Katayama, K.; Inoue, K.; Tsunoda, S.P.; et al. Proton-transporting heliorhodopsins from marine giant viruses. eLife 2022, 6, e78416. [Google Scholar] [CrossRef]
  56. Bornemann, C.; Follmann, H. Large-scale production and plaque titration of European Chlorella viruses. J. Virol. Methods 1997, 67, 119–125. [Google Scholar] [CrossRef]
  57. Code | ICTV. Available online: https://ictv.global/about/code (accessed on 9 June 2025).
  58. Meng, L.; Delmont, T.O.; Gaïa, M.; Pelletier, E.; Fernàndez-Guerra, A.; Chaffron, S.; Neches, R.Y.; Wu, J.; Kaneko, H.; Endo, H.; et al. Genomic Adaptation of Giant Viruses in Polar Oceans. Nat. Commun. 2023, 14, 6233. [Google Scholar] [CrossRef] [PubMed]
  59. Rodriguez, J.F.; Esteban, M. Plaque Size Phenotype as a Selectable Marker to Generate Vaccinia Virus Recombinants. J. Virol. 1989, 63, 997–1001. [Google Scholar] [CrossRef] [PubMed]
  60. Kanno, T.; Mackay, D.; Inoue, T.; Wilsden, G.; Yamakawa, M.; Yamazoe, R.; Yamaguchi, S.; Shirai, J.; Kitching, P.; Murakami, Y. Mapping the Genetic Determinants of Pathogenicity and Plaque Phenotype in Swine Vesicular Disease Virus. J. Virol. 1999, 73, 2710–2716. [Google Scholar] [CrossRef] [PubMed]
  61. Olson, V.A.; Karem, K.L.; Smith, S.K.; Hughes, C.M.; Damon, I.K. Smallpox Virus Plaque Phenotypes: Genetic, Geographical and Case Fatality Relationships. J. Gen. Virol. 2009, 90, 792–798. [Google Scholar] [CrossRef]
  62. Medaglia, M.L.G.; Moussatché, N.; Nitsche, A.; Dabrowski, P.W.; Li, Y.; Damon, I.K.; Lucas, C.G.O.; Arruda, L.B.; Damaso, C.R. Genomic Analysis, Phenotype, and Virulence of the Historical Brazilian Smallpox Vaccine Strain IOC: Implications for the Origins and Evolutionary Relationships of Vaccinia Virus. J. Virol. 2015, 89, 11909–11925. [Google Scholar] [CrossRef] [PubMed]
  63. Trindade, G.S.; Emerson, G.L.; Carroll, D.S.; Kroon, E.G.; Damon, I.K. Brazilian Vaccinia Viruses and Their Origins. Emerg. Infect. Dis. 2007, 13, 965–972. [Google Scholar] [CrossRef] [PubMed]
  64. Mandary, M.B.; Masomian, M.; Ong, S.-K.; Poh, C.L. Characterization of Plaque Variants and the Involvement of Quasi-Species in a Population of EV-A71. Viruses 2020, 12, 651. [Google Scholar] [CrossRef]
  65. Furtado, N.D.; Gómez, M.M.; de Mello, I.S.; Fernandes, D.R.; Bonaldo, M.C. Phenotypic and Genetic Studies of the Viral Lineage Associated with the Recent Yellow Fever Outbreak in Brazil. Viruses 2022, 14, 1818. [Google Scholar] [CrossRef]
Figure 1. Phylogenetics and genomic characteristics of betachloroviruses. Phylogenetic reconstruction of betachloroviruses. The phylogeny was made based on six concatenated genes (A32, PolB, SFII, TFIIB, Topo, VLTF3) considered the best hallmark genes for the phylum Nucleocytoviricota. New isolates are represented in pink labels, while already known isolates are in black. ATCV-1, a gammachlorovirus, was used as an outgroup. Only bootstrap values above 80 are represented. The tree scale refers to the substitution rate of amino acids. Colored bars indicate the different genomic elements found in the virus isolates.
Figure 1. Phylogenetics and genomic characteristics of betachloroviruses. Phylogenetic reconstruction of betachloroviruses. The phylogeny was made based on six concatenated genes (A32, PolB, SFII, TFIIB, Topo, VLTF3) considered the best hallmark genes for the phylum Nucleocytoviricota. New isolates are represented in pink labels, while already known isolates are in black. ATCV-1, a gammachlorovirus, was used as an outgroup. Only bootstrap values above 80 are represented. The tree scale refers to the substitution rate of amino acids. Colored bars indicate the different genomic elements found in the virus isolates.
Viruses 17 01096 g001
Figure 2. Genomic content of betachloroviruses. (A) Genes commonly found in betachloroviruses, separated based on functional categories. Only five of the 24 known minor capsid proteins of chloroviruses (P1 to P5) are represented in the graph, although homologs of other genes have been found in different isolates. Three different glycosyltransferase homologs were included to highlight possible distinct glycosylation machinery/pattern expressed by each isolate; (B) tRNA diversity in betachloroviruses. The number of anticodons is indicated in the respective square of each isolate. The scale bar indicates the range of tRNA counts in the isolate.
Figure 2. Genomic content of betachloroviruses. (A) Genes commonly found in betachloroviruses, separated based on functional categories. Only five of the 24 known minor capsid proteins of chloroviruses (P1 to P5) are represented in the graph, although homologs of other genes have been found in different isolates. Three different glycosyltransferase homologs were included to highlight possible distinct glycosylation machinery/pattern expressed by each isolate; (B) tRNA diversity in betachloroviruses. The number of anticodons is indicated in the respective square of each isolate. The scale bar indicates the range of tRNA counts in the isolate.
Viruses 17 01096 g002
Figure 3. Genome synteny of betachloroviruses. New and old isolates are included in the analyses, indicating conserved collinearity among the genomes, except the NE-JV-1 and NE-P-6-s. The isolates of each cluster were chosen randomly for this analysis. Inversions in some genomic regions are observed. The color grade represents BLASTn identity percentage.
Figure 3. Genome synteny of betachloroviruses. New and old isolates are included in the analyses, indicating conserved collinearity among the genomes, except the NE-JV-1 and NE-P-6-s. The isolates of each cluster were chosen randomly for this analysis. Inversions in some genomic regions are observed. The color grade represents BLASTn identity percentage.
Viruses 17 01096 g003
Figure 4. Betachloroviruses are clustered into three species. Hierarchical cluster analysis reveals three distinctive groups in the ‘Betachlorovirus’ subgenus. The grouping reflects a 94% cutoff value for species designation. We refer to these groups as species I, II and III. The scale bar indicates the range of ANI values.
Figure 4. Betachloroviruses are clustered into three species. Hierarchical cluster analysis reveals three distinctive groups in the ‘Betachlorovirus’ subgenus. The grouping reflects a 94% cutoff value for species designation. We refer to these groups as species I, II and III. The scale bar indicates the range of ANI values.
Viruses 17 01096 g004
Figure 5. Pan-genome and COG-sharing pattern in betachloroviruses. (A) Pan-genome evolution of betachloroviruses. Pan-genome and core-genome are represented by circles and triangles, respectively; (B) Upset plot of COG-sharing considering the three betachloroviruses species. The number of COGs found per species is indicated in the horizontal bars. Vertical bars indicate the number of COGs shared between the species; (C) Bipartite network graph representing the COG-sharing pattern of betachloroviruses. Colored nodes represent the isolates, while nodes in the grayscale represent the COGs. White nodes are the core genome, gray nodes represent the satellite genome, and dark gray nodes stand for singletons. In all images, each species is represented by a different color: species I, purple; species II, yellow; species III, blue.
Figure 5. Pan-genome and COG-sharing pattern in betachloroviruses. (A) Pan-genome evolution of betachloroviruses. Pan-genome and core-genome are represented by circles and triangles, respectively; (B) Upset plot of COG-sharing considering the three betachloroviruses species. The number of COGs found per species is indicated in the horizontal bars. Vertical bars indicate the number of COGs shared between the species; (C) Bipartite network graph representing the COG-sharing pattern of betachloroviruses. Colored nodes represent the isolates, while nodes in the grayscale represent the COGs. White nodes are the core genome, gray nodes represent the satellite genome, and dark gray nodes stand for singletons. In all images, each species is represented by a different color: species I, purple; species II, yellow; species III, blue.
Viruses 17 01096 g005
Figure 6. Betachloroviruses genomovars exhibit different plaque phenotypes. (A) ANI matrix heatmap of betachloroviruses sharing ANI >99%, grouped by species and genomovars; (B) Schematic representation of different plaque phenotypes of betachloroviruses. Plaques can be classified by size as small (<2 mm), medium (2–4 mm), and large (>4 mm), with regular (continuous line) or irregular (discontinuous line) edges, and the clarity of plaque morphology, being regular (white plaque) or cloudy (green plaque). Plaques also exhibit varying levels in the clarity of the plaque field, being clear (white plaque) or cloudy (green plaque), as well as differences in the definition at the plaque boundary, having a sharp plaque boundary (white inside border) or a soft plaque boundary (green inside boarder); (C) Plaque phenotypes of different isolates of betachloroviruses. All photos were taken after 12 days of infection. Black arrows indicate soft plaque boundaries in isolate P-NE-11.
Figure 6. Betachloroviruses genomovars exhibit different plaque phenotypes. (A) ANI matrix heatmap of betachloroviruses sharing ANI >99%, grouped by species and genomovars; (B) Schematic representation of different plaque phenotypes of betachloroviruses. Plaques can be classified by size as small (<2 mm), medium (2–4 mm), and large (>4 mm), with regular (continuous line) or irregular (discontinuous line) edges, and the clarity of plaque morphology, being regular (white plaque) or cloudy (green plaque). Plaques also exhibit varying levels in the clarity of the plaque field, being clear (white plaque) or cloudy (green plaque), as well as differences in the definition at the plaque boundary, having a sharp plaque boundary (white inside border) or a soft plaque boundary (green inside boarder); (C) Plaque phenotypes of different isolates of betachloroviruses. All photos were taken after 12 days of infection. Black arrows indicate soft plaque boundaries in isolate P-NE-11.
Viruses 17 01096 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Souza, J.W.; Henriques, L.R.; Carlson, R.M.; Botelho, B.B.F.; Carvalho, J.V.R.P.; Santos, J.P.N.; Aguiar, E.R.G.R.; Agarkova, I.V.; Van Etten, J.L.; Dunigan, D.D.; et al. New Isolates of Betachloroviruses Shed Light on the Diversity and Biological Complexity of an Unexplored Group of Giant Algal Viruses. Viruses 2025, 17, 1096. https://doi.org/10.3390/v17081096

AMA Style

Souza JW, Henriques LR, Carlson RM, Botelho BBF, Carvalho JVRP, Santos JPN, Aguiar ERGR, Agarkova IV, Van Etten JL, Dunigan DD, et al. New Isolates of Betachloroviruses Shed Light on the Diversity and Biological Complexity of an Unexplored Group of Giant Algal Viruses. Viruses. 2025; 17(8):1096. https://doi.org/10.3390/v17081096

Chicago/Turabian Style

Souza, Júlia W., Lethícia R. Henriques, Roger M. Carlson, Bruna B. F. Botelho, João Victor R. P. Carvalho, João Pedro N. Santos, Eric R. G. R. Aguiar, Irina V. Agarkova, James L. Van Etten, David D. Dunigan, and et al. 2025. "New Isolates of Betachloroviruses Shed Light on the Diversity and Biological Complexity of an Unexplored Group of Giant Algal Viruses" Viruses 17, no. 8: 1096. https://doi.org/10.3390/v17081096

APA Style

Souza, J. W., Henriques, L. R., Carlson, R. M., Botelho, B. B. F., Carvalho, J. V. R. P., Santos, J. P. N., Aguiar, E. R. G. R., Agarkova, I. V., Van Etten, J. L., Dunigan, D. D., & Rodrigues, R. A. L. (2025). New Isolates of Betachloroviruses Shed Light on the Diversity and Biological Complexity of an Unexplored Group of Giant Algal Viruses. Viruses, 17(8), 1096. https://doi.org/10.3390/v17081096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop