Genomic Characterisation of an Isolate of Brassica Yellows Virus Associated with Brassica Weed in Tasmania

Brassica yellows virus (BrYV), a tentative species in the genus Polerovirus, of the Solemoviridae family, is a phloem-restricted and aphid-transmitted virus with at least three genotypes (A, B, and C). It has been found across mainland China, South Korea, and Japan. BrYV was previously undescribed in Tasmania, and its genetic variability in the state remains unknown. Here, we describe a near-complete genome sequence of BrYV (genotype A) isolated from Raphanus raphanistrum in Tasmania using next-generation sequencing and sanger sequencing of RT-PCR products. BrYV-Tas (GenBank Accession no. OM469309) possesses a genome of 5516 nucleotides (nt) and shares higher sequence identity (about 90%) with other BrYV isolates. Phylogenetic analyses showed variability in the clustering patterns of the individual genes of BrYV-Tas. Recombination analysis revealed beginning and ending breakpoints at nucleotide positions 1922 to 5234 nt, with the BrYV isolate LC428359 and BrYV isolate KY310572 identified as major and minor parents, respectively. Results of the evolutionary analysis showed that the majority of the codons for each gene are evolving under purifying selection, though a few codons were also detected to have positive selection pressure. Taken together, our findings will facilitate an understanding of the evolutionary dynamics and genetic diversity of BrYV.

The full-length genome of BrYV consists of a 5.6-5.7 kb long positive-sense singlestranded RNA (+ssRNA) which contains six open reading frames (ORFs), a short untranslated region (UTR) at 5', an intergenic non-coding region (NCR) spanning over ORF2 and ORF3, and a UTR at the 3' end that lacks a poly (A) tail or tRNA-like structure. The genomic RNA is packed into a spherical-shaped virion of 25-30 nm diameter. Three unique genotypes, BrYV-A, BrYV-B, and BrYV-C, with genome sizes of 5666, 5666, and 5678 nt, respectively, have been described [2,6]. Additionally, a recombinant strain (BrYV-CS) has been reported in Chinese cabbage in South Korea [3]. Like typical poleroviruses, ORFs 0-2 are translated from genomic RNA, while ORFs 3-5 are translated from sub-genomic RNA ( Figure 1). ORF0 expresses the P0 protein that functions as a determinant of host range and symptoms and acts as a silencing suppressor. The fusion of ORF1 and ORF2 encodes the P1-2 fusion protein or replication-associated protein (RepA). ORF3 encodes the major P3 or coat protein (CP) which sits in-frame with ORF5. The P3-5 overlapped translation results in the expression of a read-through protein (P5) which resides on the external surface of the mature virion and is associated with the long-distance movement and aphid transmission of the virus [7]. The ORF4 encodes the P4 protein, which plays an important role in virus systemic movement within infected plants ( Figure 1).
Plants 2022, 10, x FOR PEER REVIEW 2 of 15 untranslated region (UTR) at 5', an intergenic non-coding region (NCR) spanning over ORF2 and ORF3, and a UTR at the 3' end that lacks a poly (A) tail or tRNA-like structure. The genomic RNA is packed into a spherical-shaped virion of 25-30 nm diameter. Three unique genotypes, BrYV-A, BrYV-B, and BrYV-C, with genome sizes of 5666, 5666, and 5678 nt, respectively, have been described [2,6]. Additionally, a recombinant strain (BrYV-CS) has been reported in Chinese cabbage in South Korea [3]. Like typical poleroviruses, ORFs 0-2 are translated from genomic RNA, while ORFs 3-5 are translated from subgenomic RNA (Figure 1). ORF0 expresses the P0 protein that functions as a determinant of host range and symptoms and acts as a silencing suppressor. The fusion of ORF1 and ORF2 encodes the P1-2 fusion protein or replication-associated protein (RepA). ORF3 encodes the major P3 or coat protein (CP) which sits in-frame with ORF5. The P3-5 overlapped translation results in the expression of a read-through protein (P5) which resides on the external surface of the mature virion and is associated with the long-distance movement and aphid transmission of the virus [7]. The ORF4 encodes the P4 protein, which plays an important role in virus systemic movement within infected plants ( Figure 1). A recent study in Australia demonstrated that the highest genomic variability in BrYV, and its closely relative Turnip yellows virus (TuYV), occured in ORF5 [5]. Based on phylogenetic and recombination analyses, this study proposed that BrYV and TuYV could be regarded as variants of the same species. However, a number of other studies have examined the genetic variation across other ORFs which differentiate BrYV from TuYV [2,3,6].
RNA recombination predominantly drives genetic diversity and plays a crucial role in the emergence, evolution, and epidemiology of the plant-infecting RNA viruses. It appears to be the main mechanism behind the formation of new and more pathogenic viral strains/species [8,9]. The genera Polerovirus and Enamovirus of the Solemoviridae family (previously in the family Luteoviridae) [10] are known to be commonly associated with recombination events [9]. Based on the phylogenetic analysis, several studies have demonstrated that self-recombination among poleroviruses or recombination with other luteoviruses has led to the observed evolvution of poleroviruses [11][12][13].
Recently, complete genome sequencing of different plant viruses has expanded our understanding of their molecular characterisation, genetic diversity, and evolutionary relationships [14][15][16]. Here, we took advantage of next-generation sequencing (NGS) to obtain a near-complete genome sequence of a BrYV isolate infecting R. raphanistrum in Tasmania. We employed RT-PCR and multiplex RT-PCR to fill sequence gaps and confirm the BrYV genotype, respectively. Furthermore, we performed phylogenetic and recombination detection analyses to improve our understanding of the existing genetic variability and evolutionary relatedness between the newly described isolate and other globally A recent study in Australia demonstrated that the highest genomic variability in BrYV, and its closely relative Turnip yellows virus (TuYV), occured in ORF5 [5]. Based on phylogenetic and recombination analyses, this study proposed that BrYV and TuYV could be regarded as variants of the same species. However, a number of other studies have examined the genetic variation across other ORFs which differentiate BrYV from TuYV [2,3,6].
RNA recombination predominantly drives genetic diversity and plays a crucial role in the emergence, evolution, and epidemiology of the plant-infecting RNA viruses. It appears to be the main mechanism behind the formation of new and more pathogenic viral strains/species [8,9]. The genera Polerovirus and Enamovirus of the Solemoviridae family (previously in the family Luteoviridae) [10] are known to be commonly associated with recombination events [9]. Based on the phylogenetic analysis, several studies have demonstrated that self-recombination among poleroviruses or recombination with other luteoviruses has led to the observed evolvution of poleroviruses [11][12][13].
Recently, complete genome sequencing of different plant viruses has expanded our understanding of their molecular characterisation, genetic diversity, and evolutionary relationships [14][15][16]. Here, we took advantage of next-generation sequencing (NGS) to obtain a near-complete genome sequence of a BrYV isolate infecting R. raphanistrum in Tasmania. We employed RT-PCR and multiplex RT-PCR to fill sequence gaps and confirm the BrYV genotype, respectively. Furthermore, we performed phylogenetic and recombination detection analyses to improve our understanding of the existing genetic variability and evolutionary relatedness between the newly described isolate and other globally reported BrYV isolates. The findings of our study will provide valuable information regarding

NGS Analysis
In total, 22,351,000 raw reads were obtained with Illumina sequencing for the tested sample. After quality trimming using BBDuK, these numbers were reduced to 22,228,228. De novo assembly generated sixteen contigs (ranged 3995-121 bp with N50 = 511 bp), produced by 670 viral reads. A BLASTn/BLASTx search of the GenBank database found a viral contig of~3995 bp in sizes corresponding to the BrYV isolate "TO3" (Accession no. LC428360), sharing 96.68% nucleotide similarity.

Validation of NGS Results and Amplification of Missing Genome Sequence
Since the tested sample was a composite from five different plants, this necessitated further testing of individual leaves to determine the identification of the virus source. BrYV infection was confirmed in one R. raphanistrum sample by multiplex RT-PCR. Multiplex RT-PCR gave a amplicon of 277 bp which also revealed that the BrYV-Tas isolate belongs to genotype-A. To amplify the missing parts of the BrYV-Tas genome (i.e., 5 UTR, P0, and part of P1-P2 towards 5 end), RT-PCR assays were conducted with primers BrY-Ntab001F (+)/TuYVOrf0R and primer BrY-Ntab724F(+)/BrY-Ntab2064R(-) ( Table 1), using cycling conditions, as previously described [17] with products sequenced. Consequently, the entire genome sequence of the BrYV Tasmanian isolate was determined, except for 138 bp at the termini of the 3 UTR (GenBank Accession no. OM469309).

Multiple Sequence Alignment and Phylogenetic Analysis
The genomic sequence obtained for BrYV-Tas was 5516 nt in length and had a typical BrYV genomic organisation, with six ORFs (ORF0−ORF5), a 31 nt 5 -UTR, and an incomplete 47 nt at 3 -UTR. ORF0 spans nt 32 to 781 and is predicted to encode the P0 suppressor. ORF1 stretches between nt 174 to nt 1967 and encodes the P1 protein. ORF2 (nt 1853-3268) is assumed to be translated into a P1-P2 fusion protein (RdRp) via a −1 ribosomal frameshift. GGGAAAG at nt 1617 to 1623 is the shifty sequence. ORF3 starts at nt 3460 to 4068, encoding a putative coat protein (CP). ORF4 starts at nt 3491 to 4018 and codes for a putative P4 protein by a leaky scanning. ORF5 (from nt 4255 to 5469) is translated into the P3-P5 fusion protein via a read-through strategy, which is thought to play a key role in virus transmission ( Figure 1).
Sequence homology and phylogenetic analysis based on the full genome indicated that the BrYV-Tas shares higher sequence identities (94.4%) with two Japanese BrYV isolates LC428360 and LC428361 and they cluster together in the same clade (II) (Figure 2A (Table S1). These Japanese isolates were found from B. rapa and B. napus hosts, respectively. The lowest homology percentage (81.3%) was observed for an isolate from China (MK057527) infecting tobacco. BrYV-Tas shared only 78.2% of nucleotide similarity with the closest TuYV isolate genomic sequence available on GenBank (NC_003743) (Table S1).

Nucleotide Diversity and Haplotype Variability Indices
We analysed all datasets comprising BrYV full genomes and individual genes to compare the standing molecular diversity among viral populations arising from different geographical locations. The diversity determined for the whole genome analysis of BrYV was S = 1477, Eta = 1684, π = 0.06883, Hd = 0.9950, θw = 0.07683, Tajima's D = −0.89807 ( Figure 4A and Table 3).
Genetic diversity analysis for BrYV individual genes also revealed that all genes were variable with high numbers of polymorphisms and polymorphic sites, very high haplotype diversity, but low nucleotide diversity ( Figure 4A and Table 3). Aside from P3 (S = 87, Eta = 94, π = 0.03297, h = 0.989) and P4 (S = 72, Eta = 78, π = 0.03147, Hd = 0.932), each gene indicated high diversity and had a high proportion of haplotypes. P5 with Eta = 578, π = 0.07522, and Hd = 0.995 appeared to be the most genetically diverse gene that is three times more diverse than that of the conserved P3/P4 region of the BrYV genome (Table 3). Following P5, P1 (S = 451, Eta = 528, π = 0.08736, Hd = 0.995) appeared to be the gene with next greatest diversity (possibly linked to the genetically diverse region within P1) surpassing P0 (S = 159, Eta = 171, π = 0.06490, Hd = 0.995), previously regarded as the next most variable gene. Despite the fact that the polymorphic sites varied between 14 and 38% for each gene sequence, haplotype diversity was high, ranging from 93 to 100%, indicating that the number of unique sequences with different combinations of these polymorphic sites was high, resulting in greater diversity within the BrYV population (Table 3). Likewise, we also compared the genetic diversity between two genes encoding fusion proteins i.e., P1-P2 and P3-P5. Despite the existence of discrepancies between gene sizes, we were able to calculate average pairwise nucleotide diversities (π) for the aforementioned datasets ( Figure 4A and Table 3).

Nucleotide Diversity and Haplotype Variability Indices
We analysed all datasets comprising BrYV full genomes and individual genes to compare the standing molecular diversity among viral populations arising from different geographical locations. The diversity determined for the whole genome analysis of BrYV was S = 1477, Eta = 1684, π = 0.06883, H d = 0.9950, θw = 0.07683, Tajima's D = −0.89807 ( Figure 4A and Table 3). 3). The segregation rate exhibited by the overall BrYV population was θw = 0.0768. We also determined Tajima's D values in the BrYV populations and individual genes which appeared to be negative, ranging from −2.075 to −0.251, indicating the presence of excessive polymorphic sites among these isolates ( Figure 4C and Table 3). Although the P0 and P1 displayed slightly positive Tajima's D values (0.0408 and 0.2512, respectively), this observation remained statistically non-significant.    Genetic diversity analysis for BrYV individual genes also revealed that all genes were variable with high numbers of polymorphisms and polymorphic sites, very high haplotype diversity, but low nucleotide diversity ( Figure 4A and Table 3). Aside from P3 (S = 87, Eta = 94, π = 0.03297, h = 0.989) and P4 (S = 72, Eta = 78, π = 0.03147, H d = 0.932), each gene indicated high diversity and had a high proportion of haplotypes. P5 with Eta = 578, π = 0.07522, and H d = 0.995 appeared to be the most genetically diverse gene that is three times more diverse than that of the conserved P3/P4 region of the BrYV genome (Table 3). Following P5, P1 (S = 451, Eta = 528, π = 0.08736, H d = 0.995) appeared to be the gene with next greatest diversity (possibly linked to the genetically diverse region within P1) surpassing P0 (S = 159, Eta = 171, π = 0.06490, H d = 0.995), previously regarded as the next most variable gene. Despite the fact that the polymorphic sites varied between 14 and 38% for each gene sequence, haplotype diversity was high, ranging from 93 to 100%, indicating that the number of unique sequences with different combinations of these polymorphic sites was high, resulting in greater diversity within the BrYV population (Table 3). Likewise, we also compared the genetic diversity between two genes encoding fusion proteins i.e., P1-P2 and P3-P5. Despite the existence of discrepancies between gene sizes, we were able to calculate average pairwise nucleotide diversities (π) for the aforementioned datasets ( Figure 4A and Table 3).

Detection of Positive and Negative Selection Sites
We compared the non-synonymous to synonymous substitutions (d N /d S ) for each gene to better understand the role of selection pressure on the genomic variation observed between the analysed datasets. DataMonkey-based analysis indicated that the genome of the BrYV population is being driven by purifying (negative) selection as the average dN/dS ratio of the BrYV genome remained <1 i.e., d N /d S = 0.902 ( Figure 5 and Table 4).
positive selection value of 1.548, the minimum value remained 1.063 and the maximum value was 4.401 (Table 4). P0 exhibited only nine positively selected sites with a value of 2.040, while this gene showed 201/249 negatively selected sites with 0.054 and 0.935 being the minimum and maximum values, respectively. P3 and P4 exhibited 11 positively selected sites, each with the ratio of the negatively selected sites being 122/202 and 127/176 for the P3 and P4 genes, respectively ( Figure 6 and Table 4).     Overall, 1167/1848 sites were detected with negative selection, with values ranging between 0.051 and 0.949. Meanwhile, the proportion of positively selected sites remained lower (501/1848) than the negatively selected sites. Likewise, all genes displayed a variable frequency of negative selection with the lowest average d N /d S value for P5 (0.371) and the highest average value for P1 (0.414) ( Figure 5 and Table 4). A total of 353/425 sites were found under negative selection in the P5 gene sequence with values ranging from 0.050 to 0.949. Positive selection was detected at a few sites (25/425) with values ranging between 1.058 and 2.778; however, the proportion remained much lower than for negatively selected sites ( Figure 6 and Table 4). highest average value for P1 (0.414) ( Figure 5 and Table 4). A total of 353/425 sites were found under negative selection in the P5 gene sequence with values ranging from 0.050 to 0.949. Positive selection was detected at a few sites (25/425) with values ranging between 1.058 and 2.778; however, the proportion remained much lower than for negatively selected sites ( Figure 6 and Table 4). P1, with 282/607 negatively selected sites, displayed a similar pattern, having the average negative selection value of 0.319, with the lowest value of 0.050, and the highest of 0.942. Meanwhile, 22 sites were found to be positively selected in P1. With an average positive selection value of 1.548, the minimum value remained 1.063 and the maximum value was 4.401 (Table 4). P0 exhibited only nine positively selected sites with a value of 2.040, while this gene showed 201/249 negatively selected sites with 0.054 and 0.935 being the minimum and maximum values, respectively. P3 and P4 exhibited 11 positively selected sites, each with the ratio of the negatively selected sites being 122/202 and 127/176 for the P3 and P4 genes, respectively ( Figure 6 and Table 4).     (Table 4). P0 exhibited only nine positively selected sites with a value of 2.040, while this gene showed 201/249 negatively selected sites with 0.054 and 0.935 being the minimum and maximum values, respectively. P3 and P4 exhibited 11 positively selected sites, each with the ratio of the negatively selected sites being 122/202 and 127/176 for the P3 and P4 genes, respectively ( Figure 6 and Table 4).

Discussion
The present study identified and genetically characterised an isolate of BrYV found infecting a R. raphanistrum host plant in Tasmania. Subsequent NGS data analysis revealed that this isolate of BrYV displayed a high percentage (96.68%) of nucleotide similarity with a BrYV isolate reported from Japan (LC428360). The NGS contig covered almost 70% of the genome of the BrYV reference sequence (NC_016038). A reconstructed genome, generated using NGS and RT-PCR, was 5516 bp long and had the typical organisation of BrYV. However, it was not possible to obtain the~138 bp of the 3 end of the genome, which includes the latter half of 3 UTR. BrYV isolates from China, Korea, and Japan [2,3,6,17,21] have shown genetic variation in their ORFs and have been subdivided further into three genotypes (BrYV-A, B, and C) according to sequence comparisons and phylogenetic analysis [2,6]. This suggests that BrYV genomes, like those of other poleroviruses, are more diverse within different regions of the genome than previously thought, and that more diversity can be discovered after analysing whole-genome sequences. Our findings corroborate these previous studies [19], and multiplex RT-PCR analysis revealed that the BrYV-Tas isolate belongs to the BrYV genotype-A. Further analyses involving full genome and individual genes sequences of BrYV-Tas and other BrYV and TuYV isolates revealed that the presence of variation and clustering patterns were variable among different genes.
It has recently been proposed that TuYV and BrYV should be considered members of the same species if they share at least 83% nucleotide sequence identity [5]. The authors suggested a species demarcation criteria for new poleroviruses should include less than 83% nt sequence identity across all entire coding regions of the genome, as well as any significant differences in the host range, vector, and serology. The RdRp and CP regions were proposed as important for species demarcation with an 89% aa sequence identity in both regions for the members of the same polerovirus species [22], and CP is the most conserved region within poleroviruses [23,24]. However, currently, the working criterion for the demarcation of polerovirus species remains unchanged in the most recent report of ICTV on virus taxonomy [10], which is based on differences in host range, failure to crossprotect, differences in serological reactions, and differences in the amino acid sequence of any gene product greater than 10% [10,25]. Our findings are consistent with the recent species demarcation standards by ICTV [10] as well as with the previous studies related to characterisation of BrYV [2,3,6], as all genes of BrYV-Tas isolates shared relatively low (less than 90%) amino acid similarity with TuYV (NC_003743) with the exception of P3 and P4 genes which shared over 90% amino acid similarity with both BrYV and TuYV (Tables S12 and S14). Notably, for BrYV-Tas the cistrons at the 3 end shared the least sequence identity (60.9% for P3-P5, and 43.2% for P5) with TuYV. Such difference in the P5 sequence between the BrYV and the TuYV isolates could be linked to variability in aphid transmission specificity or efficiency [26]. Based on the ICTV report, nucleotide sequence analysis, and phylogeny results our findings provide compelling evidence that BrYV-Tas is distinct from TuYV.
We have shown evidence of recombination in the genome of BrYV-Tas, with the analysis suggesting that the recombinant consisted of genomic parts from distinct BrYV isolates. Recombination events in RNA viruses have been well known [27][28][29], and both intraspecific, homologous and interspecific, non-homologous recombination are thought to be common and important in the evolution of poleroviruses. A few examples include BrYV [3], FBPV-1 [30], Cucurbit aphid-borne yellows virus (CABYV) [27,31], Sugarcane yellow leaf virus (ScYLV) [12,32], and Cotton leafroll dwarf virus (CLRDV) [33]. Our findings also suggest that recombination could play a role in the evolution of BrYV populations, as some of these genes, which are associated with compatible interactions, might be evolving under selection pressure from their hosts and/or vectors and tend to accumulate changes faster than other parts of the genome. Notably, mutations or recombination in genes can affect the biological functions of the viral proteins. However, how recombination could affect the biological functions associated with these genes remains a question worth studying in the future.
Our findings revealed that BrYV populations are evolving mainly under purifying selection pressure (Figures 5 and 6 and Table 4). To gain an in-depth understanding of this selection factor at the gene level, we opted to estimate d N /d S for P0, P1, P1-P2, P3, P3-P5, P4, and P5 genes of BrYV. While our results showed that the majority of the codons remained under negative selection for each gene with an average d N /d S ratio of <1 ( Figure 5 and Table 4), the overall contribution of negatively selected sites remained more than the positively selected sites ( Figure 6). Since all the sites had both positive and negative selection pressures, this provided strong evidence of heterogeneous selection pressures among codon sites. This is in accordance with a previous study that concluded that the genes of the TuYV population in the UK were evolving under negative selection pressure [34].
Future studies focusing on the patterns of BrYV geographical distribution, evolutionary dynamics, and functional analysis of the BrYV genes arising from genetic variations will help us to understand the host (plant/insect)-BrYV relationships and viral pathogenicity in the context of developing strategies for sustainable virus disease management.

Sample Collection and Nucleic Acid Extraction
In November 2019, ten samples of wild radish (R. raphanistrum) were randomly collected at a commercial vegetable farm in Deloraine as part of an extensive survey across Tasmania to determine the population diversity of Polerovirus species in pea crops and the weed species present in the vicinity of the pea crops. All leaf samples were grouped into lots of five and preserved (at −80 • C) for subsequent testing. Extraction of total RNA was carried out using PowerLyzer 24 homogeniser (Qiagen, Chadstone, VIC, Australia) and RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. Samples of 100 mg/lot were subject to RNA extraction to obtain sufficient yields of RNA for further processing and subsequent detection of viruses with low titer.

Next-Generation Sequencing (NGS) and Sequence Reads Analysis
Total RNA of the grouped sample 148 W was then sent to the Department of Jobs, Precincts and Regions, AgriBio, Bundoora, Victoria, for NGS. Libraries for RNA-Seq were prepared with a TruSeq Stranded Total RNA Sample Preparation Kit with Ribozero following the manufacturer's instructions. To determine the size distribution and concentration of the RNA-Seq libraries, 2200 TapeStation ® system (Agilent Technologies, Santa Clara, CA, USA) and Qubit ® Fluorometer 2.0 (Invitrogen) were used, respectively. Afterwards, the Illumina NovaSeq was employed to sequence these libraries with a paired read length of 2 × 151 bp.
Trimming of the primer and adaptor sequences from raw reads was carried out using Geneious Prime 2021.2 (Biomatters Ltd., Auckland, New Zealand). For this purpose, the BBDuK plugin (provided in the BBTools package) was employed [35]. After that, the Velvet plugin (in Geneious Prime software) with default settings was used to perform de novo assembly. Then, a length-based sorting of the obtained contigs was carried out and, subsequently, blast (BLASTn and BLASTx) analysed against the GenBank database. Simultaneously, the mapping of contigs and reads to a collection of polerovirus reference genomes (available in GenBank) was carried out and results were also analysed in Geneious Prime.

Validation of NGS Results and Amplification of BrYV Genome
To synthesize the cDNA, an iScript Reverse Transcription Supermix Kit (BioRad Bio-Rad, Hercules, CA, USA) was used according to the provided instructions. The reaction mixture of 20 µL contained total RNA (2.5 µL), 5 × RT Supermix (4 µL), diethyl pyrocarbonate (DEPC), and treated water to make up the volume. The reaction was then incubated for priming at 22 • C (5 min), reverse transcription at 46 • C (20 min), and finally enzyme inactivation at 95 • C (1 min).
Several overlapping primer sets [17,18] were used to amplify the missing part of the virus genome, with reference to previously reported sequences of BrYV and TuYV (Table 1). This was done by RT-PCR-based amplification of the target sequence. The details, including primer sequences, amplicon size, target, and thermocycling conditions, are provided in Table 1. The assays were performed as follows: 1 × HotStarTaq Master Mix (Qiagen), 10 µM of each primer pair, 1 µL cDNA template, and made up to a total volume of 20 µL with sterile distilled water. Amplicon of appropriate sizes were then excised and purified using a QIAquick Gel Extraction Kit Protocol (Qiagen). Elution was carried out using a 30 µL elution buffer (10 mM Tris HCl, pH 8.5) and sent for bidirectional (sanger) sequencing by the Central Science Laboratories (CSL), University of Tasmania, Australia. The near full-length genomic sequence of BrYV Tasmanian isolates (BrYV-Tas) was then deposited in the NCBI GenBank under accession number OM469309. In addition, multiplex RT-PCR with specific primer sets (Table 1) was also performed to determine the genotype of BrYV, as described previously [19].

Multiple Sequence Alignment and Phylogenetic Analysis
To prepare multiple sequence alignments, the near-full-length genome sequence of BrYV-Tas was aligned with the full genome sequences of all reported BrYV isolates and one TuYV isolate (NC_003743), available in the GenBank (Table 2), using the MUSCLE option in Geneious Prime 2021.2. Similarly, all genes including P0, P1, P1-P2, P3, P3-P5, P4, and P5 were also aligned with the corresponding genes of the reported isolates, as mentioned above. Maximum likelihood phylogenetic trees were constructed in MEGAX with 1000 bootstrap replicates [36].

Recombination Analysis
The aligned genome of the BrYV isolate from this study and other BrYV isolates available in GenBank were tested for potential recombination events. The recombination analysis was carried out using RDP [37], GENECONV [38], BOOTSCAN [39], MAXCHI [40], CHIMAERA [41], SISCAN [42], and 3SEQ [43] methods implemented in the recombination detection program v.4 [44]. Aligned sequences were subjected to the above-mentioned methods using default parameters. Potential recombination events detected were considered statistically significant if supported by at least four programs with p-values lower than a Bonferroni-corrected cutoff of 0.05.

Nucleotide Diversity and Haplotype Variability Indices
The average pairwise number of nucleotide differences per site (nucleotide diversity, π) was detected using DnaSP v.5 [45]. The statistically significant differences among the mean nucleotide diversity from the datasets were determined by calculating their 95% bootstrap confidence intervals. In addition, the nucleotide diversity was calculated using a 100-nucleotide sliding window, with a 10-nucleotide step size. The number of segregating sites (S), the number of haplotypes (H), and haplotype diversity (H d ) were also calculated for all datasets using DnaSP v.5 [45].

Detection of Positive and Negative Selection Sites
Using four different approaches, i.e., single-likelihood ancestor counting (SLAC), fixed-effects likelihood, random-effects likelihood, and partitioning for robust inference of selection [46], potential negatively and positively selected sites in the coding regions of P0, P1, P1-P2, P3, P3-P5, P4, and P5 were identified. All the analyses were performed in the Datamonkey web server (www.datamonkey.org accessed on 22 October 2021) [47]. Genetic Algorithm Recombination Detection (GARD) [48] was implemented to look for recombination breakpoints in the datasets in order to avoid ambiguous results. The dN/dS ratios were estimated using the SLAC method based on inferred GARD-corrected phylogenetic trees to compare the selection pressures acting on all genes of BrYV.
Supplementary Materials: The following supporting information can be downloaded at: https: //www.mdpi.com/article/10.3390/plants11070884/s1, Table S1: Percentage nucleotide sequence identities of whole genome of BrYV-Tas with other isolates reported worldwide; Table S2: Percentage nucleotide sequence identities of P0 gene of BrYV-Tas with other isolates reported worldwide; Table S3: Percentage nucleotide sequence identities of P1 gene of BrYV-Tas with other isolates reported worldwide; Table S4: Percentage nucleotide sequence identities of P1-P2 gene of BrYV-Tas with other isolates reported worldwide; Table S5: Percentage nucleotide sequence identities of P3 gene of BrYV-Tas with other isolates reported worldwide; Table S6: Percentage nucleotide sequence identities of P3-P5 gene of BrYV-Tas with other isolates reported worldwide; Table S7: Percentage nucleotide sequence identities of P4 gene of BrYV-Tas with other isolates reported worldwide; Table S8: Percentage nucleotide sequence identities of P5 gene of BrYV-Tas with other isolates reported worldwide; Table S9: Percentage amino acid sequence identities of P0 gene of BrYV-Tas with other isolates reported worldwide; Table S10: Percentage amino acid sequence identities of P1 gene of BrYV-Tas with other isolates reported worldwide; Table S11: Percentage amino acid sequence identities of P1-P2 gene of BrYV-Tas with other isolates reported worldwide; Table S12: Percentage amino acid sequence identities of P3 gene of BrYV-Tas with other isolates reported worldwide; Table S13: Percentage amino acid sequence identities of P3-P5 gene of BrYV-Tas with other isolates reported worldwide; Table S14: Percentage amino acid sequence identities of P4 gene of BrYV-Tas with other isolates reported worldwide; Table S15: Percentage amino acid sequence identities of P5 gene of BrYV-Tas with other isolates reported worldwide. Funding: This research was funded by Hort Innovation, using the Vegetable Industry research and development levy and contributions from the Australian Government (project VG16086: Area wide management of vegetable diseases: virus and bacteria). Hort Innovation is the grower-owned, not-for-profit research and development corporation for Australian horticulture.

Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.

Data Availability Statement:
The data is contained within the article or supplementary materials. The genome sequence of BrYV-Tas has been deposited in the NCBI database with Accession no: OM469309.