Deciphering the Genomic Landscape and Virulence Mechanisms of the Wheat Powdery Mildew Pathogen Blumeria graminis f. sp. tritici Wtn1: Insights from Integrated Genome Assembly and Conidial Transcriptomics

A high-quality genome sequence from an Indian isolate of Blumeria graminis f. sp. tritici Wtn1, a persistent threat in wheat farming, was obtained using a hybrid method. The assembly of over 9.24 million DNA-sequence reads resulted in 93 contigs, totaling a 140.61 Mb genome size, potentially encoding 8480 genes. Notably, more than 73.80% of the genome, spanning approximately 102.14 Mb, comprises retro-elements, LTR elements, and P elements, influencing evolution and adaptation significantly. The phylogenomic analysis placed B. graminis f. sp. tritici Wtn1 in a distinct monocot-infecting clade. A total of 583 tRNA anticodon sequences were identified from the whole genome of the native virulent strain B. graminis f. sp. tritici, which comprises distinct genome features with high counts of tRNA anticodons for leucine (70), cysteine (61), alanine (58), and arginine (45), with only two stop codons (Opal and Ochre) present and the absence of the Amber stop codon. Comparative InterProScan analysis unveiled “shared and unique” proteins in B. graminis f. sp. tritici Wtn1. Identified were 7707 protein-encoding genes, annotated to different categories such as 805 effectors, 156 CAZymes, 6102 orthologous proteins, and 3180 distinct protein families (PFAMs). Among the effectors, genes like Avra10, Avrk1, Bcg-7, BEC1005, CSEP0105, CSEP0162, BEC1016, BEC1040, and HopI1 closely linked to pathogenesis and virulence were recognized. Transcriptome analysis highlighted abundant proteins associated with RNA processing and modification, post-translational modification, protein turnover, chaperones, and signal transduction. Examining the Environmental Information Processing Pathways in B. graminis f. sp. tritici Wtn1 revealed 393 genes across 33 signal transduction pathways. The key pathways included yeast MAPK signaling (53 genes), mTOR signaling (38 genes), PI3K-Akt signaling (23 genes), and AMPK signaling (21 genes). Additionally, pathways like FoxO, Phosphatidylinositol, the two-component system, and Ras signaling showed significant gene representation, each with 15–16 genes, key SNPs, and Indels in specific chromosomes highlighting their relevance to environmental responses and pathotype evolution. The SNP and InDel analysis resulted in about 3.56 million variants, including 3.45 million SNPs, 5050 insertions, and 5651 deletions within the whole genome of B. graminis f. sp. tritici Wtn1. These comprehensive genome and transcriptome datasets serve as crucial resources for understanding the pathogenicity, virulence effectors, retro-elements, and evolutionary origins of B. graminis f. sp. tritici Wtn1, aiding in developing robust strategies for the effective management of wheat powdery mildew.


Introduction
Wheat is one of the most extensively farmed and consumed cereal grains globally, serving as a crucial dietary cornerstone for millions.Its adaptability to diverse climates renders it a dependable food staple across numerous nations, contributing significantly to global food security [1].Yet, innumerable diseases, including rusts, blights, and smuts, threaten wheat crops capable of diminishing yield and quality, thereby causing substantial economic setbacks for farmers.Among these, rust and smuts have been familiar to humanity since ancient times.However, another disease, powdery mildew, ranks among the top ten diseases that severely curtail wheat productivity [2].The resurgence of this powdery mildew caused by B. graminis f. sp.tritici is becoming evident across numerous wheat-growing nations, resulting in a yield loss ranging from 10 to 62% in countries such as Russia, South America, the United Kingdom, Europe, Brazil, Canada, and China [3].Within India, the projected yield reduction ranges from 13-34% for moderate infections to a staggering 50-100% during severe outbreaks.A sole infection by a solitary conidium, leading to the formation of a mildew colony, releases around two million conidia capable of traveling up to 650 km that exhibit resistance not only to cold conditions but also drought environments [4,5].
Repetitive infections inevitably result in new pathotypes capable of breaking the host resistance [6,7].B. graminis f. sp.tritici also forms sexual ascospores within the chasmothecia on mature wheat leaves, aiding the fungus in surviving summer temperatures during the offseason [8].Upon invading the wheat host, the fungus develops lobed haustoria, either to absorb nutrients or trigger effector-mediated immune signaling if the functional immune receptor is present [9][10][11][12].The haustorium additionally provides an expansive surface for exchanging proteins like Candidate Secreted Effector Protein (CSEP) and metabolites between the pathogen and the host [12,13].These proteins are capable of manipulating the host cell machinery to trigger susceptibility by suppressing basal immune responses, aiding fungal growth and reproduction [14,15].
Additionally, genomic data can assist in the discovery of novel powdery mildew resistance (Pm) genes that could be incorporated into hexaploid wheat via breeding initiatives.However, sequencing the B. graminis f. sp.tritici genome has consistently posed a formidable challenge [24].Its rapid mutation rates, intricate population structure, and varied physiological specialization contribute to the complexity of genome sequencing endeavors [25].Prior to our study, no documented genome sequences of wheat powdery mildew were published from India.Hence, we aimed to establish genomic resources by sequencing its genome.Employing a hybrid approach, we assembled short reads from Illumina and long genomic reads from Nanopore Technology (ONT), Oxford, UK.Our study had the following primary objectives: (i) isolation of high-quality, high-molecular-weight genomic DNA for library preparation, (ii) whole-genome sequencing using a hybrid assembly to predict genes and gene families, (iii) Gene Ontology analysis and categorization of identified genes, and (iv) unveiling the transcriptome architecture of a virulent type of B. graminis f. sp.tritici found in India.The genome and transcriptome data generated in our work will support ongoing research endeavors aimed at detecting races/pathotypes across diverse agroclimatic zones and facilitate the implementation of resistance genes tailored to specific zones, enabling effective and sustainable management of wheat powdery mildew.
Wheat seedlings (cv.WL711) were initially used to isolate and purify B. graminis f. sp.tritici Wtn1.Susceptible seedlings of Triticum aestivum (cv.WL711) were employed for this purpose.Additionally, fifty wheat lines, including near-isogenic lines (NILs) with distinct Pm (~Powdery Mildew) genes, were inoculated with seven-day-old conidia from fresh colonies.The virulence of the B. graminis f. sp.tritici Wtn1 isolates was evaluated by observing susceptible reactions on a scale of 0-9 after 7-10 days of powdery mildew inoculations under controlled conditions.A B. graminis f. sp.tritici isolate was classified as highly virulent if it exhibited a susceptible reaction rating between 7 and 9 on the 0-9 scale.The differential reactions of the B. graminis f. sp.tritici Wtn1 isolates were identified based on their average scoring values following the approach suggested by Namuco et al. (1987).The total number of virulent isolates and their virulence percentages, varietal efficacies, and Pm gene efficacies were computed using resistant and susceptible reactions of Indian wheat varieties and NILs.The calculation of these parameters was conducted according to the formula given by Green (1966).
Additionally, seven-day-old conidia from fresh colonies were introduced into five near-isogenic lines of Triticum aestivum, each carrying distinct Pm genes like Amigo (Pm17), Chul Bidai (Pm3b), and Timgalen (Pm6).Symptoms and their severity in the seedlings were assessed to determine the virulence (severity and frequencies), varietal resistance performance, and Pm gene effectiveness.Virulent reactions were quantified after 7-10 days of conidial inoculation using the 0-9 scale proposed by Namuco et al. (1987).Selection of the virulent strain, B. graminis f. sp.tritici Wtn1, was based on the frequency of virulence (Supplementary Figure S2).

Virulence (%) =
No. of virulent isolates Total number of isolates × 100 Varietal resistance efficacy (%) = Number of times the variety is resistant Total number varieties tested × 100 Principal Coordinate Analysis (PCA) and neighbor joining clustering, employing correlation as the similarity index were performed to identify the most aggressive and distinctive B. graminis f. sp.tritici isolates across three representative agro-climatic zones of Indian wheat cultivation.

Selection of the Powdery Mildew Pathogen for WGS
The highly virulent fungal strain, B. graminis f. sp.tritici Wtn1, used in this study was obtained from a susceptible bread wheat variety (cv.WL 711) collected from the experimental field at ICAR-Indian Agricultural Research Institute, Regional Station, Wellington, situated in the Western Ghats of India (11 • 21 ′ 50.00 ′′ N 76 • 47 ′ 6.90 ′′ E; 1815 Mean Sea Level).Following the validation of its identity and mycological characteristics, pure cultures were maintained on the seedlings using an efficient mass culturing technique.The isolation involved placing the conidia on surface-sterilized leaf segments on a water agar medium supplemented with benzimidazole (60 µg/mL) for genome sequencing (Figure 1).
The highly virulent fungal strain, B. graminis f. sp.tritici Wtn1, used in this study was obtained from a susceptible bread wheat variety (cv.WL 711) collected from the experimental field at ICAR-Indian Agricultural Research Institute, Regional Station, Wellington, situated in the Western Ghats of India (11°21′50.00″N 76°47′6.90″E; 1815 Mean Sea Level).Following the validation of its identity and mycological characteristics, pure cultures were maintained on the seedlings using an efficient mass culturing technique.The isolation involved placing the conidia on surface-sterilized leaf segments on a water agar medium supplemented with benzimidazole (60 µg/mL) for genome sequencing (Figure 1).

The Wheat Cultivar and Axenic Culturing of the Powdery Mildew Pathogen
Healthy seedlings emerged from surface-disinfected seeds of the susceptible cv.WL711 were grown for 7-10 days in garden soil (three parts) mixed with decomposed farmyard manure (one part).The autoclaved soil was then placed into sterilized paper cups.Surface-sterilized WL711 seeds were planted in these cups, watered every 48 h with sterile water, and maintained in controlled conditions within a polyhouse (25 ± 1 °C, 85-

The Wheat Cultivar and Axenic Culturing of the Powdery Mildew Pathogen
Healthy seedlings emerged from surface-disinfected seeds of the susceptible cv.WL711 were grown for 7-10 days in garden soil (three parts) mixed with decomposed farmyard manure (one part).The autoclaved soil was then placed into sterilized paper cups.Surfacesterilized WL711 seeds were planted in these cups, watered every 48 h with sterile water, and maintained in controlled conditions within a polyhouse (25 ± 1 • C, 85-90% humidity, 16 h light/8 h dark) (Figure 2).For axenic culturing of the B. graminis f. sp.tritici Wtn1 isolate, primary leaves from the same wheat cultivar were sterilized with ethyl alcohol (70%), cut into 25 mm lengths, and placed on a water agar medium.Fresh conidia of the pathogen were spread-inoculated into these leaves and incubated at 25 ± 2 • C.After seven days, mildew colonies comprising epiphytic conidia were collected under aseptic conditions using a sterilized lancet needle within a laminar flow chamber.Additional leaves were concurrently inoculated to enhance the fungal conidial biomass production (Figure 2).
J. Fungi 2024, 10, x FOR PEER REVIEW 5 of 28 90% humidity, 16 h light/8 h dark) (Figure 2).For axenic culturing of the B. graminis f. sp.tritici Wtn1 isolate, primary leaves from the same wheat cultivar were sterilized with ethyl alcohol (70%), cut into 25 mm lengths, and placed on a water agar medium.Fresh conidia of the pathogen were spread-inoculated into these leaves and incubated at 25 ± 2 °C.After seven days, mildew colonies comprising epiphytic conidia were collected under aseptic conditions using a sterilized lancet needle within a laminar flow chamber.Additional leaves were concurrently inoculated to enhance the fungal conidial biomass production (Figure 2).

Mass Production of Conidia and Microscopy
Leaf segments with abundant conidiogenesis of B. graminis f. sp tritici Wtn1 were picked from a water agar medium, and the conidia on a mildew colony (3 mm 2 ) were counted using a hemocytometer.Approximately 25 × 10 3 conidia per ml were recorded in a suspension prepared from the mildew-infected leaf area (~3.0 mm 2 ).For inoculation, 95-100 conidia were placed onto a primary leaf (Length 70-75 mm × Width 4-5 mm) of the susceptible wheat cultivar WL711 to ensure uniform infection.Following inoculation, the seedlings were maintained under aseptic conditions inside incubation chambers made of 2 mm thick translucent polycarbonate sheets.In a controlled playhouse environment, these pots were placed in inoculation cabins (Length 6′× Width 6′× Height 8′).Irrigation (50 mL per pot) was carried out every alternate day via sprinkling.The cabins were illuminated at 100 ± 5 µmol m −2 s −1 for 16 h of light and 8 h of darkness.After seven days, the

Mass Production of Conidia and Microscopy
Leaf segments with abundant conidiogenesis of B. graminis f. sp tritici Wtn1 were picked from a water agar medium, and the conidia on a mildew colony (3 mm 2 ) were counted using a hemocytometer.Approximately 25 × 10 3 conidia per ml were recorded in a suspension prepared from the mildew-infected leaf area (~3.0 mm 2 ).For inoculation, 95-100 conidia were placed onto a primary leaf (Length 70-75 mm × Width 4-5 mm) of the susceptible wheat cultivar WL711 to ensure uniform infection.Following inoculation, the seedlings were maintained under aseptic conditions inside incubation chambers made of 2 mm thick translucent polycarbonate sheets.In a controlled playhouse environment, these pots were placed in inoculation cabins (Length 6 ′ × Width 6 ′ × Height 8 ′ ).Irrigation (50 mL per pot) was carried out every alternate day via sprinkling.The cabins were illuminated at 100 ± 5 µmol m −2 s −1 for 16 h of light and 8 h of darkness.After seven days, the B. graminis f. sp.tritici Wtn1 conidial masses were collected from well-established individual colonies, pooled in sterilized microcentrifuge tubes (2.0 mL each), and preserved at −20 • C for genomic DNA isolation.For mycological observations, fresh conidia were retrieved using a stainless steel nichrome loop (Metaloop TM with a 2 mm diameter), spread on microscopic slides, and examined at different magnifications under a light microscope (Nikon, E600 Eclipse model) equipped with a photomicrography system.The images were visualized at 100× magnification using specific eyepieces (10×/22 Nikon) and objective lenses (10×/0.25 WD 6.1).

Genomic DNA Isolation and Strain Identification
Taxonomic classification at the intra-forma specialis level was performed before the genome sequencing.Initially, efforts were made to purify genetically pure mildew colonies from the infected leaves for genomic DNA extraction (Figure 1), and seven methods using CTAB, SDS, or acombination of both were attempted (Supplementary Table S1).Ultimately, a method involving conidia lysis in liquid nitrogen followed by maceration using glass beads in 300 µL of preheated 5% Sarcosyl in CTAB Buffer at 65 • C for 30 min yielded high-molecular-weight genomic DNA.This purified genomic DNA was used as a template for amplifying the fungal species barcodes via a PCR reaction with the ITS1 and ITS4 primers.The resulting PCR amplicons covering the entire barcode length were sequenced bidirectionally using the dideoxy chain termination method.The obtained sequences were end-trimmed, curated, and analyzed using the Basic Local Alignment Search Tool-Nucleotide (BLASTN) to confirm the species identity [26].After verifying the sequence, it was submitted to the NCBI database and the accession number (MN872365.1)was received.

Amplification of the Pathogenesis-Related Genes
Initially, the PCR primers recommended [27] for Protein Kinase C genes (PKC1 and PKC-like; 714 bp), the Catalase (Cat1; 637 bp) gene, the Alternative Oxidase Gene (159 bp), and the Integral Membrane Protein gene(pth; 735 bp) were used.The PCR mixture at a final volume of 50 µL consisted of 25 µL of 2X DreamTaq PCR Master Mix (Thermo Scientific, Waltham, MA, USA), 1 µL each of the forward and reverse primers (10 pmol/reaction), 4 µL of 25 ng of DNA, and 19 µL of nuclease-free water was subjected to initial denaturation at 95 • C for 3 min; 35 cycles of 30 s at 95 • C, 30 s at 55 • C, and 1 min at 72 • C; followed by a final extension of 12 min at 72 • C. The PCR-amplified products were visualized in 1.0% (w/v) agarose gel using a 100 bp DNA ladder.The amplified products were sequenced.The sequences were analyzed and accessioned using the NCBI database.

Genomic Libraries and Whole-Genome Sequencing
Genomic libraries for the short and long nucleotide reads were prepared following the manufacturer protocols for genome sequencing (Illumina HiSeq 2500 and PromethION Flow Cell, R9.4.1, Nanopore).The Illumina HiSeq 2500 method employed the NEBNext Ultra DNA Library Preparation Kit, with the subsequent quality assessment conducted using the Agilent TapeStation.In the Nanopore approach, native barcoding of the genomic DNA (EXP-NBD114 and SQK-LSk109) was explicitly executed for the PromethION system.Post-barcoding, stringent quality checks were performed using Qubit and the Agilent TapeStation to ensure the reliability and quality of the prepared libraries.

Sequence Curation and De Novo Genome Assembly
The raw fastQ files obtained using Illumina and ONT were subject to comprehensive quality checks, evaluating parameters including the base quality score distribution, average base content per read, % GC distribution in the reads, and mean read length for Illumina.For the ONT reads, metrics such as the bases in GB, N50, and median read length were assessed.These evaluations were performed before commencing the hybrid WGS processing (Figure 1).Subsequently, these fastQ files underwent pre-processing using AdapterRemovalv2 [28] before performing the assembly.Initially, the process began with adapter and low-quality sequence removal, maintaining an average quality score threshold of 30 across paired-end reads.Next, duplicate reads were removed using Sequence Alignment/Map (SAMtools)'s default settings.The mapping phase yielded a total of 9.24 million reads.A sliding window of 1 kb, shifting by 100 bp at each step, was used to identify zero-coverage regions and compute the bases covered by the mapped reads (Table 1).Subsequently, de novo genome assembly was executed utilizing the Maryland Super Read Cabog Assembler (MaSuRCA) hybrid assembler v4.0.8 [29].The obtained contigs underwent polishing using POLishing by Calling Alternatives (POLCA).Further gap filling and scaffolding was done through RagTag (https: //genomebiology.biomedcentral.com/articles/10.1186/s13059-022-02823-7,12 September 2023) with using reference B. graminis f. sp.tritici 96224 V3.16.(GCA_900519115.1).Later, the final assembled genome quality metrics -GC%, L50, L75, N50, and N75-were calculated using Quality Assessment Tool (QUAST) v4.6 [30] (Supplementary Figure S3).Additionally, the average nucleotide identity (ANI) was determined using OrthoANI.The list of tools used for the genome assembly and analysis is furnished in Supplementary Table S2.Furthermore, the genome assembly was deposited into the National Center for Biotechnology Information (NCBI)/GenBank under accession number GCA_024363405.1/JALMLO000000000.We also analyzed the comparative genome assembly of B. graminis represented in Supplementary Table S3.

Gene Prediction and Annotation
The assembled genome underwent several analyses.Repeat masking was performed using RepeatMasker v4.0.7 and Repeat Modeler v1.0.11, setting the GC content parameters to 46-48% for repeats soft masking.The AUGUSTUS v3.3.3 gene predictor was initially trained using the closest reference genome via B. graminis f. sp.tritici 96224 V3.16.Later, gene prediction was conducted using Augustus v3.3.3, which involved predicting genes, CDS and proteins independently on both DNA strands, along with the utilization of soft masking [31].Subsequently, Benchmarking Universal Single-Copy Orthologue (BUSCO) v4.1.4assessed the completeness of the resulting protein sequences.
KofamKOALA KEGG (Kyoto Encyclopedia of Genes and Genomes) Orthology Search via HMMER KofamScanv2023-10-02, release 108.0 [32], identified the biochemical pathways, while the hierarchical functional annotations were conducted using eggNOG (evolutionary genealogy of genes: Non-supervised Orthologous Groups)-mapperv2.1.8,database v5.0.2, for further analysis [33].The pathogen-host interaction (PHI) sequences were retrieved from the PHI database [34] using double index alignment of next-generation sequencing data (DIAMOND) v2.0.15 and a protein aligner search [35], limiting the target sequences to 1 and employing an e-value of 0.0001.The effector proteins were identified by examining the predicted total proteins for the signal peptides using the Phobius and SignalP_Euk tools available in InterProScan v5.59-91.0[36], and subsequent analysis of the signal peptide-containing sequences was performed using Effector P v3.0, following the established methodologies [37].Lastly, the presence of tRNA and rRNA was determined using tRNAscan-SE v0.4 and barrnap v 0.9, respectively, employing specific settings for the length threshold and e-value criteria, integrated in Galaxy platform [38].
2.12.Transcriptomics of B. graminis f. sp.tritici Wtn1 B. graminis f. sp.tritici Wtn1 conidial biomass was used for the total RNA extraction and subsequent sequencing via a standard transcriptomics workflow.Enriched mRNA was used to construct a paired-end (PE) library and sequenced using the Illumina HiSeq 2500 platform, generating 4.63 GB (R1+R2) of data with a 44% GC content.Quality assessment with FastQC preceded the removal of low-quality bases and ambiguous datasets using Trimmomatic v0.35 to obtain high-quality reads [41,42].Read mapping to the B. graminis f. sp.tritici 96224 V3.16 reference genome from the Ensembl Fungi database was performed using Bowtie2 (v2.5.0) and Cufflinks (v2.2.1.3)[43,44], considering genes with an FPKM value of ≥10 for functional annotation.Genes with fragments per kilobase of transcript per million mapped reads (FPKM) ≤ 9.0 were excluded from this annotation process.The expressed genes underwent Gene Ontology (GO) annotation via the Database for Annotation, Visualization, and Integrated Discovery (DAVID) [45,46] v2023q4.Additionally, these genes were assessed for gene enrichment using the functional annotation tool against a background reference, B. graminis f. sp.hordei DH14n [45].Furthermore, the raw reads from the conidial transcriptome datasets are deposited to NCBI-SRA with accession number PRJNA1086707.

Genome Analysis for Nucleotide Variation
Initially, to determine Single-Nucleotide Polymorphisms (SNPs) and Indels, the highquality reads were aligned with the B. graminis f. sp.tritici 96224 V3.16.(GCA_900519115.1)genome using the maximal exact match (MEM) algorithm in Burrows-Wheeler Alignment (BWA) tool v0.7.17 [47].PCR duplicate reads from the aligned data were removed using the Mark duplicates tool within Picard (https://broadinstitute.github.io/picard/accessed on 12 September 2023).Subsequently, variant calling was carried out employing bcftoolsmpileup from SAMtoolsv1.15.1 [48], applying filters for a minimum variant depth (DP) of ≥20, a Phred quality score (Q) of ≥25, and a mapping quality of ≥30.The identified SNPs were annotated for different genomic regions, as were their various effects and impact types using SnpEffv4.5[49].For SNP density plot, the variant positions were submitted to SNP density window of SR plot [50].

Results
The study investigated the virulence patterns of B. graminis f. sp.tritici isolates on diverse wheat varieties and the consequent efficacy of the Pm gene in conferring mildew resistance (Figure 2).To identify B. graminis f. sp.tritici Wtn1, we sequenced four pathogenesis-related genes (PKC1 and PKC-like; Cat1; AOX; and pth).The sequences obtained were accessioned in the NCBI database with the accession numbers OP271713, OP087312, OP354504, and OP271716, respectively.Among the 15 assessed B. graminis f. sp.tritici isolates, B. graminis f. sp.tritici 29 (renamed B. graminis f. sp.tritici Wtn1) exhibited infectivity across various wheat varieties and near-isogenic lines like Amigo (Pm17), Chul-Bidai (Pm3b), and Kavakaz (Pm8).However, it failed to infect Timgalen, carrying the Pm6 gene, displaying high virulence on ChulBidai (Pm3b) seedlings and moderate susceptibility on the other Pm genes.Due to its unique virulence profile, B. graminis f. sp.tritici Wtn1 was chosen for the whole-genome analysis, corroborated by PCA analysis.Molecular phylogenetic analysis based on the ITS sequence (518-bp; GenBank Acc.No. MN872365.1)confirmed a genetic similarity of 99.6-100.0%with AB462308.1 and MN919383.1,representative of mildew isolates from different regions/countries, affirming the close genetic relationship among mildew isolates impacting wheat crops.

Sequence Read Quality, Genome Assembly and Evaluation
We produced 13.97 Gb of data with 92.50 million reads; over 92.7-92.0%exceeded Q30.Additionally, ONT long-read sequencing generated 4.94 Gb from 3.23 million reads.The curated raw data for the genome hybrid assembly comprised 92.41 million reads (13.34 Gb; GC-45.4%)(Table 1).Our draft-assembled genome spans 93 scaffolds with a CG content of 43.72%.It is 140.61 Mb with an N50 of 16.02 Mb.The longest contig in the hybrid assembly reached around 19.53 Mb.Genome completeness assessment revealed 95.4% complete and 1.6% fragmented BUSCOs.Comparative analysis showed a high similarity (99.26% to 99.69%) within the B. graminis f. sp.tritici genomes, contrasting with a 65% to 73% reduction in dicotyledon-host-infecting species like E. necator, E. pisi, and G. cichoracearum.Molecular phylogeny based on whole-genome alignment highlighted alignments among various powdery mildew fungi, showing similarities between the monocot-infecting and dicot-infecting species (Figure 3).Furthermore, BLASTP analysis indicated significant matches with B. graminis f. sp.triticale, B. graminis f. sp.tritici, B. graminis f. sp.tritici 96224, and B. hordei.The organism chart revealed the highest hit for the test isolate, followed by B. graminis f. sp.hordei and B. graminis f. sp.hordei strain DH14 (Supplementary Table S4).

Identification of Repeats and Transposable Elements
The assembled genome showed repetitive sequences, encompassing 73.8% of its entirety (Table 2).Among these, retro-elements predominated, with 95,626 copies covering 62.24% of the genome, notably LINEs accounting for 42.92 Mb.LTR elements were the second most abundant, featuring 59,884 copies across 43.122 Mb (31.16% of the genome).Gypsy/DIRS1 and Copia elements were prominent within this category, comprising 15.29% and 14.43% of the genome, respectively.DNA transposons and Tc1-IS630-Pogo elements had almost equal proportions, each occupying 1.13 Mb and constituting 0.82% of the genome.Interspersed repeats represented 73.80% of the sequences, spanning a length of 102 Mb (102,143,698) (Table 2).

Gene Prediction, Annotation Validation, and Comparative Protein Coding Genes
InterProScan Analysis.The gene and protein prediction analysis revealed 8480 proteinencoding genes (PEGs) and 190 rRNA and 583 tRNA sequences (Figure 4).Within the Wtn1 isolate, the protein-coding density (PCD) covered a total length of 11,135,169 coding bases (CDS) occupying a 12,710,518 gene length; encoded 3,704,012 protein amino acids, averaging a length of 1649 genes; and had a 1445 CDS length translating to the 481 protein-encoding genes (PEGs).Determination of the gene density found it to be unevenly distributed in each chromosome (Chr).We also observed that the Chr 1, 2, 7, 8, and 10 end positions/regions had a high gene density compared to the other chromosomes.Further, Chr1 to 10 have a medium gene density, whereas the Chr 3 terminal region has the lowest gene density (Supplementary Figure S4).InterProScan analysis unveiled diverse functional features, identifying 3180 distinct protein families (PFAMs), detecting 7845 PFAMs in the genome (Figure 5).Additionally, the analysis recognized 774 distinct superfamilies with 5699 copies, 357 PIRSF with 468 copies, and 2179 FunFam features comprising 3218 copies within the B. graminis f. sp.tritici Wtn1 genome.Comparative proteome analysis showed a substantial overlap, with 86%, 90%, 71%, and 76% of the protein families, superfamilies, PIRSF, and FunFam features, respectively, shared among the three studied genomes (Figure 5).Notably, there was a significant sharing of PFAMs, superfamilies, and FunFam features between the B. graminis f. sp.tritici Wtn1 and B. graminis f. sp.triticale THUN-12 genomes.The expanded PFAMs in these genomes included fungal protein kinases; the WD domain, G-beta repeat; the protein kinase domain; RNA recognition motifs (RRM, RBD, and RNP domain); mitochondrial carrier proteins; and helicase conserved C-terminal domains.

Gene Ontology Annotation
In the Gene Ontology (GO) mapping and annotation, the genes were diversified across biological processes, molecular functions, and cellular components (Supplementary Tables S5-S7).At level 2 of the biological processes, the gene classifications ranged from 13 to 2856, while the molecular function annotations varied from 5 to 2316.For the cellular components, the gene classifications spanned from 1013 to 3004 terms.Among the biological processes, cellular processes (~2865 genes) were prominent, followed by metabolic processes (~2368 genes), localization (663 genes), and response to stimuli (384 genes).In molecular functions, "catalytic activity" had the highest representation, involving around 2316 genes, followed by binding (2204 genes), ATP-dependent activity (261 genes), transporter activity (241 genes), and molecular function regulator activity (128 genes).Considering the cellular components, genes associated with cellular anatomy activities were prevalent, followed by intracellular components and protein-containing complexes in the B. graminis f. sp.tritici Wtn1 genome.

Gene Ontology Annotation
In the Gene Ontology (GO) mapping and annotation, the genes were diversified across biological processes, molecular functions, and cellular components

KOG Classification of the Genome
Using euk-KOG classification, the genome was categorized into various functional groups.The prominent categories comprised carbohydrate transport and metabolism (G); replication, recombination, and repair (L); post-translational modification, protein turnover, and chaperones (O); intracellular trafficking, secretion, and vesicular transport (U); RNA processing and modification (A); and translation, ribosomal structure, and biogenesis (J).These prevalent features signify vital roles in the organism's biological processes and metabolic dynamics (Figure 9).Additionally, the overall functional categories encompassed transcription (K), amino acid transport and metabolism (E), lipid transport and metabolism (I), and the cytoskeleton (Z), indicating their involvement in fundamental cellular mechanisms and developmental processes.

KEGG Pathways
In the B. graminis f. sp.tritici Wtn1 genome, diverse functional processes were observed in the analysis of the biochemical pathways.Under carbohydrate metabolism (KO:09101) within the broader metabolic pathways (KO:09100), 213 enzymes were identified across fifteen pathways, including Pyruvate metabolism, Glycolysis/Gluconeogenesis, starch and sucrose metabolism, Inositol phosphate metabolism, and Glyoxylate and dicarboxylate metabolism (Table 4).Additionally, the investigation highlighted 16 genes/enzymes involved in the Terpenoid backbone biosynthesis pathway within the metabolism of terpenoids and polyketides (Table 5).In the Environmental Information Processing pathways, 393 genes/enzymes were found in 33 signal transduction pathways, notably in the MAPK signaling pathway-yeast (53 genes), the mTOR signaling pathway (38 genes), PI3K-Akt signaling (23 genes), and AMPK signaling (21 genes) (Table 6).
RNA processing and modification (A); and translation, ribosomal structure, and biogenesis (J).These prevalent features signify vital roles in the organism s biological processes and metabolic dynamics (Figure 9).Additionally, the overall functional categories encompassed transcription (K), amino acid transport and metabolism (E), lipid transport and metabolism (I), and the cytoskeleton (Z), indicating their involvement in fundamental cellular mechanisms and developmental processes.
3.13.SNPs and Indels in B. graminis f. sp.tritici The SNP and Indel analysis revealed 356,076 variants within this genome, including 345,375 SNPs, 5050 insertions, and 5651 deletions.When categorized by chromosome, the highest SNP count was observed on chromosome 5, followed by chromosomes 9, 7, and 2 (Figures 10 and 11a).The SNP density was found asymmetric or unevenly distributed in each chromosome.We also observed that the Chr 5, 6, 7, 9 and 10 start and end positions/coordinates have a high SNP density compared to the other Chrs.Further, Chrs 1, 2, 4, and 11 have a moderate SNP density (Figure 10).The average SNP density per chromosome ranges from 2782 to 49,913 (Figure 10).The transition (Ts) and transversion (Tv) counts were determined to be 377,264 and 232,101, respectively, resulting in a Ts/Tv ratio of 1.63.The annotation of these variants involved classification based on region, impact, and functional class (Figure 11).Region-wise classification of the identified variants indicated that the majority were Intergenic (43.17%), followed by the downstream (26.69%) and upstream (26.46%) regions (Figure 11b).Functional class analysis revealed that 52.03% were missense variants, and 46.87% were silent (Figure 11c).Impact-wise classification showcased that the majority of variants were labeled as modifiers (98.6%), with fewer categorized as having a moderate or low impact (Figure 11d).

SNPs and Indels in B. graminis f. sp. tritici
The SNP and Indel analysis revealed 356,076 variants within this genome, including 345,375 SNPs, 5050 insertions, and 5651 deletions.When categorized by chromosome, the highest SNP count was observed on chromosome 5, followed by chromosomes 9, 7, and 2 (Figures 10 and 11a).The SNP density was found asymmetric or unevenly distributed in each chromosome.We also observed that the Chr 5, 6, 7, 9 and 10 start and end positions/co-ordinates have a high SNP density compared to the other Chrs.Further, Chrs 1, 2, 4, and 11 have a moderate SNP density (Figure 10).The average SNP density per chromosome ranges from 2782 to 49,913 (Figure 10).The transition (Ts) and transversion (Tv) counts were determined to be 377,264 and 232,101, respectively, resulting in a Ts/Tv ratio of 1.63.The annotation of these variants involved classification based on region, impact, and functional class (Figure 11).Region-wise classification of the identified variants indicated that the majority were Intergenic (43.17%), followed by the downstream (26.69%) and upstream (26.46%) regions (Figure 11b).Functional class analysis revealed that 52.03% were missense variants, and 46.87% were silent (Figure11c).Impact-wise classification showcased that the majority of variants were labeled as modifiers (98.6%), with fewer categorized as having a moderate or low impact (Figure 11d).

Discussion
Globally, wheat cultivation is threatened not only by rusts and blast but also by th powdery mildew caused by B. graminis f. sp.tritici [24,51], disseminated by airborne co nidia [19].Mildew management relies heavily on the application of fungicides and th

Discussion
Globally, wheat cultivation is threatened not only by rusts and blast but also by the powdery mildew caused by B. graminis f. sp.tritici [24,51], disseminated by airborne conidia [19].Mildew management relies heavily on the application of fungicides and the cultivation of resistant wheat varieties [52,53].However, the emergence of new pathotypes and races renders these strategies ineffective and short-lived.Recent genomic technologies and approaches offer new opportunities for durable crop disease management in agriculture [54][55][56].In India, powdery mildew in wheat is an emerging disease, especially in hilly terrains.Despite its importance, and economic significance, so far, no attempts have been made to generate genomic data and genome resources for the Indian strain of B. graminis f. sp.tritici.Challenges such as high conidial moisture and difficulties in obtaining contamination-free high-quality genomic DNA hinder comprehensive genome analysis.In the past, conidial biomass has been used for genomic DNA isolation from related mildew strains affecting barley, wild grass, rye, and Lolium [57].Environmental DNA samples are expected to carry unintended microorganisms and host plant DNA [58].
Initially, we ensured the genetic purity of B. graminis f. sp.tritici through in vitro culturing on detached and surface-disinfected wheat leaves, following the established methods [17,59,60].Molecular-taxonomic analysis using ITS sequencing and four other gene sequences confirmed its identity as B. graminis f. sp.tritici [61] (Ac.No. MN872365.1).We used fungal biomass generated from monoconidial cultures for the genome sequencing following several methods [55].Employing strategies to minimize DNA degradation, protein and carbohydrate contamination, and deaminating the nucleotides [24,[62][63][64][65][66], we successfully isolated high-molecular-weight genomic DNA.Notably, Sarcosyl proved effective due to its ability to dissociate the nucleosomes and ribosomes while inhibiting nucleases.Utilizing a hybrid approach, we assembled short and long reads, resulting in 93 contigs totaling 140.61 Mb with a GC content of 43.7%.In recent times, integrating genome sequencing advancements and hybrid assembly algorithms has significantly accelerated genome analysis, functional predictions, and characterization [67].
The nucleotide sequence hits aligned with B. graminis f. sp.tritici identified wellassembled genome and annotated genes [68].The use of the average nucleotide identity(ANI) revealed two distinct Blumeria clades, revealing genetic variability without differences in virulence across various hosts [17].The genome showed 95,626 retro-elements, covering 86.14 Mb (62.24%), indicating a proliferation of transposable elements (TEs) contributing to virulence and ecological and host adaptations [18,69].TEs contribute to virulence evolution by introducing genetic variations through insertions and strand invasions [70].In particular, their mobility within genomes leads to gene duplication, horizontal gene transfer, and gene loss, influencing the evolution of virulence factors [13].In powdery mildew, the TE proliferation results in the absence of the RIP pathway, a rarity among related ascomycetes [18].Around 74% of the B. graminis f. sp.tritici genome consists of TE-derived sequences distributed across the chromosome arms alongside genes [22].Pathogens with larger genomes often harbor increased repetitive DNA, particularly retrotransposons [71,72], aligning with the extensive TE coverage observed in the B. graminis f. sp.tritici Wtn1 genome.Previous studies have linked larger genomes to a higher abundance of repeated elements [73,74].
The annotated B. graminis f. sp.tritici genome is predicted to possess ~8000 proteincoding genes, which is relatively low for filamentous fungi, potentially due to the conversion to pseudogenes and the prevalence of repeat elements.This genome lacks the Amber stop codon, containing only the Opal and Ochre tRNA anticodons.Previous studies have identified 6540 genes, with 5258 shared between B. graminis f. sp.tritici and B graminis f. sp.hordei [19,75].The Indian isolate B. graminisWtn1 exhibited higher gene content than these previous findings, with 8480 genes.Fungal pathogens leverage diverse effectors to expand their host range fitness [76].Analysis of 36 B. graminis f. sp.tritici isolates suggests that effector genes are vulnerable to duplication and deletion, possibly due to the high presence of repeat elements [22].Our data revealed 805 predicted effector genes in the B. graminis f. sp.tritici genome, including established ones like B. graminis f. sp.tritici_Bcg-7, BEC1005, CSEP0162, BEC1016, and BEC1040.Several of these effectors are reported to enter into wheat cells via specialized feeding structures called haustoria [77].
Our study further detected 8 Avr genes among detected pathogenesis-related genes in the B. graminis f. sp.tritici Wtn1 genome, aligning with prior findings [78,79].The Avr genes identified in earlier studies within B. graminis might serve as effectors, potentially contributing to its aggressiveness [79][80][81].Understanding these genes can offer insights into the functions of R-genes, akin to AVRa10 and AVRk1 in barley, influencing infection in susceptible varieties and potentially sharing similar functions in Triticum species [82].Genomic effector identification is important for unravelling host-pathogen interactions [83,84].Virulence genes can provide insights into how wheat genotypes respond to powdery mildew infections, aiding the development of resilient wheat cultivars.Powdery mildews like B. graminis f. sp.tritici, Erysiphe pisi, and Golovinomyces cichoracearum have undergone specific gene loss, suggesting host-specific adaptations [18].The mildews encode diverse secreted proteins, potentially vital to species-specific adaptation, associated with biological processes like secondary metabolite biosynthesis, transport, signal transduction, and DNA functions, contributing to their adaptability.Understanding these interactions is vital to breeding resilient wheat varieties against powdery mildew epidemics.
RNA sequencing of the conidia of B. graminis f. sp.tritici Wtn1 uncovered 30 highly expressed genes among the 4234 expressed genes.Notably, genes related to carbon and nitrogen metabolism, such as glucose-repressible protein, Oleate-induced peroxisomal protein POX8, Superoxide dismutase (SOD), Glycosidase, F-box domain-containing protein, and Catalase-peroxidase, exhibited substantial expression levels [18,[20][21][22]85,86].Regarding the Carbohydrate-Active Enzyme (CAZyme) families, the glycoside hydrolase (GH) families predominated, followed by the glycosyl transferase (GT) and auxiliary activity (AA) families in the B. graminis f. sp.tritici Wtn1 genome.These enzymes are essential in breaking down plant cell walls, aiding pathogen entry and colonization, particularly in degrading the complex polysaccharides found abundantly in monocots.The genome of B. graminis f. sp.tritici Wtn1 also revealed peptidase families such as M28, carboxypeptidase Y, subtilisin, and signal peptidase I, important to various physiological processes and consequent fungal nutrition, development, and pathogenesis [87,88].Additionally, chaperones, folding catalysts, and protein kinases suggest their involvement in stress adaptation, temperature resilience, and enzymatic regulation through phosphorylation for activation or deactivation [89].These diverse enzyme families and regulatory elements significantly contribute to fungal adaptation, pathogenicity, and the intricate interactions between the pathogen and its host [85,86,88].
InterProScan analysis of both the genome and transcriptome revealed an expansion of diverse protein families, showcasing functional diversity critical for the fungus to adapt within the host environment [69].The genome also highlighted transport families like ABC transporters, sugar transporters, and Major Facilitator Superfamily (MFS) proteins, which are essential for nutrient and supplement movement within the organism.The ABC transporter R-genes in Fusarium graminearum are associated with azole tolerance and defense against environmental toxic compounds [90].In barley powdery mildew, the MAPK and cAMP signaling pathways regulate various crucial aspects, including appressorium development, growth, disease progression, chemotaxis, virulence, and secondary metabolite production [91].Similarly, in B. graminis f. sp.tritici, diverse biochemical pathways and orthologous functional features contribute to essential metabolites and intermediate compounds necessary for cellular development and infection proliferation.
Multiple studies emphasize the significance of signaling pathways like MAP kinase and cAMP in pathogen growth and disease development [90,92].Combined with the twocomponent system, these pathways play a key role in disease progression and the formation of infectious structures within the host [93].cAMP signaling influences fungal development and infection, while Ras signaling governs pathogenicity and morphogenesis [93][94][95][96].These pathways regulate chemotaxis, phosphorylation-mediated signaling, virulence, and secondary metabolite production [97].Protein domains forming multidomain assemblies contribute significantly to functional diversity [98,99], crucial to the pathogen's virulence, adaptation to the host environment, and disease-causing abilities [100].
The abundance and distribution of variants, particularly the higher occurrence of SNPs on specific chromosomes like 5, 9, and 2 within the Blumeria genome, signify a mosaic of genetic diversity and potentially distinct evolutionary pressures.The observed bias for transitions over transversions (Ts/Tv ratio of 1.63) suggests a preference for specific nucleotide changes, possibly reflecting mutational patterns or selective constraints in the evolutionary trajectory of Blumeria.The prevalence of intergenic variants implies potential regulatory alterations that might influence gene expression, adaptation, or responses to selective forces.The dominance of missense variants indicates potential changes in the protein-coding sequences, possibly contributing to pathotype evolution in Blumeria.Moreover, the predominance of modifier impact variants suggests nuanced alterations in molecular functions, possibly contributing to the fine-tuning of evolutionary strategies related to wheat host interactions.

Conclusions
The genome and transcriptome data of B. graminis f. sp.tritici uncover critical insights into its diverse metabolic pathways, pathogenicity & effector genes, and virulence mechanisms, showcasing potential threats to wheat cultivation.Retro-elements, LTR elements, P elements, diverse protein families, and signaling factors, coupled with abundant tRNA anticodons, likely drive its evolution.Phylogenomic analysis places this isolate in a distinct monocot-infecting clade.Its genetic makeup with SNP clusters, a transition bias, and diverse variants, including impactful missense and modifier variants, reflects its adaptive evolution, influenced by unique pressures and wheat interactions.The abundance of proteins related to RNA processing, post-translational modification, protein turnover, chaperones, and signal transduction pathways presents opportunities for developing effective, long-lasting strategies for managing the powdery mildew pathogen and its threats to the production and quality of wheat in India.

Figure 2 .
Figure 2. (A) (a,b) depict the symptoms and mycological characteristics of wheat powdery mildew pathogen B. graminis f. sp.tritici Wtn1 at hot-spot location (Wellington) in Nilgiris hills of India.(B) (a).Conidiogenesis with airborne conidia of wheat powdery mildew pathogen expressed on leaves, stem, and awns of susceptible cultivar WL 711.(b).Formation of black and round-shaped chasmothecia on leaf surface.(c) Chasmothecium with paraphyses under SEM analysis.

Figure 2 .
Figure 2. (A) (a,b) depict the symptoms and mycological characteristics of wheat powdery mildew pathogen B. graminis f. sp.tritici Wtn1 at hot-spot location (Wellington) in Nilgiris hills of India.(B) (a).Conidiogenesis with airborne conidia of wheat powdery mildew pathogen expressed on leaves, stem, and awns of susceptible cultivar WL 711.(b).Formation of black and round-shaped chasmothecia on leaf surface.(c) Chasmothecium with paraphyses under SEM analysis.

Figure 3 .
Figure 3. (A)Average nucleotide identity (ANI) based on WGS alignment of powdery mildew fungal genera of different crop plants; (B) phylogenetic tree constructed based on WGS alignment of powdery mildew fungal genera of different crop plants.

Figure 4 .
Figure 4. Overview of different components that make up the B. graminis f. sp.tritici Wtn1 ge Majority of the genome composed of non-coding sequences, such as introns and transposab ments, including long interspersed nuclear elements (LINEs), short interspersed nuclear ele (SINEs), and long terminal repeat (LTR) retrotransposons.Only 8.05% of the genome actuall sists of protein-coding regions.

Figure 4 .
Figure 4. Overview of different components that make up the B. graminis f. sp.tritici Wtn1 genome.Majority of the genome composed of non-coding sequences, such as introns and transposable elements, including long interspersed nuclear elements (LINEs), short interspersed nuclear elements (SINEs), and long terminal repeat (LTR) retrotransposons.Only 8.05% of the genome actually consists of protein-coding regions.

Figure 9 .
Figure 9. Eukaryotic orthologous groups of proteins in B. graminis f. sp.tritici Wtn1 genome.egg-NOG was used for functional feature determination.

Figure 9 .
Figure 9. Eukaryotic orthologous groups of proteins in B. graminis f. sp.tritici Wtn1 genome.eggNOG was used for functional feature determination.

Figure 10 .
Figure 10.The identified SNP density plot with respect to chromosomes representing number of SNPs within 0.1 Mb window size in B. graminis f. sp.tritici Wtn1 genome.

Figure 10 .
Figure 10.The identified SNP density plot with respect to chromosomes representing number of SNPs within 0.1 Mb window size in B. graminis f. sp.tritici Wtn1 genome.

Figure 11 .
Figure 11.Identified variants and effect annotation B. graminis f. sp.tritici Wtn1 genome: (a) varia distribution by chromosome, (b) effect by different region, (c) effect by functional class, and (d) effe by impact.

Figure 11 .
Figure 11.Identified variants and effect annotation B. graminis f. sp.tritici Wtn1 genome: (a) variant distribution by chromosome, (b) effect by different region, (c) effect by functional class, and (d) effect by impact.

Table 1 .
Statistics of the assembled whole genome of B. graminis f. sp.tritici Wtn1.

Table 2 .
Statistics of the repeat elements in assembled whole genome of B. graminis f. sp.tritici Wtn1.
features between the B. graminis f. sp.tritici Wtn1 and B. graminis f. sp.triticale THU genomes.The expanded PFAMs in these genomes included fungal protein kinase WD domain, G-beta repeat; the protein kinase domain; RNA recognition motifs (R RBD, and RNP domain); mitochondrial carrier proteins; and helicase conserved C-t nal domains.

Table 3 .
Comparative analysis of CAZyme family profile in B. graminis.

Table 3 .
Comparative analysis of CAZyme family profile in B. graminis.
KEGG: Kyoto Encyclopedia of Genes and Genomes.
KEGG: Kyoto Encyclopedia of Genes and Genomes.
KEGG: Kyoto Encyclopedia of Genes and Genomes.