Next Article in Journal
Machine Learning-Driven Optimization for Predicting Biochar Adsorption Performance Toward Pb(II) and Cd(II)
Previous Article in Journal
Seasonal Variation in Elements in River Water and Reassessment of Environmental Impact in the Bor Mining Area, Eastern Serbia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

WaveDroughtNet: A Multi-Modal Wavelet-Enhanced Temporal Convolutional Network for Multi-Horizon Drought Forecasting and Onset Analysis

by
K. Venkatachalam
1,*,
Claudia Cherubini
2,* and
Alphonse Anushya
3
1
Department of Computer Science and Engineering, Karunya Institute of Technology and Sciences, Coimbatore 641114, India
2
Department of Mathematics, Informatics and Geosciences, University of Trieste, 34128 Trieste, Italy
3
Department of Artificial Intelligence and Data Science, College of Computer Science and Engineering, University of Ha’il, Ha’il 55476, Saudi Arabia
*
Authors to whom correspondence should be addressed.
Water 2026, 18(12), 1415; https://doi.org/10.3390/w18121415 (registering DOI)
Submission received: 26 March 2026 / Revised: 25 May 2026 / Accepted: 4 June 2026 / Published: 10 June 2026
(This article belongs to the Special Issue Sea Level Rise Vulnerability and Coastal Management)

Abstract

Drought is a slowly evolving, multi-driver hydro-meteorological hazard whose accurate early prediction is a cornerstone of climate-smart agriculture and water-resource planning. Existing data-driven drought forecasting frameworks suffer from three persistent limitations: (i) most models concatenate heterogeneous climate variables into a single flat feature vector, implicitly assuming a single dominant driver such as precipitation, even though atmospheric moisture demand, radiation and wind-mediated evapotranspiration co-determine drought onset; (ii) wavelet preprocessing is typically applied to the full series, introducing future-information leakage that violates the operational causality requirement of forecasting; and (iii) most architectures predict a single horizon and provide no causal attribution explaining when, where and which climatic variables initiated the event. This study proposes WaveDroughtNet, a multi-modal, multi-horizon deep-learning framework that addresses these limitations through five integrated components: (a) a strictly causal Daubechies-4 wavelet decomposition computed in a rolling fashion; (b) six modality-specific encoders with stochastic modality dropout (p = 0.15); (c) cross-modal multi-head attention with four heads; (d) a four-layer temporal convolutional network (TCN) backbone with dilation factors yielding a 240-step receptive field; and (e) a post hoc DroughtOriginTracer that combines temporal attention, modal-attribution and inter-district propagation scans. The Standardised Precipitation Evapotranspiration Index (SPEI), used as the supervisory target, is computed following the canonical Vicente-Serrano formulation. water balance D = P P E T (Hargreaves PET) at a 4-week (≈1-month) timescale, fitted with a three-parameter log-logistic distribution via L-moments, validated by Kolmogorov–Smirnov goodness-of-fit testing ( α = 0.05 ) per district, and standardised through the inverse-normal cumulative distribution function. Trained on 18,304 weekly district records from NASA POWER reanalysis (2014–2025) covering all 32 districts of Tamil Nadu, India, WaveDroughtNet uses only 256,869 parameters and produces, in a single forward pass, four forecasts (1 week, 1 month, 3 months, 1 year). On the held-out 2024 test partition ( N = 1728 ), the model attains weighted F 1 = 0.9221 and R 2 = 0.8512 at the 1-week horizon, and weighted F 1 = 0.8498 and R 2 = 0.6812 at the 1-year horizon. Diebold–Mariano tests confirm that WaveDroughtNet significantly outperforms naive persistence, seasonal naive, LSTM, ConvLSTM and a vanilla Transformer at the 3-month and 1-year horizons (p < 0.001). The DroughtOriginTracer successfully back-projects 15 Coimbatore events to causal origins 29–41 weeks prior to onset. We explicitly acknowledge three limitations that constrain operational deployment in its current form—zero severe events in the 2024 test partition ( F 1 s e v e r e = 0.000), static inter-district modelling, and absence of vegetation-index supervision—and propose concrete mitigation pathways in the Discussion.

1. Introduction

Drought ranks among the costliest natural hazards globally, accounting for approximately 83% of damage to the agricultural sector and USD 124 billion in cumulative losses between 1998 and 2017 [1,2]. The United Nations Convention to Combat Desertification (UNCCD) projects state that, under continuation of current land-use practices, up to 16 million km2 of fertile land could be lost by 2050 [1], placing unprecedented pressure on regional food security and water supply. These socio-economic stakes motivate the development of skillful, spatially resolved early warning systems.
Conceptually, drought is partitioned into meteorological, agricultural, hydrological and socio-economic sub-types [3,4]. Palmer [4] introduced the Palmer Drought Severity Index (PDSI) for the meteorological–agricultural interface, while McKee et al. [5] proposed the precipitation-only Standardised Precipitation Index (SPI), which remains the most widely deployed operational index. Vicente-Serrano et al. [6] extended SPI to the Standardised Precipitation Evapotranspiration Index (SPEI) by introducing temperature-driven atmospheric water demand through potential evapotranspiration (PET). SPEI is preferable to SPI for climate-change-impact studies because it responds explicitly to warming-driven increases in evapotranspirative demand [6,7]. In this study, we adopt SPEI at the 4-week (≈1-month) timescale: this scale is sensitive to meteorological drought onset (which propagates from precipitation deficits within ≈30 days) yet retains a sufficient signal-to-noise ratio for skillful weekly forecasting [6,8]. Longer SPEI windows (3-, 6-, 12-month) lag the actual onset and are computed as derived products from the same fitted log-logistic family.
Traditional drought monitoring with SPI [7] or SPEI [6] provides retrospective characterisation but does not, by itself, deliver predictive lead time. Physics-based numerical models such as general circulation models (GCMs) and mesoscale weather models have historically supplied the predictive component [9], but suffer from coarse spatial resolution (typically 50–250 km), large computational footprint and known systematic biases in tropical monsoon regions. Hybrid approaches that downscale GCM output with machine learning [10,11] partially mitigate the resolution issue but inherit the GCM bias structure.
Machine learning (ML) and deep learning (DL) have therefore been applied to drought forecasting to learn non-linear relationships directly from observations [9,12]. Valipour et al. [13] reported short-term daily precipitation R 2 = 0.64–0.89 across diverse climate zones using ensemble ML; Ferchichi et al. [10] used generative adversarial networks for spatio-temporal drought modelling in North Africa; and Cortés-Andrés et al. [14] employed 3-D CNNs with noisy-label learning for drought detection. At a global scale, Bi et al. [15], Lam et al. [16] and Pathak et al. [3] have demonstrated that data-driven models can match or exceed numerical weather prediction systems for medium-range forecasting. Montillet et al. [17] and Neset et al. [18] reviewed big-data and AI-assisted early warning frameworks, while Huntingford et al. [9] surveyed ML for climate applications more broadly. For drought specifically, recurrent architectures built around the Long Short-Term Memory (LSTM) cell of Hochreiter and Schmidhuber [19] dominate the literature: Marusov et al. [20] used a spatio-temporal LSTM for long-term PDSI forecasting in Russia, while Tuğrul et al. [21,22] coupled wavelet decomposition with LSTM and Support Vector Machines for regional drought prediction in Norway.
Wavelet preprocessing has become standard in hydrological time-series modelling because it separates non-stationary signals into low-frequency trends and high-frequency residuals, exposing the multi-scale periodicity intrinsic to drought [21,23]. Wavelet-ANN, Wavelet-SVM and Wavelet-LSTM models consistently outperform their non-wavelet counterparts [22,23]. Osmani et al. [23] hybridised Tunable Q-factor and Maximal Overlap Discrete Wavelet Transforms with Gaussian Process Regression for SPEI forecasting, and Liu et al. [24] extended wavelet attention to medical image segmentation, demonstrating the generality of wavelet-attention coupling. However, the dominant practice of applying the wavelet transform to the entire training series leaks future information into past representations [23]; this is admissible offline but invalid for operational forecasting. WaveDroughtNet therefore applies a strictly causal decomposition where at every time t, only data up to and including t enters the transform. Attention-based architectures, beginning with the Transformer of Vaswani et al. [25], have produced strong results in sequence modelling. Zhang et al. [26] applied a shifted-window Transformer to multi-scale spatio-temporal drought prediction, and Lu and Pan [27] compared two Transformer variants for short-term precipitation nowcasting. The quadratic O(n2) self-attention cost, however, is prohibitive for the 52-week input sequences used here when expanded across six modalities. Temporal convolutional networks (TCNs), with linear cost in sequence length and a dilation-controlled receptive field, offer a favourable accuracy–efficiency trade-off for this regime [28]. Despite these advances, four limitations persist in the published literature: (i) climate variables are typically concatenated as a flat vector, masking the physical heterogeneity of their roles in drought genesis [10,11,23]; (ii) most architectures predict a single horizon and require separate models for each lead time [20,21]; (iii) statistical-significance testing of forecast superiority (Diebold–Mariano, Friedman–Nemenyi) is rarely reported [10,13,23]; and (iv) no published framework provides post hoc causal attribution localising when, where and which modality initiated the drought.
These limitations define the research gap addressed in this paper, summarised by three research questions:
RQ1—Can a single, lightweight architecture deliver skillful multi-horizon SPEI forecasts (1 week to 1 year) while explicitly modelling the heterogeneity of meteorological modalities?
RQ2—Can causal wavelet preprocessing, used inside a learnable model, yield genuine operational utility (no future-information leakage) while preserving multi-scale feature representations?
RQ3—Can post hoc interpretability components localise the temporal origin, modal driver and spatial propagation path of individual drought events with sufficient confidence to support decision-making?
Regional studies have documented Tamil Nadu’s substantial drought exposure. Janarth et al. [8] reported severe SPEI-based stress in the north-western and southern districts, and Lalmuanzuala et al. [27] combined meteorological and remote-sensing indices for the southern districts. The 2016 monsoon failure produced an 82% rainfall deficit relative to the long-term mean and triggered a drought emergency that persisted into 2019. The state’s 32 districts span five distinct climatic zones (coastal-humid, inland-arid, Western Ghats-influenced, semi-arid transition, southern coastal) with mean daily precipitation ranging from 1.8 mm in the inland rain-shadow to 4.2 mm on the coast and mean temperatures ranging from 24.5 °C in the Nilgiris to 28.5 °C in the inland plains. This spatial heterogeneity provides a stringent natural test-bed for district-resolved forecasting models.
  • Contributions of this work
In response to the gaps above, this work makes the following specific contributions:
  • Strictly causal wavelet decomposition: Per district and per time step t, the Daubechies-4 (db4) transform is applied only to {x_1ⓜ, …ⓜ, x_t }, yielding a leakage-free multi-scale representation that is admissible for operational deployment.
  • Multi-modal encoder with modality dropout: Six climate modalities (Temperature, Precipitation, Humidity, Wind, Solar/Cloud, Temporal) are encoded by independent two-layer MLPs into a 64-dimensional token; stochastic modality dropout (p = 0.15) prevents over-reliance on any single channel and improves robustness to sensor failures.
  • Cross-modal attention with TCN backbone: Four-head attention fuses the six modality tokens; a four-layer dilated TCN with kernel size five (dilations 1, 2, 4, 8) yields a 240-step receptive field with linear complexity, only 256,869 trainable parameters and full causal structure.
  • Unified multi-horizon prediction: A single forward pass produces SPEI classification (4 classes) and regression at 1-week, 1-month, 3-month and 1-year horizons, sharing the TCN backbone across horizons for transfer between lead times.
  • Mathematically correct SPEI pipeline: The supervisory target is computed by Hargreaves PET, water balance D = P-PET, three-parameter log-logistic fit via L-moments, Kolmogorov–Smirnov goodness-of-fit validation at α = 0.05 per district, and inverse-normal-CDF standardisation—the Vicente-Serrano canonical procedure [6].
  • Extended statistical-significance evaluation: Diebold–Mariano pairwise tests, Friedman χ2 and Nemenyi post hoc analysis establish that the observed performance differences are not attributable to sampling variability.
  • A novel post hoc or onset analyser combines temporal attention, modal attribution and inter-district onset scanning; on a 15-event Coimbatore case study, it identifies causal origins 29–41 weeks prior to drought onset.
  • We explicitly report and discuss three limitations: Zero severe events in the test partition (F1_severe = 0.000), static inter-district modelling, and absence of vegetation-index supervision; mitigation paths are detailed in Section 4.

2. Materials and Methods

2.1. Study Area

Tamil Nadu is geographically the most diverse state in India, situated at the southernmost tip of India, spanning 8.08° N to 13.56° N latitude and 76.23° E to 80.35° E longitude. Tamil Nadu spans approximately 130,058 km2 and consists of thirty-two administratively defined districts, each with unique climatic, geographic, and hydrologic attributes. As such, Tamil Nadu presents a rich yet challenging opportunity for a case study of district-level drought prediction, given its large degree of spatial heterogeneity. Figure 1 shows the study area map.
From a climatic standpoint, Tamil Nadu is primarily influenced by the NE monsoon during October through December. Most of India depends upon the SW monsoon. Consequently, Tamil Nadu experiences significant seasonal temperature fluctuations, pronounced coastal–inland differences, and recurring droughts in the state’s rain-shadow and semi-arid regions. Based on these factors, districts may be categorised into five climatic zones:
(i)
Coastal-Humid (Chennai, Cuddalore, Nagapattinam, Thanjavur) is characterised by moderate temperatures and high humidity;
(ii)
Inland-Arid (Ramanathapuram, Sivaganga, Pudukkottai) is characterised by low levels of precipitation and extremely high temperature conditions;
(iii)
Western Ghats-Influenced (The Nilgiris, Coimbatore, Erode) is characterised by cool temperatures and orographic precipitation;
(iv)
Semi-Arid Transition (Salem, Dharmapuri, Krishnagiri) is characterised by extreme diurnal temperature range conditions;
(v)
Southern Coastal (Kanniyakumari, Tirunelveli, Thoothukkudi) is characterised by temperature moderation due to its maritime influence.
Daily precipitation ranges from 1.8 mm/day in semi-arid inland districts to 4.2 mm/day in coastal districts, whereas daily temperature ranges from 24.5 °C in the Nilgiris highlands to 28.5 °C in the inland plains. These spatial gradients emphasise the need for district-level models instead of a state-wide model [29] based on spatial averages of data. A model based on spatial averages would consistently underestimate the degree of variability in both the upper and lower tails of the distributions. The grid-point view of the Tamil Nadu map is shown in Figure 2.

2.2. Methodology

Figure 3 shows the workflow of the WaveDroughtNet framework, developed for both multi-horizon drought prediction and the source identification of drought. The WaveDroughtNet pipeline starts by collecting climate data through the NASA POWER API at a 0.1° spatial resolution for the 32 districts of Tamil Nadu. Climate variables include the average, maximum and minimum temperatures, precipitation, relative humidity, wind speed and solar radiation. These seven climate variables are transformed into weekly time series. Through the preprocessing step, temporal aggregation, feature generation and drought index calculation were implemented using several drought indices (such as the Standardised Precipitation Index and the Standardised Precipitation Evapotranspiration Index) on various temporal scales. A Daubechies-4 wavelet transform was used to capture temporal variability on all scales while maintaining causal relationships between the climate variables. The climate signals were then normalised and input into the proposed WaveDroughtNet architecture, which includes modality-specific encoders, cross-modal attention, and a causal TCN as its backbone. The WaveDroughtNet model was trained using a walk-forward validation technique and produced simultaneous multi-horizon drought predictions (horizons of 1 week, 1 month, 3 months, and 1 year). Outputs of the WaveDroughtNet model included drought severity classification and SPEI regression evaluation methods that provide several different statistical and predictive performance measures to evaluate the robustness and reliability of the model.

2.3. Data Sources, Preprocessing and Uncertainty

Climate variables were obtained from the NASA POWER (Prediction of Worldwide Energy Resources) API, which provides quality-controlled reanalysis products derived from the MERRA-2 atmospheric reanalysis and CERES satellite radiation data [3,17]. The dataset spans 1 January 2014 to 31 December 2025 at daily temporal resolution and 0.1° × 0.1° spatial resolution. Ten core variables were retrieved per grid point: near-surface air temperature (mean, maximum, minimum), corrected precipitation, relative humidity at 2 m, wind speed at 10 m, surface solar radiation and cloud cover. Daily measurements were aggregated to weekly resolution by district by area-weighted mean over the contained grid points, yielding 18,304 district-week records over the 32 districts and 572-week study period.
Administrative boundaries were taken from the Global Administrative Areas database (GADM) version 4.1 Level-2 [3], standardised to the WGS-84 coordinate reference system (EPSG:4326). Grid points were generated as a uniform 0.1° lattice, intersected with the district polygons to exclude cross-boundary points (Figure 2). The resulting 1086 grid points are distributed as 1–125 per district, with higher density in larger districts, Viluppuram, Tirunelveli, and Dindigul; the median is 34.
Data quality and consistency were assessed using a three-stage protocol. Stage 1 (physical bounds): temperature values outside 10   C T a v g 50   C were clipped to the bound, and precipitation records were enforced to be non-negative. Stage 2 (distributional consistency): values exceeding ± 4 σ of the district-month mean were imputed with the 99th percentile of that month’s empirical distribution, following the robust outlier scheme of [23]. Stage 3 (homogeneity): the Pettitt test and the Standard Normal Homogeneity Test (SNHT) were applied per district to flag changepoints in the precipitation and temperature series; no district series failed at α = 0.05 , supporting the use of the entire 2014–2025 record. Missing values (0.42% of the raw data, predominantly retrieval errors near the satellite swath edges) were imputed by district-specific seasonal-naïve backfilling within a 7-day window.
Uncertainty discussion: NASA POWER is a reanalysis product and is therefore subject to known sources of uncertainty: model parametrisation errors, sensor cross-calibration drift, and an effective spatial resolution (≈50 km native) coarser than the 0.1° interpolation grid. Independent validation against the India Meteorological Department (IMD) gridded daily dataset (0.25° resolution) for the 2015–2020 overlap window shows a district-mean monthly precipitation correlation of r = 0.91 (range 0.83–0.96) and a mean bias of −4.2% (NASA POWER underestimates IMD). For SPEI computation, this bias is partially absorbed by the per-district log-logistic standardisation, which is bias-invariant to additive shifts in the water-balance distribution; however, the residual scale uncertainty propagates to the SPEI tail and inflates the estimated frequency of moderate-severe events by an estimated 5–8% in the rain-shadow districts. Direct co-training with IMD ground-station data is identified as a priority for future work.

2.4. SPEI Computation

The supervisory target is the Standardised Precipitation Evapotranspiration Index (SPEI) at a 4-week (≈1-month) aggregation. SPEI is preferable to SPI in a warming climate because it explicitly accounts for atmospheric evaporative demand [6]. The 1-month timescale was selected as the operational target because (i) it captures the meteorological-drought onset window (precipitation deficits propagate to standardised water-balance anomalies within ≈30 days), (ii) it preserves enough high-frequency variance for skillful weekly forecasting, and (iii) it is the most widely reported SPEI timescale in published Tamil Nadu studies [8,27], enabling direct comparison. SPEI is computed by the canonical four-step Vicente-Serrano procedure [6]: (1) compute potential evapotranspiration, (2) form the climatic water balance, (3) fit a three-parameter log-logistic distribution to the water balance via L-moments and validate the fit by a Kolmogorov–Smirnov goodness-of-fit test, and (4) standardise via the inverse-normal CDF.
Step 1: Potential evapotranspiration. Because high-quality net-radiation and wind observations are not uniformly available across all 32 districts at daily resolution, we adopt the Hargreaves–Samani estimator [30], which requires only temperature and extraterrestrial radiation R a (computed from latitude and day of year following FAO-56 [30]):
P E T = 0.0023 · R a · ( T m e a n + 17.8 ) · T m a x T m i n
where R a is in MJ·m−2·day−1 and PET is in mm·day−1. The Hargreaves estimator has been shown to recover Penman–Monteith PET with mean absolute error ≤10% in subtropical climates [30].
Step 2: Water balance. The climatic water balance for week i is the difference between precipitation P and PET aggregated weekly:
D i = P i P E T i
and the k-month aggregation (here k = 1 month, i.e., 4 weeks) is the rolling sum:
D n k = i = n k + 1 n D i
Step 3: Log-logistic fit. The k-aggregated water-balance series is long-tailed and can take negative values, ruling out the gamma distribution used for SPI. Following [6], we fit a three-parameter log-logistic (LL3) distribution to D(k) independently for each district. The LL3 probability density and cumulative distribution functions are:
f ( x ) = β α x γ α β 1 1 x γ α β 2
F ( x ) = 1 α x γ β 1
where α > 0 is the scale parameter, β > 0 is the shape parameter, and γ is the origin parameter. Parameters are estimated by the method of L-moments, which is preferred to maximum likelihood for heavy-tailed hydrological series because of its lower sensitivity to outliers [6]. Given the first three sample L-moments λ̂_1, λ̂_2, λ̂_3, the estimators are:
β ^ = 2 λ ^ 2 λ ^ 1 6 λ ^ 2 λ ^ 1 6 λ ^ 3
α ^ = ( λ ^ 1 γ ^ ) · s i n ( π / β ^ ) · ( β ^ / π )
γ ^ = λ ^ 1 α ^ · Γ ( 1 + 1 / β ^ ) · Γ ( 1 1 / β ^ )
Step 3b: Goodness-of-fit validation. The empirical CDF F̂_n(x) is compared to the fitted F(x; α̂, β̂, γ̂) using the Kolmogorov–Smirnov statistic:
K n = s u p x | F ^ n ( x ) F ( x ; α ^ , β ^ , γ ^ ) |
At α = 0.05 the critical value for n = 572 weekly observations is K c r i t 0.0568 . In our experiments the LL3 fit was accepted for 30 of the 32 districts (K_n in the range 0.018–0.048); two high-elevation districts—Nilgiris (K_n = 0.064) and Kanniyakumari (K_n = 0.059)—failed the K-S test, and for these districts we additionally evaluated the Pearson type III and Generalised Extreme Value (GEV) distributions, selecting the GEV by minimum Anderson–Darling statistic. The decision rule and per-district K_n values are reported. the fitting code is released with the source.
Step 4: Probabilistic standardisation. Once F(x) is accepted, SPEI is obtained by inverting the standard normal CDF Φ at the same probability:
S P E I = Φ 1 ( F ( D k ) )
For numerical computation, we use the Abramowitz–Stegun rational approximation [6]: with W = 2 l n F for F 0.5 , and W = 2 l n ( 1 F ) otherwise, the SPEI is approximated to double-precision accuracy by:
S P E I s i g n ( F 0.5 ) · W c 0 + c 1 W + c 2 W 2 1 + d 1 W + d 2 W 2 + d 3 W 3
with constants c 0 = 2.515517 , c 1 = 0.802853 , c 2 = 0.010328 , d 1 = 1.432788 , d 2 = 0.189269 , d 3 = 0.001308 . This corrected implementation differs in three substantive ways from the earlier draft of this manuscript: (i) the previous draft applied a naive μ σ z-score to D, which assumes Gaussian water balance and overestimates the frequency of extreme values; (ii) it omitted PDF selection and the K-S validation; (iii) it omitted per-district fitting. The corrected pipeline is fully aligned with the Vicente-Serrano canonical formulation [6] and with the implementation distributed in the SPEI R package. Table 1 shows severity classes.

2.5. Feature Engineering

Ten core daily climate variables were transformed into 70 engineered features grouped by six logically coherent climate modalities (Table 2). Feature engineering proceeded in four sequential phases. Figure 4 shows the Daubechies-4 wavelet decomposition in the feature engineering process.
Phase 1—statistical and lagged features. For each primary climate variable (Temperature_Avg, Precipitation, Humidity, Wind_Speed, Solar_Radiation), rolling mean and rolling standard deviation were computed at 4-week and 12-week windows, plus 1-week and 4-week lagged versions. These features expose short- and medium-term memory of the climate state.
Phase 2—cross-modal interaction features. Five physically grounded interaction features were constructed: aridity index ( P P E T ), evaporative deficit ( P E T P ), moisture flux ( P E T T ¯ ), heat-stress index ( T ¯ · ( 1 R H 100 ) ) and effective radiation ( R s · ( 1 c l o u d f r a c ) ). These capture compound atmospheric demand processes that single-variable rolling statistics cannot represent.
Phase 3—cyclic temporal encoding. Calendar week-of-year and month-of-year are encoded as sine–cosine pairs ( s i n ( 2 π w / 52 ) , c o s ( 2 π w / 52 ) ; s i n ( 2 π m / 12 ) , c o s ( 2 π m / 12 ) ) to provide a continuous, periodic representation of seasonality that, unlike integer encoding, has no artificial December–January discontinuity.
Phase 4—strictly causal Daubechies-4 wavelet decomposition. For each of the five primary climate variables, a three-level db4 wavelet decomposition is applied per district to the contiguous history x 1 x t , yielding one approximation ( A 3 ) and three-detail ( D 1 , D 2 , D 3 ) sub-bands per variable. Crucially, the transform window is recomputed at every t and does not look forward, eliminating future-information leakage that is present in standard offline wavelet preprocessing [23]. Twenty additional wavelet features result (5 variables × 4 sub-bands). The db4 basis was selected after a Shannon-entropy comparison among the Haar, db2, db4, sym8 and coif3 bases on a held-out validation fold (db4 produced the lowest mean entropy). Features are described in detail in Table 2.

2.6. Proposed WaveDroughtNet Architecture

WaveDroughtNet Figure 5 ingests 52 weeks of climate features across six modalities and produces, in a single forward pass, classification (4 severity classes) and SPEI regression at four horizons. The architecture comprises five blocks: (1) modality encoders with modality dropout, (2) cross-modal multi-head attention, (3) a four-layer dilated TCN backbone, (4) a learned temporal-attention pool and district embedding, and (5) horizon-specific heads.

2.6.1. Modality Encoders and Modality Dropout

Modality encoders and modality dropout. Each modality m is projected into a shared d-dimensional embedding (d = 64) by an independent two-layer MLP with GELU activation, layer normalisation and dropout (p = 0.2):
h m 0 = L a y e r N o r m ( W 2 m G E L U ( W 1 m x m + b 1 m ) + b 2 m )
During training, modality dropout randomly zeros each h_m(0) with probability p = 0.15, subject to the constraint that at least one modality remains active. This is analogous to the structured dropout of [28] but applied at modality granularity, encouraging predictions that are robust to any single sensor failure—a known concern in operational deployments [17]. At inference, the encoders are deterministic. The encoder design also equalises the representational capacity allocated to small-dimension modalities (Temporal, 4 → 64) and large-dimension modalities (Temperature, 16 → 64), preventing the over-representation problem documented by [11].

2.6.2. Cross-Modal Attention

Cross-modal attention: At each time-step t the six modality tokens {h_m(0)(t)}_{m = 1}6 are stacked into H ( t ) R 6 × d and passed through a four-head multi-head attention (MHA) block:
Q = H W Q ; K = H W K ; V = H W V
H ( t ) = L a y e r N o r m ( H + c o n c a t ( h e a d 1 , , h e a d 4 ) W O ) , h e a d i = s o f t m a x Q i K i T d k V i
The attention weights at each time-step are stored and later used as the modal-attribution component of the DroughtOriginTracer. The fused 6-token representation H’(t) is then averaged across modalities to produce a single time-step vector z ( t ) R d .

2.6.3. Temporal Convolutional Network Backbone

Temporal convolutional network backbone: The sequence {z(1), …, z(52)} is processed by a stack of four causal Temporal-Causal blocks with exponentially growing dilation factors d l = 2 l 1 , l = 1 , , 4 . Each block contains two dilated 1-D convolutions (kernel k = 5 , GELU activation), layer normalisation, dropout (p = 0.2) and a residual connection:
u l ( t ) = G E L U ( L N ( W 1 l * d l z l ( t ) + b 1 l ) )
z l + 1 ( t ) = L N ( W 2 l * d l u l ( t ) + b 2 l ) + z l ( t )
The receptive field after L layers with kernel k and dilations d l is r = 1 + 2 l ( k 1 ) d l = 1 + 2 ( 4 ) ( 1 + 2 + 4 + 8 ) = 121 , and the effective receptive field including the per-block residual path is 240 steps—far larger than the 52-week input, so the network has ample capacity to model the full input history. Compared with Transformer self-attention, the TCN backbone has (i) linear O ( L · N · d 2 ) cost in sequence length, (ii) stable gradients by virtue of dilated convolutions and residual connections, (iii) fully parallel training in the temporal dimension, and (iv) explicit causal structure by construction [28].

2.6.4. Temporal-Attention Pool and District Embedding

Temporal-attention pooling: The final TCN output { z 4 ( t ) } is pooled by a learned temporal-attention vector w τ :
α t = s o f t m a x t ( w τ T z 4 ( t ) )
z s e q = t α t · z 4 ( t )
Unlike mean or max pooling, the learned weights α t identify the weeks most diagnostic for the prediction. These weights are stored and serve as the temporal-attribution component of the DroughtOriginTracer.
District embedding: Each of the 32 districts is mapped to a learnable 16-dimensional embedding e d R 16 , concatenated with the pooled sequence vector to form the final representation:
v = c o n c a t ( z s e q , e d )
We treat this static embedding as a baseline spatial encoder rather than a final spatial model: it captures inter-district differences in mean climate state but does not represent dynamic inter-district interactions (the latter is addressed by the Graph Neural Network extension proposed.

2.6.5. Horizon-Specific Heads and Loss Function

Horizon-specific heads: The combined vector v is projected by four-horizon-specific two-layer MLPs (one per horizon h { 1,4 , 13,52 } w e e k s ), each producing both a 4-way severity logit and a continuous SPEI estimate:
y ^ h c l a s s , y ^ h r e g = M L P h ( v )
Loss function: The total loss is a weighted sum of horizon-specific losses, each a weighted sum of cross-entropy (with label smoothing ϵ = 0.1 and class weights from sklearn’s compute_class_weight for the imbalanced training distribution) and Huber loss ( δ = 1.0 ) for regression. The classification term carries weight λ c = 0.6 and regression λ r = 0.4 , reflecting the operational priority of severity classification. Huber loss is preferred to MSE because it is robust to the heavy-tailed SPEI distribution [6].
L = h ( λ c · L C E h ( y ^ h c l a s s , y h c l a s s ; w h ) + λ r · L H u b e r h ( y ^ h r e g , y h r e g ) )

2.7. Drought Origin Tracer Module

The DroughtOriginTracer module performs post hoc causal analysis of detected drought events by combining three complementary attribution signals. Given a target district d and event date t e v e n t , the tracer operates as follows. Figure 6 shows drought visualisation in detail.
Temporal Origin Detection: The 52-week input sequence ending at t e v e n t is processed through the model, and the temporal-attention weights α t are extracted. The attention threshold is set at the 75th percentile of the attention distribution. The earliest time step whose attention weight exceeds this threshold is designated as the temporal origin t o r i g i n . The lead time is computed as Δ t = t e v e n t t o r i g i n .
Modal Attribution: Cross-modal attention weights from the MHA layer are averaged across all time steps and attention heads, yielding per-modality importance scores. These scores are normalised to sum to 1.0, producing a ranking of trigger modalities. For example, temperature (19.2%), solar/cloud (17.2%), and humidity (16.8%) were identified as the top triggers in the Coimbatore case study.
Spatial Propagation Analysis: The tracer scans all 32 districts for earlier-onset drought signals by examining SPEI values in a 12-week lookback window. For each district, the earliest week where SPEI drops below the drought threshold is recorded. Districts are ranked by onset timing, and the district with the earliest onset is identified as the spatial origin. The spatial propagation path describes how drought conditions spread from the origin district to the target district, providing insights into large-scale drought dynamics.

2.8. Dataset Summary

After SPEI computation and feature engineering, the resulting weekly district panel contains 18,304 records (572 weeks × 32 districts) and 70 features per record (Table 3).

2.9. Experimental Setup

Time-forward validation: To prevent temporal leakage, the panel is split chronologically rather than randomly: 2014–2021 for training (66.2%), 2022–2023 for validation (22.2%), and 2024 for testing (11.5%). This protocol follows the time-forward validation convention recommended for climate forecasting [29] and ensures that every weekly target is predicted from a 52-week input window strictly preceding it. Table 4 summarises the splits.

2.10. Training Configuration

WaveDroughtNet is trained on a single NVIDIA A100 GPU using the AdamW optimiser with a base learning rate of 3 × 10−4 and weight decay of 10−4. The learning-rate schedule is a 5-epoch linear warm-up followed by cosine annealing to zero over the remaining epochs. Training is limited to 100 epochs with early stopping (patience = 25 epochs on the validation loss). Batch size is 128; FP16 mixed precision (CUDA AMP) reduces memory and accelerates training without measurable accuracy loss. Gradient clipping at max-norm 1.0 stabilises the early epochs. Reproducibility: the random seed is fixed (seed = 42), and the cuDNN deterministic flag is enabled. Table 5 lists the full hyperparameter configuration with justifications.

2.11. Evaluation Metrics

To support a multi-faceted assessment, we employ over 20 metrics in six categories [1,2], addressing the limitations of a single accuracy score on imbalanced, multi-output forecasting.
Regression metrics: MAE, RMSE, R 2 , Nash–Sutcliffe efficiency (NSE), Kling–Gupta efficiency (KGE), bias and Pearson correlation. These quantify accuracy, explained variance and systematic error.
Scale-free metrics: Mean Absolute Scaled Error (MASE), Root Mean Squared Scaled Error (RMSSE), symmetric MAPE (sMAPE) and Weighted Absolute Percentage Error (WAPE). These enable cross-series and cross-horizon comparisons independent of the SPEI scale.
Classification metrics: Accuracy, precision, recall and F1-score (weighted, macro and per-class) at each horizon.
Probabilistic calibration: Multi-class Brier score [31], Expected Calibration Error (ECE) and reliability diagrams.
Residual analysis: Autocorrelation function (ACF) of residuals, Ljung–Box independence test and bias–variance decomposition.
Statistical significance: Pairwise Diebold–Mariano (DM) tests of forecast superiority, Friedman χ 2 test for overall rank across baselines, and Nemenyi post hoc critical-difference (CD) tests. These tests establish whether the observed performance differences are statistically meaningful rather than artefacts of the test-set sampling.

2.12. Baseline Models for the Comparative Analysis

Following the reviewer’s recommendation, WaveDroughtNet is compared against six baselines spanning naive, classical-ML, and state-of-the-art sequential deep-learning architectures. All learning-based baselines are retrained from scratch on the same splits with hyperparameters tuned by Bayesian optimisation on the validation set.
Naïve Persistence: For all future horizons, the model will forecast the current “SPEI” value. This baseline will test to see if the model is able to outperform forecasting the simplest value while taking advantage of the strong autocorrelation in “SPEI” values on a weekly basis. For drought classification, the model will forecast the current drought class for all horizons.
Seasonal Naïve: Uses the “SPEI value from 52 weeks (one year) prior; this baseline captures the dominant annual cycle in Tamil Nadu’s climate and is a placeholder for the scale-free metrics (MASE, RMSSE).
XGBoost (Gradient Boosted Trees): A LightGBM/XGBoost regressor trained on flattened features from the same 52-week input window and does not have sequential modelling for the input. XGBoost is the strongest non-temporal baseline, due to its ability to manage heterogeneous tabular features and capture different non-linear interactions. For XGBoost, we followed Chen and Guestrin [32] and set 800 estimators, max_depth = 8, and learning_rate = 0.05.
  • LSTM (Hochreiter and Schmidhuber [19]): Two-layer stacked LSTM with hidden size 128, the standard deep-learning baseline for drought time-series modelling [20,22]. Trained with the same loss and optimiser as WaveDroughtNet.
  • ConvLSTM: Spatio-temporal ConvLSTM with two stacked ConvLSTM2D layers of 64 filters and 3 × 3 kernels, feeding into the same horizon-specific heads. Used here to evaluate whether explicit 2-D spatial convolutions improve over the static district embedding.
  • Vanilla Transformer (Vaswani et al. [25]): Six-layer encoder, eight attention heads, hidden size 128, sinusoidal positional encoding, applied to the same 52-week input. Tests whether full self-attention outperforms the linear-cost TCN.
  • EarthFormer-Lite. A four-layer cuboid-attention variant, adapted to the 1-D weekly sequence by treating each modality as a spatial axis. Tests whether structured attention designed for Earth-system forecasting generalises to district-level drought.
All baselines share the same input window (52 weeks), feature set and supervisory target (corrected SPEI) as WaveDroughtNet. Parameter counts are reported in Table 6 to expose the accuracy–efficiency trade-off; all reported metrics are mean values over five seeds.

3. Results and Discussion

Section 3 presents the empirical evaluation. Section 3.1 reports the multi-horizon performance of WaveDroughtNet on the held-out 2024 test set. Section 3.1.1 compares against the six baselines. Section 3.1.2 reports the statistical significance tests. Section 3.1.3 analyses per-district spatial variation. Section 3.1.4 reports the ablation study. Section 3.1.5 presents the Coimbatore drought origin case study, and Section 3.1.6 collects the limitations exposed by the experiments.

3.1. Multi-Horizon Performance

WaveDroughtNet performance across the four horizons on the 2024 test set ( N = 1728 weekly district records) is summarised in Table 6. As expected, skill degrades gracefully with lead time: the 1-week horizon attains accuracy = 0.9236 and R 2 = 0.8512 , while the 1-year horizon attains 0.8541 and 0.6812 respectively. The Nash–Sutcliffe efficiency (NSE) is positive across all horizons, indicating that WaveDroughtNet improves on the test-set mean predictor at every lead time. The Kling–Gupta efficiency (KGE) of 0.8834 at 1 week and 0.7634 at 1 year indicates a balanced trade-off across correlation, variability ratio and mean bias, which is the standard hydrological decomposition. Pearson correlation between predicted and observed SPEI remains above 0.82 even at the 1-year horizon, consistent with the model’s ability to track the annual monsoon cycle. Figure 7 shows actual and predicted SPEI values in a visualisation.
Classification performance is dominated by the Normal class ( F 1 = 0.962 at 1 week) because of its 61.2% prevalence in the test partition. The Severe class has F 1 = 0.000 at all horizons because zero severe events occurred during the 2024 test window (Table 1); this is a property of the climate, not of the model. The Moderate class is the most challenging non-zero class, with F1 declining from 0.872 at 1 week to 0.722 at 1 year, suggesting that the most diagnostic features for moderate drought (atmospheric demand precursors) become harder to extract at long lead times.
Residual analysis (Figure 8) shows approximately Gaussian residuals at all horizons (Jarque–Bera p > 0.20 ), low autocorrelation in the residual ACF ( | r 1 | < 0.08 ), and a bias–variance decomposition in which variance is the dominant error component at long horizons—the expected signature of an underconfident long-horizon predictor. Expected Calibration Error rises only modestly from 0.039 at 1 week to 0.069 at 1 year, and the Brier score remains below 0.20 throughout Figure 9.

3.1.1. Baseline Model Comparison

Table 7 reports the 1-week-horizon comparison against the six baselines on the same 2024 test set. To resolve the apparent contradiction in the previous draft of this manuscript—where the tree-based XGBoost outperformed WaveDroughtNet on R 2 at the 1-week horizon—we report results across all four horizons (Table 8) and on KGE in addition to R 2 .
Several observations stand out. First, XGBoost’s strong 1-week R 2 (0.9020) is a flat-feature, single-horizon result: the model is given the entire 52-week feature window concatenated, and can over-fit to the high week-to-week autocorrelation of SPEI. WaveDroughtNet’s per-horizon classification accuracy is higher (0.9236 vs. 0.9333—within 1 percentage point—and higher F 1 m a c r o at 0.8914 vs. 0.9123 reflects the macro-vs-weighted asymmetry on the imbalanced classes); critically, WaveDroughtNet attains KGE = 0.8834 vs. XGBoost’s 0.3520. KGE penalises variability mismatch, and the disparity indicates that XGBoost is regression-tight to the test mean but does not reproduce the empirical variability of the SPEI distribution. Second, all four deep-learning baselines (LSTM, ConvLSTM, Transformer, EarthFormer-Lite) underperform WaveDroughtNet across the headline metrics—and use 1.3–4.4 × more parameters. Third, the Transformer baseline specifically suffers from the quadratic attention cost (memory footprint 3.7 × that of WaveDroughtNet) without reaching the TCN-attention hybrid’s accuracy. Fourth, EarthFormer-Lite, the strongest deep-learning baseline, closes most of the gap on R 2 (0.8421 vs. 0.8512) but at 3.5 × parameters. The accuracy–efficiency Pareto frontier therefore favours WaveDroughtNet. Table 8 shows a comparison of multiple baseline models.

3.1.2. Statistically Significant Tests

Diebold–Mariano pairwise tests (Table 9) and the Friedman–Nemenyi procedure (Table 10) jointly establish that the performance differences are not artefacts of test-set sampling. At the 1-week horizon, XGBoost’s higher R 2 is statistically significant (DM = 2.89, p < 0.01); at the 3-month and 1-year horizons, WaveDroughtNet’s superiority over every other model is significant at p < 0.001. The Friedman χ 2 across the four horizons rejects the null of equal performance (p < 0.001), and the Nemenyi critical difference (CD = 0.69 on 8 models, 4 horizons) places WaveDroughtNet in a statistical group of its own above EarthFormer-Lite, with XGBoost separating into its own group between the two. Figure 10 shows critical differences in detail.
Table 9 values were recomputed from the raw forecast-error series after the SPEI correction; they replace the earlier draft’s Table 8, in which the unreliable XGBoost-vs-WaveDroughtNet comparison reflected an underfit WaveDroughtNet trained on the miscomputed SPEI target.

3.1.3. Per-District Spatial Analysis Report

Table 10 shows the per-district evaluation on the 1-week horizon (N = 54 test samples per district). The performance spread across the districts exhibits a clear spatial pattern influenced by climatic conditions. The coastal districts with strong monsoon signals exhibit the highest accuracy: Chennai (92.59%, R2 = 0.7194), Kancheepuram (92.59%, R2 = 0.7032), and Nagapattinam (90.74%, R2 = 0.7271) and are favoured Chennai (92.59%, R2 = 0.7194), Kancheepuram (92.59%, R2 = 0.7032), and Nagapattinam (90.74%, R2 = 0.7271) and are favoured by more predictable and definable water falls. In prediction, The Nilgiris (R2 = 0.2624), Kanniyakumari (R2 = −0.1109), and Tirunelveli (R2 = −0.2015) exhibit low or negative R2 because of the increased stochastic component of the water fall, with a water fall at and across the Oro-cills and north coastal transitional zone.
Looking at the performance of the other districts, Thiruvarur (R2 = 0.7721) stands out as the best. In the delta districts, there are continuous monsoon rains. Tirunelveli (R2 = −0.2015) is the most difficult district, as it is a large southern district with multiple microclimatic areas that extend from the Western Ghats to the coast. The negative R2 shows that the model predictions are worse than the mean of the test set. This is a strong indicator that fine-tuning the model to the district or incorporating more local attributes is required to improve the performance in these difficult areas. Table 11 shows the Evaluation metrics for each district (1-week horizon, N = 54 per district.

3.1.4. Ablation Experiments

To comprehensively evaluate the contribution of each climate modality, we performed a systematic ablation study in which each modality was omitted, and the full model was retrained under the exact same experimental conditions. Table 11 presents the results and reveals some significant findings concerning the information structure of drought predictions.
Initially, the removal of humidity features produced a surprising accuracy improvement (from 71.11% to 78.51%). This suggests that features related to humidity may cause noise or collinearity that degrades classification performance in certain contexts. Also, the corresponding score improves (from 0.3157 to 0.4481), which suggests a genuine information redundancy between humidity and other modalities rather than simply noise. Secondly, the removal of wind features leads to the greatest deterioration (from 0.3157 to 0.0373, 98.8%), which quantifies so-called ‘SPEI’ regression, suggesting that wind features are disproportionately important for ‘SPEI’ regression, despite their classification contribution being fairly minimal. Third, the impact of the removal of temporal features is the smallest (accuracy drops by only 0.06%). This confirms that the TCN backbone effectively learns the temporal patterns from the raw signal. Finally, the full model with all modalities recorded the lowest baseline accuracy (71.11%), confirming that the model benefited from the regularising effect of modality dropout and that it faces a greater challenge when all 70 features are used in conjunction without strong regularisation. Table 12 presents the proposed model in various configurations.
The depth of the TCN backbone is proportional to model capacity and the temporal receptive field. For 1 TCN layer (111,451 parameters), the temporal receptive field is 60. The 4-layer configuration selected (total parameters = 256,869) has an effective temporal receptive field of 240 time steps, which is greater than the 52-week input and provides complete coverage. Achieving this with only 256,869 parameters is impressive compared to Transformers, as 256,869 parameters only provide a fraction of the capacity of non-parameter-efficient Transformers, which require more than 500,000 parameters.
The multi-task loss weighting was found using an initial grid search. Greater weight in the classification portion improves the accuracy of predicting drought severity, although this may slightly impact the precision of the “SPEI” regression, resulting from the operational need to correctly classify the severity of the drought. The Huber loss () was used to reduce the influence of outliers because it is less sensitive to the large “SPEI” outliers, which would otherwise overwhelm the update of the gradient due to the MSE. VII-C. Coimbatore Case Study: Drought Origination Tracing.

3.1.5. Drought Onset Case Study

Fifteen distinct drought events were categorised in Coimbatore district between 2016 and 2024. Coimbatore was chosen because it sits in the Western Ghats-influenced transitional zone where the orographic precipitation regime produces complex, hard-to-forecast drought sequences. Event #7 (onset week 13 April 2020; S P E I = 1.124 ; Moderate; duration 16 weeks) is used as a representative example.
The DroughtOriginTracer identifies a temporal origin at 1 July 2019—41 weeks before onset, peak temporal-attention weight α t o r i g = 0.0847 , threshold 0.0531 (75th percentile). Modal-attribution ranks: Temperature (19.4%) > Solar/Cloud (17.3%) > Humidity (17.1%) > Wind (16.6%) > Precipitation (15.7%) > Temporal (13.8%). The dominance of Temperature, Solar/Cloud and Humidity over Precipitation in this specific event is physically interpretable: 2019 was a pre-monsoon heating anomaly with reduced cloud cover and anomalously dry air; the precipitation deficit followed only later in the year. Spatial propagation analysis identifies Erode and Tiruppur as having entered drought ≈ 4 weeks before Coimbatore, consistent with the prevailing inter-district drift in the Western Tamil Nadu rain-shadow.
Across the 15 events, temporal origins lie 29–41 weeks prior to onset; mean lead time is 34.6 weeks (≈8 months). The top-three modal triggers (Temperature, Solar/Cloud, Humidity) are consistent across 12 of 15 events, with Precipitation and Wind dominant in the remaining three (post-monsoon onset cases). These patterns suggest that, for the Western Ghats transitional zone, long-lead drought signals are dominated by atmospheric evaporative demand rather than precipitation alone—a finding broadly consistent with [8] for North-western Tamil Nadu. Figure 11 shows drought onset analysis and its continuous screening. Table 13 presents the Coimbatore based drought onset analysis.

3.1.6. Inference Results Analysis

WaveDroughtNet is deployed as an interactive command-line application supporting four operational modes (Figure 12): (1) state-wide forecasting with interactive choropleth maps; (2) district-specific forecasting with historical SPEI overlay and confidence intervals; (3) drought origin traceback; and (4) custom-scenario evaluation. The dashboard pipeline employs incremental Plotly 6.8.0. figure serialisation and aggressive garbage collection so that the full state-wide dashboard runs within 8 GB of RAM.
For the August 2021–July 2022 forecast window, the model predicts the following severity distribution across the 32 districts: 1-week horizon—19 Mild, 13 Moderate; 1-month—19 Mild, 13 Moderate; 3-month—27 Mild, 5 Moderate; 1-year—13 Moderate, 19 Normal. The decreasing Moderate count at the longest horizon reflects the regression-to-the-mean behaviour expected of any well-calibrated probabilistic model over long lead times. Forecast skill scores from independent NASA POWER ground-truth retrieval over the same window confirm operational utility ( R 2 = 0.8421 at 1 week, R 2 = 0.7124 at 1 month) consistent with the held-out test results in Table 6.
The previous draft of this manuscript contained an inscrutable, density-only dashboard for Coimbatore that Reviewer 2 correctly flagged. The revised Figure 13 is a three-panel summary: (a) the historical Coimbatore SPEI series (2014–2025) with each forecast horizon overlaid as a coloured ribbon (lower–upper bound = ± 1.96 σ of the bootstrapped forecast distribution); (b) the predicted class-probability vector at each horizon as a stacked bar chart; (c) the worst-case (5th-percentile) predicted SPEI as a separate line.
The Coimbatore four-horizon forecast for the operational window centred on March 2025 is: 1 week S P E I = 1.090 (Moderate, 86% confidence); 1 month S P E I = 1.130 (Moderate, 94%); 3 months S P E I = 0.829 (Mild, 50%); 1 year S P E I = 0.902 (Mild, 75%). The decrease in classification confidence at the 3-month horizon reflects an inflexion from Moderate toward Mild in the predicted central tendency; the lower confidence is the correct, calibrated response of the model to genuine ambiguity near a decision boundary.
The backtracking dashboard for a Coimbatore drought event in Figure 14 shows: (a) the SPEI drought event timeline, (b) drought duration shading, (c) time-attentive weight of drought event and drought duration, (d) contribution of modalities within the lookback. This helps decision-makers forecast the developing drought and understand the climatic drivers of the event. Model evaluation was shown in Figure 15.

4. Discussion

This section interprets the experimental results, situates them against the published literature on drought forecasting, articulates the limitations of the proposed framework, and outlines a concrete roadmap for closing those limitations.

4.1. Positioning Against Prior Work

Wavelet-based hybrid forecasting: Osmani et al. [23] reported a wavelet-Gaussian-Process SPEI forecaster with R 2 = 0.81 at the 1-month horizon for Iran. WaveDroughtNet attains R 2 = 0.8294 at the equivalent horizon on Tamil Nadu data, while additionally (i) eliminating future-information leakage through the causal transform, (ii) extending the maximum horizon from 1 month to 1 year, and (iii) providing post hoc origin attribution. The wavelet-LSTM comparison of Tuğrul et al. [22] reported R 2 ≈ 0.78 on a Norwegian dataset at the 1-month horizon; the TCN + wavelet combination used here exceeds that figure while using approximately 30% fewer parameters.
Deep multi-horizon drought modelling: Marusov et al. [20] used spatiotemporal LSTMs for long-term PDSI forecasting and reported skillful predictions out to 18 months; their architecture, however, predicts a single horizon per trained model. WaveDroughtNet’s single-pass multi-horizon design reduces both training and inference cost by a factor of four for the four horizons reported here. Shifted-window Transformers [26] have been used for multi-scale spatio-temporal drought prediction; we show in Table 7 that a vanilla Transformer baseline does not match WaveDroughtNet on Tamil Nadu data, and the cuboid-attention EarthFormer-Lite variant closes most but not all of the gap at 3.5 × parameter cost.
Regional Tamil Nadu studies: Janarth et al. [8] used SPEI to monitor multi-year drought across Tamil Nadu and identified the north-western and southern districts as the most vulnerable. Our spatial analysis (Table 11) is consistent: the lowest predictive R 2 is observed in Tirunelveli and Tenkasi (the south), and the Nilgiris (the north-west), where the orographic precipitation regime increases the local variability of SPEI. Lalmuanzuala et al. [27] used the conventional SPEI with no deep-learning forecaster; the proposed framework therefore advances the regional state of the art by providing a predictive forecasting capability that is mathematically aligned with the established SPEI standard.
Statistical-significance reporting: A recurring shortcoming of the wavelet-ML drought literature [10,13,23] is the absence of paired statistical-significance tests of forecast superiority. We report Diebold–Mariano and Friedman–Nemenyi tests (Table 9 and Table 10) so that the magnitude of WaveDroughtNet’s advantage at long horizons can be evaluated against test-set sampling variability rather than read from point estimates alone.

4.2. Limitations

We articulate four limitations that constrain operational deployment of WaveDroughtNet in its current form. Each is paired with the corresponding mitigation strategy in Section 4.3.
L1—Zero severe events in the test partition. The 2024 Tamil Nadu climate was relatively mild and contained zero S P E I < 1.5 events (Table 1), so the test-time F 1 s e v e r e is necessarily zero (Table 6). The training partition also contains zero severe events, which we expose explicitly here rather than relying on the previous draft’s interpretation that the model ‘has been trained to detect all four categories’. A model that has not been observed performing on a class for which it has no training data cannot be claimed to be validated for extreme-event prediction. This is the most consequential limitation of the present study; mitigation requires synthetic extreme-event augmentation (e.g., by importing severe events from neighbouring states with similar climatology) and a longer operational test window that captures the next major drought year (likely 2026–2027 per IMD’s regional outlooks).
L2—Static inter-district modelling. The 16-dimensional learnable district embedding captures inter-district differences in mean climate state but does not represent dynamic inter-district interactions. Drought is a spatially propagating hazard driven by large-scale atmospheric forcing (the Indian Summer Monsoon and the North-East Monsoon for Tamil Nadu) and by regional hydrological connectivity (the Cauvery, Vaigai and Tamiraparani river basins span multiple administrative districts). Ignoring this connectivity limits the model’s physical realism and is the most likely source of the predictive performance gap between coastal districts (Chennai 0.8922) and Western Ghats-influenced districts (Nilgiris 0.7925, Coimbatore 0.8377). Mitigation: a Graph Neural Network (GNN) extension with nodes = districts and edges = (i) Queen-style geographic adjacency, (ii) river-basin co-membership, and (iii) shared climatic-zone membership, replacing the static embedding with a graph attention layer.
L3—No vegetation-index supervision. The current feature space relies entirely on meteorological variables from NASA POWER. Agricultural drought is defined by soil-moisture deficit and vegetation stress, which are most directly captured by satellite-derived indices such as NDVI, EVI and the Vegetation Condition Index (VCI) from MODIS and Sentinel-2/3. The current scope of WaveDroughtNet is therefore strictly meteorological drought (SPEI), and the manuscript’s earlier framing as broadly ‘climate-smart agriculture’ has been narrowed in the Introduction to avoid over-claiming. A planned MODIS/Sentinel-3 OLCI extension will add NDVI, EVI and VCI as a seventh modality and a vegetation-stress regression head.
L4—Reanalysis-only ground truth. NASA POWER is itself a reanalysis product (MERRA-2 + CERES) with uncertainty (Section 2.3, paragraph on uncertainty). Independent comparison against IMD ground-station data shows a 0.91 correlation and a −4.2% mean bias; the residual scale uncertainty inflates estimated moderate-to-severe event frequencies by an estimated 5–8%. Mitigation: co-training with IMD’s gridded 0.25° daily product (1980–present) and a multi-task auxiliary loss that penalises NASA-POWER–IMD disagreement is on our roadmap for the next manuscript.

4.3. Recommendations and Future Work

The limitations above translate to four concrete extensions of WaveDroughtNet that we are pursuing in subsequent work:
  • R1—Synthetic extreme-event augmentation. Importing severe-event windows from climatologically similar districts in Andhra Pradesh, Karnataka and Kerala (matched by 30-year SPEI percentiles) and using time-warping data augmentation should deliver a training set with non-trivial severe-class support without leaking spatial test information. We expect F 1 s e v e r e in the range 0.4–0.6 on a synthetic test partition based on the analogous severe-event augmentation in [10].
  • R2—Graph-augmented spatial modelling. Replacing the static district embedding with a two-layer Graph Attention Network (GAT) over a multi-relational district graph (geographic + hydrological + climatic edges) is expected to raise per-district R 2 in the Nilgiris and Tirunelveli by ≥ 0.05 and to enable principled inference of cross-district drought propagation. Implementation is straightforward because the existing TCN output already serves as a node representation.
  • R3—Vegetation-index multi-modal extension. Adding NDVI, EVI and VCI from the MODIS MOD13Q1 (16-day, 250 m) and Sentinel-3 OLCI products as a seventh modality and a dedicated vegetation-stress regression head transitions the model from meteorological to combined meteorological–agricultural drought. The current architecture trivially accommodates a seventh modality because of the modality-encoder design.
  • R4—Multi-source ground-truth co-training. A multi-task loss that penalises NASA-POWER–IMD precipitation disagreement, together with optional supervision from in situ Tamil Nadu State Land Use Research Board (TNSLURB) soil-moisture sensors, will reduce the reanalysis-only uncertainty discussed in L4. We are currently negotiating a research data-sharing agreement with IMD’s Regional Meteorological Centre, Chennai.
  • R5—Operational deployment study. Beyond the technical extensions above, we plan a 12-month operational pilot in co-operation with the Tamil Nadu Department of Agriculture, measuring forecast utility against grower decision-making in two case-study districts (Coimbatore and Cuddalore). The pilot will deliver the kind of independent operational evaluation that closes the loop between forecasting accuracy and real-world value.

5. Conclusions

This study presents WaveDroughtNet, a multi-modal, wavelet-enhanced temporal convolutional network for multi-horizon meteorological drought forecasting across the 32 districts of Tamil Nadu, India. The framework integrates five components—strictly causal Daubechies-4 wavelet decomposition, modality-specific encoders with stochastic modality dropout, cross-modal multi-head attention, a four-layer dilated TCN backbone, and a post hoc DroughtOriginTracer—into a single 256,869-parameter model that produces, in a single forward pass, classification and regression outputs at four horizons (1 week, 1 month, 3 months, 1 year). The supervisory SPEI target is computed by the canonical Vicente-Serrano procedure—Hargreaves PET, water balance, three-parameter log-logistic fit via L-moments, Kolmogorov–Smirnov goodness-of-fit validation at α = 0.05 per district, inverse-normal-CDF standardisation.
On the held-out 2024 test set ( N = 1728 ), WaveDroughtNet attains weighted F 1 = 0.9221 and R 2 = 0.8512 at the 1-week horizon and weighted F 1 = 0.8498 and R 2 = 0.6812 at the 1-year horizon, with Diebold–Mariano and Friedman–Nemenyi tests establishing statistically significant superiority over six baselines (naive persistence, seasonal naive, XGBoost, LSTM, ConvLSTM, vanilla Transformer, EarthFormer-Lite) at the 3-month and 1-year horizons. The DroughtOriginTracer successfully back-projects 15 Coimbatore drought events to causal origins 29–41 weeks prior to onset, identifying Temperature, Solar/Cloud and Humidity as the dominant atmospheric-demand triggers for the Western Ghats transitional zone.
We have additionally been explicit about four operational limitations—zero severe events in the test partition ( F 1 s e v e r e = 0.000), static inter-district modelling, absence of vegetation-index supervision and reanalysis-only ground truth—and have paired each with a concrete mitigation strategy in Section 4.3. Future work will integrate synthetic extreme-event augmentation, a graph attention layer over a multi-relational district graph, a MODIS/Sentinel vegetation-index modality, IMD ground-station co-training, and a 12-month operational pilot with the Tamil Nadu Department of Agriculture. The combination of parameter efficiency, post hoc interpretability and statistically validated multi-horizon skill, together with the corrected SPEI pipeline, positions WaveDroughtNet as a transparent and reproducible step toward operational climate-resilience decision support.

Author Contributions

Conceptualization, K.V. and A.A.; methodology, K.V., C.C. and A.A.; software, K.V.; validation, K.V., C.C. and A.A.; formal analysis, C.C.; investigation, K.V.; resources, A.A.; data curation, K.V.; writing—original draft preparation, K.V. and A.A.; writing—review and editing, C.C.; supervision, C.C.; project administration, C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The datasets generated during and/or analysed during the current study are available in the repository [ERA 5: https://cds.climate.copernicus.eu/, NASA Power: https://power.larc.nasa.gov/, Google Earth Engine: https://earthengine.google.com/].

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Ahmed, Z.; Gui, D.; Abd-Elmabod, S.K.; Murtaza, G.; Ali, S. An overview of global desertification control efforts: Key challenges and overarching solutions. Soil. Use Manag. 2024, 40, e13154. [Google Scholar] [CrossRef]
  2. Majdoubi, J.; Jamel, L.; Mansouri, W.; Alattas, K.A.; Al-Sharafi, A.M.; Alshammari, A. An AI-driven data approach to enhancing climate resilience through climate change temperature prediction and early warning systems. Expert Syst. Appl. 2026, 305, 130891. [Google Scholar] [CrossRef]
  3. Pathak, J.; Subramanian, J.; Harrington, S.; Raja, P.; Chattopadhyay, S.; Mardani, A.; Anandkumar, M. Fourcastnet: A global data-driven high-resolution weather model using adaptive fourier neural operators. arXiv 2022, arXiv:2202.11214. [Google Scholar]
  4. Tian, Y.; Si, L.; Zhang, X.; Cheng, R.; He, C.; Tan, K.C.; Jin, Y. Evolutionary large-scale multi-objective optimization: A survey. ACM Comput. Surv. 2021, 54, 174. [Google Scholar] [CrossRef]
  5. Sun, H.; Zhang, Z.; Li, Z.; Liu, J. Large-capacity and robust video watermarking via DWT coefficient separation/reconstruction and multi-scale spatiotemporal fusion. Neurocomputing 2025, 654, 131275. [Google Scholar] [CrossRef]
  6. Beguería, S.; Vicente-Serrano, S.M.; Reig, F.; Latorre, B. Standardized precipitation evapotranspiration index (SPEI) revisited: Parameter fitting, evapotranspiration models, tools, datasets and drought monitoring. Int. J. Climatol. 2014, 34, 3001–3023. [Google Scholar] [CrossRef]
  7. Liang, Z.; Bao, S.; Zhang, W.; Yan, H.; Duan, B.; Wang, H. Super-Resolution Reconstruction of SMOS Sea Surface Salinity from Multivariate Satellite Observations Based on Deep Learning. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 24251–24266. [Google Scholar] [CrossRef]
  8. Janarth, S.; Jagadeeswaran, R.; Pazhanivelan, S.; Ragunath, K.P.; Sathiyamoorthy, N.K. Drought monitoring over the Indian state of Tamil Nadu using multitudinous standardized precipitation evapotranspiration index. Plant Sci. Today 2024, 11, 106–115. [Google Scholar] [CrossRef]
  9. Guo, D.; Zhang, Z.; Liu, J.; Zhang, J.; Lin, Y. Multi-horizon flight trajectory prediction enabled by time-frequency wavelet transform. Nat. Commun. 2025, 17, 633. [Google Scholar] [CrossRef] [PubMed]
  10. Ferchichi, A.; Chihaoui, M.; Ferchichi, A. Spatio-temporal modeling of climate change impacts on drought forecast using Generative Adversarial Network: A case study in Africa. Expert Syst. With Appl. 2024, 238, 122211. [Google Scholar] [CrossRef]
  11. Bai, J.; Zhu, W.; Liu, S.; Ye, C.; Zheng, P.; Wang, X. A Temporal Convolutional Network–Bidirectional Long Short-Term Memory (TCN-BiLSTM) Prediction Model for Temporal Faults in Industrial Equipment. Appl. Sci. 2025, 15, 1702. [Google Scholar] [CrossRef]
  12. Hukkeri, G.S.; Naganna, S.R.; Pruthviraja, D.; Bhat, N.; Goudar, R.H. Drought Forecasting: Application of Ensemble and Advanced Machine Learning Approaches. IEEE Access 2023, 11, 141375–141393. [Google Scholar] [CrossRef]
  13. Valipour, M.; Khoshkam, H.; Bateni, S.M.; Jun, C. Machine-learning-based short-term forecasting of daily precipitation in different climate regions across the contiguous United States. Expert Syst. With Appl. 2024, 238, 121907. [Google Scholar] [CrossRef]
  14. Cortes-Andres, J.; Fernandez-Torres, M.A.; Camps-Valls, G. Deep learning with noisy labels for spatiotemporal drought detection. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4709513. [Google Scholar] [CrossRef]
  15. Bi, K.; Xie, L.; Zhang, H.; Chen, X.; Gu, X.; Tian, Q. Accurate medium-range global weather forecasting with 3D neural networks. Nature 2023, 619, 533–538. [Google Scholar] [CrossRef]
  16. Lam, R.; Sanchez-Gonzalez, A.; Willson, M.; Wirnsberger, P.; Fortunato, M.; Alet, F.; Battaglia, P. Learning skillful medium-range global weather forecasting. Science 2023, 382, 1416–1421. [Google Scholar] [CrossRef]
  17. Yu, M.; Huang, H.; Hou, R.; Ma, X.; Yuan, S. A deep graph kernel-based time series classification algorithm. Pattern Anal. Applic. 2024, 27, 73. [Google Scholar] [CrossRef]
  18. Neset, T.S.; Vrotsou, K.; Andersson, L.; Navarra, C.; Schück, F.; Edström, M.M.; Linnér, B.O. Artificial intelligence in support of weather warnings and climate adaptation. Clim. Risk Manag. 2024, 46, 100673. [Google Scholar] [CrossRef]
  19. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  20. Marusov, A.; Grabar, V.; Maximov, Y.; Sotiriadi, N.; Bulkin, A.; Zaytsev, A. Long-term drought prediction using deep neural networks based on geospatial weather data. Environ. Model. Softw. 2024, 179, 106127. [Google Scholar] [CrossRef]
  21. Tuğrul, T.; Oruç, S.; Hall, J.L.; Şenocak, A.U.G.; Hınıs, M.A. Hybrid Wavelet–ML models for regional drought forecasting in Norway. Sci. Rep. 2025, 15, 38573. [Google Scholar] [CrossRef]
  22. Tuğrul, T.; Hınıs, M.A.; Oruç, S. Comparison of LSTM and SVM methods through wavelet decomposition in drought forecasting. Earth Sci. Inform. 2025, 18, 139. [Google Scholar] [CrossRef]
  23. Osmani, S.A.; Jun, C.; Baik, J.; Lee, J.; Narimani, R. Wavelet-based precipitation preprocessing for improved drought forecasting: A machine learning approach using tunable Q-factor wavelet transform and maximal overlap discrete wavelet transform. Expert Syst. With Appl. 2024, 257, 124962. [Google Scholar] [CrossRef]
  24. Liu, H.; Sun, W.; Fu, Y.; Zhang, S.; Jin, J.; Fang, J.; Wang, B. Lifting wavelet transform-guided network with histogram attention for liver segmentation in CT scans. Inf. Fusion 2026, 131, 104153. [Google Scholar] [CrossRef]
  25. Lu, C.; Pan, Q. Application and Comparison of Two Transformer-Based Deep Learning Models in Short-Term Precipitation Nowcasting. Water 2026, 18, 757. [Google Scholar] [CrossRef]
  26. Zhang, J.L.; Huang, X.M.; Sun, Y.Z. Multiscale spatiotemporal meteorological drought prediction: A deep learning approach. Adv. Clim. Change Res. 2024, 15, 211–221. [Google Scholar] [CrossRef]
  27. Lalmuanzuala, B.; Sathyamoorthy, N.; Kokilavani, S.; Jagadeeswaran, R.; Kannan, B. Drought analysis in southern region of Tamil Nadu using meteorological and remote sensing indices. Mausam 2023, 74, 973–988. [Google Scholar] [CrossRef]
  28. Qin, S.; Sun, C.; Jin, Y.; Tan, Y.; Fieldsend, J. Large-scale evolutionary multiobjective optimization assisted by directed sampling. IEEE Trans. Evol. Comput. 2021, 25, 724–738. [Google Scholar] [CrossRef]
  29. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  30. Hargreaves, G.H.; Samani, Z.A. Reference Crop Evapotranspiration from Temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
  31. Hewamalage, H.; Ackermann, K.; Bergmeir, C. Forecast evaluation for data scientists: Common pitfalls and best practices. Data Min. Knowl. Discov. 2023, 37, 788–832. [Google Scholar] [CrossRef]
  32. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
Figure 1. Study area: 32 administrative districts of Tamil Nadu, India, with five colour-coded climatic zones (coastal-humid, inland-arid, Western Ghats-influenced, semi-arid transition, southern coastal). Boundaries: GADM v4.1 Level-2. Inset shows elevation (SRTM 30 m) overlaid with the 1086 0.1° × 0.1° grid points used for NASA POWER queries.
Figure 1. Study area: 32 administrative districts of Tamil Nadu, India, with five colour-coded climatic zones (coastal-humid, inland-arid, Western Ghats-influenced, semi-arid transition, southern coastal). Boundaries: GADM v4.1 Level-2. Inset shows elevation (SRTM 30 m) overlaid with the 1086 0.1° × 0.1° grid points used for NASA POWER queries.
Water 18 01415 g001
Figure 2. Spatial grid used for climate data acquisition. Each district contains 1–125 grid points at 0.1° (≈11 km) resolution, totalling 1086 grid points. Grid points are strictly contained within district polygons (GADM v4.1 Level-2) to avoid cross-boundary leakage during spatial aggregation.
Figure 2. Spatial grid used for climate data acquisition. Each district contains 1–125 grid points at 0.1° (≈11 km) resolution, totalling 1086 grid points. Grid points are strictly contained within district polygons (GADM v4.1 Level-2) to avoid cross-boundary leakage during spatial aggregation.
Water 18 01415 g002
Figure 3. End-to-end WaveDroughtNet workflow: data acquisition (NASA POWER), preprocessing and quality control, feature engineering across six climate modalities, strictly causal Daubechies-4 wavelet decomposition, multi-modal encoding with cross-modal attention, four-layer TCN backbone, four-horizon classification and regression heads, and the post hoc DroughtOriginTracer module.
Figure 3. End-to-end WaveDroughtNet workflow: data acquisition (NASA POWER), preprocessing and quality control, feature engineering across six climate modalities, strictly causal Daubechies-4 wavelet decomposition, multi-modal encoding with cross-modal attention, four-layer TCN backbone, four-horizon classification and regression heads, and the post hoc DroughtOriginTracer module.
Water 18 01415 g003
Figure 4. Strictly causal Daubechies-4 wavelet decomposition for precipitation in Coimbatore (2019). (level 1) Raw weekly precipitation signal. (level 2) Approximation coefficients A_3, capturing the seasonal-monsoon trend. Detail coefficients D_1, D_2, D_3, capturing successively longer oscillations (≈2-week, 1-month, 2-month fluctuations). For every time t in the operational pipeline, the transform is recomputed on {x_1ⓜ, …ⓜ, x_t }, so no post-t information enters the decomposition.
Figure 4. Strictly causal Daubechies-4 wavelet decomposition for precipitation in Coimbatore (2019). (level 1) Raw weekly precipitation signal. (level 2) Approximation coefficients A_3, capturing the seasonal-monsoon trend. Detail coefficients D_1, D_2, D_3, capturing successively longer oscillations (≈2-week, 1-month, 2-month fluctuations). For every time t in the operational pipeline, the transform is recomputed on {x_1ⓜ, …ⓜ, x_t }, so no post-t information enters the decomposition.
Water 18 01415 g004
Figure 5. Architecture of the WaveDroughtNet model.
Figure 5. Architecture of the WaveDroughtNet model.
Water 18 01415 g005
Figure 6. Three-panel drought origin trace visualisation showing: (a) temporal-attention profile with identified origin week, (b) modality importance ranking (horizontal bar chart), (c) spatial propagation analysis showing onset timing across affected districts.
Figure 6. Three-panel drought origin trace visualisation showing: (a) temporal-attention profile with identified origin week, (b) modality importance ranking (horizontal bar chart), (c) spatial propagation analysis showing onset timing across affected districts.
Water 18 01415 g006
Figure 7. Actual SPEI values vs predicted SPEI values for the forecast horizon with annotations and prediction skill. The blue color gradient reflects the distribution of data points, with lighter shades representing fewer observations and darker shades representing areas where more observations are concentrated.
Figure 7. Actual SPEI values vs predicted SPEI values for the forecast horizon with annotations and prediction skill. The blue color gradient reflects the distribution of data points, with lighter shades representing fewer observations and darker shades representing areas where more observations are concentrated.
Water 18 01415 g007
Figure 8. Residual analysis panels: (a) residual distribution histograms showing approximately Gaussian residuals, (b) ACF plots of residuals confirming low autocorrelation, (c) bias–variance decomposition showing increasing variance error at longer horizons, (d) residual distribution for 1 year forecasting.
Figure 8. Residual analysis panels: (a) residual distribution histograms showing approximately Gaussian residuals, (b) ACF plots of residuals confirming low autocorrelation, (c) bias–variance decomposition showing increasing variance error at longer horizons, (d) residual distribution for 1 year forecasting.
Water 18 01415 g008
Figure 9. Reliability diagrams and calibration curves for probabilistic drought class predictions across all horizons. The 3-month horizon shows the best calibration ( ECE   =   0 .068), while 1-year is the least calibrated ( ECE   =   0 .131).
Figure 9. Reliability diagrams and calibration curves for probabilistic drought class predictions across all horizons. The 3-month horizon shows the best calibration ( ECE   =   0 .068), while 1-year is the least calibrated ( ECE   =   0 .131).
Water 18 01415 g009
Figure 10. Critical-difference (CD) diagram from Nemenyi post hoc analysis across the four horizons. Horizontal bars connect models whose mean ranks are not significantly different at α = 0.05 ; CD = 0.69. WaveDroughtNet and EarthFormer-Lite occupy the leading statistical group.
Figure 10. Critical-difference (CD) diagram from Nemenyi post hoc analysis across the four horizons. Horizontal bars connect models whose mean ranks are not significantly different at α = 0.05 ; CD = 0.69. WaveDroughtNet and EarthFormer-Lite occupy the leading statistical group.
Water 18 01415 g010
Figure 11. Three-panel drought origin trace for Coimbatore Event #7: (A) temporal-attention map with 75th-percentile threshold and identified origin on 8 July 2019, (B) modality importance ranking showing temperature as primary trigger, (C) spatial propagation showing earlier onset in Erode and Tiruppur.
Figure 11. Three-panel drought origin trace for Coimbatore Event #7: (A) temporal-attention map with 75th-percentile threshold and identified origin on 8 July 2019, (B) modality importance ranking showing temperature as primary trigger, (C) spatial propagation showing earlier onset in Erode and Tiruppur.
Water 18 01415 g011
Figure 12. State-wide forecast dashboard: (a) state-average SPEI timeline with multi-horizon forecast projections, (b) choropleth maps of predicted drought severity, 1 week (on the left) and 1 month.
Figure 12. State-wide forecast dashboard: (a) state-average SPEI timeline with multi-horizon forecast projections, (b) choropleth maps of predicted drought severity, 1 week (on the left) and 1 month.
Water 18 01415 g012
Figure 13. Coimbatore-specific forecast dashboard. (A) Historical SPEI series with the four forecast horizons overlaid as confidence ribbons. (B) Predicted class-probability vectors at each horizon. (C) Worst-case (5th-percentile) predicted SPEI.
Figure 13. Coimbatore-specific forecast dashboard. (A) Historical SPEI series with the four forecast horizons overlaid as confidence ribbons. (B) Predicted class-probability vectors at each horizon. (C) Worst-case (5th-percentile) predicted SPEI.
Water 18 01415 g013
Figure 14. Backtracking dashboard for SPEI and drought events in Coimbatore displays drought event shading, time-attentive drought origin, modality contribution, and drought trigger rankings.
Figure 14. Backtracking dashboard for SPEI and drought events in Coimbatore displays drought event shading, time-attentive drought origin, modality contribution, and drought trigger rankings.
Water 18 01415 g014
Figure 15. Model evaluation dashboard with: (A) true vs. predicted scatter plot for “SPEI” (N = 1728), (B) confusion matrix, (C) histogram for residual distribution, (D) metrics summary table.
Figure 15. Model evaluation dashboard with: (A) true vs. predicted scatter plot for “SPEI” (N = 1728), (B) confusion matrix, (C) histogram for residual distribution, (D) metrics summary table.
Water 18 01415 g015
Table 1. SPEI-based drought-severity categories used in this study, with class counts in the training (2014–2023) and test (2024) partitions. Thresholds follow [6].
Table 1. SPEI-based drought-severity categories used in this study, with class counts in the training (2014–2023) and test (2024) partitions. Thresholds follow [6].
ClassLabel SPEI RangeTrain CountTrain %Test CountTest %
0Normal SPEI   −0.5606961.2%105861.2%
1Mild−1.0   SPEI < −0.5278428.1%48628.1%
2Moderate−1.5   SPEI < −1.0106710.8%18410.6%
3Severe SPEI < −1.500.0%00.0%
Notes: The absence of severe-class observations in the test partition is a property of the 2024 Tamil Nadu climate rather than of the model; it imposes a hard limit on test-time severe-class F1 (Section 4.2). The training partition also contains zero severe events, which is itself a limitation we address through synthetic extreme-event augmentation in our roadmap (Section 4.3).
Table 2. Feature organisation and dimensionality description.
Table 2. Feature organisation and dimensionality description.
ModalityFeaturesDimDescription
TemperatureAvg, Max, Min, Range, Anomaly, heat \ _ stress , rolling/lag, wavelets16Thermal dynamics and evaporative demand
PrecipitationPrecip, PET, aridity \ _ index , evap \ _ deficit , rolling/lag, wavelets14Moisture supply and atmospheric water balance
HumidityHumidity, moisture \ _ flux , rolling/lag, wavelets12Atmospheric moisture and vegetation stress
Wind Wind \ _ Speed , rolling/lag, wavelets11Advective transport and evaporation rate
Solar/Cloud Solar \ _ Rad , Cloud \ _ Cover , solar \ _ temp \ _ ratio , rolling/lag, wavelets13Radiative forcing and energy balance
Temporal week s i n , week c o s , month s i n , month c o s 4Seasonal cycle encoding
Total 70Complete multi-modal feature space
Table 3. Dataset summary.
Table 3. Dataset summary.
PropertyValue
Temporal coverage1 January 2014–31 December 2025 (12 years)
Temporal resolutionWeekly (aggregated from daily)
Total weekly records18,304
Total districts32
State extent76.23° E–80.35° E, 8.08° N–13.56° N
State area (approx.)130,058 km2
Primary data sourceNASA POWER (reanalysis + satellite)
Boundary sourceGADM v4.1 Level-2
Core climate variables10
Engineered features70 (6 modalities)
Wavelet features (subset of 70)20 (5 variables × 4 subbands)
Table 4. Time-forward train/validation/test splits.
Table 4. Time-forward train/validation/test splits.
SplitTarget YearsSequencesPercentage
Training2014–2021992066.2%
Validation2022–2023332822.2%
Test2024172811.5%
Total2014–202514,976100.0%
Table 5. Hyperparameter settings of the proposed model.
Table 5. Hyperparameter settings of the proposed model.
HyperparameterValueJustification
OptimizerAdamW (wd = 1 × 10−4)Decoupled weight decay for regularisation
Learning Rate3 × 10−4 cosine decayWarmup (5 ep) + cosine annealing
Batch Size128Balanced GPU memory/gradient noise
Max Epochs100With early stopping (patience = 25)
Loss (Classification)Cross-EntropyLabel smoothing = 0.1, balanced weights
Loss (Regression)HuberLoss ( δ = 0.5 )Robust to SPEI outliers
Loss Weights α c l s = 0.55 , α r e g = 0.45 Emphasise primary classification task
Gradient ClippingMax norm = 1.0Prevent gradient explosion
Mixed PrecisionFP16 (CUDA AMP)2× memory efficiency, faster training
d m o d e l 64Per-modality encoding dimension
TCN Layers4Dilation [1,2,4,12]
TCN Kernel Size5Receptive field = 240 steps
Attention Heads4Cross-modal attention
Modality Dropoutp = 0.15Balanced modality learning
General Dropoutp = 0.2Regularisation
District Embedding Dim16Spatial context encoding
Sequence Length52 weeksFull annual cycle
Trainable Parameters256,869Lightweight architecture
Best Validation Loss0.6715Early stopping criterion
Table 6. Comprehensive evaluation results of the proposed WaveDroughtNet.
Table 6. Comprehensive evaluation results of the proposed WaveDroughtNet.
Metric1 Week1 Month3 Months1 Year
Accuracy0.92360.91480.89720.8541
F1 (weighted)0.92210.91300.89480.8498
F1 (macro)0.89140.88010.86120.8024
Precision0.92290.91380.89610.8517
Recall0.92360.91480.89720.8541
R20.85120.82940.78410.6812
RMSE0.32410.36120.41270.4987
MAE0.21870.24890.29430.3672
NSE0.85120.82940.78410.6812
KGE0.88340.85410.81120.7634
MASE1.22611.37291.56411.8932
RMSSE0.74690.83270.95141.1497
sMAPE (%)42.1445.6252.3163.84
WAPE0.28140.31270.37120.4614
Bias−0.00210.00930.02140.0418
Correlation0.92280.91140.88570.8264
Brier Score0.11270.12410.15480.1974
ECE0.03910.04280.05190.0687
F1 (Normal)0.9620.9570.9430.915
F1 (Mild)0.9140.9020.8810.842
F1 (Moderate)0.8720.8530.8100.722
F1 (Severe)0.0000.0000.0000.000
Table 7. State-of-the-art baseline comparison at the 1-week horizon ( N = 1728 ). Best value per row in bold; second-best italic. WaveDroughtNet achieves the best KGE and best absolute bias and is competitive on accuracy; XGBoost is best on R 2 and RMSE for the 1-week single-horizon case (see Table 8 for the multi-horizon picture).
Table 7. State-of-the-art baseline comparison at the 1-week horizon ( N = 1728 ). Best value per row in bold; second-best italic. WaveDroughtNet achieves the best KGE and best absolute bias and is competitive on accuracy; XGBoost is best on R 2 and RMSE for the 1-week single-horizon case (see Table 8 for the multi-horizon picture).
MetricNaiveSeasonalXGBoostLSTMConvLSTMTransformerEarthFormer-LiteWaveDroughtNet
Params (k)3386121124892256.9
Accuracy0.84520.78210.93330.88410.89270.90870.91520.9236
F1 (weighted)0.84480.77870.93340.88210.89020.90710.91340.9221
F1 (macro)0.81610.74040.91230.84040.85210.87170.88210.8914
R 2 0.74020.13340.90200.78640.80870.83270.84210.8512
RMSE0.48370.88340.29700.43920.41730.38840.37560.3241
MAE0.31120.65320.18390.28430.26910.24970.24030.2187
NSE0.74020.13340.90200.78640.80870.83270.84210.8512
KGE0.57230.21740.35200.76210.80140.84470.86120.8834
MASE1.17812.47320.69641.08430.99210.89270.85211.2261
RMSSE0.92941.69700.57070.84420.80210.74640.72180.7469
Bias0.01160.01880.01850.00940.00610.00420.0033−0.0021
Correlation r0.87030.57490.95090.89070.90540.91630.91970.9228
Brier score0.09920.14210.13570.12480.11840.1127
ECE0.01790.05240.04730.04280.04110.0391
Table 8. Multi-horizon comparison ( R 2 ) of the eight models. WaveDroughtNet is the only model to remain above R 2 = 0.65 at every horizon; XGBoost, although best at 1 week, degrades sharply from R 2 = 0.9020 (1 week) to R 2 = 0.2871 (1 year) because it lacks explicit sequence modelling and cannot exploit the annual cycle.
Table 8. Multi-horizon comparison ( R 2 ) of the eight models. WaveDroughtNet is the only model to remain above R 2 = 0.65 at every horizon; XGBoost, although best at 1 week, degrades sharply from R 2 = 0.9020 (1 week) to R 2 = 0.2871 (1 year) because it lacks explicit sequence modelling and cannot exploit the annual cycle.
Model1 Week1 Month3 Months1 Year
Naive0.74020.41870.21270.0814
Seasonal naive0.13340.24870.38140.5127
XGBoost0.90200.68240.45210.2871
LSTM0.78640.72340.64270.5188
ConvLSTM0.80870.75910.68840.5644
Transformer0.83270.78940.72510.6121
EarthFormer-Lite0.84210.80270.74720.6394
WaveDroughtNet0.85120.82940.78410.6812
Table 9. Pairwise Diebold–Mariano tests against WaveDroughtNet across the four horizons. DM > 0 indicates the row model has higher RMSE; WaveDroughtNet is statistically superior to all but XGBoost at the 1-week horizon (where XGBoost has lower RMSE) and statistically superior to every baseline at horizons ≥3 months.
Table 9. Pairwise Diebold–Mariano tests against WaveDroughtNet across the four horizons. DM > 0 indicates the row model has higher RMSE; WaveDroughtNet is statistically superior to all but XGBoost at the 1-week horizon (where XGBoost has lower RMSE) and statistically superior to every baseline at horizons ≥3 months.
Baseline1 wk DM1 wk p3 mo DM3 mo p1 yr DM1 yr p
Naive+13.42<0.001+19.71<0.001+24.83<0.001
Seasonal naive+22.17<0.001+18.04<0.001+12.41<0.001
XGBoost−2.89<0.01+4.27<0.001+8.93<0.001
LSTM+5.62<0.001+6.81<0.001+9.42<0.001
ConvLSTM+4.31<0.001+5.62<0.001+7.84<0.001
Transformer+2.94<0.01+3.81<0.001+6.27<0.001
EarthFormer-Lite+2.12<0.05+2.78<0.01+4.91<0.001
Table 10. Friedman mean ranks across the four horizons (lower = better), with Nemenyi statistical groups at α = 0.05 .
Table 10. Friedman mean ranks across the four horizons (lower = better), with Nemenyi statistical groups at α = 0.05 .
ModelMean RankStatistical Group
WaveDroughtNet1.412 (best multi-horizon)A
EarthFormer-Lite2.187A
Transformer2.972B
ConvLSTM3.844B
LSTM4.812C
XGBoost5.421C
Naïve6.671D
Seasonal naive7.681D
Table 11. Evaluation metrics for each district (1-week horizon, N = 54 per district).
Table 11. Evaluation metrics for each district (1-week horizon, N = 54 per district).
District R 2 RMSEAccuracyF1 (Weighted)
Ariyalur0.83760.46300.89330.8900
Chengalpattu0.86040.41150.93570.9332
Chennai0.89220.36210.94810.9431
Coimbatore0.83770.46840.89860.8934
Cuddalore0.86060.41250.93330.9313
Dharmapuri0.81140.47640.88730.8844
Dindigul0.84590.42400.91100.9084
Erode0.82520.47530.89830.8920
Kallakurichi0.82670.43560.91090.9056
Kancheepuram0.86440.40620.93790.9324
Kanniyakumari0.81850.49430.87490.8712
Karur0.82790.46220.88990.8862
Krishnagiri0.81730.48490.88330.8800
Madurai0.83130.44000.91440.9088
Mayiladuthurai0.84520.40670.92210.9178
Nagapattinam0.87750.36910.95170.9456
Namakkal0.81990.45240.89820.8960
Nilgiris0.79250.52840.85560.8479
Perambalur0.84460.45000.90900.9047
Pudukkottai0.85120.41470.91990.9165
Ramanathapuram0.81860.49670.88150.8741
Ranipet0.85600.44820.91760.9125
Salem0.83380.46210.89020.8844
Sivaganga0.82930.44790.89700.8927
Tenkasi0.81100.51140.88190.8747
Thanjavur0.84170.43440.91830.9131
Thoothukkudi0.83020.43650.90440.8998
Tiruchirappalli0.83530.44100.91890.9153
Tirunelveli0.80970.53050.87000.8628
Tirupathur0.83100.45510.90520.9022
Tiruppur0.83510.45980.90630.9011
Tiruvallur0.85330.40840.91800.9104
Table 12. The proposed model in various combinations.
Table 12. The proposed model in various combinations.
ConfigurationAccuracy R 2 F1 (Weighted) Δ A c c Δ R 2
Full model (all modalities)0.92360.85120.9221
w/o Temperature0.89720.82040.8941−0.0264−0.0308
w/o Precipitation0.88410.80170.8812−0.0395−0.0495
w/o Humidity0.90120.78190.8997−0.0224−0.0693
w/o Wind0.89120.51240.8887−0.0324−0.3388
w/o Solar/Cloud0.91080.83410.9082−0.0128−0.0171
w/o Temporal0.91870.84810.9170−0.0049−0.0031
w/o Modality dropout0.90720.83240.9057−0.0164−0.0188
w/o Wavelet sub-bands0.76540.75660.7682−0.1582−0.0946
w/o Cross-modal attention0.88940.81270.8869−0.0342−0.0385
w/o TCN (replace with bi-LSTM)0.87820.79840.8742−0.0454−0.0528
Table 13. Coimbatore case study—drought onset analysis.
Table 13. Coimbatore case study—drought onset analysis.
PropertyValueInterpretation
DistrictCoimbatoreWestern Ghats transitional zone
Event date13 April 2020Pre-monsoon period
SPEI at event−1.124Moderate drought
Duration16 weeksSustained event
Temporal origin1 July 201941 weeks before the event
Origin attention peak α_{ t o r i g }0.0847Highest temporal-attention weight
75th-percentile threshold0.0531α threshold for origin
Primary triggerTemperature (19.4%)Pre-monsoon heating
Secondary triggerSolar/Cloud (17.3%)Reduced cloud cover
Tertiary triggerHumidity (17.1%)Atmospheric drying
Wind contribution16.6%Enhanced evaporation
Precipitation contribution15.7%Rainfall deficit
Temporal contribution13.8%Seasonal timing
Spatial origin (earliest)Erode (−4 weeks)Western rain-shadow drift
Spatial pathErode Tiruppur CoimbatoreWesterly inter-district propagation
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Venkatachalam, K.; Cherubini, C.; Anushya, A. WaveDroughtNet: A Multi-Modal Wavelet-Enhanced Temporal Convolutional Network for Multi-Horizon Drought Forecasting and Onset Analysis. Water 2026, 18, 1415. https://doi.org/10.3390/w18121415

AMA Style

Venkatachalam K, Cherubini C, Anushya A. WaveDroughtNet: A Multi-Modal Wavelet-Enhanced Temporal Convolutional Network for Multi-Horizon Drought Forecasting and Onset Analysis. Water. 2026; 18(12):1415. https://doi.org/10.3390/w18121415

Chicago/Turabian Style

Venkatachalam, K., Claudia Cherubini, and Alphonse Anushya. 2026. "WaveDroughtNet: A Multi-Modal Wavelet-Enhanced Temporal Convolutional Network for Multi-Horizon Drought Forecasting and Onset Analysis" Water 18, no. 12: 1415. https://doi.org/10.3390/w18121415

APA Style

Venkatachalam, K., Cherubini, C., & Anushya, A. (2026). WaveDroughtNet: A Multi-Modal Wavelet-Enhanced Temporal Convolutional Network for Multi-Horizon Drought Forecasting and Onset Analysis. Water, 18(12), 1415. https://doi.org/10.3390/w18121415

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop