1. Introduction
Heatwaves, defined as prolonged periods of excessively high temperatures, represent one of the most severe climate-induced hazards of the 21st century [
1,
2]. Their frequency, duration, and spatial extent have increased markedly in recent decades, particularly in regions already experiencing arid and semi-arid conditions [
3]. Heatwaves have well-established and significant effects on public health, playing a major role in increasing heat-related deaths and illnesses across diverse regions such as Europe, the United States, Russia, and Korea [
4,
5]. Between 2000 and 2016, heatwaves affected approximately 125 million additional people worldwide and contributed to over 166,000 heat-related deaths in the preceding decade, with low-income countries facing the greatest risks due to limited capacity for adaptation and response [
6]. Heatwaves are driving considerable economic and ecological losses globally by disrupting agriculture, reducing labor productivity, straining infrastructure, and damaging ecosystems. In Europe, heat-related damages have already reached up to 0.5% of GDP, with projections suggesting a fivefold increase by 2060 if climate action remains insufficient [
7,
8]. Agriculture is particularly affected, as extreme heat and drought conditions cause significant reductions in crop yields—especially in wheat and maize [
9,
10,
11,
12]. Labor productivity also declines during heatwaves, especially in low- and middle-income countries, slowing economic growth [
13,
14,
15,
16]. Moreover, heatwaves inflict lasting damage on ecosystems, triggering mass tree die-offs, coral bleaching, and wildlife losses, as seen in a Western Australian event that affected over 300,000 km
2 of land and sea [
11,
17]. The broader financial toll includes heightened wildfire threats, increased power outages, and stressed water supplies, as demonstrated by the 2010 Russian heatwave, which resulted in US
$15 billion in damages and extensive land loss [
18,
19,
20,
21,
22].
Central Asia, characterized by its continental climate, complex topography, and socio-economic vulnerabilities, has emerged as a hotspot for heatwave intensification. The region’s exposure is exacerbated by rapid urban expansion, land use changes, climate change, and limited adaptive capacity, all of which amplify the risks associated with extreme heat events [
5,
6,
23]. From declining crop yields and water shortages to elevated mortality and morbidity rates [
4], the cascading impacts of heatwaves in Central Asia are multifaceted, urgent, and insufficiently mapped. Despite the rising threat, heatwave susceptibility assessments in Central Asia remain sparse, fragmented, and often constrained by the lack of comprehensive and high-resolution spatial data. Conventional studies have largely relied on meteorological station data, static indices, and machine learning models [
24,
25,
26,
27,
28], which are limited in spatial coverage and unable to capture the dynamic, multi-dimensional nature of heatwave causality. This presents a substantial gap in hazard science, where a robust, spatially explicit, and predictive approach is critical for proactive risk management and mitigation strategies. Recent scholarship has broadened the scope of heatwave research to encompass vulnerability assessment, climate risk modeling, and the operationalization of adaptation strategies in diverse contexts. Case studies highlight adaptation planning in European cities [
29,
30], and the role of socio-economic capacity in shaping heatwave resilience [
31]. Advances in modeling approaches combine machine learning and remote sensing for urban heat vulnerability mapping [
32], and employ climate modeling to project future heatwave risks at national and regional scales [
33,
34,
35,
36,
37]. Together, these works underline the imperative of integrating physical hazard mapping with socio-economic vulnerability metrics to support climate adaptation. A key challenge lies in integrating diverse environmental, climatic, and anthropogenic variables into a unified framework that can identify areas most prone to heatwave occurrence and intensity. Remote sensing technologies offer an effective solution to this challenge [
38]. By capturing consistent, repeatable, and wide-area observations, satellite-based data enable the extraction of essential biophysical indicators that drive heatwave dynamics. Variables such as land surface temperature (LST), vegetation indices (e.g., NDVI), soil moisture, built-up density, albedo, and elevation can serve as either evidence layers or causative factors in susceptibility mapping [
39,
40,
41,
42,
43]. These data sources are particularly valuable in Central Asia, where ground-based observations are often sparse or inaccessible [
44]. The ability of remote sensing to characterize environmental gradients at fine resolutions provides a powerful foundation for developing sophisticated and regionally adapted models of heatwave susceptibility [
45].
Parallel to the advancement of remote sensing is the rapid evolution of artificial intelligence, particularly deep learning, which has transformed the modeling landscape in geospatial sciences. Deep learning models are uniquely capable of handling complex, nonlinear interactions and high-dimensional datasets that characterize environmental systems [
43,
46]. Among these, the TabTransformer model has recently gained attention for its superior performance in tabular data analysis, combining the interpretability of decision trees with the deep representational power of transformer-based architectures [
47,
48]. Its capacity to handle mixed-type inputs (continuous and categorical), learn contextual relationships, and generalize well with limited labeled data makes it an ideal candidate for susceptibility modeling where data heterogeneity is high [
49,
50]. However, deep learning models are notoriously sensitive to hyperparameter settings. Suboptimal configurations can significantly degrade model performance, interpretability, and computational efficiency. To address this, metaheuristic optimization algorithms, particularly Particle Swarm Optimization (PSO), provide an efficient mechanism for global hyperparameter tuning. Inspired by the social behavior of bird flocking, PSO balances exploration and exploitation to converge on optimal solutions in complex search spaces. When integrated with deep learning, PSO can significantly enhance model convergence, generalizability, and classification accuracy in predictive mapping tasks [
51,
52].
This study proposes a novel framework that combines the TabTransformer deep learning architecture with PSO-based optimization to develop a high-resolution, remote sensing-driven heatwave susceptibility map for Central Asia. All input data layers, whether evidence or causative, are derived exclusively from satellite observations and processed to maintain spatial consistency and thematic relevance. The hybrid model not only leverages the spatiotemporal strengths of remote sensing but also harnesses the computational intelligence of deep learning and evolutionary optimization [
52,
53,
54,
55]. By targeting an underexplored yet highly vulnerable geographic region, this research offers methodological innovation and regional insight that can inform climate adaptation policies, emergency planning, and sustainable land management strategies. It fills a critical gap in hazard assessment by providing an interpretable, scalable, and data-rich approach to understanding and mitigating heatwave risks under accelerating climate change. Previous efforts to map heatwave hazard and susceptibility have primarily relied on station-based indices and gridded meteorological datasets [
24,
25,
26,
27,
28], often applying empirical or statistical thresholds for extreme heat events [
2,
4,
5]. For instance, Perkins and Alexander [
56] provide a comprehensive review of heatwave measurement methodologies and station-based index applications across diverse climatic settings. Classic machine learning algorithms such as Random Forests, Support Vector Machines, and logistic regression have been utilized to improve predictive mapping, yet they remain limited in spatial coverage, scalability, or in capturing non-linear interactions and high-dimensional data [
24,
25,
26,
27,
28,
43]. In response to such limitations, researchers have increasingly explored novel combinations of data sources and modelling strategies. Beyond heatwave-specific research, recent studies have demonstrated the potential of integrating machine learning with high-resolution reanalysis datasets for environmental and atmospheric applications. For example, Shikhovtsev et al. [
57] employed neural networks trained on meteorological parameters from ERA5 to model atmospheric optical turbulence (“seeing”), achieving improved prediction skill over purely physical parameterizations. Similarly, Sun et al. [
58] applied a random-forest-based correction of ERA5 precipitation estimates using dense gauge networks across the Third Pole region, producing a 70-year, 10-km gridded dataset that substantially improved hydrological model performance. These studies illustrate how coupling reanalysis data with machine learning algorithms can enhance predictive accuracy in complex environmental systems—an approach that directly supports the methodology adopted in the present work. Recent studies have also highlighted the intensification and impacts of severe heatwaves, with Russo et al. [
59] characterizing the top European events since 1950, and Murari et al. [
60] examining future risks and mortality outcomes in India using statistical and ML-based approaches. More recent advances have explored the use of remote sensing products to overcome data sparsity and provide spatially consistent, wide-area biophysical variables—including LST, NDVI, soil moisture, and urban/built-up indices—as key inputs for hazard assessment [
38,
39,
40,
41,
42,
43,
44]. Tomlinson et al. [
61] demonstrate the use of MODIS-based LST data for detailed urban heat island mapping, while Ahmadalipour and Moradkhani [
62] leverage satellite observations to quantify escalating heat-stress mortality risk in the Middle East and North Africa. Nevertheless, most approaches remain constrained by either static indices or regionally fragmented data [
44,
45]. Smid et al. [
63] employ both remote sensing and city-level observational data to rank European capitals by their exposure to heatwaves, highlighting methodological diversity but also fragmentation. The integration of deep learning in environmental hazard mapping is an emerging direction [
43,
46], with transformer-based models such as TabTransformer recently showing strong potential for handling diverse tabular geospatial data [
47,
48,
49,
50]. However, the application of such advanced models—especially in combination with metaheuristic optimization techniques like PSO—for large-scale, remote sensing-based heatwave susceptibility mapping is exceedingly rare or absent from the literature [
51,
52,
53,
54,
55]. Consequently, our study addresses these gaps by proposing a fully remote sensing-driven and metaheuristically optimized TabTransformer framework, designed for high-resolution and transferable susceptibility mapping across the trans-boundary and data-scarce environment of Central Asia.
While various machine learning models have been applied to heatwave risk mapping, most rely on conventional algorithms such as RF, SVM, or logistic regression using mixed meteorological and static variables, often with limited scalability to heterogeneous, data-scarce regions. The TabTransformer offers a unique capacity to model complex, non-linear interactions in mixed-type, RS-derived tabular datasets by encoding contextual relationships between features—an aspect underexplored in climate hazard mapping. Its integration with PSO enables efficient, wide-range hyperparameter search, improving convergence stability and generalization in high-dimensional geospatial contexts. Despite the maturity of transformers and PSO separately, no prior work combines them for fully remote sensing–based heatwave susceptibility mapping, nor applies them in the trans-boundary, data-limited landscapes of Central Asia. This coupling not only enhances methodological robustness but also supports location-specific and country-level interpretations, enabling targeted adaptation strategies at sub-national scales (
Section 4.6 and
Section 4.7) and informing management and policy pathways for heatwave preparedness in arid and semi-arid environments (
Section 4.8). The relevance of this framework extends beyond methodological innovation, as
Section 4.1 provides a detailed examination of the underlying hazard dynamics, identifying key atmospheric and surface drivers of heatwave susceptibility in Central Asia and comparing their roles with findings from other climatic contexts.
The central research question guiding this study is whether a fully remote sensing–driven, PSO-optimized TabTransformer framework can reliably produce high-resolution, spatially transferable heatwave susceptibility maps in a data-scarce, trans-boundary region such as Central Asia. Accordingly, the primary objectives of this study are fourfold: (1) to develop a comprehensive heatwave susceptibility mapping framework for Central Asia using entirely remote sensing-derived indicators that reflect climatic, environmental, and anthropogenic drivers; (2) to employ the TabTransformer deep learning model for robust classification and pattern recognition in high-dimensional tabular geospatial datasets; (3) to enhance model performance and generalizability through Particle Swarm Optimization (PSO)-based hyperparameter tuning; and (4) to identify and spatially delineate regions of varying heatwave susceptibility and their contributing factors, thereby providing critical insights for targeted risk reduction, policy-making, and climate adaptation planning. By integrating cutting-edge deep learning with satellite-based environmental monitoring, this research contributes a novel, scalable, and transferable methodology to the growing field of climate hazard assessment.
4. Discussion
The following discussion addresses the central research question—namely, whether a PSO-optimized TabTransformer, powered entirely by remote sensing data, can generate accurate, high-resolution, and transferable heatwave susceptibility maps across Central Asia’s data-scarce, trans-boundary landscapes.
Section 4.1 examines model behavior and variable influence to elucidate the environmental drivers captured by the framework.
Section 4.2,
Section 4.3 and
Section 4.4 evaluate the role of PSO in hyperparameter tuning, its effect on learning dynamics, and the resulting gains in efficiency and generalization.
Section 4.5 situates performance metrics in the context of real-world applicability, while
Section 4.6 and
Section 4.7 interpret spatial outputs quantitatively and geographically at national scales. Finally,
Section 4.8 and
Section 4.9 discuss broader implications, limitations, and directions for future research, placing the findings within the broader hazard mapping and climate adaptation literature.
4.1. Model Behavior and Variable Influence
The feature importance results from the TabTransformer model show a clear ranking of the 13 factors used to predict heatwave susceptibility in Central Asia. Thermal and atmospheric indicators dominate, with maximum temperature as the most important predictor—an expected result, since areas with frequent high temperatures are naturally more prone to heat stress, especially when cooling is limited. Following closely, Number of Hot Days (Tmax > 30 °C) scored an importance of 0.245, indicating that not only extreme maximum temperatures but also the frequency of moderately hot days substantially contribute to heatwave vulnerability. This factor reflects the persistence and accumulation of heat exposure, which intensifies physiological and environmental stress over time. The high importance of this metric suggests that areas with many such days are primed for transitioning into full heatwave conditions under favorable synoptic circumstances. Rainfall (0.190) ranked third in importance, highlighting its inverse relationship with heatwave susceptibility. Reduced precipitation limits surface moisture and evaporative cooling, increases soil drying and vegetation stress, and raises surface albedo, thereby promoting conditions favorable for intense heating in Central Asia’s arid regions.
Surprisingly, LST, while conceptually a direct indicator of surface heat stress, held the fourth rank with an importance score of 0.140. Although LST represents the current thermal state of the land and is affected by diurnal and seasonal variability, its moderate importance suggests it is less predictive than Tmax or the frequency of hot days for assessing long-term or large-scale heatwave susceptibility. Among the topographic variables, Aspect (0.136) demonstrated a greater influence than Slope (0.100) and Elevation (0.095). In Central Asia, aspect is critical for localized heating, as south-facing slopes receive greater solar radiation and are consequently more susceptible to elevated daytime temperatures. Meanwhile, elevation inversely correlates with temperature through lapse rate dynamics, with higher altitudes experiencing cooler conditions, thus reducing susceptibility. Humidity (0.114) had moderate importance, reflecting its role in altering perceived temperature through the heat index, increasing the probability of dry heat in arid regions and intensifying thermal stress in humid conditions. Both albedo (0.098) and population density (0.096) showed similar contributions. Lower albedo surfaces increase heat absorption and local warming, while higher population density is linked to anthropogenic heat and Urban Heat Island (UHI) effect. While urbanization is less extensive in Central Asia than in densely populated regions, major cities such as Tashkent, Almaty, and Bishkek still ex-hibit clear heat island signatures, which the model effectively captures.
On the lower end of the importance spectrum, NDVI scored 0.072, indicating a relatively minor yet still relevant role in determining heatwave susceptibility. While vegetation cover cools locally via transpiration and shading, its predictive power is likely weakened by NDVI’s seasonal variability and Central Asia’s widespread low natural vegetation cover. Furthermore, NDVI is more impactful on microclimatic scales, and its effect may be overshadowed by stronger thermal and atmospheric drivers in a macro-scale susceptibility model. Land Cover (0.047) ranked second-lowest, likely because it is relatively static in the region and overlaps with other variables (e.g., NDVI, LST) that indirectly reflect land surface properties. In this modeling framework, land cover may have provided redundant or overlapping information, thereby reducing its standalone contribution. Finally, Number of Days with Heat Index >35 °C was assigned the lowest importance score of 0.028. Although the heat index is widely used in human comfort studies, its limited relevance here likely stems from two factors: (1) it requires high humidity to register significant values, and many regions in Central Asia experience dry heat, making this metric less sensitive; and (2) it overlaps with other thermal indicators like Tmax and LST, which capture similar phenomena with higher temporal resolution. Thus, the model appears to downweight this variable in favor of more directly informative predictors. In sum, thermal metrics (Tmax, heat day frequency) are the top predictors, followed by climate modulators (rainfall, humidity), surface energy balance (LST, albedo), topography, and anthropogenic factors. Variables such as NDVI, land cover, and heat index, while theoretically relevant, contribute less in this specific model due to regional climatic characteristics, redundancy with other inputs, or lower temporal stability. This layered understanding enhances both the interpretability of the model and its value as a decision-support tool.
Relevant literature review reveals that in process-based/regional studies, the role of vegetation—often captured by NDVI or phenology metrics—is frequently linked to microclimatic cooling [
83,
84]. Greener areas or increased blue-green infrastructure consistently relate to lower modeled or observed heat risk. However, in large-scale, ML-driven or purely meteorological prediction frameworks [
25], NDVI is often excluded or found unimportant—likely due to predictor selection (preference for atmospheric/synoptic variables) or coarse data, and limited landscape variation within the model domain. Land cover type, especially the distinction between vegetation (forest, grassland) and built-up areas, repeatedly emerges as a primary heatwave risk modulator in urban and peri-urban hazard models [
83]. Increased construction land is associated with elevated UTCI/heat stress, while higher proportions of greenspace and water bodies buffer extreme heat exposure. In climate/impact modeling [
84], explicit “what-if” land cover change (LCC) scenarios demonstrate that increased tree cover can meaningfully reduce peak heatwave temperatures, especially under extreme soil moisture/heat stress conditions. Additionally, Heat Index/Number of Days Variables serves as either a threshold for defining heatwave events or as a primary output variable (e.g., annual # HWD in Asadollah et al. [
25]). In most complex ML modeling frameworks, predictors based on atmospheric states (e.g., humidity, wind, synoptic pressure) outweigh NDVI or land cover in importance when predicting the number of heatwave days. Nevertheless, in risk mapping oriented around human experience (e.g., UTCI, LST models), the number/duration of heat events is used to partition risk, with land cover/NDVI acting as spatial moderators rather than core predictors. In sum, in regions where high-resolution landscape data are available and the focus is on urban areas, NDVI and land cover are frequently highly influential—because local micro-climate/greenness are decisive for heat exposure [
83,
84]. In large-scale, synoptic, or ML-intensive models (especially in arid/semi-arid zones or where predictor selection omits local land metrics), atmospheric and meteorological variables dominate variable importance rankings; NDVI/land cover play lesser roles or are omitted (Asadollah et al. [
25]; also echoed in spatial patterns in Dong et al. [
83] for non-urban zones). Studies emphasizing landscape heterogeneity and operational urban planning needs almost always find land cover/NDVI meaningful, while purely predictive climatological models may mask or minimize their effect. Thus, our finding of low importance for NDVI, land cover, and number of hot days in Central Asia is fully consistent with other scale-matched, climatic-contextualized studies, and the divergence from some urban-focused findings is a consequence of well-documented methodological and environmental distinctions.
4.2. Benefits and Limitations of PSO for Model Tuning
Particle Swarm Optimization (PSO) applied to hyperparameter tuning significantly enhanced the TabTransformer’s predictive capabilities by optimizing architecture parameters that govern learning dynamics, model capacity, and regularization. The PSO framework operates by simulating a population (swarm) of candidate solutions that collectively explore the multidimensional hyperparameter space. Each particle adjusts its trajectory based on its own experience (personal best) and the swarm’s collective knowledge (global best), enabling an adaptive balance between exploration of new configurations and exploitation of promising regions. This metaheuristic strategy is effective for deep learning models, where hyperparameters (e.g., embedding size, layer count, attention heads, dropout) interact complexly to shape convergence, generalization, and overfitting. The optimization resulted in a notably smaller embedding dimension (8 vs. default 32), fewer transformer layers (1 vs. 4), and fewer attention heads (3 vs. 8), accompanied by adjusted dropout rates. This reduction in model complexity suggests that the dataset and problem structure favor a more compact representation, which likely prevents overfitting while retaining sufficient expressiveness. The elevated dropout values (attention dropout 0.2 and feedforward dropout 0.17) imply stronger regularization, helping to mitigate the risk of memorizing noise in the training data. These hyperparameter shifts demonstrate how PSO navigates trade-offs inherent in deep learning: increasing complexity can model richer patterns but risks overfitting, while simpler architectures may generalize better but potentially underfit. PSO’s global search capability effectively balances this trade-off without exhaustive brute-force searches that are computationally prohibitive in such high-dimensional spaces. Despite these advantages, PSO’s stochastic nature and requirement for multiple iterations can incur considerable computational cost, especially for transformer models with long training times. Moreover, the optimal hyperparameters identified are contingent on the specific dataset, model initialization, and PSO parameters (e.g., population size, coefficients), necessitating replication to ensure robustness. Nevertheless, the demonstrated improvements justify its integration as a routine tuning step in transformer-based environmental modeling.
4.3. Metaheuristic Learning Behavior and Efficiency of Convergence
The convergence pattern exhibited by PSO during hyperparameter tuning provides critical insight into the optimization landscape of transformer-based models when applied to complex geospatial classification tasks such as heatwave susceptibility modeling. Unlike traditional exhaustive search methods such as grid search or even stochastic approaches like random search, PSO leverages swarm intelligence to balance exploration and exploitation—and this dynamic is clearly evident in the optimization trace. The flat R2 in early iterations (1–25) suggests broad exploration, as the swarm navigated a homogeneous solution space. This phase is vital in ensuring the algorithm does not prematurely commit to a suboptimal basin of attraction. Once more promising regions are located—evidenced by the sharp upturn beginning around iteration 30—the swarm collectively shifts toward higher-performing configurations, exploiting known good regions while still maintaining some exploratory behavior through inertia. The rapid improvement and early stabilization at R2 = 0.96 by iteration 63 demonstrate that PSO was highly effective in identifying near-optimal or optimal configurations well before exhausting the iteration budget. This suggests the underlying loss surface of the TabTransformer architecture, in this context, is sufficiently smooth for global optimization algorithms to operate efficiently—an important insight for computational resource allocation in future model development. Additionally, the lack of oscillations or regressions in the convergence curve points to algorithmic stability. This outcome affirms that the balance between global and local coefficients (both set to 2 in this case), combined with an appropriate inertia weight (0.4), created a well-calibrated swarm dynamic capable of sustained convergence. From an operational standpoint, this behavior is highly desirable: it implies that PSO can reduce the number of training iterations and thus computational cost without sacrificing performance. More importantly, the convergence trajectory validates the decision to pair PSO with TabTransformer in a remote sensing context, particularly where high-dimensional inputs and nonlinear dependencies exist—conditions that commonly arise in environmental modeling. The PSO convergence behavior not only confirms the algorithm’s tuning efficacy but also highlights its suitability as a robust hyperparameter optimization method in data-scarce, remote sensing–driven climate applications.
While direct, in-study benchmarking against alternative optimizers such as random search or Bayesian optimization was beyond the scope of this work, the choice of PSO was informed by its strong theoretical and empirical record in the literature. Numerous studies report that PSO variants achieve faster convergence, lower computational cost, and higher accuracy than other stochastic metaheuristics, owing to their ability to track the global best solution while adaptively refining search around promising regions [
85,
86,
87]. Enhanced PSO versions have also outperformed established methods such as COBYLA and Differential Evolution in tasks with smooth optimization landscapes [
88]. In specific applied domains, PSO-based methods have achieved performance levels exceeding 85%, consistently surpassing competing algorithms [
89]. Nonetheless, existing literature seldom includes head-to-head evaluations against random search or Bayesian optimization on identical, domain-specific tasks, and results can vary substantially depending on algorithm settings, data structure, and search space complexity [
90]. In the present study, coupling PSO with TabTransformer produced measurable improvements over untuned baselines across all reported performance metrics, validating its suitability for large-scale, high-dimensional hyperparameter optimization in heatwave susceptibility mapping.
4.4. Comparative Analysis of Model Learning Behavior and Optimization Impact
A detailed examination of the loss and accuracy trajectories reveals profound distinctions in the learning behavior and generalization capability between the baseline TabTransformer and the PSO-enhanced TabTransformer. The baseline model presents a pattern of reliable and incremental improvement, with the gradual convergence of loss and accuracy suggesting that the transformer architecture, when configured with default hyperparameters, is inherently suited to structured geospatial data, offering effective regularization and steady learning. The close association between training and validation performance indicates a well-balanced model free from substantial overfitting or underfitting. However, this conventional setup, while robust, manifests a relatively slow ascent to peak performance and terminates with slightly elevated loss values, hinting at possible inefficiencies in resource use and representational capacity when faced with complex, high-dimensional environmental inputs. PSO hyperparameter tuning significantly alters the model’s training dynamics. The TabTransformer–PSO exhibits not only a sharper and earlier reduction in loss for both training and validation data but also a more pronounced and sustained plateau at minimal loss values, highlighting the efficacy of the discovered hyperparameter configuration. Importantly, the validation loss remains as low as or marginally lower than the training loss throughout most of the process, a sign of well-matched regularization and model complexity—a delicate equilibrium rarely achieved by manual tuning. On the accuracy front, the PSO-tuned model achieves a remarkable leap in early-stage performance, with validation accuracy exceeding 90% almost immediately and stabilizing near the theoretical maximum as training progresses. The consistent parity between training and validation accuracy, and at times the slightly higher validation accuracy, strongly implies that the network is not merely memorizing training patterns but has internalized the essential data-generating processes key to generalization.
These observed differences are attributable to the tailored adjustment of architectural parameters—embedding dimension, layer depth, attention heads, and dropout rates—facilitated by PSO’s global search strategy. PSO dynamically navigates the hyperparameter landscape, striking an optimal balance between model representational power and regularization, thereby enhancing both learning efficiency and predictive robustness. The resource savings from faster convergence and the reduction in risk of overfitting are particularly valuable for large-scale, operational geospatial modeling. Furthermore, the refined decision boundaries and enhanced calibration resulting from this process translate directly into more reliable susceptibility delineation on the ground, which is critical for environmental risk mapping where actionable decisions depend on the model’s accuracy and stability. The epoch-wise training and validation curves exemplify the transformative impact of PSO-based hyperparameter optimization, confirming its suitability and substantial benefit in state-of-the-art, transformer-based remote sensing applications for climate hazard assessment.
4.5. Model Performance in Context
The quantitative comparison between the baseline TabTransformer and the PSO-tuned model revealed consistent improvements across training and test datasets. Key regression metrics—RMSE, MAE, and R2—indicate both better predictive accuracy and stronger explanatory power following PSO. On the training set, the tuned model achieved an RMSE reduction from 0.082 to 0.062 and MAE reduction from 0.027 to 0.019, with R2 increasing from 0.97 to 0.98. The test set exhibited similar trends: RMSE declined from 0.132 to 0.123, MAE from 0.038 to 0.034, and R2 improved from 0.93 to 0.938. These differences, while seemingly modest, are meaningful in spatial environmental modeling where incremental gains can translate into substantially improved risk delineation over heterogeneous landscapes. The reduction in error metrics implies that the PSO-tuned model’s predictions more closely approximate observed heatwave occurrences, reducing uncertainty in spatial risk assessments. Higher R2 values indicate better model fit, capturing more variance and underlying environmental drivers. The substantial discrepancy between country-specific and regional aggregate performance improvements reveals a critical geographic dimension in heatwave susceptibility modeling that warrants careful consideration. While the TabTransformer–PSO model consistently outperformed its baseline counterpart across all Central Asian nations, the magnitude of improvement varied dramatically between individual countries and their collective regional representation. This phenomenon suggests the presence of distinct climatic, topographic, and environmental characteristics within each country that respond differently to the PSO algorithm, indicating that heatwave susceptibility patterns are inherently heterogeneous across the Central Asian landscape. The exceptional performance observed in Uzbekistan, which achieved nearly 58% improvement in both MAE and RMSE metrics, can be attributed to several interconnected factors that distinguish this nation within the regional context. Uzbekistan’s geographic position in the heart of Central Asia, characterized by diverse topographic features ranging from the Kyzylkum Desert to mountainous regions, creates complex microclimatic conditions that benefit significantly from the enhanced feature selection and hyperparameter optimization provided by the PSO algorithm.
The geographic clustering evident in the heatwave susceptibility maps provides additional context for understanding these performance variations. The southwestern regions, encompassing much of Uzbekistan and Turkmenistan, display predominantly very high heatwave susceptibility, creating distinct patterns that the PSO-optimized model can more effectively differentiate and predict. This spatial coherence in high-risk areas allows the enhanced model to better delineate the boundaries between susceptibility classes, resulting in more accurate classifications and consequently lower error metrics. In contrast, countries like Kazakhstan, show more heterogeneous susceptibility patterns across their vast territory, potentially explaining why their improvement percentages, while substantial, do not reach the levels observed in Uzbekistan. The aggregation effect that reduces apparent model improvements from country-specific levels to regional levels represents a classic example of Simpson’s paradox in geospatial analysis. When individual country datasets are pooled to create a regional model, the unique characteristics and optimization benefits specific to each geographic domain become diluted within the larger, more generalized dataset. This phenomenon has profound implications for practical implementation of heatwave early warning systems, suggesting that country-specific model deployment would yield significantly better prediction accuracy than a single regional model approach. The weighted average effect of pooling diverse geographic conditions, climate regimes, and topographic features across the entire Central Asian region creates a modeling environment where the PSO algorithm cannot fully exploit the localized patterns that drive its exceptional performance at individual country scales. The implications of these findings extend beyond academic interest to practical climate adaptation strategies across Central Asia. The demonstrated country-specific benefits of PSO suggest that national meteorological services would achieve superior heatwave prediction accuracy by implementing individualized models rather than relying on regional approaches. This localized modeling strategy could significantly enhance early warning system effectiveness, potentially saving lives and reducing economic impacts associated with extreme heat events. Moreover, the geographic heterogeneity in model improvements indicates that countries experiencing the highest susceptibility to heatwaves, such as those in the southwestern portions of the region, also benefit most from advanced modeling techniques, creating a synergistic relationship between vulnerability and predictive capability enhancement.
Furthermore, the AUROC analysis confirmed a strengthened ability to discriminate between heatwave and non-heatwave classes. High AUROC values (>0.9) across both models denote strong classification performance, with PSO tuning yielding consistent gains. This reinforces the tuned model’s potential utility in operational hazard mapping where precise identification of high-risk zones is critical. Importantly, the congruence of training and test performances indicates good generalization, signifying that regularization and model complexity control mechanisms embedded within the transformer and optimized via PSO effectively prevent overfitting. Given the spatial autocorrelation and temporal variability inherent in environmental data, this robust performance highlights the framework’s capacity to handle real-world complexities. The comparative 10-fold cross-validation results indicate that both the baseline TabTransformer and the PSO-optimized variant delivered consistently strong classification performance, achieving mean AUC values above 0.93 across all folds. The TabTransformer–PSO model not only attained a slightly higher mean AUC (0.94 vs. 0.93) but also recorded a lower standard deviation (0.06 vs. 0.07), suggesting a more stable performance profile across the folds. This modest yet consistent improvement supports the effectiveness of PSO in fine-tuning hyperparameters to enhance both predictive accuracy and model robustness. The relatively narrow variance in fold-wise results for both models underscores their generalizability, while PSO appears to further reduce susceptibility to performance fluctuations caused by variations in the training–validation split. In the context of heatwave susceptibility mapping, such stability is crucial for ensuring reliable spatial predictions under varying input conditions.
It is noteworthy that our goal was to develop a truly remote sensing–based framework, given the unique characteristics of heatwaves and the geographic realities of Central Asia. Unlike sudden-onset hazards such as landslides, where precise event localization is possible through on-site observation, heatwaves are dynamic, spatially extensive, “creeping” phenomena. Their occurrence and intensity fluctuate gradually across regions, often without a singular, pinpointable ground locus. Remote sensing–derived indices—such as the Heat Wave Index (HWI)—enable the systematic detection of widespread anomalies over large areas, which is essential for regions of this scale. Moreover, Central Asia’s vast territorial expanse presents practical challenges for in situ data collection, even where ground-based records exist. Ensuring well-distributed, spatially representative heatwave evidence is considerably more difficult than simply maximizing the number of samples. Our approach intentionally balanced abundance with representativeness, selecting 200 georeferenced points from remote sensing products to reflect the diversity of climatological zones and land covers across all five countries. A larger sample size (e.g., thousands of densely clustered points) could risk overfitting and degrade generalizability. This rationale is consistent with similar environmental susceptibility studies—for example, Liu et al. [
91] employed a composite Standardized Drought Condition Index (SDCI) derived entirely from remote sensing as an evidence layer for drought modeling when harmonized, large-scale ground truth was unavailable. Even where heatwave impact records exist, they are typically sparse, irregularly distributed, or influenced by differences in national reporting standards. The rarity of large-scale heatwave susceptibility mapping reflects not only methodological novelty but also the constraints of assembling suitable ground evidence across international, multi-ecotope domains.
Finally, it should be noted that both remote sensing indices and ground-based datasets have inherent sources of uncertainty. While RS-based indices may be limited by coarse spatial or temporal resolution, ground-truth heatwave data is itself complicated by subjective thresholds (e.g., population vulnerability, adaptation, and perception), making the construction of a comprehensive and standardized ground-evidence database inherently challenging.
4.6. Agreement and Divergence in Susceptibility Outputs
The areal comparisons of heatwave susceptibility maps indicate that PSO led to subtle but meaningful shifts in the spatial pattern of susceptibility classes. Specifically, the TabTransformer–PSO model produced a larger extent of very-high-susceptibility zones (increase of ≈ 56,000 km2) coupled with reductions in the low (decrease of ≈ 71,200 km2) and very low categories. Although the relative percentage changes may appear small, the absolute differences translate to tens of thousands of square kilometers—an extent large enough to encompass multiple provinces or major metropolitan regions in Central Asia. If we treat the PSO-optimized model as the more reliable baseline, then the weaker-performing TabTransformer effectively misclassified this ≈ 56,000 km2 as low or very-low risk when it should have been flagged as very high susceptibility. Such false-negative errors carry significant management and socio-economic consequences: regions wrongly assumed to be safe may become targets for urban expansion, infrastructure investment, or agricultural intensification, placing populations, economic assets, and critical systems squarely in harm’s way during future heatwaves. This creates a risk of avoidable losses in human health, agricultural productivity, and infrastructure resilience, along with increased emergency response burdens. Conversely, the contraction of low-risk areas in the PSO-optimized output signals that safety margins in certain locations may be narrower than previously assumed, highlighting the need for adaptive planning even in zones historically considered resilient. While the shifts in moderate- and high-susceptibility classes were relatively minor, the consistent coverage patterns across models still reinforce their agreement on large-scale susceptibility gradients.
The statistical evaluation of susceptibility maps generated by the two models sheds light on spatial consistency and differences resulting from hyperparameter tuning. The Chi-squared test revealed highly significant differences (p < 0.0001) in class frequency distributions, indicating that the models allocate areas differently across susceptibility categories. This suggests that PSO tuning modifies the sensitivity and thresholds applied to raw predictive outputs, refining the spatial classification of heatwave hazard. The Friedman test (p < 0.0001) confirmed systematic differences in spatial susceptibility rankings. This test’s sensitivity to ordinal differences highlights how PSO tuning affects not only class frequencies but the relative ordering of areas by vulnerability. Finally, the Wilcoxon signed-rank test comparing paired spatial predictions confirmed that differences at individual locations are significant, reinforcing that tuning alters local susceptibility estimates rather than merely shifting global statistics. This is critical for practical applications, where local accuracy determines the effectiveness of targeted interventions. Together, these statistical tests validate that hyperparameter tuning materially changes susceptibility outputs. The changes likely arise from enhanced model generalization, improved feature weighting, and better calibration of decision boundaries within the transformer architecture. While the broad spatial patterns of susceptibility remain coherent, PSO tuning provides a more nuanced delineation of hotspots and transitional zones, potentially enabling more precise resource allocation and hazard mitigation.
4.7. Geographic Interpretation: Country-Wise Discussion of Heatwave Susceptibility
A country-wise examination of the susceptibility outputs, contextualized using the overly on country borders, reveals substantive patterns that underscore both regional climatic realities and the advantages conferred by PSO-based model optimization. Kazakhstan, the largest country in the study area, displays striking north–south contrasts. In both models, the northern and eastern regions—covering the Kazakh Uplands and the Altai foothills—are predominantly classified as low to very low susceptibility, corresponding to their cooler, higher-elevation, and more temperate climate regimes. Moving southward, particularly below approximately 45° N latitude, susceptibility intensifies markedly, with the strongest effects observed in the dry, expansive lowlands bordering Uzbekistan and around the Aral Sea basin. The TabTransformer–PSO output offers a distinctly sharper separation of these zones compared to the baseline model, narrowing the bands of high susceptibility to those areas that are physiographically most exposed to extreme heat. This refinement aligns well with observed climatic trends, lending support to the model’s improved realism. Uzbekistan emerges as one of the most heatwave-prone countries in the study, a finding consistent across both susceptibility maps. Most of the republic falls into the high or very-high-susceptibility categories—particularly the Kyzylkum Desert, the lower Amudarya basin, and urban centers such as Tashkent and Samarkand. The PSO-enhanced map delineates urban and irrigated oases with better granularity, differentiating moderate susceptibility zones in the Fergana Valley and river corridors from the contiguous high-susceptibility deserts. Notably, the cities and agricultural heartlands are more tightly circumscribed by zones of extreme exposure in the PSO-optimized model, emphasizing elevated vulnerability in densely populated areas.
Turkmenistan consistently shows widespread high to very high susceptibility throughout its interior, especially across the Karakum Desert and surrounding lowland plains. The area of maximal susceptibility is slightly reduced and more precisely defined in the TabTransformer–PSO result, with clearer boundaries between high-risk desert interiors and the relatively less-susceptible borderlands near the Kopet Dag foothills along the Iranian frontier. This improved spatial resolution is critical for targeting interventions in this country, where marginal climatic and water resource conditions prevail. Kyrgyzstan is characterized by markedly lower susceptibility in its mountainous east and central regions, as both models capture the climate-mitigating influence of high elevations in the Tien Shan. Lower-lying areas in the southwest, including the Fergana Valley and the Chuy Valley near Bishkek, transition into higher susceptibility categories, though the PSO-based model portrays these at slightly greater spatial fidelity, avoiding overgeneralization into neighboring highlands. The result is a risk landscape that mirrors actual climatic and topographical gradients, crucial for a predominantly mountainous nation. Tajikistan, dominated by the Pamir and Alai mountains, overwhelmingly falls into the very low to moderate susceptibility classes, especially in the east and central massif. However, both models identify the southwestern and western lowland areas—especially the Vakhsh and Panj Valley corridors—as moderate to high-risk zones. The optimized TabTransformer–PSO provides a finer discrimination of these river-basin risk areas, crucial for hazard mapping in Tajikistan’s densely settled, agriculturally intensive valleys.
In a nutshell, while broad patterns are preserved, PSO ensures greater spatial precision and ecological plausibility across all countries. Susceptibility classes are more tightly correlated with topo-climatic and environmental characteristics—such as temperature and rainfall, aspect and elevation, and known heatwave hotspots—under the TabTransformer–PSO model. This sharpening of boundaries and improved mapping of transitional zones increases the operational value of susceptibility outputs, supporting targeted policy and intervention efforts in the region’s varied socio-environmental contexts. Ultimately, the integration of advanced deep learning and metaheuristic optimization yields susceptibility maps that are not only statistically robust but also geographically meaningful—an advance of direct relevance to heatwave adaptation planning in Central Asia.
The country-wise susceptibility patterns produced by our TabTransformer–PSO framework are strongly supported by independent climatological and impact-focused studies. In Kazakhstan, our very high-risk zones in the southern lowlands and around the Aral Sea closely correspond to regions identified by Wang et al. [
42] as experiencing the most frequent and intense heatwaves, driven primarily by rapid soil moisture decline from reduced precipitation and elevated net radiation. The March 2025 West Asia heatwave attribution study further highlights southern and eastern Kazakhstan as recording unprecedented early-season heat, coinciding with sensitive agricultural phenophases [
66]; this aligns with the agricultural–public health vulnerability focus of Broomandi et al. [
64]. In Uzbekistan, our concentration of very high susceptibility in the Kyzylkum Desert, the lower Amu Darya basin, and urban Tashkent mirrors WWA [
66] findings, which attribute these hotspots to combined soil moisture scarcity, UHI intensification, and dependence on glacier-fed irrigation. For Turkmenistan, we map pervasive high susceptibility across the Karakum Desert core, with moderating effects along Caspian coastal regions; this is consistent with Wang et al. [
42], who observed chronic aridity and radiation load in the desert interior, and with WWA [
66], which documented severe March 2025 anomalies in the country’s central and southern lowlands. In Kyrgyzstan, our model emphasizes elevated susceptibility in the low-lying Chuy and Fergana valleys, matching the hotspot valleys recognized by Fallah et al. [
36] in multi-decadal heat extremes, and corroborating WWA [
66] observations of UHI amplification in Bishkek and other expanding urban centers. Similarly, in Tajikistan, our identification of the Vakhsh and Panj valleys as higher susceptibility zones is supported by Fallah et al. [
36], who cite intense pre-monsoon radiation and low evaporative cooling as key drivers, and WWA [
66], which notes early-season extremes affecting these irrigated lowlands. Across all countries, the dominant environmental factors emphasized in the previous literature—notably soil moisture depletion, precipitation deficits, high solar radiation, UHI effects, and elevation—are explicitly represented among our predictors, reinforcing the physical credibility of the susceptibility patterns. Where differences arise—such as our sharper discrimination of Caspian coastal moderation in Turkmenistan—they largely reflect the finer 30 m RS-driven resolution and heatwave-specific focus of our approach, compared with broader temperature-extreme assessments in some studies. These comparisons are provided in detail in
Table 11, summarizing country-specific hotspots, associated environmental drivers, and the level of agreement between our high-resolution heatwave-focused mapping and broader temperature-extreme or multi-hazard assessments from the literature.
4.8. Broader Implications and Applications
The integration of advanced transformer-based deep learning with PSO hyperparameter tuning in this study offers a powerful, data-driven methodology for heatwave susceptibility mapping in Central Asia. This region’s wide-ranging climatic zones, complex terrain, and rapid socioeconomic changes pose significant challenges to traditional statistical or physically based models, which often rely on sparse observations and linear assumptions. By leveraging high-resolution satellite data and flexible neural architectures, this approach captures the nonlinear, multivariate relationships driving heatwave dynamics, supporting scalable and transferable modeling. PSO’s improvements support its broader use in geospatial deep learning to boost performance and reliability. The practical applications of these findings are manifold. Urban planners and disaster management agencies can use fine-grained susceptibility maps to identify vulnerable neighborhoods and prioritize mitigation actions such as increasing urban green spaces, improving building materials, and implementing early warning systems. Policymakers can integrate these spatial risk assessments into climate adaptation strategies, directing investments to areas with greatest need. Furthermore, the methodological framework’s reliance on publicly available data and open-source algorithms enhances reproducibility and facilitates adaptation to other regions facing heatwave risks. Future work can build on this foundation by incorporating additional socio-economic variables, climate projections, or multimodal datasets to enrich susceptibility characterization.
From a climate risk governance perspective, mapping accuracy gains that translate into tens of thousands of square kilometers’ reclassification are not trivial—they can reshape priorities for early-warning systems, heat-health action plans, agricultural advisories, and infrastructure cooling strategies, preventing long-term socio-economic setbacks in one of the world’s most heatwave-vulnerable regions. This case highlights that even small numerical differences in performance metrics should not be overlooked; a model with seemingly high accuracy, such as the baseline TabTransformer, may still lead to substantial misclassification when applied across large geographic extents, resulting in far-reaching and potentially detrimental consequences. It should be emphasized that susceptibility maps—particularly those generated at continental-to-regional scales—are intended to guide preliminary susceptibility assessments and policy prioritization. They identify relative hotspots that warrant further, site-specific evaluation, rather than providing definitive, actionable sites at high spatial precision. For management application, top-down analytical frameworks such as ours should be succeeded by detailed, local validation and customized interventions.
4.9. Uncertainty Considerations in Heatwave Susceptibility Mapping
Uncertainty is an inherent component of environmental risk mapping, arising from multiple sources within both the input data and the modeling process. In the present study, all predictors and evidence layers were derived from remote sensing and global reanalysis products, which, although spatially continuous, differ in spatial resolution, temporal frequency, and retrieval algorithms. For example, ERA5-derived heatwave evidence is provided at a coarse grid (0.25°) and must be downscaled to align with finer-resolution environmental predictors such as MODIS-based NDVI or land surface temperature. This multi-scale integration can introduce spatial misalignment and propagate errors through the modeling pipeline. Additionally, variable retrievals—such as vegetation indices or land cover classifications—are sensitive to atmospheric correction quality, sensor calibration, and seasonal acquisition timing, contributing to measurement uncertainty.
Model-driven uncertainty is also relevant. While the TabTransformer architecture can capture complex, non-linear relationships across mixed-type variables, its training outcomes are sensitive to hyperparameter selection, initialization, and stochastic learning dynamics. Although PSO-based optimization helped converge toward a performant solution, slight variations in training data splits or optimizer hyperparameters could yield differences in predicted susceptibility surfaces. Furthermore, the absence of in situ ground-truth data for direct validation introduces epistemic uncertainty: while the susceptibility maps display strong internal performance metrics, they do not quantify how residual errors vary across different environmental contexts or administrative regions.
From a management perspective, such uncertainties influence the confidence with which susceptibility classes can be used for early warning, land-use planning, or infrastructure prioritization. Overestimation could lead to inefficient allocation of resources, whereas underestimation might leave vulnerable areas unprepared. Mitigation of these uncertainties could be pursued through improved spatial harmonization of input datasets, incorporation of independent ground observations where feasible, and ensemble modeling to average over variable model configurations and data perturbations. Future work should extend the present framework with a formal spatial uncertainty analysis—such as Monte Carlo simulations, prediction interval mapping, or pixel-level variance decomposition—to generate both susceptibility and confidence surfaces. Such paired outputs would allow stakeholders to integrate uncertainty directly into heatwave susceptibility-to-risk assessment and decision-making processes, ensuring more robust and transparent hazard management.
4.10. Limitations and Future Research Directions
While remote sensing-based approaches offer powerful advantages for transboundary, data-scarce susceptibility mapping, they also introduce limitations. The absence of ground-based meteorological and impact data in this study both reflects regional data scarcity and necessarily restricts some aspects of model calibration and validation. Although the workflow leverages ERA5 and satellite products for maximally consistent and reproducible mapping, ground observations could provide additional insights for model adjustment, especially regarding localized or microclimatic extreme events. Differences in remote sensing product resolution, revisit intervals, and potential data artifacts introduce further uncertainty. Additionally, the dynamic and distributed nature of heatwaves means that associations with on-the-ground impacts are not always direct; population- and context-dependent vulnerabilities further complicate the establishment of hard ground “truth”. Nevertheless, as more in situ meteorological and health-impact datasets become available in Central Asia, future work should pursue integrated, hybrid approaches that fuse remote sensing and ground-based evidence. This could improve both the generality and precision of susceptibility mapping, as well as contribute to operational heatwave risk management. Finally, susceptibility maps should be understood as a screening tool for broad hazard assessment and policy prioritization. While our results identify generalized hazard hotspots, localized on-site validation and more granular investigations are needed before implementing targeted adaptation or management actions. Future work could therefore combine detailed field studies or citizen-reported impact records with RS-derived susceptibility to refine and localize intervention strategies.
In addition, while this study adopted a two-way (training/validation) partitioning strategy supplemented with 10-fold cross-validation and internal validation within the TabTransformer architecture, we acknowledge that a three-way split (training/validation/testing) can be especially valuable for smaller, noisier, or highly imbalanced datasets. Such partitioning allows for clearer separation between hyperparameter tuning and final performance evaluation. Although the abundance and spatial representativeness of our dataset, coupled with rigorous cross-validation, rendered an explicit validation set unnecessary for the present work, future studies in data-limited settings—particularly those relying on shorter climatic records or rare event samples—could benefit from incorporating a dedicated validation subset to further safeguard against overfitting and ensure maximal generalizability.
Another methodological limitation relates to the absence of direct benchmarking between PSO and alternative hyperparameter optimization strategies such as random search, Bayesian optimization, or other metaheuristics. While the present study adopted PSO based on established literature highlighting its convergence efficiency and suitability for high-dimensional search spaces, the relative performance of different optimizers was not empirically assessed for this specific application. Future work should therefore incorporate systematic head-to-head evaluations under comparable experimental conditions to quantify trade-offs in accuracy, convergence speed, and computational cost, and to better characterize optimizer–model interactions in the context of environmental hazard mapping.
5. Conclusions
This study demonstrated the integration of the TabTransformer deep learning architecture with Particle Swarm Optimization (PSO) metaheuristics for remote sensing-based heatwave susceptibility mapping in Central Asia. Multiple independent evaluation approaches—including 10-fold cross-validation, statistical significance testing (Chi-squared, Friedman, and Wilcoxon, all p < 0.0001), and areal extent analysis—consistently showed that the PSO-optimized model delivered higher predictive accuracy, greater stability across folds, and improved spatial delineation of susceptibility zones compared with the baseline. These findings attest to the robustness of the framework under different data partitions and confirm its generalizability across varying geographic subsets within the study area. The model’s applicability is reinforced by the fact that even marginal gains in AUC and classification precision translated into reclassifications of up to tens of thousands of square kilometers, enough to substantially alter the designation of priority areas for adaptation.
From a practical standpoint, the areal comparison between models underscores the tangible management consequences of enhanced mapping accuracy: if the weaker-performing baseline had been used as the sole decision support, approximately 56,000 km2 of land—previously labeled as low or very low risk—would not have been recognized as very high susceptibility. Such false-negative misclassifications could inadvertently channel urban expansion projects, infrastructure investment, or agricultural intensification into areas facing elevated heatwave risk, with long-term socio-economic and public health consequences. Conversely, the reduction in low-risk zones in the PSO-optimized output warns of narrower safety margins, prompting more cautious land-use and climate adaptation planning even in places traditionally seen as resilient. This case demonstrates that even modest shifts in headline performance metrics can trigger large-scale changes in operational decision landscapes when applied to continental-scale hazard mapping, yet the consistent broad-pattern agreement between models still affirms their convergence on the dominant regional susceptibility gradients.
The produced susceptibility maps highlight pronounced spatial patterns across the study area, identifying southern and southwestern subregions—such as Turkmenistan, Uzbekistan, and southern Kazakhstan—as most prone to heatwave hazards, in line with climatic and physiographic realities. The PSO-optimized model particularly excelled at sharpening spatial transitions and reducing the overextension of high-susceptibility zones, enhancing practical interpretability and value for targeting interventions. Feature importance analyses highlighted the primacy of thermal metrics—maximum temperature and frequency of hot days—while also affirming supporting roles for rainfall, land surface temperature, and topography. From a policy and planning perspective, the susceptibility maps generated through this approach can serve as actionable tools for prioritizing communities and infrastructure for heat adaptation interventions. Decision-makers can use these spatial outputs to identify high-risk hotspots, optimize early warning systems, and guide targeted allocation of resources such as cooling centers, green infrastructure, and public awareness campaigns.
However, we acknowledge that the spatial resolution of satellite-derived datasets imposes certain limitations, particularly in heterogeneous or urbanized landscapes where local-scale variations may be underrepresented. While in situ impact data were not uniformly available for our study region, we sought out comparable and relevant studies reporting historical heatwave impacts in analogous climatic contexts to provide a fair, real-world basis for interpreting our validation results. Integrating such datasets in future applications will strengthen context-specific calibration, enhance operational confidence in the model’s outputs, and improve the granularity and accuracy of subsequent susceptibility assessments. Of particular note, the proposed framework is highly scalable and transferable to other data-scarce regions that experience heatwave hazards, provided that comparable remote sensing and ancillary datasets are available. This scalability arises from the exclusive use of open-access, globally consistent satellite-derived indicators, the domain-agnostic structure of the TabTransformer model—which flexibly accommodates diverse geospatial and environmental variables—and the PSO-based automatic hyperparameter optimization, which minimizes manual adjustment and the need for expert regional intervention. Adaptation to diverse climatic and landscape contexts can thus be achieved with only minor modifications to input features or thresholds, enabling rapid and reproducible deployment at broader geographic scales.
Looking forward, expanding the framework to incorporate multi-year temporal validation, finer-resolution predictors in urbanized or heterogeneous areas, and integration with region-specific impact datasets will further enhance confidence in operational deployment. These steps, combined with its demonstrated robustness, scalability, and consistency with regional heatwave trends, position the approach as an immediately valuable and continually improvable tool for climate adaptation strategy in Central Asia and beyond.