1. Introduction
The precise forecasting of severe weather is crucial for protecting vital infrastructure and developing contingency strategies for catastrophe prevention and mitigation. Heatwaves, a typical manifestation of extreme weather, present substantial hazards to energy and power systems by challenging electricity grid load management and dispatch operations. An illustrative instance occurred in the summer of 2019 in Sichuan Province, a significant hydroelectric base in China, where enduring heatwaves induced a marked increase in electricity demand while concurrently diminishing hydropower production due to falling reservoir levels, ultimately resulting in regional power shortages [
1]. This incident highlights the critical necessity of accurate heatwave forecasts for safe grid dispatch and power supply management.
The Sichuan region features a complex topography of alternating basins and plateaus (
Figure 1). The formation and persistence of surface meteorological elements, especially extreme high temperatures characteristic of heatwaves, are governed by intricate multi-scale atmospheric interactions. Such prolonged heat events are generally governed by quasi-stationary synoptic-scale systems, such as the subtropical high, which establish a favorable background through large-scale subsidence and enhanced solar radiation [
2,
3]. Under such a synoptic-scale forcing background, local complex terrain further modulates the surface energy budget and boundary-layer structure through thermal and dynamic processes, thereby altering the intensity and spatial distribution of high temperatures [
4]. Therefore, the skill of a numerical weather prediction (NWP) model in accurately simulating the coupling between synoptic-scale forcing and terrain-induced local processes is key to achieving precise forecasts of such extreme heat events in this region.
Operationally, regional NWP systems based on the Weather Research and Forecasting (WRF) model [
5] play an important role in such forecasting tasks. However, in extended-range forecasts spanning a week or longer, regional models exhibit a tendency for systematic large-scale circulation drift, causing simulated weather systems to progressively diverge from their actual evolution in terms of both position and intensity [
6]. This systematic error arises in part from the declining constraining effect of lateral boundary conditions over time, which undermines the coherence between the interior model solution and the global driving field, ultimately degrading forecasts of weather occurrence location and magnitude [
7].
A conventional remedy for such large-scale drift is to enlarge the simulation domain, though this incurs substantially higher computational cost. Research demonstrates that lateral boundary effects often persist for about three days of integration, with this duration potentially being shorter for smaller domains, indicating that merely expanding the domain is not a fundamental remedy [
8]. An alternative and widely adopted strategy employs relaxation nudging techniques, which dynamically adjust the large-scale circulation of the regional model to align more closely with a dependable global analysis or forecast field. The WRF model offers two principal nudging methods: grid (analysis) nudging and spectral nudging [
5]. Unlike grid nudging, which uniformly adjusts all scales in physical space, spectral nudging applies scale separation in wavenumber space, imposing dynamic constraints only on components larger than a predefined cutoff wavenumber [
9,
10,
11]. Theoretically, this approach better balances the guidance from the global background field with the regional model’s inherent capacity to develop locally consistent, physically realistic features, thus utilizing the advantages of both global and regional models [
12,
13]. This is especially beneficial for maintaining terrain-induced mesoscale circulations in areas of complex topography.
The effectiveness of spectral nudging, however, is highly affected by the cutoff wavenumber, a key parameter whose represented physical scale is intrinsically dependent on the dimensions of the simulation domain. Because the technique relies on a two-dimensional Fourier transform, the physical wavelength corresponding to a given wavenumber is determined by the domain’s dimensions. Consequently, an identical cutoff wavenumber imposes constraints on different physical scales over domains of differing sizes. Suboptimal parameter configuration, such as an excessively small domain or an inappropriate wavenumber selection, may therefore fail to effectively constrain key synoptic systems or, conversely, may over-suppress the model’s simulation of local processes [
14]. Traditional parameter selection often relies on subjective experience, lacking an objective, weather-type-specific methodology. Thus, for forecasting heatwave events over Sichuan’s complex terrain, establishing the optimal spectral nudging configuration, which involves identifying the appropriate physical scale to constrain for a specific domain, remains a major issue for enhancing forecast accuracy using this method.
To address this gap, this study investigates a significant heatwave event that impacted the Sichuan electricity grid. We formulate and evaluate an objective methodology for spectral-nudging parameterization, primarily aimed at identifying the best cutoff wavenumber through power-spectrum energy diagnostics of the background field. This method seeks to objectively identify and constrain the dominant large-scale waves, thus alleviating the systematic drift in large-scale circulation that typically undermines extended-range regional forecasts as lateral boundary constraints weaken. Such drift manifests as progressive deviations in the position and intensity of steering circulation features, such as the subtropical high, directly leading to errors in forecasting the spatial extent and intensity of heatwave episodes. We comprehensively assess the impact of various spectral nudging configurations on correcting drift and enhancing surface temperature forecasts using a series of sensitivity tests, focusing on the essential relationship between domain size and cutoff wavenumber. A central innovation of this work is the development of an objective “energy-threshold” method to determine the spectral nudging cutoff wavenumber. This method moves beyond the subjective selection common in prior studies by quantitatively identifying the dominant large-scale waves from the background field’s energy spectrum. To rigorously evaluate this approach, our experimental design incorporates configurations that represent a spectrum of constraint strengths—from no nudging, to optimally selective nudging (using our objective method), to nudging that imposes broader, less selective constraints on the circulation. This framework allows us to explicitly assess whether and when scale-selective spectral nudging provides forecast benefits over both unconstrained simulations and simulations with more pervasive dynamic constraints.
The remainder of this paper is structured as follows.
Section 2 describes the data and details the proposed objective method for determining the spectral nudging cutoff wavenumber, focusing on a heatwave event in 2019.
Section 3 presents the design of a suite of WRF sensitivity experiments, which contrast spectral nudging forecasts with various combinations of domain size and cutoff wavenumber.
Section 4 examines the outcomes of these comparative experiments. Finally,
Section 5 summarizes the main conclusions.
4. Results
4.1. Evaluation of Simulation Performance Under Different Configurations
4.1.1. Compensatory Effect of Spectral Nudging in a Smaller Domain
The control experiment in Group G1 (LT-SR-no) used a relatively small simulation domain, which limited the captured large-scale circulation and allowed error accumulation (i.e., large-scale drift) over long integration times. Theoretically, this could lead to poor forecasting of the heatwave.
Figure 3 shows that the daily maximum 2 m temperature during this heatwave event occurred around 1700 LST; therefore, the 2 m temperature at 1700 LST was used to characterize the spatial extent and intensity of the heatwave. The spatiotemporal evolution of the heatwave simulated by experiment LT-SR-no is presented in
Figure 8. Compared with observations (
Figure 4), the simulated heatwave extent is notably smaller and its intensity is weaker. In particular, the experiment failed to reproduce the high temperatures over the eastern Sichuan Basin (east of 105 °E) on 17 August (cf.
Figure 4c and
Figure 8c).
The other two sensitivity experiments in Group G1 (LT-SR-E1 and LT-SR-E2) employed spectral nudging, yielding simulated heatwave evolutions that differed markedly from the control run (LT-SR-no). These differences are illustrated in
Figure 9, which shows how spectral nudging modifies the simulated heatwave pattern. Crucially, the spatial changes reflect a correction of the underlying dynamical bias, not just a quantitative adjustment.
For LT-SR-E1 (
Figure 9a–c), the heatwave intensity was significantly stronger than in LT-SR-no, with temperatures 2–3 °C higher over the eastern Sichuan Basin (east of 105 °E) on 17 August. This spatially concentrated warming is the direct surface manifestation of the compensatory effect: by constraining the dominant large-scale waves, spectral nudging systematically intensified the synoptic-scale high-pressure system aloft. The enhanced subsidence associated with a stronger high suppresses cloud formation and promotes adiabatic warming, leading to the more accurate and intense heatwave forecast shown here. The confinement of the strongest warming to the basin’s eastern sector underscores that the improvement stems from a better-represented circulation pattern.
In contrast, LT-SR-E2 (
Figure 9d–f), which applied a broader spectral constraint (higher cutoff wavenumbers), produced a more mixed result. While it also showed warming over the eastern basin on 17 August (
Figure 9f), its simulated heatwave intensity was generally weaker than that of the control run (LT-SR-no) on 15–16 August (
Figure 9d–e). This suggests that the less selective constraint of LT-SR-E2 was less effective in compensating for the large-scale drift during the earlier phase of the forecast, and may have begun to suppress beneficial mesoscale development, a precursor to the over-constraint effect discussed later for larger domains.
To quantitatively evaluate the heatwave forecast performance of Group G1 experiments, the simulated 2 m temperature at 1700 LST was bilinearly interpolated to CDMet stations (with daily maximum temperature ≥ 35 °C) near the electricity grid. The mean error (ME) and root mean square error (RMSE) were then calculated; the results are listed in
Table 2. All experiments in Group G1 showed a negative ME, indicating a systematic underestimation of heatwave intensity. The underestimation was most pronounced in the control experiment LT-SR-no (ME and RMSE of −2.41 °C and 2.78 °C, respectively). The application of spectral nudging reduced these errors. For example, compared with LT-SR-no, experiment LT-SR-E2 improved the ME and RMSE by 15% and 15%, respectively, while LT-SR-E1 showed even greater improvements of 25% and 21%.
The cutoff wavenumber pair for experiment LT-SR-E2 was (8, 6), whereas for LT-SR-E1 it was (2, 2). This indicates that the former nudged not only large-scale circulation but also relatively smaller-scale systems. The comparative results from the three experiments in Group G1 (see
Figure 9 and
Table 2) demonstrate that within a relatively small domain, better simulation is achieved by nudging only the dominant large-scale circulation (as in LT-SR-E1). Forecast errors arising from system drift due to the limited domain can be partially corrected by spectral nudging, an effect we term the compensatory effect of spectral nudging. Importantly, the performance of LT-SR-E2, which nudges a broader spectrum of waves (including synoptic scales) and thus approximates the effect of a strong-constraint grid nudging, lies between that of the control run and the optimal spectral nudging (LT-SR-E1). This quantitative comparison underscores a key advantage of properly configured spectral nudging: by selectively constraining only the most critical large-scale waves, it achieves superior forecast skill compared to both no nudging and a nudging strategy that imposes broader, less selective constraints.
4.1.2. Over-Constraint Effect of Spectral Nudging in a Larger Domain
Theoretically, a larger simulation domain can better resolve the spatiotemporal evolution of large-scale circulation, alleviate system drift caused by boundary conditions, and thus improve forecast performance. This theoretical expectation is supported by the results of the control experiment LT-LR-no in Group G2 (
Figure 10). The heatwave intensity simulated by LT-LR-no was stronger than that in LT-SR-no and was nearly comparable to that of experiment LT-SR-E1, which used spectral nudging (cf.
Figure 9a–c and
Figure 10a–c).
However, the two sensitivity experiments in Group G2 that employed spectral nudging (LT-LR-E1 and LT-LR-E2) performed worse than the control experiment LT-LR-no (
Figure 11). Contrary to the performance ranking in Group G1 (where LT-SR-E1 outperformed LT-SR-E2), LT-LR-E1 performed worse than LT-LR-E2. Quantitative statistics (table omitted) also confirmed this finding. This indicates that when the simulation domain is sufficiently large to adequately resolve the evolution of large-scale circulation, imposing spectral nudging to constrain either the large-scale circulation (LT-LR-E1) or relatively smaller-scale systems (LT-LR-E2) degrades the heatwave forecast. We term this the over-constraint effect of spectral nudging.
4.1.3. Sensitivity of the Nudging Effect to the Length of Lead Time
The forecast lead time for Groups G1 and G2 was 144 h. If the lead time is shortened, the drift of large-scale circulation systems would be relatively smaller, potentially leading to better forecast performance than in the longer-lead experiments. Groups G3 and G4 correspond to G1 and G2, respectively, but with a shortened lead time of 96 h. The forecast performance of their control experiments (ST-SR-no and ST-LR-no) for the heatwave showed clear improvement compared to the control experiments in Groups G1 and G2 (LT-SR-no and LT-LR-no).
Figure 12 illustrates these improvements in the spatiotemporal evolution of the heatwave. The improvement was more pronounced for the smaller domain (
Figure 12a–c) than for the larger domain (
Figure 12d–f), particularly for the forecast on 17 August (cf.
Figure 12c,f).
Figure 13 shows the impact of the sensitivity experiments in Groups G3 and G4 on heatwave forecasts relative to their respective control experiments. It reveals the differential effects of spectral nudging on forecast performance for smaller and larger domains under a shorter lead time. For the smaller domain, both ST-SR-E1 (top row of
Figure 13) and ST-SR-E2 (second row) significantly improved the heatwave forecast on 16 and 17 August. In contrast, for the larger domain, ST-LR-E1 (third row) and ST-LR-E2 (bottom row) did not yield clear improvements. This indicates that with a shorter lead time, applying spectral nudging in a smaller domain can effectively improve heatwave forecasts, whereas its benefit is minimal for a larger domain.
A quantitative assessment of the heatwave forecast performance for Groups G3 and G4 is summarized in
Table 3, supporting the qualitative analysis from
Figure 12 and
Figure 13. Control experiments with shorter lead times consistently showed better ME and RMSE than their longer-lead counterparts (e.g., compare ST-SR-no in
Table 3 with LT-SR-no in
Table 2). Notably, ST-LR-no achieved the best performance among all experiments without spectral nudging, with a three-day average RMSE of 2.03 °C and a ME of −1.65 °C.
Among experiments using spectral nudging, the impact of a shorter lead time depended on the domain size. For the smaller domain, experiments ST-SR-E1 and ST-SR-E2 further improved the forecasts relative to the control (ST-SR-no), with average ME improvement rates of 20% and 18%, and RMSE improvement rates of 15% and 14%, respectively. However, for the larger domain, applying spectral nudging led to a slight degradation in performance. For instance, ST-LR-E1 and ST-LR-E2 showed increased ME by 18% and 10%, and increased RMSE by 14% and 9%, respectively. This reaffirms that even with a shorter lead time, spectral nudging applied to a large domain that already resolves large-scale circulation well may suppress the free development of weather systems within the domain, consequently degrading the heatwave forecast. Across all experiments, the configurations using our objectively determined optimal cutoff wavenumber (E1) consistently outperformed or matched their E2 counterparts, which acted as proxies for less scale-selective nudging. This reinforces the conceptual superiority of an objective, scale-selective approach over a one-size-fits-all constraint strategy.
The analysis in
Section 4.1 demonstrates that the impact of spectral nudging on the forecast of persistent high temperatures over the Sichuan Basin is highly conditional. For long-term (LT) simulations in a smaller domain (SR), spectral nudging effectively improves the simulation by elevating temperatures and reducing errors. In contrast, for a larger domain (LR), its over-constraining effect degrades performance. Shortening the forecast lead time (ST) offers an alternative pathway to slightly improve accuracy. Within the short-term forecast situation, the magnitude of improvement from spectral nudging is comparable to that from shortening the lead time. However, its effectiveness remains strictly modulated by the domain size: it is beneficial only for small domains and still produces negative effects in large domains.
4.2. Underlying Mechanism: Modulation of Background Circulation by Spectral Nudging
To investigate the mechanism behind the impact of spectral nudging on forecast performance, we examined the difference in the 500 hPa geopotential height field between experiments LT-SR-no and LT-SR-E1 as an example. The choice of this level is based on the fact that the cutoff wavenumber pair (
xwavenum,
ywavenum) controlling the spectral nudging was derived from the 500 hPa height field (see
Section 2.3.2).
Figure 14 compares the 500 hPa geopotential height fields from ERA5 and experiment LT-SR-no at 0800 LST on 15–17 August 2019. The 5880 gpm contour present in the CMA weather analysis charts (
Figure 5d–f) is not seen in the ERA5 reanalysis (blue solid lines in
Figure 14) and was also not captured by experiment LT-SR-no (red dashed lines in
Figure 14). This indicates that ERA5, which provided the background fields for the WRF simulation, systematically underestimated the high-pressure system, leading to a persistently weaker simulated circulation in LT-SR-no compared to the observations.
The forecast performance of experiment LT-SR-E1 improved significantly with the application of spectral nudging (
Table 2). Comparing the 500 hPa height fields of LT-SR-E1 and LT-SR-no (
Figure 15a–c) reveals a notable increase in geopotential height over the Sichuan region in LT-SR-E1, bringing it closer to the observations in
Figure 5. This suggests that spectral nudging, by constraining the large-scale circulation over Sichuan while allowing meso- and small-scale systems to develop freely, compensated for the underestimation of the high-pressure system inherent in the background field. Consequently, it more accurately reproduced the high-pressure system dominating the Sichuan extreme heat weather, leading to a significant improvement in the heatwave forecast.
For the larger domain, experiment LT-LR-E1 performed worse than LT-LR-no (
Figure 11). The 500 hPa geopotential height differences in
Figure 15 directly link the spectral nudging constraint to the forecast skill documented in
Section 4.1. In
Figure 15a–c (LT-SR-E1 minus LT-SR-no), the broad area of positive height anomalies over Sichuan signifies a strengthening of the upper-level ridge. This correction towards a more realistic high-pressure system (cf.
Figure 5) is the key dynamical driver behind the compensatory effect: it enhances large-scale subsidence, stabilizes the atmospheric column, and thereby amplifies the surface heating captured in
Figure 9a–c. Conversely,
Figure 15d–f (LT-LR-E1 minus LT-LR-no) show negative height anomalies, indicating that nudging weakened the ridge in the large-domain experiment. This dynamical degradation explains the corresponding deterioration in surface heatwave forecasts (
Figure 11a–c), exemplifying the over-constraint effect where nudging disrupts an already well-evolving circulation. Thus, these height difference maps are not just auxiliary diagnostics; they provide the causal physical explanation for the performance changes shown in the temperature maps. Further analysis of the 850 hPa temperature field (figure omitted) revealed that when the 500 hPa height field was stronger, the 850 hPa level over Sichuan was noticeably warmer. This suggests that a stronger 500 hPa height field intensifies subsidence, favoring the enhancement of heatwave intensity.
4.3. A Fourier-Based Interpretation of Spectral Nudging Effects
Based on the diagnostic results from the specific case study presented above, we can formulate a more general physical interpretation of the compensatory and over-constraint effects, which carries implications for the broader application of spectral nudging.
The contrasting effects of spectral nudging—beneficial compensation in small domains and detrimental over-constraint in large ones—originate from fundamental principles of wave interaction in limited domains, suggesting a potentially generalizable framework for its application.
From a spectral perspective, when a domain is dominated by a single coherent system (e.g., a subtropical high), its structure is captured by specific low-wavenumber components. Spectral nudging “anchors” these key components, but more importantly, it fixes the phase reference for the entire wave packet representing that system. This phase-locking enables constructive superposition between the constrained large-scale waves and the model’s internally generated smaller-scale features, systematically reinforcing the system’s core intensity—manifesting as the compensatory effect. This mechanism is not specific to heatwaves or the Sichuan region; it should apply wherever a well-defined, dominant large-scale circulation (e.g., a blocking high, persistent trough, or monsoon system) controls the weather phenomenon of interest, provided the domain scale appropriately captures that system.
Conversely, in domains containing multiple interacting systems, the Fourier spectrum becomes populated by competing wavenumber sets representing different systems. Constraining this broader spectrum does not reinforce a single coherent entity. Instead, it rigidly prescribes phase relationships between multiple wave packets, which can destructively interfere with the regional model’s internal multi-scale dynamics. This suppresses physically meaningful mesoscale development—the over-constraint effect. This risk is likewise general: it will emerge whenever the domain is sufficiently large that the nudging must constrain a complex background flow rather than a single coherent wave packet, such as in continental-scale simulations or climate downscaling.
Thus, the method’s efficacy depends crucially on whether the spectral constraint is applied to a coherent wave packet (enhancing it through phase-locking) or to a complex wave background (disrupting scale interactions). This principle suggests compensatory effects should be most pronounced when a single dominant system exists within the domain, while over-constraint risks increase with domain size and circulation complexity. The proposed energy-threshold method operationalizes this principle by objectively identifying the coherent wave packet(s) to constrain. While the specific energy thresholds (E1, E2) may require calibration for different regions or flow regimes, the underlying concept of aligning the nudging scale with the dominant spectral signature of the governing circulation is universally applicable.
In summary, the conceptual framework developed here—linking domain size, circulation complexity, and wave interaction outcomes—provides a guiding principle for configuring spectral nudging. Future work should test this framework across diverse weather types and geographical settings to fully establish its generality and refine its operational implementation.
5. Conclusions
Aimed at supporting reliable electricity grid operations through better heatwave prediction over Sichuan’s complex topography, this study specifically targets the optimization of the key spectral nudging parameter, the cutoff wavenumber. Traditional selection methods, which are often subjective, lack objective criteria tailored to specific weather events and simulation domains. We developed an objective energy-threshold method, grounded in spectral energy analysis of the background 500 hPa geopotential height field, to determine a physically consistent cutoff wavenumber. Using the August 2019 Sichuan heatwave event as a case, comprehensive sensitivity experiments were conducted to assess the effects of domain size, forecast lead time, and spectral nudging configuration on forecast performance. The underlying mechanisms were further explored through circulation diagnostics.
The efficacy of spectral nudging is strongly modulated by the simulation domain scale. Over smaller domains and longer lead times, spectral nudging successfully compensates for large-scale circulation drift arising from insufficient lateral boundary constraints, yielding a marked improvement in heatwave forecasts—a compensatory effect. Conversely, in larger domains that inherently support more realistic large-scale evolution, applying spectral nudging suppresses beneficial mesoscale development and degrades forecast skill, revealing an over-constraint effect.
Forecast lead time offers an alternative pathway to enhance accuracy. Although shortening the lead time itself improves forecasts, the benefit of spectral nudging within short-term situations remains strictly scale-dependent. It provides significant gains only in smaller domains, whereas in larger domains, it offers no improvement and may even degrade performance.
The energy threshold method introduced here effectively isolates the dominant large-scale circulation patterns associated with the heatwave event from the background field, enabling an objective and physics-based determination of the spectral nudging cutoff wavenumber. This approach provides a viable alternative to subjective parameter selection for operational implementation. Crucially, the experimental results demonstrate that spectral nudging configured with our objective method (E1) can yield better forecasts than both no nudging and a strong-constraint approach resembling traditional analysis nudging (E2), highlighting the value of scale-selective guidance.
These conclusions are derived from an in-depth analysis of a single, albeit significant, case study. Their generality, particularly the applicability and boundaries of the proposed “compensatory” and “over-constraint” conceptual framework, should be tested across a broader range of heatwave events with varying synoptic backgrounds and seasonal timing. Furthermore, while this study primarily evaluated forecast skill for near-surface temperature, future work should extend the assessment to other critical variables such as circulation patterns, boundary-layer structure, and humidity. Although the energy thresholds (E1, E2) were derived from a historical event sample, their optimal values may exhibit regional and seasonal dependencies, warranting further sensitivity investigations.
Future efforts should aim to implement this objective parameterization approach within operational regional forecasting systems, enabling dynamic optimization. Furthermore, this study confirms that the application of energy-threshold-selected spectral nudging can effectively enhance heatwave forecast skill over a smaller inner domain. This finding carries significant practical implications, as it suggests the potential for operational forecasting systems to significantly reduce computational costs by employing smaller domains while maintaining, or even improving, forecast quality. It is acknowledged that this conclusion is drawn from an in-depth analysis of a single representative case; its generality and the precise computational cost benefits need to be further verified and quantitatively assessed across a wider variety of weather events and different geographical regions. Furthermore, testing the applicability of this conceptual framework across diverse geographical regions could substantially advance extreme weather prediction.