1. Introduction
As the electricity market gradually opens up, the increasingly complex environment has heightened market uncertainty, making electricity prices more susceptible to changes in supply and demand and other external factors. In countries that have already introduced electricity futures trading, electricity retailers can use electricity futures to hedge against risks arising from electricity price fluctuations, based on the characteristics of electricity pricing [
1]. However, in many countries, electricity futures have not yet been launched, and spot electricity markets are not fully developed. As a result, alternative methods for risk hedging must be considered. In this context, fossil fuel futures, which are closely linked to electricity prices, are considered potential tools for risk management.
Contemporary research has progressively addressed the critical challenge of risk mitigation in deregulated electricity markets. As demonstrated in [
2], the dual uncertainties in price and trading volume pose substantial threats to corporate financial stability, with conventional hedging instruments like electricity futures proving inadequate due to market illiquidity. This investigation pioneers an innovative risk management framework through the synergistic combination of energy derivatives and weather derivatives. Complementary analysis in [
3] employs variance minimization criteria to assess weekly versus monthly hedging efficacy, revealing electricity futures’ underperformance relative to other energy commodities.
Multidimensional investigations have elucidated the intricate price transmission mechanisms between energy commodities and electricity markets. Regarding fossil fuel impacts, empirical evidence from the Guangdong regional market [
4] establishes significant positive correlation between monthly electricity and coal prices. The VAR-MGARCH analysis in [
5] systematically decouples these dynamic interactions: coal prices exert immediate short-term effects while natural gas predominantly influences price volatility, contrasting with crude oil’s statistically insignificant role. Time-frequency decomposition in [
6] further identifies delayed responses in electricity market volatility to fossil fuel market fluctuations, distinguishing renewable-driven spot price dynamics from futures market behavior shaped by natural gas pricing.
Advanced methodologies have enhanced understanding of interconnected energy market risks. Through wavelet coherence analysis, [
7] quantifies event-driven spillover effects across temporal scales. A breakthrough in [
8] applies dynamic factor modeling to disentangle the co-volatility structure among electricity, fossil fuel, and carbon markets. Critical pathway analysis in [
9] maps risk transmission channels, confirming electricity market-mediated impacts of fossil fuels on carbon markets, thereby informing carbon risk mitigation strategies. The natural gas-electricity nexus study [
10] reveals futures market leadership effects, substantiating cross-commodity hedging viability through Granger causality analysis.
Cutting-edge forecasting methodologies increasingly integrate cross-domain insights. The hybrid model in [
11] synergizes Fourier decomposition with multi-energy price drivers, while comparative analysis in [
12] validates accuracy improvements through fossil-renewable energy indicator fusion. References [
13,
14] both adopt a hybrid framework that integrates signal decomposition with predictive modeling. Reference [
15] (SSA-LSTM) demonstrates a streamlined and efficient structure, effectively balancing high predictive accuracy with manageable model complexity. In contrast, [
16] (VMD-PSO-LSTM-RF) employs a more sophisticated architecture designed for peak accuracy, utilizing adaptive decomposition, seasonal hyperparameter optimization, and ensemble forecasting to capture the intricate, multi-scale fluctuations in short-term load data within electricity markets. Reference [
17] (ARIMA & CNN-Bi-LSTM) follows a distinct comparative validation approach. It provides a robust and practical empirical assessment of traditional versus deep learning models for mid-term electricity consumption forecasting, prioritizing the verification of a feasible methodological framework over the pursuit of ultimate predictive precision. Explainable AI applications in [
18,
19,
20] employ SHAP value analysis to decode complex relationships, including lagged feature impacts, renewable penetration effects on price distributions, and anomaly driving factors. This interpretability framework extends to grid congestion analysis in [
21], successfully identifying critical nodal influences and causal pathways. Previous studies have noted the limitations of SHAP-based interpretation in time-series models with multiple lagged variables. Ref. [
22] shows that due to the strong correlations among lagged features, SHAP values primarily capture the model’s predictive dependence on historical information rather than the independent effects of individual lags, and therefore should be interpreted with caution under multicollinearity.
Despite these advancements, several critical research gaps remain unaddressed, which this paper aims to fill:
- (1)
Lack of Integrated Forecasting-Hedging Frameworks: Existing studies often treat price forecasting and hedging strategy design as separate tasks. There is a need for a cohesive model that directly uses a high-accuracy price forecast to inform and trigger specific, actionable hedging decisions in related energy futures markets.
- (2)
Insufficient Exploration of Lagged Price Transmission for Hedging: While the literature acknowledges lead-lag relationships between energy markets [
6,
10], few studies have quantitatively and systematically mapped these lag cycles with the explicit goal of identifying optimal futures entry points for electricity retailers. The understanding of how the transmission time varies between different fossil fuel futures is limited.
- (3)
Absence of Practical, Risk-Adjusted Portfolio Allocation for Electricity Retailers: Previous research on hedging often focuses on minimizing variance or calculating CVaR under standard distributional assumptions. There is a gap in providing electricity retailers with a practical portfolio weight allocation that explicitly balances risk and return, accounting for the heavy-tailed characteristics of energy futures returns, and is validated with real market trading data.
In response to these gaps, this article proposes an integrated futures risk hedging model based on monthly electricity price forecasting. The main contributions are as follows:
- (1)
An Integrated Forecasting-Hedging Framework: By developing a hybrid SSA-LSTM model that incorporates energy futures prices, this study not only improves monthly electricity price forecast accuracy but also directly uses the forecasted trend to generate clear signals for futures market operations, creating a closed-loop from prediction to strategy execution.
- (2)
Quantification of Lagged Transmission for Strategy Timing: Using SHAP interpretable machine learning on a constructed lag model, this research explicitly uncovers the distinct lag cycles in the price signal transmission from JM (coking coal) and LPG futures to monthly electricity prices. This analysis directly informs the precise timing (specific trading days) for initiating futures positions, a critical component often overlooked in prior work.
- (3)
A Practical Risk-Optimized Portfolio Allocation Model: We establish a quantitative risk model using Monte Carlo simulation to calculate CVaR under a t-GARCH assumption, which better captures the fat-tailed nature of energy returns. This model provides electricity retailers with a specific, optimal allocation weight between futures that balances risk control and return stability, moving beyond theoretical hedging to offer actionable decision support.
In summary, the integrated risk management framework developed in this study follows a clear logical chain: First, the SSA-LSTM model generates high-precision monthly electricity price forecast paths, which provide the core input and directional guidance for subsequent risk analysis. Next, SHAP interpretability analysis is used to delve into the drivers of price fluctuations, explicitly identifying the intensity and direction of the influence of JM and LPG futures on electricity prices at specific lag periods. This directly determines which futures contracts should be selected for hedging and the optimal entry timing. Finally, based on the forecasted electricity price scenarios and the selected futures contracts, a t-GARCH-based Monte Carlo simulation is employed to calculate the portfolio CVaR, thereby quantifying the results of the first two steps into specific, understandable risk exposure numbers and optimal asset allocation weights for electricity retailers. These three components connect sequentially, collectively forming a complete decision-support system that moves from forecasting and explanation to risk quantification.
The rest of this article is structured as follows:
Section 2 develops the monthly electricity price forecasting model using the SSA-LSTM approach.
Section 3 analyzes the lagged impact of energy futures on electricity prices using the SHAP model to determine optimal entry timing.
Section 4 determines the optimal futures allocation weights by computing the portfolio CVaR through the Monte Carlo method. In
Section 5, real market data is used to validate the effectiveness of the proposed method, followed by an in-depth discussion.
Section 6 concludes this article. The overall research framework is illustrated in
Figure 1.
3. Electricity Price Influencing Factor Analysis Based on Random Forest and SHAP Models
Significant transmission effects occur across different markets, mainly reflected in two aspects: time-lagged transmission of price fluctuations and amplitude response. The time-lagged transmission effect refers to the phenomenon in which price changes in one market usually take some time to influence related markets. The amplitude response, in contrast, results from the nonlinear relationships among market price fluctuations. When volatility occurs in one market, the magnitude of price changes in related markets tends to vary accordingly.
Electricity, as a unique commodity, cannot be stored directly. However, through intermediate carriers such as coking coal and liquefied petroleum gas—both forms of primary energy—an indirect energy storage mechanism is effectively established, with fuel inventories serving as the medium [
24]. This particular transmission pathway is constrained by factors such as transportation cycles and storage durations, which inevitably create a time-lag effect on electricity prices. At the level of electricity market design, a structural mismatch exists between the monthly clearing mechanism used to determine electricity prices and the continuous trading mechanism of the energy futures market. This institutional difference systematically delays the transmission of futures price signals to medium- and long-term electricity trading prices. Nevertheless, existing research has yet to systematically examine the relationship between monthly electricity trading prices and energy futures prices over different time horizons.
3.1. SHAP Value Analysis Based on Random Forest Model
This study employs the Random Forest model for modeling and analysis. As a machine learning algorithm, it combines multiple decision trees through an ensemble method and introduces a principle of randomness. Compared to traditional linear models, it more effectively captures and explains the nonlinear relationships between variables.
The model employs the Bootstrap resampling method, randomly selecting M training samples with replacements from the training set to generate K sub-models. The average of the calculated results from these sub-models is then used as the regression prediction value. The model can be expressed as follows:
denotes the ensemble classification model, represents the classification model of an individual decision tree, and denotes the target variable.
The input of this study consists of multi-dimensional time series data. Among them, JM and LPG futures are traded only on working days during market hours, while the monthly electricity trading price is published at the end of each month. This study adopts the bilinear interpolation method to address the missing values in monthly futures data to ensure time series continuity. This interpolation is adopted to construct a contiguous time series for model training. It should be noted that this process may smooth genuine intra-month volatility. Consequently, the identified lagged days should be interpreted as approximate temporal markers relative to the monthly price release within the model’s framework, rather than as exact estimates of daily structural transmission lags. The interpolation formula is as follows:
The monthly electricity trading price is treated as the target variable in the modeling process and standardized to be released on the last day of each month. Considering the varying number of days in different months, this study constructs 26-day lag terms for JM and LPG futures prices, aiming to examine the influence of these lagged variables on monthly electricity prices. The corresponding lag model can be expressed as follows:
denote the monthly transaction electricity price, , represent the futures prices of coking coal and liquefied petroleum gas (LPG), respectively. The corresponding regression coefficients with a lag of k days are denoted by and .
The core idea of SHAP values is derived from the concept of Shapley values in game theory. It explains the importance of each feature by computing its marginal contribution to the model’s output. The SHAP value is defined as follows:
denotes the Shapley value of feature p; is a subset of features included in the model, represents the full set of all features, is the value of feature in the set , denotes the feature values in subset excluding feature p, represents the model output based on the feature values in without feature p, is the output based on the full feature set .
The greater the absolute value of a Shapley value, the greater the corresponding feature’s contribution to the target variable. Moreover, the sign of the value indicates the direction of the effect: a positive value suggests a positive influence on the target variable. In contrast, a negative value indicates a negative influence.
3.2. Interpretability Analysis Framework Using SHAP Modeling
This study applies SHAP value analysis to investigate the lagged influence mechanism of energy futures prices on medium- and long-term electricity prices, based on data from 2023.
Figure 8 and
Figure 9 display SHAP bar plots illustrating the effects of LPG and JM futures on monthly electricity trading prices, while
Figure 10 and
Figure 11 offer a more detailed visualization of these effects. In these figures, variables are ranked on the vertical axis from highest to lowest according to their influence on electricity prices. Each point represents an individual observation, with its color and size corresponding to the feature values shown in the legend. The sign of each SHAP value indicates whether the feature has a positive or negative effect on electricity prices. Lag1 through Lag26 denote the influence strength of futures prices from 1 to 26 days earlier on the monthly centralized electricity trading price.
Figure 8.
Feature importance analysis of LPG futures.
Figure 8.
Feature importance analysis of LPG futures.
Figure 9.
Feature importance analysis of JM futures.
Figure 9.
Feature importance analysis of JM futures.
Figure 10.
Explainability analysis of LPG futures.
Figure 10.
Explainability analysis of LPG futures.
Figure 11.
Explainability analysis of JM futures.
Figure 11.
Explainability analysis of JM futures.
The SHAP value analysis of LPG futures prices reveals a distinct concentration trend characterized by a unimodal decay pattern. The most prominent feature is the futures price from six days before the release of the electricity price, which exhibits significantly greater importance than all other features. This finding suggests that fluctuations in LPG futures prices are strongly transmitted to electricity prices with an approximate six-day lag. Additionally, prices from one to five days earlier also exert a notable influence on electricity price prediction—an effect weaker than that of the sixth day but still considerably stronger than those from other periods. Furthermore, secondary SHAP value peaks appear on the 12th, 17th, and 22nd days before the release date, indicating possible periodic transmission effects. Therefore, it can be inferred that LPG futures prices exert a clear lagged influence on electricity prices, with the strongest impact occurring at approximately a six-day lag.
Further analysis of the decision plot shows that the high-impact red price points are concentrated almost entirely in the negative SHAP value region. This pattern is consistent with the hypothesis that rising natural gas prices may prompt power plants to reduce the share of gas-fired generation and shift to alternative energy sources, thereby temporarily curbing the upward trend in electricity prices. However, this interpretation remains a plausible theoretical explanation rather than an empirically validated conclusion, as actual generation mix data were not incorporated into the model.
The SHAP value analysis of JM futures prices indicates that, compared with LPG futures, JM futures exert a lower overall influence on electricity prices. The SHAP values across different lag periods are more evenly distributed but still display a distinct cyclical pattern.
In the short term, JM futures prices from one to three days before the release of electricity prices show a relatively strong influence, suggesting that short-term fluctuations in coal prices are rapidly transmitted to the electricity market. These price signals directly shape market participants’ expectations and are quickly reflected in electricity price movements. In the medium term, the influence of JM futures prices peaks between days ten and fourteen, exhibiting a distinct cyclical pattern. This suggests that electricity prices are significantly influenced by factors such as power plant inventory management, coal procurement strategies, and transportation cycles. In the long term, coking coal prices from days twenty-one to twenty-six continue to exert a measurable influence on electricity prices, further confirming the presence of long-term cyclical effects.
Further analysis of the SHAP decision plot reveals that the relationship between JM futures prices and electricity prices varies across time scales. In the short term, this relationship is negative, suggesting that rising coking coal prices may suppress electricity demand and thus place downward pressure on short-term electricity prices. However, in the medium and long term, the relationship turns consistently positive. This pattern aligns with the cost transmission mechanism hypothesis of coking coal as a primary fuel for power generation, where higher coking coal prices raise fuel costs and, in turn, drive up electricity prices. It should be noted that the strength and immediacy of this transmission may be moderated by factors such as power plant inventories and long-term contracts, which were not directly modeled in this study.
The above analysis demonstrates that the relationship between JM futures prices and electricity prices varies markedly across time scales, showing a distinct shift from negative to positive in the medium and long term. Among these, the SHAP values in the medium term are more concentrated and display stronger and more stable correlations. Therefore, the medium-term time frame should be regarded as the primary focus for developing coking coal futures trading strategies.
These observed correlations can be interpreted in light of energy structure theory. For coking coal, as a primary baseload energy source, the positive correlation with electricity prices may stem from the fact that increases in its price directly raise fuel costs and, consequently, drive up electricity prices. In contrast, natural gas primarily serves as a supplementary energy source for peak shaving [
25]. A plausible speculation is that when natural gas prices rise, power generation companies may reduce its usage, leading to a substitution effect that could suppress marginal demand and limit electricity price increases. This might explain the observed negative correlation between natural gas prices and electricity prices. It should be emphasized that the above mechanisms are inferences based on economic theory and market, and future empirical validation using micro-level data on unit output and fuel switching is warranted.
3.3. Robustness and Quantitative Analysis of SHAP Results
To comprehensively assess the robustness of the SHAP analysis results and gain an in-depth understanding of the interaction mechanisms among lagged variables, this section conducts supplementary analyses in three dimensions: first, verifying the stability of SHAP values for key lagged terms via Bootstrap resampling; second, calculating the cumulative SHAP contributions across different time scales using a window aggregation approach; and finally, discussing the impact of collinearity among lagged variables on SHAP interpretation and the corresponding mitigation strategies.
Given the potential high correlation among lagged variables, which may affect the stability of SHAP value estimation, this study adopts the Bootstrap resampling method to perform 50 rounds of sampling with replacement on the training dataset. For each resampling iteration, the random forest model is retrained and SHAP values are recalculated. Particular attention is paid to two key lagged terms: LPG futures lagged by 6 days and JM futures lagged by 12 days. To balance computational efficiency and statistical reliability, each Bootstrap iteration uses 50 randomly selected samples for SHAP value calculation.
The stability test results are shown in
Figure 12, with key statistics summarized in
Table 2. For LPG_lag_6, the distribution of the mean absolute SHAP values across 50 Bootstrap iterations ranges from [3.21, 4.81], with a mean of 4.18, a standard deviation of 0.39, and a coefficient of variation (CV) of 0.092. For JM_lag_12, the mean SHAP value distribution ranges from [1.00, 1.84], with a mean of 1.32, a standard deviation of 0.17, and a coefficient of variation of 0.131. The coefficients of variation for both key lagged terms are below 0.15, indicating good stability of the SHAP values across Bootstrap resampling. Notably, the signs of the SHAP values remain consistent across all Bootstrap samples, further confirming the reliability of the direction of influence.
To overcome the limitations that single lag-point estimates may be affected by random fluctuations and to systematically evaluate the cumulative impact of futures prices across different time scales, this study further calculates the sum of absolute SHAP values for all lagged terms within short-, medium-, and long-term time windows. The time windows are divided as follows: short-term (lagged 1–7 days), medium-term (lagged 8–14 days), and long-term (lagged 15–26 days). The window aggregation approach effectively smooths out random fluctuations of individual lagged terms and provides a more robust assessment of impact across time scales.
To address the limitations of single-lag point estimates, which may be influenced by random fluctuations, and to systematically assess the cumulative impact of futures prices across different time scales, this study further calculates the sum of the absolute SHAP values of all lag terms within short-, medium-, and long-term time windows. The time windows are divided as follows: short-term (lags 1–7 days), medium-term (lags 8–14 days), and long-term (lags 15–26 days). The window aggregation method effectively smooths random fluctuations in individual lag terms, providing a more robust evaluation of impacts across time scales.
Table 3 presents the cumulative SHAP contributions of LPG and JM futures across different time windows. The results show that the impact of LPG futures is highly concentrated in the short-term window, with its short-term cumulative SHAP contribution accounting for 64.2% of the total. The peak occurs at lag 6 days, which aligns with the rapid price signal transmission characteristics of LPG as a peaking energy source. In contrast, the impact of JM futures is more evenly distributed across all windows, with the medium-term window contributing 40.5%, making it the most influential period. The peak occurs at lag 12 days, reflecting the longer inventory and procurement cycles required for the transmission of basic fuel costs. This differentiated impact pattern across time scales provides a quantitative basis for designing differentiated hedging timing strategies.
It should be emphasized that the objective of the SHAP analysis in this study is not to precisely estimate the independent coefficients of each lag term, but rather to identify the historical time windows that have the most significant influence on future electricity price predictions from the perspective of the forecasting model. Therefore, key lag points such as “Day 6” and “Day 12” revealed by SHAP values should be understood as the historical information windows that the model most relies on to make reliable predictions. The window aggregation analysis further validates the robustness of these key windows. This shift in methodological perspective allows the study to extract robust and actionable hedging signals from highly collinear lag variables, perfectly aligning with the closed-loop design objectives from prediction to strategy execution.
In summary, through Bootstrap stability testing, quantitative window aggregation, and an in-depth discussion of collinearity issues, the supplementary analysis in this section systematically verifies the robustness of the SHAP results and provides a cross-temporal quantitative impact assessment. This establishes a reliable methodological foundation for the subsequent differentiated design of hedging strategies.
3.4. Futures Market Hedging Strategy Design
SHAP value analysis reveals a significant correlation between LPG and JM futures prices and monthly electricity trading prices. Specifically, when LPG futures prices decline, monthly electricity trading prices typically increase after an approximately six-day transmission lag. In contrast, when JM futures prices rise, monthly electricity trading prices generally increase after around a twelve-day transmission lag, and vice versa.
These findings further suggest that fluctuations in LPG and JM futures prices can serve as effective predictors of electricity price changes. In particular, during periods of pronounced price volatility, trends in the futures market can anticipate movements in electricity prices.
Based on these market observations, a position-opening strategy can be designed. The strategy leverages anticipated electricity price fluctuations to guide trading operations in futures commodities, thereby hedging against risks arising from electricity price volatility in the futures market. Specifically, when electricity prices are expected to rise, a long position is taken in LPG futures due to their negative correlation with electricity prices. In contrast, a short position is initiated in JM futures, reflecting their positive correlation. Conversely, when electricity prices are expected to decline, short positions are taken in LPG futures, while long positions are adopted in coking coal futures. Lag analysis identifies the optimal entry points as the sixth trading day before month-end for LPG futures and the twelfth trading day before month-end for JM futures. All positions are systematically closed on the last trading day of the month, coinciding with the official publication of monthly electricity trading prices.
4. Optimal Portfolio Risk Optimization Using t-GARCH and Monte Carlo CVaR
4.1. CVaR Calculation Methodology
Value at Risk (VaR) is a quantitative measure used to assess the risk exposure of a commodity portfolio [
25]. VaR represents the maximum potential loss of a commodity portfolio at a given confidence level. Specifically, it corresponds to the 1 −
α quantile in the lower tail of the loss distribution. For example, if a portfolio is held for one day and the confidence level is 95%, a VaR value of 100 means there is only a 5% chance that the portfolio will lose more than 100 in one day. A higher VaR value indicates a greater level of risk faced by the portfolio. The main drawback of VaR lies in its failure to measure the severity of tail losses: VaR only provides a loss threshold corresponding to the probability of 1 −
α, but does not reveal the average magnitude of losses when they exceed this threshold (i.e., fall into the extreme tail of the distribution). It cannot distinguish between thin-tailed and fat-tailed distributions. Therefore, in this study, to comprehensively assess risk, we not only calculate VaR but also compute its complementary metric—Conditional Value at Risk (CVaR). CVaR measures the average level of losses when they exceed the VaR threshold, effectively capturing tail risk. It is primarily determined by market volatility and a given confidence level.
In Equation (15); denotes the maximum potential loss of the portfolio; indicates the given confidence level.
In this study, the Monte Carlo simulation method is used to generate a large number of random sample paths to simulate the price movements over the next 30 days. For each path, the corresponding loss value is calculated. Based on the distribution of losses across all simulated paths, the CVaR at a specified confidence level is computed. This result is then used to determine the optimal weight allocation between the two futures.
4.2. Comparing Normal and t-GARCH CVaR Models for Optimal Allocation
In financial derivatives risk management, the distribution characteristics and volatility structure of return series directly determine a model’s ability to capture extreme risks. Traditional approaches assume that residuals are independent and identically distributed with constant variance, which contradicts common features of financial time series such as volatility clustering and heavy tails. To more thoroughly characterize the differences in return properties and tail risks among energy commodities, this study conducts statistical feature analysis, distribution fitting tests, and GARCH model estimation for the return series of JM futures and LPG futures. Furthermore, volatility models under normal and t-distribution assumptions are used to calculate CVaR at different confidence levels. All analyses are based on 731 daily return samples, with risk values computed using Monte Carlo simulation with 5000 iterations, a 30-day time step, and 1000 weight search points, ensuring numerical stability and statistical significance. The historical returns of the two commodities are illustrated in
Figure 13.
As shown in
Figure 14, the returns of JM futures have a mean of −0.000692 and a standard deviation of 0.026268, exhibiting slight negative skewness (−0.5065) and a kurtosis of 4.6160, which exceeds the normal distribution benchmark of 3, indicating a heavy-tailed distribution and significant deviation from normality. In contrast, the returns of LPG futures have a mean of 0.000121 and a standard deviation of 0.022541, but display pronounced positive skewness (2.9631) and extreme kurtosis (27.9319), reflecting strong right-skewness and leptokurtic behavior, likewise significantly rejecting the normality assumption.
The probability density fitting of the sample returns indicates that the normal distribution, due to its lower kurtosis and rapidly decaying tails, fails to adequately capture the occurrence probability of extreme returns. In contrast, the t-distribution exhibits higher probability density in the tail regions, reflecting pronounced heavy-tailed behavior. These results suggest that the returns of both energy futures exhibit asymmetry and fat tails, with LPG futures showing more pronounced volatility and a significantly higher frequency of extreme fluctuations compared to JM futures.
To further characterize the tail structure, the returns were fitted using the t-distribution. The estimated degrees of freedom are 5.7499 for JM futures and 2.8583 for LPG futures, indicating that LPG futures have heavier tails and a higher likelihood of extreme price movements during periods of market turbulence, resulting in greater risk exposure compared to JM futures. This finding is consistent with the probability density fitting curves in
Figure 14, where the t-distribution provides a superior fit to extreme returns in the tail regions compared to the normal distribution. The Q-Q plots further confirm these differences: under the normality assumption, sample points, particularly in the left tail, deviate noticeably below the diagonal, indicating that the actual probability of extreme losses exceeds the prediction of the normal distribution. In contrast, the Q-Q plot for the t-distribution closely aligns with the theoretical quantile line, with only minor deviations at the extreme tails, demonstrating its superior suitability for capturing tail risks.
This distributional difference arises from the degrees of freedom parameter
introduced in the t-distribution, which allows flexible characterization of extreme risks by adjusting tail thickness. When
, the t-distribution approaches the normal distribution; conversely, when
are small, the tails become heavier, effectively capturing atypical fluctuations commonly observed in financial data. For a return series
under a GARCH conditional heteroskedasticity framework, the model can be expressed as:
Here, follows different distributional assumptions depending on the model: if , it corresponds to a normal GARCH model; if , it corresponds to a t-GARCH model.
Based on
Table 4, a GARCH (1,1) model with a t-distribution was employed to model the conditional volatility of the two futures contracts. This specification is particularly well-suited to accommodate the pronounced leptokurtosis and heavy tails observed in the data, which deviate significantly from normality as illustrated in
Figure 14. For JM futures, the estimated results yield a mean equation constant of −0.00083109 and a variance constant of 0.00003095. The ARCH parameter
α is 0.045099, while the GARCH parameter
β reaches 0.907946, with degrees of freedom approximately 6.0138. The
β value close to 1 indicates substantial volatility persistence, implying that current market volatility is strongly conditioned by its past values.
In contrast, the estimation for LPG futures reveals a distinct volatility structure where the ARCH coefficient becomes statistically insignificant (near zero), while the GARCH parameter reaches 0.995647. This suggests that LPG futures’ volatility responds weakly to transitory short-term shocks, which are instead rapidly absorbed into the long-term volatility component. Such behavior is consistent with markets characterized by infrequent but substantial price jumps, where volatility responds more through persistent structural components rather than immediate noise. Furthermore, the sum of ARCH and GARCH coefficients for LPG futures is nearly unity, suggesting near-integrated volatility dynamics. This reflects the strong volatility clustering and long-memory effects commonly observed in energy commodity markets, where the impact of market shocks tends to persist over extended horizons.
The use of the t-distribution substantially improves the overall fit by capturing the extreme tail behavior of LPG returns, which exhibit a low degree of freedom at 3.1580, indicating the presence of rare but large price movements. Although more flexible specifications such as EGARCH and GJR-GARCH are capable of capturing potential asymmetry, the standard GARCH model with Student’s t-innovations is maintained as the baseline to ensure interpretability and direct comparability between the JM and LPG contracts.
4.3. Determination of Commodity Weights
As shown in
Figure 15, the CVaR comparison curves indicate that CVaR increases with higher confidence levels under different distributional assumptions, reflecting that stricter risk control requires the model to allocate larger buffers for potential extreme losses. Under the t-GARCH model, the portfolio CVaR at the 90%, 95%, and 99% confidence levels are 0.17797, 0.20524, and 0.25869, respectively, with corresponding weights for JM and LPG futures of (0.449, 0.551), (0.416, 0.584), and (0.387, 0.613). This suggests that at low to medium confidence levels, the model favors a higher allocation to LPG to mitigate the high volatility risk of JM futures, whereas at high confidence levels, the weight of JM futures increases, indicating its relatively lower risk contribution under extreme scenarios.
Compared to the normality assumption, the t-GARCH model generally produces higher risk estimates, particularly at high confidence levels, indicating that the t-distribution assumption better captures the risk of extreme events. Regarding portfolio weights, the allocation to JM futures under the t-distribution model is noticeably higher than under the normal model, suggesting that the model recognizes the right-skewed and extreme volatility characteristics of LPG returns and consequently reduces its weight in the optimization to mitigate potential risk.
Based on the results above, the normal distribution assumption provides relatively smooth risk estimates under moderate risk conditions but has limited capability to capture tail risks, often underestimating the probability of extreme losses. In contrast, the t-distribution model, through its degrees of freedom parameter, relaxes tail assumptions and more accurately characterizes abnormal market fluctuations at high confidence levels. In the energy futures market, prices are influenced by multiple factors such as supply and demand, seasonality, and geopolitical events, resulting in highly asymmetric and abrupt volatility. Therefore, employing a GARCH model under the t-distribution assumption significantly enhances the reliability of risk assessment. Considering both return and risk, this study adopts the portfolio weights calculated under the 95% confidence level t-GARCH model, with JM and LPG futures allocated at 0.416 and 0.584, respectively.
This result indicates that the diversified portfolio offers effective risk hedging. With this asset allocation, risk can be effectively diversified, reducing potential losses and providing strong data support for optimizing asset allocation and developing safer hedging strategies.
5. Case Study Analysis
5.1. Experimental Environment and Parameter Settings
All experiments in this study were conducted on a high-performance mobile workstation equipped with an AMD Ryzen 9 7945HX processor (16 cores, 32 threads, base frequency 2.50 GHz) and an NVIDIA GeForce RTX 4060 Laptop GPU (8 GB GDDR6 memory). The system was configured with 16 GB DDR5 RAM and ran a 64-bit Windows operating system. All model development and experiments were performed in a Python 3.11.3 environment, using TensorFlow and Keras as the primary frameworks for building and training the LSTM model. Data processing was implemented with Pandas and NumPy, normalization was performed using the MinMaxScaler module from scikit-learn, and visualization was achieved through Matplotlib 3.8.3.
In the SSA-LSTM hybrid electricity price forecasting model of this study, the first two SSA components were recombined into a low-frequency trend component based on singular value contribution analysis, while the remaining six components were recombined into high-frequency fluctuation components. The LSTM network was configured as a two-layer architecture: the first layer contains 100 neurons to capture long-term dependencies, and the second layer contains 50 neurons to learn local features. A dropout rate of 0.3 was applied to prevent overfitting. The model was trained to predict future electricity prices using historical data with a time step of 3. Data were normalized to the [0, 1] range using MinMaxScaler. For optimization, RMSprop was used for the low-frequency component to ensure stable convergence, while Adam was applied to the high-frequency components to quickly adapt to fluctuations. The training process used a batch size of 20 and 20 epochs, with the mean squared error (MSE) as the loss function. The dataset was split chronologically, reserving the last 12 samples as the test set.
5.2. Analysis of Forecast Results of Electricity Prices
This study performs a case analysis from the perspective of the electricity retailer in Guangdong’s electricity market, using the SSA-LSTM model to predict monthly trading electricity prices in the Guangdong electricity market. The dataset from January 2018 to August 2023 is used for training, with a one-step prediction model and rolling training method applied. Monthly trading electricity prices from September 2023 to June 2024 are forecasted, and an energy futures strategy will be developed in the first half of 2024 to hedge against fluctuations in electricity procurement costs. The electricity price prediction results are shown in
Table 5, and the evaluation results are presented in
Table 6. The specific trend is shown in
Figure 16.
To better evaluate the prediction results and make further adjustments to the model parameters, the Root Mean Squared Error (
RMSE), Mean Absolute Percentage Error (
MAPE) and Mean Absolute Error (
MAE) are selected as evaluation metrics, with their mathematical expressions given by:
In Equations (17)–(19), n denotes the sample size; represents the predicted electricity price for the i-th sample, denotes the true electricity price for the i-th sample.
Table 6 shows that when electricity prices exhibit significant volatility, the predictions of the XGBOOST, and basic LSTM models generally deviate substantially from the actual values, indicating that they fail to adequately capture and respond to price trends. In reality, electricity prices are influenced by multiple complex factors interacting nonlinearly, and the time series contains abundant nonlinear characteristics. Linear models cannot capture such complex dynamics, resulting in inherently insufficient predictive performance. Moreover, prediction errors tend to accumulate rapidly as the forecasting horizon extends. While the XGBOOST model exhibits relatively small errors at the beginning of the prediction period, the discrepancy between predicted and actual values increases over time, reaching an absolute deviation of 71.6 by June 2024.
In contrast, the superior performance of the LSTM-SSA model stems from its successful mitigation of the aforementioned limitations through Singular Spectrum Analysis (SSA). The LSTM model learns from a series that has been purified by SSA, in which trend and major periodic components are more pronounced. This allows the LSTM to focus on robust long-term dependencies and core dynamics without being confounded by noise and irrelevant fluctuations. Consequently, the LSTM-SSA predictive trajectory closely tracks the actual price decline, exhibiting minimal systematic bias. Furthermore, all evaluation metrics, including RMSE (28.24), MAPE (5.00%), and MAE (21.55), demonstrate significantly better performance than those of the comparative models.
In summary, the LSTM-SSA model, employing its unique “decompose-learn” framework, effectively overcomes the limitations of traditional models—such as the linearity assumption and rigid handling of seasonality—while enhancing LSTM’s ability to identify true trends amid noise. This framework enables the model to deliver more accurate and reliable electricity price forecasts, providing electricity retailers with greater confidence in decision-making and enhanced risk management capabilities when formulating procurement strategies, estimating expected costs, and designing hedging schemes, compared with other models.
5.3. Trading Strategy
Based on the forecasting results indicating a consistent downtrend in electricity prices from January to June 2024, this study designs a position-opening strategy involving long positions in JM futures and short positions in LPG futures throughout this period.
Table 7 shows the futures prices and
Table 8 shows their volatility.
According to the analysis in
Table 8, the futures trading strategy closely aligns with market price fluctuations. The correlation coefficient between JM futures price volatility and electricity price volatility is 0.82, while the correlation coefficient between LPG futures price volatility and electricity price volatility is −0.79. The correlation between futures price volatility and electricity price volatility when combined by weights is 0.89. In the Monte Carlo simulation, the correlation between the return series of JM futures and LPG futures is set based on the sample historical correlation coefficient and is assumed to remain constant during the simulation period. While this approach captures the static linear dependence between the two assets, it does not reflect potentially time-varying correlation structures. Future research could consider introducing dynamic conditional correlation models (e.g., DCC-GARCH) to further investigate the impact of changing correlation structures on portfolio risk.
5.4. Hedging and Return Analysis
This study uses the Guangdong electricity market as a case for validation, with the actual electricity purchase cost calculated based on the market average. Futures market operations follow the strategy outlined in
Table 9, with investment amounts for LPG and JM futures allocated according to the results of the CVaR calculation.
Based on the electricity purchase cost, different proportions of capital are invested in the futures market for hedging, results are shown in
Figure 17. Within the hedging ratio range of 0% to 200%, it can be observed that as the hedging ratio increases, the total return of the strategy gradually declines.
Risk indicator analysis shows that, at low hedging ratios, both volatility and drawdown are relatively high. As the hedging ratio increases, risk gradually decreases until reaching a minimum, after which it begins to rise again.
The Sharpe ratio exhibits a “mountain-shaped” curve, peaking at a hedging ratio of 127%, indicating that the strategy achieves an optimal allocation by balancing return and risk. This represents the most efficient configuration in terms of risk-adjusted performance.
Table 9 shows the electricity purchase cost and the monthly investment in the futures market. In the ‘JM Contracts’ and ‘LPG Contracts’ columns, ‘+’ indicates long positions, while ‘−’ denotes short positions. Since futures contracts can only be traded in whole lots, which are 60 tons per lot for JM futures and 20 tons per lot for LPG futures, the nearest rounding method is applied during the calculation.
The rise in monthly electricity trading prices leads to an increase in electricity purchase costs, thereby reducing the profit. Based on the purchase amounts in
Table 6, the corresponding returns are shown in
Table 10 and
Table 11.
Based on the data analysis for the first half of 2024, the portfolio hedging strategy demonstrates multidimensional advantages in electricity cost risk management:
- (1)
Regarding return performance, the LPG hedging strategy achieves the highest total return (352,200 CNY) and average return (58,700 CNY), indicating strong profitability. However, these returns are accompanied by relatively high volatility (7.62) and a maximum drawdown of −49,800 CNY, suggesting that although the returns are substantial, the associated risks remain considerable.
- (2)
The JM hedging strategy performed the poorest, exhibiting both the lowest total return (99,000 CNY) and the largest maximum drawdown (−94,500 CNY), making it the riskiest strategy. Moreover, its Sharpe ratio is only 0.85, indicating relatively low return efficiency per unit of risk.
- (3)
The combined hedging strategy exhibits a clear overall advantage across multiple performance indicators. Its total return of 234,100 CNY, though lower than that of the LPG strategy, is accompanied by the lowest volatility (4.73) and a maximum drawdown of only −10,200 CNY, resulting in minimal significant losses. Furthermore, its Sharpe ratio of 2.86—the highest among the three strategies—indicates that the combined strategy excels in stability and risk control, offering strong risk-adjusted return potential.
- (4)
Based on
Table 12, the LSTM-SSA model demonstrates the best performance. It achieves the highest Sharpe Ratio (2.86) while maintaining the lowest volatility (4.73) and the smallest maximum drawdown (1.02), reflecting exceptional risk control capabilities and superior risk-adjusted returns. In comparison, XGBOOST carries excessive risk with the lowest return efficiency, and although LSTM outperforms XGBOOST, it is significantly inferior to LSTM-SSA across all metrics.
Empirical analysis indicates that the portfolio hedging strategy preserves most returns during months of positive cost fluctuations while effectively mitigating risks during months of negative fluctuations (e.g., February 2024). This dynamic balancing mechanism renders it an optimized approach for electricity cost management.