1. Introduction
Natural gas, a relatively clean fossil fuel with lower carbon emissions than coal and oil, has emerged as a pivotal cornerstone in the global energy transition toward low-carbon sustainability [
1]. Its extensive applications include residential heating, industrial production, and power generation, making it indispensable for balancing energy security, economic development, and environmental protection. Against the backdrop of accelerating global decarbonization endeavors, the demand for natural gas has witnessed a substantial surge. Globally, natural gas consumption accounts for approximately 24% of primary energy consumption, while in China, its proportion has escalated from a negligible level to 8.4% in recent years, with projections indicating further growth [
2,
3,
4,
5,
6]. Nevertheless, this growing dependence is accompanied by significant price volatility, which poses formidable challenges to energy market stability, industrial investment planning, and household energy expenditures [
7,
8,
9,
10].
The fluctuation in natural gas prices stems from the complex interaction of multiscale and multidimensional factors, thereby positioning it as a focal point of research in the field of energy economics. Essentially, the dynamics of supply and demand, such as fluctuations in natural gas production, consumption, and inventory levels, as endogenous factors (i.e., the standard dynamics between supply and demand), exert a direct impact on short-term price oscillations [
11]. Geopolitical events and unforeseen incidents further amplify the price uncertainty. For example, the Russia-Ukraine conflict disrupted Europe’s conventional natural gas supply chains, precipitating a surge in imports of U.S. liquefied natural gas and restructuring of global maritime transportation networks, which in turn induced sharp fluctuations in regional market prices [
12].
Therefore, the accurate forecasting of natural gas prices is of paramount importance to multiple stakeholders. For policymakers, reliable forecasts can serve as a foundation for formulating energy security strategies such as adjusting import structures or optimizing reserve capacities. Market participants (e.g., producers and traders) facilitate risk management and investment decision making, mitigating losses arising from price volatility. For households and industries, stable price expectations are conducive to cost planning and adjustments to energy consumption patterns [
13].
Currently, there has been extensive exploration of natural gas price forecasting. The well-established classical statistical methods in this field include the Autoregressive Moving Average (ARMA) model [
14,
15], Autoregressive Integrated Moving Average (ARIMA) model [
16,
17,
18], Principal Component Analysis (PCA) [
19], and Generalized Autoregressive Conditional Heteroskedasticity (GARCH) model [
20]. For instance, Ruslan et al. analyzed prices using a series of univariate GARCH models, which provided a reference for regulators and investors in predicting market volatility and formulating relevant strategies. However, natural gas prices are affected by multiple complex factors, such as geopolitics, supply-demand relationships, and weather changes, and their fluctuations often exhibit strong nonlinear and nonstationary characteristics. This makes it difficult for classical statistical methods to fully capture the complex dynamics of price changes, leading to significant limitations in forecasting accuracy.
To overcome the constraints of traditional statistical techniques, scholars have introduced artificial intelligence-based methods, including the Structural Heterogeneous Autoregressive Vector Autoregression (SHVAR) model [
21], Support Vector Machines (SVM) [
22], and boosting algorithms [
23]. For instance, Su et al. employed an advanced least-squares regression boosting algorithm to predict natural gas prices. By optimizing the regression model, this algorithm remarkably enhanced the predicted R2 value, which is the coefficient of determination, indicating that the model fits the data extremely well. Simultaneously, the algorithm also decreased the Mean Absolute Error (MAE), indicating that the gap between the projected and factual values was effectively narrowed.
However, in practical forecasting, a single model can only capture specific data patterns. When dealing with complex datasets, especially natural gas price data, which are influenced by multiple factors and exhibit highly nonlinear and dynamically volatile characteristics, they are highly prone to overfitting the details in the training data (including random noise and outliers). By contrast, hybrid models that integrate multiple methods can effectively compensate for the limitations of a single approach. Consequently, model combinations have gained increasing popularity and are widely applied to address various challenges in natural gas price forecasting. For natural gas price forecasting, a hybrid approach was utilized by Wang et al., which fuses the complete ensemble empirical mode decomposition with adaptive noise-sample entropy (CEEMDAN–SE) and a Gated Recurrent Unit (GRU) network optimized through a Particle Swarm Optimization algorithm incorporating an Adaptive Learning Strategy (PSO–ALS–GRU). This model exhibits superior performance for analyzing long-term dependencies and addressing complex nonlinear problems [
24]. But lacks a component to extract local subtle features (e.g., short-term price correlations with inventory data), as GRU networks focus primarily on global temporal trends rather than local fine-grained patterns [
25]. Wang and Wang combined Long Short-Term Memory (LSTM), Wavelet Packet Decomposition (WPD), and stochastic time-effective weights (SW) to construct a new hybrid model (WPD-SW-LSTM). They used single models such as SVM, Back Propagation Neural Network (BPNN), LSTM, and their hybrid models for comparison and improved a new error measurement method to evaluate prediction results, achieving high-precision forecasting of oil futures prices [
26]. While the stochastic weights improved robustness, the unidirectional LSTM cannot capture backward temporal dependencies (e.g., how future price reversals affect the interpretation of current trends), limiting its understanding of sequential context [
25]. Jiang et al. proposed a hybrid prediction model based on Fuzzy Entropy Variational Mode Decomposition (VMD), WPD, and LSTM, targeting the complexity and nonlinear characteristics of natural gas production and consumption data, and demonstrated that its performance is significantly superior to other comparable models with certain practical value [
27]. Although multi-step decomposition enhanced multiscale feature extraction, the model lacks a dynamic weighting mechanism for key features—e.g., it assigns equal importance to trivial signals (e.g., minor daily demand fluctuations) and critical signals (e.g., geopolitical event shocks), leading to suboptimal focus on core influencing factors [
28].
Beyond these specific gaps, two broader limitations persist in existing hybrid models: (1) Most decomposition methods (e.g., CEEMDAN, PSO) suffer from mode mixing (i.e., overlapping frequency components in decomposed modes), which distorts feature extraction [
29]; (2) Few models integrate both local feature capture and bidirectional dependency mining—two capabilities critical for accurately modeling daily gas prices, which are influenced by both short-term local events and long-term global trends.
To fill the aforementioned gaps, this study proposes a novel hybrid model—VMD-CNN-BiLSTM-Attention—integrating Variational Mode Decomposition (VMD), Convolutional Neural Network (CNN), Bidirectional LSTM (BiLSTM), and an attention mechanism. Its key innovations (directly addressing the gaps in existing literature) are:
- (1)
Adaptive decomposition with VMD: Unlike CEEMDAN or PSO, VMD decomposes price series into intrinsic mode functions (IMFs) with minimal mode mixing [
29], enabling more accurate multiscale feature extraction and laying a solid foundation for subsequent modeling.
- (2)
Local feature capture with CNN: CNN is introduced to extract local correlations (e.g., short-term price-volume relationships, weekly demand cycles) that are overlooked by GRU/LSTM-based models—compensating for the local feature gap in Wang et al.’s [
24] CEEMDAN–SE–PSO–GRU.
- (3)
Bidirectional dependency mining with BiLSTM: Replacing unidirectional LSTM/GRU with BiLSTM allows the model to mine both forward (past→present) and backward (future→present) temporal dependencies—addressing the sequential context limitation in Wang and Wang’s [
26] WPD-SW-LSTM.
- (4)
Dynamic feature weighting with attention mechanism: The attention mechanism assigns higher weights to critical features (e.g., geopolitical shocks, inventory shortages) and lower weights to trivial signals—solving the equal-weighting problem in Jiang et al.’s [
27] VMD-WPD-LSTM.
In addition to methodological innovations, this study aims to enhance the practical value of gas price forecasting: by improving accuracy and robustness, the model can provide more reliable decision support for policymakers (e.g., refining emergency reserve policies), market participants (e.g., optimizing hedging strategies), and households (e.g., rationalizing energy consumption plans)—ultimately contributing to the stability of the global natural gas market and the advancement of low-carbon energy transitions.
4. Results and Discussion
The VMD algorithm cannot be directly applied to the decomposition of daily natural gas price sequences. This is because its input requires not only the original daily natural gas price data, but also the number of modes (K), whose value must be determined beforehand. The existing literature has proposed various methods for determining the value of K [
45,
46]. In research on the VMD of energy prices (e.g., natural gas, crude oil) and financial time series, the selection of K must balance the effectiveness of feature decomposition and model efficiency. In academic and engineering practice, K is usually restricted to a reasonable range of 3–7.
For daily frequency data, K = 5 is a commonly adopted intermediate value that has been validated in numerous empirical studies [
47,
48]. To further justify this choice, a sensitivity analysis was conducted with different mode numbers (K = 3, 5, and 7). As summarized in
Table 2, increasing K led to slightly improved forecasting performance, with the average RMSE decreasing from 0.1965 (K = 3) to 0.1288 (K = 7), and
increasing from 98.50% to 99.26%. However, when K exceeds 5, the number of decomposed Intrinsic Mode Function (IMF) components increases considerably, which substantially raises model training costs and complexity, while the incremental improvement in accuracy becomes marginal.
Therefore, K = 5 provides an appropriate balance between feature extraction integrity and computational feasibility. It allows the model to capture multi-scale characteristics of natural gas price dynamics—from short-term high-frequency fluctuations (e.g., intraday supply–demand disturbances) to medium- and long-term variations (e.g., seasonal cycles and macroeconomic influences)—without introducing redundant components. Accordingly, this study adopted K = 5 as the mode number for VMD decomposition. The decomposition results for daily NYMEX prices are shown in
Figure 5.
As shown in
Table 3, this decomposition process eliminates redundant information between components, while preserving all fluctuation scales of practical significance, confirming that K = 5 can optimally decompose the original price sequence. This preprocessing step not only mitigates the non-stationarity issue by converting complex raw data into a set of stationary and interpretable IMF components but also lays a foundation for targeted feature extraction in the subsequent VMD-CNN-BiLSTM-attention prediction model.
4.1. Evaluations of Various Models
As shown in
Table 4, from the one-step to four-step-ahead forecasting results, the VMD-CNN-BiLSTM-attention model consistently outperforms the CNN-BiLSTM-attention model across all evaluation metrics, confirming the effectiveness of VMD-based signal preprocessing in improving short-term forecasting accuracy. In the one-step-ahead forecasting task, the VMD-CNN-BiLSTM-attention model achieves an MSE of 0.0169, RMSE of 0.1301, MAE of 0.0823, MAPE of 2.45%, and
of 99.35%. In contrast, the CNN-BiLSTM-attention model exhibits higher error levels (MSE = 0.0325, RMSE = 0.1802, MAE = 0.1152, MAPE = 3.13%) and a lower fitting degree (
= 98.75%).
When the forecasting horizon is extended to two-, three-, and four-step-ahead, both models show a trend of gradually increasing errors and a slight decrease in , which is consistent with the inherent challenge of uncertainty accumulation in multistep time series forecasting. However, the VMD-CNN-BiLSTM-attention model still maintains superior stability and accuracy. Specifically, in the two-step-ahead task, it achieves MSE = 0.0213, RMSE = 0.1460, MAE = 0.0941, MAPE = 2.79%, and = 99.18%. In the three-step-ahead task, it records MSE = 0.0247, RMSE = 0.1572, MAE = 0.1013, MAPE = 2.99%, and = 99.05%. Even in the four-step-ahead task, it remains at a low MAPE of 3.46% and a high of 98.81%, whereas the CNN-BiLSTM-attention model’s MAPE increases to 6.21% and its drops to 95.74%.
This performance gap between the proposed hybrid model and the benchmarks suggests that VMD effectively decomposes the original price sequence into IMFs with distinct frequency characteristics. This enables the subsequent deep learning framework to more accurately capture frequency-specific fluctuation patterns (e.g., ultra-short-term volatility and short-term supply–demand disturbances) while mitigating high-frequency noise interference in long-term trend learning. This finding is consistent with the concept proposed by Zhang et al. (2025) [
49], which demonstrated that integrating multiple feature representations enhances model robustness and forecasting stability. Consequently, it offers a more reliable solution for multistep short-term price forecasting.
To clarify the multi-step forecasting protocol, we adopt a “separate model for each prediction horizon” strategy. For forecasting horizons ranging from one-step to four-step ahead, we do not use a recursive approach (i.e., predicting the next step first and then feeding that prediction back to predict subsequent steps). Instead, we train an independent model for each specific forecasting horizon. Specifically, when generating data sequences for training and testing, given an input sequence length of 14 days (as in our case), we construct training samples aimed at predicting each future day n (where n ranges from 1 to 4). Then, distinct CNN-BiLSTM-attention models (with or without VMD pre-processing) are trained for each n. Each of these models takes a historical sequence of the same length as input but is optimized to directly output the prediction for its designated future day. This ensures that each model focuses on learning the patterns specific to its respective forecasting horizon, rather than relying on potentially error—prone recursive predictions.
4.2. Discussion
As shown in
Figure 6, the prediction curve of the VMD-CNN-BiLSTM-attention model exhibited the highest degree of fit with the actual price curve. This visualization result strongly corresponds to the conclusion of “this model, demonstrating the optimal prediction performance” derived from the quantitative analysis of error metrics in the early stage, further confirming its reliability and superiority in the short-term prediction of NYMEX natural gas futures prices.
In the comparative experiment on 1–4-day short-term price prediction, an in-depth analysis was conducted on the performance differences between the VMD-CNN-BiLSTM-attention and CNN-BiLSTM-attention models. The results indicate that in the 1-th Day Ahead prediction scenario, although both models can initially capture the temporal evolution trend of prices, the hybrid model integrated with VMD performs better. Its core advantage lies in the implementation of multiscale mode decomposition on the nonstationary futures price sequence through VMD technology, which effectively separates and mitigates noise interference and fluctuation coupling in the original data. This provides purer and more representative input features for the subsequent local feature extraction by the CNN module, long-term dependency capture by the BiLSTM module, and key information focusing by the attention mechanism, ultimately achieving a higher degree of fitting between the prediction curve and actual price.
When the prediction horizon is extended to 2–4 days, the prediction errors of both models show an increasing trend, which conforms to the general rule of “positive correlation between prediction horizon and error” in the field of time series prediction. However, the comparison reveals that the error growth rate of the VMD-CNN-BiLSTM-Attention model is significantly lower. Particularly in the price periodic fluctuation scenarios covered in the dataset (such as medium-term fluctuations driven by seasonal supply-demand imbalances and short-term pulse fluctuations caused by unexpected event shocks), its advantage in predicting price inflection points is more prominent.
6. Conclusions
This study focuses on accurately predicting the daily prices of NYMEX natural gas futures. Addressing the limitation that traditional single models struggle to handle the nonlinearity, high volatility, and multi-scale characteristics of price series, this study proposes a hybrid prediction model, VMD-CNN-BiLSTM-attention, which integrates Variational Mode Decomposition (VMD), Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), and an attention mechanism. Using daily closing price data from 3 April 1990 to 27 June 2025, comparative experiments for one-step to four-step forecasting were conducted between the proposed model and two benchmark models (CNN-BiLSTM-Attention and ARIMA). The key conclusions are as follows.
The empirical results demonstrate that the proposed VMD-CNN-BiLSTM-Attention model consistently outperforms the benchmark CNN-BiLSTM-Attention model across all key performance indicators, including MSE, RMSE, MAE, and MAPE. In the one-step to four-step forecasting horizons, the proposed model maintains remarkably low prediction errors, with MAPE values ranging from 2.45% to 3.46% and forecasting accuracy remaining above 98.81%. Moreover, the model exhibits the smallest performance degradation as the forecasting horizon increases: its MAPE increases by only 1.01 percentage points (from 2.45% in one-step forecasting to 3.46% in four-step forecasting), which is significantly lower than that of the CNN-BiLSTM-Attention model (3.08 percentage points, from 3.13% to 6.21%). These findings confirm that the proposed hybrid model possesses a stronger capability to capture the complex nonlinear dynamics of natural gas prices and provides more stable and reliable short- to medium-term forecasting performance. The synergy of multi-technical integration is the core source of the model’s advantages.
These results imply that incorporating signal decomposition and feature-fusion mechanisms can substantially enhance the adaptability of forecasting systems to nonstationary energy markets. This has important implications for energy trading strategies, risk management, and operational planning, where timely and precise short-term forecasts can reduce financial uncertainty and support data-driven decision-making.
The functional complementarity of the components of the model significantly enhances the prediction effect. VMD decomposes the original price series into multiple stable Intrinsic Mode Functions (IMFs), effectively separating the interference of fluctuations at different frequencies and reducing the impact of data nonlinearity on the model; CNN accurately extracts local features of each IMF (e.g., short-term price mutation signals); BiLSTM captures long-term temporal dependencies of the series (e.g., medium-term price trends); The attention mechanism assigns higher weights to key time steps, further strengthening the model’s ability to focus on core driving information of price fluctuations.
This “decomposition–feature extraction–temporal modeling–key information enhancement” pipeline demonstrates a systematic framework that can be generalized to other financial or commodity markets exhibiting multi-frequency volatility. In particular, it provides methodological insights for integrating data-driven deep learning with signal-processing theory to handle multi-scale time series.
This integrated design addresses the shortcoming of traditional models—“easily missing key patterns when dealing with complex series alone”—and is the fundamental reason for its leading performance.
- 2.
Research limitations and future optimization directions
This study constructs a model based only on single-series price data and does not incorporate external factors that may affect natural gas prices (e.g., U.S. EIA inventory data, weather indices, and the linkage effect of crude oil prices). In the future, multisource features can be introduced to further improve the generalization ability of the model. Meanwhile, the number of VMD modes (set to five in this study) is determined empirically; future research can optimize mode division through adaptive algorithms or combine the attention mechanism to weighted fuse the prediction results of different IMFs, thereby further exploring the value of multimodal data.
Expanding the model to include cross-market and exogenous indicators will also enhance its applicability to dynamic energy systems forecasting, contributing to more resilient and data-informed policy and investment decisions.
The relatively high
values reported in
Table 2 can be explained by the characteristics of the dataset and the model design. Specifically, the model uses the original price series as input rather than log returns or differenced values. Natural gas prices exhibit strong autocorrelation and trend persistence; therefore, the high
primarily reflects the model’s ability to track the level persistence of prices rather than to capture short-term volatility. The VMD component effectively removes high-frequency noise and preserves the main trend, while the CNN layer extracts local features and filters out short-term fluctuations. As a result, the predicted series is a smoothed curve that closely overlaps with the actual price level, which naturally increases the
metric—although the RMSE remains a more reliable indicator of predictive accuracy. Furthermore, the data normalization to the [0, 1] range compresses the variance, which causes the ratio of residual to total variation in
computation to approach unity
In conclusion, the VMD-CNN-BiLSTM-attention model provides an efficient and feasible technical solution for the short-to-medium-term forecasting of natural gas futures prices. The design concept of multi-technical integration also offers a reference for forecasting research on other highly nonlinear time series (e.g., crude oil and electricity prices), demonstrating strong promotional significance.