Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method

Wang, Jiancheng; Guo, Pengcheng; Hao, Peng; Wang, Dan

doi:10.3390/su18104738

Open AccessArticle

Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method

¹

School of Economics, Wuhan University of Technology, Wuhan 430072, China

²

Hubei Provincial Research Center for E-Business Big Data Engineering Technology, Wuhan 430072, China

³

School of Business, Anhui University, Hefei 230039, China

⁴

School of Information Management, Central China Normal University, Wuhan 430079, China

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(10), 4738; https://doi.org/10.3390/su18104738

Submission received: 2 April 2026 / Revised: 2 May 2026 / Accepted: 5 May 2026 / Published: 9 May 2026

Download

Browse Figures

Versions Notes

Abstract

Against the backdrop of worldwide sustainability and low-carbon development, carbon emission trading prices serve as an important signal for carbon reduction and green economic regulation. However, they are influenced by quota policies, energy markets, and macroeconomics, and exhibit pronounced non-stationary, high-noise, and nonlinear dynamics that challenge traditional forecasting models. This research aims to improve carbon price prediction accuracy by proposing a hybrid ICEEMDAN-CNN-LSTM model. The Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) method adaptively decomposes the original carbon price series, suppressing mode aliasing and noise interference, and producing stable Intrinsic Mode Function (IMF) components; each IMF is then processed by CNN-LSTM, where the Convolutional Neural Network (CNN) extracts local features and the Long Short-Term Memory (LSTM) captures long short-term dependencies, with the final results obtained by linear combination. This research uses historical closing prices of the Hubei carbon emission trading market with multiple economic indicators as inputs. Model performance is evaluated against LSTM and CNN-LSTM benchmarks. The results show that the proposed model significantly outperforms benchmarks, achieving a test-set MAE of 1.140 yuan, representing reductions of 59.1% and 65.2% compared to LSTM and CNN-LSTM, respectively, and the RMSE is reduced by 57.2% and 62.9%, respectively. At the same time, the proposed model maintains strong robustness under different data splitting ratios. Through the “decomposition–extraction–fitting” framework, the proposed model effectively handles complex carbon price dynamics, offering a reliable forecasting tool that helps stabilize carbon markets, guide emission–reduction behaviors, and advance global sustainability and low-carbon transition.

Keywords:

carbon emission trading price; ICEEMDAN; convolutional neural network; long short-term memory network; time-series prediction

1. Introduction

Addressing climate change is a crucial issue concerning the survival and development of humanity. Actions to reduce greenhouse gas emissions have become a focus for all countries. With total control over the market for carbon emission allowances, enterprises can profit by trading surplus carbon emission allowances as commodities through carbon emission reduction, or by purchasing allowances to offset excess carbon emissions. Meanwhile, this mechanism can promote the low-carbon transformation of enterprises and realize the effective control of carbon emissions. The carbon price is a core element of the carbon market, directly showing the scarcity of carbon resources and enterprises’ emission–reduction costs. Also, it is an important basis for an enterprise to make production decisions and to allocate and trade carbon resources. At the same time, it acts as a key reference for the government to evaluate the operation of the carbon market and flexibly adjust carbon-emission reduction policies.

Since the launch of pilot carbon emission trading in China in 2011, a unified national carbon emission trading market has formed, becoming an important measure to advance the “dual carbon” goals. These pilot policies have driven regional emission reductions [1], strengthened enterprise performance in environmental and social responsibilities and self-governance [2], and accelerated technological innovation and green development [3]. However, the carbon price is affected by multiple complex factors, such as quota policies, industrial production, energy prices, and macroeconomic factors, leading to complex patterns of non-stationarity, high noise, and nonlinear characteristics. Consequently, developing accurate price prediction models for carbon prices is worth researching.

There has been some relevant research on predicting the price of carbon emission trading. One is to apply traditional time-series regression methods to predict the price of carbon emission trading. Some studies have established combined MIDAS regression models for prediction, but these require manual determination of feature correlations between sequences and the setting of relevant weights and parameters in the experiment, and it is difficult to accurately capture the extreme fluctuations that may occur in carbon price sequences. Some other studies constructed hybrid ARIMA-based regression models for prediction, but parameter selection relies heavily on manual selection and empirical screening. Meanwhile, processes such as differencing used in the ARIMA method may cause the loss of key information in the original sequence, which could affect prediction results.

Another type of research uses AI methods, such as machine learning, for prediction. Some studies have constructed CNN-LSTM models to predict, but the sequences themselves usually exhibit a mix of linear and nonlinear features with high noise, making it difficult for models to capture features completely. What is more, the performance on test sets is quite degraded compared with that on training sets. The complex characteristics of the sequence actually arise from the superposition of signals with different frequencies, and each signal has its own trend, period, and noise information, which needs targeted decomposition and fitting to accurately extract the implied laws.

The ICEEMDAN method can automatically decompose data into several subsequences that are relatively stationary, based on the characteristics of signal sequence, controlling the influence of noise while retaining the information across different dimensions. The CNN-LSTM method can effectively extract local features from each sequence and capture long short-term dependencies within it. Consequently, this research proposes a hybrid prediction model integrating the ICEEMDAN decomposition method and a CNN-LSTM fitting method. It decomposes and denoises the original sequence of carbon prices, then extracts features and fits to improve prediction accuracy.

This research proceeds from two core expectations. First, integrating ICEEMDAN signal decomposition into a neural network pipeline should improve the accuracy of prediction by producing more reliable IMF components for subsequent modeling. Second, the combined CNN-LSTM method should better extract local features and dependencies across the components.

The core work of this research focuses on the proposed ICEEMDAN-CNN-LSTM prediction model. Firstly, to address the pain points in predicting carbon prices, a prediction framework comprising decomposition, fitting, and recombination is designed by leveraging the advantages of ICEEMDAN for signal decomposition and the feature-extraction capabilities of CNN-LSTM. Secondly, the closing price of the Hubei carbon emission trading market is selected as the research object, and three indicators including the carbon emission futures [4], S&P 500 index [5] and US dollar index [6] are selected as influencing factors, which are used as input data after preprocessing like interpolation and normalization. Thirdly, by applying the ICEEMDAN-CNN-LSTM model, prediction results are obtained and then compared with those of benchmark models CNN and LSTM. In addition, this research picks four evaluation indicators, including MAE, MSE, RMSE, and MAPE, then comprehensively verifies the superiority of the model through robustness tests under different division ratios of training and test sets.

From the experimental results, compared with two benchmark models, the proposed model reduces MAE, MSE, RMSE, and MAPE by more than 47%, 78%, 53%, and 42%, respectively, in the training set, and by more than 59%, 81%, 57%, and 57%, respectively, in the test set. This shows that the constructed ICEEMDAN-CNN-LSTM model can effectively handle the frequency, trend, and noise in carbon prices and significantly improve feature capture, resulting in higher prediction accuracy.

The rest of the paper consists of four sections. Section 2 will systematically analyze research progress on predicting the price of carbon emission trading through a literature review. Then, Section 3 will introduce the framework of the model and research methods adopted in this study in detail. After that, Section 4 will present an empirical analysis, introducing basic information on the selected research data, and present the prediction results and indicator performance of the proposed model. Finally, Section 5 will propose research conclusions and prospects.

2. Literature Review

In the early years, the prediction of carbon prices was mostly based on traditional methods for electricity price prediction, such as regression and econometric models. For example, Contreras et al. [7] used the ARIMA model to predict next-day electricity prices in Spain and other regions. Zareipour [8] et al. used transfer function and dynamic regression models, incorporating relevant market data to improve the accuracy of energy price predictions in Ontario, Canada. Similarly, some studies treat carbon prices as time series and use traditional regression methods to predict them [9]. Byun & Cho [10] used GARCH-family models, implied volatility (IV), and other methods for prediction. However, such traditional methods usually require linear and stationary sequences, while the actual carbon prices are affected by multiple factors such as policies, markets and energy prices, often showing significant nonlinear, non-stationary, and high noise characteristics, making it difficult for traditional models to fully explore effective information in the sequences [11].

With the development of artificial intelligence, machine learning and deep learning methods have been increasingly applied to carbon price prediction in recent years, thanks to their strong nonlinear fitting and feature-learning capabilities. Compared with single models, hybrid models that integrate multiple methods usually achieve better predictive performance. For example, Dong et al. [12] designed an Lp-CNN-LSTM model to effectively deal with high- and low-frequency components of carbon prices. Wang & Zhuang [13] combined XGBoost with BiLSTM and BiGRU methods to improve prediction accuracy. In addition, some studies focus on selecting influencing factors. For example, Yao et al. [14] used the Pearson correlation coefficient method to screen variables and constructed a BP-LSTM hybrid model. Wei et al. [15] selected multi-level indicators, such as the CSI 300 index, to establish a Transformer-LSTM model. Wang & He [4] adopted the APVMD-LightGBM-TCN method to construct a complete framework for feature extraction, factor screening, and prediction. The above studies provide useful references for feature selection and model construction, but still have problems, such as their rough depiction of extreme situations and rise-and-fall trends, and low representativeness of selected features.

Considering the non-stationary and high-noise characteristics of carbon prices, introducing signal decomposition methods into the prediction model can decompose the original sequence into several relatively stationary subsequences, effectively reducing noise interference and capturing hidden information, and thus improving prediction accuracy. For example, He et al. [16] used the VMD-SWD secondary decomposition method to process the carbon price sequence to extract effective information. Nadirgil [17] used a CEEMDAN-VMD dual decomposition combined with a neural network for prediction, thereby verifying the decomposition method’s effectiveness. Wang et al. [18] constructed a CEEMDAN-SE-LATM-RF model and made innovations in feature extraction. Duan et al. [5] used the CEEMDAN method for multiple decompositions, thereby further improving prediction accuracy. The experimental results of the above studies show that signal decomposition methods can considerably improve the prediction model’s ability to capture trends and details of the sequence.

Some studies focus on feature selection and model parameter optimization. For feature selection, Li & Ren [6] effectively improved prediction effect of the model through feature selection algorithms, while Liu et al. [19] screened out key factors through regularization models. Wei & Ouyang [20] used the s-PCA method to reduce dimensions of influencing factors to improve prediction accuracy. For model parameter optimization, Shi et al. [21] constructed a CNN-LSTM model and selected the optimal parameter combination. Yu & Shi [22] introduced an attention mechanism based on CNN-LSTM to enable the model to focus on key features. However, existing methods still rely heavily on personal experience in parameter setting; in addition, there is still room for improvement in the processing of noise interference.

In addition, some studies focus on the explainability and comparative analysis of models. For model explainability, Sayed et al. [23] tried to use interpretable artificial intelligence technology to analyze prediction results. For model comparative analysis, Hong et al. [24] evaluated the performance of various classification methods for predicting carbon price trends. Kumar et al. [25] systematically compared the predictive performance of traditional time-series models and machine learning models. The above studies provide a diversified perspective for model construction and evaluation. Based on existing research, it could be concluded that various hybrid models have made certain progress in carbon price prediction, but there are still deficiencies, including insufficient processing of high noise and non-stationary features in the sequence, excessive dependence on individual experience in the setting of complex model parameters, and room for improvement in feature extraction and data fitting.

Among the various signal decomposition methods used in carbon price prediction, ICEEMDAN offers several unique advantages that distinguish it from alternative methods. Unlike traditional EMD or EEMD, which suffer from model aliasing or residual noise, ICEEMDAN employs an adaptive noise control mechanism that dynamically adjusts the noise intensity based on the local characteristics of the residual signal. The dynamic noise elimination strategy is particularly well-suited to the carbon price series, which exhibits time-varying volatility due to periodic policy adjustments and external shocks. Compared to VMD, which requires pre-specification of the number of modes and a penalty factor, ICEEMDAN is fully data-driven and can autonomously determine the appropriate decomposition level. These features make ICEEMDAN a more robust and adaptive decomposition tool for non-stationary and nonlinear carbon price sequences, providing higher-quality IMF components for subsequent CNN-LSTM modeling.

In view of these, an ICEEMDAN-CNN-LSTM hybrid prediction model is proposed with the following core advantages. Firstly, the ICEEMDAN method adaptively decomposes the original sequence, efficiently suppressing model aliasing, and determines the number of decomposition modes based on the sequence’s characteristics, thereby avoiding deviations caused by manual setting and improving the stability and reliability of subsequent feature extraction. Secondly, the CNN method can effectively capture local key features of subsequences, and the LSTM method can model long-term dependencies among them. By integrating CNN and LSTM methods, the model’s ability to capture the complicated dynamics of sequences can be significantly improved. Finally, by applying the ICEEMDAN method, the model can decompose into multi-scale and reduce the noise interference of the original sequence, enabling the CNN-LSTM method to focus on subsequences with clearer features, which can effectively solve the pain points of insufficient noise suppression and incomplete feature capture of existing prediction methods, reduce prediction errors, and have stronger robustness.

3. Model Construction

3.1. Methods

Carbon emission trading prices exhibit significant non-stationarity, nonlinearity, and high noise, making it difficult for a single model to simultaneously filter noise, extract multi-scale features, and capture long short-term dependencies. Based on the core logic of “Decomposition–Extraction–Fitting”, a hybrid model framework of “ICEEMDAN signal decomposition–CNN extraction–LSTM time-series fitting” is used to predict carbon prices. The ICEEMDAN method can adaptively decompose the original sequence, effectively handle noise in carbon prices, and yield relatively stationary subsequences. The CNN method can extract local key features from nonlinear sequences and effectively mine their hidden information. The LSTM method can capture long-term dependencies in sequences and effectively model time-series features. In the organic combination of the above methods, the model is able to reduce noise, capture features and fit time series, and finally realize the accurate prediction of carbon prices. The model’s framework is shown in Figure 1, and a detailed introduction to each method adopted in the model is provided below.

3.1.1. ICEEMDAN

The Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (ICEEMDAN) [26] is a signal decomposition method based on Empirical Mode Decomposition (EMD) and Ensemble Empirical Mode Decomposition (EEMD). By introducing adaptive noise and complete ensemble methods, it can effectively suppress model aliasing that may occur in the signal decomposition process and control noise to a certain extent, thereby obtaining more reliable decomposition results. ICEEMDAN is further optimized on the basis of the CEEMDAN method, which can automatically determine the most appropriate number of decomposition modes according to characteristics of the input signal itself, rather than relying on manual settings, which may cause errors. At the same time, by constructing multiple groups of white noise and performing ensemble averaging, ICEEMDAN suppresses noise interference in the original sequence, effectively preserving signal information, and improves the stability of decomposition results. Its main steps are as follows:

Firstly, generate M groups of independent Gaussian white noise n^(m)(t) with mean 0 and variance 1, and set the initial residual component r₀(t) as the original signal x(t).

Then, add noise to the original signal to obtain a series of noisy signals r₀(t) + ε₀n^(m)(t), and decompose them using the EMD method to obtain the first intrinsic mode component IMF₁^(m)(t). Perform ensemble averaging on M groups of IMFs to obtain the first IMF of the ICEEMDAN method, namely

{I M F}_{1} (t) = \frac{1}{M} \sum_{m = 1}^{M} {I M F}_{1}^{(m)} (t),

(1)

and calculate the first residual component r₁(t) = r₀(t) − IMF₁(t).

Next, calculate the standard deviation of the first residual component r₁(t):

σ_{1} = \sqrt{\frac{1}{N} \sum_{t = 1}^{N} r_{1} {(t)}^{2}},

(2)

and adaptively determine the corresponding noise intensity ε₁ = ε₀ · σ₁, construct a noisy residual signal r₁(t) = ε₁n^(m)(t), then use the EMD method to decompose it to obtain the second intrinsic mode component IMF₂^(m)(t). Obtain IMF₂^(m)(t) by ensemble averaging, and calculate the second residual component r₂(t).

After that, repeat the above steps until the residual component is monotonic or has only one extreme point, and the decomposition terminates. At this time,

{I M F}_{k} (t) = \frac{1}{M} \sum_{m = 1}^{M} {I M F}_{k}^{(m)} (t),

(3)

and obtain the final residual component r_k(t).

Finally, the original signal can be reconstructed from all IMFs and the final residual component r_k(t), namely

x (t) = \sum_{k = 1}^{K} {I M F}_{k} (t) + r_{k} (t),

(4)

3.1.2. CNN

The Convolutional Neural Network (CNN) is a classic method in deep learning. It was initially used in the fields of signal processing, image recognition and classification, and later extended to other tasks such as feature extraction. Through the local receptive field and weight-sharing mechanism, CNN can efficiently capture local correlation features of data. It generally includes an input layer, convolutional layers, pooling layers and fully connected layers. The input layer receives the original data and converts it into a feature matrix with specified batch size, step size, and feature dimension. The convolutional layer performs cross-correlation operations on the feature matrix by sliding convolution kernels to extract local key features. The pooling layer adopts a specific pooling strategy to compress feature dimensions, retaining core features while reducing computation and improving the generalization ability of the model. The fully connected layer maps pooled feature vectors to the output space. The CNN process is shown in Figure 2.

Firstly, capture local features of data through the convolution kernel:

Y = X * W + b, W \in R^{k \times k \times c_{i n} \times c_{o u t}}, b \in R^{c_{o u t}},

(5)

where X is the input feature map, W is the convolution kernel, b is the bias, and “

*

” represents a cross-correlation operation, that is, taking the convolution kernel as a sliding window, and multiplying and summing corresponding bits on the input feature map. k is the side length of the convolution kernel, c_in is the number of channels of the input feature map, and c_out is the number of new channels to be detected, that is, the number of channels of the output feature map.

Then, compress the data appropriately to retain data features via a pooling operation, and reduce the output dimension to reduce computation. For max pooling:

P_{i, j, c} = \max_{0 \leq u \leq k, 0 \leq v \leq k} X_{s \cdot i + u, s \cdot j + v, c},

(6)

where P is the output feature map, i, j are row and column coordinates of the output feature map, respectively, c is the channel index, X is the input feature map, k is the side length of the pooling window, u, v are local offsets in the window, respectively, and s is the step size.

Finally, integrate the data features through fully connected layers and output the final result:

z = W x + b,

(7)

where z is the output vector, W is the weight matrix, x is the input vector, and b is the bias.

3.1.3. LSTM Network

The Long Short-Term Memory (LSTM) Network is a neural network architecture based on the Recurrent Neural Network (RNN). Using three gate control structures—input gate, forget gate, and output gate—with cell state, it solves the problems of gradient explosion and gradient disappearance that occur in traditional RNNs during training. The cell state can store time-series information for longer, enabling better capture of long short-term dependency relationships. The gate control structure can flexibly adjust the forgetting and updating of information to screen out core time-series features.

As shown in Figure 3, an LSTM structure usually consists of an input layer, one or more hidden layers, and an output layer. Data enter the model through the input layer and are converted in the hidden layer to extract information. Finally, the output layer would generate the processed result. An LSTM structure can have one or more hidden layers, depending on specific task requirements, to effectively extract information from the data.

The hidden layer of the LSTM consists of several hidden units, and the internal structure of a single hidden unit is shown in Figure 4. In addition to receiving input at the current time step, the hidden unit also receives the output from the previous time step, and finally obtains the output at the current time step after processing through the input gate, forget gate, output gate, and other structures.

Let the current time step be t, input be x_t, and hidden state at the previous time step be h_t₋₁, with cell state C_t₋₁. In the hidden unit of LSTM:

Firstly, the forget gate determines information to be retained and discarded in cell state C_t₋₁ at the previous time step:

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}),

(8)

where σ is the Sigmoid activation function, W_f is the weight matrix, and b_f is the bias.

Then, the input gate determines new information to be stored at the current time step and generates candidate cell states:

i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}),

(9)

{\tilde{C}}_{t} = \tanh (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C}),

(10)

where W_i and W_C are weight matrices, and b_i and b_C are biases.

Next, update the cell state at the current time step according to the results of the forget gate and input gate:

C_{t} = f_{t} ⊙ C_{t - 1} + i_{t} ⊙ {\tilde{C}}_{t} .

(11)

Finally, the output gate determines information in the cell state to be output to the hidden state, and generates the hidden state at the current time step:

o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}),

(12)

h_{t} = o_{t} ⊙ \tanh (C_{t}),

(13)

where W_o is the weight matrix, and b_o is the bias.

3.2. Evaluation Indicators

In addition, to comprehensively evaluate the model’s predictive performance, Mean Absolute Error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE) are selected as performance metrics. The calculation formulas of each indicator are as follows:

M A E (y, \tilde{y}) = \frac{1}{n} \sum_{i = 0}^{n - 1} |y_{i} - {\tilde{y}}_{i}|,

(14)

M S E (y, \tilde{y}) = \frac{1}{n} \sum_{i = 0}^{n - 1} {(y_{i} - {\tilde{y}}_{i})}^{2},

(15)

R M S E (y, \tilde{y}) = \sqrt{\frac{1}{n} \sum_{i = 0}^{n - 1} {(y_{i} - {\tilde{y}}_{i})}^{2}},

(16)

M A P E (y, \tilde{y}) = \frac{1}{n} \sum_{i = 0}^{n - 1} \frac{|y_{i} - {\tilde{y}}_{i}|}{\max (ϵ, |y_{i}|)}

(17)

4. Empirical Analysis

4.1. Data Collection and Processing

This research collects data on carbon emission trading prices from the historical records of the Hubei Carbon Emission Trading Center from 24 April 2017, to 20 February 2025. Referring to the studies of Wang & He [4], Duan et al. [5], and Li & Ren [6], and considering the economic representativeness of the indicators, this research selects three indicators—carbon emission futures, S&P 500 index, and US dollar index—as influencing factors, and all data are from the Investing.com database. All indicators are shown in Table 1.

The Pearson correlation matrix of carbon price and selected additional features is presented in Figure 5. Each selected feature exhibits a statistically significant correlation with carbon prices, indicating that these features may have the predictive capability for carbon price.

To ensure the continuity of each time series and eliminate the impact of missing data on the model, this research uses a linear interpolation method to fill in missing values in each dataset, resulting in 1837 valid data points. By conducting linear interpolation, it is possible to reasonably estimate the intermediate missing values. Unlike forward or backward filling methods that may introduce artificial plateaus, linear interpolation preserves the continuity of price dynamics and reflects the gradual price change between adjacent trading days. The descriptive statistics of the processed carbon price and each indicator are shown in Table 2.

In Table 2, the maximum closing price of Hubei carbon emission trading is 61.48 yuan, the minimum is 11.56 yuan, and the average is 34.00 yuan. The overall price level is within a reasonable range compared to China’s carbon emission trading pilots and the national market. At the same time, the difference between the maximum and minimum is about 50 yuan, and the standard deviation is 11.29, reflecting the significant large-scale fluctuation of carbon prices, indicating the non-stationary and high noise characteristics of carbon prices affected by multiple factors, and also reflecting the necessity of accurate prediction for carbon prices. In terms of indicators, the average carbon emission futures price is 47.11 yuan, with a standard deviation of 28.52, which is significantly higher than that of carbon prices, indicating that the futures market is more sensitive to changes in policy, demand, and other factors. The S&P 500 index is larger in scale than carbon prices, reflects stock market fluctuations, and may have a correlation with carbon prices. The standard deviation of the US dollar index is 5.53, with relatively stable changes. Its short-term impact on carbon prices is small, but it may indirectly affect them through certain factors.

Next, this research uses the first 80% of the data in the sequence as the training set and the last 20% as the test set. The data and division of carbon prices and various indicators are shown in Figure 6 and Figure 7.

To eliminate the impact of differences in indicator dimensions on model construction and prediction, this research uses min-max normalization to scale data to the [0, 1] interval. The formula is as follows:

x^{*} = \frac{x - x_{m i n}}{x_{m a x} - x_{m i n}} .

(18)

4.2. Model Training and Parameter Setting

The research uses a rolling window with a length of 20 days to achieve one-step rolling prediction, that is, using carbon prices and data of other indicators of the previous 20 days as input to predict the carbon price of the next time step, and then rolling forward one time step and repeating the above process.

The model’s parameter settings mainly focus on the CNN-LSTM part as follows: the time window length is set to 20, the number of hidden state units is 128, the dropout ratio is 0.001, the activation function is ReLU, and the Adam optimizer is used. In addition, low-frequency signal components achieve good fitting with relatively fewer iterations, whereas high-frequency components require more iterations. Therefore, the number of iterations is set to 200 for high-frequency signals and 60 for low-frequency signals. For ICEEMDAN, this research set the number of ensemble members as 100, and the initial noise level as 0.4. Detailed parameter settings are shown in Table 3.

4.3. Experimental Result Analysis

4.3.1. Signal Decomposition Results

The decomposition results from the ICEEMDAN method are shown in Figure 8.

Figure 8 shows that ICEEMDAN automatically decomposes the original carbon price sequence into several components, effectively separating the noise and trend information. The subsequences are more stationary than the original sequence, and the problem of modal aliasing is avoided. Among them, IMF1~IMF3 exhibit the most violent fluctuations and the shortest cycles, which can reflect short-term random noise and small fluctuations in the original sequence. IMF4~IMF6 exhibit relatively small fluctuations and long cycles, which may reflect the impact of medium-term factors, such as short-term changes in energy prices. IMF7~IMF8 exhibit more gentle fluctuations and longer cycles, which may correspond to long-term trend changes, such as policy modifications. IMF9 shows a stable trend change with almost no high-frequency fluctuations, which can represent the long-term core trend of the original sequence.

4.3.2. Benchmark Model Comparison

To verify the effectiveness of the ICEEMDAN-CNN-LSTM model, the LSTM and CNN-LSTM models are selected as benchmark models to compare their predictive performance. The proposed model is constructed, and actual data are used to complete the training and prediction steps as described above. The final experimental results of each model are shown in Figure 9.

The experimental results show that during training, the performance of the ICEEMDAN-CNN-LSTM and CNN-LSTM models is generally better than that of the LSTM model. Given characteristics of the methods used in each model, this result can be explained by the LSTM method effectively extracting characteristic information from each influencing factor and the carbon price itself during training, thereby achieving better training results. At the end of training set, roughly from March 2022 to July 2023, the performance of the CNN-LSTM model declines. A possible reason is that the data at the end of the training set contain patterns different from those in earlier periods, and the LSTM method fails to capture them in time, instead continuing to use the original pattern, resulting in deviations from real values. In contrast, the ICEEMDAN-CNN-LSTM model provides the best fit on the training set and effectively captures the rise-and-fall trend of the original sequence.

In the test part, it can be noted that the LSTM and CNN-LSTM models can only roughly depict the trend, and are insufficient to capture detailed rise and fall. In comparison, the ICEEMDAN-CNN-LSTM model is more accurate and better simulates detailed changes in the data. It benefits from the decomposition by ICEEMDAN, which makes the features in component signals easier for the model to learn and fit, enabling more accurate results when the CNN-LSTM method is applied to complete the prediction and recombination in subsequent processes.

The performance comparison of each model under various evaluation indicators is shown in Table 4.

Table 4 shows that the ICEEMDAN-CNN-LSTM model proposed in this research achieves better performance in predicting carbon prices than other models. On the test set, it achieves an MAE of 1.140 yuan, which is 59.1% lower than LSTM and 65.2% lower than CNN-LSTM. At the same time, the RMSE of the proposed model is reduced by 57.2% and 62.9% compared to LSTM and CNN-LSTM, respectively. The decomposition of data using ICEEMDAN significantly improves prediction accuracy, thereby verifying the effectiveness of the proposed model. Also, by comparing performance between the training and test sets of each model, the proposed model maintains consistently lower errors than benchmark models, demonstrating strong generalization ability without significant overfitting. In terms of MAPE, the proposed model achieves 2.469% on the test set, compared to 6.245% for LSTM and 7.173% for CNN-LSTM, representing reductions of 57.6% and 63.1%, respectively. These improvements confirm that the “decomposition–extraction–fitting” framework effectively handles the non-stationary and high-noise characteristics of carbon price series.

4.3.3. Robustness Analysis Under Different Data Splits

In addition, the model’s robustness is verified by adjusting the splitting ratios of training and test sets. The comparison of different splitting ratios and corresponding evaluation indicators is shown in Table 5.

Table 5 shows that the proposed model’s performance is significantly better than the benchmark models across different training/test splitting ratios, and the error values are within the allowable range, indicating that the ICEEMDAN-CNN-LSTM model is robust. When the training proportion decreases from 80% to 50%, all models exhibit increasing test set errors, as the LSTM shows the most dramatic deterioration, followed by the CNN-LSTM. The proposed model also shows error growth, yet its absolute error under the most challenging 50%/50% split remains lower than LSTM under the favorable 80%/20% split.

4.3.4. Statistical Significance Test

To further verify the statistical significance of the prediction accuracy of the proposed model, the Diebold–Mariano (DM) test is used to compare the prediction results of test set in pairs, with square error as the loss function, and the prediction step is set to 1. The Diebold–Mariano test result is shown in Table 6.

Table 6 shows that the DM statistics for ICEEMDAN-CNN-LSTM against LSTM and CNN-LSTM are 11.454 and 15.892, respectively (both p < 0.001), significantly rejecting the null hypothesis at the 1% level. This confirms that the predictive advantage of the proposed model is statistically significant.

4.3.5. Ablation Experiment for IMF Component Contributions

To further validate the contribution of each IMF component in ICEEMDAN decomposition to carbon price prediction, this research conducts an ablation experiment. Specifically, each of the nine IMF components is predicted independently and the impact of sequentially removing each component on overall prediction accuracy is analyzed. The prediction errors of individual IMF components and the results of the ablation experiment are shown in Table 7 and Table 8.

Table 7 shows that the prediction errors of individual IMF components exhibit significant variation. IMF8 achieves the largest prediction error, with an MAE of 1.054 yuan, contributing 37.19% of the total prediction error. Meanwhile, IMF1 ranks second, with an MAE of 0.548 yuan, accounting for 19.34% of the total error. IMF2, IMF3, and IMF9 have MAE values between 0.22 and 0.25 yuan, with a combined contribution of approximately 24.69%. In contrast, IMF4 through IMF7 exhibit relatively small prediction errors, with MAE values ranging from 0.12 to 0.18 yuan, collectively contributing only 18.78% to the total error.

This distribution reflects the differential contributions of frequency components to the carbon price series. The mid-frequency components IMF4–IMF7 demonstrate good predictability with relatively small prediction errors. In contrast, IMF8, a low-frequency component approaching the trend due to its larger amplitude, and IMF1, the highest-frequency component due to its strong randomness, both exhibit larger prediction errors.

Table 8 shows that after removing the trend component IMF9, the MAE greatly increases from the baseline value of 1.247 yuan to 45.080 yuan, representing an increase of 3515.56%, making the prediction almost completely ineffective, while removing IMF8 also increases the MAE by 248.06% to 4.340 yuan. The results demonstrate that the low-frequency trend components, particularly IMF9 and IMF8, are the core drivers of carbon price prediction, and the long-term trend information they contain plays an important role in prediction accuracy.

At the same time, removing IMF1 reduces the MAE by 20.56%, indicating that IMF1 has a negative contribution to the prediction, and showing that the current equal-weighted linear combination may not be optimal. The underlying reason is that high-frequency components are much more susceptible to short-term random noise, which makes them less predictable. Therefore, their prediction errors propagate to the final result when included in equal weight. Future research could explore adaptive weighting strategies or selective combination to address this limitation. Removing IMF2 and IMF4 leads to slight MAE increases, while removing the remaining IMFs also makes reasonable MAE increases.

4.3.6. Comparison with an Alternative Decomposition Method

In addition, the research verifies the improvement of the model made by ICEEMDAN decomposition by replacing ICEEMDAN with VMD, which changes the model into VMD-CNN-LSTM. The performance of this model under various evaluation indicators is shown in Table 9.

Table 9 shows that the performance of the VMD-based model under the indicators is weaker than that of the ICEEMDAN-based model, suggesting that using ICEEMDAN as the decomposition method contributes to improving the performance of the model.

4.3.7. Comparison with a Traditional Statistical Method

To emphasize the advantages of deep learning approaches over traditional statistical methods, the SARIMAX method is used to complete the same prediction task and is judged under same evaluation indicators, as shown in Table 10.

Table 10 shows that SARIMAX has limited prediction performance and generalization ability, which proves that deep learning approaches have the advantage over traditional statistical ones.

5. Conclusions

Aiming at the research topic of carbon emission trading price prediction, this research proposed the ICEEMDAN-CNN-LSTM model. After necessary data preprocessing, the original data were decomposed into signal components with different frequencies using the ICEEMDAN method, which retains characteristic information of the original data. Then, the CNN-LSTM method was used to capture the long short-term dependence of each component and the impact of relevant factors on fluctuations in the research subject’s data, and to obtain predictions for each component. Finally, the prediction results of the corresponding original data were obtained by linear addition. Empirical analysis of the closing price of Hubei carbon emission trading was used to compare the performance of multiple prediction methods across a variety of evaluation indicators, and the effectiveness of the proposed model was verified.

At the same time, the robustness of the proposed model was verified by re-dividing the training and test sets. In addition, the proposed model achieves significantly lower prediction errors compared to benchmark models; the improvements are also confirmed by the Diebold–Mariano test. Furthermore, the ablation experiment reveals that low-frequency trend components are the core drivers of carbon price prediction. Also, the advantages over the alternative decomposition method VMD and the traditional statistical method SARIMAX are reflected by performing corresponding experiments that confirm the effectiveness of the proposed model for capturing the non-linear dynamics of carbon prices. Through the “decomposition–extraction–fitting” framework, the proposed model successfully separates noise from trends and was capable of more effectively capturing multi-scale features, thereby predicting carbon prices more accurately, and provided a new research path for the field of carbon price prediction.

The proposed model offers several practical implications for carbon market participants and policymakers. First, from the perspective of trading practice, the prediction results of the proposed model can serve as a reference for enterprises and investors to formulate carbon asset allocation strategies. However, it is crucial to acknowledge that the prediction errors, even when minimized, may lead to missing the timing of trades or misjudgments of price trends. Therefore, market participants should regard the outputs of the model as one of multiple decision-making tools with risk management mechanisms. Second, regarding policy dynamics, carbon prices could greatly change when policy is adjusted. Therefore, real-time policy monitoring should be considered to maintain predictive validity. Finally, concerning computational feasibility, the proposed hybrid framework has higher computational cost requirements compared to simpler or single-model approaches. Quantitatively, based on the actual experimental environment, training the full ICEEMDAN-CNN-LSTM model with nine IMF components requires about 55 min on a laptop equipped with an Intel Core i5-10210U CPU (1.60 GHz, 8 Cores) and 16 GB RAM, with no GPU acceleration. The decomposition and multi-component training processes increase both computing time and memory requirements. For practical deployment, trade-offs between prediction accuracy and computational efficiency should be evaluated, and model compression or parallel computing techniques may be explored.

Based on this research, future research can further explore issues related to the carbon emission trading price: First, extending the analysis to other carbon markets like the EU ETS, China’s national carbon market, and other regional pilots would enhance generalizability. Second, examining the mechanism underlying its formation and quantifying the impact of policy-making, market supply and demand, energy structure, and other factors on carbon prices. Third, focusing on the behavior patterns of all parties involved in carbon emission trading, and analyzing the mechanisms by which micro-entities, such as enterprises and investors, respond to carbon price fluctuations. Finally, research on how to develop reasonable participation strategies based on the prediction results of carbon emission trading prices could also be worthy of consideration.

Author Contributions

Conceptualization, J.W. and P.G.; methodology, P.G.; software, P.H.; validation, J.W. and P.G.; formal analysis, P.G.; investigation, D.W.; resources, D.W.; data curation, P.H. and D.W.; writing—original draft preparation, J.W. and P.G.; writing—review and editing, P.H. and D.W.; visualization, J.W.; supervision, P.G.; project administration, P.G.; and funding acquisition, J.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was financially supported by the National Natural Science Foundation of China (Grant No. 72204193), the National Key Research and Development Program of China (Grant No. 2022YFB330560), Hubei Province Natural Science Foundation (Grant No. 2024AFB332), and the Ministry of Education of the People’s Republic of China Humanities and Social Sciences Youth Foundation, (Grant No. 23YJC630049, 24YJC630199).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xie, L.; Chen, J. The Pollution and Carbon Reduction Effects, Regional Variations, and Mechanisms of Carbon Emission Trading pilot policy. Stat. Decis. 2025, 41, 89–93. (In Chinese) [Google Scholar] [CrossRef]
Li, C.; Song, Q. Study on the Impact of Carbon Emission Trading Pilot Policies on ESG Performance of Enterprises. J. Xi’an Shiyou Univ. (Soc. Sci. Ed.) 2025, 34, 37–48, (In Chinese with English abstract). [Google Scholar]
Tao, C.; Cheng, B.; Liao, Z.; Wei, Z.; Chen, F.; Yu, C. The Impact of Carbon Emissions Trading Policy on Green Innovation of Export Enterprises: Evidence from China. Econ. Anal. Policy 2025, 88, 1922–1938. [Google Scholar] [CrossRef]
Wang, J.; He, X. An Optimal Multi-Scale and Multi-Factor Two-Stage Integration Paradigm Coupled with Investor Sentiment for Carbon Price Prediction. Inf. Process. Manag. 2025, 62, 103953. [Google Scholar] [CrossRef]
Duan, Y.; Zhang, J.; Wang, X.; Feng, M.; Ma, L. Forecasting Carbon Price Using Signal Processing Technology and Extreme Gradient Boosting Optimized by the Whale Optimization Algorithm. Energy Sci. Eng. 2024, 12, 810–834. [Google Scholar] [CrossRef]
Li, D.; Ren, X. Carbon Price Prediction Based on LsOALEO Feature Selection and Time-Delay Least Angle Regression. J. Clean. Prod. 2023, 416, 137853. [Google Scholar] [CrossRef]
Contreras, J.; Espinola, R.; Nogales, F.J.; Conejo, A.J. ARIMA Models to Predict Next-Day Electricity Prices. IEEE Trans. Power Syst. 2003, 18, 1014–1020. [Google Scholar] [CrossRef]
Zareipour, H.; Canizares, C.A.; Bhattacharya, K.; Thomson, J. Application of Public-Domain Market Information to Forecast Ontario’s Wholesale Electricity Prices. IEEE Trans. Power Syst. 2006, 21, 1707–1717. [Google Scholar] [CrossRef]
Masih, A.M.M.; Albinali, K.; DeMello, L. Price Dynamics of Natural Gas and the Regional Methanol Markets. Energy Policy 2010, 38, 1372–1378. [Google Scholar] [CrossRef]
Byun, S.J.; Cho, H. Forecasting Carbon Futures Volatility Using GARCH Models with Energy Volatilities. Energy Econ. 2013, 40, 207–221. [Google Scholar] [CrossRef]
Zhu, B.; Wei, Y. Carbon Price Forecasting with a Novel Hybrid ARIMA and Least Squares Support Vector Machines Methodology. Omega 2013, 41, 517–524. [Google Scholar] [CrossRef]
Dong, H.; Hu, Y.; Yang, Y.; Jiang, W. A Multi-Strategy Integration Prediction Model for Carbon Price. Energies 2023, 16, 4613. [Google Scholar] [CrossRef]
Wang, J.; Zhuang, Z. A Novel Cluster Based Multi-Index Nonlinear Ensemble Framework for Carbon Price Forecasting. Environ. Dev. Sustain. 2023, 25, 6225–6247. [Google Scholar] [CrossRef]
Yao, Y.; Hong, R.; Liu, Q. Carbon Price Prediction Based On BP–LSTM Hybrid Neural Network. Environ. Sci. Manag. 2023, 48, 71–76, (In Chinese with English abstract). [Google Scholar] [CrossRef]
Wei, B.; Liu, C.; Liu, J. Multi-factor carbon emission rights trading price prediction based on Transformer-LSTM model. Prices Mon. 2024, 49–57, (In Chinese with English abstract). [Google Scholar] [CrossRef]
He, Z.; Wu, Z.; Yu, L. An Ensemble Carbon Price Forecasting Model Based on VMD-SWD Secondary Decomposition and Optimal Predictor Selection. Chin. J. Manag. Sci. 2026. (In Chinese with English abstract). [Google Scholar] [CrossRef]
Nadirgil, O. Carbon Price Prediction Using Multiple Hybrid Machine Learning Models Optimized by Genetic Algorithm. J. Environ. Manag. 2023, 342, 118061. [Google Scholar] [CrossRef]
Wang, J.; Sun, X.; Cheng, Q.; Cui, Q. An Innovative Random Forest-Based Nonlinear Ensemble Paradigm of Improved Feature Extraction and Deep Learning for Carbon Price Forecasting. Sci. Total Environ. 2021, 762, 143099. [Google Scholar] [CrossRef]
Liu, W.; Yang, X.; Wang, Z. Influencing factors and prediction analysis of transaction price based on multi factors. Stat. Consult. 2023, 19–23. (In Chinese) [Google Scholar] [CrossRef]
Wei, X.; Ouyang, H. Carbon Price Prediction Based on a Scaled PCA Approach. PLoS ONE 2024, 19, e0296105. [Google Scholar] [CrossRef]
Shi, H.; Wei, A.; Xu, X.; Zhu, Y.; Hu, H.; Tang, S. A CNN-LSTM Based Deep Learning Model with High Accuracy and Robustness for Carbon Price Forecasting: A Case of Shenzhen’s Carbon Market in China. J. Environ. Manag. 2024, 352, 120131. [Google Scholar] [CrossRef] [PubMed]
Yu, B.; Shi, H. Attention Mechanism-Driven Multi-Source Data Fusion for Improving Carbon Trading Price Prediction Accuracy. Apprais. J. China 2025, 2025, 32–43, (In Chinese with English abstract). [Google Scholar] [CrossRef]
Sayed, G.I.; Abd El-Latif, E.I.; Darwish, A.; Snasel, V.; Hassanien, A.E. An Optimized and Interpretable Carbon Price Prediction: Explainable Deep Learning Model. Chaos Solitons Fractals 2024, 188, 115533. [Google Scholar] [CrossRef]
Hong, K.; Jung, H.; Park, M. Predicting European Carbon Emission Price Movements. Carbon Manag. 2017, 8, 33–44. [Google Scholar] [CrossRef]
Kumar, N.; Kayal, P.; Maiti, M. A Study on the Carbon Emission Futures Price Prediction. J. Clean. Prod. 2024, 483, 144309. [Google Scholar] [CrossRef]
Colominas, M.A.; Schlotthauer, G.; Torres, M.E. Improved Complete Ensemble EMD: A Suitable Tool for Biomedical Signal Processing. Biomed. Signal Process. Control 2014, 14, 19–29. [Google Scholar] [CrossRef]

Figure 1. Framework structure of the ICEEMDAN-CNN-LSTM model.

Figure 2. Process of the CNN method.

Figure 3. Process of LSTM with one hidden layer.

Figure 4. Structure of the hidden unit at time t in LSTM.

Figure 5. Pearson correlation result.

Figure 6. Closing price data of Hubei carbon emission trading.

Figure 7. Data of indicators: (a) carbon emission futures; (b) S&P 500 index; and (c) US dollar index.

Figure 8. Decomposition result of the carbon emission trading price sequence.

Figure 9. Experimental results of each model.

Table 1. Indicators of carbon emission trading prices.

	Indicator	Type	Source
Y	Closing prices of Hubei carbon emission trading	Carbon emission trading prices	Hubei Carbon Emission Trading Center
x₁	Carbon emission futures	Environmental—political feature	Investing.com database
x₂	S&P 500 index	Stock market feature	Investing.com database
x₃	US dollar index	Currency market feature	Investing.com database

Table 2. Descriptive statistics of the variables.

Variable	Sample Size	Max.	Min.	Avg.	Standard Deviation
Carbon emission trading prices	1837	61.48 yuan	11.56 yuan	34.00 yuan	11.29 yuan
Carbon emission futures	1837	98.01 yuan	4.72 yuan	47.11 yuan	28.52 yuan
S&P 500 index	1837	6144.15	2237.4	3766.85	1002.74
US dollar index	1837	114.05	88.51	98.38	5.53

Table 3. Parameter settings.

Module	Parameter	Setting
ICEEMDAN	Number of ensemble members	100
ICEEMDAN	Initial noise level	0.4
CNN	Convolution
	- Number of layers	2
	- Number of kernels of layer 1	64
	- Number of kernels of layer 2	128
	- Size of kernels	3
	- Stride	1
	- Padding values	Zeros
	- Activation function	ReLU
	Pooling
	- Number of layers	2
	- Size	2
LSTM	Number of layers	1
	Hidden state units	128
	Dropout ratio	0.001
	Recurrent dropout ratio	0.001
Fully connection	Number of layers	1
Fit parameters	Optimizer	Adam
	Iteration count
	- High-frequency signals	200
	- Low-frequency signals	60

Table 4. Comparison of each model under various evaluation indicators.

Model	Training Set				Test Set
Model	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)
LSTM	1.997	8.731	2.955	6.318	2.787	10.878	3.298	6.245
CNN-LSTM	2.036	7.375	2.716	7.718	3.272	14.489	3.806	7.173
ICEEMDAN-CNN-LSTM	1.044	1.617	1.272	3.652	1.140	1.987	1.410	2.649

Table 5. Comparison of each model under various evaluation indicators under different splitting ratios.

Model	Training Set				Test Set
Model	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)
a. 70% as training set, 30% as test set
LSTM	1.873	9.050	3.008	6.412	1.982	6.269	2.504	4.374
CNN-LSTM	2.044	7.056	2.656	8.192	1.112	2.032	1.426	2.551
ICEEMDAN-CNN-LSTM	1.019	1.494	1.222	3.536	1.086	1.754	1.324	2.532
b. 60% as training set, 40% as test set
LSTM	1.769	8.557	2.925	6.499	4.417	26.700	5.167	11.241
CNN-LSTM	1.358	3.386	1.840	4.863	5.740	38.865	6.234	14.903
ICEEMDAN-CNN-LSTM	1.294	2.643	1.626	4.171	1.920	4.215	2.053	4.832
c. 50% as training set, 50% as test set
LSTM CNN-LSTM	1.586	7.171	2.678	6.315	7.956	75.809	8.707	23.068
LSTM CNN-LSTM	1.444	2.734	1.653	7.224	5.112	31.520	5.614	13.654
ICEEMDAN-CNN-LSTM	1.136	2.327	1.525	3.274	3.229	10.983	3.314	8.393

Table 6. The Diebold–Mariano test result.

Comparation Pair	DM Statistics	p Value	Conclusion
CNN-LSTM vs. ICEEMDAN-CNN-LSTM	15.892	<0.001 ***	Reject H₀
LSTM vs. ICEEMDAN-CNN-LSTM	11.454	<0.001 ***	Reject H₀

Note: H₀: No significant difference in forecast accuracy between the two models; H₁: the latter model has significantly higher forecast accuracy. *** denote significance at the 1% level, respectively.

Table 7. Prediction errors of individual IMF components.

Component	MAE (yuan)	RMSE (yuan)	Contribution Proportion (%)
IMF1	0.548	0.730	19.34
IMF2	0.248	0.336	8.74
IMF3	0.231	0.294	8.13
IMF4	0.119	0.174	4.19
IMF5	0.173	0.194	6.11
IMF6	0.121	0.155	4.27
IMF7	0.119	0.137	4.21
IMF8	1.054	1.119	37.19
IMF9	0.222	0.257	7.82

Table 8. The ablation experiment results.

Removed Component	MAE (yuan)	RMSE (yuan)	Change in MAE (%)
IMF1	0.991	1.250	−20.56
IMF2	1.256	1.552	+0.71
IMF3	1.369	1.687	+9.79
IMF4	1.254	1.645	+0.57
IMF5	1.388	1.762	+11.35
IMF6	1.368	1.888	+9.75
IMF7	1.297	1.694	+4.00
IMF8	4.340	4.827	+248.06
IMF9	45.080	45.090	+3515.56

Table 9. Comparison between VMD-CNN-LSTM and the proposed model under various evaluation indicators.

Model	Training Set				Test Set
Model	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)
VMD-CNN-LSTM	2.253	8.452	2.907	10.865	4.810	25.527	5.052	10.345
ICEEMDAN-CNN-LSTM	1.019	1.494	1.222	3.536	1.086	1.754	1.324	2.649

Table 10. Comparison between SARIMAX and the proposed model under various evaluation indicators.

Model	Training Set				Test Set
Model	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)	MAE (yuan)	MSE (yuan²)	RMSE (yuan)	MAPE (%)
SARIMAX	3.709	26.961	5.192	15.330	5.871	51.309	7.163	22.545
ICEEMDAN-CNN-LSTM	1.019	1.494	1.222	3.536	1.086	1.754	1.324	2.649

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, J.; Guo, P.; Hao, P.; Wang, D. Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method. Sustainability 2026, 18, 4738. https://doi.org/10.3390/su18104738

AMA Style

Wang J, Guo P, Hao P, Wang D. Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method. Sustainability. 2026; 18(10):4738. https://doi.org/10.3390/su18104738

Chicago/Turabian Style

Wang, Jiancheng, Pengcheng Guo, Peng Hao, and Dan Wang. 2026. "Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method" Sustainability 18, no. 10: 4738. https://doi.org/10.3390/su18104738

APA Style

Wang, J., Guo, P., Hao, P., & Wang, D. (2026). Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method. Sustainability, 18(10), 4738. https://doi.org/10.3390/su18104738

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on Carbon Emission Trading Price Predictions with the ICEEMDAN-CNN-LSTM Method

Abstract

1. Introduction

2. Literature Review

3. Model Construction

3.1. Methods

3.1.1. ICEEMDAN

3.1.2. CNN

3.1.3. LSTM Network

3.2. Evaluation Indicators

4. Empirical Analysis

4.1. Data Collection and Processing

4.2. Model Training and Parameter Setting

4.3. Experimental Result Analysis

4.3.1. Signal Decomposition Results

4.3.2. Benchmark Model Comparison

4.3.3. Robustness Analysis Under Different Data Splits

4.3.4. Statistical Significance Test

4.3.5. Ablation Experiment for IMF Component Contributions

4.3.6. Comparison with an Alternative Decomposition Method

4.3.7. Comparison with a Traditional Statistical Method

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI