4.2. Baseline Data and Influencing Factors
In the analysis of factors influencing carbon allowance transaction price, significant disparities exist among China’s regional carbon markets in aspects such as economic development levels. Current research predominantly focuses on the relatively mature EU Emissions Trading System (EU ETS), while comparative studies on the characteristics of different domestic carbon markets remain relatively scarce. Consequently, it is necessary to first conduct a systematic summary of the influencing factors.
In the academic research field of carbon price-influencing factors, numerous scholars have conducted in-depth explorations from diverse dimensions. Among them, Zhang Yun (2018) [
24] employed an unbalanced panel data model to analyze China’s carbon prices and found that macroeconomic indicators exert a positive impact on carbon emission rights prices. For instance, fluctuations in indicators such as Gross Domestic Product (GDP) [
25] and Industrial Production Index (IPI) [
26] can drive changes in carbon prices to a certain extent. Meanwhile, as fossil energy combustion constitutes the primary source of carbon emissions, the impact of energy market price volatility on carbon trading markets has attracted considerable attention, a viewpoint that has gained widespread consensus in academic circles [
27,
28]. Building on this, Li Feifei et al. [
29] further delved into the correlation between the prices of specific energy types and carbon trading prices, focusing on the mechanism through which price fluctuations of common fossil energies such as crude oil and coal affect carbon prices. Beyond economic and energy factors, the significant impact of climate and environmental factors on carbon trading prices has also been verified by many scholars. Wang Zhonghua’s [
30] research indicated that two climate indicators—temperature and precipitation—exhibit a positive impact on carbon prices, while air humidity shows a significant negative impact, clearly revealing the association between conventional climate factors and carbon prices. Expanding on this research, Xia Ruitong [
31] found that extreme weather events can indirectly affect carbon prices by altering the energy demand structure, enriching the research perspective on the impact of climate and environmental factors on carbon prices. In addition, air quality, as an important component of climate and environmental factors, is also closely linked to carbon prices and has become an indispensable aspect of this research field [
32,
33]. With the continuous development of global carbon markets, the connections between international carbon markets have become increasingly close. This has prompted some scholars to shift their research focus to the international level, investigating the impact of international carbon prices on domestic carbon emission rights trading prices [
34,
35]. Simultaneously, other scholars have taken exchange rates as an entry point, conducting comparative studies between the EU ETS and domestic carbon markets to further explore the similarities and differences in carbon price-influencing factors between international and domestic markets, providing multi-dimensional references for a comprehensive understanding of carbon price fluctuation laws [
36,
37].
The dependent variable of this study is the average transaction price of carbon emission rights, defined as the ratio of the total transaction volume to the total transaction quantity of carbon emission allowances traded monthly in the five major carbon markets (Shenzhen, Guangdong, Hubei, Beijing, and Shanghai). Building on existing research, the indicators influencing this average price are categorized into four dimensions: macroeconomy, energy prices, climate and environment, and international markets. Details are presented in
Table 1:
4.3. Data Preprocessing and Influencing Factor Indicator Selection
This study selects relevant data from China’s five major carbon markets (Shenzhen, Guangdong, Hubei, Beijing, and Shanghai) spanning the period 2018–2024 for empirical analysis to verify the accuracy of the proposed models. It aims to provide a novel method for predicting the average transaction price of carbon trading, enhance prediction precision, and furnish forward-looking price forecasting information for market participants and policymakers. This support facilitates the formulation of scientific decisions and mitigates market risks.
To screen the factors influencing the average transaction price of carbon emission rights, the LASSO algorithm is employed for feature selection. First, the original data is divided into a training set and a test set, and all subsequent preprocessing and feature selection steps are performed only on the training set. Due to significant differences in the dimensions and orders of magnitude among the original data indicators, the data are first subjected to standardization processing: Normal distribution data are standardized using Z-score normalization; non-normal distribution data are normalized using min-max normalization. The normality of the data is assessed using the Shapiro–Wilk test (with
p > 0.05 indicating a normal distribution), supplemented by Q-Q plots for verification [
37]. Subsequently, 10-fold cross-validation is used to determine the λ value, with the goal of minimizing the cross-validation mean squared error (MSE) to find the λ value that achieves a bias-variance tradeoff for the model. Finally, the determined λ value is input into the LASSO model to select the key features.
Based on the LASSO regression, the insignificant influencing factors for each carbon market (i.e., those with a LASSO coefficient of 0) are screened out. The indicator selection results of the five carbon markets (Shenzhen, Guangdong, Hubei, Beijing, and Shanghai) are summarized, and the findings are presented in
Table 2.
Through LASSO regression, the optimal lag orders of exogenous variables affecting carbon prices are identified. The analysis shows that the significant exogenous variables have lagged effects on carbon prices. Therefore, when predicting carbon prices for 2025, only historical data of exogenous variables (already observed values) are needed, without relying on unknown exogenous variable data for 2025. This fully utilizes the delayed response characteristics of carbon prices to exogenous variables, making multi-step forecasting feasible in practical applications.
4.4. Comparison and Selection of Carbon Trading Price Forecasting Models
To ensure the objectivity and scientific validity of model comparisons, this study establishes unified experimental conditions. All experiments were conducted using Python 3.13.5 on Windows 11 (64-bit, AMD64). Monthly average carbon trading price data from five major carbon markets spanning January 2018 to December 2024 are selected as the sample. The Expanding Window evaluation strategy is adopted, with the initial training set comprising 50% of the total sample. Subsequently, the window expands by one time point at each iteration, and the model is retrained using all available historical data to predict the next time point, resulting in a total of N rolling predictions (where N is the sample size of the test set). Three core evaluation metrics are employed: Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE).
Additionally, prediction trend plots and 95% confidence intervals are provided. The confidence intervals are estimated using the Bootstrap method with 1000 resampling iterations, taking the 2.5th and 97.5th percentiles as the lower and upper bounds, respectively. Based on these settings, the adaptability of each model across different markets is comprehensively evaluated. The specific implementation and comparison results of each model are presented as follows.
- (1)
Prediction Accuracy of the ARIMAX-Based Carbon Allowance Transaction Price Model
Taking the Hubei carbon market as an example, in the analysis of time series data, the Augmented Dickey–Fuller (ADF) test is first performed for stationarity verification. The results indicate that all original sequences are non-stationary except for the “Air Quality Index (AQI)” sequence, and all become stationary after first-order differencing (d = 1). Subsequently, combined with the autocorrelation and partial autocorrelation plots of ∇yt, the ARIMAX model is established using Python with repeated order determination and verification. The optimal model is identified as ARIMAX(2, 0, 2) based on the Bayesian Information Criterion (BIC) minimization principle. Finally, validation is conducted through model residual plots and the Ljung–Box test (statistic = 9.767124, p-value = 0.461157), leading to the conclusion that the residuals exhibit no autocorrelation, the model can fully extract time series information, and both the fitting and prediction performances are favorable.
The ARIMAX model specifications for the five markets show distinct differences. Shenzhen and Beijing adopt orders of (0, 0, 2) and (0, 0, 1), respectively, requiring no differencing and relying on moving average terms and residual corrections. Hubei adopts an order of (2, 0, 2), requiring no differencing while depending on both autoregressive and moving average terms. Guangdong is the only market requiring first-order differencing (d = 1), adopting an order of (0, 1, 1) and primarily relying on residual correction. Shanghai adopts an order of (1, 0, 0), requiring no differencing and depending solely on the autoregressive term. Regarding exogenous variables, the USD exchange rate has significant positive effects on Shenzhen and Beijing; CPI has significant negative effects on Shenzhen and Hubei; PPI and WTI crude oil price have significant effects only on Hubei; while all exogenous variables in Guangdong and Shanghai are not significant.
Based on the aforementioned methodology, the ARIMAX model fitting is performed separately for each of the five carbon markets, and the specific models corresponding to each market are presented in
Table 3.
denotes the first-order lag variable of the average carbon trading price; sigma 2 typically represents the variance of the error term; is the white noise error term; and denotes the first-order lag of the white noise error term.
The prediction graphs and 95% confidence intervals for the average transaction price of carbon trading in each market over the next 12 months, generated by the constructed ARIMAX model, are illustrated in
Figure 2.
As illustrated in
Figure 2, carbon prices across markets within the sample period exhibit synchronized cyclical fluctuation characteristics, characterized by low-level volatility in the early stage, a sustained upward surge following 2022, and a subsequent pullback from elevated levels in the later stage. The predicted curves generated by the model are able to closely align with the overall operational trends and short-term fluctuation rhythms of carbon prices across markets, with only minor deviations observed at extreme market turning points and peak prices during sharp rallies. The model demonstrates robust fitting performance and favorable cross-market adaptability with respect to carbon price trajectories across different regions.
- (2)
Carbon Allowance Transaction Price Prediction Based on the CNN Model
The parameters of the CNN model in this study were set as follows:
Table 4 presents the parameter settings of the CNN model. The convolutional layer has 8 input channels (corresponding to the 8 feature variables) and 64 output channels with a kernel size of 1, capturing contemporaneous correlations among features. The first fully connected layer compresses the 64-dimensional features to 32 dimensions to learn high-order nonlinear combinations; the second fully connected layer maps the 32-dimensional features to a 1-dimensional predicted value.
As illustrated in
Figure 3, carbon price trajectories across markets within the sample period exhibit a high degree of synchronization, each undergoing a complete cycle characterized by low-level volatility in the early stage, a sustained upward surge following 2022, and a subsequent pullback from elevated levels commencing at the end of 2023. Both types of window models are able to closely follow the operational trends and fluctuation rhythms of carbon prices across markets, with only minor prediction deviations observed at the peak positions of extreme price spikes. The models demonstrate a pronounced capability for identifying market turning points. Overall, the fitting performance is robust, and the models exhibit favorable adaptability across diverse carbon market environments.
- (3)
Carbon Allowance Transaction Price Prediction Based on the GRU Model
The parameters of the GRU model in this study were set as follows:
Table 5 presents the parameter settings of the GRU model. The input size is 50, indicating that the model processes 50 time steps of historical sequence data at each iteration. The output size is 1, meaning the model predicts the carbon price at the next time point. The timestep is 1, indicating that the model considers only the input features of a single time point for prediction at each step. The batch size is 32, meaning the model processes 32 samples simultaneously in each training iteration.
As illustrated in
Figure 4, carbon prices across markets exhibit a synchronized cyclical trajectory characterized by low-level volatility in the early stage, a sustained upward surge following 2022, and a subsequent pullback from elevated levels commencing at the end of 2023. The predicted curves generated by the model are able to closely align with the overall operational trends and fluctuation rhythms of carbon prices in each market, with only a slight overestimation bias observed at the peak nodes of extreme price spikes. The model demonstrates a pronounced capability for identifying market turning points. Overall, the fitting performance is robust, and the model exhibits favorable adaptability across diverse carbon markets.
- (4)
Carbon Allowance Transaction Price Prediction Based on the Transformer Model
The parameters of the Transformer model were configured as follows:
Table 6 presents the parameter configuration of the Transformer model for carbon allowance transaction price prediction. The input dimension is the number of features in the training data (8 features). The feature dimension is 64, used for embedding mapping. The number of attention heads is 4, enabling parallel capture of feature relationships. The number of encoder layers is 2, enhancing feature extraction capability. The output dimension is 1, producing the predicted carbon price.
As illustrated in
Figure 5, carbon price trajectories across markets exhibit a generally convergent pattern over the sample period, each undergoing a complete cycle characterized by low-level volatility in the early stage, a sustained upward surge following 2022, and a subsequent pullback from elevated levels after late 2023. The predicted curves generated by the model are able to closely follow the fluctuation patterns and overall trends of carbon prices in each market, with only a slight overestimation bias observed at the peak nodes of extreme price spikes. Overall, the model demonstrates robust fitting performance and favorable cross-market adaptability.
- (5)
Comparative Evaluation of Testing Accuracy for Carbon Allowance Transaction Price Prediction
This study selects the Root Mean Square Error (RMSE) as the primary evaluation metric based on valid theoretical and methodological grounds. To provide a more comprehensive assessment of model performance, the Mean Absolute Error (MAE) and the Mean Absolute Percentage Error (MAPE) are also adopted. The test precision of each model with respect to these three metrics is presented in
Table 7.
According to the RMSE, MAE, and MAPE results across the five markets, ARIMAX and GRU are the top two models. ARIMAX performs best in Shenzhen, Guangdong, and Shanghai with minimal errors, while GRU excels in Hubei and Beijing, significantly outperforming CNN and Transformer in error control. The fundamental reason for this performance gap lies in the difference in temporal dependency modeling capabilities. ARIMAX and GRU possess recursive states or autoregressive components, making them naturally suitable for capturing the strong autocorrelation of carbon price series. In contrast, CNN and Transformer are “feed-forward” models that lack the ability to model temporal dependencies. In the small-sample, one-step-ahead rolling window forecasting scenario, recursive models (ARIMAX/GRU) have higher parameter efficiency and better match the data characteristics, thus achieving superior performance.
4.5. Carbon Allowance Price Prediction Based on Optimized ARIMAX and GRU Models
Multiple optimization algorithms were tested, and the following two were found to be the most appropriate for this study.
- (1)
Improved ARIMAX Model and Its Prediction
We apply the SVR-ARIMAX model to the carbon markets, and the obtained prediction results are presented in
Figure 6:
Based on a comparative analysis of the RMSE, MAE, and MAPE metrics across the five markets—Shenzhen, Guangdong, Hubei, Beijing, and Shanghai—the SVR-ARIMAX model achieves significant improvements over the baseline ARIMAX in the majority of markets (
Table 8). Specifically, SVR-ARIMAX yields lower error values across all three metrics in Guangdong, Hubei, and Beijing, with the most pronounced enhancement observed in the Beijing market (RMSE decreasing from 20.8844 to 16.1214, MAPE decreasing from 15.35% to 11.60%) and substantial improvements also evident in the Hubei market (RMSE decreasing from 2.9634 to 2.0948). However, in the Shenzhen and Shanghai markets, SVR-ARIMAX exhibits higher error metrics compared to ARIMAX, with particularly inferior performance relative to the benchmark model in the Shanghai market.
According to the long-term price forecasting results of five Chinese carbon emission trading pilots (Shenzhen, Guangdong, Hubei, Beijing and Shanghai) based on the SVR-ARIMAX model, the historical carbon prices of all markets have undergone a complete operational cycle consisting of low-level accumulation, medium-term sharp rally, and subsequent corrective decline. The in-sample backtesting curve of the model achieves favorable fitting performance in capturing the historical trend and volatility characteristics of carbon prices. In view of the green long-term forecasting curves from 2025 to 2026 and their corresponding 95% confidence intervals, distinct clustered operational patterns are identified among the five markets. Specifically, the two South China markets, Shenzhen and Guangdong, exhibit highly consistent future trajectories: following prior adjustments, Shenzhen will complete a short-term bottom probing and subsequently enter an oscillatory recovery channel, and its moderately wide confidence interval indicates controllable overall uncertainty of future price movements. After profound prior corrections, Guangdong will step into a long-term horizontal consolidation phase; its stable green forecasting curve and moderately broad confidence interval demonstrate weak future price volatility and predominant range-bound operation. The two core markets of Beijing and Hubei share analogous future trend characteristics. Supported by their historically high price benchmarks, both markets present trajectories of narrow oscillatory decline and gradual bottom consolidation within the forecasting horizon, with medium-width confidence intervals signifying mild long-term price adjustment and manageable overall volatility risks. As for the Shanghai market, it features a relatively independent evolution trend. After peaking in the early stage, it will maintain a slow downward trajectory in the future. Meanwhile, its confidence interval is notably wider than that of other pilots, which reflects remarkable long-term downward pressure and relatively higher uncertainty in future price fluctuations.
- (2)
Improved GRU Model and Its Prediction
We apply the GA-GRU model to the carbon markets, and the corresponding prediction results are presented in
Figure 7:
As shown in
Table 9, based on a comparative analysis of the RMSE, MAE, and MAPE metrics across the five markets—Shenzhen, Guangdong, Hubei, Shanghai, and Beijing—the GA-GRU model demonstrates substantial improvements over the standard GRU in the majority of markets. Specifically, GA-GRU achieves lower error values across all three metrics in Shenzhen, Guangdong, Hubei, and Shanghai, with the most pronounced enhancement observed in the Guangdong market, where RMSE decreases from 7.7673 to 4.1427 and MAPE declines from 10.79% to 5.77%. Steady improvements are also evident in the Hubei market. In contrast, within the Beijing market, GA-GRU yields marginally higher RMSE and MAPE values relative to GRU, suggesting that the optimization algorithm’s adaptability to extremely volatile market conditions remains subject to further refinement. Overall, the genetic algorithm-based parameter optimization of GRU effectively reduces prediction errors across most regional carbon markets, thereby enhancing model fitting accuracy and robustness.
According to the long-term price forecasting results of five domestic carbon emission trading pilots (Shenzhen, Guangdong, Hubei, Beijing and Shanghai) based on the GA-GRU model, all markets have undergone a complete operational cycle consisting of low-level consolidation, mid-term volatile rally, and subsequent corrective adjustment in historical carbon price movements. The in-sample backtesting results of the model demonstrate favorable fitting performance, with strong capability in capturing historical price trends and volatility inflection points. In light of the 2025–2026 green long-term forecasting curves and their corresponding 95% confidence intervals, distinct clustered operational patterns are identified across the five markets. The two South China markets, Shenzhen and Guangdong, exhibit highly consistent future trajectories: both will achieve bottom stabilization following prior corrections, followed by moderate oscillatory upward movements. Their moderately narrow confidence intervals indicate low future price volatility risks, signaling a steady recovery trajectory ahead. The two core markets of Beijing and Hubei share analogous trend characteristics; resting on their historically high price benchmarks, they will maintain slow incremental growth within narrow volatility bands. Their compact confidence intervals reflect the strongest price stability among all pilots, with robust long-term price support fundamentals. As for the Shanghai market, it features a relatively independent dynamic, presenting a typical pattern of initial mild downside correction succeeded by volatile rebound and upward recovery. Its comparatively wider confidence interval than other pilots denotes explicit upward recovery potential alongside relatively elevated short-term uncertainty in price fluctuations. On the whole, all five pilot carbon markets will terminate their prior unilateral downward adjustment after 2025, collectively stepping into a new phase of stabilization, recovery and gradual upward elevation of price benchmarks. Heterogeneities in recovery pace and volatility amplitude merely arise from inherent regional market differentiation, while the reasonable coverage of overall forecasting intervals verifies the high reliability of the model’s projections on future carbon price dynamics.