Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models

Sun, Weibao; Gao, Yafang; Yang, Xuemei; Zhang, Yalong; Hu, Haolin

doi:10.3390/su17125407

Open AccessArticle

Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models

by

Weibao Sun

¹

,

Yafang Gao

^1,2,3,*,

Xuemei Yang

^2,3,4,

Yalong Zhang

⁵

and

Haolin Hu

¹

College of Tourism, Northwest Normal University, Lanzhou 730070, China

²

School of Tourism, Lanzhou University of Arts and Science, Lanzhou 730030, China

³

Gansu Cultural and Tourism Industry Research Institute, Lanzhou 730030, China

⁴

Observation Station of Subalpine Ecology Systems in the Middle Qilian Mountains, Xining 810000, China

⁵

School of Geographic Sciences, Qinghai Normal University, Xining 810008, China

^*

Author to whom correspondence should be addressed.

Sustainability 2025, 17(12), 5407; https://doi.org/10.3390/su17125407

Submission received: 29 April 2025 / Revised: 25 May 2025 / Accepted: 7 June 2025 / Published: 11 June 2025

Download

Browse Figures

Versions Notes

Abstract

Carbon markets play a pivotal role in achieving carbon peaking targets, with accurate price forecasting being essential for effective policymaking and corporate decision making. This study develops an integrated forecasting framework, combining an overall market model and a market-specific model, to predict carbon price trends in China from 2025 to 2026, while examining inter-market heterogeneity across eight regional markets. The overall market forecast reveals a fluctuating upward trend in the national carbon price over the next two years. Market-specific forecasts highlight significant disparities in price trends, as follows: the Shanghai and Guangzhou markets are projected to experience faster growth and the Beijing market to maintain stable prices, while the Tianjin and Chongqing markets exhibit more moderate increases. These disparities reflect the profound influence of regional economic levels, policy enforcement, and market maturity on carbon market development. By incorporating seasonal fluctuations and stochastic disturbances, we construct a forecasting model aligned with historical data dynamics and achieve differentiated forecasts through the analysis of historical price levels across markets, addressing the limitations of uniform target pricing in prior studies. These findings offer actionable insights for carbon market participants and policymakers, providing a robust foundation for designing differentiated carbon pricing policies to support China’s carbon peaking objectives.

Keywords:

carbon price forecasting; China’s carbon market; market heterogeneity; overall market model; market-specific model

1. Introduction

Global climate change has emerged as one of the most pressing challenges facing humanity in the 21st century, with extreme weather events occurring more frequently and carbon emissions drawing significant attention [1]. Carbon markets, as market-based mechanisms for emissions reduction, incentivize firms to lower greenhouse gas emissions through emissions trading, establishing themselves as a critical global strategy for addressing climate change [2]. Since its inception in 2005, the European Union Emissions Trading System (EU ETS) has become the world’s largest carbon market, offering valuable insights into price volatility and market mechanisms for other regions [3]. As the world’s largest carbon emitter, China initiated regional carbon trading pilots in 2011 and officially launched its national carbon market in 2021, marking a new phase in the development of its carbon market [4]. By December 2024, the Shanghai Environment and Energy Exchange recorded a cumulative trading volume of 419 million tons, making it the largest pilot market in China by transaction scale [5].

Carbon prices serve as the cornerstone signal of carbon markets, reflecting supply–demand dynamics while providing critical guidance for governments in designing emissions reduction policies and for firms in making investment decisions [6]. Accurate carbon price forecasting enables governments to formulate evidence-based carbon pricing policies, mitigate market volatility risks, and supports firms in optimizing investment strategies and asset allocation [7]. However, carbon prices are influenced by a multitude of factors, including energy prices, financial market fluctuations, policy shifts, climate variables, and market sentiment, exhibiting non-linear, non-stationary, and highly volatile characteristics [8]. Furthermore, carbon prices are shaped by macroeconomic cycles, international trade patterns, and geopolitical events, adding further complexity to forecasting efforts [9]. For instance, Creti et al. found that EU ETS carbon prices experienced significant shocks during the financial crisis, demonstrating a strong correlation with energy markets [10].

Research on carbon price forecasting has an extensive history, primarily grounded in traditional time series methodologies. Models like the Generalized Autoregressive Conditional Heteroskedasticity (GARCH) and the Autoregressive Integrated Moving Average (ARIMA) were often held back by their reliance on manual feature engineering and their struggle to pick up on non-linear patterns, which kept their forecasting results from being as good as they could be [11]. Lately, though, things have shifted—methods from machine learning and deep learning have really taken off in this field. Tools like Support Vector Machines (SVMs) and Random Forests (RFs) have shown they can handle a bunch of different variables pretty well, offering a lot of reliability in the process [12], Meanwhile, Long Short-Term Memory (LSTM) networks have turned out to be particularly good at spotting long-term connections in time series data [13]. On top of this, some researchers have managed to make their predictions even sharper by bringing in outside factors—like energy prices, financial market signals, or even less structured data like online news sentiment [14]. A good example here is Byun and Cho [14], who put together an indicator based on news sentiment and ended up with much better forecasting results for EU ETS carbon prices [15].

Even so, there are still a few shortcomings in the current body of research that need addressing. Many studies overlook the multi-scale features and structural breaks in carbon price series, hindering models’ ability to capture the complex patterns of price volatility [16]. The majority of research has focused on the EU ETS [17], with relatively limited attention to forecasting China’s carbon market, particularly the heterogeneity across its national and regional pilot markets [18]. China’s carbon market exhibits pronounced regional disparities; for instance, the Beijing market experiences significant price volatility driven by policy interventions, whereas the Tianjin market remains relatively stable. Furthermore, existing studies often focus solely on point forecasts, neglecting the quantification of price volatility ranges and uncertainty, which limits their practical utility in decision making [19]. Additionally, some studies fail to adequately account for the uncertainty of policy scenarios, such as the implementation of carbon peaking and neutrality targets, which could profoundly influence carbon prices [20].

Building on the aforementioned research gaps, this study proposes a novel carbon-price forecasting framework that integrates an overall market model and a market-specific model to predict price trends in China’s carbon market from 2025 to 2030. The overall market model focuses on the national carbon market’s general trajectory, while the market-specific model targets price forecasts for eight pilot markets (Beijing, Chongqing, Fujian, Guangzhou, Hubei, Shanghai, Shenzhen, and Tianjin), accounting for inter-market heterogeneity. The primary innovations of this study are threefold, as follows: (1) it constructs a forecasting model aligned with historical data dynamics by incorporating seasonal fluctuations and stochastic disturbances; (2) it sets differentiated target prices based on historical price levels of individual markets, analyzing the impact of market heterogeneity on price trends; and (3) it combines point forecasts with volatility analysis to provide more comprehensive price forecasting insights. This research aims to offer a scientific foundation for carbon market participants and policymakers, contributing to the development of China’s carbon market system and the realization of its dual-carbon goals.

2. Data and Methods

2.1. Data Sources

This study utilizes daily historical trading data from China’s eight major carbon markets, spanning the period from 25 June 2021, to 31 December 2024. The data were sourced from the publicly accessible the Greenhouse Gas Voluntary Emissions Reduction Trading Platform (Greenhouse Gas Voluntary Emission Reduction Trading Platform (Source of Original Transaction Data): https://ets.sceex.com.cn/internal.htm?orderby=tradeTime%20desc&pageSize=14&k=guo_nei_xing_qing&url=mrhq_gn&pageIndex=1, accessed on 30 January 2025), encompassing records from the Shanghai Environment and Energy Exchange, Beijing Green Exchange, Tianjin Emission Rights Exchange, Guangzhou Carbon Emission Rights Exchange, Hubei Carbon Emission Trading Center, Fujian Haixin Trading Center, Shenzhen Green Exchange, and Chongqing Carbon Emission Rights Trading Center. The raw dataset includes fields such as trading date, trading institution, trading product, opening price, highest price, lowest price, average transaction price, closing price, trading volume, and transaction value, comprising a total of 8309 records.

2.2. Data Cleaning and Preprocessing

To ensure data accuracy and consistency, this study applied the following cleaning and preprocessing steps to the raw dataset:

(1): Handling Missing Values

Missing values, originally denoted by “-” in the raw data, were replaced with NaN. For price-related fields (opening price, highest price, lowest price, average transaction price, and closing price), a forward fill method was employed, grouped by market, to impute missing values. This method fills missing entries by propagating the most recent non-missing value forward in chronological order (from earlier to later dates) within each market’s time series, ensuring that all subsequent missing values are replaced with the same value until a new non-missing value is encountered. This approach preserves the temporal continuity of the price data, which is essential for maintaining the integrity of time series analysis in carbon-emission trading markets. Grouping by market ensures that the imputation is performed independently within each market, preserving market-specific price trends. For trading volume and transaction value fields, missing values were filled with 0, indicating no trading activity.

(2): Standardization of Institution Names

Due to variations in market names across different time periods (e.g., “Beijing Environment Exchange” versus “Beijing Green Exchange”), a mapping table was used to standardize these names (e.g., unified as “Beijing Green Exchange”). Data from the eight major pilot markets were retained, while records from irrelevant exchanges (e.g., “European Energy Exchange”) were excluded.

(3): Merging Market Data

The Shenzhen market includes multiple trading products (e.g., SZA2013 to SZA2020). To facilitate a unified analysis, Shenzhen market data within the same trading dates were consolidated into a single product (SZEA) using a volume-weighted average. The formula for calculating the weighted average price is as follows:

P_{S Z E A} = \frac{\sum_{i = 1}^{n} {(P}_{i} {\cdot V}_{i})}{\sum_{i = 1}^{n} V_{i}}

where

P_{S Z E A}

represents the weighted average price after consolidation;

P_{i}

denotes the average transaction price of the i-th trading product;

V_{i}

indicates the trading volume of the ith trading product; and n is the total number of trading products. The trading volume and transaction value were calculated as the cumulative sums of the respective values across all products.

(4): Outlier Removal

To eliminate outliers in the dataset, the following two methods were applied:

Threshold Method: To accurately reflect the data distribution, a detailed statistical analysis of the raw data was conducted, and the price threshold was set at 148.3 CNY/ton. Figure 1 presents a boxplot of the transaction prices, where the Interquartile Range (IQR) is calculated as the difference between the third quartile (Q3 = 77.95 CNY/ton) and the first quartile (Q1 = 31.04 CNY/ton), resulting in an IQR of 46.91 CNY/ton. Using the standard outlier detection method (Q3 + 1.5 × IQR), the upper threshold was determined to be 148.3 CNY/ton.

Three Standard Deviation Method: For each market, the mean (

μ

) and standard deviation (

σ

) of each price field were calculated. Data points falling outside the range (

μ \pm 3 σ

) were identified as outliers, replaced with NaN and subsequently imputed using the forward fill method to address missing values. The calculation formula is as follows:

Outlier Range = [μ - 3 σ, μ + 3]

(5): Data Validation

Theoretical transaction values were calculated using the average transaction price and trading volume (

A_{c a l c}

= P·V), and compared with the actual transaction values. If the actual value was missing or the discrepancy exceeded 1 CNY, the theoretical value was used as a replacement to ensure data consistency. The cleaned dataset was saved in CSV format, encompassing trading records from the eight major markets, with fields including trading date, market name, trading product, opening price, highest price, lowest price, average transaction price, closing price, trading volume, and transaction value.

2.3. Feature Analysis Methods

To comprehensively examine the trading characteristics of China’s eight major carbon markets, this study conducts Exploratory Data Analysis (EDA) from the following four perspectives: price trends, trading volume distribution, price volatility, and market correlations. The specific methods are as follows:

(1): Price Trend Analysis

The average transaction price trends over time were plotted for each market, reflecting price levels and volatility patterns. Price trend graphs were constructed as line charts, with the x-axis representing trading date, and the y-axis representing the average transaction price (in CNY/ton). The y-axis was scaled from 0 up to 160 CNY/ton, making sure it captures the full span of price variations you would expect across all of the markets.

(2): Trading Volume Distribution Analysis

To get a sense of how active and liquid each market is, we worked out the total trading volume for each one. This total volume was figured out by adding up the daily trading amounts, and the formula for that is laid out below:

V_{t o t a l, m} = \sum_{t = 1}^{T} V_{m, t}

where

V_{t o t a l, m}

stands for the overall trading volume in market m;

V_{m, t}

refers to the amount traded in market m on the t-th day; and T marks the total count of days that trading took place.

(3): Price Volatility Analysis

To gauge how much prices fluctuate, we used the standard deviation as our metric, figuring out the standard deviation of the average transaction price for each market. The formula for this is laid out below:

σ_{m} = \sqrt{\frac{1}{T_{m}} \sum_{t = 1}^{T_{m}} {(p}_{m, t} - {\bar{p}}_{m})^{2}}

where

σ_{m}

shows how much prices in market m tend to vary;

p_{m, t}

refers to the average price of transactions in market m on the t-th day;

\bar{p}

_m stands for the overall average price in market m; and

T_{m}

marks the total number of days that trading happened in market m.

(4): Inter-Market Price Correlation Analysis

To see how closely prices move together across different markets, we worked out the Pearson correlation coefficient for the average prices among them, with the formula for this given below:

ρ_{m, n} = \frac{\sum_{t = 1}^{T} {(p}_{m, t} - {\bar{p}}_{m}) {(p}_{n, t} - {\bar{p}}_{n})}{\sqrt{\sum_{t = 1}^{T} {(p}_{m, t -} {\bar{p}}_{m})^{2} \cdot \sqrt{\sum_{t = 1}^{T} (p_{n, t} - {\bar{p}}_{n})^{2}}}}

where

p_{m, n}

represents the correlation coefficient between markets m and n;

p_{m, n} a n d p_{n, t}

denote the average transaction prices of markets m and n on day t, respectively;

\bar{p}

_m and

\bar{p}

_n indicate the mean prices of markets m and n, respectively; and T is the number of common trading days. The correlation coefficient ranges from −1 to 1.

Data cleaning and feature analysis were implemented using the Python programming language, primarily relying on the following libraries: pandas for data processing and cleaning and matplotlib and seaborn for data visualization. The data cleaning and analysis code was executed in a Python 3.11 environment, ensuring the reproducibility and reliability of the results.

2.4. Feature Engineering

To enhance the performance of subsequent forecasting models, this study conducted feature engineering on the cleaned dataset, extracting the following two categories of features:

(1): Internal Features

Lagged Features: The average transaction prices from the previous 1 day and 7 days (denoted as

P_{t - 1} a n d P_{t - 7}

, respectively) were extracted to capture the temporal dependency of prices. Lagged features were computed by applying a time shift to the price series after grouping by market, with the formula as follows:

P_{t - k, m} {= P}_{t - k, m}, k \in [1, 7]

where

P_{t - k, m}

represents the average transaction price of market m on day t − k.

Moving Average Features: The 7-day moving average price (denoted as M

A_{7, t, m}

) was calculated to smooth price fluctuations and capture short-term trends. The formula for the moving average price is as follows:

{M A}_{7, t, m} = \frac{1}{7} \sum_{i = 0}^{6} P_{t = i, m}

where

P_{t - i, m}

represents the average transaction price of market m on day t − i. If fewer than 7 days of data were available, the calculation was performed using the available data.

Temporal Features: The month of the trading date (denoted as M_t, ranging from 1 to 12) was extracted to capture the seasonal variations in prices. Additionally, the compliance cycle was identified (denoted as C_t), with June and July defined as the compliance period (C_t = 1), and other months as the non-compliance period (C_t = 0), to reflect the potential impact of compliance cycles on prices.

Trading Volume Features: The daily trading volume (denoted as V_t,m), was directly used as an indicator of market activity.

(2): Market Features

Market names (Market) were transformed into dummy variables through one-hot encoding, generating eight binary features (denoted as D_m,i, i

\in {1,2, \cdot \cdot \cdot, 8}

), indicating whether a trading record pertains to a specific market. For instance, if a record belongs to the Shanghai market, then D_Shanghai = 1, while the features for other markets are set to 0. The formula for generating dummy variables is as follows:

D_{m, i} = \{\begin{matrix} 1, i f a r e c o r d p e r t a i n s t o m a r k e t i \\ 0, o t h e r w i s e \end{matrix}

2.5. XGBoost Model

To forecast carbon prices, this study employs XGBoost (Extreme Gradient Boosting) as the primary machine learning model. XGBoost, a gradient boosting algorithm based on decision trees, is widely adopted for regression tasks due to its efficiency, flexibility, and robust capacity to model multi-feature data. By integrating multiple decision trees, XGBoost effectively captures non-linear relationships and interaction effects among features, making it well-suited for handling the complex time series data inherent in carbon price forecasting.

3. Results Analysis

3.1. Carbon Market Price Trend Analysis

Between June 2021 and December 2024, the average transaction prices (in CNY/ton) for China’s eight main carbon markets are shown over time in Figure 2, highlighting just how different these markets can be when it comes to price levels, how much prices swing, and what might be driving those changes. The Beijing market recorded the highest price levels among the eight markets, with prices ranging from 80 to 120 CNY/ton and occasionally approaching 140 CNY/ton. These elevated prices are likely attributable to Beijing’s role as China’s political and economic center, reinforced by stringent carbon emission regulations. However, the pronounced price fluctuations reflect a heightened sensitivity to external factors, such as policy interventions or compliance deadlines. On the other hand, the Shanghai market kept things much steadier, with prices staying within a tighter range of 60 to 80 CNY/ton and following a fairly smooth path overall, which really shows what a well-established carbon market looks like. Shanghai’s solid trading systems and high liquidity did a good job of keeping price swings in check, likely thanks to its role as a financial hub and its early start in carbon trading. Meanwhile, the Guangzhou market saw some pretty noticeable ups and downs, with prices ranging from 30 to 70 CNY/ton. After we took out some outliers—prices that went above 200 CNY/ton—there was a clear downward trend in 2023, dropping from 50 CNY/ton to around 35 CNY/ton. Those swings might come from how sensitive the market is to local economic conditions and policy shifts, plus the fact that it is a smaller market overall. The Hubei market demonstrated the lowest volatility, consistently ranging between 35 and 45 CNY/ton, a stability consistent with its high trading volume, indicating robust liquidity and broad participant engagement that effectively buffered price fluctuations. The Tianjin, Fujian, Shenzhen, and Chongqing markets recorded lower price levels, with fluctuation ranges between 20 and 60 CNY/ton. Among these, Tianjin exhibited the greatest stability (around 30 CNY/ton), possibly due to lenient emission targets or limited market activity, while Shenzhen showed moderate volatility (potentially due to diverse trading products like SZEA). Fujian and Chongqing displayed a gradual upward trend over time, reflecting increasing market maturity. These regional disparities in price dynamics may present opportunities for cross-market arbitrage, underscoring the need for coordinated national policies to promote price convergence and enhance market efficiency. Additionally, the removal of outliers ensured the reliability of the analysis, mitigating distortions from data entry errors or non-representative transactions.

3.2. Trading-Volume Distribution Analysis

The total trading volume (in tons) of China’s eight major carbon markets from 2021 to 2024, as shown in Figure 3, was calculated by aggregating the daily trading volumes, reflecting the trading activity and liquidity levels of each market. Specific values annotated on the bar chart enhance the visual clarity of the data. The Shanghai market led with a total trading volume of 419 million tons, far surpassing other markets and accounting for a substantial share of the overall trading activity. This dominant position aligns with Shanghai’s role as a financial hub and a pioneering carbon trading pilot, where robust trading infrastructure and supportive policies have attracted significant participation, markedly enhancing market liquidity. The Guangzhou and Hubei markets followed, with total trading volumes of 41.64 million tons and 36.54 million tons, respectively, ranking second and third. This indicates strong trading activity in these regional markets, likely driven by local industrial activity and policy incentives. Hubei’s high trading volume complements its low price volatility, reflecting a market characterized by high liquidity and stability. The Fujian, Chongqing, Tianjin, and Shenzhen markets had trading volumes that sat in the middle of the pack, going from 15.34 million tons in Shenzhen all the way up to 25.82 million tons in Fujian. Fujian’s higher numbers might have something to do with its focus on certain areas—like forestry carbon sink trading—while the smaller volumes in Chongqing and Tianjin point to less action in those markets, probably because they are smaller in scale or do not have as many players involved. Conversely, the Beijing market registered the lowest total trading volume at 11.60 million tons, despite its elevated prices. This constrained trading volume likely contributes to its pronounced price fluctuations, as reduced liquidity heightens vulnerability to supply–demand imbalances or external disruptions. These differences in trading volumes across regions really drive home how important liquidity is in carbon trading. If we could boost the trading volumes in markets that are not as active, it might help make things run more smoothly and keep prices steadier, and the way Shanghai and Hubei have managed things could offer some useful ideas for others to follow.

3.3. Price Volatility Characteristics Analysis

Figure 4 shows how much prices in China’s eight main carbon markets fluctuated between 2021 and 2024, using the standard deviation of the average transaction prices (in CNY/ton) as the yardstick, with the exact numbers marked on the bar chart to make the data easier to read and understand. The Beijing market displayed the highest price volatility, with a standard deviation of 26.93 CNY/ton, consistent with its limited trading volume. This tells us that markets with less trading activity tend to get hit harder by things like supply–demand imbalances or outside shocks—say, policy changes—leading to some pretty big price jumps. The Shenzhen market came in second for price ups and downs, with a standard deviation of 19.57 CNY/ton, putting it at a middle-of-the-road level, likely because of its mix of trading products (like SZEA) and the different dynamics at play in that market. Guangzhou’s market had a standard deviation of 15.85 CNY/ton, which matches the noticeable swings in its price trends, probably due to how much it is affected by local economic conditions, policy shifts, and its smaller overall size. On the other hand, the Shanghai market kept its prices much steadier, with a standard deviation of just 11.85 CNY/ton, matching its stable price patterns and high trading volume. Its strong liquidity really helped keep price swings in check, making it a market you can predict more easily. The Fujian, Chongqing, Hubei, and Tianjin markets did not see much price movement, with standard deviations ranging from 3.70 CNY/ton in Tianjin to 7.02 CNY/ton in Fujian. Hubei’s low number (4.86 CNY/ton) goes hand in hand with its high trading volume, showing that its strong liquidity and large number of participants do a good job of smoothing out price changes, while Tianjin’s tiny fluctuations might come from its low trading activity, meaning prices just do not move much. The link between price swings and trading volume here suggests that getting more activity and liquidity into quieter markets could help tone down price fluctuations, making the market work better and easier to predict. The way Hubei and Shanghai have kept their prices steady offers some useful ideas for other regional markets to think about.

3.4. Inter-Market Price Correlation Analysis

Figure 5 lays out a correlation matrix for the average transaction prices across China’s eight main carbon markets from 2021 to 2024, with correlation coefficients going from −1 to 1. The heatmap uses a color scale from cool to warm tones—red for positive correlations and blue for negative ones—to show just how varied the price connections among markets. The Shenzhen market exhibits strong positive correlations with Beijing (0.64), Fujian (0.68), and Shanghai (0.61), suggesting highly synchronized price movements, potentially driven by shared economic factors, similar policy frameworks, or inter-market arbitrage activities. Similarly, the Fujian market shows notable correlations with Tianjin (0.60) and Beijing (0.55), indicating a degree of price linkage among these markets. The Shanghai market has a middling level of correlation with Beijing (0.55), Fujian (0.46), and Tianjin (0.46), which points to some degree of price connection among them, likely shaped by things like national policies or efforts to tie markets together more closely. Meanwhile, the Guangzhou and Hubei markets show a correlation of 0.30, hinting that they might be influenced by similar local factors—think industrial activity or emission reduction goals. On the other hand, the Hubei market does not seem to move much in sync with most other markets, with correlation numbers running from just 0.07 (Tianjin) to 0.30 (Shenzhen). That independence probably has a lot to do with its high trading volume and steady pricing, which makes it less affected by price ups and downs in other markets. The Chongqing market demonstrates negative correlations with Hubei (−0.30) and Guangzhou (−0.30), indicating that its price movements often diverge from these markets, possibly reflecting differences in regional economic conditions or policy priorities. The diverse patterns of inter-market price correlations suggest a degree of market integration between Shenzhen, Beijing, Fujian, and Shanghai. However, the independence of the Hubei market and the negative correlations of the Chongqing market highlight regional disparities in market dynamics, underscoring the need for coordinated policies to promote price convergence, reduce regional disparities, and enhance the overall efficiency of China’s carbon trading system.

3.5. Feature Engineering Results Analysis

Feature engineering was performed on the cleaned dataset, extracting internal and market features to provide multidimensional input data for subsequent carbon price forecasting models. Internal features include lagged features (average transaction prices from the previous 1 and 7 days), 7-day moving average prices, temporal features (month and compliance cycle), and daily trading volume. Market features were generated by transforming market names into dummy variables, covering the eight major markets. Following feature generation, the dataset was expanded with 13 additional feature fields, resulting in a total of 23 fields. Lagged features effectively capture the temporal dependency of prices, temporal features reflect the potential influence of compliance cycles on prices, and market features highlight inter-market disparities, providing robust data support for modeling price dynamics and market heterogeneity in subsequent analyses.

Exploratory the data analysis of China’s eight major carbon markets from 2021 to 2024 reveals significant heterogeneity in price dynamics, trading activity, and market integration. The Beijing market exhibits the highest price levels and volatility but the lowest trading volume at 11.60 million tons, indicating limited market activity and high sensitivity to external shocks. In contrast, the Shanghai and Hubei markets demonstrate advantages in liquidity and market maturity, with high trading volumes (419 million tons and 36.54 million tons, respectively) and low volatility (standard deviations of 11.85 CNY/ton and 4.86 CNY/ton, respectively). Price correlation analysis indicates strong correlations between the Shenzhen market and Beijing, Fujian, and Shanghai (correlation coefficients of 0.64, 0.68, and 0.61, respectively), while the Hubei market remains relatively independent, and the Chongqing market shows negative correlations (coefficients of −0.30 with both Hubei and Guangzhou). These findings suggest that increasing participation and liquidity in less active markets, such as Beijing and Tianjin, could enhance price stability and market efficiency. Moreover, coordinated policies are needed to reduce regional disparities, promote price convergence across markets, and support the development of a unified national carbon market in China.

4. Modeling and Simulation Validation

4.1. Modeling Approach

In this study, the hyperparameters of the XGBoost model were set as follows: the number of trees (n_estimators) at 100, the learning rate (learning_rate) at 0.1, the maximum tree depth (max_depth) at 5, and the random seed (random_state) at 42. These parameters were determined through preliminary experiments to balance predictive accuracy and computational efficiency.

4.2. Data Partitioning and Feature Selection

This study utilized historical data from China’s eight major carbon exchanges (Beijing, Chongqing, Fujian, Guangzhou, Hubei, Shanghai, Shenzhen, and Tianjin) spanning 25 June 2021, to 31 December 2024. The dataset includes carbon price (Average Price)alongside a range of feature variables.

To evaluate the model’s predictive performance, the data were chronologically split into a training set and a test set, with the training set comprising 80% of the data (25 June 2021 to 31 March 2024) and the test set comprising 20% (1 April 2024 to 31 December 2024). The carbon price range in the training set was 3.45 to 149.64 CNY/ton, while the test set ranged from 25.5 to 67.06 CNY/ton. This partitioning ensures that the model is trained on earlier data and tested on later data, aligning with the practical requirements of time series forecasting.

For feature selection, a total of 18 features were employed, including lagged features (lag1_pricelag7_price and ma7_price), temporal features (month and is_compliance_period), market features (market_Beijing), and external variables (industrial growth rate and coal price). These features were generated through feature engineering to capture short-term price trends, seasonal effects, market disparities, and the influence of external economic factors.

4.3. Model Training and Performance Evaluation

Figure 6 presents the overall carbon price forecasting performance of China’s carbon market from 2021 to 2024, encompassing both the training and testing phases. The training set spans 25 June 2021 to 31 March 2024, while the testing set covers 1 April 2024 to 31 December 2024. Each market is represented by the following two lines: a solid line indicating the actual carbon price (Train/Test Actual) and a dashed line representing the predicted carbon price. To facilitate differentiation, each market is assigned a distinct color. During the training phase, the model demonstrates strong fitting capability, achieving an overall R² value of 0.83, indicating its ability to effectively capture historical price patterns. In the testing phase, the model exhibits robust predictive performance, with an overall R² value of 0.89, reflecting strong generalization ability. This indicates that the model achieves high accuracy and stability in predicting carbon prices.

Figure 7a–h shows how well the market-specific models predicted carbon prices for China’s eight carbon markets—Beijing, Chongqing, Fujian, Guangzhou, Hubei, Shanghai, Shenzhen, and Tianjin—from 2021 to 2024. The figure is split into eight smaller plots, labeled (a) to (h), each one matching up with a market and showing the actual and predicted prices for both the training and testing phases. In these plots, blue and green solid lines mark the real prices for the training and testing sets (labeled Train Actual and Test Actual), while the red and orange solid lines show what the model predicted (Train Predicted and Test Predicted). Looking at the figure, you can see that during the training phase, the model’s predictions matched up really closely with the actual prices across all of the markets, which suggests it is pretty good at picking up on historical price patterns. But in the testing phase, the results differ quite a bit depending on the market. For example, in the Chongqing market (plot b) and Fujian market (plot c), the predicted prices are pretty far off from the actual ones, while in the Shenzhen market (plot g) and Tianjin market (plot h), the predictions stay much closer to reality. This points to the idea that the market-specific model struggles a bit when dealing with markets where prices swing a lot, so there is definitely some room to tweak it to get better at predicting down the line.

This study first trained an overall model by aggregating historical data from the eight major markets to construct a single XGBoost model, capturing the common patterns across all markets. To further examine inter-market differences, separate XGBoost models (market-specific models) were trained for each market, enabling a comparison of the predictive performances between the overall and market-specific models. The model performance was evaluated using the following three metrics: the Root Mean Square Error (RMSE), which measures the average error between predicted and actual values; the Mean Absolute Error (MAE), which quantifies the absolute deviation between predicted and actual values; and the coefficient of determination (R²), which assesses the model’s explanatory power, with values closer to 1 indicating a better model fit. The performance results of the overall model are presented in Table 1.

As shown in Table 1, the overall model performed robustly on both the training and testing sets, achieving an R² of 0.89 on the testing set, which indicates strong generalization capability. The RMSE and MAE values for the training and testing sets are closely aligned (7.14 and 5.07 for RMSE; 5.44 and 4.06 for MAE, respectively), suggesting that the model exhibits no significant overfitting issues.

As shown in Table 2, the performance of the market-specific models varies across markets. The Shanghai market demonstrates the highest forecasting accuracy, with a testing set R² of 0.92, an RMSE of 3.12 CNY/ton, and an MAE of 2.41 CNY/ton, indicating excellent predictive capability. The Guangzhou and Tianjin markets also exhibit strong performance, with testing set R² values of 0.89 (RMSE: 5.19 CNY/ton, MAE: 4.17 CNY/ton) and 0.86 (RMSE: 1.46 CNY/ton, MAE: 1.17 CNY/ton), respectively, with Tianjin showing the smallest prediction error among all markets. In contrast, the Chongqing market yields the lowest forecasting performance, with a testing set R² of 0.71, an RMSE of 3.57 CNY/ton, and an MAE of 2.54 CNY/ton, suggesting that the model struggles to capture price patterns effectively in this market. Additionally, while the Beijing and Shenzhen markets achieve reasonable testing set R² values of 0.83 and 0.80, their higher RMSE values of 11.02 CNY/ton (MAE: 8.45 CNY/ton) and 8.33 CNY/ton (MAE: 5.93 CNY/ton), respectively, indicate substantial prediction errors, particularly in Beijing, where the error is the largest across all markets. These performance differences likely arise from variations in market characteristics, such as greater price volatility or smaller data volumes in Beijing and Shenzhen, and potentially more complex price dynamics in Chongqing, which may challenge the model’s ability to generalize effectively.

To clearly show how well the models performed in their predictions, this study put together scatter plots of the results for both the overall model and the market-specific ones, as shown in Figure 8a–i. Subplots (a) to (h) depict the forecasting performance of the market-specific models for the eight carbon exchanges, while subplot (i) illustrates the performance of the overall model. In the plots, blue dots stand for the training set, while red dots mark the testing set. Plot (a) gives the prediction results for the Beijing market, where the model did not do so well, probably because there is not much data to work with and prices in that market swing a lot, which might have caused the model to overfit. Plot (b) presents the results for the Chongqing market, for which the model did an okay job, likely since trading activity in Chongqing is on the lower side and the data patterns are a bit tricky to pin down. Plot (c) shows how the model did for the Fujian market, and it was not great there either—the predicted curve does not match the actual prices in the testing set, likely because Fujian’s market does not have a lot of data to go on. Subplot (d) illustrates the results for the Guangzhou market, for which the model performed well, with the predicted curve closely aligning with part of the price trend in the testing set, though some deviations occur during abrupt price changes, likely influenced by the market’s high sensitivity to policy factors. Subplot (e) shows the forecasting performance for the Hubei market, for which the model performed well, capturing the main price trends in the testing set, but with some errors during large fluctuations, likely due to the complex data patterns in this market. Subplot (f) presents the results for the Shanghai market, for which the model performed well, with the predicted curve fitting most price trends in the testing set, though deviations occur at certain abrupt change points, potentially due to the limited data volume in this market. Subplot (g) displays the forecasting performance for the Shenzhen market, for which the model performed well, effectively capturing the main price trends in the testing set, particularly during stable periods, where the model demonstrated consistent performance. Subplot (h) illustrates the results for the Tianjin market, for which it exhibited the best forecasting performance, with the predicted curve in the testing set closely matching the actual values, indicating strong predictive capability in this market, likely due to the relatively stable data patterns in Tianjin. Subplot (i) presents the forecasting performance of the overall model, demonstrating that the model effectively fits the actual prices in both the training and testing sets, with the predicted values closely aligning with actual values. Notably, in the testing set, the predicted curve accurately captures the price fluctuation trends, validating the model’s generalization capability.

4.4. Comparative Analysis of Predictive Models

The performance of the XGBoost model in forecasting carbon prices across the overall carbon market was validated through comparative experiments to ensure the scientific rigor and reliability of the model selection. The comparison included a baseline naive model, as well as Random Forest and ARIMA models, representing typical applications of simple forecasting methods, machine learning models, and traditional time series approaches, respectively.

The comparative experiment was based on the overall carbon market price series described earlier, with the training and testing sets split in an 80:20 ratio. The feature set includes lagged price features (1-day, 7-day, and 14-day lagged prices and 7-day moving average), external factors (industrial growth rates and coal prices along with their lagged values and moving averages), and additional indicators such as time trends and volatility metrics. To ensure fairness, all models were trained and tested on the same data, and their performance was evaluated using the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and the coefficient of determination (R²).

Table 3 presents the performance comparison of each model on the training and testing sets. The XGBoost model demonstrated the best performance on the testing set, achieving an R2 value of 0.89, with RMSE and MAE values of 5.07 and 4.06, respectively, significantly outperforming other models. In contrast, the naive model yielded a testing set R2 of only 0.17, while the Random Forest and ARIMA models achieved testing set R2 values of 0.55 and 0.33, respectively. These results highlight XGBoost’s superior predictive accuracy and stability, effectively capturing the dynamic changes in carbon prices.

4.5. Feature Importance Analysis

To identify the key drivers influencing carbon prices, this study leveraged the feature importance functionality of the XGBoost model to analyze the contribution of features in both the overall model and the eight market-specific models. The feature importance results for the overall model are presented in Figure 9.

As shown in Figure 9, the feature ma7_price (7-day moving average price) emerged as the most significant predictor in the overall carbon price forecasting model, with an importance score of 0.48155, indicating that short-term price trends exert a substantial influence on carbon prices. The feature lag1_price (previous day’s price) follows with an importance score of 0.44677, suggesting a strong short-term autocorrelation in carbon prices. The external variables cumulative industrial growth and industrial growth rate have importance scores of 0.01470 and 0.00613, respectively, indicating a relatively minor direct impact of industrial value-added on carbon prices. The feature coal price exhibits an importance score of 0.00520, suggesting a limited role of coal prices in driving carbon price dynamics.

The feature importance analysis for the market-specific models of China’s eight carbon markets (Beijing, Chongqing, Fujian, Guangzhou, Hubei, Shanghai, Shenzhen, and Tianjin) from 2021 to 2024 is presented in Figure 10a–h. The eight subplots (a–h) in Figure 10 illustrate the feature importance for each market, revealing significant variations in the models’ reliance on different features across markets. The Shanghai market model (subplot f) exhibited the strongest dependence on the Ma7_price (7-day moving average price) feature, with an importance score of 0.89327, while the Beijing market model (subplot a) relies most heavily on the Lag1_price (previous day’s price) feature, with an importance score of 0.58769. In contrast, external factors such as industrial growth rate and coal price generally exhibited low importance across all market models; for instance, in the Tianjin market (subplot h), the importance of the industrial growth rate was only 0.02689. These findings indicate that the models predominantly rely on price-related features, underscoring the dominant role of price lagged features in carbon price forecasting. This also suggests that future research could further explore the potential influence of external economic factors on enhancing model predictive performance.

5. Short-Term Carbon Price Forecasting: A Case Study for 2025–2026

5.1. Overall Model Future Carbon Price Forecast

Figure 11 presents the carbon price forecast for China’s overall carbon market from 2025 to 2026. The results indicate a fluctuating upward trend in the national carbon price over the next two years. This trend is primarily driven by policy influences under China’s carbon peaking targets, as follows: as the carbon market mechanism matures rapidly and policy enforcement intensifies, market supply–demand dynamics tighten, promoting a steady price increase. Additionally, the broader macroeconomic environment and the acceleration of energy transition further support price growth, providing critical insights for carbon market participants and policymakers.

5.2. Market-Specific Model Future Carbon Price Forecast

Figure 12a–h illustrate the market-specific carbon price forecasts for China’s eight carbon markets (Beijing, Chongqing, Fujian, Guangzhou, Hubei, Shanghai, Shenzhen, and Tianjin) from 2025 to 2026. The forecasts reveal a general fluctuating upward trend in prices across all markets, with significant disparities in growth rates and magnitudes among markets. For instance, the Shanghai and Guangzhou markets are expected to exhibit more pronounced growth trends over the next two years, primarily driven by their roles as economic hubs with higher market maturity and robust policy support. The continuous efforts of local governments to meet carbon reduction targets, coupled with strong corporate engagement in low-carbon transitions, collectively contribute to a steady price increase. Furthermore, the industrial structure in these regions, predominantly composed of high-tech and financial services sectors, intensifies market supply–demand tensions, will facilitate faster carbon price growth in the coming years. The Beijing market, heavily influenced by policy, maintains relatively stable and high carbon prices. In contrast, markets such as Tianjin and Fujian experience greater price volatility. Tianjin’s market is constrained by its smaller scale and lower trading activity, with slower policy advancement in carbon market development. The regional economy, largely reliant on traditional manufacturing, has not yet fully embraced green transformation, resulting in less stable carbon prices. Similarly, Chongqing’s market faces challenges due to its remote geographical location, which reduces the regional economy’s sensitivity to carbon market dynamics. The lack of diversity and activity among market participants, combined with relatively limited local policy enforcement, leads to less stable price growth. These inter-market disparities underscore the profound influence of regional economic levels, policy enforcement, and societal contexts on carbon market development, while also highlighting the impact of market volatility and asynchronous regional development stages, offering more targeted insights for carbon market participants and policymakers.

6. Conclusions and Discussion

6.1. Conclusions

This study employs an overall market model and market-specific models to forecast carbon price trends in China’s carbon market from 2025 to 2026, aiming to provide a reference for carbon market participants and policymakers.

The findings indicate a fluctuating upward trend in the national carbon price over the next two years, primarily driven by policy influences under China’s carbon peaking targets. The tightening of market supply–demand dynamics coupled with the accelerated energy transition collectively contribute to price growth. Market-specific forecasts reveal significant disparities in price trends across regions, as follows: Shanghai and Guangzhou markets are projected to experience faster growth, while Tianjin and Chongqing markets to exhibit more moderate increases, and the Beijing market will maintain relatively stable and high prices. These disparities underscore the profound impact of regional economic levels, policy enforcement, and societal contexts on carbon market development. The primary contribution of this study lies in developing a forecasting model that aligns with historical data dynamics by incorporating policy factors and market characteristics, thereby elucidating the price evolution trends of China’s carbon market under its dual-carbon goals. Furthermore, by differentiating forecasts across markets, the study highlights the influence of inter-market heterogeneity on price trends, offering more targeted insights for carbon market participants and policymakers.

6.2. Discussion

This study employed an overall market model and market-specific models to forecast carbon price trends in China’s carbon market from 2025 to 2026, elucidating a fluctuating upward trajectory and inter-market heterogeneity. The overall market forecast indicates a fluctuating upward trend in the national carbon price over the next two years. This trend aligns with the findings of Zhang and Wen [17], who, utilizing deep learning methods, identified a fluctuating upward characteristic in carbon prices [17]. By incorporating policy factors and market characteristics, this study further validates the policy-driven influence on carbon price trends. The forecast results suggest that the policy-driven upward trend is supported by the tightening of market supply–demand dynamics under China’s carbon peaking targets [21], consistent with Chevallier’s research on the volatility characteristics of carbon prices in the EU Emissions Trading System (EU ETS), highlighting the pervasive nature of volatility in carbon markets [22].

Market-specific forecasts reveal significant disparities in price trends across regions. Shanghai and Guangzhou markets exhibited faster growth, the Beijing market maintained stable prices, while Tianjin and Chongqing markets showed more moderate increases. These findings align with Fan and Todorova’s study on China’s pilot carbon markets, which observed higher price volatility in the Shanghai market and relative price stability in the Tianjin market [23]. This study further analyzed the inter-market heterogeneity, noting that regional economic levels, policy enforcement, and market maturity play significant roles in these disparities [24]. For instance, the rapid growth in the Shanghai and Guangzhou markets is primarily driven by robust policy support and high market participation, whereas the slower growth in the Tianjin market is associated with its smaller market scale and lower trading activity [25]. Additionally, the results of this study are consistent with Cong and Wei’s findings, which indicate that carbon market prices in China are significantly influenced by regional policy disparities [26].

From a theoretical perspective, the forecasts in this study support the “policy-driven hypothesis” for carbon prices, suggesting that the upward trend in carbon prices is influenced by policy intensity and market maturity [27]. Under China’s carbon peaking targets, stricter regulations have driven price increases, a pattern consistent with the findings of Alberola et al. on the EU Emissions Trading System carbon market [28]. Furthermore, by incorporating seasonal fluctuations and stochastic disturbances, this study captures the non-linear and non-stationary nature of carbon prices, validating Benz and Trück’s proposition that carbon price volatility exhibits multi-scale characteristics [29]. From a policy perspective, the forecast results provide a foundation for designing differentiated carbon pricing policies. For instance, the rapid growth in the Shanghai and Guangzhou markets suggests the need for stronger price stabilization mechanisms, while the moderate growth in the Tianjin and Chongqing markets indicates a need to further enhance market activity. Meanwhile, the stability and high price levels in the Beijing market highlight the importance of sustained policy support to maintain market equilibrium [30].

From a methodological perspective, the forecasting model developed in this study demonstrates robust capability in capturing the dynamic characteristics of carbon prices, aligning with the findings of Koop and Tole [31]. Furthermore, by conducting differentiated forecasts based on the historical price levels of individual markets, this study effectively addresses the limitation of uniform target pricing prevalent in prior research [18]. Compared to studies that solely focus on point forecasts [12], this study offers a more comprehensive perspective on price forecasting. The forecast results are consistent with Paolella and Taschini’s study on the volatility characteristics of the EU Emissions Trading System carbon market, indicating a certain degree of generalizability in the model’s applicability [32].

This study was subject to certain limitations during the forecasting process. The forecasting model primarily relies on historical data and current policy targets, and future policy changes or variations in policy enforcement may impact the forecast results. Koch et al. note that carbon prices are influenced by both energy and financial markets, with policy uncertainty potentially exerting a significant effect on price trajectories [33]. Additionally, unstructured data (e.g., news sentiment and global economic signals) may further influence carbon price volatility by shaping market participants’ expectations. Future research can enhance the model’s predictive capability by incorporating a broader range of external factors (e.g., global economic signals) [34].

Future research can advance in the following directions: incorporating more dynamic policy scenarios and market supply–demand factors to develop more refined forecasting models, thereby enhancing prediction accuracy and applicability [35]; integrating unstructured data (e.g., news sentiment and social media data) to strengthen the model’s responsiveness to external factors [36]; and expanding the forecasting scope to analyze the long-term impacts of carbon prices on corporate investment decisions and green technology innovation, providing more comprehensive support for achieving carbon neutrality goals [37].

Author Contributions

Conceptualization, Y.G.; Methodology, X.Y.; Data curation, H.H.; Writing—original draft, W.S.; Writing—review & editing, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the following projects: 1. Open Fund of the Field Scientific Observation and Research Station of the Subalpine Ecosystem in the Central Qilian Mountains in 2024 (grant no. QLSKFJJ-[2024] D0006); 2. Gansu Provincial Department of Education: Innovation Project for University Teachers (grant no. 2025B-254); 3. Lanzhou College of Arts and Science Teaching Reform Project (grant no. 2024-ZL-jxgg-04).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data originates from a public website, and the website address has been included in the main text of the paper under Section 2. Data and Methods, Section 2.1. Data Sources, fourth line: (Greenhouse Gas Voluntary Emission Reduction Trading Platform (Source of Original Transaction Data): https://ets.sceex.com.cn/internal.htm?orderby=tradeTime%20desc&pageSize=14&k=guo_nei_xing_qing&url=mrhq_gn&pageIndex=1) (accessed on 30 January 2025).

Conflicts of Interest

The authors declare no conflict of interest.

References

IPCC. Intergovernmental Panel on Climate Change. Available online: https://www.ipcc.ch/ (accessed on 30 March 2025).
Arouri, M.E.H.; Jawadi, F.; Nguyen, D.K. Nonlinearities in carbon spot-futures price relationships during Phase II of the EU ETS. Econ. Model. 2012, 29, 884–892. [Google Scholar] [CrossRef]
Ellerman, A.D.; Buchner, B.K. The European Union Emissions Trading Scheme: Origins, Allocation, and Early Results. Rev. Environ. Econ. Policy 2007, 1, 66–87. [Google Scholar] [CrossRef]
National Development and Reform Commission. Notice on Launching Pilot Work for Carbon Emissions Trading (Development and Reform Office Climate [2011] No. 2601). Government Information Disclosure. Available online: https://zfxxgk.ndrc.gov.cn/web/iteminfo.jsp?id=1349 (accessed on 30 March 2025).
Wei, X.; Ouyang, H. Carbon price prediction based on a scaled PCA approach. PLoS ONE 2024, 19, e0296105. [Google Scholar] [CrossRef]
Wang, D.; Sun, M. The impact of the carbon trading market on urban coordinated development in China. Environ. Sci. Pollut. Res. Int. 2024, 31, 20093–20116. [Google Scholar] [CrossRef] [PubMed]
Zhou, J.; Wang, S. A Carbon Price Prediction Model Based on the Secondary Decomposition Algorithm and Influencing Factors. Energies 2021, 14, 1328. [Google Scholar] [CrossRef]
Seifert, J.; Uhrig-Homburg, M.; Wagner, M. Dynamic behavior of CO2 spot prices. J. Environ. Econ. Manag. 2008, 56, 180–194. [Google Scholar] [CrossRef]
Hintermann, B. Allowance price drivers in the first phase of the EU ETS. J. Environ. Econ. Manag. 2010, 59, 43–56. [Google Scholar] [CrossRef]
Creti, A.; Jouvet, P.-A.; Mignon, V. Carbon price drivers: Phase I versus Phase II equilibrium? Energy Econ. 2012, 34, 327–334. [Google Scholar] [CrossRef]
Ballings, M.; Van den Poel, D.; Hespeels, N.; Gryp, R. Evaluating multiple classifiers for stock price direction prediction. Expert Syst. Appl. 2015, 42, 7046–7056. [Google Scholar] [CrossRef]
Lu, H.; Ma, X.; Huang, K.; Azimi, M. Carbon trading volume and price forecasting in China using multiple machine learning models. J. Clean. Prod. 2020, 249, 119386. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Byun, S.J.; Cho, H. Forecasting carbon futures volatility using GARCH models with energy volatilities. Energy Econ. 2013, 40, 207–221. [Google Scholar] [CrossRef]
Zhu, B.; Ye, S.; Han, D.; Wang, P.; He, K.; Wei, Y.-M.; Xie, R. A multiscale analysis for carbon price drivers. Energy Econ. 2019, 78, 202–216. [Google Scholar] [CrossRef]
Huang, Y.; Dai, X.; Wang, Q.; Zhou, D. A hybrid model for carbon price forecasting using GARCH and long short-term memory network. Appl. Energy 2021, 285, 116485. [Google Scholar] [CrossRef]
Zhang, F.; Wen, N. Carbon price forecasting: A novel deep learning approach. Environ. Sci. Pollut. Res. 2022, 29, 54782–54795. [Google Scholar] [CrossRef]
Dong, X.; Zhang, J.F. Heterogeneity of regional carbon emission markets in China: Evidence from multidimensional determinants. Energy Econ. 2024, 138, 107835. [Google Scholar] [CrossRef]
Keppler, J.H.; Mansanet-Bataller, M. Causalities between CO2, electricity, and other energy variables during phase I and phase II of the EU ETS. Energy Policy 2010, 38, 3329–3341. [Google Scholar] [CrossRef]
Mengesha, I.; Roy, D. Carbon pricing drives critical transition to green growth. Nat. Commun. 2025, 16, 1321. [Google Scholar] [CrossRef]
He, J.; Li, Z.; Zhang, X.; Wang, H.; Dong, W.; Du, E.; Chang, S.; Ou, X.; Guo, S.; Tian, Z.; et al. Towards carbon neutrality: A study on China’s long-term low-carbon transition pathways and strategies. Environ. Sci. Ecotechnol. 2022, 9, 100134. [Google Scholar] [CrossRef]
Chevallier, J. A model of carbon price interactions with macroeconomic and energy dynamics. Energy Econ. 2011, 33, 1295–1312. [Google Scholar] [CrossRef]
Fan, J.H.; Todorova, N. Dynamics of China’s carbon prices in the pilot trading phase. Appl. Energy 2017, 208, 1452–1467. [Google Scholar] [CrossRef]
Zhang, Y.-J.; Sun, Y.-F. The dynamic volatility spillover between European carbon trading market and fossil energy market. J. Clean. Prod. 2016, 112, 2654–2663. [Google Scholar] [CrossRef]
Tan, X.; Wang, X. The market performance of carbon trading in China: A theoretical framework of structure-conduct-performance. J. Clean. Prod. 2017, 159, 410–424. [Google Scholar] [CrossRef]
Cong, R.-G.; Wei, Y.-M. Potential impact of (CET) carbon emissions trading on China’s power sector: A perspective from different allowance allocation options. Energy 2010, 35, 3921–3931. [Google Scholar] [CrossRef]
Alberola, E.; Chevallier, J.; Chèze, B. Price drivers and structural breaks in European carbon prices 2005–2007. Energy Policy 2008, 36, 787–797. [Google Scholar] [CrossRef]
Alberola, E.; Chevallier, J.; Chèze, B. The EU Emissions Trading Scheme: Disentangling the Effects of Industrial Production and CO₂ Emissions on Carbon Prices. Soc. Sci. Res. Netw. 2008, 4, 93–125. [Google Scholar] [CrossRef]
Benz, E.; Trück, S. Modeling the price dynamics of CO₂ emission allowances. Energy Econ. 2009, 31, 4–15. [Google Scholar] [CrossRef]
Zhang, Y.-J.; Wei, Y.-M. An overview of current research on EU ETS: Evidence from its operating mechanism and economic effect. Appl. Energy 2010, 87, 1804–1814. [Google Scholar] [CrossRef]
Koop, G.; Tole, L. Forecasting the European Carbon Market. J. R. Stat. Soc. Ser. A Stat. Soc. 2013, 176, 723–741. [Google Scholar] [CrossRef]
Paolella, M.S.; Taschini, L. An econometric analysis of emission allowance prices. J. Bank. Financ. 2008, 32, 2022–2032. [Google Scholar] [CrossRef]
Koch, N.; Fuss, S.; Grosjean, G.; Edenhofer, O. Causes of the EU ETS price drop: Recession, CDM, renewable policies or a bit of everything?-New evidence. Energy Policy 2014, 73, 676–685. [Google Scholar] [CrossRef]
Liu, M.; Ying, Q. The role of online news sentiment in carbon price prediction of China’s carbon markets. Environ. Sci. Pollut. Res. 2023, 30, 41379–41387. [Google Scholar] [CrossRef] [PubMed]
Zhang, J.; Li, D.; Hao, Y.; Tan, Z. A hybrid model using signal processing technology, econometric models and neural network for carbon spot price forecasting. J. Clean. Prod. 2018, 204, 958–964. [Google Scholar] [CrossRef]
Xie, Q.; Hao, J.; Li, J.; Zheng, X. Carbon price prediction considering climate change: A text-based framework. Econ. Anal. Policy 2022, 74, 382–401. [Google Scholar] [CrossRef]
Kong, S.; Li, H.; Tan, S. Carbon markets, energy transition, and green development: A moderated dual-mediation model. Front. Environ. Sci. 2023, 11, 1257449. [Google Scholar] [CrossRef]

Figure 1. Boxplot of carbon-emission trading prices.

Figure 2. Price trends of China’s eight carbon markets (2021–2024).

Figure 3. Total trading volume by market (2021–2024).

Figure 4. Price volatility by market (2021–2024).

Figure 5. Correlation of carbon prices across markets (2021–2024).

Figure 6. Carbon-price forecasting performance of China’s overall market:: training and testing phases (2021–2024).

Figure 7. Carbon price prediction performance across eight Chinese carbon markets (2021–2024).

Figure 8. Scatter plot comparison of predicted and actual values for eight Chinese carbon markets and the overall model (2021–2024).

Figure 9. Feature importance analysis of the overall carbon price model across eight Chinese carbon markets (2021–2024).

Figure 10. Feature importance analysis of market-specific carbon price models across eight Chinese carbon markets (2021–2024).

Figure 11. Overall price forecast of China’s carbon market from 2025 to 2026.

Figure 12. Market-specific price forecast of China’s eight carbon markets from 2025 to 2026.

Table 1. Performance metrics of the overall model.

Dataset	RMSE	MAE	R²
Training Set	7.14	5.44	0.83
Testing Set	5.07	4.06	0.89

Table 2. Performance metrics of the market-specific models.

Markets	Dataset	RMSE	MAE	R²
Beijing	Training Set	12.29	8.88	0.80
Beijing	Testing Set	11.02	8.45	0.83
Chongqing	Training Set	3.17	2.31	0.73
Chongqing	Testing Set	3.57	2.54	0.71
Fujian	Training Set	2.95	2.24	0.83
Fujian	Testing Set	2.94	2.31	0.84
Guangzhou	Training Set	5.64	4.22	0.87
Guangzhou	Testing Set	5.19	4.17	0.89
Hubei	Training Set	2.26	1.55	0.78
Hubei	Testing Set	1.88	1.35	0.82
Shanghai	Training Set	4.27	3.02	0.87
Shanghai	Testing Set	3.12	2.41	0.92
Shenzhen	Training Set	7.07	5.64	0.87
Shenzhen	Testing Set	8.33	5.93	0.80
Tianjin	Training Set	1.44	1.07	0.85
Tianjin	Testing Set	1.46	1.17	0.86

Table 3. Model performance comparison.

Model	Dataset	RMSE	MAE	R²
XGBoost	Training Set	7.14	5.44	0.83
XGBoost	Testing Set	5.07	4.06	0.89
Naive	Training Set	14.69	11.32	0.26
Naive	Testing Set	13.21	9.53	0.17
Random Forest	Training Set	8.14	6.20	0.78
Random Forest	Testing Set	11.55	9.05	0.55
ARIMA	Training Set	13.91	10.56	0.35
ARIMA	Testing Set	14.09	10.66	0.33

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, W.; Gao, Y.; Yang, X.; Zhang, Y.; Hu, H. Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models. Sustainability 2025, 17, 5407. https://doi.org/10.3390/su17125407

AMA Style

Sun W, Gao Y, Yang X, Zhang Y, Hu H. Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models. Sustainability. 2025; 17(12):5407. https://doi.org/10.3390/su17125407

Chicago/Turabian Style

Sun, Weibao, Yafang Gao, Xuemei Yang, Yalong Zhang, and Haolin Hu. 2025. "Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models" Sustainability 17, no. 12: 5407. https://doi.org/10.3390/su17125407

APA Style

Sun, W., Gao, Y., Yang, X., Zhang, Y., & Hu, H. (2025). Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models. Sustainability, 17(12), 5407. https://doi.org/10.3390/su17125407

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Carbon Price Forecasting and Market Characteristics Analysis in China: An Integrated Approach Using Overall and Market-Specific Models

Abstract

1. Introduction

2. Data and Methods

2.1. Data Sources

2.2. Data Cleaning and Preprocessing

2.3. Feature Analysis Methods

2.4. Feature Engineering

2.5. XGBoost Model

3. Results Analysis

3.1. Carbon Market Price Trend Analysis

3.2. Trading-Volume Distribution Analysis

3.3. Price Volatility Characteristics Analysis

3.4. Inter-Market Price Correlation Analysis

3.5. Feature Engineering Results Analysis

4. Modeling and Simulation Validation

4.1. Modeling Approach

4.2. Data Partitioning and Feature Selection

4.3. Model Training and Performance Evaluation

4.4. Comparative Analysis of Predictive Models

4.5. Feature Importance Analysis

5. Short-Term Carbon Price Forecasting: A Case Study for 2025–2026

5.1. Overall Model Future Carbon Price Forecast

5.2. Market-Specific Model Future Carbon Price Forecast

6. Conclusions and Discussion

6.1. Conclusions

6.2. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI