Next Article in Journal
Assessment of Lightning Activity and Early Warning Capability Using Near-Real-Time Monitoring Data in Hanoi, Vietnam
Previous Article in Journal
Hydro-Climatic and Multi-Temporal Remote Analysis of Glacier and Moraine Lake Changes in the Ile-Alatau Mountains (1955–2024), Northern Tien Shan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Air Quality Index Forecasting Based on Quadratic Decomposition and Transformer-BiLSTM—A Case Study of Beijing

1
School of Mathematics and Statistics, North China University of Water Resources and Electric Power, Zhengzhou 450046, China
2
School of Electrical Engineering, North China University of Water Resources and Electric Power, Zhengzhou 450045, China
3
China Renewable Energy Engineering Institute, Beijing 100120, China
*
Author to whom correspondence should be addressed.
Atmosphere 2025, 16(12), 1334; https://doi.org/10.3390/atmos16121334
Submission received: 3 November 2025 / Revised: 21 November 2025 / Accepted: 24 November 2025 / Published: 25 November 2025
(This article belongs to the Section Air Quality)

Abstract

Accurate Air Quality Index (AQI) forecasting is crucial for environmental pollution control. However, the strong nonlinearity and pronounced non-stationarity of AQI time series limit the precision of single-model predictions. This paper therefore proposes an efficient new AQI forecasting model. First, the raw AQI sequence is decomposed using Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN). This is combined with Sample Entropy (SE) and K-means clustering to reconstruct high-, medium-, and low-frequency sub-sequences. For the high-frequency component, a second decomposition is performed using Variational Mode Decomposition (VMD) optimised by the Crested Porcupine Optimizer (CPO). This forms the basis for constructing a hybrid forecasting model: the CEEMDAN–SE–CPO–VMD–Transformer-BiLSTM model. Finally, the prediction error is corrected via Least Squares Support Vector Machine (LSSVM). Empirical analysis based on AQI data of Beijing in summer 2023 demonstrates that this model significantly outperforms traditional models and single-decomposition models in terms of MAE, RMSE, MAPE, and R2 metrics. Cross-seasonal experiments further confirm its excellent predictive performance and robustness across the spring, autumn, and winter. This model provides a new, efficient, and reliable approach for AQI forecasting.

Graphical Abstract

1. Introduction

In recent years, with the rapid advancement of China’s industrialisation and the continuous expansion of its population, air pollution has become an increasingly severe issue, exerting significant negative impacts on public health and socio-economic development [1,2]. Accurate air quality forecasting facilitates effective prevention, control, and mitigation of atmospheric pollution [3]. While individual pollutant concentrations can be used to assess air quality, the public and policymakers struggle to interpret these data to determine current air quality conditions. The Air Quality Index (AQI), however, simplifies pollutant concentrations—including PM2.5, PM10, CO, SO2, O3, and NO2—into a single numerical value, providing a more intuitive reflection of overall air quality.
The atmospheric system is inherently dynamic, non-linear, and non-stationary, significantly increasing the complexity of AQI forecasting. Consequently, developing an appropriate forecasting framework is crucial [4]. Existing AQI forecasting techniques are primarily categorised into four types [5]: physical methods, statistical methods, artificial intelligence methods, and hybrid models. Physical models, based on physicochemical principles, are commonly used to simulate the dispersion and transport of atmospheric pollutants. However, the primary drawbacks of this approach lie in their high computational complexity, substantial time requirements, and limited predictive accuracy [6]. Traditional statistical methods, such as the Autoregressive Integrated Moving Average (ARIMA) model [7] and Grey Model [8], can infer short-term trends using historical sequences but struggle to capture the pronounced non-linear and non-stationary characteristics of AQI data [9]. Artificial intelligence methods, particularly deep learning approaches, such as convolutional neural networks (CNN), recurrent neural networks (RNN), long short-term memory networks (LSTM), gated recurrent units (GRU) [10], and the recently emerging Transformer model, have been widely applied in air quality forecasting due to their robust non-linear fitting capabilities and ability to learn temporal features [11]. For instance, Ansari A et al. [12] conducted a study in Azamgarh, India, using 8760 hourly air quality measurements (PM2.5, PM10, NO2, SO2) and meteorological data from July 2022 to June 2023, employing multiple statistical methods to demonstrate significant pollutant variations across different timescales. By comparing six deep learning models including Transformers and LSTMs, they that found feedforward neural networks (FNNs) achieved the optimal hourly AQI prediction performance (MAE 2.89, RMSE 4.99, R2 = 0.9971), with PM2.5, NO2, and SO2 exerting the strongest influence on their predictions. However, single-model approaches remain significantly constrained in handling the multi-scale characteristics and complex spatiotemporal dependencies inherent in air quality time series [13].
Presently, to overcome the limitations of single models in terms of accuracy and adaptability, deep learning-based hybrid models have emerged as the mainstream approach for AQI forecasting [14]. Nguyen A T et al. [15] proposed a hybrid deep learning model that integrates Attention Convolutional Neural Networks, ARIMA, LSTM enhanced by Quantum Particle Swarm Optimisation, and XGBoost. Utilising Seoul air quality data from 2021–2022, they achieved AQI prediction through a two-stage process (ARIMA fitting for linear components + hybrid model processing for non-linear components). Results demonstrated significant improvements over traditional models in terms of metrics such as MSE, MAE, and R2, with superior performance at both city and site levels. Qian S et al. [16] proposed a hybrid deep learning model integrating XGBoost feature selection, Gaussian data augmentation, an improved manta ray foraging optimisation algorithm, and TCN-GRU. By synergistically enhancing spatio-temporal feature extraction and model robustness across multiple components, it significantly reduced prediction errors compared to baseline models in AQI forecasting for four cities, providing reliable support for air quality early warning systems.
With technological advancements, numerous scholars have incorporated signal decomposition techniques such as the Extended Empirical Mode Decomposition (EEMD) into AQI forecasting. By decomposing AQI data into frequency-based components through signal decomposition, these methods better capture multi-scale characteristics and enhance prediction accuracy [17]. Wang K et al. [18] developed an integrated model (IAMSSA-VMD-SSA-LSTM) for AQI forecasting. This combines an improved adaptive variant sparrow search algorithm (IAMSSA) to optimised variational mode decomposition (VMD) parameters with a sparrow search algorithm (SSA) to optimise LSTM. After decomposing the non-linear, non-stationary AQI sequence into multiple intrinsic modal functions (IMFs) and residuals (RES), they modelled these components separately. Across data from Chengdu, Guangzhou, and Shenyang, the model achieved MAE, RMSE, MAPE, and R2 values of 3.692, 4.909, 6.241, and 0.981, respectively, outperforming comparison models such as LSTM and SSA-LSTM. Qian S et al. [19] proposed the STL–Metis–MHBA–TC hybrid model for AQI forecasting. This approach processes raw sequences via seasonal-trend decomposition (STL), employs the Metis algorithm to optimise hyperparameters of TimesNet and Crossformer for component prediction, and then fuses results using an enhanced honey badger algorithm (MHBA). This significantly enhances forecasting accuracy compared to Transformer-based models. Owing to AQI’s complex characteristics, including strong non-stationarity, some single-stage decomposition methods fail to effectively address high-frequency disturbances, and often retain noise interference. Others may yield suboptimal decomposition outcomes due to heavy reliance on parameters [20]. Consequently, researchers have explored two-stage decomposition strategies, combining the strengths of both approaches: initial decomposition followed by refined processing of complex components to enhance feature extraction quality [21]. Tang C et al. [22] utilised AQI data from the Beijing–Tianjin–Hebei region to propose two foundational AQI prediction models: one based on Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) with GRU, and another hybrid model incorporating VMD and Sample Entropy (SE) optimisation. The improved model achieved a maximum R2 of 0.984 and a mean absolute error (MAE) of 2.476 across multiple regions, validating its reliability and applicability.
Consequently, this paper proposes a hybrid prediction model integrating CEEMDAN, SE, VMD, and Transformer–BiLSTM. The Crested Porcupine Optimizer (CPO) algorithm is employed to optimised VMD parameters, enhancing decomposition accuracy. Its performance is validated using AQI data of Beijing. Experimental results demonstrate that the CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM framework significantly outperforms traditional models in both prediction accuracy and efficiency. To further enhance prediction precision, LSSVM is introduced for error correction, and this error correction step provides crucial reference for environmental management.
The innovations and primary contributions of this study are as follows:
(1)
Proposal of a novel hybrid prediction framework: We designed the CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM hybrid prediction model. CEEMDAN decomposition enhances data utilisation, SE optimises signal reconstruction, the CPO algorithm adaptively optimises VMD parameters to enhance decomposition accuracy, Transformer–BiLSTM collaboratively captures long- and short-term temporal features, and LSSVM error correction further improves prediction accuracy. Empirical analysis demonstrates the framework’s superior performance in AQI forecasting, significantly outperforming traditional and single-decomposition models.
(2)
Validating of the hybrid framework’s superiority and robustness: Cross-season comparison experiments using AQI data of Beijing across spring, summer, autumn, and winter 2023 data confirmed the proposed model’s superiority across metrics including MAE, RMSE, MAPE, and R2, alongside its universality and stability across seasonal scenarios. LSSVM error correction substantially optimised prediction outcomes.
(3)
Optimisation of VMD parameter selection: Addressing the subjective nature of selecting VMD parameters (mode number K and penalty factor α ), we introduce CPO for adaptive optimisation. This enhanced the accuracy of quadratic decomposition, providing high-quality inputs for Transformer–BiLSTM forecasting and LSSVM error correction, and thus ensuring model robustness and prediction reliability.

2. Materials and Methods

2.1. Complete Ensemble Empirical Mode Decomposition with Adaptive Noise

Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) [23] represents an enhanced signal decomposition method derived from Empirical Mode Decomposition (EMD). By repeatedly introducing adaptive white noise during decomposition, it significantly alleviates the modal aliasing inherent in EMD. Meanwhile, its decomposition process exhibits excellent completeness, enabling precise reconstruction of the original signal. The specific steps of this method are as follows:
First, the AQI signal is decomposed into K zero-mean white noise components:
x i ( t ) = x ( t ) + ε δ i ( t )
where ε represents a Gaussian white noise weighting factor; δ i ( t ) denotes the i -th generated white noise component
Subsequently, perform EMD is performed on x i ( t ) , treating the first decomposed modal component as the first modal component obtained from CEEMDAN decomposition:
I M F 1 ( t ) = 1 K i = 1 K I M F 1 i ( t )
r 1 ( t ) = x ( t ) I M F 1 ( t )
Among these, I M F 1 ( t ) represents the first modal component of this decomposition; r 1 ( t ) denotes the residual.
After adding specific adaptive noise to the residuals obtained at the j -th stage of the decomposition, the EMD continues:
I M F j ( t ) = 1 K i = 1 K E 1 r j 1 ( t ) + ε j 1 E j 1 [ δ i ( t ) ]
r j ( t ) = r j 1 ( t ) I M F j ( t )
where I M F j ( t ) denotes the j -th modal component of this decomposition.
Finally, if the residual signal obtained from the K -th decomposition is a monotonic signal, the iteration stops and the CEEMDAN algorithm concludes.

2.2. Sample Entropy

To address the issue of the generation of numerous components after decomposition whose practical significance is difficult to determine, Richman and Moorman proposed Sample Entropy (SE) [24]. SE is a non-linear dynamic method for assessing data complexity: the simpler the time series, the smaller the SE value; conversely, the larger the SE value. SE computation consists of three steps: first, determining the subsequence length and similarity threshold; second, constructing the subsequences; and finally, calculating similarity and probability. In our experiments, we employed the SE algorithm to compute the entropy for each intrinsic mode function (IMF). Based on these entropy values, we assessed the randomness of each IMF and them as the basis for merging and reconstructing components, to generate three types of components: low-frequency, high-frequency, and trend components. To further optimise component classification, we employed the K-means clustering algorithm to categorise components into high-frequency, medium-frequency, and low-frequency groups. This approach reduces the number of components and improves computational efficiency.

2.3. Crested Porcupine Optimizer

The Crested Porcupine Optimizer (CPO) [25] is a novel bio-inspired meta-heuristic optimisation algorithm. Its core lies in constructing an optimisation framework by simulating four typical defence behaviours of crested porcupines: visual, acoustic defence, olfactory defence, and physical attack. These behaviours are hierarchically designed to balance exploration and exploitation: the first two strategies (visual and acoustic) facilitate global search, while the latter two (olfactory and physical) enhance local refinement. This design endows CPO with three critical advantages:
Strong global exploration: Effectively avoids local optima by mimicking the porcupine’s threat assessment mechanisms;
Rapid convergence: Achieves fast optimisation through cyclic population reduction, which dynamically adjusts search intensity;
Low computational cost: Reduces traversal time in complex parameter spaces via adaptive defence strategy switching.
The detailed mathematical formulations of CPO, including its position update rules and cyclic population reduction technique, are given in Reference [26]. This algorithm has been validated in various engineering applications for its robustness and efficiency in solving high-dimensional optimisation problems. The specific workflow is illustrated in Figure 1.

2.4. Variational Mode Decomposition

Variational Mode Decomposition (VMD) represents a novel approach to decomposing complex signals [27]. Unlike EMD and its variants, the VMD algorithm has rigorous mathematical derivation and strong noise robustness, thereby ensuring decomposition accuracy and stability. Consequently, VMD effectively resolves the modal aliasing issue inherent in EMD [28]. It can adaptively optimise the decomposition of non-stationary and highly complex signals into several relatively stable sub-modes. The specific steps are detailed in the Reference [29].

2.5. Transformer–BiLSTM

AQI forecasting necessitates processing multidimensional time-series data. Traditional RNNs struggle to model long-term sequence dependencies, while Transformers excel at capturing global correlations but lack sufficient short-term dynamic capture capability. Hence, this paper proposes the Transformer–BiLSTM model, which synergistically learns to extract long- and short-term features through self-attention mechanisms and BiLSTMs, thereby enhancing the precision and robustness of AQI predictions. The model structure and principles are as follows:
(1)
Position Encoding: Imbuing temporal information into the model
The Transformer first applies positional encoding to the input AQI time-series features, embedding temporal sequence information into the model. The formula are:
P E p o s , 2 i = sin p o s 10000 2 i d
P E p o s , 2 i + 1 = cos p o s 10000 2 i d
where p o s denotes the temporal step position and d represents the embedding dimension. Position encoding enables the model to capture the temporal sequence of AQI features, preventing the loss of temporal information inherent in the Transformer’s “unordered list” processing approach.
(2)
Transformer Self-Attention: Capturing Global Dependencies
Self-attention is the core of the Transformer. The input to self-attention is represented by matrix X. Via linear transformations of the query matrix (Q), key matrix (K), and value matrix (V), it calculates the correlation weights between AQI features across different time steps (e.g., the correlation between pollutant concentrations at the current time and those at historical times, or with historical AQI values).
Q = X W Q K = X W K V = X W V
A Q , K , V = S o f t   max Q K T d k V
where A Q , K , V represents the attention score, X denotes the input matrix, and W Q ,   W K ,   W V are the trainable matrices within the model; Q K T characterises the similarity between queries and keys (i.e., feature relevance); d k is a scaling factor to prevent excessive inner products; Multi-Head Attention further maps features to multiple subspaces, capturing multi-scale global dependencies. The formula is:
o n Q , K , V = c o n c a t h e a d 1 , h e a d 2 , , h e a d h W e i
where h e a d h denotes the output of the h -th head, and W e i represents the linear transformation matrix. Through multi-head attention, the model can simultaneously learn long-range correlations within AQI sequences.
(3)
BiLSTM Decoder: Learning Local Temporal Dependencies
The global feature vector processed by the Transformer serves as input to the BiLSTM, which comprises a forward LSTM and a backward LSTM: the forward LSTM processes the sequence chronologically to learn forward temporal relationships; the backward LSTM processes the sequence in reverse order to learn backward temporal relationships.
The core of LSTM lies in its gating mechanism ( f t ,   i t ,   o t ). The forward computation process of LSTM is as follows:
f t = σ U f h t 1 + W f x t + b f i t = σ U i h t 1 + W i x t + b i o t = σ U o h t 1 + W o x t + b o c ˜ t = t a n h U c h t 1 + W c x t + b c c t = f t c t 1 + i t c ˜ t h t = o t t a n h c t
where x t represents the AQI feature sequence (incorporating contextual information from pollutant concentrations, meteorological factors, and historical AQI data). The hidden state c t captures short-term AQI dynamics. Through bidirectional processing, BiLSTM can simultaneously learn “past → present” and “present → past” relationships, enhancing its modelling capability for short-term fluctuations.
(4)
Output Layer: AQI Prediction
The BiLSTM output is mapped to a single-dimensional prediction value via a fully connected layer, and model training is completed using regression loss. Ultimately, the model outputs the predicted AQI value for the next time step.
The Transformer–BiLSTM model provides a more accurate and robust approach for AQI forecasting through synergistic learning of “global correlations + local dynamics”.

2.6. Prediction Framework

To address the non-stationarity and complex temporal characteristics in AQI forecasting, this paper proposes an integrated AQI prediction workflow combining a two-stage modal decomposition method based on CEEMDAN-SE-CPO-VMD with an ensemble Transformer–BiLSTM model. This workflow comprises three stages: preprocessing of raw AQI data, signal processing via secondary modal decomposition based on CEEMDAN-SE-CPO-VMD, and training and testing of the integrated Transformer–BiLSTM model. The prediction steps are as follows:
(1)
Data Preprocessing
Collect historical AQI data and feature vectors (pollutant concentrations such as PM2.5, PM10, NO2, SO2, CO, O3, and meteorological factors including wind speed, wind direction, temperature, and humidity). Preprocess the raw data to construct standardised input sequences, providing a high-quality data foundation for subsequent decomposition and modelling.
(2)
CEEMDAN-SE-CPO-VMD Second-Order Modal Decomposition
The CEEMDAN algorithm to decompose the original non-stationary AQI time series into multiple IMF components. Calculate the SE for each IMF component to quantify sequence complexity. Using the SE values of each IMF as feature vectors, apply the K-means clustering algorithm to partition the IMF components into three clusters (high-, medium-, and low-frequency Co-IMF). The cluster with the highest complexity (Co-IMF1) is selected and further decomposed into finer-grained sub-components (VMD-IMF) using VMD to uncover micro-oscillation patterns. To avoid arbitrary parameter selection in VMD, CPO is employed to optimise the modal number K and penalty factor α .
(3)
Ensemble Transformer–BiLSTM Prediction and Result Reconstruction
The sub-components obtained from VMD and the remaining clusters from CEEMDAN decomposition are fed into a Transformer–BiLSTM hybrid prediction model for training and forecasting. The Transformer captures global correlations within the AQI sequence (e.g., pollutant concentration trends across days) via self-attention mechanisms, while the BiLSTM models local temporal dependencies (e.g., short-term pollutant fluctuations). Finally, the predicted values of all subcomponents (VMD-decomposed subcomponents and remaining Co-IMFs) are linearly superimposed to reconstruct the final prediction of the original AQI time series.
This approach effectively captures both short-term and long-term dynamic characteristics of AQI sequences through integrated modelling using secondary modal decomposition and Transformer–BiLSTM, thereby enhancing prediction accuracy and robustness. The prediction model workflow is illustrated in Figure 2.

3. Date Sources and Processing

3.1. Data Sources and Preprocessing

Beijing, situated in the northern part of the North China Plain, experiences a temperate monsoon climate with distinct seasonal characteristics. Summers are hot and rainy, while winters are cold and dry. Daily temperature variations are moderate, and the region enjoys abundant sunshine. Precipitation is predominantly concentrated in summer, and spring is prone to sandstorms. Beijing’s industrial structure centres on the tertiary sector, prioritising modern services, high-tech industries, and cultural and creative industries. Heavy industry accounts for a relatively low proportion, though certain areas still host energy processing and high-end manufacturing sectors. Significant disparities exist in pollutant emissions between urban and suburban zones. Regarding pollution sources, Beijing’s atmospheric environment is primarily influenced by two factors: natural factors, notably spring sandstorms and surface dust pollution; and anthropogenic factors, chiefly motor vehicle exhaust emissions, industrial pollutant discharges, and energy consumption during the winter heating season.
This study uses hourly AQI, pollutant, and meteorological data from Beijing spanning 1 June to 31 August 2023, comprising 2208 data points. Pollutants include PM2.5, PM10, SO2, NO2, O3, and CO, while meteorological factors encompass air temperature, humidity, atmospheric pressure, and wind speed. AQI and pollutant data were sourced from the National Urban Air Quality Real-time Publishing Platform of the China National Environmental Monitoring Centre (CNEMC) [Online]. Available: https://air.cnemc.cn:18007/ (accessed on 15 October 2025), while meteorological data were sourced from the National Meteorological Information Centre (NMIC) [Online]. Available: http://data.cma.cn/ (accessed on 15 October 2025).
AQI data are presented in Figure 3.
The descriptive statistics presented in Table 1 offer a comprehensive overview of the dataset and reveal the dominant influence of specific environmental and meteorological parameters on the AQI during the summer of 2023 in Beijing.
The average AQI was only 48.84, with a maximum of 150, indicating air quality was predominantly excellent or good and no heavy pollution episodes occurred. Among the six criteria pollutants, ozone (O3) exhibited the highest mean concentration (101.90 μg/m3) and by far the greatest variability (Std. Dev = 54.90 μg/m3), while primary pollutants such as PM2.5 (mean: 20.84 μg/m3), PM10 (mean: 39.86 μg/m3), NO2 (mean: 16.57 μg/m3), and CO (mean: 0.45 mg/m3) remained at remarkably low levels. This pattern suggests that secondary photochemical processes, rather than direct emissions, were the primary drivers of AQI variations during this period.
Temperature emerged as a key meteorological driver, with a mean of 28.17 °C and moderate variability (Std. Dev = 4.82 °C), creating favourable conditions for intense solar radiation and the rapid photochemical formation of ozone from precursor pollutants. Relative humidity averaged 60.44% with marked dispersion (Std. Dev = 25.08%), reflecting frequent convective rainfall events, which effectively scavenged particulate matter through wet deposition, thereby suppressing PM concentrations. Wind speed was relatively low (mean: 2.24 m/s, Std. Dev = 1.24 m/s), indicating generally calm conditions, which limited long-range pollutant transport and favoured the local accumulation of photochemically produced pollutants. Atmospheric pressure showed minor fluctuations (Std. Dev = 4.62 hPa), consistent with the stable dominance of the East Asian summer monsoon and the absence of strong subsidence inversions characteristic of colder seasons.
Taken together, the interplay of high temperatures, moderate-to-high humidity coupled with frequent precipitation, low wind speeds, and stable pressure patterns created a meteorological regime in which temperature-driven ozone production overwhelmingly dominated AQI variability, while particulate matter was efficiently removed by wet deposition. This contextual understanding of the dataset, alongside the specific roles played by these environmental and meteorological parameters, lays a solid foundation for subsequent correlation analysis and modelling stages.
During data preprocessing, missing data caused by equipment failure were first addressed using mean imputation to ensure data integrity and minimise its impact on the model. Subsequently, outliers arising from equipment malfunctions, transmission errors, or extreme weather were detected using the Z-score method. Given the limited sample size, outliers with |Z| > 3 were corrected via threshold-based correction. Next, to eliminate the impact of differing variable dimensions on analysis and accelerate the convergence of machine learning algorithms, Min-Max normalisation was applied to scale the data to the [0, 1] range. Finally, the 2208 empirical data points were divided into a training set (1766 records) and a test set (442 records) in an 8:2 ratio in chronological order.

3.2. Feature Selection

AQI is influenced by multiple factors. with influencing factors as model inputs risks increasing computational workload and data redundancy, making feature selection crucial. Correlation analysis among features enables the identification of optimal subsets from numerous features, eliminating non-critical factors and significantly enhancing model training efficiency. Given the pronounced influence of weather conditions on air quality, Pearson’s correlation coefficient was employed to investigate the relationship between concentrations of six primary pollutants and meteorological variables. This coefficient measures the strength of data association, calculated as the covariance divided by the product of their respective standard deviations. Results range from −1 to 1, with values closer to 1 indicating a stronger positive correlation between the factor and AQI. Using the concentrations of six AQI-influencing pollutants as baseline data, combined with four categories of meteorological data, the aforementioned method was applied the aforementioned method after screening and preprocessing. This ultimately yielded a Pearson correlation coefficient heatmap (Figure 4), revealing the interaction mechanisms between pollutants and meteorological factors while enabling the selection of relevant characteristics.
As shown in Figure 4, the correlation coefficient between SO2 and AQI is merely 0.117, while that for NO2 is 0.065, indicating extremely weak linear correlations with AQI. The correlation coefficient between humidity and AQI was −0.361, between atmospheric pressure and AQI was −0.100, and between wind speed and AQI was 0.149. These meteorological variables also exhibited relatively weak correlations with AQI, indicating limited direct explanatory power for AQI. Conversely, PM2.5 exhibits a moderate positive correlation with AQI (correlation coefficient 0.565), while PM10 shows a strong positive correlation (0.662). O3 demonstrates the strongest positive correlation with AQI (0.751), and CO exhibits a moderate positive correlation (0.380). Air temperature also exhibits a moderate positive correlation with AQI (0.585). Notably, while PM2.5 and PM10 demonstrate extremely strong collinearity (correlation coefficient 0.907), their influence on AQI differs due to their distinct particle size dimensions (as pollutants of different diameters). O3 and CO show weaker collinearity with other pollutants, providing independent types of pollution information. Although air temperature exhibits strong collinearity with O3 (correlation coefficient 0.778), as a meteorological variable it maintains a close association with AQI through unique mechanisms. Consequently, PM2.5, PM10, O3, CO, and air temperature were selected as predictive features for AQI.

4. Results and Discussion

The experiments in this paper were conducted in MATLAB 2023b. The computer environment configuration is provided in Table 2.

4.1. Evaluation Metrics

This paper employs Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R2) as evaluation metrics for assessing the performance of each model in predicting AQI. Their respective formulas are as follows:
M A E = 1 n i = 1 n | y i y ^ i |
R M S E = 1 n i = 1 n ( y i y ^ i ) 2
M A P E = 1 n i = 1 n y i y ^ i y i × 100 %
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2
where n denotes the number of samples; y i represents the true value of the i -th sample; y ^ i signifies the predicted value of the i -th sample; and y ¯ indicates the mean of the true values.

4.2. Parameter Configuration

The parameter configurations were determined based on multiple experimental validations and reviews of relevant literature to ensure the rationality and effectiveness of model performance (Table 3).

4.3. CEEMDAN-SE-CPO-VMD Data Decomposition

First, the AQI dataset was decomposed using the CEEMDAN algorithm. The decomposition results of the AQI dataset following CEEMDAN are shown in Figure 5.
As shown in Figure 5 following CEEMDAN decomposition, the original AQI time series is decomposed into 11 IMF components sorted by frequency from high to low. Among these, the high-frequency components exhibit higher noise content and more complex time-domain structures, making it challenging for models to adequately capture fluctuation characteristics across various time scales. In light of this, we first calculate the SE for each IMF component to quantify its complexity. The SE results for each IMF are presented in Table 4.
Based on the SE calculation results in Table 4, the 11 IMF components were fed into the K-means algorithm with the number of clusters set to 3. The three modal clusters obtained from clustering were designated as Co-IMF1, Co-IMF2, and Co-IMF3: the high-frequency group Co-IMF1 comprises IMF1, IMF2, and IMF3; the mid-frequency group Co-IMF2 comprises IMF4, IMF5, and IMF6; and the low-frequency group Co-IMF3 consists of IMF7–IMF11. After completing K-means clustering, for the high-frequency component Co-IMF1 obtained from clustering, the VMD algorithm was further employed to perform in-depth secondary modal decomposition on it. The CPO algorithm was used to optimise the parameters of VMD (K and α ), yielding K = 6 and α = 110.08 after CPO optimisation. The secondary decomposition via CEEMDAN-SE-CPO-VMD is shown in Figure 6.
The secondary decomposition via CEEMDAN-SE-CPO-VMD not only reduced data complexity but also extracted key features, effectively capturing both short-term fluctuations and long-term trends in AQI. The data after secondary decomposition was then fed into the Transformer–BiLSTM model for prediction.

4.4. CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM Prediction Results

4.4.1. Comparison of Different Models

To validate the superiority of the CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM model, four reference models (BiLSTM, Transformer, Transformer–BiLSTM, and CEEMDAN–SE–Transformer–BiLSTM) are selected for the comparison of experimental results, as follows:
A comprehensive analysis of Figure 7 and Figure 8 reveals that the proposed CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM model demonstrates outstanding performance in AQI forecasting. The prediction fit plot demonstrates that the model’s forecast curve aligns most closely with the true value curve, accurately tracking AQI trends across varying fluctuation ranges. The percentage error plot further indicates that this model exhibits the narrowest error band with the most concentrated distribution, significantly outperforming other comparison models. Combined with the quantitative metrics in Table 5, this model achieved the lowest values for MAE (0.7963), RMSE (1.0544), and MAPE (2.0808%), alongside the highest R2 (0.9958). This indicates not only high fitting accuracy but also robust predictive stability.
Figure 9 presents the Taylor diagram of the prediction results of all models, intuitively illustrating the agreement between the model predictions and the actual data. The CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM model exhibits the smallest angle with the observation point and the highest correlation coefficient with the observed values, while its standard deviation is close to that of the observed values—indicating that the output range of this model is in good agreement with the observed data. Although the standard deviations of other models are also relatively close to those of the observed values, their correlation coefficients with the observed values are lower than that of the CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM model.
The results demonstrate that incorporating preprocessing methods—including CEEMDAN decomposition, SE-based clustering, and CPO-optimised VMD—effectively enhances the model’s ability to capture complex temporal features and non-linear patterns within AQI sequences, thereby significantly improving predictive performance.

4.4.2. Error Correction Strategy

Existing research confirms that error correction strategies markedly enhance model predictive performance [30,31]. Inspired by this body of research, this study employs the Least Squares Support Vector Machine (LSSVM) [32]—a model characterised by its simple structure and compact parameter set, to predict the error sequence generated from the combined model’s prediction results, thereby achieving error correction. The final corrected prediction is obtained by integrating the error prediction results with the initial prediction results of the combined model. The specific experimental steps are illustrated in Figure 10 (see also Table 6).
Following the incorporation of the LSSVM error correction strategy into the CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM model, all prediction accuracy metrics were further enhanced. For the error-corrected model, MAE was reduced from 0.7963 to 0.7102, RMSE from 1.0544 to 0.9154, and MAPE from 2.0808% to 1.8543%, while R2 was increased from 0.9958 to 0.9968. This indicates that the LSSVM-based error correction method effectively identified and rectified the systematic bias in the original combined model. By predicting and integrating the error sequence, the final output results were further optimised, validating the effectiveness of the error correction strategy in enhancing AQI prediction accuracy.

4.4.3. Seasonal Validation

AQI values across different seasons within the same city exhibit significant spatio-temporal variations influenced by meteorological conditions and emission source intensities, with their fluctuation characteristics directly challenging the model’s environmental adaptability. Having previously validated the baseline model performance using Beijing’s summer 2023 data (June–August), we supplemented the validation with seasonal data from Beijing’s spring 2023 (March–May), autumn (September–November), and winter (December 2023–February 2024). The supplementary data included AQI values alongside corresponding pollutant concentrations and meteorological factors. The cross-seasonal validation results are presented in Table 7:
Based on validation results across different seasons in Beijing during 2023, the proposed improved model CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM–LSSVM consistently maintained low values for MAE, RMSE, and MAPE, with R2 values approaching 1. It demonstrated stable and outstanding predictive performance even amid spatiotemporal variations in meteorological conditions across seasons, fully reflecting its exceptional robustness in cross-seasonal contexts. By contrast, reference models such as BiLSTM and Transformer exhibited greater performance variability, further highlighting the robust adaptability of the proposed model across diverse seasonal conditions.

5. Conclusions

This paper addresses the challenges of strong nonlinearity and non-stationarity in AQI time series, alongside the limitations of single models in prediction accuracy. We constructed a hybrid forecasting model integrating CEEMDAN-SE-CPO-VMD secondary decomposition and Transformer–BiLSTM, and incorporated LSSVM for error correction. Experimental validation using hourly AQI data from Beijing in summer 2023 led to the following conclusions:
  • Secondary decomposition effectively enhances feature quality: The CEEMDAN algorithm decomposed the raw AQI sequence into 11 IMF components. By quantifying their complexity using SE and applying K-means clustering, these components were grouped into high-, medium-, and low-frequency subsequences. We further used CPO to optimise VMD parameters (K = 6, α = 110.08) for secondary de-composition of high-frequency components, which substantially reduced data complexity and provided high-quality inputs for subsequent prediction.
  • Hybrid deep learning model enhances temporal modelling capability: For the Transformer–BiLSTM architecture, the Transformer effectively captures global long-term dependencies, while the BiLSTM focuses on extracting local short-term dynamics. Their synergy between the two substantially improves feature learning capacity. This model outperforms standalone BiLSTM and Transformer models in MAE, MAPE and RMSE, while achieving a higher R2, demonstrating superior predictive performance.
  • Error correction further optimises prediction accuracy: After introducing LSSVM to correct prediction errors, all model metrics improved. MAE decreased from 0.7963 to 0.7102, RMSE decreased from 1.0544 to 0.9154, MAPE decreased from 2.0808% to 1.8543% and R2 increased from 0.9958 to 0.9968. These results were significantly superior to those of the reference models without error correction, demonstrating enhanced stability and robustness.
  • The model demonstrates robust cross-seasonal adaptability: In extended experiments across spring, autumn, and winter, it maintained consistently superior performance across key metrics including MAE, RMSE, MAPE, and R2, significantly outperforming reference models. This confirms its strong universality and stability in addressing variations in meteorological conditions and pollution characteristics across different seasons.
This model provides an effective technical solution for high-precision AQI forecasting, providing support for air quality early warning systems and pollution control decision-making. Future research may focus on validating the model’s generalisation capability using multi-regional data, or incorporating more advanced dynamic feature extraction mechanisms to enhance predictive performance during sudden pollution events.

Author Contributions

Conceptualization, P.C.; Methodology, C.W.; Software, C.W.; Validation, P.C.; Formal analysis, H.W.; Investigation, J.Z.; Resources, J.Z.; Writing—original draft, C.W.; Visualisation, P.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Enterprise Science and Technology Tackling Project of China Power Limited (Grant No. DJ-HXGG-2024-03).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare that this study received funding from The Enterprise Science and Technology Tackling Project of China Power Limited (Grant No. DJ-HXGG-2024-03). The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Abbreviations

The following abbreviations are used in this manuscript:
AQIAir Quality Index
EMDEmpirical Mode Decomposition
EEMDExtended Empirical Mode Decomposition
CEEMDANComplete Ensemble Empirical Mode Decomposition with Adaptive Noise
SESample Entropy
VMDVariational Mode Decomposition
CPOCrested Porcupine Optimizer
BiLSTMBidirectional Long Short-Term Memory
LSSVMLeast Squares Support Vector Machine
ARIMAAutoregressive Integrated Moving Average
CNNConvolutional Neural Networks
RNNRecurrent Neural Networks
LSTMLong Short-Term Memory
GRUGated Recurrent Units
PSOParticle Swarm Optimisation
SSASparrow Search Algorithm

References

  1. Vilcassim, R.; Thurston, G.D. Gaps and future directions in research on health effects of air pollution. EBioMedicine 2023, 93, 104668. [Google Scholar] [CrossRef]
  2. Cheng, L. The potential synergy between air pollution control and employment growth. J. Environ. Manag. 2024, 369, 122358. [Google Scholar] [CrossRef]
  3. Yang, H.; Gao, Y.; Zhao, F.; Wang, J. An intelligent interval forecasting system based on fuzzy time series and error distribution characteristics for air quality index. Environ. Res. 2024, 251, 118577. [Google Scholar] [CrossRef]
  4. Aggarwal, A.; Toshniwal, D. A hybrid deep learning framework for urban air quality forecasting. J. Clean. Prod. 2021, 329, 129660. [Google Scholar] [CrossRef]
  5. Liu, H.; Yan, G.; Duan, Z.; Chen, C. Intelligent modeling strategies for forecasting air quality time series: A review. Appl. Soft Comput. 2021, 102, 106957. [Google Scholar] [CrossRef]
  6. Luecken, D.J.; Yarwood, G.; Hutzell, W.T. Multipollutant modeling of ozone, reactive nitrogen and HAPs across the continental US with CMAQ-CB6. Atmos. Environ. 2019, 201, 62–72. [Google Scholar] [CrossRef] [PubMed]
  7. Gao, W.; Xiao, T.; Zou, L.; Li, H.; Gu, S. Analysis and Prediction of Atmospheric Environmental Quality Based on the Autoregressive Integrated Moving Average Model (ARIMA Model) in Hunan Province, China. Sustainability 2024, 16, 8471. [Google Scholar] [CrossRef]
  8. Guo, J.; Tu, L.; Qiao, Z.; Wu, L. Forecasting the air quality in 18 cities of Henan Province by the compound accumulative grey model. J. Clean. Prod. 2021, 310, 127582. [Google Scholar] [CrossRef]
  9. Yang, W.; Wang, J.; Lu, H.; Niu, T.; Du, P. Hybrid wind energy forecasting and analysis system based on divide and conquer scheme: A case study in China. J. Clean. Prod. 2019, 222, 942–959. [Google Scholar] [CrossRef]
  10. Pande, C.B.; Kushwaha, N.L.; Alawi, O.A.; Sammen, S.S.; Sidek, L.M.; Yaseen, Z.M.; Pal, S.C.; Katipoglu, O.M. Daily scale air quality index forecasting using bidirectional recurrent neural networks: Case study of Delhi, India. Environ. Pollut. 2024, 351, 124040. [Google Scholar] [CrossRef]
  11. Khatibi, V.; Nikpour, P. Advancing multi-pollutant air quality forecasting using Transformer-based Informer architecture. Earth Sci. Inform. 2025, 18, 287. [Google Scholar] [CrossRef]
  12. Ansari, A.; Quaff, A.R. Data-driven analysis and predictive modelling of hourly Air Quality Index (AQI) using deep learning techniques: A case study of Azamgarh, India. Theor. Appl. Climatol. 2025, 156, 74. [Google Scholar] [CrossRef]
  13. Sarkar, N.; Keserwani, P.K.; Govil, M.C. A modified PSO based hybrid deep learning approach to predict AQI of urban metropolis. Urban Clim. 2024, 58, 102212. [Google Scholar] [CrossRef]
  14. Sarkar, N.; Gupta, R.; Keserwani, P.K.; Govil, M.C. Air Quality Index prediction using an effective hybrid deep learning model. Environ. Pollut. 2022, 315, 120404. [Google Scholar] [CrossRef] [PubMed]
  15. Nguyen, A.T.; Pham, D.H.; Oo, B.L.; Ahn, Y.; Lim, B.T.H. Predicting air quality index using attention hybrid deep learning and quantum-inspired particle swarm optimization. J. Big Data 2024, 11, 71. [Google Scholar] [CrossRef]
  16. Qian, S.; Peng, T.; Tao, Z.; Li, X.; Nazir, M.S.; Zhang, C. An evolutionary deep learning model based on XGBoost feature selection and Gaussian data augmentation for AQI prediction. Process Saf. Environ. Prot. 2024, 191, 836–851. [Google Scholar] [CrossRef]
  17. Dong, J.; Zhang, Y.; Hu, J. Short-term air quality prediction based on EMD-transformer-BiLSTM. Sci. Rep. 2024, 14, 20513. [Google Scholar] [CrossRef]
  18. Wang, K.; Fan, X.; Yang, X.; Zhou, Z. An AQI decomposition ensemble model based on SSA-LSTM using improved AMSSA-VMD decomposition reconstruction technique. Environ. Res. 2023, 232, 116365. [Google Scholar] [CrossRef]
  19. Qian, S.; Peng, T.; He, R.; Chen, J.; Zhang, X.; Nazir, M.S.; Zhang, C. A novel ensemble framework based on intelligent weight optimization and multi-model fusion for air quality index prediction. Urban Clim. 2025, 59, 102233. [Google Scholar] [CrossRef]
  20. Wu, J.; Chen, X.; Li, R.; Wang, A.; Huang, S.; Li, H.; Liu, H.; Qi, H.; Liu, M.; Cheng, H.; et al. A novel framework for high resolution air quality index prediction with interpretable artificial intelligence and uncertainties estimation. J. Environ. Manag. 2024, 357, 120785. [Google Scholar] [CrossRef]
  21. Zhou, H.; Yan, Y. Research on a hybrid deep learning model based on two-stage decomposition and an improved whale optimization algorithm for air quality index prediction. Eng. Appl. Comput. Fluid Mech. 2025, 19. [Google Scholar] [CrossRef]
  22. Tang, C.; Wang, Z.; Wei, Y.; Zhao, Z.; Li, W. A novel hybrid prediction model of air quality index based on variational modal decomposition and CEEMDAN-SE-GRU. Process Saf. Environ. Prot. 2024, 191, 2572–2588. [Google Scholar] [CrossRef]
  23. Torres, M.E.; Colominas, M.A.; Schlotthauer, G.; Flandrin, P. A complete ensemble empirical mode decomposition with adaptive noise. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 4144–4147. [Google Scholar]
  24. Richman, J.S.; Moorman, J.R. Physiological Time-Series Analysis Using Approximate Entropy and Sample Entropy. Am. J. Physiol.—Heart Circ. Physiol. 2000, 278, H2039–H2049. [Google Scholar] [CrossRef]
  25. Abdel-Basset, M.; Mohamed, R.; Abouhawwash, M. Crested Porcupine Optimizer: A new nature-inspired metaheuristic. Knowl.-Based Syst. 2024, 284, 111257. [Google Scholar] [CrossRef]
  26. Yang, Z.; Xu, K.; Zhao, H.; Su, B. Joint application of Crested Porcupine Optimizer and hybrid models in short-term wind power load forecasting. Electr. Power Syst. Res. 2025, 247, 111814. [Google Scholar] [CrossRef]
  27. Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process 2014, 62, 531–544. [Google Scholar] [CrossRef]
  28. Katipoglu, O.M. Integration of extreme learning machines with CEEMDAN and VMD techniques in the prediction of the multiscalar standardized runoff index and standardized precipitation evapotranspiration index. Nat. Hazards 2024, 120, 825–849. [Google Scholar] [CrossRef]
  29. Zhang, Y.; Pan, G.; Chen, B.; Han, J.; Zhao, Y.; Zhang, C. Short-term wind speed prediction model based on GA-ANN improved by VMD. Renew. Energy 2020, 156, 1373–1388. [Google Scholar] [CrossRef]
  30. Wang, L.; Li, X.; Bai, Y. Short-term wind speed prediction using an extreme learning machine model with error correction. Energy Convers. Manag. 2018, 162, 239–250. [Google Scholar] [CrossRef]
  31. Huo, M.; Yan, W.; Ren, G.; Li, R. A novel hybrid model based on modal decomposition and error correction for building energy consumption prediction. Energy 2024, 294, 130811. [Google Scholar] [CrossRef]
  32. Li, G.; Wei, X.; Yang, H. A method for accurate prediction of photovoltaic power based on multi-objective optimization and data integration strategy. Appl. Math. Model. 2024, 136, 115643. [Google Scholar] [CrossRef]
Figure 1. CPO Algorithm Flowchart.
Figure 1. CPO Algorithm Flowchart.
Atmosphere 16 01334 g001
Figure 2. CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM Flowchart.
Figure 2. CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM Flowchart.
Atmosphere 16 01334 g002
Figure 3. AQI Time Series Plot.
Figure 3. AQI Time Series Plot.
Atmosphere 16 01334 g003
Figure 4. Pearson Correlation Coefficient Heatmap.
Figure 4. Pearson Correlation Coefficient Heatmap.
Atmosphere 16 01334 g004
Figure 5. CEEMDAN Decomposition Results Diagram.
Figure 5. CEEMDAN Decomposition Results Diagram.
Atmosphere 16 01334 g005
Figure 6. CEEMDAN-SE-CPO-VMD Secondary Decomposition Diagram.
Figure 6. CEEMDAN-SE-CPO-VMD Secondary Decomposition Diagram.
Atmosphere 16 01334 g006
Figure 7. Prediction Fitting Plot.
Figure 7. Prediction Fitting Plot.
Atmosphere 16 01334 g007
Figure 8. Error Percentage Plot.
Figure 8. Error Percentage Plot.
Atmosphere 16 01334 g008
Figure 9. Model Evaluation Based on Taylor Diagram.
Figure 9. Model Evaluation Based on Taylor Diagram.
Atmosphere 16 01334 g009
Figure 10. LSSVM Error Correction Process.
Figure 10. LSSVM Error Correction Process.
Atmosphere 16 01334 g010
Table 1. Descriptive Statistics of AQI, Pollutants, and Meteorological Parameters.
Table 1. Descriptive Statistics of AQI, Pollutants, and Meteorological Parameters.
VariableMeanStd. Dev.Min25%50%75%MaxSkewness
AQI48.8425.44731.043.059.01501.299
PM2.5 ( μ g / m 3 )20.8413.84111.018.028.0831.269
PM10 ( μ g / m 3 )39.8622.66322.037.053.01400.901
SO2 ( μ g / m 3 )2.780.7322.03.03.0185.287
NO2 ( μ g / m 3 )16.578.27410.015.021.8531.007
O3 ( μ g / m 3 )101.9054.90861.093.0131.82990.836
CO ( mg / m 3 )0.450.1700.30.40.610.577
Temperature (°C)28.174.821724.827.631.2420.425
Humidity (%)60.4425.08939.063.083.0100−0.257
Pressure ( hPa )1001.844.62986999.01002.11005.31011−0.707
Wind Speed ( m / s )2.241.2401.42.02.871.028
Table 2. Computer Environment Configuration.
Table 2. Computer Environment Configuration.
Experimental EnvironmentSpecific Settings
GPUNVIDIA GeForce RTX 4060 Laptop GPU/Memory: 8 GB
CPUAMD Ryzen 7 7735H with Radeon Graphics
Memory6 GB 4800 MT/s
Default hard disc512 GB SSD
Operating SystemWindows 11
Table 3. Model Parameter Settings.
Table 3. Model Parameter Settings.
Model NameParameter NameParameter Settings
CEEMDAN-SE-CPO-VMDCEEMDAN Standard Deviation0.2
CEEMDAN Number of Ensembles500
CEEMDAN Maximum Number of Iteration5000
SE dim2
SE r0.2
CPO α Optimisation Range[100, 2000]
CPO k Optimisation Range[2, 10]
CPO Initial Population Size30
Maximum Number of Iterations50
Convergence Rate0.2
Transformer–BiLSTMNumber of Multi-Head Attention Heads4
Key/Value Number of Channels128
Attention MaskCausal
Number of Hidden Units in BiLSTM Layer50
Positional Encoding Embedding Dimension256
Table 4. SE Values for Each IMF Component.
Table 4. SE Values for Each IMF Component.
IMF ComponentSEIMF ComponentSE
IMF11.06112IMF70.20416
IMF21.07190IMF80.10191
IMF30.84298IMF90.04570
IMF40.51092IMF100.01462
IMF50.51110IMF110.00360
IMF60.44369
Table 5. Evaluation Metrics for Each Model.
Table 5. Evaluation Metrics for Each Model.
ModelMAEMAPERMSER2
BiLSTM2.68756.8627%3.88920.9427
Transformer2.64636.8830%3.70460.9480
Transformer–BiLSTM2.44456.2439%3.58670.9512
CEEMDAN–SE–Transformer–BiLSTM1.37373.5598%2.03780.9843
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM0.79632.0808%1.05440.9958
Table 6. Evaluation Results After Error Correction.
Table 6. Evaluation Results After Error Correction.
ModelMAEMAPERMSER2
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM0.79632.0808%1.05440.9958
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM–LSSVM0.71021.8543%0.91540.9968
Table 7. Evaluation Metrics for Each Model Across Different Seasons.
Table 7. Evaluation Metrics for Each Model Across Different Seasons.
SeasonModelMAEMAPERMSER2
SpringBiLSTM6.42087.7047%14.35500.8951
Transformer5.73446.8074%13.45530.9079
Transformer–BiLSTM5.12826.0585%11.87900.9282
CEEMDAN–SE–Transformer–BiLSTM4.02055.1519%9.22800.9567
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM2.78293.9951%4.83750.9881
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM–LSSVM2.58573.7771%4.59780.9892
AutumnBiLSTM5.537912.8260%7.25120.9691
Transformer2.50006.0889%4.23000.9895
Transformer–BiLSTM2.33965.8509%3.91830.9910
CEEMDAN–SE–Transformer–BiLSTM1.46123.9717%2.21650.9971
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM1.00432.6157%1.40950.9988
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM–LSSVM0.99912.5125%1.35990.9995
WinterBiLSTM4.38468.5386%6.49840.9705
Transformer2.84726.4984%4.38090.9866
Transformer–BiLSTM2.37535.0917%3.86210.9896
CEEMDAN–SE–Transformer–BiLSTM1.47433.1120%2.22010.9966
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM1.13292.3427%1.59600.9978
CEEMDAN–SE–CPO–VMD–Transformer–BiLSTM–LSSVM1.05772.3090%1.46630.9985
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, P.; Wei, C.; Zhang, J.; Wang, H. Air Quality Index Forecasting Based on Quadratic Decomposition and Transformer-BiLSTM—A Case Study of Beijing. Atmosphere 2025, 16, 1334. https://doi.org/10.3390/atmos16121334

AMA Style

Cheng P, Wei C, Zhang J, Wang H. Air Quality Index Forecasting Based on Quadratic Decomposition and Transformer-BiLSTM—A Case Study of Beijing. Atmosphere. 2025; 16(12):1334. https://doi.org/10.3390/atmos16121334

Chicago/Turabian Style

Cheng, Peng, Chuanning Wei, Jinhua Zhang, and Haizheng Wang. 2025. "Air Quality Index Forecasting Based on Quadratic Decomposition and Transformer-BiLSTM—A Case Study of Beijing" Atmosphere 16, no. 12: 1334. https://doi.org/10.3390/atmos16121334

APA Style

Cheng, P., Wei, C., Zhang, J., & Wang, H. (2025). Air Quality Index Forecasting Based on Quadratic Decomposition and Transformer-BiLSTM—A Case Study of Beijing. Atmosphere, 16(12), 1334. https://doi.org/10.3390/atmos16121334

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop