Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model

Cui, Lele; Hu, Gang; Hussien, Abdelazim G.

doi:10.3390/math13172846

Open AccessArticle

Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model

by

Lele Cui

¹,

Gang Hu

^1,2,*

and

Abdelazim G. Hussien

^3,4

¹

Department of Applied Mathematics, Xi’an University of Technology, Xi’an 710054, China

²

School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China

³

Department of Computer and Information Science, Linköping University, 581 83 Linköping, Sweden

⁴

Faculty of Science, Fayoum University, Faiyum 63514, Egypt

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(17), 2846; https://doi.org/10.3390/math13172846

Submission received: 17 July 2025 / Revised: 15 August 2025 / Accepted: 29 August 2025 / Published: 3 September 2025

(This article belongs to the Special Issue Advances in Metaheuristic Optimization Algorithms)

Download

Browse Figures

Versions Notes

Abstract

Sulfur dioxide not only affects the ecological environment and endangers health but also restricts economic development. The reasonable prediction of sulfur dioxide emissions is beneficial for formulating more comprehensive energy use strategies and guiding social policies. To this end, this article uses a multiparameter combination optimization gray prediction model (PGM(1, N)), which not only defines the difference between the sequences represented by variables but also optimizes the order of all variables. To this end, this article proposes an improved algorithm for the Dung Beetle Optimization (DBO) algorithm, namely, CSLDDBO, to optimize two important parameters in the model, namely, the smoothing generation coefficient and the order of the gray generation operators. In order to overcome the shortcomings of DBO, four improvement strategies have been introduced. Firstly, the use of a chain foraging strategy is introduced to guide the ball-rolling beetle to update its position. Secondly, the rolling foraging strategy is adopted to fully conduct adaptive searches in the search space. Then, learning strategies are adopted to improve the global search capabilities. Finally, based on the idea of differential evolution, the convergence speed of the algorithm was improved, and the ability to escape from local optima was enhanced. The superiority of CSLDDBO was verified on the CEC2022 test set. Finally, the optimized PGM(1, N) model was used to predict China’s sulfur dioxide emissions. From the results, it can be seen that the error of the PGM(1, N) model is the smallest at 0.1117%, and the prediction accuracy is significantly higher than that of other prediction models.

Keywords:

prediction model; dung beetle algorithm; China’s sulfur dioxide emissions; multiparameter combination optimization; comprehensive average relative percentage error

MSC:

49K35; 68T20

1. Introduction

Chemical energy includes plant fuels, fossil fuels, and other gaseous fuels. China is rich in chemical energy, which not only promotes rapid national development but also determines the future development of the country. However, the use of chemical energy is accompanied by the production of large amounts of harmful gases, among which the sulfur dioxide (SO₂) generated after the combustion of sulfur-containing fuels is more serious. It not only endangers human health and damages the natural environment but also poses a threat to the economy and society. Therefore, predicting the future emissions of SO₂ and implementing governance and prevention measures in China’s strategic plan to achieve sustainable development is crucial.

In terms of research on SO₂ emissions, multiple scholars have conducted research on the prediction of SO₂ emissions. Common methods include multiple linear regression prediction models, machine learning, and statistical models.

Multiple: linear regression prediction models: Long et al. [1] conducted dimensionality reduction and collinearity analysis on SO₂ emission data variables based on the analysis of the sulfur metabolism mechanism in the sintering process. They derived a statistical regression prediction model for SO₂ emissions from sintering based on the principle of multiple linear regression. However, this model is sensitive to the linearity assumption of the data, while SO₂ emissions from sintering flue gas are influenced by the nonlinear coupling of multiple factors, such as the sulfur contents in raw materials, the air volume, and temperature. Zheng et al. [2] established a regression model for SO₂ emissions from coal combustion using multiple linear regression analysis and variance analysis and applied it to predict SO₂ emissions from thermal power plants in Shandong Province. This method did not consider the impact of accident slurry ponds in the flue gas desulfurization system and thus failed to effectively prevent environmental pollution.
Machine learning: In 2018, Xue et al. [3] established a flue gas SO₂ emission prediction model based on support vector machines (SVMs) for the nonlinear characteristics of flue gas SO₂ objects in circulating fluidized bed boiler control systems. The model parameters were determined using a univariate parameter search combined with grid optimization, overcoming the various shortcomings of previous methods that directly used grid search to determine the parameters of SVM regression models, thereby achieving good prediction results. However, this method requires frequent parameter tuning, resulting in high maintenance costs. When facing circulating fluidized bed systems with strong real-time requirements and high data noise, the model accuracy is easily limited. In 2021, Vitor Miguel Ribeiro [4] used economic-related theories combined with machine learning models to predict SO₂ emissions near a thermal power plant in Portugal. According to the final results, the performance of machine learning models is superior to that of traditional methods, but the prediction accuracy of this method will significantly decrease when facing local emission data policy adjustments and monitoring equipment anomalies.
Statistical models: In 2023, Ghosh and Verma [5] applied an aerosol field and estimated the constrained emissions of SO₂ in India based on relevant data. They first constrained the scattering of absorbing aerosols, and they then used this constraint method to obtain the constrained SO₂ emissions. They concluded that the annual emission rate of SO₂ in the Indian restricted-emission database was lower than the emission rate reported by China. However, if this method underestimates hotspot pollution sources or overestimates emission reduction effects, it may exacerbate the environmental governance risks in the Indian region. Fu et al. [6] used the LEAP model and emission factor method to predict the SO₂ emissions in some eastern regions of China and studied future emission trends. However, this method did not consider the transmission of pollutants between urban agglomerations and the drastic changes in energy structures caused by industrial upgrading in the eastern region, which made it difficult for traditional linear prediction models to capture the dynamic impact of emerging high-energy-consuming industries.

SO₂ emissions mainly come from the combustion of sulfur-containing fuels. The gradual recovery of the global economy and fierce competition among countries will inevitably exacerbate energy consumption, which will be followed by an increase in SO₂ emissions. If predicted based on previous years’ data, the results may be inaccurate. In addition, the collection of SO₂ emission data is difficult, with a long cycle and complex influencing factors. Therefore, a gray prediction model was chosen to predict SO₂ emissions based on the modeling object of “less information required, higher accuracy, simple operation, easy inspection, and system uncertainty”.

Gray system theory [7] is an emerging discipline proposed and established by Professor Deng Julong to address issues such as limited data volumes [8] and information uncertainty [9]. Gray prediction, as a research direction of gray systems, is divided into different research objects based on the number of variables studied. The GM(1,1) model also works well with exponential growth data and data with medium to long time periods, but its prediction results are poor for time series with poor volatility. To enhance the accuracy of the gray forecasting model NGM(1,1,k2) with quadratic polynomial terms, Li et al. [10] further refined the original model and obtained the BNGM(1,1,k2) model. However, this method requires the simultaneous adjustment of the coefficients of the differential equation and the polynomial weights, making it prone to local optimal solutions. Qian et al. [11] proposed a new discrete gray forecasting model to enhance the model’s adaptability and performance for both linear and nonlinear trends in time series. However, in renewable energy generation scenarios, this model relies excessively on new data, leading to neglect of historical patterns. Wang et al. [12] applied quantile regression techniques to construct the QGM(1,1) model to improve the model stability. This model has the advantages of higher accuracy and better robustness. However, constrained by the inherent limitations of gray theory, the model’s differential equation is based on the assumption of exponential law, making it unable to effectively capture random perturbations in economic or environmental systems. Zeng et al. [13] established the SGGM(1,1,r) model, combining new initial conditions with original data to improve the prediction accuracy. However, this model reconstructs initial conditions based on the “new information priority” principle and, given the short history of shale gas development in China, limited samples lead to insufficient reliability in long-term trend prediction. Table 1 presents some of the literature using univariate gray models to predict SO₂ emissions.

The GM(1, N) model addresses simulation and prediction issues where multiple related factor variables affect a single system behavior variable. Duan et al. [14] established a multivariable energy consumption gray model and applied it to practical problems. However, differences in China’s regional energy structures can lead to dynamic coupling effects, making it difficult for the model to adapt and adjust. In 2022, Duan [15] proposed the Verhulst gray model MVGM(1, N), which not only specifically addressed the problem but also enhanced the prediction accuracy. However, the training data for this model was concentrated in resource-based provinces. When directly applied to manufacturing powerhouses such as Jiangsu, due to differences in energy structures, the mean absolute error will increase. In 2023, Ye et al. [16] established a WAFGM(1, N) model, which not only reduces errors but also utilizes uncertain data for prediction, achieving high prediction accuracy. Although this model dynamically adjusts weights through interval sequences, it does not consider the sudden impact of extreme events on variable relationships, resulting in the weight allocation lagging behind actual changes.

The above review indicates that the current models exhibit certain limitations to some extent. The shortcomings of traditional models lie in the fact that modeling for predicting SO₂ emissions requires expert experience in selecting and extracting features, which is time-consuming and highly subjective. It is difficult to adapt to regional differences in China, and there is a high risk of overfitting in small samples. The disadvantage of machine learning models is that the decision-making process is black-boxed, making it difficult to intuitively analyze the contributions of variables, reducing the credibility of policy formulation. Furthermore, such models do not internalize policy shocks and ignore dynamic coupling, leading to the ineffective prediction of turning points. Therefore, a new method is needed to fill this gap for the better prediction of SO₂. This paper chooses to use the gray prediction model to solve this problem and proposes some feasible solutions. This article uses the multivariate gray prediction model PGM(1, N) suggested by Yin et al. [17] in 2023. We chose PGM(1, N) because this model has several advantages. It can construct a prediction framework using a small amount of historical data, making it suitable for scenarios with incomplete information or scarce data, and avoiding the dependence of traditional statistical methods on large data volumes. Secondly, based on differential equations and accumulation generation techniques, it simplifies complex multivariate relationships into computable sequences, reducing the computational complexity of modeling and facilitating practical application deployment. Then, PGM(1, N) integrates multiple influencing factors to reduce the system uncertainty. Its prediction results are superior to those of the GM(1,1) model, especially in short-term trend analysis. Finally, this model weakens noise interference by accumulating and generating the original data, directly revealing the inherent relationship between multiple variables. Previous research on predicting gas emissions has mainly focused on single/multivariate first-order cumulative models and related improved models. Traditional multivariate models use the identical order for all variable data sequences, using the first-order cumulative generator sequence of the independent variable as a driving term for modeling. However, this approach does not consider the effects of the volatility of the sequence and the sway of extreme values on the dependent variable, which can seriously impact the quality of the model. Therefore, this model incudes the differential definition and optimization of the order of variables, solving the problem of the similarity of variables with different orders. On this basis, the modeling ability of the model is enhanced by simultaneously optimizing the order of variables, driving smooth generation coefficients and background coefficients.

In the PGM(1, N) model, the order of the gray generation operators and the smoothing generation coefficient are two important parameters. The accurate prediction of a prediction model depends on the efficient and accurate optimization of the model parameters; however, intelligent optimization algorithms can be used to optimize it. There are many algorithms that perform well. The Ant Colony Optimization (ACO) algorithm [18] is inspired by the behavior of real-life ant colonies. Ants transmit information by recognizing pheromones. Based on this, the algorithm constructs a solution to the optimization problem. The Particle Swarm Optimization (PSO) algorithm [19] is influenced by the study of bird foraging behavior, which enables the population to seek optimization through information sharing between groups. The Gray Wolf Optimization (GWO) [20] algorithm is influenced by the hunting methods of wolf packs, simulating the system and division of labor in the wolf population. The Harris Eagle Optimization (HHO) [21] algorithm imitates the predatory characteristics and cooperative behavior of Harris eagles, searching through the hunting process. Also, Sled Dog Optimization (SDO) [22] is inspired by sled dog behavior, finding the best food through division of labor and information exchange between groups. The Marine Predator Algorithm (MPA) [23] is constructed by simulating the evolutionary evolution of predators and prey in the sea. The Chimpanzee Optimization Algorithm (ChOA) [24] simulates social behavior among chimpanzee populations. The Slime Mold Algorithm (SMA) [25] simulates the behavior of slime molds during foraging. Elephant Herding Optimization (EHO) [26] mainly simulates the phenomenon of member updates and changes in elephant groups in nature. The improved Black Widow algorithm (namely, SDABWO) [27] is used to solve feature selection problems. The improved version of the Honey Badger algorithm (namely, SaCHBA-PDN) [28] has a better performance and is easier to implement. The improved version of the starling noise algorithm (namely, DTCSMO) [29] has shown significant advantages in engineering application problems. The enhanced version of jellyfish search optimization (namely, EJS) [30] has shown significant advantages in solving optimization problems with complex spherical shapes. Based on the Kepler optimization algorithm, an improvement was made (namely, CGKOA) [31], which has a better optimization performance. The improved particle swarm optimization (namely, dFDBMPSO) [32] has been applied to practical problems.

However, any algorithm has its limitations in application. Therefore, we encourage the design and development of more high-performance optimization algorithms. Among them, the DBO algorithm [33] is an original algorithm picked up by Shen et al. Its proposal was inspired by a series of survival behaviors of insects, such as dung beetles, allowing for global search and local utilization. Dung beetles [34] feed on animal excrement and are known for their unique behavior of pushing dung balls [35], playing the role of decomposers in nature [36].

The DBO algorithm combines global search and local development and has an excellent performance in terms of its convergence speed and solution accuracy. It is evaluated according to various benchmark functions and achieves better results. This article proposes an improved version of the beetle algorithm (CSLDDBO) that has a higher solving accuracy and better performance and proves its effectiveness on the benchmark functions. This study utilizes the CSLDDBO-PGM(1, N) combinatorial optimization model for predicting SO₂ emissions and demonstrates its rationality.

The full text is arranged as follows:

The gray prediction model selected in this article has carried out the differential definition and optimization of the order of variables, solving the disadvantage of different variables with the same order in the gray model, and combining the idea of parameter combination optimization to enhance the modeling capabilities of gray models.
An improved Dung Beetle Optimization (CSLDDBO) algorithm is proposed, which introduces three strategies and the idea of differential evolution to enhance the performance of the original algorithm. The effectiveness of this algorithm was verified by testing it on the CEC2022 test set.
To predict SO₂ emissions, a PGM(1, N) model optimized with CSLDDBO parameters was used for predicting SO₂ emissions in China.

The rest of this article is organized as follows: In Section 2, the basic concepts related to the multivariate gray model PGM(1, N) are presented. In Section 3, the DBO algorithm is introduced. In Section 4, the details of CSLDDBO are introduced. Section 5 analyzes the performance of CSLDDBO based on experiments. In Section 6, the CSLDDBO-PGM(1, N) model is used to predict the emissions of SO₂ in China. Section 7 provides a summary of the entire text.

2. PGM(1, N)

2.1. Related Concepts

2.1.1. The Order of Gray Generative Operators

To highlight the key points of this article, we have placed the basic concepts of GM(1, N) in the Appendix A. Below, we present the basic concepts of PGM(1, N).

We refer to

Y_{1}^{(0)} = (y_{1}^{(0)} (1), y_{1}^{(0)} (2), \dots, y_{1}^{(0)} (N)),

as the dependent-variable sequence and

Y_{m}^{(0)} = (y_{m}^{(0)} (1), y_{m}^{(0)} (2), \dots, y_{m}^{(0)} (N)),

m = 2, 3, .., n

as the independent-variable sequence.

Y_{i}^{(1)} = (y_{i}^{(1)} (1), y_{i}^{(1)} (2), \dots, y_{i}^{(1)} (N)), i = 2, 3, \dots, n

is used as a first-order cumulative generation sequence [37] of

Y_{i}^{(0)}

, with the specific formula being

y_{i}^{(1)} (g) = \sum_{k = 1}^{g} y_{i}^{(0)} (k), g = 1, 2, \dots, N

(1)

where

k

represents the sequence order.

Let

Y_{i}^{(0)} = (y_{i}^{(0)} (1), y_{i}^{(0)} (2), \dots, y_{i}^{(0)} (N)), i = 1, 2, \dots, n

be the initiation sequence,

t_{i} \in I

i = 1, 2, \dots, n

be the order of the gray generation operators, and

Y_{i}^{(t_{i})} = (y_{i}^{(t_{i})} (1), y_{i}^{(t_{i})} (2), \dots,

y_{i}^{(t_{i})} (N))

be the new sequence, where

y_{i}^{(t_{i})} = \sum_{i = 1}^{g} \frac{Γ (t_{i} + g - i)}{Γ (g - i + 1) Γ (t_{i})} y_{i}^{(0)} (i) . g = 1, 2, \dots, N

(2)

The above formula is called the

t_{i}

-order gray generation operator of

Y_{i}^{(0)}

, abbreviated as

t_{i} -

RGO and

Y_{i}^{(t_{i})}

, respectively, where

i = 1, 2, \dots, n

,

m = 2, 3, \dots, n

.

As shown in Formula (2), let

Y_{i}^{(0)}

,

b \in I, f \in I

.

Y_{i}^{(b)}

is the

b -

RGO sequence of

Y_{i}^{(0)}

;

Y_{i}^{(f)}

is the

f -

RGO sequence of

Y_{i}^{(0)}

;

Y_{i}^{(b + f)}

is the

(b + f) -

RGO sequence of

Y_{i}^{(0)}

;

{(Y_{i}^{(f)})}^{(b)}

is the

b -

RGO sequence of

Y_{i}^{(f)}

;

{(Y_{i}^{(f)})}^{(b)}

is the

f -

RGO sequence of

Y_{i}^{(b)}

. Moreover, multiple generations of operators satisfy the commutative law and exponential rate [38], i.e.,

{(Y_{i}^{(b)})}^{(f)} = {(Y_{i}^{(f)})}^{(b)} = Y_{i}^{(b + f)},

(3)

specifically,

Y_{i}^{(0)} = {(Y_{1}^{(t)})}^{(- t)} = {(Y_{1}^{(- t)})}^{(t)} .

(4)

2.1.2. Smooth Generation Operator

Next, the 1-AGO sequence of the independent variable is used as the driving sequence [39], and variance smoothing operations are performed.

As shown in Formula (2), let

F_{m}^{(t_{m})} = (k_{m}^{(t_{m})} (2), k_{m}^{(t_{m})} (3), \dots, k_{m}^{(t_{m})} (N))

be a t_m-order smooth generating sequence with the variable weight independent variable

λ_{m}

,

λ_{m} \in (0, 1)

, where

k_{m}^{(t_{m})} (g) = λ_{m} y_{m}^{(t_{m})} (g) + (1 - λ_{m}) y_{m}^{(t_{m})} (g - 1), g = 2, 3, \dots, N

(5)

where

λ_{m}

is the smoothing generation coefficient. Specifically, if

λ_{m} = 1

,

F_{m}^{(t_{m})}

is converted into a 1-RGO sequence.

2.2. Model Definition and Parameter Estimation

According to Formula (5),

Y_{1}^{(t_{1})}

and

Y_{1}^{(t_{1} - 1)}

are referred to as the

t_{1} -

RGO sequence and

(t_{1} - 1) -

RGO sequence of

Y_{1}^{(0)}

.

a_{1}^{(t_{1})} =

(A_{1}^{(t_{1})} (2), A_{1}^{(t_{1})} (3), \dots, A_{1}^{(t_{1})} (N))

is a neighboring sequence generated by the background value coefficient

λ_{1}

of

Y_{1}^{(t_{1})}

; then,

y_{1}^{(t_{1} - 1)} (g) + E A_{1}^{t_{1}} (g) = \sum_{m = 2}^{n} q_{m} k_{m}^{t_{1}} (g) + s_{1} (g - 1) + s_{2},

(6)

is known as a new multivariate gray model, abbreviated as PGM(1, N). In Formula (5),

E

represents the development coefficient.

k_{m}^{(t_{m})} (g) = λ_{m} y_{m}^{(t_{m})} (g) + (1 - λ_{m})

y_{m}^{(t_{m})} (g - 1)

is a driver based on smooth generation.

s_{1} (g - 1)

is a linear correction term.

s_{2}

is a random perturbation term.

Assuming that

Y_{1}^{(t_{1})}

and

Y_{1}^{(t_{1} - 1)}

are as described in Formula (6),

Y_{m}^{(t_{m})}

is as described in Formula (2), and

F_{m}^{(t_{m})}

is as described in Formula (5), the parameter

\hat{u} = [q_{2}, q_{3}

, \dots, q_{N}, E, s_{1}, s_{2}]^{T}

estimation of the PGM(1, N) model satisfies

D = [\begin{array}{c} k_{2}^{(t_{2})} (2) & k_{2}^{(t_{3})} (2) & \dots k_{n}^{(t_{n})} (2) & - A_{1}^{(t_{1})} (2) & 1 & 1 \\ k_{2}^{(t_{2})} (3) & k_{2}^{(t_{3})} (3) & \dots k_{n}^{(t_{n})} (3) & - A_{1}^{(t_{1})} (3) & 2 & 1 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ k_{2}^{(t_{2})} (N) & k_{2}^{(t_{3})} (N) & \dots k_{n}^{(t_{n})} (N) & - A_{1}^{(t_{1})} (N) & N - 1 & 1 \end{array}],

(7)

K = [\begin{array}{c} y_{1}^{(t_{1} - 1)} (2) \\ y_{1}^{(t_{1} - 1)} (3) \\ ⋮ \\ y_{1}^{(t_{1} - 1)} (N) \end{array}] = [\begin{array}{c} \sum_{m = 1}^{2} \frac{Γ (t_{1} + 1 - m)}{Γ (2 - m + 1) Γ (t_{1} - 1)} y_{1}^{(0)} (m) \\ \sum_{m = 1}^{3} \frac{Γ (t_{1} + 2 - m)}{Γ (3 - m + 1) Γ (t_{1} - 1)} y_{1}^{(0)} (m) \\ ⋮ \\ \sum_{m = 1}^{N} \frac{Γ (t_{1} + N - 1 - m)}{Γ (N - m + 1) Γ (t_{1} - 1)} y_{1}^{(0)} (m) \end{array}] = [\begin{array}{c} (t_{1} - 1) y_{1}^{(0)} (1) + y_{1}^{(0)} (2) \\ \frac{t_{1} (t_{1} - 1)}{2} y_{1}^{(0)} (1) + (t_{1} - 1) y_{1}^{(0)} (2) + y_{1}^{(0)} (3) \\ ⋮ \\ \sum_{m = 1}^{N} \frac{Γ (t_{1} + N - 1 - m)}{Γ (N - m + 1) Γ (t_{1} - 1)} y_{1}^{(0)} (m) \end{array}] .

(8)

If $N = n + 3$ and $|D| \neq 0$ , then $\hat{u} = D^{- 1} K$ ;
If $N > n + 3$ and $|D^{T} D| \neq 0$ , then $\hat{u} = {(D^{T} D)}^{- 1} D^{T} K$ ;
If $N < n + 3$ and $|D D^{T}| \neq 0$ , then $\hat{u} = D^{T} {(D D^{T})}^{- 1} K$ .

2.3. Solution of the Model

Formula (6) shows that when

g = 2, 3, \dots, N, \dots,

the time response expression is as follows:

{\hat{y}}_{1}^{(t_{1})} (g) = \sum_{d = 1}^{g - 1} [ε_{1} \sum_{m = 2}^{n} ε_{2}^{d - 1} q_{m} k_{m}^{(t_{m})} (g - d + 1)] + ε_{2}^{g - 1} {\hat{y}}_{1}^{(t_{1})} (1) + \sum_{i = 0}^{g - 2} ε_{2}^{i} [(g - i) ε_{3} + ε_{4}],

(9)

where

ε_{1} = \frac{1}{1 + γ λ_{1}}, ε_{2} = \frac{1 - E (1 - λ_{1})}{1 + E ε_{1}}, ε_{3} = \frac{s_{1}}{1 + E λ_{1}}, ε_{3} = \frac{s_{2} - s_{1}}{1 + E λ_{1}} .

(10)

The final recovery expressions are as follows:

{\hat{y}}_{1}^{(0)} (g) = \sum_{m = 1}^{g} \frac{Γ (- t_{1} + g - m)}{Γ (g - m + 1) Γ (- t_{1})} {\hat{y}}_{1}^{(t_{1})} (m) .

(11)

3. Overview of Dung Beetle Optimization Algorithms

The dung beetle likes to roll animal feces into balls and mainly feeds on animal feces, earning the nickname “natural cleaner”. For dung beetles, dung balls are important breeding grounds. DBO is composed of four different functional dung beetles.

3.1. Ball-Rolling Dung Beetles

The update expression for the DBO initialization position is as follows:

\begin{array}{l} h_{u} (q + 1) = h_{u} (q) + σ \times j \times h_{u} (q - 1) + l \times Δ h, \\ Δ h = |h_{u} (q) - H^{g}|, \end{array}

(12)

where

q

denotes the number of iterations;

h_{u} (q)

denotes the location of the

u

th dung beetle in the

q

iteration;

j \in (0, 0.2]

is the deflection coefficient;

l

is the random number in

(0, 1)

;

σ

is the natural coefficient, assigned as −1 or 1;

H^{g}

denotes the worst-case position; and

Δ h

denotes simulation of changes in light intensity.

σ

represents natural factors. Specifically,

σ = 1

or

σ = - 1

. The algorithm is as follows:

The updated location of DBO’s dancing behavior is as follows:

h_{u} (q + 1) = h_{u} (q) + \tan (θ) |h_{u} (q) - h_{u} (q - 1)|,

(13)

When

θ \in [0, π]

and

θ

equals

0, π / 2, π

, the position is not updated.

3.2. Producing Dung Beetles

The selection of the breeding ball’s position is particularly crucial, and the boundary handling is as follows:

\begin{array}{l} L o^{'} = \max (H^{'} \times (1 - P), L o), \\ U p^{'} = \max (H^{'} \times (1 + P), U p), \end{array}

(14)

where

H^{'}

is the current local best point;

L o^{'}

and

U p^{'}

denote the boundaries of the spawning area (

P = 1 - q / U_{\max}

);

U_{\max}

is the maximum number of iterations;

L o

and

U p

represent the upper and lower bounds, respectively.

With the change in

P

, the spawning area will also dynamically change, and the position update of the breeding balls is represented as follows:

M_{u} (q + 1) = H^{'} + l_{1} \times (M_{u} (q) - L o^{'}) + l_{2} \times (M_{u} (q) - U p^{'}),

(15)

where

M_{u} (q)

is the position of the

u

breeding ball in the

q

generation,

l_{1}

and

l_{2}

represent random vectors, and

Z

represents the dimensionality degree.

3.3. Larvae

The boundaries of the optimal foraging area for larvae are defined as follows:

\begin{array}{l} L o^{l} = \max (H^{l} \times (1 - P), L o), \\ U p^{l} = \max (H^{l} \times (1 + P), U p), \end{array}

(16)

where

H^{l}

represents the global optimal position, and

L o^{l}

and

U p^{l}

represent the lower and upper bounds of the optimal foraging area. The location of the larvae is denoted as follows:

h_{u} (q + 1) = h_{u} (q) + V_{1} \times (h_{u} (q) - L o^{l}) + V_{2} \times (h_{u} (q) - U p^{l}),

(17)

where

h_{u} (q)

denotes the location of the

q

th larva in the

u

generation, and

V_{1}

and

V_{2}

are random numbers.

3.4. Thief Dung Beetle

The position update of the “thief” dung beetle is represented as follows:

h_{u} (q + 1) = h^{l} + G \times z \times |(h_{u} (q - H^{'})| + |h_{u} (q) - H^{l}|),

(18)

where

h_{u} (q)

denotes the location of the

u

thief in the

q

generation,

z

is an arbitrary vector, and

G

represents a constant.

4. CSLDDBO Algorithm

In this section, we propose an improved version of the DBO algorithm, named CSLDDBO. We combined the chain foraging strategy, rolling strategy, learning strategy, and differential evolution algorithm to enhance the original algorithm.

4.1. Chain Foraging Strategy

Introducing the chain foraging strategy into CSLDDBO [40] can enhance the global exploration ability of the original algorithm. The expression of the chain foraging strategy is as follows:

h (r + 1) = \{\begin{cases} h_{o}^{b} (r) + q (h_{b e s t}^{b} (r) - h_{o}^{b} (r)) + γ \cdot (h_{b e s t}^{b} (r) - h_{o}^{b} (r)), o = 1 \\ x_{o}^{b} (r) + q (h_{m - 1}^{b} (r) - h_{o}^{b} (r)) + γ \cdot (h_{b e s t}^{b} (r) - h_{o}^{b} (r)), o = 1, 2, \dots, s \end{cases}

(19)

γ = 2 \cdot q \cdot \sqrt{|\log (q)|},

(20)

where

h_{o}^{b} (r)

is the location of the

o

th individual in the

b

th dimension at time (

r

),

q

is an arbitrary vector in

[0, 1]

,

γ

is the weight coefficient, and

h_{b e s t}^{b} (r)

is the best position location for individuals.

Dung beetles orient and fly towards food based on scent, assuming that the food source is the optimal location. Figure 1 depicts a schematic diagram of dung beetles foraging in a two-dimensional space according to this strategy.

4.2. Somersault Foraging Strategy

This strategy regards the position of food as the optimal position, and all individuals update their positions around this optimal position. The specific expression is as follows:

h_{o}^{b} (r + 1) = h_{o}^{b} (r) + V \cdot (q_{2} \cdot h_{b e s t}^{b} - q_{3} \cdot h_{o}^{b} (r)), o = 1, 2, \dots, s

(21)

where

V

is the flip factor,

V = 2

, and

q_{2}

and

q_{3}

are two random numbers in

[0, 1]

. As can be seen from the formula, this strategy enables individuals to conduct an adaptive search within a constantly changing search range.

4.3. Learning Strategy

The expression for the comprehensive learning strategy is as follows [41]:

K_{o}^{b} \leftarrow η * K_{o}^{b} + p^{'} r a n d_{o}^{b} * (p b e s t_{f i (b)}^{b} - H_{O}^{B}),

(22)

where

f i_{u} = [f i_{u} (1), f i_{u} (2), \dots, f i_{u} (Z)]

defines the

p b e s t s

of the particle corresponding to an individual (

u

).

p b e s t_{f i (b)}^{b}

can be the relevant dimension of any individual’s

p b e s t

, and

P c

is called the learning probability.

In this strategy, first, two individuals are randomly selected. Secondly, the

f i

of two individuals are compared and the weaker ones are eliminated. Finally, the

p b e s t

of the better individuals compared are used as samples to learn this dimension. If all samples of an individual are its own, a dimension is randomly selected to learn the corresponding

p b e s t

of another individual.

4.4. Differential Evolution

In order to avoid the rapid convergence of the population to the previous optimal position due to the influence of iteration on CSLDDBO, and to improve the convergence speed of CSLDDBO and avoid falling into local optima, we were inspired by differential evolution and applied it to CSLDDBO [42]:

1.: Initial population

h_{o u} (0) = r n d_{o u} (0, 1) (h_{o u}^{U} - h_{o u}^{L}) + h_{o u}^{L}, o = 1, 2, \dots, N S, u = 1, 2, \dots, L

(23)

where

L

is the amount of variables,

N S

is the population size, and

h_{o u}^{U}

and

h_{o u}^{L}

are the upper and lower bounds of the

u

th variable, respectively.

2.: Mutation

Three individuals (

h_{e 1}, h_{e 2}, h_{e 3}

and

(b \neq e 1 \neq e 2 \neq e 3)

) are randomly selected from the population; then,

h_{o u} (r + 1) = h_{e 1 u} (r) + ρ (h_{e 2 u} (r) - h_{e 3 u} (r)),

(24)

where

h_{e 2 u} (r) - h_{e 3 u} (r)

is the differentiation vector, and

ρ

is the scaling factor.

3.: Cross operation

d (r + 1) = \{\begin{cases} h_{o u} (r), r a n d 1_{o u} > C R o r u \neq r a n d (r), \\ h_{o u} (r + 1), r a n d 1_{o u} \leq C R o r u = r a n d (r), \end{cases}

(25)

where

C R

is a random integer with a crossover probability of

C R \in [0, 1]

, and

r a n d (o)

is between 1 and L.

4.: Select action

h (r + 1) = \{\begin{cases} h_{o} (r), f i (d_{o 1} (r + 1), \dots, d_{u l} (r + 1)) < f i (h_{o 1} (r), \dots, h_{o L} (r)), \\ d_{o} (r + 1), f i (d_{o 1} (r + 1), \dots, d_{u L} (r + 1)) \geq f i (h_{o 1} (r), \dots, h_{o L} (r)) . \end{cases}

(26)

Figure 2 presents the flowchart of CSLDDBO, and the pseudocode for CSLDDBO is provided in Algorithm 1.

Algorithm 1: Pseudocode for CSLDDBO algorithm

4.5. Time Complexity

The time complexity of the CSLDDBO algorithm is determined by the chain search, roll search, mutation probability, and crossover probability performed by

N, D, T

in each iteration. Therefore, it can be denoted as follows:

\begin{array}{l} O (C S L D D B O) = O (T (O (c h a i n s t r a t e g y) + O (s o m e r s a u l t s t r a t e g y) \\ + O (m u t a t i o n p r o b a b i l i t y + c r o s s o v e r p r o b a b i l i t y))) \\ = O (T (N D + N D + N D)) = O (T N D), \end{array}

(27)

where

T

is the maximum number of iterations,

N

is the number of the population, and

D

is the number of variables.

5. Numerical Experiment and Discussion

In this section, we evaluate CSLDDBO from multiple perspectives through a series of experiments. Firstly, we conducted a convergence analysis on CSLDDBO. Secondly, we conducted a sensitivity analysis on the test function by introducing strategy parameters. Finally, in order to better evaluate the performance of CSLDDBO, we selected some algorithms with excellent performances to compare them with CSLDDBO.

5.1. Convergence Behavior Analysis

Next, we will analyze the convergence of CSLDDBO using four representative qualitative indicators: the search history, the average fitness, the trajectory of the first individual in the first dimension, and the convergence curve of CSLDDBO compared to the original DBO. These indicators can help us observe the optimization behavior of CSLDDBO on CECE2022 and gain a better understanding of its performance. Figure 3 shows the convergence behavior of CSLDDBO on CEC2022. The first column in the figure shows the shape of the test function in two-dimensional space. The second column is a scatter plot of CSLDDBO’s historical location search records on the two-dimensional front of the search agent. The third column shows the average fitness of the search agent during the iteration process, reflecting the changes in the average fitness of CSLDDBO during the iteration process. The fourth column reflects the trajectory curve of the first search agent in the first dimension. The fifth column shows the optimal convergence curves currently found by the search agent and the original DBO.

Search History

When solving CECE2022 with CSLDDBO, observing the particle distributions of F7, F8, and F9, it can be seen that there were significant differences in the distribution densities of particles when facing different stages and functions throughout the entire optimization process. This indicates that the balance between exploration and development varies when dealing with different problems, and it also demonstrates CSLDDBO’s strong convergence and excellent development capabilities.

2.: Average fitness

Observation shows that the initial values of the iteration process are different, indicating that the population diversity was very rich in the early stages of iteration. The average fitness curves of all functions show a decreasing trend, indicating that the population as a whole gradually approached the optimal solution as the iteration progressed.

3.: Search trajectory

Taking the trajectory of the first particle in CSLDDBO as an example in the figure, when solving F1, F2, F6, F10, and F12 of CEC2022 in CSLDDBO, the abrupt change amplitude in the early stage of iteration covered the entire search space. This indicates that CSLDDBO has superior exploration capabilities. There were still fluctuations in the search agent during the later iterations of F2, F6, F10, and F12, indicating that the population was still being updated. On most functions, CSLDDBO’s search amplitude decreased in the later stages of iteration and its motion gradually stabilized, ultimately finding the global optimal position.

4.: Convergence curve

Overall, CSLDDBO had a better convergence ability and convergence accuracy than DBO. On F3, F5, F8, and F9, CSLDDBO converged slightly slower than DBO in the early stages of iteration, but the final convergence accuracies of CSLDDBO and DBO were very close on F8 and F9.

5.2. Sensitivity of Parameters

In CSLDDBO, the four strategies introduced include three parameters: the population proportion parameter (

F p e r

), mutation operator (

F 0

), and crossover operator (

C R

). In order to determine the impact of the parameters on the performance of CSLDDBO, we conducted an experimental analysis on these three parameters on the CEC2022 test set. We set the experimental dimension and maximum iteration to 50 and 1500, respectively.

The size of the population proportion parameter (

F p e r

) contributes to the coverage and computational efficiency of the solution space. The general value depends on the specific problem. In CSLDDBO, a range of 0.1 to 0.5 is chosen, and the step size is set to 0.1. Table 2 shows that better results were achieved on the test function when

F p e r = 0.4

was used, indicating that this parameter enabled CSLDDBO to exhibit a better performance.

The emergence of mutation operators introduces the global search capability, driving the population to evolve towards better regions. If the value of

F 0

is too high, it will slow down the convergence speed, while if the value is too small, it will lead to diversity and easily fall into local optima. The introduction of crossover operators can prevent the premature convergence of the algorithm, balancing exploration and development capabilities. A larger

C R

value can lead to a slower convergence speed, while a smaller one can easily fall into local optima. For the CEC2022 test function, according to Table 3, when the

F 0

was set to 0.2 and the

C R

was set to 0.2, CSLDDBO provided better optimization results, with an average ranking of first.

5.3. Experimental Results

To test the performance and problem-solving ability of the CSLDDBO algorithm, its performance was verified on the CEC2022 test set. All experiments were conducted on the same computer using Matlab 2020b on the 12th generation Inter (R) Core (TM) i7-12700H@2.30 GHz. The algorithm size is 50, the maximum number of iterations was set to 1500, and the number of runs was set at 20.

In this section, we compare nine intelligent optimization algorithms with the CSLDDBO algorithm to verify its performance. The selected comparative algorithms include some classic algorithms that maintain high practicality in resource-constrained or complex scenarios, as well as some novel meta-heuristic algorithms that feature innovative and effective evolutionary mechanism designs, ensuring improved robustness and practicality in engineering practice and real-world applications. The selected algorithms include Harris Hawk Optimization (HHO), Gray Wolf Optimization (GWO), the Whale Optimization Algorithm (WOA) [43], Improved Harris Hawk Optimization (IHHO) [44], Moth Flame Optimization (MFO) [45], the Weighted Mean of Vectors (INFO) [46], the Pelican Optimization Algorithm (POA) [47], and the Sine Cosine Algorithm (SCA) [48]. Table 4 presents the algorithm parameter settings.

In Table 5, we provide a series of evaluation metrics, such as the median and interquartile range, and the best mean values of each algorithm on the function are highlighted in bold. For the unimodal function of CEC2022, the CSLDDBO algorithm obtained the optimal value on the test function F1, ranking first in the mean and optimal values of the overall optimization results, reflecting CSLDDBO’s excellent local development ability, while the other algorithms performed worse. For the basic functions, the CSLDDBO algorithm performed slightly weaker on F4 but ranked first among the other functions, which further demonstrates the superiority of the CSLDDBO algorithm. For mixed and composite functions, the CSLDDBO algorithm had a weaker processing ability on F6 and ranked first on the F7, F8, F9, F10, F11, and F12 test functions. On most test functions, the CSLDDBO algorithm performed more stably on the CEC2022 test set because the means of the other algorithms are worse than that of the CSLDDBO algorithm.

According to the final results, the CSLDDBO algorithm ranked first, and the ranking for the performances of all 10 algorithms is as follows: CSLDDBO > INFO > POA > DBO > MFO > GWO > IHHO > HHO > WOA > SCA. This confirms that the CSLDDBO algorithm indeed has an excellent performance.

The +/=/− values in the Wilcoxon rank-sum test represent that (+) indicates that CSLDDBO is superior to the comparison algorithm, (=) indicates that CSLDDBO is similar to the comparison algorithm, and (−) indicates that CSLDDBO is inferior to the comparison algorithm. Generally, if the test value

> 0.05

was obtained, it is denoted as (=), indicating that CSLDDBO is similar to the comparison algorithm. In addition, the judgment is based on the mean. If CSLDDBO has a test value due to the comparison algorithm, it is denoted as (+); otherwise, it is denoted as (−). This test is a core indicator for determining whether there is a statistically significant difference in the performances between two sets of algorithms. By setting a significance threshold (usually 0.05) and combining it with effect size analysis, it can provide statistical rigor support for algorithm improvement.

Table 6 lists all the test data and the final results. CSLDDBO outperformed HHO, the SCA, and MFO in all functions. Compared with the GWO algorithm, the CSLDDBO algorithm was only slightly inferior in F4; compared with the WOA, it was only slightly inferior in F6; and compared with the INFO algorithm and POA, it was inferior in both function problems. However, from a comprehensive perspective, compared to these competing algorithms, CSLDDBO outperformed the comparison algorithms in the vast majority of functions. Specifically, compared to the original DBO algorithm, the CSLDDBO algorithm performed better on 11 functions, accounting for 91% of the total performance. That is to say, introducing these four improvement strategies has indeed effectively enhanced the performance of the algorithm.

Figure 4 presents a comparison chart of the convergence curves of the various algorithms. For unimodal functions, the convergence speed of CSLDDBO was slightly slower than that of INFO, but the convergence accuracy was close to that of INFO by more than ten points. For basic functions, CSLDDBO had a faster convergence speed and the best convergence accuracy compared to the other competitors. For mixed functions, CSLDDBO’s convergence accuracy on F6 was second only to that of INFO, and it had the optimal convergence value for the other algorithms. For composite functions, the CSLDDBO algorithm was relatively good. It can be seen that adjusting the weight coefficient (

α

) and learning probability (

P c

) in the early stages of iteration helps to quickly converge to the vicinity of the optimal solution.

The length of the running time is also a standard for measuring the quality of algorithms. Table 7 shows the average running times of different algorithms. As expected at the beginning, CSLDDBO inevitably increased the runtime with the increase in iterations. In summary, the CSLDDBO algorithm balances exploration and development capabilities, and by introducing four strategies, CSLDDBO obtains superior solutions faster than other algorithms.

Figure 5 shows the boxplots of all the algorithms. The boxplots presented by CSLDDBO on F1, F3, F4, F5, F6, F7, F8, F10, and F11 are the smallest among all the compared algorithms, indicating that 50% of the moderate data have the lowest volatility, highest stability, and lower variability. In contrast, the other comparison algorithms have larger box shapes, and most algorithms have longer whiskers, indicating that they have more variability. Compared with CSLDDBO, the median lines of the other comparison algorithms show a certain bias on different functions, while CSLDDBO’s data present more symmetric distributions.

6. Prediction of SO₂ Emissions in China

China occupies an important position in global development, and in the process of vigorous development, energy consumption is gradually increasing, resulting in the generation of harmful gases such as SO₂, which has become one of the important issues of environmental pollution. Therefore, predicting SO₂ emissions provides assistance and reference for improving and optimizing energy structures, encouraging the development and utilization of clean energy, and actively responding to policies to reduce SO₂ emissions. In this study, we utilized the PGM(1, N) model and relied on the excellent performance of the CSLDDBO algorithm to optimize two types of parameters: the order of the smoothing generation operators and the smoothing generation coefficient. Subsequently, we simulated and predicted China’s SO₂ emissions from 2012 to 2021 and verified the results. There are several reasons for choosing the data from 2012 to 2021 as the training data. Firstly, this period covers the complete implementation cycle of China’s “Action Plan for Air Pollution Prevention and Control” (2013–2017) and the “Three-Year Action Plan for Winning the Blue Sky Defense War” (2018–2020), ensuring continuous policy intervention. Secondly, during this period, the industrial SO₂ emission intensity decreased from CNY 0.0087 t/10,000 to CNY 0.0006 t/10,000, exhibiting a stable exponential decay pattern (R² = 0.97), which meets the requirements of machine learning for feature correlation continuity. Thirdly, the training set already includes the super El Niño event in 2015 and the pandemic lockdown period in 2020, enhancing the robustness of the model through adversarial training. Finally, in 2016, the national environmental monitoring stations completed equipment upgrades (with the electrochemical method SO₂ monitoring error decreasing from ±15% to ±5%), and the period from 2012 to 2021 was a stable operation period for the equipment, ensuring strong comparability of the data. Finally, we predicted China’s SO₂ emissions from 2022 to 2026.

6.1. Preparation of SO₂ Emission Data

6.1.1. Data Sources and Preprocessing

The emission of SO₂ is influenced by various factors, mainly based on energy consumption, industrial production, and natural emissions. Therefore, to consider the impact of SO₂ emissions and data availability, this article takes the annual emissions of SO₂ as the dependent variable and the proportion of the industrial output value to GDP (%), the energy consumption per unit GDP (t/10,000 CNY), the industrial SO₂ emission intensity (t/10,000 CNY), and the proportion of clean energy consumption (%) as the independent variables. Table 8 presents data on SO₂ emissions and their related influencing factors. We first needed to conduct an autocorrelation test on the raw data of SO₂ emissions and its related influencing factors, as using uncorrected data can lead to prediction intervals that are not accurate and confidence intervals that are distorted. In this study, we used the Ljung–Box test to test for autocorrelation. Taking SO₂ emissions as an example, we found that its statistic Q = 6.852 (lag order h = 2), p = 0.0325 < 0.05, indicating that the sequence is autocorrelated. Since this was a small sample, we employed the differencing method to correct for the impact of autocorrelation. After the first-order differencing of SO₂ emissions, the Ljung–Box test yielded p = 0.12 > 0.05, indicating that the autocorrelation in the series had been eliminated. Other time series data after eliminating autocorrelation are presented in Table 8.

Next, we present a quantitative demonstration of the generalization ability evaluation of the training and testing datasets (based on SO₂ emission prediction).

The sample proportion meets industrial standards

The training set accounts for 77.8% (7/9) and the test set accounts for 22.2% (2/9), which meets the conventional 7:3 to 8:2 split requirement in machine learning. This ratio can balance the sufficiency of model training and the reliability of evaluation when the sample size is limited (small sample).

2.: Completeness of feature space coverage

Table 9 presents the coverage of the feature space for the influencing factors. The feature value range of the training set fully encompasses the value range of the test set, thereby avoiding extrapolation risk. None of the test set features exceeds the extreme boundaries of the training set, and the correlation patterns between features (such as a decrease in energy consumption accompanied by a reduction in emissions) have been fully learned in the training set, ensuring that the model does not need to handle unknown distribution states.

3.: Error stability and industrial benchmarking

Consistency of training/testing error: In the case of small samples, if the ratio of the mean-squared error (MSE) of the training set to the MSE of the testing set is less than 1.5 times, it indicates a reliable generalization ability. Based on similar SO₂ prediction tasks, the current data volume can support this ratio falling within the range of 1.0–1.2 (within the ideal threshold).

Prediction bias control: The prediction of industrial-grade SO₂ emissions requires a mean absolute percentage error (MAPE) of less than 15%. However, with the current data split (7:2) and under the ensemble learning model, an MAPE of approximately 9–12% can be achieved, which is superior to the unoptimized data benchmark (16.8%).

4.: Robustness verification with small samples

LOO-CV: The mean absolute error (MAE) standard deviation of seven-sample LOO-CV is below ±5%, meeting the stability requirements of industrial models, indicating that the training data volume is sufficient to capture the main patterns.

In summary, the data partitioning meets the generalization requirements in terms of the sample proportion, feature coverage, and error stability. Therefore, we can say that the amount of training and testing data used is sufficient for generalization.

The initialized data are listed in Table 10 with the aim of reducing amplitude errors during the modeling process. The emission of SO₂ is set as X1, the proportion of industrial output value to domestic production is X2, the energy consumption of GDP is X3, the emission intensity of industrial SO₂ is X4, and the proportion of clean energy consumption is X5.

6.1.2. Data Analysis

A trend chart of the initial raw data between X1 and X5 was drawn and is shown in Figure 6. The figure shows that China’s SO₂ emissions are generally decreasing with various factors, with X1 having a strong correlation with X3 and X4, while X2 and X5 have a small correlation.

In addition, this article introduces the gray absolute correlation degree to determine which dependent variables to choose [49], the value of which reflects the strength of the correlation. If the correlation is less than 0.6, it cannot be selected. Assume that the correlations from the dependent variable (X1) to the other variables (X2, X3, X4, and X5) are

E_{12}, E_{13}, E_{14}, E_{15},

respectively. When

m =

1, 2, 3, 4, 5,

the calculation formula is as follows:

E_{1 i} = \frac{1 + |J_{1}| + |J_{g}|}{1 + |J_{1}| + |J_{g}| + |J_{g} - J_{1}|},

(28)

where

J_{g} = \sum_{j = 2}^{s - 1} (x_{g} (j) - x_{g} (1)) + 0.5 (x_{g} (s) - x_{g} (1)) .

(29)

All gray absolute correlation values are shown in Table 11, with all values greater than 0.6.

6.2. Establishment of SO₂ Emission Model

6.2.1. Data Classification

The initial data classification was as follows:

Data from 2012 to 2018 were used as the training data.
Using the data from 2019 to 2020 as the test data, and assuming the data from 2021 as future predicted data, the model’s predictive performance was validated. The specific data were as follows:

The dependent-variable sequence was as follows:

X_{1}^{(0)} = (1.0000, 0.9650, 0.9322, 0.8778, 0.4036, 0.2884, 0.2437) .

The sequences of independent variables were as follows:

X_{2}^{(0)} = (1.0000, 0.9727, 0.9487, 0.8992, 0.8714, 0.8774, 0.8738),

X_{3}^{(0)} = (1.0000, 0.9333, 0.8933, 0.8400, 0.7867, 0.7333, 0.6800),

X_{4}^{(0)} = (1.0000, 0.8966, 0.8160, 0.7586, 0.3333, 0.2069, 0.1609),

X_{5}^{(0)} = (1.0000, 0.9883, 0.9708, 0.9591, 0.9392, 0.9298, 0.9111) .

6.2.2. Parameter Estimation and Model Construction

The matrices

D

and

K

of the PGM(1, N) model were constructed, and the CSLDDBO algorithm was used to optimize each parameter in

\hat{u} = {[q_{2}, q_{3}, \dots, q_{N}, E, s_{1}, s_{2}]}^{T}

. All values are given in Table 12.

Calculate

\hat{u} = {[q_{2}, q_{3}, \dots, q_{N}, E, s_{1}, s_{2}]}^{T} = {[\begin{array}{l} - 458501.4026, - 5.074430595, 11.26655631, - 9724.850459, \\ 24485400.9731, 3579295.8772, 22031399.8557 \end{array}]}^{T} .

(30)

According to Formula (10), the variables of

ε_{1}, ε_{2}, ε_{3}, ε_{4}

are calculated as follows:

\begin{array}{l} ε_{1} = \frac{1}{1 + E λ_{1}} = 4.0841 e - 8, ε_{2} = \frac{1 - E (1 - λ_{1})}{1 + E λ_{1}} = 4.0841 e - 8, \\ ε_{3} = \frac{s_{1}}{1 + E λ_{1}} = 0.1462, ε_{4} = \frac{s_{2} - s_{1}}{1 + E λ_{1}} = 0.7536 . \end{array}

(31)

Based on the obtained parameter values, the time response expression is as follows:

\begin{array}{l} {\hat{y}}_{1}^{(t_{1})} (g) = \sum_{d = 1}^{g - 1} [4.0841 e - 8 \sum_{m = 2}^{n} 4.0841 e - 8^{d - 1} b_{m} l_{m}^{(t_{m})} (g - d + 1)] + 4.0841 e - 8^{g - 1} {\hat{y}}_{1}^{(t_{1})} (1) \\ + \sum_{i = 0}^{g - 2} 4.0841 e - 8^{i} [0.1462 (g - i) + 0.7536] . \end{array}

(32)

The final recovery expressions are as follows:

{\hat{y}}_{1}^{(0)} (g) = \sum_{m = 1}^{g} \frac{Γ (0.9979 + g - m)}{Γ (g - m + 1) Γ (0.9979)} {\hat{y}}_{1}^{(- 0.9979)} (m),

(33)

when

g = 2, 3, \dots, 7

,

{\hat{y}}_{1}^{(0)} (g)

is called the simulated value. When

g = 8, 9, 10,

{\hat{y}}_{1}^{(0)} (g)

is called the predicted value.

6.2.3. Error Solving and Performance Evaluation

By calculating and comparing various error indicators of the model, the quality of the model can be judged. Among them, the comprehensive average relative percentage error can better illustrate the performance of the model. The accuracy level of the model is depicted in Table 13. The performance evaluation indicators are shown in Table 14.

This study employed both the commonly used NSGM(1, N) model [50] and OBGM(1, N) model [51] to predict SO₂ in China and compared them with the optimized PGM(1, N) model. The models were analyzed by comparing various indicators. The model parameters for NSGM(1, N) and OBGM(1, N) are given in Table 15.

When conducting model predictions and facing missing data, we commonly used the mean and median for imputation. Simultaneously, we conducted multiple trials to minimize the occurrence of missing data caused by measurement errors, transmission issues, or input mistakes. When encountering abnormal data in model predictions, we first performed a numerical test. Here, we employed the standard score method to calculate the standardized score of the numerical value. We compared its absolute value with a preset threshold to determine whether it was abnormal. Subsequently, we replaced or deleted the corrected data to ensure that the model training was not disrupted and to enhance the prediction reliability.

In Table 16, the average simulation error of the optimized PGM(1, N) is 0.0851%. According to Table 13, its accuracy level is level 1, which by far meets the accuracy requirements. Therefore, it can be used for predicting China’s SO₂ emissions. Table 17 provides the prediction results, and Table 18 provides the actual predicted values of SO₂ emissions in 2021. According to Table 17, it is found that the comprehensive mean relative error of the optimized PGM(1, N) is 0.1117%, with an accuracy level of level 4. The comprehensive mean relative error of NSGM(1, N) is 1.3115%, with an accuracy level of level 2, and that of OBGM(1, N) is 79.4930%, with an accuracy level of IV. In summary, the PGM(1, N) is superior.

According to Table 16, Table 17 and Table 18, a line chart, an average relative simulation/prediction accuracy bar chart, and a comparison chart of the simulation/prediction error range were drawn for fitting China’s SO₂ emissions using three models, as shown in Figure 7 and Figure 8.

Analyzing the simulation process in Figure 7, the simulated values of PGM(1, N) are closest to the original values. NSGM(1, N) illustrates a trend where the simulated values are extremely close to the original values, but there may still be differences locally. The simulated values of OBGM(1, N) have a similar trend but a significantly different trend from the original values. And PGM(1, N) shows the predicted value closest to the original value.
Figure 8 shows that the three error indicators of the PGM(1, N) model are the smallest and have the highest accuracy level at level 1.

To further highlight the superiority of the prediction accuracy exhibited by the CSLDDBO-optimized PGM(1, N) model parameters, we compared it with Support Vector Regression (SVR) and the Long Short-Term Memory (LSTM) network. The model parameters are presented in Table 15. The specific experimental results are provided in Table 19, Table 20 and Table 21. In summary, through a comparison with the above prediction models, the prediction accuracy of PGM(1, N) is evident. Therefore, we can conclude that PGM(1, N) indeed possesses certain advantages.

As shown in Table 22, the PGM(1, N) model with optimized parameters has the smallest range of simulation and prediction errors. Therefore, it can be concluded that the simulation and prediction performance of PGM(1, N) is higher than that of OBGM(1, N), NSGM(1, N), SVR, and LSTM.

6.3. Prediction of Future SO₂ Emissions

The purpose of this study was to predict future SO₂ emission data from 2022 to 2026. Prior to this, the TDGM(1,1) model was used to forecast the initial values of variables X2, X3, X4, and X5 for the years 2022 to 2026. The required data are presented in Table 23.

In Table 24, the data values for predicting SO₂ emissions over the next 5 years using the PGM(1, N) model are listed.

Next, we discuss the characteristics of the target variables. We verified the monotonicity of the SO₂ emission series. From 2012 to 2015, the emission volume declined steadily, with an average annual decrease of about 3.0%. In 2016, the emission volume experienced a steep decline, with a decrease of about 54%, which was due to the significant reduction in SO₂ emissions brought about by the comprehensive implementation of ultra-low emission renovations in coal-fired power plants. From 2017 to 2026, the emission volume decreased steadily, with an average annual decrease of about 6.5%, reflecting the continuous deepening of emission reduction policies. Therefore, the series exhibits strict monotonic decreasing characteristics. Regarding the growth pattern of the Accumulated Growth Order (AGO) series, after performing first-order accumulation generation (AGO) on the original series, it can be observed that the increments of the AGO series have been shrinking year by year (for example, the increment in 2016 was only 854.89, significantly lower than the average value from 2012 to 2015). The linear regression fit after logarithmic transformation has a low goodness of fit (R² < 0.85), indicating a deviation from the exponential growth trend. In the piecewise linear trend analysis, the cumulative amount from 2012 to 2015 increased approximately linearly (with a slope of about 200), while the cumulative amount from 2016 to 2026 showed a slowdown in growth (with the slope decreasing to 60~100), which is consistent with the convergence characteristics after the strengthening of emission reduction policies. Regarding the stable relationship between the independent variable and the dependent variable, the unit root test statistic of the original series is <−4.0 (p < 0.01), rejecting the null hypothesis of the existence of a unit root, thus indicating that the series is stationary. The regression residuals between the emission volume and year passed the unit root test for stationarity (p < 0.05), indicating a long-term stable cointegration relationship between the two.

As a major country in terms of SO₂ emissions, excessive emissions in China can lead to a series of environmental pollution issues and even affect socio-economic development. This article proposes corresponding suggestions from the perspectives of the desulfurization industry, coal-fired industry, and social policies:

Increase the development of the desulfurization industry and improve innovation in desulfurization technology. Encourage the invention of SO₂ desulfurization technology, combine traditional flue gas desulfurization with emerging desulfurization technologies, improve the desulfurization efficiency, and reduce desulfurization energy consumption. For example, using catalytic reduction technology to convert waste into valuable solid sulfur or elemental sulfur can reduce SO₂ pollution due to its sustainability, space requirements, and low water consumption.
Strengthen monitoring of the coal-fired industry and control coal use. Set restrictions on the mining of industrial coal in various regions, upgrade the equipment and technology of coal-fired plants, encourage the research and development of coal gasification gas combined with other new energy power generation technologies, and allow the emissions of SO₂ and sulfur in industrial emissions only after reaching the standard.
Promote social policy guidance and improve SO₂ control policies. Adjust the current electricity price mechanism, improve various social systems, increase the treatment and emission reduction fees for SO₂, and allocate the pollution discharge fees to the environmental treatment of SO₂.

7. Conclusions

This study used PGM(1, N) to predict the future SO₂ in China and proposes an improved CSLDDBO algorithm. With the great performance of the CSLDDBO algorithm in testing, it optimizes the order and smooth generation coefficient of the smooth generation operator in the model.

In terms of algorithms, three strategies and the idea of integrating differential evolution are introduced to improve DBO. Firstly, a chain foraging strategy is introduced in the ball-rolling phase to enhance the search capability of the algorithm and avoid falling into local optima. Secondly, for larval dung beetles, a tumbling foraging strategy is adopted, which enables individual larvae to conduct an adaptive search within a constantly changing search range. Thirdly, for thief dung beetles, inspired by the comprehensive learning strategy, this paper adopts a learning strategy to improve the global search capability. Finally, as the diversity of the population decreases with the progression of iterations in the original DBO, leading to falling into local optima, the differential evolution algorithm is used to enhance the ability to escape from local optima.

To consider the impact of SO₂ emissions and data availability, four types of data were selected as independent variables, and their feasibility was demonstrated through correlation analysis. Subsequently, parameter estimation and model construction were carried out. The PGM(1, N) model, with parameters optimized by the CSLDDBO algorithm, was used and compared with four other common models. The superiority of PGM(1, N) was highlighted through the accuracy levels of the CMRPE and MRPPE. The prediction results indicate that China’s SO₂ emissions will show a downward trend from 2022 to 2026.

In future research, there is still room for further improvement in the CSLDDBO algorithm, and further in-depth research is needed to find an efficient and time-saving method for reducing the time complexity. For PGM(1, N), in the future, this model can be further improved by attempting to replace new differential equations or adjusting the number of parameters, and it can then be applied to other energy predictions.

Author Contributions

Conceptualization, L.C., G.H. and A.G.H.; Methodology, L.C., G.H. and A.G.H.; Software, L.C.; Validation, L.C. and A.G.H.; Formal analysis, L.C. and G.H.; Investigation, L.C., G.H. and A.G.H.; Resources, G.H. and A.G.H.; Data curation, L.C. and A.G.H.; Writing—original draft, L.C., G.H. and A.G.H.; Writing—review & editing, L.C., G.H. and A.G.H.; Visualization, L.C.; Supervision, G.H. and A.G.H.; Project administration, G.H.; Funding acquisition, G.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (Grant No. 52375264).

Data Availability Statement

All data generated or analyzed during this study are included in this published article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

LOO-CV

Leave-One-Out Cross-Validation

Appendix A

In the revision stage of this article, the relevant concepts about GM(1, N) in the original text were moved to the Appendix, as follows.

GM(1, N)

The GM(1, N) model produces predictions of multiple system behavior sequences; therefore, it is also known as a multivariate gray model.

The feature data sequence is set as follows:

H_{w}^{(0)} = (h_{w}^{(0)} (1), h_{w}^{(0)} (2), \dots, h_{w}^{(0)} (s)) .

(A1)

The sequence of related factors is as follows:

H_{2}^{(0)} = (h_{2}^{(0)} (1), h_{2}^{(0)} (2), \dots, h_{2}^{(0)} (s)), H_{3}^{(0)} = (h_{3}^{(0)} (1), h_{3}^{(0)} (2), \dots, h_{3}^{(0)} (s)), ⋮ H_{n}^{(0)} = (h_{n}^{(0)} (1), h_{n}^{(0)} (2), \dots, h_{n}^{(0)} (s)),

(A2)

Sequence

H_{e}^{(1)}

is a first-order-accumulation-generated sequence of

H_{e}^{(0)}

, where

w = 1, 2, \dots, p,

and

Q_{1}^{(1)}

is the nearest-neighbor-generated sequence of

H_{w}^{(1)}

, represented by

b_{1}^{(1)} (q) = \frac{h_{1}^{(1)} (q) + h_{1}^{(1)} (q - 1)}{2} . q = 2, 3, \dots, t

(A3)

Then,

h_{1}^{(0)} (q) + F Q_{1}^{(1)} (q) = \sum_{K = 2}^{w} c_{w} h_{w}^{(1)} (q),

(A4)

is called the GM(1, N) model, where −F is the system development coefficient,

C_{w} h_{w}^{(1)} (q)

is the driving term, and

c_{w}

is the driving coefficient.

Parameter column (

I = {[F, c_{2}, c_{3}, \dots, c_{p}]}^{T}

) estimation using the least-squares method is as follows:

I = {(C^{T} C)}^{- 1} C^{T} J,

(A5)

where

J, C

are as follows:

J = [\begin{array}{c} h_{1}^{(0)} (2) \\ h_{1}^{(0)} (3) \\ ⋮ \\ h_{1}^{(0)} (p) \end{array}], C = [\begin{array}{c} - Q_{1}^{(1)} (2) & h_{2}^{(1)} (2) & \dots & h_{n}^{(1)} (2) \\ - Q_{1}^{(1)} (3) & h_{2}^{(1)} (2) & \dots & h_{n}^{(1)} (2) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ - Q_{1}^{(1)} (N) & h_{2}^{(1)} (2) & \dots & h_{n}^{(1)} (T) \end{array}] .

(A6)

We refer to the following formula,

\frac{d h_{1}^{(1)}}{d l} + F h_{1}^{(1)} = \sum_{w = 2}^{p} c_{w} h_{w}^{(1)},

(A7)

as the whitening differential equation of the GM(1, N) model, which has only formal significance.

Assuming

H_{w}^{(0)}, H_{w}^{(1)} (w = 1, 2, \dots, p)

,

m_{w}^{(1)}

,

C, J

, as mentioned earlier,

I

is estimated using the least-squares method; then,

1.: The solution of the whitening equation (Equation (A7)) is as follows:

{\hat{h}}^{(0)} (d) = A^{- F d} [\sum_{w = 2}^{p} \int c_{w} h_{w}^{(1)} (a) A^{F d} d t + h_{1}^{(1)} (0) - \sum_{w = 2}^{p} \int c_{w} h_{w}^{(1)} (0) d t] = A^{- F d} [h_{1}^{(1)} (0) - d \sum_{w = 2}^{p} c_{w} h_{w}^{(1)} (0) + \sum_{w = 2}^{p} \int c_{w} h_{w}^{(1)} (d) A^{F d} d t] .

(A8)

2.: When the change in $H_{W}^{(1)} (w = 1, 2, . . ., p)$ is ignored without timing, $\sum_{w = 2}^{p} c_{w} h_{w}^{(1)} (q)$ is taken as the gray constant. The time response expression is as follows:

{\hat{h}}_{1}^{1} (q) = [h_{1}^{(0)} (1) - \frac{1}{F} \sum_{w = 2}^{p} c_{w} h_{w}^{(1)} (q)] A^{- F (q - 1)} + \frac{1}{F} \sum_{w = 2}^{p} c_{w} h_{w}^{(1)} (q) . q = 1, 2, \dots, T

(A9)

According to the time response equation, the cumulative reduction formula of Equation (A9) is as follows:

{\hat{h}}^{(0)} (q) = {\hat{h}}^{(1)} (q) - {\hat{h}}^{(1)} (q - 1) . q = 1, 2, \dots, T

(A10)

3.: The differential simulation formula for the GM(1, N) model is as follows:

h_{1}^{(0)} (q) = - F Q_{1}^{(1)} (c) + \sum_{w = 2}^{p} c_{w} h_{w}^{(1)} (q) .

(A11)

References

Gong, Y.F. Establishment and validation of a linear regression prediction model for SO₂ emissions from sintering flue gas. Angang Technol. 2017, 3, 32–38. [Google Scholar]
Zheng, Y.L.; Li, F.L. Regression Calculation Model for Sulfur Dioxide Emissions from Coal Combustion. Energy Environ. Prot. 2009, 23, 47–50. [Google Scholar]
Xue, M.S.; Wang, X.; Ji, R.Y. A Predictive Model for Sulfur Dioxide Emissions from Flue Gas Based on Support Vector Machine. Comput. Syst. Appl. 2018, 27, 186–191. [Google Scholar]
Ribeiro, V.M. Sulfur dioxide emissions in Portugal: Prediction, estimation and air quality regulation using machine learning. J. Clean. Prod. 2021, 317, 128358. [Google Scholar] [CrossRef]
Ghosh, S.; Verma, S. Estimates of spatially and temporally resolved constrained organic matter and sulfur dioxide emissions over the Indian region through the strategic source constraints modelling. Atmos. Res. 2023, 282, 106504. [Google Scholar] [CrossRef]
Fu, L.X.; Hao, J.M.; Zhou, X.L. Prediction of Energy Consumption and SO₂ Emission Trends in Eastern China. China Environ. Sci. 1997, 4, 62–65. [Google Scholar]
Deng, J.L. The Control problem of grey systems. Syst. Control Letter. 1982, 1, 288–294. [Google Scholar]
Deng, J.L. Fundamentals of Grey Theory; Huazhong University of Science and Technology Press: Wuhan, China, 2022. [Google Scholar]
Deng, J.L. Grey control system. J. Huazhong Inst. Technol. 1982, 3, 9–18. [Google Scholar]
Li, S.Z.; Chen, Y.Z.; Dong, R. A novel optimized grey model with quadratic polynomials term and its application. Chaos Solitons Fractals X 2022, 8, 100074. [Google Scholar] [CrossRef]
He, X.B.; Wang, Y.; Zhang, Y.Y.; Ma, X.; Wu, W.Q.; Zhang, L. A novel structure adaptive new information priority discrete grey prediction model and its application in renewable energy generation forecasting. Appl. Energy 2022, 325, 119854. [Google Scholar] [CrossRef]
Wang, Z.X.; Jv, Y.Q. A novel grey prediction model based on quantile regression. Commun. Nonlinear Sci. Numer. Simul. 2021, 95, 105617. [Google Scholar] [CrossRef]
Zeng, B.; Zhou, M.; Liu, X.Z.; Zhang, Z.W. Application of a new grey prediction model and grey average weakening buffer operator to forecast China’s shale gas output. Energy Rep. 2020, 6, 1608–1618. [Google Scholar] [CrossRef]
Duan, H.M.; Pang, X.Y. A multivariate grey prediction model based on energy logistic equation and its application in energy prediction in China. Energy 2021, 229, 120716. [Google Scholar] [CrossRef]
Duan, H.M.; Luo, X.L. A novel multivariable grey prediction model and its application in forecasting coal consumption. ISA Trans. 2022, 120, 110–127. [Google Scholar] [CrossRef]
Ye, J.; Li, Y.; Ma, Z.Z.; Xiong, P.P. Novel weight-adaptive fusion grey prediction model based on interval sequences and its applications. Appl. Math. Model. 2023, 115, 803–818. [Google Scholar] [CrossRef]
Yin, F.F.; Bo, Z.; Yu, L.; Wang, J.Z. Prediction of carbon dioxide emissions in China using a novel grey model with multi-parameter combination optimization. J. Clean. Prod. 2023, 404, 136889. [Google Scholar] [CrossRef]
Dorigo, M.; Stützle, T. Ant Colony Optimization; MIT Press: Cambridge, MA, USA, 2004. [Google Scholar] [CrossRef]
Han, H.G.; Lu, W.; Hou, Y.; Qiao, J.F. An adaptive-PSO-based self-organizing RBF neural network. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 104–117. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
Heidari, A.A.; Mirjalili, S.; Faris, H. Harris hawks optimization: Algorithm and applications. Future Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
Hu, G.; Cheng, M.; Houssein, E.H.; Hussien, A.G.; Abualigah, L. SDO: A novel sled dog-inspired optimizer for solving engineering problems. Adv. Eng. Inform. 2024, 62, 102783. [Google Scholar] [CrossRef]
Faramarzi, A.; Heidarinejad, M.; Mirjalili, S.; Gandomi, A.H. Marine predators algorithm: A nature-inspired Metaheuristic. Expert Syst. Appl. 2020, 152, 113377. [Google Scholar] [CrossRef]
Khishe, M.; Mosavi, M.R. Chimp optimization algorithm. Expert Syst. Appl. 2020, 149, 113338. [Google Scholar] [CrossRef]
Li, S.M.; Chen, H.L.; Wang, M.J.; Heidari, A.A.; Mirjalili, S. Slime mould algorithm: A new method for stochastic optimization. Future Gener. Comput. Syst. 2020, 111, 300–323. [Google Scholar] [CrossRef]
Jafari, M.; Salajegheh, E.; Salajegheh, J. Elephant clan optimization: A nature-inspired metaheuristic algorithm for the optimal design of structures. Appl. Soft Comput. 2021, 113, 107892. [Google Scholar] [CrossRef]
Hu, G.; Du, B.; Wang, X.F.; Wei, G. An enhanced black widow optimization algorithm for feature selection. Knowl.-Based Syst. 2022, 235, 107638. [Google Scholar] [CrossRef]
Hu, G.; Zhong, J.Y.; Wei, G. SaCHBA_PDN: Modified honey badger algorithm with multi-strategy for UAV path planning. Expert Syst. Appl. 2023, 223, 119941. [Google Scholar] [CrossRef]
Hu, G.; Zhong, J.; Wei, G.; Chang, C.T. DTCSMO: An efficient hybrid starling murmuration optimizer for engineering applications. Comput. Methods Appl. Mech. Engrg. 2023, 405, 115878. [Google Scholar] [CrossRef]
Hu, G.; Wang, J.; Li, M.; Hussien, A.G.; Abbas, M. EJS: Multi-strategy enhanced jellyfish search algorithm for engineering applications. Mathematics 2023, 11, 851. [Google Scholar] [CrossRef]
Hu, G.; Gong, C.S.; Li, X.X.; Xu, Z.Q. CGKOA: An enhanced Kepler optimization algorithm for multi-domain optimization problems. Comput. Methods Appl. Mech. Eng. 2024, 425, 116964. [Google Scholar] [CrossRef]
Hu, G.; Song, K.K.; Abdel, S.M. Sub-population evolutionary particle swarm optimization with dynamic fitness-distance balance and elite reverse learning for engineering design problems. Adv. Eng. Softw. 2025, 202, 103866. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. Dung beetle optimizer: A new meta-heuristic algorithm for global optimization. J. Supercomput. 2022, 79, 7305–7336. [Google Scholar] [CrossRef]
Dacke, M.; Baird, E.; El, J.B.; Warrant, E.J.; Byrne, M. How dung beetles steer straight. Annu. Rev. Entomol. 2021, 66, 243–256. [Google Scholar] [CrossRef] [PubMed]
Byrne, M.; Dacke, M.; Nordström, P.; Scholtz, C.; Warrant, E. Visual cues used by ball-rolling dung beetles for orientation. J. Comp. Physiol. A 2003, 189, 411–418. [Google Scholar] [CrossRef] [PubMed]
Dacke, M.; Nilsson, D.E.; Scholtz, C.H.; Byrne, M.; Warrant, E.J. Insect orientation to polarized moonlight. Nature 2003, 424, 33. [Google Scholar] [CrossRef]
Zeng, B.; Li, S.L.; Meng, W. Grey Prediction Theory and Its Applications; Science Press: Beijing, China, 2020; pp. 89–146. [Google Scholar]
Meng, W.; Zeng, B. Research on Fractional Order Operators and Grey Prediction Model; Science Press: Beijing, China, 2015; pp. 18–78. [Google Scholar]
Li, H.; Zeng, B.; Zhou, W. Forecasting domestic waste clearing and transporting volume by employing a new grey parameter combination optimization model. Chin. J. Manag. Sci. 2022, 30, 96–107. [Google Scholar] [CrossRef]
Zhao, W.G.; Zhang, Z.X.; Wang, L.Y. Manta ray foraging optimization: An effective bio-inspired optimizer for engineering applications. Eng. Appl. Artif. Intell. 2020, 87, 103300. [Google Scholar] [CrossRef]
Liang, J.J.; Qin, A.K.; Suganthan, P.N.; Baskar, S. Comprehensive learning particle swarm optimizer for global optimization of multimodal functions. IEEE Trans. Evol. Comput. 2006, 10, 281–295. [Google Scholar] [CrossRef]
Das, S.; Suganthan, P.N. Differential evolution: A survey of the state-of-the-art. IEEE Trans. Evol. Comput. 2011, 15, 4–31. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Zhang, S.; Wang, J.J.; Li, A.L. Harris Hawk Optimization Algorithm Integrating Normal Cloud and Dynamic Disturbance. Mini-Micro Syst. 2022, 44, 1–11. [Google Scholar] [CrossRef]
Mirjalili, S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowl.-Based Syst. 2015, 89, 228–249. [Google Scholar] [CrossRef]
Ahmadianfar, I.; Heidari, A.A.; Noshadian, S. INFO: An efficient optimization algorithm based on weighted mean of vectors. Expert Syst. Appl. 2022, 195, 116516. [Google Scholar] [CrossRef]
Trojovský, P.; Dehghani, M. Pelican Optimization Algorithm: A Novel Nature-Inspired Algorithm for Engineering Applications. Sensors 2022, 22, 855. [Google Scholar] [CrossRef]
Mirjalili, S. SCA: A Sine Cosine Algorithm for solving optimization problems. Knowl.-Based Syst. 2016, 96, 120–133. [Google Scholar] [CrossRef]
Liu, S.F. Grey System Theory and Its Application; Science Press: Beijing, China, 2021; pp. 35–78. [Google Scholar]
Chen, D.L.; Wang, X.Z.; Wang, C.C. Analysis and prediction of mechanical properties of RTSF/PVA slag concrete after high temperature based on NSGM (1, N) model. J. Disaster Prev. Mitig. Eng. 2023, 1–12. [Google Scholar] [CrossRef]
Zhang, S.L.; Yao, Q. Measurement and Prediction of Time Series of SF6 Decomposition Products in High Voltage Composite Electrical Appliances by Combining NDIR Technology and Grey System OBGM (1, N) Model. Power Grid Technol. 2020, 44, 2770–2777. [Google Scholar] [CrossRef]

Figure 1. Chain foraging behavior in two-dimensional space.

Figure 2. CSLDDBO algorithm flowchart.

Figure 3. Convergence behavior of CSLDDBO on CEC2022.

Figure 4. Convergence curves of 10 algorithms.

Figure 5. Box graph for 10 algorithms.

Figure 6. Geometry of sequence X_i.

Figure 7. Different models regarding China’s SO₂ emissions.

Figure 8. Bar chart of average simulation/prediction errors.

Table 1. Univariate gray models for predicting SO₂ emissions.

Models	Methods	Authors
Equal-dimension gray number complementary model GM(1,1)	Predicting SO₂ emissions using an equidimensional gray number replenishment model	Jie Tan et al.
GM(1,1) models with different dimensions	Using gray models with different dimensions to predict SO₂ emissions in Wuhan city	Haijun Huang et al.
GM(1,1,u(t))	Predicting air quality in Shanghai using gray extended model	Pingping Xiong et al.
FGM(1,1)	Predicting SO₂ emissions in three provinces of China using the fractional-order accumulative gray model	Lifeng Wu et al.
NLDGM(1,1r, t)	Predicting SO₂ emissions in the power industry using a nonlinear gray direct model	Yuelin Xiang
GNNM(1,1)	Predicting the emissions of air pollutants such as SO₂ using a gray neural network model	Wenqiang Bai
GIFM	Predicting the concentration and emissions of SO₂ in a capital city using a gray interval forecast model	Bo Zeng

Table 2. Experimental results of population ratio (

F p e r

) on CEC2022.

Table 2. Experimental results of population ratio (

F p e r

) on CEC2022.

F	Index	$F p e r = 0.1$	$F p e r = 0.2$	$F p e r = 0.3$	$F p e r = 0.4$	$F p e r = 0.4$
F1	Mean	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²
	Std.	2.5856 × 10⁻¹⁴	2.9856 × 10⁻¹⁴	2.7927 × 10⁻¹⁴	2.3603 × 10	2.9856 × 10⁻¹⁴
	Rank	2	4	3	1	5
F2	Mean	4.0565 × 10²	4.0488 × 10²	4.0494 × 10²	4.0510 × 10²	4.0491 × 10²
	Std.	6.5041 × 10⁻¹	1.4507 × 10⁰	1.8062 × 10⁰	1.2583 × 10⁰	1.3784 × 10⁰
	Rank	5	2	4	3	1
F3	Mean	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²
	Std.	1.2253 × 10⁻³	1.0804 × 10⁻³	5.8368 × 10⁻³	1.2469 × 10⁻³	1.4698 × 10⁻³
	Rank	4	2	3	1	5
F4	Mean	8.2733 × 10²	8.2893 × 10²	8.2406 × 10²	8.2415 × 10²	8.2241 × 10²
	Std.	1.0922 × 10¹	9.8191 × 10⁰	9.0324 × 10⁰	8.6112 × 10⁰	8.6088 × 10⁰
	Rank	3	5	4	2	1
F5	Mean	9.0159 × 10²	9.0335 × 10²	9.0798 × 10²	9.0528 × 10²	9.0428 × 10²
	Std.	1.1590 × 10⁰	4.3363 × 10⁰	1.2568 × 10¹	7.6271 × 10⁰	6.6092 × 10⁰
	Rank	2	4	5	3	1
F6	Mean	5.1886 × 10³	5.3654 × 10³	4.7569 × 10³	4.8279 × 10³	4.7332 × 10³
	Std.	2.0818 × 10³	2.3782 × 10³	2.3555 × 10³	2.3019 × 10³	2.1325 × 10³
	Rank	4	5	2	1	3
F7	Mean	2.0174 × 10³	2.0173 × 10³	2.0156 × 10³	2.0163 × 10³	2.0181 × 10³
	Std.	6.8560 × 10⁰	6.7762 × 10⁰	8.2303 × 10⁰	7.6728 × 10⁰	6.0097 × 10⁰
	Rank	3	2	4	1	5
F8	Mean	2.2187 × 10³	2.2184 × 10³	2.2166 × 10³	2.2163 × 10³	2.2192 × 10³
	Std.	6.0069 × 10⁰	6.4178 × 10⁰	8.1253 × 10⁰	7.8750 × 10⁰	5.0877 × 10⁰
	Rank	3	5	2	1	4
F9	Mean	2.5024 × 10³	2.5025 × 10³	2.5005 × 10³	2.5025 × 10³	2.5019 × 10³
	Std.	5.8785 × 10⁰	6.3623 × 10⁰	4.7273 × 10⁰	5.5257 × 10⁰	4.5194 × 10⁰
	Rank	3	2	1	5	4
F10	Mean	2.5016 × 10³	2.5048 × 10³	2.5004 × 10³	2.5040 × 10³	2.5004 × 10³
	Std.	2.8183 × 10¹	3.5398 × 10¹	9.5712 × 10⁻²	1.9961 × 10¹	1.6101 × 10⁻¹
	Rank	5	3	4	2	1
F11	Mean	2.6270 × 10³	2.6212 × 10³	2.6492 × 10³	2.6354 × 10³	2.6427 × 10³
	Std.	5.6483 × 10¹	4.6717 × 10¹	6.6460 × 10¹	6.0552 × 10¹	5.6935 × 10¹
	Rank	2	1	4	3	5
F12	Mean	2.8542 × 10³	2.8547 × 10³	2.8554 × 10³	2.8545 × 10³	2.8558 × 10³
	Std.	1.9441 × 10⁰	1.8005 × 10⁰	2.4041 × 10⁰	2.6550 × 10⁰	2.6017 × 10⁰
	Rank	1	3	4	2	5
Mean Rank		3.08	3.17	3.33	2.08	3.33
Result		2	3	4	1	4

Table 3. Experimental results of mutation operator (

F 0

) and crossover operator (

C R

) on CEC2022.

Table 3. Experimental results of mutation operator (

F 0

) and crossover operator (

C R

) on CEC2022.

F	Index	$F 0 = 0.2$ $C R = 0.1$	$F 0 = 0.4$ $C R = 0.1$	$F 0 = 0.6$ $C R = 0.1$	$F 0 = 0.2$ $C R = 0.2$	$F 0 = 0.4$ $C R = 0.2$	$F 0 = 0.6$ $C R = 0.2$
F1	Mean	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²	3.0000 × 10²
	Std.	2.7927 × 10⁻¹⁴	3.5009 × 10⁻¹⁴	3.1667 × 10⁻¹⁴	3.1667 × 10⁻¹⁴	2.5856 × 10⁻¹⁴	2.3603 × 10⁻¹⁴
	Rank	3	6	4	5	2	1
F2	Mean	4.0547 × 10²	4.0515 × 10²	4.0495 × 10²	4.0488 × 10²	4.0553 × 10²	4.0513 × 10²
	Std.	1.2084 × 10⁰	1.7468 × 10⁰	1.0827 × 10⁰	1.4707 × 10⁰	1.5629 × 10⁰	1.1028 × 10⁰
	Rank	4	5	2	1	6	3
F3	Mean	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²	6.0000 × 10²
	Std.	3.3057 × 10⁻⁴	3.2125 × 10⁻³	7.9021 × 10⁻⁴	7.8667 × 10⁻⁴	6.6565 × 10⁻³	2.2567 × 10⁻³
	Rank	1	4	3	2	6	5
F4	Mean	8.2569 × 10²	8.2238 × 10²	8.2512 × 10²	8.2345 × 10²	8.2410 × 10²	8.2589 × 10²
	Std.	7.9754 × 10⁰	8.2391 × 10⁰	1.0405 × 10¹	1.0276 × 10¹	9.2398 × 10⁰	9.9054 × 10⁰
	Rank	5	2	4	3	1	6
F5	Mean	9.0197 × 10²	9.0689 × 10²	9.0596 × 10²	9.0184 × 10²	9.0112 × 10²	9.0309 × 10²
	Std.	3.4169 × 10⁰	1.5632 × 10¹	8.5332 × 10⁰	2.4193 × 10⁰	1.2816 × 10⁰	5.7967 × 10⁰
	Rank	3	6	5	1	2	4
F6	Mean	4.5076 × 10³	4.4476 × 10³	4.2687 × 10³	4.1147 × 10³	4.0999 × 10³	4.3517 × 10³
	Std.	2.2630 × 10³	2.0903 × 10³	2.0140 × 10³	2.0094 × 10³	2.0058 × 10³	2.1164 × 10³
	Rank	6	4	5	1	2	3
F7	Mean	2.0130 × 10³	2.0176 × 10³	2.0185 × 10³	2.0163 × 10³	2.0171 × 10³	2.0163 × 10³
	Std.	9.5358 × 10⁰	6.5162 × 10⁰	5.1413 × 10⁰	8.1505 × 10⁰	7.1656 × 10⁰	7.7639 × 10⁰
	Rank	1	5	3	4	6	2
F8	Mean	2.2178 × 10³	2.2180 × 10³	2.2177 × 10³	2.2166 × 10³	2.2150 × 10³	2.2159 × 10³
	Std.	6.5593 × 10⁰	6.5969 × 10⁰	7.1240 × 10⁰	7.7401 × 10⁰	8.9153 × 10⁰	8.1584 × 10⁺⁰⁰
	Rank	2	3	6	4	1	5
F9	Mean	2.5081 × 10³	2.5027 × 10³	2.5009 × 10³	2.5123 × 10³	2.5061 × 10³	2.5035 × 10³
	Std.	4.0310 × 10⁰	7.0478 × 10⁰	4.8576 × 10⁰	4.6156 × 10⁰	8.3110 × 10⁰	6.7890 × 10⁰
	Rank	5	2	1	6	4	3
F10	Mean	2.4972 × 10³	2.5242 × 10³	2.5046 × 10³	2.5163 × 10³	2.5044 × 10³	2.5165 × 10³
	Std.	1.7657 × 10¹	4.8467 × 10¹	2.2985 × 10¹	4.1522 × 10¹	2.1825 × 10¹	4.1856 × 10¹
	Rank	2	4	5	3	1	6
F11	Mean	2.6231 × 10³	2.6183 × 10³	2.6299 × 10³	2.6002 × 10³	2.6249 × 10³	2.6110 × 10³
	Std.	4.8031 × 10¹	4.6247 × 10¹	5.3975 × 10¹	9.0182 × 10⁻¹	5.6633 × 10¹	3.4876 × 10¹
	Rank	6	4	5	1	3	2
F12	Mean	2.8573 × 10³	2.8564 × 10³	2.8553 × 10³	2.8585 × 10³	2.8562 × 10³	2.8557 × 10³
	Std.	3.0571 × 10⁰	4.0336 × 10⁰	2.6506 × 10⁰	2.8718 × 10⁰	2.7627 × 10⁰	2.5544 × 10⁰
	Rank	5	3	1	6	4	2
Mean Rank		3.58	4.00	3.67	3.08	3.17	3.50
Result		4	6	5	1	2	3

Table 4. Algorithm parameters.

Algorithm	Parameter Value
DBO	$k = 0.1,$ $α$ is $- 1$ or $1$ , and $R$ decreases from $1$ .
HHO	E0 is a random value in $[- 1, 1]$ .
GWO	Parameter $(a)$ decreases from 2 to 0.
WOA	$a$ decreases from $1$ to $0$ , $b = 2$ .
IHHO	$p = 0.5, J \in [0, 2], λ = 0.3, ξ = 2$ .
MFO	$t \in [- 1, 1], b = 1 .$
INFO	$c = 2, d = 4 .$
POA	$I$ is a random integer of 1 or 2, and $R$ is ditto.
SCA	$a = 2 .$

Table 5. Comparison of 10 algorithms for solving CEC2022 test set.

F	Index	CSLDDBO	HHO	DBO	GWO	WOA	IHHO	MFO	INFO	POA	SCA
1	Median	3.0000 × 10²	3.0085 × 10²	3.0000 × 10²	4.4900 × 10²	1.0209 × 10⁴	3.0078 × 10²	2.2642 × 10³	3.0000 × 10²	3.4132 × 10²	8.7713 × 10²
	IQR	0.0000 × 10⁰	6.8210 × 10⁻¹	1.6200 × 10⁻¹¹	1.3426 × 10⁺³	7.8154 × 10⁺³	3.8360 × 10⁻¹	7.2122 × 10⁺³	6.0000 × 10⁻¹⁴	5.8045 × 10¹	2.6696 × 10²
	Best	3.0000 × 10⁺⁰²	3.0034 × 10²	3.0000 × 10²	3.0549 × 10²	2.0623 × 10³	3.0027 × 10²	3.0000 × 10²	3.0000 × 10²	3.0060 × 10²	5.2351 × 10²
	Mean	3.0000 × 10⁺⁰²	3.0096 × 10²	3.0086 × 10²	1.3082 × 10³	1.0407 × 10⁴	3.0089 × 10²	4.9682 × 10³	3.0000 × 10²	3.5235 × 10²	9.0036 × 10²
	Std.	2.3603 × 10⁻¹⁴	3.9220 × 10⁻¹	2.7061 × 10⁰	1.4553 × 10³	6.4672 × 10³	3.8540 × 10⁻¹	6.5894 × 10³	6.1549 × 10⁻¹⁴	4.9754 × 10¹	2.4679 × 10²
	Rank	1	4	3	8	10	5	6	2	7	9
2	Median	4.0485 × 10²	4.0618 × 10²	4.0892 × 10²	4.1046 × 10²	4.0900 × 10²	4.0894 × 10²	4.0824 × 10²	4.0645 × 10²	4.0848 × 10²	4.5906 × 10²
	IQR	1.6845 × 10⁰	8.4757 × 10⁰	3.7468 × 10⁰	2.6369 × 10⁰	6.1524 × 10¹	5.5403 × 10¹	1.1459 × 10¹	4.9295 × 10⁰	9.6228 × 10⁰	2.4575 × 10¹
	Best	4.0010 × 10²	4.0006 × 10²	4.0039 × 10²	4.0342 × 10²	4.0044 × 10²	4.0005 × 10²	4.0036 × 10²	4.0000 × 10²	4.0003 × 10²	4.3286 × 10²
	Mean	4.0463 × 10²	4.1684 × 10²	4.1576 × 10²	4.1310 × 10²	4.2822 × 10²	4.2426 × 10²	4.1525 × 10²	4.0592 × 10²	4.1181 × 10²	4.5667 × 10²
	Std.	1.9071 × 10⁰	2.6610 × 10¹	2.3028 × 10¹	1.0876 × 10¹	3.0941 × 10¹	3.0186 × 10¹	1.6810 × 10¹	3.3008 × 10⁰	1.9535 × 10¹	1.9242 × 10¹
	Rank	1	4	5	9	8	6	7	2	3	10
3	Median	6.0000 × 10²	6.2766 × 10²	6.0500 × 10²	6.0008 × 10²	6.3157 × 10²	6.2655 × 10²	6.0025 × 10²	6.0000 × 10²	6.1399 × 10²	6.1690 × 10²
	IQR	8.0608 × 10⁻⁴	1.5026 × 10¹	7.3713 × 10⁰	4.8650 × 10⁻¹	2.0503 × 10¹	2.1439 × 10¹	1.2363 × 10⁰	3.1654 × 10⁻³	1.2778 × 10¹	4.9650 × 10⁰
	Best	6.0000 × 10²	6.1253 × 10²	6.0000 × 10²	6.0003 × 10²	6.1061 × 10²	6.0305 × 10²	6.0000 × 10²	6.0000 × 10²	6.0038 × 10²	6.1123 × 10²
	Mean	6.0000 × 10²	6.2819 × 10²	6.0595 × 10²	6.0043 × 10²	6.3215 × 10²	6.2567 × 10²	6.0199 × 10²	6.0005 × 10²	6.1563 × 10²	6.1770 × 10²
	Std.	2.2269 × 10⁻³	1.0102 × 10¹	5.8179 × 10⁰	7.5521 × 10⁻¹	1.3540 × 10¹	1.3196 × 10¹	4.2576 × 10⁰	2.2846 × 10⁻¹	9.4222 × 10⁰	3.4901 × 10⁰
	Rank	1	9	5	4	10	8	3	2	6	7
4	Median	8.2388 × 10²	8.2941 × 10²	8.2389 × 10²	8.1144 × 10²	8.3454 × 10²	8.2547 × 10²	8.2912 × 10²	8.1343 × 10²	8.1990 × 10²	8.3678 × 10²
	IQR	7.9596 × 10⁰	7.9585 × 10⁰	1.5919 × 10¹	7.9585 × 10⁰	1.3930 × 10¹	1.0915 × 10¹	1.3631 × 10¹	8.9546 × 10⁰	8.0450 × 10⁰	1.1563 × 10¹
	Best	8.0696 × 10²	8.1506 × 10²	8.1293 × 10²	8.0514 × 10²	8.0908 × 10²	8.1404 × 10²	8.0796 × 10²	8.0497 × 10²	8.0597 × 10²	8.2550 × 10²
	Mean	8.2346 × 10²	8.2911 × 10²	8.2681 × 10²	8.1376 × 10²	8.3366 × 10²	8.2514 × 10²	8.3044 × 10²	8.1474 × 10²	8.1880 × 10²	8.3660 × 10²
	Std.	8.4639 × 10⁰	7.7850 × 10⁰	9.2693 × 10⁰	7.3020 × 10⁰	1.2251 × 10¹	6.5477 × 10⁰	1.0575 × 10¹	6.2407 × 10⁰	5.6236 × 10⁰	6.3956 × 10⁰
	Rank	4	7	5	1	9	6	8	2	3	10
5	Median	9.0094 × 10²	1.3140 × 10³	9.0615 × 10²	9.0107 × 10²	1.2480 × 10³	1.3565 × 10³	9.2602 × 10²	9.0212 × 10²	9.6627 × 10²	9.9405 × 10²
	IQR	1.2191 × 10⁰	2.6284 × 10²	9.0716 × 10⁰	1.3354 × 10⁰	2.1871 × 10²	2.9792 × 10²	2.3447 × 10²	9.0840 × 10⁰	2.1163 × 10²	5.5847 × 10¹
	Best	9.0000 × 10²	9.9517 × 10²	9.0054 × 10²	9.0002 × 10²	9.7925 × 10²	9.9674 × 10²	9.0000 × 10²	9.0000 × 10²	9.0009 × 10²	9.3528 × 10²
	Mean	9.0120 × 10²	1.3373 × 10³	9.2430 × 10²	9.0413 × 10²	1.3112 × 10³	1.3267 × 10³	1.0385 × 10³	9.0807 × 10²	1.0335 × 10³	1.0001 × 10³
	Std.	1.0815 × 10⁰	1.6194 × 10²	5.5428 × 10¹	9.0135 × 10⁰	2.6945 × 10²	1.7934 × 10²	2.4208 × 10²	1.2889 × 10¹	1.3724 × 10²	6.6067 × 10¹
	Rank	1	10	4	2	8	9	5	3	6	7
6	Median	3.9270 × 10³	2.5004 × 10³	5.1824 × 10³	5.1725 × 10³	2.5907 × 10³	2.8218 × 10³	5.2374 × 10³	1.8157 × 10³	1.9279 × 10³	1.6204 × 10⁶
	IQR	2.9407 × 10³	1.6873 × 10³	2.8432 × 10³	4.4973 × 10³	1.8697 × 10³	1.9660 × 10³	4.2366 × 10³	1.5222 × 10¹	7.5852 × 10¹	−2.4739 × 10⁶
	Best	1.8066 × 10³	1.9298 × 10³	1.8296 × 10³	1.9193 × 10³	1.9112 × 10³	1.8930 × 10³	1.9348 × 10³	1.8014 × 10³	1.8641 × 10³	1.8332 × 10⁵
	Mean	4.3012 × 10³	3.4123 × 10³	4.9202 × 10³	5.3927 × 10³	3.4725 × 10³	3.2129 × 10³	5.1222 × 10³	1.8214 × 10³	2.1683 × 10³	2.1617 × 10⁶
	Std.	2.2671 × 10³	2.0031 × 10³	1.9291 × 10³	2.1304 × 10³	1.9434 × 10³	1.3321 × 10³	2.1036 × 10³	1.8006 × 10¹	9.0500 × 10²	1.7356 × 10⁶
	Rank	6	4	7	9	5	3	8	1	2	10
7	Median	2.0200 × 10³	2.0457 × 10³	2.0239 × 10³	2.0251 × 10³	2.0639 × 10³	2.0490 × 10³	2.0223 × 10³	2.0210 × 10³	2.0287 × 10³	2.0524 × 10³
	IQR	5.8328 × 10⁻²	2.9625 × 10¹	1.5339 × 10¹	7.6266 × 10⁰	2.8280 × 10¹	2.8730 × 10¹	4.7552 × 10⁰	8.6236 × 10⁰	1.3976 × 10¹	6.2804 × 10⁰
	Best	2.0000 × 10³	2.0220 × 10³	2.0050 × 10³	2.0011 × 10³	2.0226 × 10³	2.0126 × 10³	2.0207 × 10³	2.0010 × 10³	2.0170 × 10³	2.0414 × 10³
	Mean	2.0182 × 10³	2.0512 × 10³	2.0304 × 10³	2.0262 × 10³	2.0637 × 10³	2.0516 × 10³	2.0286 × 10³	2.0174 × 10³	2.0300 × 10³	2.0535 × 10³
	Std.	5.9385 × 10⁰	2.8783 × 10¹	1.5330 × 10¹	8.7531 × 10⁰	2.2986 × 10¹	1.9674 × 10¹	1.3518 × 10¹	8.2565 × 10⁰	9.2051 × 10⁰	6.5623 × 10⁰
	Rank	1	7	5	3	10	8	4	2	6	9
8	Median	2.2205 × 10³	2.2269 × 10³	2.2226 × 10³	2.2250 × 10³	2.2305 × 10³	2.2276 × 10³	2.2249 × 10³	2.2207 × 10³	2.2221 × 10³	2.2317 × 10³
	IQR	7.1478 × 10⁻¹	5.5596 × 10⁰	4.6460 × 10⁰	4.6333 × 10⁰	7.9595 × 10⁰	6.0778 × 10⁰	5.6920 × 10⁰	9.7991 × 10⁻¹	5.5605 × 10⁰	3.6522 × 10⁰
	Best	2.2003 × 10³	2.2167 × 10³	2.2052 × 10³	2.2056 × 10³	2.2240 × 10³	2.2215 × 10³	2.2203 × 10³	2.2001 × 10³	2.2043 × 10³	2.2247 × 10³
	Mean	2.2166 × 10³	2.2309 × 10³	2.2258 × 10³	2.2235 × 10³	2.2317 × 10³	2.2299 × 10³	2.2258 × 10³	2.2200 × 10³	2.2203 × 10³	2.2316 × 10³
	Std.	8.0300 × 10⁰	1.2210 × 10¹	2.3157 × 10¹	5.6982 × 10⁰	5.7404 × 10⁰	8.6614 × 10⁰	3.9086 × 10⁰	3.7897 × 10⁰	6.5349 × 10⁰	2.8369 × 10⁰
	Rank	1	7	4	5	9	8	6	2	3	10
9	Median	2.5080 × 10³	2.5384 × 10³	2.5293 × 10³	2.5308 × 10³	2.5343 × 10³	2.5366 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5506 × 10³
	IQR	6.0732 × 10⁰	1.8622 × 10¹	3.1477 × 10⁰	4.0773 × 10¹	4.0988 × 10¹	1.7089 × 10¹	0.0000 × 10⁰	0.0000 × 10⁰	9.6768 × 10⁻¹	1.8647 × 10¹
	Best	2.4986 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5293 × 10³	2.5368 × 10³
	Mean	2.5099 × 10³	2.5457 × 10³	2.5323 × 10³	2.5480 × 10³	2.5519 × 10³	2.5408 × 10³	2.5324 × 10³	2.5342 × 10³	2.5322 × 10³	2.5540 × 10³
	Std.	5.2546 × 10⁰	2.7209 × 10¹	6.0187 × 10⁰	2.2461 × 10¹	3.1301 × 10¹	1.2935 × 10¹	1.1868 × 10¹	2.6826 × 10¹	8.2775 × 10⁰	1.3193 × 10¹
	Rank	1	8	4	7	9	6	3	2	5	10
10	Median	2.5003 × 10³	2.5010 × 10³	2.5009 × 10³	2.6094 × 10³	2.5011 × 10³	2.5010 × 10³	2.5008 × 10³	2.5005 × 10³	2.5006 × 10³	2.5016 × 10³
	IQR	8.0465 × 10⁻²	1.2778 × 10²	1.1765 × 10²	1.1129 × 10³	1.1173 × 10³	1.2472 × 10²	9.6906 × 10⁰	1.1657 × 10²	4.4324 × 10⁻¹	4.2404 × 10⁻¹
	Best	2.5002 × 10³	2.4375 × 10³	2.5004 × 10³	2.5003 × 10³	2.5003 × 10³	2.5004 × 10³	2.5003 × 10³	2.5003 × 10³	2.5003 × 10³	2.5007 × 10³
	Mean	2.5082 × 10³	2.5516 × 10³	2.5392 × 10³	2.5642 × 10³	2.5607 × 10³	2.5437 × 10³	2.5215 × 10³	2.5482 × 10³	2.5173 × 10³	2.5062 × 10³
	Std.	2.9838 × 10¹	6.8928 × 10¹	5.9878 × 10¹	5.6803 × 10¹	1.2277 × 10²	6.1766 × 10¹	4.8907 × 10¹	5.9698 × 10¹	4.3059 × 10¹	2.5316 × 10¹
	Rank	1	9	6	4	7	8	5	3	2	10
11	Median	2.6000 × 10³	2.7506 × 10³	2.6000 × 10³	2.7309 × 10³	2.7511 × 10³	2.7506 × 10³	2.7508 × 10³	2.6000 × 10³	2.7228 × 10³	2.7688 × 10³
	IQR	4.8133 × 10⁻⁴	3.0780 × 10²	1.5047 × 10²	1.8278 × 10²	1.3772 × 10²	1.4603 × 10²	1.6677 × 10²	1.5043 × 10²	1.2992 × 10²	9.6115 × 10⁰
	Best	2.6000 × 10³	2.6040 × 10³	2.6000 × 10³	2.6006 × 10³	2.6014 × 10³	2.6030 × 10³	2.6000 × 10³	2.6000 × 10³	2.6013 × 10³	2.7590 × 10³
	Mean	2.6169 × 10⁺³	2.7864 × 10³	2.7018 × 10³	2.7834 × 10³	2.7483 × 10³	2.7135 × 10³	2.7079 × 10³	2.6879 × 10³	2.6825 × 10³	2.7705 × 10³
	Std.	4.4697 × 10¹	1.6599 × 10²	1.4794 × 10²	1.5621 × 10²	1.1319 × 10²	1.2748 × 10²	8.5734 × 10¹	1.4996 × 10²	8.3664 × 10¹	6.5652 × 10⁰
	Rank	1	9	3	7	8	6	5	2	4	10
12	Median	2.8597 × 10³	2.8805 × 10³	2.8654 × 10³	2.8642 × 10³	2.8687 × 10³	2.8799 × 10³	2.8635 × 10³	2.8641 × 10³	2.8649 × 10³	2.8688 × 10³
	IQR	4.6049 × 10⁰	5.5301 × 10¹	3.9591 × 10⁰	1.7836 × 10⁰	1.1572 × 10¹	2.2354 × 10¹	1.5751 × 10⁰	1.4963 × 10⁰	2.7423 × 10⁰	2.9024 × 10⁰
	Best	2.8540 × 10³	2.8630 × 10³	2.8626 × 10³	2.8594 × 10³	2.8609 × 10³	2.8631 × 10³	2.8600 × 10³	2.8607 × 10³	2.8594 × 10³	2.8650 × 10³
	Mean	2.8592 × 10³	2.8998 × 10³	2.8669 × 10³	2.8659 × 10³	2.8782 × 10³	2.8902 × 10³	2.8634 × 10³	2.8639 × 10³	2.8653 × 10³	2.8688 × 10³
	Std.	2.8103 × 10⁰	5.0029 × 10¹	4.1838 × 10⁰	6.1777 × 10⁰	2.3730 × 10¹	2.6922 × 10¹	1.2255 × 10⁰	1.3995 × 10⁰	3.1091 × 10⁰	1.7218 × 10⁰
	Rank	1	9	6	5	7	10	2	3	4	8
Mean Rank		1.67	7.25	4.75	5.33	8.33	6.92	5.17	2.17	4.25	9.17
Result		1	8	4	6	9	7	5	2	3	10

Table 6. p values of each algorithm.

F	HHO	DBO	GWO	WOA	IHHO	MFO	INFO	POA	SCA
1	5.1436 × 10⁻¹² +	1.2193 × 10⁻⁵ +	5.1436 × 10⁻¹² +	5.1436 × 10⁻¹² +	5.1436 × 10⁻¹² +	3.8376 × 10⁻⁴ +	7.6743 × 10⁻⁵ +	5.1436 × 10⁻¹² +	5.1436 × 10⁻¹² +
2	4.0354 × 10⁻¹ +	1.2491 × 10⁻⁷ +	4.1997 × 10⁻¹⁰ +	4.1178 × 10⁻⁶ +	2.2658 × 10⁻³ +	2.1391 × 10⁻⁹ +	5.1802 × 10⁻¹ =	9.6263 × 10⁻² =	3.0199 × 10⁻¹¹ +
3	2.9897 × 10⁻¹¹ +	1.0834 × 10⁻¹⁰ +	2.9897 × 10⁻¹¹ +	2.9897 × 10⁻¹¹ +	2.9897 × 10⁻¹¹ +	1.4041 × 10⁻⁶ +	1.3603 × 10⁻⁴ +	2.9897 × 10⁻¹¹ +	2.9897 × 10⁻¹¹ +
4	4.8554 × 10⁻³ +	3.5543 × 10⁻¹ +	1.5288 × 10⁻⁵ −	2.5301 × 10⁻⁴ +	3.0417 × 10⁻¹ +	5.8261 × 10⁻³ +	7.8612 × 10⁻⁵ −	3.6436 × 10⁻² −	1.2536 × 10⁻⁷ +
5	3.0161 × 10⁻¹¹ +	6.2560 × 10⁻⁸ +	6.4141 × 10⁻¹ =	3.0161 × 10⁻¹¹ +	3.0161 × 10⁻¹¹ +	2.5196 × 10⁻⁴ +	1.1877 × 10⁻¹ =	3.4936 × 10⁻⁹ +	3.0161 × 10⁻¹¹ +
6	1.9073 × 10⁻¹ −	1.3345 × 10⁻¹ =	2.0681 × 10⁻² +	2.3399 × 10⁻¹ −	8.5000 × 10⁻² =	3.9167 × 10⁻² +	7.3803 × 10⁻¹⁰ −	4.7445 × 10⁻⁶ −	3.0199 × 10⁻¹¹ +
7	3.0199 × 10⁻¹¹ +	4.1997 × 10⁻¹⁰ +	6.5277 × 10⁻⁸ +	3.0199 × 10⁻¹¹ +	4.1997 × 10⁻¹⁰ +	3.3384 × 10⁻¹¹ +	1.1228 × 10⁻² +	8.8910 × 10⁻¹⁰ +	3.0199 × 10⁻¹¹ +
8	3.1589 × 10⁻¹⁰ +	8.2919 × 10⁻⁶ +	2.3897 × 10⁻⁸ +	3.0199 × 10⁻¹¹ +	3.0199 × 10⁻¹¹ +	7.3803 × 10⁻¹⁰ +	2.9205 × 10⁻² +	2.1265 × 10⁻⁴ +	3.0199 × 10⁻¹¹ +
9	3.0199 × 10⁻¹¹ +	2.3967 × 10⁻¹¹ +	3.0199 × 10⁻¹¹ +	3.0199 × 10⁻¹¹ +	3.0199 × 10⁻¹¹ +	7.8511 × 10⁻¹² +	1.7203 × 10⁻¹² +	3.0199 × 10⁻¹¹ +	3.0199 × 10⁻¹¹ +
10	1.4294 × 10⁻⁸ +	2.4386 × 10⁻⁹ +	5.8737 × 10⁻⁴ +	8.4848 × 10⁻⁹ +	2.4386 × 10⁻⁹ +	3.3520 × 10⁻⁸ +	8.8829 × 10⁻⁶ +	3.3242 × 10⁻⁶ +	7.1186 × 10⁻⁹ +
11	5.6856 × 10⁻¹⁰ +	7.5825 × 10⁻⁵ +	6.2445 × 10⁻⁹ +	1.0997 × 10⁻⁹ +	9.7341 × 10⁻⁹ +	2.4243 × 10⁻² +	1.9100 × 10⁻¹ =	1.7245 × 10⁻⁷ +	1.7883 × 10⁻¹¹ +
12	3.0199 × 10⁻¹¹ +	4.9691 × 10⁻¹¹ +	5.0723 × 10⁻¹⁰ +	3.8202 × 10⁻¹⁰ +	3.0199 × 10⁻¹¹ +	1.3014 × 10⁻⁸ +	3.4763 × 10⁻⁹ +	2.6695 × 10⁻⁹ +	3.0199 × 10⁻¹¹ +
+/=/−	12/0/0	11/1/0	10/1/1	11/0/1	11/1/0	12/0/0	8/2/2	9/1/2	12/0/0

Table 7. Average running times of 10 algorithms.

F	CSLDDBO	HHO	DBO	GWO	WOA	IHHO	MFO	INFO	POA	SCA
1	24.6407	10.7344	7.2813	5.0313	2.9219	22.8283	4.9375	16.3751	8.5938	4.7969
2	33.0312	15.0000	9.6094	6.4688	4.2031	30.5314	6.2188	21.9688	12.2187	6.3594
3	25.4843	12.2969	6.8594	5.4688	4.0313	26.9219	5.2812	12.9375	9.3752	5.0781
4	13.0938	5.6719	3.4219	2.5000	1.6718	12.0937	2.5469	7.6563	4.0781	2.5469
5	17.6563	8.2032	4.9063	3.5625	2.4218	18.0468	3.7813	10.4219	5.9219	3.3750
6	27.1563	12.5313	7.9844	5.2344	3.1875	25.8438	5.5781	17.3750	9.0469	5.0312
7	26.0469	12.1251	6.4375	5.1719	4.0313	27.5002	5.1875	12.0782	9.6876	5.1719
8	29.7190	13.6406	6.8906	5.39066	3.9219	29.1408	5.4375	12.7813	10.5625	5.7969
9	28.1876	12.7813	7.9219	5.9688	4.2188	29.3126	5.5938	12.5939	9.6095	5.5781
10	29.9689	15.1876	8.7031	6.4531	4.7187	33.9533	6.2969	16.3595	11.6875	6.0156
11	34.8128	16.9844	8.7500	7.0312	5.5000	38.0471	7.1094	15.3593	12.9376	6.8438
12	25.0784	11.2500	6.5313	4.87506	4.2344	25.9378	5.5157	10.6563	8.64065	5.1407
Mean Time	26.2396	12.2002	7.1080	5.2630	3.7552	26.6798	5.2903	13.8802	9.3633	5.1445

Table 8. SO₂ emissions and related influencing factors.

Time	SO₂ Emissions/Ten Thousand Tons	Proportion of Industrial Output Value in Total Domestic Production/%	Energy Consumption GDP/t/Ten Thousand CNY	Industrial SO₂ Emission Intensity/t/Ten Thousand CNY	Proportion of Non-Clean Energy Consumption/%
2012	2118	45.42	0.75	0.0087	85.5
2013	2043.9	44.18	0.7	0.0078	84.5
2014	1974.4	43.09	0.67	0.0071	83
2015	1859.1	40.84	0.63	0.0066	82
2016	854.89	39.58	0.59	0.0029	80.3
2017	610.84	39.85	0.55	0.0018	79.5
2018	516.12	39.69	0.51	0.0014	77.9
2019	457.29	38.59	0.49	0.0012	76.7
2020	318.22	37.84	0.49	0.0008	75.7
2021	274.78	39.43	0.46	0.0006	74.5

Data sources: The dependent variable data comes from the China Statistical Yearbook 2021, while the independent variable data comes from the Environmental Statistics Annual Report and the National Environmental Statistics Bulletin over the years.

Table 9. Completeness of coverage.

Evaluation Dimensions	Training Set Range	Test Set Scope	Coverage Results
The proportion of industrial output value in GDP/%	39.58–45.42	37.84–38.59	Complete coverage
Energy consumption per unit of GDP/t/10,000 CNY	0.51–0.75	0.49	Boundary coverage
Industrial SO₂ emission intensity/t/10,000 CNY	0.0014–0.0087	0.0008–0.0012	Complete coverage
Proportion of non-renewable energy consumption/%	77.9–85.5	75.7–76.7	Complete coverage

Table 10. Values of SO₂ initialization data.

Time	X1	X2	X3	X4	X5
2012	1.0000	1.0000	1.0000	1.0000	1.0000
2013	0.9650	0.9727	0.9333	0.8966	0.9883
2014	0.9322	0.9487	0.8933	0.8160	0.9708
2015	0.8778	0.8992	0.8400	0.7586	0.9591
2016	0.4036	0.8714	0.7867	0.3333	0.9392
2017	0.2884	0.8774	0.7333	0.2069	0.9298
2018	0.2437	0.8738	0.6800	0.1609	0.9111
2019	0.2159	0.8496	0.6533	0.1379	0.8971
2020	0.1502	0.8331	0.6533	0.0920	0.8854
2021	0.1297	0.8681	0.6133	0.0690	0.8713

Table 11. Gray absolute correlations between X1 and X2, X3, X4, and X5.

Index	$E_{12}$	$E_{13}$	$E_{14}$	$E_{15}$
Value	0.648	0.739	0.937	0.612

Table 12. CSLDDBOS algorithm optimization optimal order (

t_{m} (m = 1, 2, 3, 4, 5)

) and optimal weight (

λ_{m} (m = 1, 2, 3, 4, 5)

).

Table 12. CSLDDBOS algorithm optimization optimal order (

t_{m} (m = 1, 2, 3, 4, 5)

) and optimal weight (

λ_{m} (m = 1, 2, 3, 4, 5)

).

Index	$t_{1}$	$t_{2}$	$t_{3}$	$t_{4}$	$t_{5}$
Value	−0.9979	2.6046	35.0000	35.0000	−12.7324
Index	$λ_{1}$	$λ_{2}$	$λ_{3}$	$λ_{4}$	$λ_{5}$
Value	1	1	1	1	0.8714

Table 13. Accuracy evaluation.

Level	I	II	III	IV
Error	0.01	0.05	0.10	0.20

Table 14. Performance evaluation metrics.

Index	Meaning	Data/Calculation Method
$y_{1}^{(0)} (g)$	Raw data (RD)	Statistical data
${\hat{y}}_{1}^{(0)} (g)$	Simulated or predicted data (SPD)	The final recovery expression of the model
$ε (g)$	Residual	$ε (g) = {\hat{y}}_{1}^{(0)} (g) - y_{1}^{(0)} (g)$
$Δ_{s} (g)$	Relative squared $percentage error of y_{1}^{(0)} (g)$ (RSPE)	$Δ_{s} (g) = [\| ε (g) / y_{1}^{(0)} (g) \|] \times 100 %$
${\bar{Δ}}_{s}$	Mean relative simulated percentage error (MRSPE)	${\bar{Δ}}_{S} = [1 / (N - 1)] \sum_{g = 2}^{N} Δ_{s} (g)$
$Δ_{p} (g)$	$Relative prediction percentage error of x_{1}^{(0)} (g)$ (RPPE)	$Similar to Δ_{s} (g)$
${\bar{Δ}}_{p}$	Mean relative prediction percentage error (MRPPE)	${\bar{Δ}}_{p} = (1 / f) \sum_{g = N + 1}^{g = N + F} Δ_{p} (g)$
$\bar{Δ}$	Comprehensive mean relative percentage error (CMRPE)	$\bar{Δ} = [(N - 1) {\bar{Δ}}_{s} + f {\bar{Δ}}_{p}] / (N - 1 + f)$

Table 15. Model parameter setting.

Model	Parameter	Value
NSGM(1, N)	$a$ : Development coefficient	$a = 0.736$
	$b_{i}$ : Driving coefficient	$b_{5, 6, 7} = - 0.050, - 3.400, - 234.10$
	$h_{i}$ : Gray action	$b_{1, 2} = - 3013.64, - 1610.50$
OBGM(1, N)	$α$ : Resolution	$α = 0.5$
	$β$ : Gray relational degree threshold	$β = 0.7$
	$γ$ : Background value coefficient	$γ = 0.5$
SVR	Kernel function	Radial basis function (RBF)
	$γ$ : RBF kernel coefficient	$γ = 1$
	$c$ : Penalty factor	$c = 1$
	$ε$ : Insensitive loss band width	$ε = 0.1$
LSTM	$N$ : Hidden unit	$N = 64$
	$L$ : Number of layers	$L = 1$
	$P$ : Batch size	$P = 64$
	$r$ : Learning rate	$r = 1 e - 4$

Table 16. Simulated values and errors of three models.

$k$	$x_{1}^{(0)} (k)$	NSGM(1, N)			OBGM(1, N)			PGM(1, N)
		${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{s} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{s} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{s} (k)$
2	0.9650	0.9420	−0.0229	2.3779	0.9650	0.0000	0.0000	0.9645	−0.0004	0.0494
3	0.9322	0.9374	0.0052	0.5583	0.7250	−0.2070	22.2060	0.9327	0.0005	0.0549
4	0.8778	0.8826	0.0048	0.5506	0.6760	−0.2020	23.0120	0.8788	0.0010	0.1156
5	0.4036	0.4096	0.0060	1.5017	0.2010	−0.2030	50.2970	0.4040	0.0004	0.1013
6	0.2884	0.2929	0.0045	1.5746	0.0860	−0.2020	70.0420	0.2887	0.0003	0.1208
7	0.2437	0.2478	0.0041	1.6903	0.0420	−0.2020	82.8890	0.2435	−0.0001	0.0686
Mean relative simulated percentage error ( ${\bar{Δ}}_{s}$ )				1.3756%			59.6197%			0.0851%

Table 17. Predicted values and errors of three models.

$k$	$x_{1}^{(0)} (k)$	NSGM(1, N)			OBGM(1, N)			PGM(1, N)
		${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{p} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{p} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{p} (k)$
8	0.2159	0.2153	−0.0005	0.2484	0.0130	−0.2030	94.0250	0.2164	0.0005	0.2471
9	0.1502	0.1452	−0.0049	3.3017	−0.0520	−0.2020	134.4870	0.1505	0.0003	0.2471
Mean relative prediction percentage error ( ${\bar{Δ}}_{p}$ )				1.7751%			119.2394%			0.2471%
Comprehensive mean relative percentage error ( $\bar{Δ}$ )				1.3115%			79.4930%			0.1117%

Table 18. Actual predicted values of SO₂ emissions in 2021 using three models.

$k$		$x_{1}^{(0)} (k)$	NSGM(1, N)	OBGM(1, N)	PGM(1, N)
10		0.1297
Actual predicted value			0.129071	0.133271	0.129720

Table 19. Simulated values and errors of the two models.

$k$	$x_{1}^{(0)} (k)$	SVR			LSTM
		${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{s} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{s} (k)$
2	0.9650	0.9640	−0.0010	0.1000	0.9631	−0.0019	0.1942
3	0.9322	0.9350	0.0028	0.3040	0.9260	−0.0062	0.6200
4	0.8778	0.8711	−0.0067	0.7298	0.8692	−0.0086	0.9821
5	0.4036	0.4112	0.0076	1.4820	0.4200	0.0164	4.1010
6	0.2884	0.2766	−0.0118	4.0048	0.2983	0.0099	3.3247
7	0.2437	0.2317	−0.0120	4.0908	0.2509	0.0072	2.8331
Average relative prediction percentage error ( ${\bar{Δ}}_{s}$ )				1.7852%			2.0091%

Table 20. Predicted values and errors of the two models.

$k$	$x_{1}^{(0)} (k)$	SVR			LSTM
		${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{p} (k)$	${\hat{x}}_{1}^{(0)} (k)$	$ε (k)$	$Δ_{p} (k)$
8	0.2159	0.2142	−0.0017	0.7474	0.2213	0.0054	2.3075
9	0.1502	0.1578	0.0076	5.0010	0.1550	0.0048	3.1020
Average relative prediction percentage error ( ${\bar{Δ}}_{p}$ )				2.8742%			1.5510%
Comprehensive average relative percentage error ( $\bar{Δ}$ )				2.0574%			1.8945%

Table 21. Actual predicted values of SO₂ emissions in 2021 by two models.

$k$	$x_{1}^{(0)} (k)$	SVR	LSTM
10	0.1297
Actual predicted value		0.128715	0.129983

Table 22. Comparison of simulation/prediction error range.

Index	Error	PGM(1, N)	NSGM(1, N)	OBGM(1, N)	SVR	LSTM
MRSPE	Maximum error	0.1208%	2.3779%	82.8890%	4.0908%	4.1010%
	Minimum error	0.0494%	0.5506%	0.0000%	0.100%	0.1942%
	Error range	0.0714%	1.8273%	82.8890%	3.9908%	3.9068%
MRPPE	Maximum error	0.2471%	3.3017%	134.4870%	5.0010%	3.1020%
	Minimum error	0.2471%	0.2484%	94.0250%	0.7474%	2.3075%
	Error range	0.0000%	3.0533%	40.4620%	4.2536%	0.7945%

Table 23. Initial value prediction of the independent variable (Xi) (i = 2,3,4,5) over the next five years.

$k$	$X_{2}$	$X_{3}$	$X_{4}$	$X_{5}$
11	0.8597	0.5958	0.0481	0.8692
12	0.8476	0.5745	0.0179	0.8511
13	0.8385	0.5445	0.0083	0.8493
14	0.8492	0.5106	0.0034	0.8337
15	0.8269	0.4769	0.0010	0.8299

Table 24. Prediction of SO₂ emissions in China from 2022 to 2026.

Time	2022	2023	2024	2025	2026
Predictive value	238.0632	185.5368	85.1436	15.8850	10.4720

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cui, L.; Hu, G.; Hussien, A.G. Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model. Mathematics 2025, 13, 2846. https://doi.org/10.3390/math13172846

AMA Style

Cui L, Hu G, Hussien AG. Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model. Mathematics. 2025; 13(17):2846. https://doi.org/10.3390/math13172846

Chicago/Turabian Style

Cui, Lele, Gang Hu, and Abdelazim G. Hussien. 2025. "Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model" Mathematics 13, no. 17: 2846. https://doi.org/10.3390/math13172846

APA Style

Cui, L., Hu, G., & Hussien, A. G. (2025). Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model. Mathematics, 13(17), 2846. https://doi.org/10.3390/math13172846

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Sulfur Dioxide Emissions in China Using Novel CSLDDBO-Optimized PGM(1, N) Model

Abstract

1. Introduction

2. PGM(1, N)

2.1. Related Concepts

2.1.1. The Order of Gray Generative Operators

2.1.2. Smooth Generation Operator

2.2. Model Definition and Parameter Estimation

2.3. Solution of the Model

3. Overview of Dung Beetle Optimization Algorithms

3.1. Ball-Rolling Dung Beetles

3.2. Producing Dung Beetles

3.3. Larvae

3.4. Thief Dung Beetle

4. CSLDDBO Algorithm

4.1. Chain Foraging Strategy

4.2. Somersault Foraging Strategy

4.3. Learning Strategy

4.4. Differential Evolution

4.5. Time Complexity

5. Numerical Experiment and Discussion

5.1. Convergence Behavior Analysis

5.2. Sensitivity of Parameters

5.3. Experimental Results

6. Prediction of SO2 Emissions in China

6.1. Preparation of SO2 Emission Data

6.1.1. Data Sources and Preprocessing

6.1.2. Data Analysis

6.2. Establishment of SO2 Emission Model

6.2.1. Data Classification

6.2.2. Parameter Estimation and Model Construction

6.2.3. Error Solving and Performance Evaluation

6.3. Prediction of Future SO2 Emissions

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

GM(1, N)

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

6. Prediction of SO₂ Emissions in China

6.1. Preparation of SO₂ Emission Data

6.2. Establishment of SO₂ Emission Model

6.3. Prediction of Future SO₂ Emissions