Prediction of Transient NO x Emission from Diesel Vehicles Based on Deep-Learning Differentiation Model with Double Noise Reduction

: For diesel engines, accurate prediction of NO x (Nitrogen Oxides) emission plays an essential role in virtual NO x sensor development and engine design under situations of actual road driving. However, due to the randomness and uncertainty in the driving process of diesel vehicles, it is difﬁcult to make predictions about NO x emissions. In order to solve this problem, this paper proposes differential models for noise reductions of NO x emissions in time series. First, according to the internal ﬂuctuation of time series, use SSA (Singular Spectrum Analysis) to reduce the noises of the original time series; second, use ICEEMDAN (Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise) to decompose the noise-reducing data into several relatively stable subsequences; third, use the sample entropy to calculate the complexity of each subsequence, and divide the sequences into high-frequency ones and low-frequency ones; ﬁnally, use GRU (Gated Recurrent Unit) to complete the prediction of high-frequency sequences and SVR (Support Vector Regression) for the prediction of low-frequency sequences. To obtain the ﬁnal models, integrate the prediction results of the subsequences. Make comparisons with ﬁve single models, SSA single-processing models, and ICEEMDAN single-processing models. The experimental results show that the proposed model can predict the instantaneous NO x emissions of diesel engines better than the single model and the model processed by SSA, and the differentiated model can effectively improve the execution speed of the model.


Introduction
The diesel vehicle is the main source of NO x emission. In China, according to the 2019 China Mobile Source Environmental Management Annual Report, diesel vehicles account for 9.1% of motor vehicles in China, but the contribution rate of NO x emission reaches 70% of the total vehicle emissions [1]. At the same time, China has implemented NO x emission reduction regulations, which limit the NO x emission of motor vehicles strictly, and put forward higher requirements for automobile manufacturers to adjust the engine component parameters. Traditionally, in order to reduce emissions by adjusting engine component parameters, it is necessary to go through the engine test bench or actual driving. It is very expensive and inefficient to evaluate the success of the adjustment process by means of PEMS (Portable Emission Measurement System). Therefore, in the actual road driving process, it is very important to understand the causal relationship between vehicle operating parameters and emissions and to establish relevant reliable models.
In recent years, some scholars have used the chassis dynamometer or the engine test bench to collect the engine working data and construct the models according to the data obtained in the steady-state. Table 1 presents the results of some recent studies on NO x steady-state prediction by some authors.
These studies have achieved ideal experimental results, but the data obtained under steady-state conditions cannot capture the transient behavior of the engine from part load to full load as well as the hysteresis effect of the engine under start, stop, or cold start conditions [2], which is often quite different from the emission data generated by diesel vehicles driving on the real road. Encoder-Decoder [7] In actual road driving, the NO x emission data of the diesel engine is a highly nonlinear, complex, and changeable sequence. Coupled with the complexity of engine characteristics and the uncertainty of drivers in the actual road operation process, it is very difficult to construct a highly nonlinear equation group for NO x emission modeling. Traditional modeling methods for NO x transient emissions include physical or chemical as well as empirical or semi-empirical assumptions. For example, Maurya et al. [8] used the commercial 3D CFD (Computational Fluid Dynamics) engine simulation tool STAR-CD, combined with the mesh generator es-ice, to analyze the combustion process of a dual-fuel engine using a computational fluid dynamics (CFD) model, and derived the effect of EGR on the combustion and performance of the dual-fuel engine. Stelios A et al. [9] used a semi-empirical zero-dimensional two-zone model to predict NO x variation with engine load/speed, fuel injection timing, EGR rate, boost pressure, and fuel injection pressure at a relatively low cost. CFD models based on physical chemistry are commonly used for NO x predictions. By calculating the physical and chemical process of fuel combustion, NO x emission can be estimated accurately. However, due to the complex structure, high computational cost, and long time required, CFD models cannot be applied in real time [10,11]. Based on physical and chemical properties and empirical assumptions, these methods are very difficult to model instantaneous NO x emissions [12] and have limited performance in predicting NO x emissions during actual road driving [13].
With the gradual rise of machine learning, as it is unnecessary to understand the complex physical and chemical knowledge behind the research object, any additional complexity will be incorporated into the models when having enough data. At the same time, the method can consider the impact of environmental conditions on emissions that are difficult to analyze by physical and chemical models [14], so it is widely used by scholars. Liu et al. [15] used the integrated method of PCA (Principal Component Analysis) and genetic algorithm to search for the best super parameters of support vector machine to predict the steady-state and transient NO x emissions of diesel engines. Ideal results can be achieved by making predictions of the steady-state emission, but the results of transient emissions still have large errors at some operating points. Sáez et al. [16] used a single artificial neural network to predict the transient NO x emission of diesel engines with input variables of vehicle speed and acceleration, engine speed and torque, intake temperature as well as air mass flow, which the best result of R 2 was only 0.82. In view of the characteristics of the NO x transient emission time series of diesel vehicles, such as highly non-stationary, irregular fluctuation frequency as well as internal complexity and variability, as time goes on, the traditional machine learning algorithm may lose learning abilities [17]. Deep learning has better learning performance for the highly complex nonlinear data, showing more excellent performance, but at present, few scholars use deep-learning methods to model transient NO x emissions of diesel vehicles. Shin et al. [18] used the DNN (Deep Neural Network) and Bayesian optimization method to optimize the hyper-parameter and predict the NO x transient emission, which improved the prediction accuracy and model stability significantly. However, compared with the single deep neural network, researchers found that signal processing can extract signal features effectively [19] and machine learning algorithms have been used in the field of wind speed prediction, power load forecasting, and financial field. For instance, Ma et al. [20] predicted wind speed using double decomposition, error correction strategies, and long-and short-term memory neural networks. Tong et al. [21] proposed a deep learning-based model that refines features from historical load data and associated temperature parameters by stacking denoised self-encoders and then training a support vector regression (SVR) model to predict the total load for the coming day. Niu et al. [22] used a combination of variational modal decomposition and LSTM to predict stock price changes with good predictive performance.
In this study, the vehicle exhaust online monitoring platform is used to obtain the NO x emission data of two buses in the actual road driving process, and the in-depth learning model GRU (Gated Recurrent Unit) is used to establish the instantaneous NO x emission model. Since there are noises in the collected data, SSA (Singular Spectrum Analysis) and ICEEMDAN (Improved Complete Ensemble Empirical Mode Decomposition with Adaptive Noise) are used to filter the noise in the data. Considering that the subsequences generated by the ICEEMDAN decomposition increase the computational cost, the SVR (Support Vector Regression) models of the differentiation models are used to replace the low-frequency subsequence GRU models.

Singular Spectrum Analysis
SSA is a non-parametric method for studying nonlinear time series [23]. Based on time series, the SVD (Singular Value Decomposition) of the specific matrix is constructed. The specific algorithm process is as follows [24]: Define the NO x series data as {h i , i = 1, 2, . . . , N} and then calculate the Hankel [25] trajectory matrix: (1) L represents the sliding window length parameter and 1 < L < N, K is defined as N − L + 1. Matrix H is the trajectory matrix. The resulting trajectory matrix is then SVD (Singular Value Decomposed), where H can be rewritten as: where d = rank(H) and it is the number of intrinsic components or modes with non-zero intrinsic values. λ i is the singular value in descending order. U i and V i are the related left and right singular vectors, respectively. The set denoted by √ λ i , U i , V i is called the feature set. Then H i (i = 1, 2, 3, · · · , d) in the trace matrix is decomposed into signal and noise. However, an important step is to identify the subset of features that contain the main variation of H. This is equivalent to approximating the matrix ∼ H by the combination of the first r elementary matrices using the following equation: H is attributed to the signal, r represents the number of features selected for signal reconstruction, and the residual

Improved Adaptive Noise Fully Integrated Empirical Mode Decomposition
In order to overcome the shortcomings of EEMD and CEEMDAN algorithms [26], Colominas et al. improved the CEEMDAN algorithm [27]. Different from the traditional method of adding Gaussian white noise, ICEEMDAN extracts the k-th IMF component through EMD decomposition of Gaussian white noise [28], calculates the local mean value of signal plus noise for each modal component, and defines the IMF obtained by decomposition as the difference between the residual signal and local mean.
First, define M(·) as the local mean value of the signal, E k (·) is the k-th mode component after EMD decomposition, then we have E 1 (x) = x − M(x), and the main steps are as follows: The first IMF of the given white noise is added to the original sequence X(t) as follows: where β 0 represents the noise standard deviation. w i is the added white noise. Find the local mean M(.) of X i (t), calculate the average of the local mean and obtain the following residuals: where N is the number of times to add noise. Obtain the first IMF as follows: For k = 2, 3, · · · , K, the residual and the k-th IMF can be calculated by Formulas (6) and (7): where ε is the ratio of the added noise.

Gated Recurrent Unit
Because of the gradient vanishing problem in traditional RNN networks [29], it is difficult to solve the long-term dependence problem in practical application, which makes RNN training very difficult. To solve this problem, LSTM and GRU use different functions to calculate the state of the hidden layer by adding "memory cell" structures into the hidden layer. The "memory cell" of LSTM is composed of three gates: input gates, forgetting gates, and output gates, while the GRU "memory cell" is only composed of update gates and reset gates, with fewer training parameters and faster calculations. The structure of the GUR neural network is shown in Figure 1 [29].
The function of the update gate is to control the state information of the previous time into the current state. The larger the value of the update gate is, the more state information will be brought into the previous moment. The formula of the update gate is [30]: where h t−1 contains the information of the previous node, x t is the current input, σ is the activation function, w is the neuron connection weight, b is the neuron bias value. The function of the reset gate is to control the forgetting degree of state information from the previous moment. When the reset gate is smaller, more state information is forgotten. The calculation formula of the reset gate is [31]: Atmosphere 2021, 12, 1702

of 19
After obtaining the gating signal, the gating signal is reset to obtain the reset data, and then use tanh to activate the functions to obtain the candidate hidden state: In the update phase, when the state information is changed, the update gate is used for controlling [32], then the hidden layer state at time t can be expressed as follows:

Support Vector Regression
Support Vector Regression is a supervised learning algorithm for classification and regression, solved by feature mapping the input controls. With powerful theoretical concepts and applicable to nonlinear data fitting, SVR has been proven by a large number of scholars to outperform general neural networks [33] and machine learning algorithms [34] in prediction. The basic principle of SVR is as follows: Define the output variable as: where w is a vector of weights. φ(x) is a nonlinear function that maps x from the input space to a higher-dimensional space. b is deviation. To find w and b, create a minimization function and compute it: where δ is the minimization objective function. C is the penalty factor. ζ i , ζ * i are the relaxation variables for the upper and lower boundaries of the i-th sample.
Define ε as the insensitive loss factor. y i is the output value of the i-th sample. There are ζ i , ζ * i , y i , ε, and w i and b to be satisfied: By introducing the Lagrange multipliers α * i and α, and solving the quadratic planning problem, the regression function is obtained.
where K x i , x j is the kernel function. x i and x j represent the input vectors of the kernel function. x i is the input variable in the training set and x j is the input variable in the test set.

Deep-Learning Differentiation Models with Double Noise Reduction
Based on SSA, ICEEMDAN, SVR, and GRU neural networks, hybrid models of SSA-ICEEMDAN-SVR-GRU are proposed. The process of constructing the model is shown in Figure 2. The descriptions of the details are given below: (1) Use SSA to decompose and reconstruct the original data, extract the effective trend information, and discard the noise part. The SSA window length is set to seven, with bus 1 selecting the first five principal component reconstruction data and bus 2 selecting the first three principal component reconstruction data.
(2) Use ICEEMDAN to decompose the data after the SSA noise reduction into subsequences with different complexities to reduce the influence of noise still existing in the data. Add the standard deviation of noise of 0.45 to ICEEMDAN for 1000 times, and the maximum number of iterations is allowed to be 40,000.  In order to test the abilities of SSA-ICEEMDAN-SVR-GRU models to predict the NO x emissions of diesel vehicles in actual roads, we compare the performance of 12 different prediction models, including SVR models, LSTM models, GRU models, SSA-SVR models, SSA-LSTM models, SSA-GRU models, ICEEMDAN-SVR models, ICEEMDAN-LSTM models, ICEEMDAN-GRU models, RF (Random Forest) models, and Bayes network models.

Data Sources
The data of this study comes from the vehicle exhaust online monitoring platform, which uses the GPRS (General Packet Radio Service) to monitor the real-time exhaust emission of the vehicle when driving on the actual road. The terminal equipment includes a nitrogen oxide sensor, OBD (On-Board Diagnostic) data reader, and wireless data transmission unit. The NO x sensor is an electrochemical sensor that directly measures the amount of NO x emissions per second. Through strict management and control, collect and manage the data information of exhaust monitoring parameters and the basic information of vehicles. The data collected by the platform has the characteristics of standardization, authenticity, and accuracy. The basic engine parameters for the two buses are given in Table 2. The experimental data were selected according to the method of Günther et al. [35]. The data acquired by the monitoring platform were divided into individual micro-travels. Micro-travel is defined as the travel between two stops starting at speed zero and ending at speed zero. These stops include bus stops, traffic light sections, and stops due to heavy traffic.
For buses, the selected samples include the data for all the working conditions of idle speed, acceleration, constant speed, deceleration, and stop. The main parameters collected include vehicle speed, engine speed, torque percentage, instantaneous fuel consumption, accelerator pedal opening, and NO x concentration. By modeling the collected real-time data, the transient emission prediction of NO x concentration for diesel vehicles is realized.

Evaluation Indexes
In this paper, root mean square error (RMSE) [36], determination coefficient (R 2 ) [37], mean absolute error (MAE) [36], and normalized root mean squared error (NRMSE) [38] are used to evaluate the performance of the model. The calculation formula is as follows: here, y is the experimental value,ŷ is the predicted value of the model, y is the average value of the measured value, y max is the maximum value of NO x concentration, and y min is the minimum value of NO x concentration.

Data Processing Analysis
In this part, the main theories and algorithms are introduced in detail, including the SSA algorithm, ICEEMDAN algorithm, GRU network, and SVR machine learning algorithm. Among them, GRU and SVR models are completed by Python 3.6, and data processing algorithms, including SSA, and ICEEMDAN are completed by MATLAB 2019b.

Singular Spectrum Noise Reduction
In the process of SSA noise reduction, the window length L and the number of reconstructed principal components m are determined. Generally, let L < N 2 , where N is the length of the noise-reduction sequence, and the proportion of the sequence period [38]. Since there are no obvious periodic changes in the actual driving process of diesel buses, it is more appropriate to set the window length L as seven after the test. The number of principal components m is very important for reconstructions and noise reductions, the value of which is usually set based on the contribution rate of each principal component eigenvalue. Generally, the contribution of the noise component is very small. The component characteristics of the two buses are shown in Figure 3. It can be seen from Figure 3 that the first two components are significantly larger than the rest, indicating that they have more information content. Table 3 shows the performance results of SSA-SVR models when m takes different values. It is found from Table 3 that bus 1 has the best result when m = 5, which indicates that the data after component 5 is the noise components that should be removed, and the information component before component 5 should be retained. From Table 3, it is found that bus 2 has the best result when m = 3, which indicates that the data after component 3 contains noises that are directly so the information of the first three components is retained. Figure 4 shows the NO x emission sequence results after the SSA treatment.

ICEEMDAN Decomposition Sequence
After the noise of the original NO x concentration data is reduced by SSA, use ICEEM-DAN to decompose the data and obtain the subsequences of two buses. Then add the standard deviation of noise of 0.45 to ICEEMDAN for 1000 times, and the maximum number of iterations is allowed to be 40,000, which fully avoided the influence of noises on the results. The decomposition results of the two buses are shown in Figure 5. The fluctuation frequency of the subsequence gradually becomes stable with the increase of decomposition times until there are residual quantities of a single trend. Bus 1 is decomposed into 13 IMFs and 1 RES, bus 2 is decomposed into 12 IMFs and 1 RES.

Calculation of Sub-Sequence Complexity
In order to reduce the computational complexity of the GRU neural network and avoid overfitting situations, calculate the complexity of the NO x concentration series by the sample entropy. The sample entropy can be used to evaluate the repetition of the waveform before and after or frequencies. The higher the entropy, the greater the frequency change in the waveform, and the more repetitive the waveform; the lower the entropy, the smaller the frequency change in the waveform, and the less repetitive the waveform. According to the complexity of subsequences, divide the subsequences into high-frequency ones and low-frequency ones. Use GRU neural network to predict high frequency, and use SVR to predict low frequency. The calculation results of sample entropy of two bus NO x emission subsequences are shown in Table 4.
Looking at Figure 5 and Table 4, the greater the sample entropy, the more obvious the frequency change of subsequence, and the higher the complexity. When the sample entropy is lower than 1, the fluctuation of sequence data is more gentle. Therefore, this paper takes the sample entropy value equal to 1 as the division standard of high-frequency and low-frequency sequences. IMF1 to IMF6 of bus 1 are predicted using GRU neural network, and IMF7 to IMF13, including RES, are predicted by SVR; IMF1 to IMF5 of bus 2 are predicted using GRU neural networks, and IMF6 to IMF12, including RES, are predicted by SVR.

Analysis of the Prediction Results of Each Sub-Sequence by GRU and SVR
For the subsequences decomposed by ICEEMDAN, use GRU and SVR models to predict the high-frequency and low-frequency subsequences respectively. Observe the learning situation of the subsequences. For NO x emission of the two buses, the multi-index evaluation results of each subsequence prediction model are given in Table 5. In the experiment, the high-frequency sequence IMF1 of bus 1 and bus 2 is complex. According to the GRU model, the value for the two buses R 2 are 0.620 and 0.510 respectively, which indicates that the noise or outliers may be decomposed into IMF1 during the decomposition process, which affects the prediction effects of the models. However, with the increase of decomposition times, the noise data in the subsequence is gradually reduced, and the prediction stability of the model is significantly enhanced, with a R 2 which is close to 1, indicating that the predicted and actual values of the subsequence are very close, and the prediction effects are excellent. In the low-frequency sequence, the SVR model also shows good effects, and the value of R 2 is close to 1, which shows that in the low-frequency sequence, SVR can replace GRU, and the calculation time of the SVR model is much lower than that of GRU models. Therefore, use the SVR model to reduce the prediction time in low-frequency sequences, but the prediction performance of the model is still stable.

Results and Discussion
In order to verify the prediction performance of the multiple noise-reduction deeplearning differentiated model SSA-ICEEMDAN-SVR-GRU proposed in this paper, a comparative analysis was carried out. First, compare the model with five single models of GRU, LSTM, SVR, Bayes, and RF without noise reductions; then compare the model with GRU, LSTM, and SVR that have undergone SSA one-step noise reduction; in addition, ICEEMDAN-GRU, ICEEMDAN-LSTM, and ICEEMDAN-SVR models are added for comparison, involving a total of 12 different models for comparison. Table 6 lists the multi-index evaluation results of two buses using different models.

Comparative Analysis of Single Models
It is not possible to achieve ideal results in the transient NO x emission. Taking the transient NO x emission prediction results of bus 2 as an example, the RMSE, MAE, and R 2 of the SSA-ICEEMDAN-SVR-GRU models are increased by 52.613, 43.202, and 9.932% respectively compared to the GRU models. Compared with the random forest model, the RMSE, MAE, and R 2 of the SSA-ICEEMDAN-SVR-GRU model are improved by 50.354, 45.727, and 9.685%, respectively.
In order to further analyze and verify the predictive ability of the SSA-ICEEMDAN-SVR-GRU models on the instantaneous emission of NO x from two buses, Figure 6 shows the comparison results for the five single models of Bayes, RF, SVR, LSTM, and GRU. Combining Table 6, it can be found that the predictive ability of single models is limited and cannot accurately capture the significant characteristics of transient NO x concentration emissions, and the prediction error is relatively large. Single models can predict the overall trend of changes in NO x transient emissions, but there are huge gaps between the predicted result and the actual value. The main reason for this result is that in the actual road operation of diesel vehicles, the road conditions are complex and changeable. The driver needs to adjust the fuel supply and emergency braking constantly. The engine presents an irregular and rapid change state, which directly affects diesel combustion and NO x emissions. It is difficult for single models to capture the transient emission law of NO x , and it is impossible to model and predict accurately.
NOx emissions. It is difficult for single models to capture the transient emission law of NOx, and it is impossible to model and predict accurately.

Comparative Analysis of SSA Single Treatment Results
After the original data is processed by SSA noise reduction, the prediction accuracy is significantly higher than that of a single model. For example, in bus 2, the RMSE, MAE, and R 2 of SSA-GRU are improved by 18.122, 11.347, and 3.500%, respectively. However, compared with the SSA-ICEEMDAN-SVR-GRU model, satisfactory prediction results are still not achieved. In the transient NO x emission prediction results of bus 2, the SSA-ICEEMDAN-SVR-GRU model is better than the RMSE of the SSA-GRU model, in which the values of MAE and R 2 increased by 42.124, 35.932, and 6.181% respectively. Figure 7 shows the modeling results after processing the transient NO x emission results of two buses using singular spectrum analysis. Through comparative analysis of Figures 6 and 7, it can be found that the noise after the original sequence is reduced by SSA and the NO x sequence shows a more obvious trend. The regularity of the NO x transient sequence can be found more easily by using a machine learning model, and the stability and generalization of the model can be enhanced. By comparing the first mock exam with the SSA model, the R 2 of SVR, LSTM, and GRU increased by 2.587, 3.964, and 3.533% respectively, which means that SSA can enhance the interpretability and prediction ability of the model. Table 3 shows the influence of different values of SSA-SVR model m on the prediction results. In the process of SSA noise reduction, if the m value is too large, the noise in the NO x emission sequence may not be filtered completely, which will directly affect the basic information of the NO x emission sequence; if the value of m is too small, the correct, and useful information in the original sequence will be filtered out. Therefore, selecting an appropriate m value to filter the transient emission data of NO x can improve the data quality, reduce the influence of random interference, and enhance the prediction accuracy and stability of the model. The experimental results show that SSA noise reduction is very helpful to extract useful information from the original NO x sequence and improve the prediction accuracy of NO x transient emission.

Comparative Analysis of Single Treatment Results of ICEEMDAN
The original data is decomposed by ICEEMDAN and then modeled. The prediction accuracy and stability are significantly higher than the single model and the SSA noise reduction processing model. For example, for bus 2, the RMSE, MAE, and R 2 of the ICEEMDAN-GRU model are improved by 42.44%, 32.442, and 8.58%, respectively, compared with the single model GRU. The RMSE, MAE, and R 2 of the ICEEMDAN-GRU model are increased by 29.701, 23.795, and 4.907%, respectively, compared to the SSA-GRU noise-reduction model SSA-GRU. Through the comparison of the above two groups, it is found that the ICEEMDAN decomposition has a better noise-reduction effect on nonlinear and unstable data, and it is easier to mine the internal information and characteristics of the data. However, the dual noise-reduction SSA-ICEEMDAN-SVR-GRU model increased the RMSE, MAE, and R 2 of ICCEMDAN-GRU by 17.672, 15.927, and 1.248%, respectively. Figure 8 shows the comparison results of modeling and comparison of the original NO x emission sequences of two buses after ICEEMDAN processing. Figures 6-8 are compared and analyzed. The original sequence of NO x emissions is processed by ICEEMDAN and then modeled and predicted. The predicted result is closer to the experimental value, and more accurate prediction results can be obtained under high-load and high-speed conditions of diesel vehicles. Combining Table 6, it is found that the prediction performance of the model processed by ICEEMDAN is higher than that of SSA noise reduction. For example, in the GRU model of bus 2, the RMSE of the SSA-GRU model is 18.122% higher than that of the GRU model, and the ICEEMDAN-GRU is 42.441% higher than the GRU. This improvement is mainly due to the fact that ICEEMDAN decomposes the transient NO x emission sequence into multiple subsequences with different frequency fluctuations. From Table 4, it is found that these subsequences have lower sample entropy, that is, they have lower sample entropy. The complexity of the model can better capture the changing law of the sequence, thereby improving the predictive ability.
quences with different frequency fluctuations. From Table 4, it is found that these subsequences have lower sample entropy, that is, they have lower sample entropy. The complexity of the model can better capture the changing law of the sequence, thereby improving the predictive ability. The comparison of the dual noise-reduction differential model SSA-ICEEMDAN-SVR-GRU with the single model and single-processing model shows that the prediction accuracy of the model proposed in this paper is higher than other models. Among them, a single model has the worst performance. This is because the originally collected NO x emission sequence contains a lot of noise and a variety of fluctuating frequencies. A single model cannot learn its laws well and accurately. After the noise of the original NO x emission sequence is reduced by SSA, the noise in the sequence is significantly reduced, the model can better capture the sequence information, and the accuracy of the model is improved. The original data is directly processed by ICEEMDAN, which can obtain better prediction results. However, because ICEEMDAN directly decomposes the original sequence, the subsequence is less complex than the original sequence, and the sampling results of different frequencies are distributed in different subsequences, which is easier to find than the law but does not filter the noise, so the prediction accuracy is still insufficient. After the noise of the original sequence is reduced by SSA, ICEEMDAN is used to decompose, and the accuracy of the model is further improved. At the same time, the complexity of subsequences is analyzed through sample entropy, and SVR is used to replace low-complexity subsequences, which reduces the calculation cost and prediction time while maintaining model stability and prediction accuracy. Therefore, the SSA-ICEEMDAN-SVR-GRU model proposed in this paper is a prediction method with excellent performance. By comparing different models, using SSA to reduce the noise of the transient NO x emissions for diesel vehicles, and ICEEMDAN decomposition can effectively improve the prediction accuracy of the model.

Conclusions
This research proposes a differential model SSA-ICEEDMAN-SVR-GRU with dual noise reduction to improve the prediction of transient NO x emissions of diesel vehicles. In the proposed SSA-ICEEDMAN-SVR-GRU model, SSA is used for noise reduction, ICEEMDAN is used to decompose the data, and sample entropy is used to calculate the subseries complexity, using a differentiation model depending on the complexity. The proposed model is validated with data from two buses and compared with five single models, three SSA processing models, and three ICEEMDAN processing models. Based on the comparison of three sets of experiments, the following conclusions are drawn: (1) After noise reduction by SSA, the prediction model is established. The accuracy is higher than that of a single model. It shows that SSA can remove the outliers in the original sequence. It also shows that noise has a great influence on the prediction results of NO x transient emissions of diesel vehicles. (2) Using ICEEMDAN to process the original data and then modeling, the prediction accuracy is significantly improved, indicating that ICEEMDAN's decomposition method can effectively extract the trend law and useful information of the original sequence, which helps the model learn its internal laws and improve the performance of the model. (3) After comprehensively considering SSA noise reduction and ICEEDAN decomposition, the combined model SSA-ICEEMDAN-SVR-GRU has the best prediction performance. The results show that double noise reduction has better prediction performance than the single-processing method in improving the accuracy of transient NO x emission prediction. (4) The use of SVR in the low-frequency sequence instead of the GRU model with higher computational cost can reduce the prediction time and maintain the prediction performance of the model.
Overall, the SSA-ICEEMDAN-SVR-GRU model proposed in this paper helps to analyze the diesel vehicle's NO x emissions on actual roads, replacing physical NO x emissions sensors with virtual NO x emissions sensors, etc., to provide transient NO x emissions for diesel vehicles A new method.