Forecasting of Disassembly Waste Generation under Uncertainties Using Digital Twinning-Based Hidden Markov Model

Disassembly waste generation forecasting is the foundation for determining disassembly waste treatment and process formulation and is also an important prerequisite for optimizing waste management. The prediction of disassembly waste generation is a complex process which is affected by potential time, environment, and economy characteristic variables. Uncertainty features, such as disassembly amount, disassembly component status, and workshop scheduling, play an important role in predicting the fluctuation of disassembly waste generation. We therefore focus on revealing the trend of waste generation in disassembly remanufacturing that faces significant influences of technology and economic changes to achieve circular industry sustainable development. To dynamically predict the generation of disassembly waste under uncertainty, this work proposes a statistical method driven by a probabilistic model, which integrates the digital twinning, Gaussian mixture, and the hidden Markov model (DG-HMM). First, digital twinning technology is used for real-time data interaction between simulation prediction and decision evaluation. Then, the Gaussian mixture and HMM are used to dynamically predict the generation of disassembly waste. In order to effectively predict the amount of disassembly waste generation, real data collected from a disassembly enterprise are used to train and verify the model. Finally, the proposed model is compared with other general prediction models to illustrate the correctness and feasibility of the proposed model. The comparison results show that DG-HMM has better prediction accuracy for the actual disassembly waste generation.


Introduction
With the shortening of product life cycle and the acceleration of product upgrade, the rapid growth of obsolete products will put tremendous pressure on the ecological environment and human health [1,2]. The development of environmental awareness and the circular economy has forced manufacturers to increasingly consider the importance of environment protection in the production process [3]. Effective treatment of disassembly waste has become an important factor in the industrial circular economy and sustainable industrial development [4]. Disassembly waste is the useless solid, semi-solid and oil-water mixture produced in dismantling activities, including rust and nonferrous metals (cadmium, chromium, mercury, etc.), batteries, plastics, and various working fluids (lubricants, engine oil, etc.) [5]. Random dumping or improper disposal of disassembly waste can seriously endanger human health and even cause damage to the ecological environment. The prediction of disassembly waste generation is a complex process, which is affected by many factors, including rapid product upgrades, new technological innovations, machine Sustainability 2021, 13, 5391 2 of 15 usage conditions, component damage, and storage time. These influential factors in waste generation and their uncertainties are difficult to quantify [6]. Therefore, how to predict the generation of dismantling waste in an uncertain environment has become full of challenges. On this basis, in order to promote the sustainable development of resources, the ecosystem, and the environment [7,8], it is of great significance to study disassembly waste in the industrial system.
The existing approaches for waste generation forecasting are classified into four main categories: traditional statistical models [9,10], the gray and fuzzy models [11], simulation models [12], and nonprobabilistic statistical learning models [13]. In terms of traditional statistical models, Karpušenkaitė [14], Box [15], Giannouli [16], Chen [17], Althaf [18], and others have studied the time series prediction of uni-variate models involving environmental applications. Denafas [19] applied the municipal waste composition data to a time series prediction model, which can quantitatively estimate seasonally changing waste generation. Unlike deterministic models, stochastic models and probabilistic model-driven statistical methods have been widely used to predict waste generation under uncertain conditions [20,21], for example, by Peeters [22], Karpušenkaitė [23], Abdoli [24], and Kannangara [25]. However, the uncertainties about input data and predicted recovery potential are often overlooked. Gray and fuzzy theories can solve the problem of uncertainty [26], and a reliable model output can be obtained even with poor data. Chauhan [27] integrated interpretive structure modeling, a fuzzy analytic hierarchy process, and fuzzy technology to predict medical waste with order preference. Noori [28] combined a wavelet transform fuzzy system and a wavelet transform artificial neural network to predict solid waste generation. At the same time, with the increasing requirements for green sustainable development and cleaner production management technology [29,30], in terms of the current research situation, Tsai [31], Roubík [32], and Ahamed [33] have conducted extensive research on waste resource reuse, environmental climate change, and waste system management. Bai [34] presented an overview of the current solid waste management situation in Singapore and provided a brief discussion of the future challenges.
However, most of the mainstream studies focused on the dynamic prediction of waste generation in a certain time period or system, lacking research on the real-time dynamic prediction of waste generation under uncertainty [35]. Therefore, in this study, based on the HMM prediction model [36], we make improvements to be suitable for real-time dynamic prediction of dismantling waste. During the disassembly process, a lot of data about disassembly waste will be generated. In order to improve the accuracy of prediction, it is necessary to introduce digital twinning to dynamically predict the generation of disassembly waste. In addition, Gaussian mixture regression (GMR) constructs the joint probability density function of data and obtains a regression function from the probability model. The density estimation can reflect the trend of most data, which is an advantage for the prediction of dismantling waste. In this work, digital twinning technology, Gaussian mixture, and hidden Markov model (DG-HMM) are integrated to solve the interference factors of dynamic prediction. The proposed model is a new solution for the waste generation prediction in the disassembly process.
The primary objective of this paper is to dynamically predict in real-time the generation of disassembly waste, so as to provide necessary references for waste management, production technical improvement, and the capital budget of the disassembly system. Therefore, the contributions of this paper are mainly in three points: (1) A novel DG-HMM, which integrates digital twinning, Gaussian mixture, and HMM, was proposed to dynamically predict in real-time disassembly waste generation under uncertainty. (2) Based on the prediction model of HMM, the wavelet denoising was used for data cleaning, and a weighted average Gaussian mixture method was suggested to solve the problem of disassembly waste generation. (3) The proposed DG-HMM forecasting framework, including data pretreatment and forecasting analysis, can be expressed in an HMM to reduce prediction uncertainty. The proposed model was applied to a real case using daily Sustainability 2021, 13, 5391 3 of 15 disassembly waste generation data from 1 January 2018 to 31 December 2018, and its reliability and accuracy in forecast were verified.

DG-HMM Modeling
In this model, the uncertainty of disassembly waste was a hidden variable. It is assumed that the hidden state sequence follows discrete time and a first-order Markov chain, which means that the current state only depends on the previous state. The predictor variable is regarded as an observation of a continuous random variable. In this work, we assumed that the observation was conditionally independent of generated states. The hidden state sequence was composed of the total uncertainty of disassembly waste generation, and the observation sequence was composed of actual values. The generation of disassembly waste is a noncyclical fluctuation problem.
DG-HMM includes two main stages, namely the dynamic prediction and the decision evaluation. In the dynamic prediction stage, the wavelet denoising is a tool for removing noise in the data series, and then HMM is used for forecasting. When the prediction accuracy exceeds the given value, the prediction result will enter the decision evaluation stage. In the decision evaluation stage, the algorithm parameters are reset, the model is trained, then Gaussian mixture mutation is used, and finally training data are postprocessed.
The predicted values that satisfy the prediction accuracy requirements will be output to physical workshop. Real-time data interaction can be carried out between dynamic prediction and decision evaluation. The data of each forecast will be automatically saved in the database so that they can be used in parameter setting and decision making for the next forecast. The prediction results of the model can provide a theoretical reference for disassembly waste generation and planning implementation. The structure of the DG-HMM dynamic prediction of disassembly waste generation is shown in Figure 1. reduce prediction uncertainty. The proposed model was applied to a real case using daily disassembly waste generation data from 1 January 2018 to 31 December 2018, and its reliability and accuracy in forecast were verified.

DG-HMM Modeling
In this model, the uncertainty of disassembly waste was a hidden variable. It is assumed that the hidden state sequence follows discrete time and a first-order Markov chain, which means that the current state only depends on the previous state. The predictor variable is regarded as an observation of a continuous random variable. In this work, we assumed that the observation was conditionally independent of generated states. The hidden state sequence was composed of the total uncertainty of disassembly waste generation, and the observation sequence was composed of actual values. The generation of disassembly waste is a noncyclical fluctuation problem.
DG-HMM includes two main stages, namely the dynamic prediction and the decision evaluation. In the dynamic prediction stage, the wavelet denoising is a tool for removing noise in the data series, and then HMM is used for forecasting. When the prediction accuracy exceeds the given value, the prediction result will enter the decision evaluation stage. In the decision evaluation stage, the algorithm parameters are reset, the model is trained, then Gaussian mixture mutation is used, and finally training data are postprocessed. The predicted values that satisfy the prediction accuracy requirements will be output to physical workshop. Real-time data interaction can be carried out between dynamic prediction and decision evaluation. The data of each forecast will be automatically saved in the database so that they can be used in parameter setting and decision making for the next forecast. The prediction results of the model can provide a theoretical reference for disassembly waste generation and planning implementation. The structure of the DG-HMM dynamic prediction of disassembly waste generation is shown in Figure 1.   In HMM, z t represents the discrete time of a hidden state at time t, 0 ≤ t ≤ T, and x t represents the observation sequence. The conditional distribution of an observed state is p(x t |z t ). The probability of z t is related to the state of z t−1 . The latent variables can be expressed as k-dimensional binary variables. The transition probability matrix corresponds to a number table, which is represented by A: where a ij is the transition probability from state i to state j, a ij = p(z t = j, z t−1 = i), 0 ≤ a ij ≤ 1, ∑ j a ij = 1. Based on the above description, the observation sequence x t depends on the hidden state z t . In this work, the conditional distribution of the observed variables was defined as p(x t |z t = k, B), where B is a set of parameters for conditional distribution, and The joint probability distribution of latent and observation variables can be obtained by formula (2): where X = {x 1 , · · · , x T }, Z = {z 1 , · · · , z T };θ = {π, A, B} represents the parameter set; π k denotes the initial probability of state k; µ k and ∑ k are mean vector and covariance matrix of the kth Gaussian distribution, respectively. Next, the conditional distribution will be derived from HMM. The data x t can be separated into two subvectors, namely x t = [y t , z t ]. y t and z t comprise the predictors and the prediction values, respectively. From the previous analysis, x t = [y t , z t ] can be regarded as a joint Gaussian p(y t , z t ) ∼ N(µ k , ∑ k ) [37]. When p(y t , z t ) is a joint Gaussian distribution, GMR can be used to derive the conditional density of HMM. In addition, HMM is also called the dependent mixture model, which can be interpreted as an extension of the mixture model. The selection of the mixture composition for each observation is not independently finished but depends on the composition for the previous observation. The predictors and prediction values in each state of HMM are defined as Equation (3): To derive the M-step for the µ k and ∑ k terms, we considered the parts of log likelihood that depend on µ k and ∑ k , which is a weighted version of the standard problem for computing the maximum likelihood estimate with a multivariate Gaussian. For each Gaussian state k, the conditional expectation and the estimated conditional covariance are shown in Equations (4) and (5).
where µ k z t |y t is the conditional expectation of z t given y t ; ∑ k z|y represents the estimated conditional covariance of z t given y t .
For a given time t, the observations were modeled by a mixture of K-Gaussian distributions [38]. The conditional probability distribution function of z t |y t is shown in Equation (6): where h k (y t ) is HMM forward variable, which corresponds to a probability that the observation sequence y t in state k at time t.
In the original GMR framework, the influence of different Gaussians is represented by the weight h k , which is defined as each Gaussian's probability of the observed value. To extend GMR, this model used the possibility of recursive computing a likelihood and the probability of emission in the HMM framework. Therefore, the predictors y t and the sequential information probabilistically h k (y t−1 ) are encapsulated in the HMM: where h k (y t ) is initialized to h k (y 1 ) = π k N y t |µ k y , ∑ k yy , which corresponds to the probability of observing the partial sequence y 1 , y 2 , . . . , y t , and being in stake k at time t.
The conditional probability distribution function given in Equation (6) is the complete prediction probability density of DG-HMM, which can predict the distribution of the predictor values for a given variable.

Performance Measures
The quality of predictions can be reflected through bias and reliability. Bias is defined as the correspondence between an average predicted value and an observed value. In order to evaluate the prediction accuracy, the mean absolute error (MAE), the coefficient of determination (R 2 ), and the adjusted determination coefficient (R 2 ) are selected to evaluate the proposed model. In addition, the unified probability map of the predicted probability integral transformation (PIT) value is used to evaluate prediction reliability. MAE is defined as the average squared error. In regression analysis, MAE is also used to represent the unbiased estimate of the error variance. The calculation formulas for MAE, R 2 , andR 2 are shown in Equations (8)- (10).
where x t represents an actual observation at time t; x t is a prediction value at time t; l represents the number of explanatory variables. By obtaining a unified probability map of the PIT value, the performance of a prediction can be assessed in an intuitive way. When the forecasts are reliable, PIT values follow a uniform distribution between 0 and 1. The PIT value can be obtained from the cumulative distribution function p(t) and the observed value x t , as shown in Equation (11).

Data Pretreatment
The uncertainty of the dismantling operation will cause the dismantling waste generation to be fluctuating and nonlinear. In this work, the wavelet denoising was used to clean the original data to filter high-frequency noise signals. The wavelet denoising uses the scaling and translating version of the initial wavelet ϕ j,w (x) to construct an approximation of original function f (x) and is a practical method to eliminate data noise in time series signals [39]. Wavelet translation is shown in Equations (12) to (14): where c j,w is wavelet coefficients; δ represents a constant; w is the translation version of a wavelet; 2 j represents the scale parameter of a wavelet. Orthogonal wavelet is a tool for analyzing discrete wavelets, also known as Daubechies wavelets [40]. As a modification of Daubechies wavelet, the Symlets wavelet is almost a symmetric wavelet [41]. This work used the Daubechies wavelet and Symlets wavelet for wavelet decomposition and synthesis reconstruction. After a wavelet is decomposed, the data signal can be decomposed into a low-pass filter and a high-pass filter. The potential trends of influencing factors can be obtained by a low-pass filter, whilst the noise characteristics can be stored by a high-pass filter. For reconstructed wavelets, the new time series will be treated as updated data for training and testing.

Baum-Welch Algorithm
The Baum-Welch algorithm is a variant of the expectation maximization algorithm (EMA) [42]. This work used EMA to estimate the parameter θ = {π, A, B}. Regarding the EMA, local optimization should be avoided to the greatest extent when initializing parameters. As an iterative algorithm, EMA obtains the maximum likelihood estimate by maximizing the likelihood function in E-step and M-step. Concisely, EMA is an iterative algorithm that alternates between inferring the hidden states given the parameters (E step) and optimizing the parameters given the data (M step).
(1) The E-step is the calculation of state probability and occupancy probability. The state probability and the occupation probability can be obtained via backward and forward inductive computations [43]. The specification of the expected complete data log likelihood Q is given as follows Equations (15) to (17): where γ t (k) is the occupation probability in state k at time t; ξ t (i, j) is the probability being in state i at time t − 1 and being in state j at time t.
(2) In M-step, γ t (k) and ξ t (i, j) are used to reoptimize the parameter θ = {π, A, B}. The maximization of π and A can be achieved by Lagrange multiplication, the calculation is shown in Equations (18) and (19): To derive γ t (k) and ξ t (i, j) in M-step, this work considers a weighted version of multivariate Gaussian maximum likelihood estimate. The new parameter estimates are shown in (20) and (21): After the new estimated value is obtained, the parameter θ = {π, A, B} can be set. If the calculation result does not meet the convergence criterion, return to E-step.

Model Verification
The research object of this work was GEM Co., Ltd. in Hubei Province, which is a listed enterprise that recycles urban mine resources such as electronic waste, scrapped automobiles, and waste circuit boards. During the period from 1 January 2018 to 31 December 2018, the collected 314 daily data excluding Sunday were used as the original data set for disassembly waste generation with periodic fluctuations.

Parameter Setting
In order to better assess the performance of HMM, we selected several models for comparison. The traditional statistical models are Autoregressive-moving-average (ARMA) and exponential smoothing (ES) [44]. By performing a weighted average of all values in the time series, ES can assign different weights. The artificial neural network (ANN) and support vector machine (SVM) in nonprobabilistic statistical learning model are powerful in handling nonlinear prediction problems. In this work, historical data generated by disassembly waste were used as the metadata of the model. The ES model can only use the variables of disassembly waste generation for prediction. In order to improve the prediction accuracy, two main influential factors of disassembly waste generation were selected in the ARIMA model, namely the amount of disassembly and the degree of component damage. In the ANN and SVM models, the amount of disassembly, the degree of component damage and the level of disassembly were the main factors affecting waste generation. The models used in this work were all data twinning. When new data was input, the model automatically adjusted parameters for iterative optimization. The specific parameter settings of the models are shown in Table 1.

Comparison Model Parameters Setting
ANN Learning rate 0.1; the transfer function of hidden layer is tansig; the wrong target is 0.0001; the maximum number of iterations is 10,000; the hidden node is 3; the output node is 1.

SVM
Radial basis function is used as the best kernel function. In the coarse grid search, the parameters of the gamma function and the penalty parameters are set to the range [2 −8 , 2 8 ]. The steps of coarse grid search and fine grid search are set to 1 and 0.2, respectively.

ES
Using a multiplicative algorithm to predict the amount of waste generated during disassembly.

Computational Complexity Analysis
The disassembly waste generation data set was selected as the input data to meet the requirements of the stable time series. The selected data set was divided into training group and testing group. The single-sample Kolmogorov-Smirnov test can help to check the normality of the input data, which can initialize the state transition matrix. In this work, the penalty medium threshold function in the MATLAB wavelet toolbox was used to reduce noise, and the original signal was decomposed by Daubechies of level 2. Taking into account the periodicity of disassembly waste, the HMM data set was converted into a matrix data set. The test results are shown in Table 2. For the six sample sets, the minimum mean is 2.963 (Thursday) and the maximum mean is 4.178 (Friday). The range of standard deviation is 1.857 (Thursday) to 3.135 (Friday). The most extreme differences are very close, with values ranging from 0.087 (Thursday) to 1.121 (Monday). The value ranges of Kolmogorov-Smirnov and Asymp. Sig. are 0.817 (Wednesday) to 1.039 (Friday) and 0.238 (Friday) to 0.535 (Monday), respectively. Data in the data set meet the normal distribution with a significance level of 0.05. Therefore, there is no evidence to reject the null hypothesis that the observations are independently distributed. In order to verify the independence and uniform distribution, this paper conducts a correlation graph test (each simulation run 50 times) on the time series of disassembly waste generation. The results show that only 4 of the 52ρ k values are outside the range of ±1.97/ √ T (α = 0.05), and there is no evidence that the negative null hypothesis is independently distributed. According to the data pattern recognition in DG-HMM, two sets of relatively close data were selected for analysis, and the results are shown in Figure 2. The abscissa in Figure 2 represents the time period (Mon., Tues., Wed., Thur., Fri., and Sat.), and the ordinate represents the amount of disassembly waste. As can be seen from Figure 2a, the data pattern for the 50th week, the 23rd week, and the 7th week is similar to that of the 35th week. Figure 2b shows that the data pattern of the 42nd week, the 26th week, and the 18th week is similar to that of the 37th week.

Computational Complexity Analysis
The disassembly waste generation data set was selected as the input data to meet the requirements of the stable time series. The selected data set was divided into training group and testing group. The single-sample Kolmogorov−Smirnov test can help to check the normality of the input data, which can initialize the state transition matrix. In this work, the penalty medium threshold function in the MATLAB wavelet toolbox was used to reduce noise, and the original signal was decomposed by Daubechies of level 2. Taking into account the periodicity of disassembly waste, the HMM data set was converted into a matrix data set. The test results are shown in Table 2. For the six sample sets, the minimum mean is 2.963 (Thursday) and the maximum mean is 4.178 (Friday). The range of standard deviation is 1.857 (Thursday) to 3.135 (Friday). The most extreme differences are very close, with values ranging from 0.087 (Thursday) to 1.121 (Monday). The value ranges of Kolmogorov−Smirnov and Asymp. Sig. are 0.817 (Wednesday) to 1.039 (Friday) and 0.238 (Friday) to 0.535 (Monday), respectively. Data in the data set meet the normal distribution with a significance level of 0.05. Therefore, there is no evidence to reject the null hypothesis that the observations are independently distributed. In order to verify the independence and uniform distribution, this paper conducts a correlation graph test (each simulation run 50 times) on the time series of disassembly waste generation. The results show that only 4 of the 52 values are outside the range of ± 1.97 √ ⁄ (α = 0.05), and there is no evidence that the negative null hypothesis is independently distributed. According to the data pattern recognition in DG-HMM, two sets of relatively close data were selected for analysis, and the results are shown in Figure 2. The abscissa in Figure 2 represents the time period (Mon., Tues., Wed., Thur., Fri., and Sat.), and the ordinate represents the amount of disassembly waste. As can be seen from Figure 2a, the data pattern for the 50th week, the 23rd week, and the 7th week is similar to that of the 35th week. Figure 2b shows that the data pattern of the 42nd week, the 26th week, and the 18th week is similar to that of the 37th week. In this work, the uncertainty of disassembly waste generation was used as a hidden variable. It can be seen from Table 3 that the value of NHS is two or three, and the value In this work, the uncertainty of disassembly waste generation was used as a hidden variable. It can be seen from Table 3 that the value of NHS is two or three, and the value of NGMC ranges from one to three. The minimum and maximum values of MLE are 306 and 397, respectively. The value of AIC ranges from −185 to −49. Regarding AIC standard, the hidden state is set to three, and the mixed component of Gaussian mixture model (GMM) is set to three, as shown in Table 3. Therefore, Uncertainty can be divided into three hidden states, namely, "S", "M" and "L". The initial transition probability between different hidden states can be represented by a matrix heat map. The transition matrix (A) is shown in Figure 3, and the color bar represents the transition probability of the matrix (A). The result of the chi-square test shows χ 2 > χ 2 α (q − 1) 2 , which indicates that the hidden state sequence satisfies the Markov attribute. (A). The result of the chi-square test shows > ( − 1) , whic den state sequence satisfies the Markov attribute.

Prediction Reliability Analysis
Before determining the rationality of the DG-HMM, its relia ated. The PIT uniform probability plots illustrate the reliability o obtained in the monthly forecasting, as shown in Figure 4. It ca that the predicted values of Figure 4c tend to be above the diag values of Figure 4d are concentrated below the diagonal. Howev of Figure 4a,b basically fall on the diagonal. Although the PIT va tember deviate slightly from the diagonal, the PIT values are ve In June and July, the PIT values are within the 5% significant ba shows that the PIT value distribution is quite uniform, and the pr tribution is feasible. There is no prediction of a distribution tha Therefore, it can be explained that the predicted probability dist with appropriate spread.

Prediction Reliability Analysis
Before determining the rationality of the DG-HMM, its reliability needs to be evaluated. The PIT uniform probability plots illustrate the reliability of the observation values obtained in the monthly forecasting, as shown in Figure 4. It can be seen from Figure 4 that the predicted values of Figure 4c tend to be above the diagonal, and the predicted values of Figure 4d are concentrated below the diagonal. However, the predicted values of Figure 4a,b basically fall on the diagonal. Although the PIT values in August and September deviate slightly from the diagonal, the PIT values are very close to the diagonal. In June and July, the PIT values are within the 5% significant band of Kolmogorov. This shows that the PIT value distribution is quite uniform, and the prediction probability distribution is feasible. There is no prediction of a distribution that is too high or too low. Therefore, it can be explained that the predicted probability distribution is unbiased and with appropriate spread. Sustainability 2021, 13, x FOR PEER REVIEW 10 of 15 . Figure 4. PIT uniform probability plots.

Comparison Plots of the Forecast and Observed Value
The predicted mean values based on a single observation and the (0.1, 0.9) quantile range are compared in chronological order as shown in Figure 5. The blue square and vertical line are the predicted average and the (0.10, 0.90) quantile range, respectively. The black horizontal line and the shaded area represent the predicted average value and the (0.1, 0.9) quantile range of waste generation, respectively. The red dot represents the observed data. The graph intuitively reflects the comparison between the probability prediction and the actual observation value. As far as the predicted value and the observed value are concerned, the predicted value has no obvious bias trend over time. The black vertical line or the red dot on the shading indicates that the probability prediction or observation value is not biased. It can be seen from Figure  The predicted average values of waste generation are shown in Figure 6 in chronological order. If the prediction is too large, the PIT value is close to one; if the prediction is too small, the PIT value is close to zero. When the prediction unbiased value is 0.50, the effect of prediction distribution is the best. The ideal prediction should produce a uniform distribution of PIT values in (0, 1). As shown in Figure 6a-d, the PIT values are almost

Comparison Plots of the Forecast and Observed Value
The predicted mean values based on a single observation and the (0.1, 0.9) quantile range are compared in chronological order as shown in Figure 5. The blue square and vertical line are the predicted average and the (0.10, 0.90) quantile range, respectively. The black horizontal line and the shaded area represent the predicted average value and the (0.1, 0.9) quantile range of waste generation, respectively. The red dot represents the observed data. The graph intuitively reflects the comparison between the probability prediction and the actual observation value. As far as the predicted value and the observed value are concerned, the predicted value has no obvious bias trend over time. The black vertical line or the red dot on the shading indicates that the probability prediction or observation value is not biased. It can be seen from Figure Figure 5b,c fluctuate more greatly in the quantile range (0.1, 0.9). Based on the results of the four-months prediction, the reliability of the DG-HMM is explained.
The predicted average values of waste generation are shown in Figure 6 in chronological order. If the prediction is too large, the PIT value is close to one; if the prediction is too small, the PIT value is close to zero. When the prediction unbiased value is 0.50, the effect of prediction distribution is the best. The ideal prediction should produce a uniform distribution of PIT values in (0, 1). As shown in Figure 6a-

Comparative Analysis
In order to further analyze the performance and comprehensive prediction capabilities of the proposed DG-HMM, the prediction results of ARMA, ES, ANN, HMM and DG-HMM were compared with actual observation data. The average value of prediction in each month was used as the forecast, and the comparison results are shown in Figure 7.

Comparative Analysis
In order to further analyze the performance and comprehensive prediction capabilities of the proposed DG-HMM, the prediction results of ARMA, ES, ANN, HMM and DG-HMM were compared with actual observation data. The average value of prediction in each month was used as the forecast, and the comparison results are shown in Figure 7. As for the overall trend in Figure 7, the prediction average values of EA are undoubtedly lower than the actual observed values, but the predicted values of ANN and HMM are higher than the observed values. Obviously, the predicted values of DG-HMM and ARMA are closer to the observed values. The predicted trends of HMM and DG-HMM are very similar to actual observation, but the values of DG-HMM are closer to the observed values. Although ARMA, ES and ANN have large errors with the observed values, the overall trend is still very close. The MAE, R 2 , andR 2 of the prediction results for the five models are shown in Table 4. For MAE, the highest value is ANN (20.47), and the lowest value is DG-HMM (14.35). Regarding R 2 , DG-HMM (0.913) and ARMA (0.875) are the highest value and the lowest value, respectively. ForR 2 , ARMA has a minimum value of 0.861 and DG-HMM shows a maximum value of 0.911. For the variance improvement rate in the models, the maximum improvement rate of ANN is 1.60%, and the minimum improvement rate of DG-HMM is 0.22%, which shows that DG-HMM has the best stability among other models. The MAE, R 2 , andR 2 values of DG-HMM are significantly better than the other four models, especially for the criterion of MAE. The analysis results show that DG-HMM performs better predictability than ARMA, ES, ANN and HMM.

Conclusions and Discussion
This work proposes a real-time dynamic probability model to predict the amount of disassembly waste generation under uncertainty. We integrate digital twinning, Gaussian mixture, and a hidden Markov model for dynamic prediction of disassembly waste generation. The prediction results show that, in terms of MAE, MSE, R 2 and Ȓ 2 , the performance of the DG-HMM method is better than that of HMM, ARIMA, ES, ANN and SVM. Based on the MSE skill score, the accuracy of the point prediction obtained by DG-HMM is verified. The unified PIT diagram proves the reliability of DG-HMM. The training results of the example data set show that DG-HMM can effectively solve the problem of predicting disassembly waste generation. Through the model, the prediction will be able to eliminate the uncertainty in disassembly waste generation yield, which allows for more efficient disassembly planning decision making. The DG-HMM model expands the application scope and depth of the traditional input−output method to a certain extent. How-

Conclusions and Discussion
This work proposes a real-time dynamic probability model to predict the amount of disassembly waste generation under uncertainty. We integrate digital twinning, Gaussian mixture, and a hidden Markov model for dynamic prediction of disassembly waste generation. The prediction results show that, in terms of MAE, MSE, R 2 andR 2 , the performance of the DG-HMM method is better than that of HMM, ARIMA, ES, ANN and SVM. Based on the MSE skill score, the accuracy of the point prediction obtained by DG-HMM is verified. The unified PIT diagram proves the reliability of DG-HMM. The training results of the example data set show that DG-HMM can effectively solve the problem of predicting disassembly waste generation. Through the model, the prediction will be able to eliminate the uncertainty in disassembly waste generation yield, which allows for more efficient disassembly planning decision making. The DG-HMM model expands the application scope and depth of the traditional input-output method to a certain extent. However, the proposed model is a uniform hidden Markov model, and the waste generation process of the production system is a complex giant system with a huge amount of nonuniform data. Future research on prediction can use virtual simulations and verified methods such as regression analysis, neural networks, or machine learning [45].
With the promotion and application of digital technology, digital management of industrial production and big data simulation will be the future trend [46]. The implementation of clean production and green economy development strategy on technological upgrading will have a significant impact on waste management. Furthermore, the dynamic prediction model established in this study can provide a useful reference for relevant personnel to predict the generation of disassembly waste. Therefore, combining these findings of DG-HMM with real-time disassembly waste generation situations, three policy suggestions are provided on future disassembly management for green remanufacturing. Firstly, enterprises should strengthen the norms and standards for disassembly waste. The forecast trend of the proposed model can provide references for workshop scheduling, process improvement and product life cycle management. Secondly, the government should increase subsidies and policies for disassembly waste disposal. The government's policy support can promote enterprise to introduce advanced technology and equipment, so as to better improve the management of disassembly waste. Finally, digitizing will be the trend of disassembly waste dynamic prediction and intelligent management in the future. Meanwhile, the real-time data storage of the detection system will provide timely and effective guarantee for disassembly waste prediction.

Data Availability Statement:
The data used in this study are available from the corresponding author upon request. The data are not publicly available due to privacy restrictions.