Investigation of Machine Learning Methods for Predictive Maintenance of the Ultra-High-Pressure Reactor in a Polyethylene-Vinyl Acetate Production Process
Abstract
:1. Introduction
- ■
- This research focuses on a complex ultra-high pressure and special lubricating system with multiple product grade transitions. Moreover, inappropriate operation can lead to the run-away decomposition of the process, which may cause serious harm to people and the environment.
- ■
- ■
- ■
- Most bearing life predictions are based solely on accelerometer data. The DCS data, such as motor current, provided in real-world processes, is rarely used. This study combines datasets from different domains to produce a more holistic as well as accurate prediction.
- ■
- While traditional dataset processing time series data employs dozens of statistical features to make predictions, these features are frequently challenging to interpret and lack physical intuition. A more intuitive approach is to find and monitor characteristic frequencies or frequency bands. For example, compare the time-frequency analysis with the calculated theoretical values [35], or look into the characteristic frequency band with high monotonicity over time [36]. In this paper, we utilize machine learning approaches to select characteristic frequency bands for predicting the remaining useful life of the bearing.
- ■
- Our dataset is sparse and limited (one day, one datum, 426 days, four life cycles). Complex approaches such as CNN and LSTM are not suitable for this small set of data. In this study, we emphasize simple methods and the addition of process data (motor three-phase current).
- ■
- This is the first time this dataset has been used in a predictive maintenance analysis in terms of machine learning and AI. As a result, there is no published work on the same or similar problem or data to compare. Therefore, to build a solid foundation for future studies, we reviewed several commonly used regression methods rather than focusing on ensemble architectures, which might generate better results.
- ■
- This study also aids the industry in the implementation of a predictive maintenance plan, which increases production efficiency while preventing hazardous pollution. This is very much in line with Sustainable Development Goal (SDG) Target 8.8, which promotes a safe working environment for those in precarious employment, and SDG Target 9.4, which encourages industry to upgrade for sustainability.
2. Process Description and Methods
2.1. Process Description
2.2. Method
2.3. Data Acquisition
2.4. HI Construction
2.4.1. Welch’s Method
2.4.2. Piecewise Linear Remaining Useful Life (PL-RUL)
2.4.3. Regression
2.4.4. Reliability Regression Results Evaluation
2.5. Health Status (HS) Division
2.6. Bearing Reliability Prediction
2.7. Limitation
- ■
- This study’s sample size is small compared to other regularly used datasets. This study’s system is an ultra-high-pressure reactor containing polymers with a broad distribution of molecular weight. This characteristic shows that the bearing functions in a variety of conditions and the method of failure are likely to vary. The dataset may not account for all potential failure mechanisms. Consequently, if a new failure mode occurs, it may be necessary to retrain the machine learning model.
- ■
- The sampling interval for this study is once per day. Therefore, it is difficult for the model to respond to short-term events such as grade transitions. In addition, short-term events may contaminate the training dataset, causing the results to deviate from the correct long-term trend.
3. Results and Discussion
3.1. Training Using Acceleration and Motor Datasets
3.2. Training Using Different Regression Methods
3.3. Parameter Selection
3.4. Gini Importance
3.5. Feature Selection of Gini Importance
4. Conclusions
- ■
- The Welch’s power spectrum density periodogram level is used to stifle strongly oscillating data and reduce the training data size by 16 times.
- ■
- We have successfully predicted the PL-RUL and health status using the Welch’s power spectrum density periodogram level and ERTs, which gives RAE and Linearity of 0.307 and 15.064, respectively.
- ■
- The analysis of the optimum selection of 70 days shows that our system has an optimum value, and PL-RUL is a better fit than RUL.
- ■
- We have performed grid search for hyperparameters tuning for seven commonly used regression algorithms. We discovered that ensemble trees algorithms have a smaller RAE than linear regression.
- ■
- Many ensemble trees algorithms, except for ERTs, have non-ideal staircase-like prediction behavior. We discovered that those with staircase-like behavior relied heavily on a few characteristic frequencies after investigating the Gini importance for each method. In contrast, the ERTs method has a large number of characteristic frequencies and hence exhibits linear behavior, making it a more suitable algorithm for this task.
- ■
- Two feature selection methods were proposed to improve prediction results. Method 1 can help improve the Linearity of the results from 15.1 to 11.7, whereas Method 2 can lower RAE from 0.307 to 0.229 in the acceleration and current combined (A/C) dataset and the Linearity from 15.1 to 8.4 in the acceleration (A) dataset.
- ■
- We have compared prediction with and without processing data (motor three-phase current) and discovered little difference between the two conditions before feature selection. However, following feature selection using Method 2, the prediction based on acceleration and motor current datasets has a lower RAE than the one without processing data. Still, the Linearity is higher, likely due to a lack of training samples.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Notation
Actual target for index | |
Actual targets average | |
Window energy correction factor | |
Index of operating time | |
Frequency band index | |
Acceleration level of frequency band for sensor , dB | |
Segments index | |
Total segments | |
Segmented, constant detrended, windowed, and zero-padded time series index | |
Segment size | |
Data length | |
The predicted target for the index | |
Periodogram of the mth block for sensor | |
Inter-segment overlapping ratio | |
Sensor index | |
Welch’s power spectral density periodogram of the frequency band for the sensor | |
Sampling frequency, Hz | |
Current time, day | |
End of use time of the bearing, day | |
First predicting time, day | |
Remaining useful life period, day | |
The discrete sequence of the acceleration time-domain data for the sensor with index | |
Zero-padded time series | |
Hanning window function of length | |
The mth segments of constant detrended, windowed, and zero-padded time series with index for sensor | |
Frequency band with index , Hz |
Abbreviation
A | Acceleration dataset |
A/C | Accelerration plus motor current dataset |
atm | Atmosphere |
CNN | Convolution neural networks |
DCS | Distributed control system |
DFT | Discrete Fourier transform |
ERTs | Extremely randomized trees |
EVA | Ethylene-Vinyl Acetate |
FFT | Fast Fourier Transform |
FNN | Feed-forward neural network |
KNN | K Nearest Neighbor |
LCB | Long chain branch |
LSTM | Long short-term memory |
MI | Melting index |
MSE | Mean square errors |
MWD | Molecular weight distribution |
PL-RUL | Piecewise linear remaining useful life |
PSD | Power spectral density |
RAE | Relative absolute error |
RBF | Radial basis function |
RF | Random Forest |
RNN | Recurrent neural networks |
RUL | Remaining useful life |
SDGs | Sustainable Development Goals |
SVM | Support vector machine |
VA | Vinyl acetate |
XGboost | eXtreme Gradient Boosting |
References
- Henderson, A. Ethylene-vinyl acetate (EVA) copolymers: A general review. IEEE Electr. Insul. Mag. 1993, 9, 30–38. [Google Scholar] [CrossRef]
- Sun, F.; Wang, G. Study on the thermal risk of the ethylene-vinyl acetate bulk copolymerization. Thermochim. Acta 2019, 671, 54–59. [Google Scholar] [CrossRef]
- Albert, J.; Luft, G. Runaway phenomena in the ethylene/vinylacetate copolymerization under high pressure. Chem. Eng. Process.-Process Intensif. 1998, 37, 55–59. [Google Scholar] [CrossRef]
- Turman, E.; Strasser, P.W. CFD modeling of LDPE autoclave reactor to reduce ethylene decomposition: Part 2 identifying and reducing contiguous hot spots. Chem. Eng. Sci. 2022, 257, 117722. [Google Scholar] [CrossRef]
- Brandolin, A.; Sarmoria, C.; Lótpez-Rodríguez, A.; Whiteley, K.S.; Del Amo Fernández, B. Prediction of molecular weight distributions by probability generating functions. Application to industrial autoclave reactors for high pressure polymerization of ethylene and ethylene-vinyl acetate. Polym. Eng. Sci. 2001, 41, 1413–1426. [Google Scholar] [CrossRef]
- Lee, H.-Y.; Yang, T.-H.; Chien, I.-L.; Huang, H.-P. Grade transition using dynamic neural networks for an industrial high-pressure ethylene–vinyl acetate (EVA) copolymerization process. Comput. Chem. Eng. 2009, 33, 1371–1378. [Google Scholar] [CrossRef]
- Sharma, P.; Sahoo, B.B. An ANFIS-RSM based modeling and multi-objective optimization of syngas powered dual-fuel engine. Int. J. Hydrogen Energy 2022, 47, 19298–19318. [Google Scholar] [CrossRef]
- Arena, S.; Florian, E.; Zennaro, I.; Orrù, P.; Sgarbossa, F. A novel decision support system for managing predictive maintenance strategies based on machine learning approaches. Saf. Sci. 2022, 146, 105529. [Google Scholar] [CrossRef]
- Hung, T.N.K.; Le, N.Q.K.; Le, N.H.; Van Tuan, L.; Nguyen, T.P.; Thi, C.; Kang, J.H. An AI-based Prediction Model for Drug-drug Interactions in Osteoporosis and Paget’s Diseases from SMILES. Mol. Inform. 2022, 41, e2100264. [Google Scholar] [CrossRef]
- Le, N.Q.K.; Do, D.T.; Nguyen, T.-T.; Le, Q.A. A sequence-based prediction of Kruppel-like factors proteins using XGBoost and optimized features. Gene 2021, 787, 145643. [Google Scholar] [CrossRef]
- Jin, S.; Sui, X.; Huang, X.; Wang, S.; Teodorescu, R.; Stroe, D.-I. Overview of Machine Learning Methods for Lithium-Ion Battery Remaining Useful Lifetime Prediction. Electronics 2021, 10, 3126. [Google Scholar]
- Lei, Y.; Li, N.; Guo, L.; Li, N.; Yan, T.; Lin, J. Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mech. Syst. Signal Process. 2018, 104, 799–834. [Google Scholar] [CrossRef]
- Moosavian, A.; Ahmadi, H.; Tabatabaeefar, A.; Khazaee, M. Comparison of Two Classifiers; K-Nearest Neighbor and Artificial Neural Network, for Fault Diagnosis on a Main Engine Journal-Bearing. Shock Vib. 2013, 20, 263–272. [Google Scholar] [CrossRef]
- Welch, P.D. The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Trans. Audio Electroacoust. 1967, 15, 70–73. [Google Scholar] [CrossRef] [Green Version]
- Long, Z.; Jun, Z. Rolling Bearing Fault Diagnosis Based on Ensemble Learning Model with WELCH Algorithm# br. Noise Vib. Control 2022, 42, 144. [Google Scholar]
- Jin, Z.; Han, Q.; Zhang, K.; Zhang, Y. An intelligent fault diagnosis method of rolling bearings based on Welch power spectrum transformation with radial basis function neural network. J. Vib. Control 2020, 26, 629–642. [Google Scholar] [CrossRef]
- Patil, S.; Phalle, V. Fault detection of anti-friction bearing using ensemble machine learning methods. Int. J. Eng. 2018, 31, 1972–1981. [Google Scholar]
- Rathore, M.S.; Harsha, S.P. Prognostic Analysis of High-Speed Cylindrical Roller Bearing Using Weibull Distribution and k-Nearest Neighbor. J. Nondestruct. Eval. Diagn. Progn. Eng. Syst. 2022, 5, 011005. [Google Scholar] [CrossRef]
- Xu, G.; Liu, M.; Jiang, Z.; Söffker, D.; Shen, W. Bearing fault diagnosis method based on deep convolutional neural network and random forest ensemble learning. Sensors 2019, 19, 1088. [Google Scholar] [CrossRef] [Green Version]
- Yang, Y.; Yu, D.; Cheng, J. A fault diagnosis approach for roller bearing based on IMF envelope spectrum and SVM. Measurement 2007, 40, 943–950. [Google Scholar] [CrossRef]
- Guo, L.; Li, N.; Jia, F.; Lei, Y.; Lin, J. A recurrent neural network based health indicator for remaining useful life prediction of bearings. Neurocomputing 2017, 240, 98–109. [Google Scholar] [CrossRef]
- Li, X.; Ding, Q.; Sun, J.-Q. Remaining useful life estimation in prognostics using deep convolution neural networks. Reliab. Eng. Syst. Saf. 2018, 172, 1–11. [Google Scholar] [CrossRef] [Green Version]
- Chen, Z.; Li, W. Multisensor feature fusion for bearing fault diagnosis using sparse autoencoder and deep belief network. IEEE Trans. Instrum. Meas. 2017, 66, 1693–1702. [Google Scholar] [CrossRef]
- Zheng, S.; Ristovski, K.; Farahat, A.; Gupta, C. Long short-term memory network for remaining useful life estimation. In Proceedings of the 2017 IEEE International Conference on Prognostics and Health Management (ICPHM), Dallas, TX, USA, 19–21 June 2017; IEEE: Piscataway, NJ, USA, 2017. [Google Scholar]
- Hu, C.; Youn, B.D.; Wang, P.; Yoon, J.T. Ensemble of data-driven prognostic algorithms for robust prediction of remaining useful life. Reliab. Eng. Syst. Saf. 2012, 103, 120–135. [Google Scholar] [CrossRef] [Green Version]
- Zhang, S.; Li, L.; Zhou, H.; Liu, H. Ensemble Learning Based Decision-Making Models on the Aero-Engine Bearing Fault Diagnosis. In Advances in Guidance, Navigation and Control; Springer: Berlin/Heidelberg, Germany, 2022; pp. 3829–3840. [Google Scholar]
- Zhou, H.; Cheng, L.; Teng, L.; Sun, H. Bearing Fault Diagnosis Based on RF-PCA-LSTM Model. In Proceedings of the 2021 2nd Information Communication Technologies Conference (ICTC), Nanjing, China, 7–9 May 2021; IEEE: Piscataway, NJ, USA, 2021. [Google Scholar]
- Patil, S.; Patil, A.; Phalle, V.M. Life prediction of bearing by using adaboost regressor. In Proceedings of the TRIBOINDIA-2018 An International Conference on Tribology, Mumbai, India, 13–15 December 2018. [Google Scholar]
- Shi, J.; Yu, T.; Goebel, K.; Wu, D. Remaining Useful Life Prediction of Bearings Using Ensemble Learning: The Impact of Diversity in Base Learners and Features. J. Comput. Inf. Sci. Eng. 2021, 21, 021004. [Google Scholar] [CrossRef]
- El-Thalji, I.; Jantunen, E. Dynamic modelling of wear evolution in rolling bearings. Tribol. Int. 2015, 84, 90–99. [Google Scholar] [CrossRef]
- Ramasso, E. Investigating computational geometry for failure prognostics in presence of imprecise health indicator: Results and comparisons on c-mapss datasets. In Proceedings of the PHM Society European Conference, Nantes, France, 8–10 July 2014. [Google Scholar]
- Heimes, F.O. Recurrent neural networks for remaining useful life estimation. In Proceedings of the 2008 International Conference on Prognostics and Health Management, Denver, CO, USA, 6–9 October 2008; IEEE: Piscataway, NJ, USA, 2008. [Google Scholar]
- Hong, C.-C.; Huang, S.-Y.; Shieh, J.; Chen, S.-H. Enhanced Piezoelectricity of Nanoimprinted Sub-20 nm Poly(vinylidene fluoride–trifluoroethylene) Copolymer Nanograss. Macromolecules 2012, 45, 1580–1586. [Google Scholar] [CrossRef]
- Kim, T.; Park, C.; Samuel, E.P.; Kim, Y.I.; An, S.; Yoon, S.S. Wearable sensors and supercapacitors using electroplated-Ni/ZnO antibacterial fabric. J. Mater. Sci. Technol. 2022, 100, 254–264. [Google Scholar] [CrossRef]
- Zhang, X.; Liu, Z.; Wang, J.; Wang, J. Time–frequency analysis for bearing fault diagnosis using multiple Q-factor Gabor wavelets. ISA Trans. 2019, 87, 225–234. [Google Scholar] [CrossRef]
- Wu, B.; Li, W.; Qiu, M. Remaining Useful Life Prediction of Bearing with Vibration Signals Based on a Novel Indicator. Shock Vib. 2017, 2017, 8927937. [Google Scholar] [CrossRef] [Green Version]
- Rahi, P.K.; Mehra, R. Analysis of power spectrum estimation using welch method for various window techniques. Int. J. Emerg. Technol. Eng. 2014, 2, 106–109. [Google Scholar]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Drucker, H. Improving regressors using boosting techniques. In Proceedings of the ICML, Nashville, TN, USA, 8–12 July 1997. [Google Scholar]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016. [Google Scholar]
Event | Operation Days | Days with Data |
---|---|---|
1 | 126 | 105 |
2 | 131 | 99 |
3 | 116 | 103 |
4 | 119 | 119 |
Algorithm | Hyper Parameter Grid Search Range | Best RAE Value | Best Linearity Value |
---|---|---|---|
Extremely randomized Trees | n_estimators = [10, 50, 100] | 10 | 100 |
max_depth = [10, 30, None] | 30 | 10 | |
min_sample_leaf = [1, 2, 4] | 4 | 2 | |
min_sample_split = [2, 5, 10] | 2 | 10 | |
Average RAE, Linearity | 0.279, 30.798 | 0.307, 15.064 | |
XGboost | min_child_weight = [1, 5, 10] | 10 | 10 |
Gamma = [0.5, 1.5, 5] | 1.5 | 5 | |
n_estimators = [10, 50, 100] | 50 | 10 | |
colsample_bytree = [0.6, 0.8, 1] | 0.6 | 0.8 | |
max_depth = [3, 4, 5] | 4 | 5 | |
Average RAE, Linearity | 0.272, 24.381 | 0.376, 16.240 | |
Random Forest | n_estimators = [10, 50, 100] | 10 | 10 |
max_depth = [10, 30, None] | 30 | 30 | |
min_samples_leaf = [1, 2, 4] | 2 | 1 | |
min_samples_split = [2, 5, 10] | 5 | 10 | |
Average RAE, Linearity | 0.334, 43.742 | 0.342, 34.016 | |
Adaboost | n_estimators = [10, 50, 100] | 50 | 10 |
learning rate = [0.1, 1, 10] | 0.1 | 10 | |
estimator = [DecisionTree(depth = 1), ExtraTree(depth = 1)] | ExtraTree | DecisionTree | |
Average RAE, Linearity | 0.409, 74.461 | 0.925, 0 | |
Gradient boost | n_estimators = [10, 50, 100] | 100 | 10 |
max_depth = [10, 30, None] | 30 | 10 | |
min_samples_leaf = [1, 2, 4] | 4 | 4 | |
min_samples_split = [2, 5, 10] | 10 | 2 | |
Average RAE, Linearity | 0.364, 29.316 | 0.577, 15.098 | |
SVM | C = [0.01, 0.1, 1, 10] | 0.01 | 0.1 |
gamma = [0.001, 0.01, 0.1, 1] | 0.001 | 0.1 | |
kernel = [rbf, linear] | linear | rbf | |
Average RAE, Linearity | 0.404, 19.179 | 0.975, 0 | |
Linear regression | Average RAE, Linearity | 0.381, 27.557 | 0.381, 27.557 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Pan, S.-J.; Tsai, M.-L.; Chen, C.-L.; Lin, P.T.; Lee, H.-Y. Investigation of Machine Learning Methods for Predictive Maintenance of the Ultra-High-Pressure Reactor in a Polyethylene-Vinyl Acetate Production Process. Electronics 2023, 12, 580. https://doi.org/10.3390/electronics12030580
Pan S-J, Tsai M-L, Chen C-L, Lin PT, Lee H-Y. Investigation of Machine Learning Methods for Predictive Maintenance of the Ultra-High-Pressure Reactor in a Polyethylene-Vinyl Acetate Production Process. Electronics. 2023; 12(3):580. https://doi.org/10.3390/electronics12030580
Chicago/Turabian StylePan, Shih-Jie, Meng-Lin Tsai, Cheng-Liang Chen, Po Ting Lin, and Hao-Yeh Lee. 2023. "Investigation of Machine Learning Methods for Predictive Maintenance of the Ultra-High-Pressure Reactor in a Polyethylene-Vinyl Acetate Production Process" Electronics 12, no. 3: 580. https://doi.org/10.3390/electronics12030580
APA StylePan, S.-J., Tsai, M.-L., Chen, C.-L., Lin, P. T., & Lee, H.-Y. (2023). Investigation of Machine Learning Methods for Predictive Maintenance of the Ultra-High-Pressure Reactor in a Polyethylene-Vinyl Acetate Production Process. Electronics, 12(3), 580. https://doi.org/10.3390/electronics12030580