Data Cleaning Model of Mine Wind Speed Sensor Based on LOF-GMM and SGAIN
Abstract
1. Introduction
2. LOF-GMM-SGAIN Model
2.1. Real-Time Anomaly Data Detection
2.2. Anomaly Data Imputation
3. Mine Intelligent Ventilation Wind Speed Sensor Cleaning Model and Evaluation Metrics
3.1. Mine Intelligent Ventilation Wind Speed Sensor Cleaning Model
3.2. Evaluation Metrics
4. Experiment on Outlier Detection and Imputation for Mining Ventilation Sensor Data
4.1. Anomaly and Fault Identification
4.1.1. Abnormal Data Identification
4.1.2. Fault Data Identification
4.2. Missing Value Imputation
4.2.1. Data Preparation
4.2.2. SGAIN Model Parameter Configuration
4.2.3. Experimental Results
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Glossary
| LOF | local outlier factor | 
| GMM | Gaussian mixture model | 
| LOF-GMM | local outlier factor and Gaussian mixture model | 
| SGAIN | self-generating adversarial network | 
| GAIN | generative adversarial imputation nets | 
| RF | random forest | 
| DAE | denoising autoencoder | 
| WT | wind turbine | 
| SCMA | submarine cable magnetic anomaly | 
| DC | direct current | 
| MCMC | Markov chain Monte Carlo | 
| CFSFDP | clustering by fast search and find of density peaks | 
| LSTM | long short-term memory network | 
| SDAE | stacked denoising autoencoder | 
| RMSE | root mean squared error | 
| MAE | mean absolute error | 
| KF | Kalman filter | 
References
- Zhou, F.; Xin, H.; Wei, L.; Shi, Q.; Xia, T. Research progress of mine intelligent ventilation theory and technology. Coal Sci. Technol. 2023, 51, 313–328. [Google Scholar]
- Liu, J. Overview on key scientific and technical issues of mine intelligent ventilation. Saf. Coal Mines 2020, 51, 108–111+117. [Google Scholar]
- Zhou, F.; Wei, L.; Xia, T.; Wang, K.; Wu, X.; Wang, Y. Principle, key technology and preliminary realization of mine intelligent ventilation. J. China Coal Soc. 2020, 45, 2225–2235. [Google Scholar]
- Wei, L.; Zhou, F.; Xia, T.; Wang, K.; Jiang, H.; Wang, J. Mine intelligent ventilation and disaster emergency decision platform. China Saf. Sci. J. 2022, 32, 158–167. [Google Scholar]
- Zhang, L.; Liu, Y. Research on technology of key steps of intelligent ventilation in mines. Coal Sci. Technol. 2024, 52, 178–195. [Google Scholar]
- Li, D. Application of wireless sensor network in mine underground environment monitoring. World Nonferrous Met. 2022, 7, 16–18. [Google Scholar]
- Liu, J.; Jiang, Q.; Liu, L.; Wang, D.; Huang, D.; Deng, L.; Zhou, Q. Resistance variant fault diagnosis of mine ventilation system and position optimization of wind speed sensor. J. China Coal Soc. 2021, 46, 1907–1914. [Google Scholar]
- Sun, P.; Li, J.; Wang, C.; Lei, X. A generalized model for wind turbine anomaly identification based on SCADA data. Appl. Energy 2016, 168, 550–567. [Google Scholar] [CrossRef]
- Liu, Y.; Wu, Y.; Yang, L.; Zhou, P.; Kuang, J.; Yu, W.; Wang, J.; Xu, Z.; Li, G. A Multi-Task Learning for Submarine Cable Magnetic Anomaly Recognition. J. Mar. Sci. Eng. 2023, 11, 900. [Google Scholar] [CrossRef]
- Chen, J.; Zhang, H.; Tang, Y. An abnormal data identification method based on improved generative adversarial network. China Electr. Power Constr. 2021, 42, 9–15. [Google Scholar]
- Kou, Z.; Lin, S.; Wang, A.; He, Y.; Chen, L. Identification of Abnormal Data for Synchronous Monitoring of Transformer DC Bias Based on Multiple Criteria. Sensors 2023, 23, 4959. [Google Scholar] [CrossRef]
- Goldstein, M.; Uchida, S. A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data. PLoS ONE 2016, 11, e0152173. [Google Scholar] [CrossRef] [PubMed]
- Gong, S.; Pan, T.; Wu, D.; Ji, Z. Research on missing data imputation of Mcro-Grid PV system based on MCMC. Renew. Energy 2018, 36, 346–350. [Google Scholar]
- Ni, J.; Liu, X.; Deng, L. Method for filling missing data of mine ventilation parameters. J. China Coal Soc. 2024, 49, 2315–2323. [Google Scholar]
- Pan, J.; Li, C.; Tang, Y.; Li, W.; Li, X. Energy consumption prediction of a CNC machining process with incomplete data. IEEE/CAA J. Autom. Sin. 2021, 8, 987–1000. [Google Scholar] [CrossRef]
- Liu, J.; Qiu, M.; Li, J.; Li, Z.; Gao, Z. SGAIN fusion TCN for prediction residual life of rolling bearing with missing data. J. Ordnance Equip. Eng. 2024, 45, 240–247. [Google Scholar]
- Meng, L.; Zhang, R.; Li, X.; Xi, Z. Cleaning abnormal status data of substation equipment based on machine learning. J. Electr. Power Syst. Autom. 2021, 33, 79–86. [Google Scholar]
- Mei, Y.; Li, Y.; Zhou, W.; Guo, Y.; Deng, W.; Qiao, X. Dynamic data cleaning method of abnormal and missing data in a distribution networkbased on machine learning. Power Syst. Prot. Control. 2023, 51, 158–169. [Google Scholar]
- Zhu, D.; Zhang, S.; Ma, R.; Kang, W.; Sha, J. Cleaning method for abnormal energy big data based on sparse self-coding. Sci. Rep. 2024, 14, 24016. [Google Scholar] [CrossRef] [PubMed]
- Qu, S. Real-time date processing method of wind speed sensor in roadway. Saf. Coal Mines 2017, 48, 163–166. [Google Scholar]
- Zhao, D.; Sheng, Z.; Song, Z.; Xie, L.; Liu, B. Mine airflow speed sensor data cleaning model for intelligent ventilation. China Saf. Sci. J. 2023, 33, 56–62. [Google Scholar]
- Breunig, M.M.; Kriegel, H.P.; Ng, R.T.; Sander, J. LOF, identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, TX, USA, 15–18 May 2000; Volume 2, pp. 93–104. [Google Scholar]
- Neves, D.T.; Naik, M.G.; Proença, A. SGAIN, WSGAIN-CP and WSGAIN-GP, Novel GAN Methods for Missing Data Imputation. In Proceedings of the 21st International Conference on Computational Science (ICCS ’21), Krakow, Poland, 16–18 June 2021. [Google Scholar]
- Vincent, P.; Larochelle, H.; Bengio, Y.; Manzagol, P.A. Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 5–9 July 2008; pp. 1096–1103. [Google Scholar]
- Bellavista-Parent, V.; Torres-Sospedra, J.; Pérez-Navarro, A. Comprehensive analysis of applied machine learning in indoor positioning based on Wi-Fi: An extended systematic review. Sensors 2022, 22, 4622. [Google Scholar] [CrossRef] [PubMed]
- Available online: https://scikit-learn.org/stable/about.html#citing-scikit-learn (accessed on 31 January 2025).












| Local Outlier Factor Threshold | Sample Size | Number of Accurately Identified Outliers | Number of False Positives | Identification Accuracy/% | Misclassification Rate/% | 
|---|---|---|---|---|---|
| 2 | 300 | 9 | 12 | 100 | 4 | 
| 3 | 300 | 9 | 2 | 100 | 0.67 | 
| 4 | 300 | 9 | 1 | 100 | 0.33 | 
| 5 | 300 | 9 | 0 | 100 | 0 | 
| 6 | 300 | 9 | 0 | 100 | 0 | 
| 7 | 300 | 9 | 0 | 100 | 0 | 
| … | … | … | … | … | … | 
| 14 | 300 | 9 | 0 | 100 | 0 | 
| 15 | 300 | 8 | 0 | 88.9 | 0 | 
| … | … | … | … | … | … | 
| 29 | 300 | 8 | 0 | 88.9 | 0 | 
| 30 | 300 | 7 | 0 | 77.8 | 0 | 
| MODEL | RMSE | MAE | TIME/ms | 
|---|---|---|---|
| SGAIN | 0.0579 | 0.0478 | 2554 | 
| GAIN | 0.0592 | 0.0527 | 3621 | 
| RF | 0.0908 | 0.0820 | 1892 | 
| DAE | 0.4731 | 0.4256 | 2159 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ni, J.; Yang, S.; Liu, Y. Data Cleaning Model of Mine Wind Speed Sensor Based on LOF-GMM and SGAIN. Appl. Sci. 2025, 15, 1801. https://doi.org/10.3390/app15041801
Ni J, Yang S, Liu Y. Data Cleaning Model of Mine Wind Speed Sensor Based on LOF-GMM and SGAIN. Applied Sciences. 2025; 15(4):1801. https://doi.org/10.3390/app15041801
Chicago/Turabian StyleNi, Jingfeng, Shengya Yang, and Yujiao Liu. 2025. "Data Cleaning Model of Mine Wind Speed Sensor Based on LOF-GMM and SGAIN" Applied Sciences 15, no. 4: 1801. https://doi.org/10.3390/app15041801
APA StyleNi, J., Yang, S., & Liu, Y. (2025). Data Cleaning Model of Mine Wind Speed Sensor Based on LOF-GMM and SGAIN. Applied Sciences, 15(4), 1801. https://doi.org/10.3390/app15041801
 
        

 
       