Research on CC-SSBLS Model-Based Air Quality Index Prediction
Abstract
:1. Introduction
- (1)
- Four regional air quality datasets were established and four regional meteorological datasets were evaluated from Xuzhou, Nanjing, Beijing, and Changchun City (China);
- (2)
- The semi-supervised criteria and the correlation entropy criterion serve as the foundation for the CC-SSBLS prediction model, which is suggested with the goal of addressing the partially missing data and outlier problems in the air quality dataset. When the CC-SSBLS method is used on the air quality dataset spanning four locations, the experimental findings demonstrate that it is effective in resolving the dataset’s outlier and data-missing issues as well as improving the stability and prediction accuracy of the algorithm;
- (3)
- Air quality datasets from four different regions were subjected to CC-SSBLS application along with RF, V-SVR, BLS, SSBLS, and CART-BLS models. Stability and accuracy comparison experiments were also carried out, and the outcomes demonstrated that the CC-SSBLS algorithm performed better in terms of both stability and prediction accuracy.
2. Algorithm
2.1. BLS
2.2. SSBLS
2.3. Regularized Correntropy Criterion (CC)
2.4. CC-SSBLS
Algorithm 1: CC-SSBLS pseudo codes. |
1 Given |
2 A dataset with label , an unlabeled dataset , where ; |
3 The number of nodes per window of mapped features N1; |
4 The number of enhancement nodes N3, the activation function ; |
5 The regularization parameters and ; |
6 Bandwidth of the Gaussian kernel function ; |
7 Number of iterative windows N2;
|
Else end. |
3. Data and Experiments
3.1. Data
3.2. Experimental Analysis
3.3. Results and Analysis
4. Conclusions
- (1)
- Through validation on the four datasets gathered for this paper, it is demonstrated that the correlation entropy criteria performs better when handling dataset outliers. In terms of predictive efficacy, the CC-SSBLS model performs better than the CART-SSBLS model and a number of other comparison models that are discussed above;
- (2)
- The efficacy of the model is validated by using the four datasets gathered in this study for validation. These datasets demonstrate that the stability of the prediction results of CC-SSBLS is superior to that of BLS;
- (3)
- For the model in this paper, only the prediction effect for the data is analyzed in detail, but the time analysis of model training is ignored. At a later stage, the training time of the model can be further studied to reduce the time consumed by the training model and improve the efficiency of the model.
- (4)
- As for the prediction effect for the AQI, the broad learning system should be further studied in the future to further strengthen the prediction effect. For the broad learning system, subsequent studies can be conducted on many aspects, such as the feature layer, the enhancement layer and the pseudo-inverse solving principle, so as to further improve the prediction effect of the model, reduce the training time and improve the prediction efficiency of the model.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Habibi, R.; Alesheikh, A.A.; Mohammadinia, A.; Sharif, M. An Assessment of Spatial Pattern Characterization of Air Pollution: A Case Study of CO and PM2.5 in Tehran, Iran. ISPRS Int. J. Geo-Inf. 2017, 6, 270. [Google Scholar] [CrossRef]
- Deters, J.K.; Zalakeviciute, R.; Gonzalez, M. Modeling PM2.5 Urban Pollution Using Machine Learning and Selected Meteorological Parameters. J. Electr. Comput. Eng. 2017, 2017, 5106045. [Google Scholar]
- Huang, K.; Xiao, Q.; Meng, X. Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain. Environ. Pollut. 2018, 242, 675–683. [Google Scholar] [CrossRef]
- Chen, G.; Li, S.; Knibbs, L.D.; Hamm, N.A.; Cao, W.; Li, T.; Guo, J.; Ren, H.; Abramson, M.J.; Guo, Y. A machine learning method to estimate PM2.5 concentrations across China with remote sensing, meteorological and land use information. Sci. Total Environ. 2018, 636, 52–60. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.J.; Jiang, S.; Huang, L.; Lu, G.; Kasemsan, M.; Yaluk, E.A. Differences between VOCs and NOx transport contributions, their impacts on O3, and implications for O3 pollution mitigation based on CMAQ simulation over the Yangtze River Delta, China. Sci. Total Environ. 2023, 872, 2118–2120. [Google Scholar] [CrossRef] [PubMed]
- Di, G.L.; Lapucci, M.; Schoen, F. Improved Maximum Likelihood Estimation of ARMA Models. Lobachevskii J. Math. 2022, 43, 2433–2443. [Google Scholar]
- Tarmanini, C.; Sarma, N.; Gezegin, C. Short term load forecasting based on ARIMA and ANN approaches. Energy Rep. 2023, 9, 550–557. [Google Scholar] [CrossRef]
- Kożuch, A.; Cywicka, D.; Adamowicz, K. A Comparison of Artificial Neural Network and Time Series Models for Timber Price Forecasting. Forests 2023, 14, 177. [Google Scholar] [CrossRef]
- Peng, S.J.; Shen, J.; Chao, Z.X. PM2.5 prediction based on ARIMA model. Saf. Environ. Eng. 2014, 21, 125–128. [Google Scholar]
- Zhang, L.Y.; Qiu, R.Z.; Hu, X.S. Trend analysis and forecast of PM2.5 in Fuzhou, China using the ARIMA model. Ecol. Indic. 2018, 95, 702–710. [Google Scholar] [CrossRef]
- Hu, J.L.; Chen, J.J.; Ying, Q. One-year simulation of ozone and particulate matter in China using WRF/CMAQ modeling system. Atmos. Chem. Phys. 2016, 16, 10333–10350. [Google Scholar] [CrossRef]
- Liang, Y.C.; Maimury, Y. Machine Learning-Based Prediction of Air Quality. Appl. Sci. 2020, 10, 9151. [Google Scholar] [CrossRef]
- Ghaemi, Z.; Alimohammadi, A.; Farnaghi, M. LaSVM-based big data learning system for dynamic prediction of air pollution in Tehran. Environ. Monit. Assess. Int. J. 2018, 190, 300. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Li, Q.; Yu, D.; Gu, Y. Air Quality Index and Air Pollutant Concentration Prediction Based on Machine Learning Algorithms. Appl. Sci. 2019, 9, 4069. [Google Scholar] [CrossRef]
- Zhang, J.; Ding, W. Prediction of air pollutants concentration based on an extreme learning machine: The case of Hong Kong. Int. J. Environ. Res. Public Health 2017, 14, 114. [Google Scholar] [CrossRef] [PubMed]
- Jiang, F.; He, J.Q.; Tian, T.H. A clustering-based ensemble approach with improved pigeon-inspired optimization and extreme learning machine for air quality prediction. Appl. Soft Comput. 2019, 85, 105827. [Google Scholar] [CrossRef]
- Qiao, D.W.; Yao, J.; Zhang, J.W. Short-term air quality forecasting model based on hybrid RF-IACA-BPNN algorithm. Environ. Sci. Pollut. Res. 2022, 29, 39164–39181. [Google Scholar] [CrossRef]
- Feng, R.; Zheng, H.J.; Gao, H. Recurrent Neural Network and random forest for analysis and accurate forecast of atmospheric pollutants: A case study in Hangzhou, China. J. Clean. Prod. 2019, 231, 1005–1015. [Google Scholar] [CrossRef]
- Belavadi, S.V.; Rajagopal, S. Air Quality Forecasting using LSTM RNN and Wireless Sensor Networks. Procedia Comput. Sci. 2020, 170, 241–248. [Google Scholar] [CrossRef]
- Zhang, Z.D.; Zeng, Y.K.; Yan, K. A hybrid deep learning technology for PM2.5 air quality forecasting. Environ. Sci. Pollut. Res. 2021, 28, 39409–39422. [Google Scholar] [CrossRef]
- Qin, D.; Yu, J.; Zou, G. A novel combined prediction scheme based on CNN and LSTM for urban PM2.5 concentration. IEEE Access 2019, 7, 20050–20059. [Google Scholar] [CrossRef]
- Wu, Z.Y.; Wang, Y.; Zhang, L. MSSTN: Multi-scale spatial temporal network for air pollution prediction. IEEE Int. Conf. Big Data 2019, 12, 1547–1556. [Google Scholar]
- Han, J.D.; Liu, H.; Zhu, H.S. Joint air quality and weather prediction based on multi-adversarial spatiotemporal networks. arXiv 2021, arXiv:2012.15037. [Google Scholar] [CrossRef]
- Chen, C.; Liu, Z. Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 10–24. [Google Scholar] [CrossRef] [PubMed]
- Liu, Z.; Zhou, J.; Chen, C.L.P. Broad learning system: Feature extraction based on K-means clustering algorithm. Int. Conf. Inf. 2017, 4, 683–687. [Google Scholar]
- Wang, L.; Wang, Y.B.; Chen, J. A PM2.5 Concentration Prediction Model Based on CART–BLS. Atmosphere 2022, 13, 1674. [Google Scholar] [CrossRef]
- Pourmir, M.R.; Monsefi, R.; Hodtani, G.A. Signal Detection in MIMO Communications System with Non-Gaussian Noises based on Deep Learning and Maximum Correntropy Criterion. Int. J. Wirel. Mob. Netw. 2022, 14, 1–16. [Google Scholar] [CrossRef]
- Yang, J.; Cao, J.W.; Wang, T.L. Regularized correntropy criterion based semi-supervised ELM. Neural Netw. 2020, 122, 117–129. [Google Scholar] [CrossRef]
- Huang, G.; Song, S.; Wu, C. Semi-supervised and unsupervised extreme learning machines. IEEE Trans. Cubernetics 2017, 44, 2405–2417. [Google Scholar] [CrossRef]
Data | Time | T (°C) | H (°C) | WS (°) | WD (m/s) | CO (mg/m3) | NO2 (μg/m3) | SO2 (μg/m3) | O3 (μg/m3) | PM10 (μg/m3) | PM2.5 (μg/m3) | AQI |
---|---|---|---|---|---|---|---|---|---|---|---|---|
1/1/2018 | 11 | 8 | 0 | 100 | 6 | 2.4 | 95 | 21 | 33 | 373 | 288 | 338 |
1/1/2018 | 12 | 8 | −0.7 | 110 | 6 | 2.19 | 91 | 22 | 53 | 363 | 275 | 325 |
1/1/2018 | 13 | 9 | −1 | 110 | 7 | 1.79 | 77 | 20 | 71 | 331 | 248 | 298 |
1/1/2018 | 14 | 9 | −1 | 110 | 6 | 1.48 | 64 | 18 | 85 | 310 | 217 | 267 |
1/1/2018 | 15 | 7.2 | −1.7 | 110 | 4 | 1.4 | 59 | 17 | 92 | 290 | 184 | 234 |
Dataset | Model | N1 | N2 | N3 | C | λ | σ |
---|---|---|---|---|---|---|---|
XuZhou Dataset | BLS | 15 | 18 | 300 | — | — | — |
SSBLS | 13 | 31 | 300 | 101 | 10−1 | — | |
CC-SSBLS | 15 | 18 | 300 | 101 | 10−3 | 1 | |
NanJing Dataset | BLS | 10 | 17 | 300 | — | — | — |
SSBLS | 15 | 17 | 300 | 101 | 10−1 | — | |
CC-SSBLS | 5 | 18 | 300 | 101 | 10−3 | 1 | |
BeiJing Dataset | BLS | 9 | 12 | 300 | — | — | — |
SSBLS | 7 | 17 | 300 | 101 | 10−1 | — | |
CC-SSBLS | 8 | 20 | 300 | 101 | 10−3 | 1 | |
ChangChun Dataset | BLS | 5 | 17 | 300 | — | — | — |
SSBLS | 16 | 17 | 300 | 101 | 10−1 | — | |
CC-SSBLS | 12 | 35 | 300 | 101 | 10−3 | 1 |
Dataset | Model | RMSE (μg·m−3) | MAE (μg·m−3) | MAPE (%) | R2 |
---|---|---|---|---|---|
XuZhou Dataset | RF | 21.8 | 17.44 | 1.39 | 0.74 |
V-SVR | 21.03 | 16.04 | 1.34 | 0.76 | |
BLS | 20.61 | 14.04 | 0.40 | 0.77 | |
SSBLS | 15.25 | 9.19 | 0.32 | 0.90 | |
ANN | 20.03 | 14.02 | 0.31 | 0.81 | |
RNN | 20.17 | 14.06 | 0.32 | 0.81 | |
CC-SSBLS | 11.94 | 6.89 | 0.24 | 0.92 | |
NanJing Dataset | RF | 21.51 | 12.77 | 1.34 | 0.66 |
V-SVR | 20.26 | 12.22 | 1.33 | 0.70 | |
BLS | 15.76 | 9.91 | 0.52 | 0.85 | |
SSBLS | 14.53 | 9.32 | 0.48 | 0.89 | |
ANN | 15.86 | 10.78 | 0.60 | 0.79 | |
RNN | 15.80 | 10.67 | 0.57 | 0.80 | |
CC-SSBLS | 8.68 | 5.26 | 0.24 | 0.90 | |
BeiJing Dataset | RF | 17.89 | 12.67 | 1.35 | 0.65 |
V-SVR | 16.92 | 11.03 | 1.33 | 0.68 | |
BLS | 12.67 | 8.48 | 0.40 | 0.80 | |
SSBLS | 10.43 | 7.29 | 0.32 | 0.86 | |
ANN | 15.11 | 11.56 | 0.56 | 0.78 | |
RNN | 14.88 | 11.04 | 0.57 | 0.76 | |
CC-SSBLS | 8.75 | 5.66 | 0.25 | 0.94 | |
ChangChun Dataset | RF | 21.86 | 12.4 | 1.44 | 0.65 |
V-SVR | 20.57 | 12.17 | 1.43 | 0.69 | |
BLS | 15.31 | 9.12 | 0.72 | 0.80 | |
SSBLS | 13.63 | 8.41 | 0.57 | 0.86 | |
ANN | 15.41 | 9.46 | 0.76 | 0.79 | |
RNN | 15.82 | 9.57 | 0.79 | 0.79 | |
CC-SSBLS | 11.73 | 6.81 | 0.34 | 0.91 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, L.; Wang, Y.; Chen, J.; Zhang, S.; Zhang, L. Research on CC-SSBLS Model-Based Air Quality Index Prediction. Atmosphere 2024, 15, 613. https://doi.org/10.3390/atmos15050613
Wang L, Wang Y, Chen J, Zhang S, Zhang L. Research on CC-SSBLS Model-Based Air Quality Index Prediction. Atmosphere. 2024; 15(5):613. https://doi.org/10.3390/atmos15050613
Chicago/Turabian StyleWang, Lin, Yibing Wang, Jian Chen, Shuangqing Zhang, and Lanhong Zhang. 2024. "Research on CC-SSBLS Model-Based Air Quality Index Prediction" Atmosphere 15, no. 5: 613. https://doi.org/10.3390/atmos15050613
APA StyleWang, L., Wang, Y., Chen, J., Zhang, S., & Zhang, L. (2024). Research on CC-SSBLS Model-Based Air Quality Index Prediction. Atmosphere, 15(5), 613. https://doi.org/10.3390/atmos15050613