Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe
Abstract
:1. Introduction
2. Primary Data
2.1. PM2.5 Measurements
2.2. AOD Data
2.3. Meteorological Data
2.4. Digital Elevation Model
2.5. Normalised Difference Vegetation Index
2.6. Land Cover
2.7. Population Data
3. Methodology
3.1. Data Pre-Processing
3.2. Model Development
3.3. Model Validation
3.3.1. Sample-Based Cross Validation
3.3.2. Spatial and Temporal 10-Fold Cross Validation
4. Results
5. Creating PM2.5 Maps
6. Discussion
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Li, L.; Losser, T.; Yorke, C.; Piltner, R. Fast Inverse Distance Weighting-Based Spatiotemporal Interpolation: A Web-Based Application of Interpolating Daily Fine Particulate Matter PM2.5 in the Contiguous U.S. Using Parallel Programming and k-d Tree. Int. J. Environ. Res. Public Health 2014, 11, 9101–9141. [Google Scholar] [CrossRef] [PubMed]
- Crippa, M.; Janssens-Maenhout, G.; Guizzardi, D.; Van Dingenen, R.; Dentener, F. Contribution and uncertainty of sectorial and regional emissions to regional and global PM2.5 health impacts. Atmos. Chem. Phys. 2019, 19, 5165–5186. [Google Scholar] [CrossRef] [Green Version]
- Pascal, M.; Falq, G.; Wagner, V.; Chatignoux, E.; Corso, M.; Blanchard, M.; Host, S.; Pascal, L.; Larrieu, S. Short-term impacts of particulate matter (PM10, PM10–2.5, PM2.5) on mortality in nine French cities. Atmos. Environ. 2014, 95, 175–184. [Google Scholar] [CrossRef]
- Liu, C.; Chen, R.; Sera, F.; Vicedo-Cabrera, A.M.; Guo, Y.; Tong, S.; Coelho, M.S.Z.S.; Saldiva, P.H.N.; Lavigne, E.; Matus, P.; et al. Ambient Particulate Air Pollution and Daily Mortality in 652 Cities. N. Engl. J. Med. 2019, 381, 705–715. [Google Scholar] [CrossRef]
- Martins, N.R.; da Graça, G.C. Impact of PM2.5 in indoor urban environments: A review. Sustain. Cities Soc. 2018, 42, 259–275. [Google Scholar] [CrossRef]
- Baklanov, A.; Molina, L.T.; Gauss, M. Megacities, air quality and climate. Atmos. Environ. 2016, 126, 235–249. [Google Scholar] [CrossRef]
- Mao, X.; Wang, L.; Pan, X.; Zhang, M.; Wu, X.; Zhang, W. A study on the dynamic spatial spillover effect of urban form on PM2.5 concentration at county scale in China. Atmos. Res. 2022, 269, 106046. [Google Scholar] [CrossRef]
- Environmental Protection Agency 40 CFR Part 50 Review of the National Ambient Air Quality Standards for Particulate Matter. Available online: https://cfpub.epa.gov/ncea/ (accessed on 19 December 2021).
- Lee, H.J. Advancing Exposure Assessment of PM2.5 Using Satellite Remote Sensing: A Review. Asian J. Atmos. Environ. 2020, 14, 319–334. [Google Scholar] [CrossRef]
- Deng, L. Estimation of PM2.5 spatial distribution based on kriging interpolation. In Proceedings of the First International Conference on Information Sciences, Machinery, Materials and Energy, Chongqing, China, 11–13 April 2015; Volume 126. [Google Scholar] [CrossRef] [Green Version]
- Vienneau, D.; De Hoogh, K.; Beelen, R.; Fischer, P.; Hoek, G.; Briggs, D. Comparison of land-use regression models between Great Britain and the Netherlands. Atmos. Environ. 2010, 44, 688–696. [Google Scholar] [CrossRef]
- Briggs, D.J. The use of GIS to evaluate traffic-related pollution. Occup. Environ. Med. 2006, 64, 1–2. [Google Scholar] [CrossRef] [Green Version]
- You, W.; Zang, Z.; Pan, X.; Zhang, L.; Chen, D. Estimating PM2.5 in Xi’an, China using aerosol optical depth: A comparison between the MODIS and MISR retrieval models. Sci. Total Environ. 2015, 505, 1156–1165. [Google Scholar] [CrossRef] [PubMed]
- Yao, F.; Si, M.; Li, W.; Wu, J. A multidimensional comparison between MODIS and VIIRS AOD in estimating ground-level PM2.5 concentrations over a heavily polluted region in China. Sci. Total Environ. 2017, 618, 819–828. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Li, Z. Remote sensing of atmospheric fine particulate matter (PM2.5) mass concentration near the ground from satellite observation. Remote Sens. Environ. 2015, 160, 252–262. [Google Scholar] [CrossRef]
- Kanabkaew, T. Prediction of Hourly Particulate Matter Concentrations in Chiangmai, Thailand Using MODIS Aerosol Optical Depth and Ground-Based Meteorological Data. EnvironmentAsia 2013, 6, 65–70. [Google Scholar] [CrossRef]
- Gupta, P.; Christopher, S.A. Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: Multiple regression approach. J. Geophys. Res. Earth Surf. 2009, 114. [Google Scholar] [CrossRef] [Green Version]
- Ma, Z.; Dey, S.; Christopher, S.; Liu, R.; Bi, J.; Balyan, P.; Liu, Y. A review of statistical methods used for developing large-scale and long-term PM2.5 models from satellite data. Remote Sens. Environ. 2021, 269, 112827. [Google Scholar] [CrossRef]
- Liu, Y.; Paciorek, C.J.; Koutrakis, P. Estimating Regional Spatial and Temporal Variability of PM2.5 Concentrations Using Satellite Data, Meteorology, and Land Use Information. Environ. Health Perspect. 2009, 117, 886–892. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Schneider, R.; Vicedo-Cabrera, A.M.; Sera, F.; Masselot, P.; Stafoggia, M.; de Hoogh, K.; Kloog, I.; Reis, S.; Vieno, M.; Gasparrini, A. A Satellite-Based Spatio-Temporal Machine Learning Model to Reconstruct Daily PM2.5 Concentrations across Great Britain. Remote Sens. 2020, 12, 3803. [Google Scholar] [CrossRef]
- Wei, J.; Li, Z.; Cribb, M.; Huang, W.; Xue, W.; Sun, L.; Guo, J.; Peng, Y.; Li, J.; Lyapustin, A.; et al. Improved 1 km resolution PM2.5 estimates across China using enhanced space–time extremely randomized trees. Atmos. Chem. Phys. 2020, 20, 3273–3289. [Google Scholar] [CrossRef] [Green Version]
- Chen, G.; Li, S.; Knibbs, L.D.; Hamm, N.A.S.; Cao, W.; Li, T.; Guo, J.; Ren, H.; Abramson, M.J.; Guo, Y. A machine learning method to estimate PM2.5 concentrations across China with remote sensing, meteorological and land use information. Sci. Total Environ. 2018, 636, 52–60. [Google Scholar] [CrossRef]
- Xiao, F.; Yang, M.; Fan, H.; Fan, G.; Al-Qaness, M.A.A. An improved deep learning model for predicting daily PM2.5 concentration. Sci. Rep. 2020, 10, 20988. [Google Scholar] [CrossRef] [PubMed]
- Li, L.; Girguis, M.; Lurmann, F.; Pavlovic, N.; McClure, C.; Franklin, M.; Wu, J.; Oman, L.D.; Breton, C.; Gilliland, F.; et al. Ensemble-based deep learning for estimating PM2.5 over California with multisource big data including wildfire smoke. Environ. Int. 2020, 145, 106143. [Google Scholar] [CrossRef] [PubMed]
- Van Donkelaar, A.; Martin, R.V.; Brauer, M.; Kahn, R.; Levy, R.; Verduzco, C.; Villeneuve, P.J. Global Estimates of Ambient Fine Particulate Matter Concentrations from Satellite-Based Aerosol Optical Depth: Development and Application. Environ. Health Perspect. 2010, 118, 847–855. [Google Scholar] [CrossRef] [Green Version]
- Koelemeijer, R.; Homan, C.; Matthijsen, J. Comparison of spatial and temporal variations of aerosol optical thickness and particulate matter over Europe. Atmos. Environ. 2006, 40, 5304–5315. [Google Scholar] [CrossRef]
- Bourgeois, Q.; Ekman, A.M.L.; Renard, J.-B.; Krejci, R.; Devasthale, A.; Bender, F.A.-M.; Riipinen, I.; Berthet, G.; Tackett, J.L. How much of the global aerosol optical depth is found in the boundary layer and free troposphere? Atmos. Chem. Phys. 2018, 18, 7709–7720. [Google Scholar] [CrossRef] [Green Version]
- Liu, B.; Ma, X.; Ma, Y.; Li, H.; Jin, S.; Fan, R.; Gong, W. The relationship between atmospheric boundary layer and temperature inversion layer and their aerosol capture capabilities. Atmos. Res. 2022, 271, 106121. [Google Scholar] [CrossRef]
- Li, X.; Feng, Y.J.; Liang, H.Y. The Impact of Meteorological Factors on PM2.5 Variations in Hong Kong. IOP Conf. Series Earth Environ. Sci. 2017, 78, 012003. [Google Scholar] [CrossRef]
- Wang, J.; Ogawa, S. Effects of Meteorological Conditions on PM2.5 Concentrations in Nagasaki, Japan. Int. J. Environ. Res. Public Health 2015, 12, 9089–9101. [Google Scholar] [CrossRef]
- Wang, S.; Gao, J.; Guo, L.; Nie, X.; Xiao, X. Meteorological Influences on Spatiotemporal Variation of PM2.5 Concentrations in Atmospheric Pollution Transmission Channel Cities of the Beijing–Tianjin–Hebei Region, China. Int. J. Environ. Res. Public Health 2022, 19, 1607. [Google Scholar] [CrossRef]
- Open Data Science Europe. Geo-Harmonizer Project Implementation Plan 2020–2022; Open Data Science Europe: Wageningen, The Netherlands, 2020. [Google Scholar]
- OpenAQ. Available online: https://openaq.org/ (accessed on 8 May 2022).
- Tukey, J.W. Exploratory Data Analysis; Addison-Wesley Publishing Company: Boston, MA, USA, 1977. [Google Scholar]
- Ibrahim, S.; Landa, M.; Pešek, O.; Pavelka, K.; Halounova, L. Space-Time Machine Learning Models to Analyze COVID-19 Pandemic Lockdown Effects on Aerosol Optical Depth over Europe. Remote Sens. 2021, 13, 3027. [Google Scholar] [CrossRef]
- Lyapustin, A.; Wang, Y.; Laszlo, I.; Kahn, R.; Korkin, S.; Remer, L.; Levy, R.; Reid, J.S. Multiangle implementation of atmospheric correction (MAIAC): Part 2. Aerosol algorithm. J. Geophys. Res. 2011, 116. [Google Scholar] [CrossRef]
- Inness, A.; Ades, M.; Agustí-Panareda, A.; Barré, J.; Benedictow, A.; Blechschmidt, A.-M.; Dominguez, J.J.; Engelen, R.; Eskes, H.; Flemming, J.; et al. The CAMS reanalysis of atmospheric composition. Atmos. Chem. Phys. 2019, 19, 3515–3556. [Google Scholar] [CrossRef] [Green Version]
- Muñoz-Sabater, J.; Dutra, E.; Agustí-Panareda, A.; Albergel, C.; Arduini, G.; Balsamo, G.; Boussetta, S.; Choulga, M.; Harrigan, S.; Hersbach, H.; et al. ERA5-Land: A state-of-the-art global reanalysis dataset for land applications. Earth Syst. Sci. Data 2021, 13, 4349–4383. [Google Scholar] [CrossRef]
- Tadono, T.; Ishida, H.; Oda, F.; Naito, S.; Minakawa, K.; Iwamoto, H. Precise Global DEM Generation by ALOS PRISM. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2014, II-4, 71–76. [Google Scholar] [CrossRef] [Green Version]
- Didan, K. MOD13A3 MODIS/Terra Vegetation Indices Monthly L3 Global 1 km SIN Grid V006 [Dataset]. NASA EOSDIS Land Processes DAAC. 2015. Available online: https://doi.org/10.5067/modis/mod13a3.006 (accessed on 14 March 2021).
- Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef] [Green Version]
- Rodriguez, J.D.; Perez, A.; Lozano, J.A. Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 32, 569–575. [Google Scholar] [CrossRef] [PubMed]
- Li, T.; Shen, H.; Zeng, C.; Yuan, Q.; Zhang, L. Point-surface fusion of station measurements and satellite observations for mapping PM2.5 distribution in China: Methods and assessment. Atmos. Environ. 2017, 152, 477–489. [Google Scholar] [CrossRef] [Green Version]
- He, Q.; Huang, B. Satellite-based mapping of daily high-resolution ground PM2.5 in China via space-time regression modeling. Remote Sens. Environ. 2018, 206, 72–83. [Google Scholar] [CrossRef]
- Wei, J.; Huang, W.; Li, Z.; Xue, W.; Peng, Y.; Sun, L.; Cribb, M. Estimating 1-km-resolution PM2.5 concentrations across China using the space-time random forest approach. Remote Sens. Environ. 2019, 231, 111221. [Google Scholar] [CrossRef]
- European Environment Agency. Available online: https://www.eea.europa.eu (accessed on 19 December 2021).
- Baborska-Narozny, M.; Szulgowska-Zgrzywa, M.; Mokrzecka, M.; Chmielewska, A.; Fidorow-Kaprawy, N.; Stefanowicz, E.; Piechurski, K.; Laska, M. Climate justice: Air quality and transitions from solid fuel heating. Build. Cities 2020, 1, 120–140. [Google Scholar] [CrossRef]
- Perrone, M.G.; Zhou, J.; Malandrino, M.; Sangiorgi, G.; Rizzi, C.; Ferrero, L.; Dommen, J.; Bolzacchini, E. PM chemical composition and oxidative potential of the soluble fraction of particles at two sites in the urban area of Milan, Northern Italy. Atmos. Environ. 2016, 128, 104–113. [Google Scholar] [CrossRef] [Green Version]
- Perrone, M.; Larsen, B.; Ferrero, L.; Sangiorgi, G.; De Gennaro, G.; Udisti, R.; Zangrando, R.; Gambaro, A.; Bolzacchini, E. Sources of high PM2.5 concentrations in Milan, Northern Italy: Molecular marker data and CMB modelling. Sci. Total Environ. 2012, 414, 343–355. [Google Scholar] [CrossRef] [PubMed]
- Filonchyk, M.; Hurynovich, V.; Yan, H. Impact of Covid-19 lockdown on air quality in the Poland, Eastern Europe. Environ. Res. 2020, 198, 110454. [Google Scholar] [CrossRef] [PubMed]
- Jenkins, N.; Parfitt, H.; Nicholls, M.; Beckett, P.; Wyche, K.; Smallbone, K.; Gregg, D.; Smith, M. Estimation of Changes in Air Pollution Emissions, Concentrations and Exposure during the COVID-19 Outbreak in the UK; Report for The Air Quality Expert Group, on Behalf of Defra: Analysis of Air Quality Changes Experienced in Sussex and Surrey since the COVID-19 Outbreak; UK Air, Department for Food and Rural Affairs: London, UK, 2020; 57p. [Google Scholar]
- Pala, D.; Casella, V.; Larizza, C.; Malovini, A.; Bellazzi, R. Impact of COVID-19 lockdown on PM concentrations in an Italian Northern City: A year-by-year assessment. PLoS ONE 2022, 17, e0263265. [Google Scholar] [CrossRef]
Name of the Variable | Unit | Minimum | Maximum | Mean | STD |
---|---|---|---|---|---|
PM2.5 | μg/m3 | 2 | 80 | 11.81 | 9.26 |
Aerosol optical depth | - | 0.01 | 3.12 | 0.13 | 0.08 |
PBLH | m | 73.90 | 3420.17 | 933.39 | 463.59 |
WS | m/s | 0.23 | 18.12 | 3.88 | 2.13 |
T2m | K | 249.86 | 314.15 | 287.03 | 8.17 |
Relative Humidity | % | 0.04 | 110.82 | 68.53 | 22.93 |
Total precipitation | mm | 0 | 8 | 0.1 | 0.3 |
Total Column Water Vapour | Kg/m2 | 0.95 | 50.61 | 16.76 | 7.88 |
NDVI | - | −0.3 | 0.73 | 0.25 | 0.12 |
Evaporation | mm | −0.744 | 0.065 | −0.164 | 0.109 |
Elevation | m | −3.88 | 914.26 | 151.66 | 156.01 |
10-CV | R2 | RMSE | MAE |
---|---|---|---|
Sample-based | 0.69 | 5.0 | 3.3 |
Spatial | 0.69 | 4.9 | 3.2 |
Temporal | 0.53 | 6.1 | 4.1 |
Independent Variable | North-West | North-East | South-West | South-East |
---|---|---|---|---|
AOD | 13.25 | 8.81 | 10.43 | 9.11 |
BLH | 15.89 | 15.22 | 14.98 | 10.41 |
T2m | 8.62 | 6.25 | 10.13 | 10.71 |
Rh | 6.41 | 3.99 | 5.82 | 4.71 |
E | 3.58 | 5.99 | 3.44 | 7.96 |
WS | 5.18 | 4.25 | 7.32 | 5.82 |
TCWV | 4.469 | 3.63 | 4.55 | 4.07 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ibrahim, S.; Landa, M.; Pešek, O.; Brodský, L.; Halounová, L. Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe. Remote Sens. 2022, 14, 3392. https://doi.org/10.3390/rs14143392
Ibrahim S, Landa M, Pešek O, Brodský L, Halounová L. Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe. Remote Sensing. 2022; 14(14):3392. https://doi.org/10.3390/rs14143392
Chicago/Turabian StyleIbrahim, Saleem, Martin Landa, Ondřej Pešek, Lukáš Brodský, and Lena Halounová. 2022. "Machine Learning-Based Approach Using Open Data to Estimate PM2.5 over Europe" Remote Sensing 14, no. 14: 3392. https://doi.org/10.3390/rs14143392