Next Article in Journal
RingFormer-Seg: A Scalable and Context-Preserving Vision Transformer Framework for Semantic Segmentation of Ultra-High-Resolution Remote Sensing Imagery
Previous Article in Journal
FDEN: Frequency-Band Decoupling Detail Enhancement Network for High-Fidelity Water Boundary Segmentation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimation of PM2.5 Vertical Profiles from MAX-DOAS Observations Based on Machine Learning Algorithms

1
Institutes of Physical Science and Information Technology, Anhui University, Hefei 230039, China
2
State Key Laboratory of Opto-Electronic Information Acquisition and Protection Technology, Anhui University, Hefei 230039, China
3
Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei 230601, China
4
Key Lab of Environmental Optics & Technology, Anhui Institute of Optics and Fine Mechanics, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China
5
Department of Precision Machinery and Precision Instrumentation, University of Science and Technology of China, Hefei 230026, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(17), 3063; https://doi.org/10.3390/rs17173063
Submission received: 25 July 2025 / Revised: 25 August 2025 / Accepted: 2 September 2025 / Published: 3 September 2025
(This article belongs to the Section AI Remote Sensing)

Abstract

Highlights

What are the main findings?
  • Synchronized NO2 and SO2 vertical profiles contribute to the estimation of PM2.5 vertical distribution.
  • The proposed model can be used to estimate PM2.5 concentration in typical regions in China.
What is the implication of the main finding?
  • PM2.5 mass concentration is not significantly impacted by RH in the two northern cities of Beijing and Lanzhou.
  • PM2.5 vertical profile estimation reveals high altitude air pollution transport event in Beijing.

Abstract

The vertical profile of PM2.5 is important for understanding its secondary formation, transport, and deposition at high altitudes; it also provides important data support for studying the causes and sources of PM2.5 near the ground. Based on machine learning methods, this study fully utilized simultaneous Multi-Axis Differential Optical Absorption Spectroscopy measurements of multiple air pollutants in the atmosphere and employed the measured vertical profiles of aerosol extinction—as well as the vertical profiles of precursors such as NO2 and SO2—to evaluate the vertical distribution of PM2.5 concentration. Three machine learning models (eXtreme Gradient Boosting, Random Forest, and back-propagation neural network) were evaluated using Multi-Axis Differential Optical Absorption Spectroscopy instruments in four typical cities in China: Beijing, Lanzhou, Guangzhou, and Hefei. According to the comparison between estimated PM2.5 and in situ measurements on the ground surface in the four cities, the eXtreme Gradient Boosting model has the best estimation performance, with the Pearson correlation coefficient reaching 0.91. In addition, the in situ instrument mounted on the meteorological observation tower in Beijing was used to validate the estimated PM2.5 profile, and the Pearson correlation coefficient at each height was greater than 0.7. The average PM2.5 vertical profiles in the four typical cities all show an exponential pattern. In Beijing and Guangzhou, PM2.5 can diffuse to high altitudes between 500 and 1000 m; in Lanzhou, it can diffuse to around 1500 m, while it is primarily distributed between the near surface and 500 m in Hefei. Based on the vertical distribution of PM2.5 mass concentration in Beijing, a high-altitude PM2.5 pollutant transport event was identified from January 19th to 21st, 2021, which was not detected by ground-based in situ instruments. During this process, PM2.5 was transported from the 200 to 1500 m altitude level and then sank to the near surface, causing the concentration on the ground surface to continuously increase. The sinking process contributes to approximately 7% of the ground surface PM2.5 every hour.

1. Introduction

Fine particulate matter (PM2.5), which is defined as particulate matter with an aerodynamic diameter of 2.5 um or less, has complex effects on human health and climate change [1,2,3]. Although PM2.5 pollution in China has been significantly reduced over the past decade, outdoor PM2.5 in China still causes approximately 1.2 million premature deaths each year [4]. Atmospheric aerosols are mainly distributed in the troposphere at altitudes of 0–12 km, and those in the 2–4 km altitude layer contribute to over 40% of the total aerosol variation. The long-distance transport of high-altitude aerosols and the vertical deposition of atmospheric aerosols have significant impacts on human health [5,6,7]. Measuring the vertical distribution of PM2.5 mass concentration in the atmosphere can provide a more comprehensive understanding of its dispersion, transformation, transport, and deposition at different altitudes, which can help to better comprehend the impacts of regional transport and deposition on near-surface PM2.5 [8,9,10]. The vertical PM2.5 distribution can help in formulating a target-oriented control strategy for air pollution.
PM2.5 concentration monitoring mainly relies on in situ monitoring methods, such as the Tapered Element Oscillating Microbalance (TEOM) [11], Beta-Ray Absorption Method [12], Light Scattering Method [13], and Gravimetric Method [14]. However, an in situ observation instrument can only monitor the particulate matter concentration in its vicinity. To obtain the vertical mass concentration distribution of PM2.5, an in situ instrument is usually mounted on a meteorological observation tower, a balloon, an airplane, or an unmanned aerial vehicle to collect samples at high altitudes. Sun et al. observed the vertical distribution of PM2.5 mass concentration between 1 August 2009 and 16 August 2009 by mounting three environmental particulate matter monitors (rp1400a) on meteorological observation towers at three different levels (8 m, 120 m, and 280 m) in Beijing [15]. The advantage of mounting an in situ instrument on a meteorological observation tower is the ability to perform long-term monitoring at a relatively low cost. However, the monitoring height is limited by the height of the tower. At present, most such towers in China are no more than 500 m in height and are scarce, being available only in Beijing, Shanghai, and Guangzhou. In situ observations based on tower-mounted instruments have significant gaps in terms of height and geographical coverage. In situ monitoring instruments mounted on balloons or airplanes can measure the vertical distribution of particulate matter from the near surface to the stratosphere, but they have the problem of high cost [16,17,18]. At present, the use of drones equipped with in situ monitoring instruments is an emerging method for vertical distribution monitoring. Compared with balloon and airplane platforms, drones are more flexible and less costly. However, their disadvantage is that the detection height is limited by airspace applications.
In addition to in situ instruments, remote sensing methods are widely used to obtain the vertical distribution of atmospheric aerosols [19,20]. At present, two main approaches can be used for aerosol profile monitoring: lidar and Multi-Axis Differential Optical Absorption Spectroscopy (MAX-DOAS). The satellite-based lidar CALIOP can obtain the vertical distribution of aerosol extinction through backscattering and is often used in research on regional pollutant transport [21]. In addition, ground-based lidar is widely used in the vertical monitoring of aerosol extinction [22]. As a passive remote sensing technique, MAX-DOAS retrieves the vertical distributions of aerosols and trace gases by measuring scattered sunlight at several view elevations [23,24,25]. MAX-DOAS can obtain not only the vertical distribution of aerosol extinction coefficients but also the vertical distributions of NO2 and SO2 simultaneously, which are precursors of PM2.5 [26,27,28,29,30]. In addition, AOD can be obtained using sun photometers and satellite-based imaging spectroradiometers, such as the Moderate-Resolution Imaging Spectroradiometer (MODIS) [31], Himawari-8 [32], the China Aerosol Remote Sensing Network (CARSNET) [33], and the Aerosol Robotic Network (AERONET) [34]. However, all these remote sensing methods merely measure the optical characteristic parameters of aerosols, rather than directly measuring the concentration of particulate matter.
The traditional method assumes a linear relationship between the aerosol extinction coefficient (EC) or aerosol optical depth (AOD) and PM2.5 at a certain relative humidity at a given spatial location. Given the variation in relative humidity and location, the linear relationships can change between different spatial locations. For example, a linear regression relationship between the measured aerosol extinction coefficient and PM2.5 mass concentration has been obtained. However, the reported linear relationships have certain differences; the slopes in Hefei and Wuhan are 0.0148 and 0.0067, respectively [35,36]. On the other hand, machine learning methods can be used to establish the relationship between the PM2.5 mass concentration and aerosol extinction coefficient, and they are widely used in remote sensing [37,38]. Commonly used machine learning models include eXtreme Gradient Boosting (XGBoost), Random Forest (RF), and the back-propagation neural network (BPNN). XGBoost, which is based on the gradient boosting decision tree (GBDT) algorithm, is an efficient gradient boosting decision tree algorithm [39]. The RF model, proposed by Breiman, is an ensemble algorithm based on the bagging algorithm [40,41]. The BPNN model is a multi-layer feedforward neural network trained via error back-propagation [42]. Chen et al. evaluated the performance of Extra Trees (ET), Random Forest (RF), Deep Neural Network (DNN), and Gradient Boosting Regression Tree (GBRT) when generating the vertical distribution of PM2.5 mass concentration via the aerosol extinction profile obtained using Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) [43]. The ET model showed the best performance in terms of model effectiveness and feature interpretation. Zhu et al. used the relationship between the aerosol extinction coefficient, temperature, relative humidity, and the near-surface PM2.5 mass concentration to build linear regression, improved linear regression, and Random Forest (RF) models to retrieve the PM2.5 mass concentration profile [44].
This study fully utilized simultaneous MAX-DOAS measurements of multiple air pollutants in the atmosphere and employed the measured vertical profile of aerosol extinction (which is directly related to particulate matter concentration) as well as the vertical profiles of precursors such as NO2 and SO2 (which directly affect the secondary formation of PM2.5), which were combined with relative humidity and wind information to evaluate the vertical concentration distribution of PM2.5. The applicability of this estimation model at different geographical locations was studied. Using the estimated PM2.5 vertical distribution, we analyzed the vertical distribution of particulate matter in four representative cities across China and studied the high-altitude pollutant transport process.

2. Materials and Methods

2.1. Dataset

2.1.1. MAX-DOAS Observations

In this study, four MAX-DOAS sites in typical cities in eastern, western, southern, and northern China—Beijing (BJ, 116.32°E, 39.95°N), Lanzhou (LZ, 103.85°E, 36.14°N), Guangzhou (GZ, 113.36°E, 23.15°N), and Hefei (HF, 117.18°E, 31.78°N)—were used. Beijing, Lanzhou, Hefei, and Guangzhou are typical Chinese cities with different climates and pollution levels and significant differences in the concentrations of PM2.5 precursors and humidity. Using the data from these four cities for model training, we aimed to obtain a general model that is suitable for most areas of China. The locations of these four sites are shown in Figure 1. MAX-DOAS can provide the vertical distribution of various air pollutants, such as NO2 and SO2, and parameters such as the aerosol extinction coefficient (EC). The MAX-DOAS used in the experiment is the SkySpec-2D-200-System manufactured by the Airyx company [28], which consists of a scanning telescope and two spectrometers (Avaspec-UL2048L-USB2, Avantes, The Netherlands). These spectrometers cover the ultraviolet (303–370 nm) and visible bands (390–608 nm). Using a multi-axis scanning method, the instrument can collect scattered sunlight at any elevation and azimuth. A full scan sequence consists of 11 elevation angles (1°, 2°, 3°, 4°, 5°, 6°, 8°, 10°, 15°, 30°, and 90°) at a fixed azimuth. The QDOAS 3.6.0 software (http://uv-vis.aeronomie.be/software/QDOAS/, accessed on 22 December 2022) was used to process the scattered sunlight spectrum and obtain the differential Slant Column Density (dSCD) for O4, NO2, and SO2. The DOAS fitting setting for O4, NO2, and SO2 is as described in previous works [23,24,45]. Then, an optimization estimation (OE) solver was used to retrieve the aerosol extinction coefficient and trace gas (NO2 and SO2) profiles from the dSCD of a full scan [46,47,48,49]. Since O4 concentration is proportional to the square of the oxygen concentration and relatively constant, significant variations in O4 dSCD are mainly caused by the changes in the optical path resulting from the scattering of sunlight by aerosols. MAX-DOAS thus utilizes the optimal estimation method to invert the scattering and absorption of sunlight by aerosols; that is, optical parameters such as the aerosol extinction coefficient. Using the Levenberg–Marquardt method from OE, the cost function χ 2 is minimized. χ 2 is as follows:
χ 2 = ( y F ( x , b ) ) T S ε 1 ( y F ( x , b ) ) + ( x x a ) T S a 1 ( x x a )
When minimizing the cost function, the solution of the inversion is given by
x i + 1 = x i + [ ( 1 + γ i ) S a 1 + K i T S ε 1 K i ] 1 [ K i T S ε 1 ( y F ( x i ) ) S a 1 ( x i x a ) ]
in which x a is the prior state vector, which imposes a prior constraint on the state vector x. The matrices S ε and S a represent the observation and prior covariance matrices, respectively. F ( x , b ) is the forward result of the radiative transfer model, and x i + 1 and x i represent the state vectors of the current and previous iterations, respectively. γ i is the correction factor, and K is the weighting function matrix. y is the observed dSCD.
The profile ranges from 0 to 4.0 km height, with a layer height of 100 m, was used in the retrieval of aerosol EC, NO2, and SO2 profiles. The bottom layer of the profile is regarded as the near-surface observation. The observation time of a full elevation sequence is around 15 min; therefore, the time resolution of the profile is approximately 15 min. To match the time resolution of in situ PM2.5 measurements, hourly averages of these profiles were calculated. Generally, one hour is divided into four MAX-DOAS observation periods, each lasting 15 min. These four periods overlap with the first, second, third, and fourth quarters of an hour; that is, there will be exactly four MAX-DOAS observations within one hour. The average of these four observations was calculated as the hourly average. The aerosol EC is a physical parameter of particulate matter concentration. NO2 and SO2 are considered important precursors of particulate matter. Therefore, the aerosol EC, NO2, and SO2 profiles were used as input for the PM2.5 concentration estimation model.

2.1.2. PM2.5 In Situ Monitoring

The near-surface PM2.5 concentration in this study was obtained from the China National Environmental Monitoring Centre (CNEMC) Air Quality Real-time Publicity System “http://www.cnemc.cn/ (accessed on 20 Decemeber 2024)”, which provides air quality measurements from 3866 monitoring sites in China. Four CNEMC sites were used in this study: Beijing (116.34°E, 39.93°N), Lanzhou (103.8400°E, 36.07°N), Guangzhou (113.32°E, 23.13°N), and Hefei (117.19°E, 31.78°N). The distance from each CNEMC site to its nearby MAX-DOAS site is approximately 2.53, 8.09, 4.53, and 0.95 km, respectively. The criteria for data validity judgment specified in HJ 817–2018, the official technical specifications for operation and quality control of ambient air quality automated monitoring system for particulate matter (PM10 and PM2.5) used by the CNEMC, were strictly followed. The PM2.5 mass concentrations from CNEMC were used as labels to train the machine learning model for PM2.5 concentration estimation. Meanwhile, part of the ground-based in situ measurement dataset was used to evaluate the trained model. In addition, the estimated and the measured PM2.5 profiles were compared; the measured profiles were obtained from the 325-meter-high meteorological observation tower located at the Institute of Atmospheric Physics, Chinese Academy of Sciences (IAP, 39.98°N, 116.39°E), approximately 6 km away from the Chinese Academy of Meteorological Sciences (CAMS). The in situ instruments were placed on the 60, 160, and 280 m platforms of the tower. PM2.5 concentrations were measured using the beta-ray attenuation method [50]. The tower-based PM2.5 data were acquired from the Institute of Atmospheric Physics, Chinese Academy of Sciences (IAP/CAS), with quality control procedures independently executed by IAP/CAS [50].

2.1.3. Meteorological Data

Meteorological conditions have a direct impact on the formation, transportation, transformation, and hygroscopic growth of particulate matter in the atmosphere, and thus were incorporated into the PM2.5 concentration estimation model as important input data. The vertical profiles of meteorological data, including air temperature (T), relative humidity (RH), and wind speed (U, V), were modeled in this study using the Weather Research and Forecasting with Chemistry model version 4.3 (WRF-chem). WRF-chem is used to simulate meteorological fields and air pollutants across the country using a horizontal spatial resolution of 20 km × 20 km and 44 vertical layers [51,52]. Meteorological data with a spatial resolution of 1° × 1° and temporal resolution of 6 h, provided by the National Center for Environmental Prediction (NCEP), were used as the initial meteorological fields and boundary conditions. Furthermore, the emission inventories used in the modeling were obtained from the Multi-resolution Emission Inventory for China (MEIC, http://meicmodel.org/). In this study, the meteorological data were temporally and spatially matched with the vertical grid of the MAX-DOAS profile. In addition, meteorological data at heights less than 20 m were used to represent near-surface meteorological parameters.

2.1.4. Data Preprocessing

The MAX-DOAS profile contains some invalid data due to unavoidable issues such as instrument errors and natural factors. The preprocessing of MAX-DOAS data was performed as follows. Firstly, profiles obtained under cloudy conditions were filtered out according to the Aerosol Robotic Network (AERONET). Secondly, based on the quality control standard of the MAX-DOAS retrieval algorithm [48], invalid data were removed from the aerosol EC and trace gas profiles. Thirdly, as the effective observation time of MAX-DOAS in the daytime is usually from 8:00 to 16:00 (UTC + 8), meteorological data obtained during the daytime were chosen as the model input. Note that all times in this study are Beijing Time (UTC + 8). Finally, the aerosol EC, NO2, SO2, and meteorological data were matched with each other in terms of hourly time points. A few near-surface PM2.5 data points could not be completely matched with the independent variables due to missing data or outliers; these unmatched data were defined as invalid and were removed. After data preprocessing and matching, a total of 6423 aerosol profiles were obtained from the four sites.

2.2. Methodology

2.2.1. XGBoost Model

XGBoost is an efficient gradient boosting decision tree algorithm. The model is based on the gradient boosting decision tree (GBDT) algorithm. As an ensemble algorithm, the core of XGBoost is the boosting algorithm, which integrates multiple weak learners into a strong learner. In other words, it is an additive model that gradually adds classification and regression trees (CART) to minimize the objective function. During the training of the XGBoost model, a CART tree is first trained using the initial training dataset, and then the residual of the former CART tree is taken as the input for the next CART tree. The preceding procedure is repeated numerous times until the stop conditions are satisfied. The estimated result is determined based on the sum of results from all CART trees.

2.2.2. RF Model

The RF model is an ensemble algorithm based on the bagging algorithm. A CART tree is utilized as the base learner, and then multiple CART trees are combined in the RF model. The primary characteristic of the bagging algorithm is its use of the bootstrap resampling technique. Before building a CART tree, each time, some of the samples and features are randomly picked from the initial training dataset and used to train the CART tree. In contrast to the XGBoost model, every CART tree in the RF model is independent of the others. The approach can handle a large number of samples and features while avoiding overfitting. Therefore, the RF model is widely used to solve non-linear problems. Notably, the result for regression problems is usually given by the average of all decision trees’ estimations.

2.2.3. BPNN Model

A back-propagation neural network (BPNN) model is a multi-layer feedforward neural network trained with the error back-propagation. Typically, a BPNN model includes input, hidden, and output layers. The input and output layers each constitute a single network layer, but there may be one to several hidden layers. Each layer of the neural network consists of several neurons, and the connection mode is fully connected. The BPNN learning process consists of two phases. The first phase is forward propagation. The input data are transmitted from the input layer unit to the output layer unit via the hidden layer unit. The second one is back-propagation. An error signal is defined as the difference between the true value and the result of network estimation. During the error back-propagation, the signal is transferred layer-by-layer from the output to the input layer. Meanwhile, the network weights and thresholds are adjusted accordingly [53]. The forward and backward propagation are repeated several times until the difference between the true value and the estimation result meets the requirement.

2.2.4. Model Optimization

In this study, six parameters of the XGBoost model were adjusted: the number of trees, the maximum depth of the tree (max depth), subsample, learning rate, alpha (L1 regularization parameter), and lambda (L2 regularization parameter). For the RF model, there are three parameters to adjust: the number of trees, the maximum depth of the tree (max depth), and the criterion. As a metric for evaluating the quality of decision tree branching, the mean squared error (MSE) was employed as the criterion. Table 1 lists the parameters of the three models. For the BPNN model, five parameters were adjusted accordingly: hidden layer sizes, solver, activation, maximum iterations, and initial learning rate.

2.2.5. Model Evaluation

In this study, the ratio of the training set to the test set was 9:1. Since activation functions are typically used in the training process of BPNNs, the data in the samples are usually mapped to a smaller range (such as [0, 1] or [−1, 1]) for normalization before training. Normalization transforms the data in the dataset to a specific range of values, making it easier for data processing in the neural network. Therefore, before training the BPNN model, the training and testing datasets were normalized to the range of [0, 1] using Equation (3). However, XGBoost and RF models do not require data normalization, because decision tree algorithms are not affected by the scaling or translation of individual features.
x * = x m i n m a x m i n .
This study used the Pearson correlation coefficient (R) and root mean square error (RMSE) to evaluate the error between the measured and estimated PM2.5 concentrations. The R and RMSE values were calculated using the following formulas:
R = C o v y , y ^ σ y σ y ^ ,
R M S E = 1 n i = 1 n y i y i ^ 2 ,
where n is the number of samples; y i is the observed value and y i ^ is the estimated value; C o v ( y , y ^ ) represents the covariance between observed and estimated values; and σ y and σ y ^ are the standard deviations of the observed and estimated values, respectively. A 10-fold cross-validation (CV) was used to evaluate model performance. Therefore, ten subsets from the overall datasets were randomly and equitably generated. To evaluate the performance of the model, nine of the ten subsets were chosen as training data, while the remaining subset was used as test data. The evaluation was repeated ten times until all samples had been evaluated [54]. To assess the models’ performance at various sites, a spatial four-fold cross-validation was also applied, in which three MAX-DOAS sites were used for training the models and the remaining site for testing the models.

2.2.6. Feature Importance

The interpretability of the model can also help in understanding the relationship between aerosol extinction, NO2, SO2, meteorological conditions, and PM2.5 concentration, ultimately facilitating model adjustment for required accuracy. In this study, the SHAP (SHapley Additive exPlanations) method was utilized to interpret the contribution of each feature to the three machine learning models, and the SHAP values are provided as the feature importance. SHAP, proposed by Lundberg and Lee, is a post hoc model interpretation method based on cooperative game theory [55]. The method can assist in the interpretation of models by quantifying the importance of each feature variable in each sample. Consequently, for the entire dataset, the SHAP value can be utilized to efficiently and precisely obtain the local interpretation, i.e., the interpretation of each sample. Then, for a certain feature variable, the SHAP values of the corresponding variable in all samples are calculated. The average of the absolute SHAP values represents the feature’s importance for global interpretation. Meanwhile, it also resolves the problem of feature importance being impacted by the multicollinearity of ML model features. To evaluate the impact of each feature on the PM2.5 concentration estimation, Tree SHAP was applied to the trained XGBoost and RF models, while Kernel SHAP was applied to the trained BPNN model. Tree SHAP and Kernel SHAP were performed using Tree Explainer and Kernel Explainer, respectively [56,57].

2.2.7. Vertical Transport Flux

Transport flux is a critical indicator for evaluating the transport of pollutants [58,59]. In this study, we considered two forms of vertical transport of PM2.5—vertical transport and vertical diffusion. The first form entails calculating vertical transport using vertical wind speed and PM2.5 concentration, which is given by
W F l u x i = C i V i ,
where C i and V i represent the PM2.5 mass concentration (µg/m3) and vertical wind speed (m/s) at a specific height, respectively. The second involves calculating transport diffusion using the turbulence coefficient and PM2.5, which is given by
T F l u x i = C i C i 1 e x c h _ h h ,
where C i and C i 1 represent the PM2.5 mass concentrations in two neighboring layers i t h and ( i 1 ) t h , respectively; e x c h _ h is the turbulence coefficient; and h denotes the height difference between neighboring layers i   and   i 1 . The contribution rate of the transport flux to the near-surface concentration can then be obtained.
The ratio between accumulated vertical flux and the estimated PM2.5 concentration represents the contribution of vertical transport to PM2.5 pollution. Accumulated flux refers to the time-integrated vertical flux of PM2.5. The PM2.5 vertical flux mentioned here includes both the vertical transport caused by vertical wind and the turbulent exchange in the atmosphere.

3. Results and Discussion

3.1. Model Comparison

To evaluate the performance of the XGBoost, RF, and BPNN models, the estimated near-surface PM2.5 mass concentrations of the three models were compared. The test set, 10-fold CV, and spatial 4-fold CV validation were used to evaluate the performance of the three models, and the results are shown in Table 2. These results indicate that the performance of XGBoost and RF is similar, and both are superior to the BPNN in the test set and 10-fold CV. Overall, the Pearson correlation coefficient is the highest and the RMSE value is the lowest for XGBoost compared to RF and BPNN. XGBoost, RF, and BPNN have similar performance in the spatial 4-fold CV, which shows the similar spatial extrapolation capabilities of the models. Temperature, wind speed, and wind direction in different regions can affect the diffusion and transport of PM2.5.
Figure 2 shows the feature importance of the input variables of the three models. The feature importance ranking of six input variables is the same for the XGBoost and RF models, but it is different in the BPNN model. In this study, the correlations of PM2.5 with EC, NO2, SO2, T, RH, and wind were 0.79, 0.39, 0.21, −0.07, 0.08, and −0.25, respectively. The R between the EC and PM2.5 is the largest. Similarly, the feature importance of the aerosol EC is much larger than that of other features, which suggests that the EC is strongly related to the PM2.5 mass concentration. They are positively related, so more PM2.5 usually means a larger aerosol extinction coefficient. Additionally, it is found that the PM2.5 mass concentration is also affected by RH. This is because high relative humidity can result in the hygroscopic growth of particulate matter. When RH is high, water molecules will condense into water droplets on the surface of aerosol particles, increasing the size of the aerosols and thus increasing aerosol extinction. As the aerosol particle size increases, its settling velocity slows down, resulting in a longer residence time and propagation distance in the atmosphere, thereby exacerbating the concentration of PM2.5 [9,60,61]. The impacts of NO2, SO2, T, and wind speed on the concentration of PM2.5 slightly differed among the three models. Overall, the three models do not rely on only one feature.

3.2. Near-Surface PM2.5 Concentration Validation

According to the performance comparison results, the XGBoost model has the best performance. Therefore, the trained XGBoost model was used to estimate the hourly PM2.5 mass concentration at the four MAX-DOAS sites during 2021. The correlation between hourly observed and estimated PM2.5 mass concentrations at four MAX-DOAS sites is shown in Figure 3. The results show that the correlation coefficient of in situ measurement and estimated PM2.5 mass concentrations in Beijing is 0.98, which is better than in the other three sites. The correlation coefficients for Lanzhou, Guangzhou, and Hefei are 0.94, 0.92, and 0.93, respectively. The Lanzhou site RMSE, which is 4.24 μg/m3, is lower than those for the other three sites. The RMSE values for Beijing, Guangzhou, and Hefei are 5.65, 4.63, and 5.98 μg/m3, respectively. Figure 4 shows the correlation between the estimated hourly PM2.5 mass concentration and the aerosol EC measured by MAX-DOAS at four MAX-DOAS sites. The correlation coefficients for Beijing, Lanzhou, Guangzhou, and Hefei are 0.85, 0.58, 0.74, and 0.51, respectively. Overall, the XGBoost model has the best performance at the Beijing site, and there are two main potential reasons. Firstly, more data were obtained for the Beijing site compared to the other sites, which allows the XGBoost model to learn more information from Beijing. Secondly, affected by meteorological, geographical, and instrumental conditions, the aerosol EC has the highest correlation with the near-surface PM2.5 mass concentration measurement in Beijing. Among the four cities, Guangzhou has the highest humidity. Therefore, the aerosol particles in Guangzhou tend to increase in size due to hygroscopic growth. Then, as the aerosol particles become larger, their extinction coefficient naturally increases. As a result, as shown in Figure 4, under the same PM2.5 concentration conditions, the extinction coefficient of aerosols in Guangzhou is significantly higher than those in the other three cities. This also leads to Guangzhou having the largest slope in Figure 4 among the four cities.
Figure 5 shows the hourly variations in near-surface aerosol extinction coefficients, observed PM2.5 mass concentrations, and estimated PM2.5 mass concentrations from May to July at four MAX-DOAS sites. The results show that the estimated and observed PM2.5 concentrations exhibit similar fluctuations. It can also be observed that the estimated PM2.5 mass concentrations are closer to the observed PM2.5 mass concentrations than the aerosol extinction coefficient. In addition, the results further verify the strong correlation between the extinction coefficient and surface-level PM2.5 concentration.

3.3. PM2.5 Vertical Profile Validation

Next, the XGBoost model was used to estimate the hourly vertical distribution of PM2.5 mass concentration. The hourly observed vertical distribution of PM2.5 mass concentration is from the meteorological observation tower at the Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing [45]. The time range of the data is from March to May 2019. Considering that the vertical resolution of the estimated PM2.5 is 100 m, we used the estimated PM2.5 concentrations in the first, second, and third layers to compare with the observation results at heights of 60, 160, and 280 m.
Figure 6 shows the correlation between observed and estimated PM2.5 mass concentrations at different heights. The correlation coefficients at different heights are 0.98, 0.84, 0.76, and 0.71 at the near surface and 60, 160, and 280 m’s height, respectively. The correlation coefficient shows a decreasing trend with the increase in height. The RMSE is the highest at 280 m at 28.78 μg/m3. The RMSE values at the other heights are 9.53 μg/m3 at the near surface, 22.18 μg/m3 at 60 m, and 25.13 μg/m3 at 160 m. The slopes for 60, 160, and 280 m are 0.65, 0.8, and 0.82, respectively. This difference is mainly attributed to the vertical profiles of aerosol extinction coefficients measured by MAX-DOAS due to its different sensitivities to observations at different altitudes. Additionally, when the observed PM2.5 mass concentration exceeds approximately 100.00 μg/m3, PM2.5 is underestimated. This underestimation is mainly due to the small sample of high values, such that the models could not learn the connection between the inputs and the observed PM2.5 concentration under all conditions. In addition, the XGBoost model was trained based on near-surface datasets. When the model is extrapolated to higher heights, it introduces a certain bias. This results in a decrease in the correlation coefficient when the model is applied to higher altitudes. It is still notable that the overall correlation coefficient is larger than 0.71. In conclusion, the results support the feasibility of using the XGBoost model to estimate the vertical distribution of PM2.5 mass concentration.

3.4. Vertical Distribution of PM2.5

Figure 7 shows the vertical distribution of the annual hourly average PM2.5 mass concentrations in 2021 for the four MAX-DOAS sites, as well as the corresponding feature importance ranking for each site. Utilizing the XGBoost model, data on the PM2.5 mass concentration can be obtained in different areas and at varying altitudes. Unlike the traditional linear extinction–PM2.5 relationship, which may vary with the study area and height, the XGBoost model achieves uniformity, enabling a horizontal comparison of PM2.5 mass concentrations at different altitudes and in different study areas. As shown in Figure 7, in 2021, high-altitude PM2.5 pollution was the most severe in Lanzhou, followed by Beijing and Guangzhou, while Hefei had the least pollution. Lanzhou, Guangzhou, and Beijing exhibit significant diurnal variations in the distribution of PM2.5 mass concentrations at different altitudes. The boundary layer height is low in the morning and evening, leading to high PM2.5 mass concentrations at lower altitudes. However, at noon, as the boundary layer height increases, the PM2.5 mass tends to diffuse to higher altitudes. PM2.5 in Beijing and Guangzhou can diffuse to high altitudes between 500 and 1000 m; meanwhile, it can diffuse to around 1500 m in Lanzhou, and it is primarily distributed between the near surface and 500 m in Hefei.
Figure 7e–h describe the feature importance ranking of the estimated PM2.5 mass concentration at each of the four MAX-DOAS sites. The x-axis represents the SHAP value of each feature in each sample, which indicates its importance in the estimation process. The y-axis represents the feature names, with the importance decreasing from top to bottom. Each sample is represented by a point in the figures. The color bar represents the true value of each feature, with the color of the point becoming redder as the true value increases and bluer as the true value decreases. It is found that the aerosol EC has the highest importance in the estimation of PM2.5 mass concentration at all four MAX-DOAS sites. Furthermore, the impact of the aerosol EC on PM2.5 mass concentration increases as the value of the EC increases. At the Beijing, Lanzhou, and Guangzhou sites, the importance of nitrogen dioxide ranks second while, at the Hefei site, it ranks fourth. Moreover, the impact of NO2 on PM2.5 mass concentration also increases as NO2 concentration increases. The ranking of SO2 at all four sites is relatively low, and its impact on PM2.5 mass concentration is not significant. The effects of temperature and wind speed on PM2.5 mass concentration are complex; hence, their feature importance rankings vary across the four sites. PM2.5 mass concentration is not significantly impacted by RH in the two northern cities of Beijing and Lanzhou, while it is heavily influenced by RH in Guangzhou and Hefei. The reason for this is that Lanzhou and Beijing have low humidity and a smaller range of humidity variations, resulting in a lesser impact on PM2.5 mass concentration. On the other hand, Hefei and Guangzhou experience significant fluctuations in humidity, leading to a greater impact on PM2.5 mass concentration. In addition, since 2012, China has substantially reduced sulfur and nitrogen emissions. In particular, the reduction in SO2 has produced significant effects. Currently, the concentration of SO2 in China is very low, which led to SO2, as a precursor of PM2.5, contributing less to the formation of PM2.5. Although the concentration of NO2 has decreased since 2012, nitrate is still one of the main components of current aerosols and, so, the impact of NO2 on PM2.5 concentration is still significant.
Figure 8 shows a PM2.5 pollutant event that occurred from January 19th to 21st, 2021 in Beijing, illustrating the high-altitude transport and deposition of PM2.5. Figure 8a shows the variation in aerosol extinction, HCHO, NO2, PM2.5, and wind speed profiles during these three days. Figure 8b shows the measured and estimated ground-surface PM2.5 concentration, AERONET AOD, and MAX-DOAS aerosol extinction at the ground surface. On 19 January 2021, there was a high-altitude transport process. The pollutants were mainly concentrated at high altitudes between 8:00 and 17:00, with a height range of 0–1.5 km. However, the observed PM2.5 at the near-surface level was less than 30 μg/m3—a low PM2.5 concentration. This implies the occurrence of high-altitude transport, accompanied by NO2 and HCHO transport at heights of around 200–1500 m. On this day, the wind speed reached its highest value during the pollution event, which also contributed to the transport of PM2.5 in high-altitude areas.
The PM2.5 mass concentrations started to descend on January 20th, leading to an increase in near-surface PM2.5 mass concentration and a decrease in the aerosol extinction coefficient at high altitudes. The aerosol extinction coefficient and HCHO both showed high values from around 10:00 to 12:00 at an altitude of approximately 500 to 1000 m, which indicates the potential secondary generation of aerosols. At the same time, NO2 was also transported at the altitude range of 0.3–0.5 km, with a peak appearing at 16:00. Around 10:00, wind speed decreased, and PM2.5 mass concentration began to accumulate at high altitudes.
On January 21st, the aerosols descended to the near surface, and the wind speed decreased between 10:00 and 13:00. Following the stabilization of atmospheric conditions, significant deposition of PM2.5 on the ground surface from high altitudes began to occur. Consequently, high concentrations of PM2.5, NO2, and HCHO at the ground surface were observed during this time. The high concentrations of NO2 and HCHO promoted the secondary generation of PM2.5. After 13:00, the wind speed increased, and the concentrations of NO2, HCHO, and PM2.5 decreased.
As shown in Figure 8b, during this air pollution period, the estimated near-surface PM2.5 mass concentration was below 30.00 μg/m3 on the 19th, and then gradually increased from the 20th and reached its peak on the 21st. This observation is consistent with the measurement results. Additionally, AERONET and MAX-DOAS AOD have similar decreasing trends during the air pollution period and exhibit the total opposite phenomenon from the near-surface PM2.5 concentration. This indicates that the aerosols are at high altitudes, which is consistent with the profiles of aerosol extinction and estimated PM2.5 concentration shown in Figure 8a. As the total AOD decreases, the near-surface PM2.5 concentration increases. This indicates that the deposition of high-altitude aerosols results in an increase in near-surface PM2.5 concentration. The AERONET site used in this study is Beijing-CAMS, with a latitude and longitude of 39.9°N, 116.3°E. The Beijing-CAMS site is approximately 6 km from the MAX-DOAS Beijing site.
Figure 9 shows the vertical transport flux during this PM2.5 pollution event at each height level, calculated separately using the turbulence coefficient and vertical wind speed. The positive values represent upward transport, and negative values represent downward transport. The results show an overall transport of PM2.5 from higher altitudes towards the near surface. In addition, the vertical diffusion flux is larger than the flux driven by vertical wind. They also show significant diffusion between the near surface and 500 m. Moreover, we find that the vertical transport flux caused by vertical wind decreases rapidly over 1.4 km height.
Furthermore, we adopt the ratio between accumulated vertical flux and the estimated PM2.5 concentration to represent the contribution of vertical transport to PM2.5 pollution. The contribution rate of the transport flux at 100 m to the near-surface PM2.5 concentration is shown in Figure 10. Figure 10a shows the turbulence coefficient at 100 m and the pollutant transport contribution rate calculated using the turbulence coefficient. A positive contribution ratio means downward transport, and a negative value means upward transport. The maximum contribution rate reaches 6%. Figure 10b shows an up to 0.8% contribution rate of the vertical transport flux calculated using the vertical wind speed. The results show that diffusion contributes more than vertical wind transport. In addition, PM2.5 is mostly brought to the near surface by downward diffusion and vertical wind in the afternoon during PM2.5 pollution events.
In general, the technological advantage of this method is that it incorporates synchronous MAX-DOAS observations of PM2.5 precursors. The limitation of the proposed method is that atmospheric meteorological data are required to estimate PM2.5 concentration, which affects the time efficiency and prevents the acquisition of real-time PM2.5 concentration estimation results through observation.

4. Conclusions

In this study, we employed machine learning models and integrated the aerosol extinction coefficient, NO2, SO2, T, RH, and wind obtained using MAX-DOAS and WRF-chem to estimate the PM2.5 mass concentration. The performance of the XGBoost, RF, and BPNN models was compared based on the correlation between observed and estimated PM2.5 mass concentrations. Furthermore, the XGBoost model was chosen to estimate the vertical distribution of the PM2.5 mass concentration. The model was used to estimate the vertical distribution of the annual average PM2.5 mass concentration at four MAX-DOAS sites, as well as a specific high-altitude pollutant transport event in Beijing.
The comparison between the XGBoost, RF, and BPNN models for estimating near-surface PM2.5 mass concentration suggests that the XGBoost model has the best performance among the three models. The R and RMSE values are 0.91 and 13.37 μg/m3, respectively. For a 10-fold CV, the R is 0.98 and the RMSE is 6.52 μg/m3. The RF and XGBoost models showed similar performance. Detailed feature importance analysis suggests that aerosol EC has the greatest effect on the PM2.5 mass concentration, while relative humidity has a more significant impact on PM2.5 in Hefei and Guangzhou than in Beijing and Lanzhou. The comparison between tower-based measurements and estimates of the PM2.5 concentration shows the XGBoost model’s feasibility in estimating the vertical distribution of the PM2.5 mass concentration.
Finally, the vertical distributions of the annual average PM2.5 mass concentrations at the four sites in 2021 were compared. The daily variation in the annual average PM2.5 vertical distribution was obtained for the four MAX-DOAS sites. The PM2.5 mass concentration is not significantly impacted by RH in the two northern cities of Beijing and Lanzhou, but it is heavily influenced by RH in Guangzhou and Hefei. Lanzhou, being closer to the desert, has the highest PM2.5 concentration of the four cities. In addition, PM2.5 in Beijing and Guangzhou can diffuse to high altitudes between 500 and 1000 m; in contrast, it can diffuse to around 1500 m in Lanzhou, and it is primarily distributed between the near surface and 500 m in Hefei. We identified a high-altitude PM2.5 transport event from January 19th to 21st, 2021, in Beijing. PM2.5 was transported through the high-altitude area and subsequently descended towards the near surface, with diffusion and vertical wind contribution rates of up to 6% and 0.8%, respectively.
In summary, machine learning methods were utilized to establish a conversion relationship between aerosol extinction measured by MAX-DOAS and the PM2.5 mass concentration, and the vertical distribution of the PM2.5 concentration was obtained in four key cities across China. The importance of the influencing factors of particulate matter concentration was evaluated using interpretable machine learning networks. This work used the vertical distribution of MAX-DOAS in combination with the meteorological field to quantitatively analyze the high-altitude transport of particulate matter, as well as the diffusion and vertical transport of particles from high to low altitudes and their contributions to the near-ground concentration. This study expands the methods for monitoring the vertical profile of PM2.5 and provides data support for future studies on the vertical evolution of atmospheric particulate matter, regional transport, and vertical exchange of air pollutants. Additionally, through the estimation of PM2.5, we found that precursors, such as NO2, have significant impacts on PM2.5 generation.

Author Contributions

Conceptualization, Q.L. and Q.H.; Methodology, Q.L.; Software, J.L. and H.Q.; Investigation, S.X.; Resources, Z.Z.; Data curation, J.L., H.Q., C.X., W.T. and H.L.; Writing—original draft, Q.L.; Supervision, Q.H.; Project administration, Q.H.; Funding acquisition, Q.L. and Q.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research and Development Project of Anhui Province (2023t07020015), National Natural Science Foundation of China (U21A2027), the Youth Innovation Promotion Association of CAS (2021443), the HFIPS Director’s Fund (BJPY2022B07 and YZJJQY202303), the open Foundation of the Key Laboratory of Environmental Optics and Technology, CAS (20050P173065-2024-02), the Hefei Comprehensive National Science Center, and the major science and technology special project of the Xinjiang Uygur Autonomous Region (2024A03012).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors would like to thank the CNEMC sites for providing free ground-surface hourly PM2.5 concentration data (http://www.cnemc.cn/ (accessed on 20 December 2024)). We also acknowledge Tang Guiqian from the Institute of Atmospheric Physics, Chinese Academy of Sciences, for providing the tower-based PM2.5 concentration profiles. We would like to acknowledge the DOAS UV-VIS team at BI-RA-IASB, led by M. Van Roozendael. We performed spectral fitting based on QDOAS 3.6.0, which is a free and open-source software developed by the authors (http://uv-vis.aeronomie.be/software/QDOAS/ (accessed on 20 December 2024)). We also acknowledge the SCIATRAN development team at the Institute of Remote Sensing/Institute of Environmental Physics (IUP/IFE), University of Bremen. We calculated the radiation transfer model using SCIATRAN, a free and open-source software developed by them (https://www.iup.uni-bremen.de/sciatran/ (accessed on 20 December 2024)). We would like to thank the European Centre for Medium-Range Weather Forecasts (ECMWF) and the Copernicus Climate Change Service (C3S) for the ERA5 reanalysis dataset.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Feng, S.; Gao, D.; Liao, F.; Zhou, F.; Wang, X. The health effects of ambient PM2.5 and potential mechanisms. Ecotoxicol. Environ. Saf. 2016, 128, 67–74. [Google Scholar] [CrossRef]
  2. Huang, R.-J.; Zhang, Y.; Bozzetti, C.; Ho, K.-F.; Cao, J.-J.; Han, Y.; Daellenbach, K.R.; Slowik, J.G.; Platt, S.M.; Canonaco, F. High secondary aerosol contribution to particulate pollution during haze events in China. Nature 2014, 514, 218–222. [Google Scholar] [CrossRef] [PubMed]
  3. Li, G.; Fang, C.; Wang, S.; Sun, S. The Effect of Economic Growth, Urbanization, and Industrialization on Fine Particulate Matter (PM2.5) Concentrations in China. Environ. Sci. Technol. 2016, 50, 11452–11459. [Google Scholar] [CrossRef]
  4. Yin, P.; Brauer, M.; Cohen, A.J.; Wang, H.; Li, J.; Burnett, R.T.; Stanaway, J.D.; Causey, K.; Larson, S.; Godwin, W.; et al. The effect of air pollution on deaths, disease burden, and life expectancy across China and its provinces, 1990–2017: An analysis for the Global Burden of Disease Study 2017. Lancet Planet. Health 2020, 4, e386–e398. [Google Scholar] [CrossRef] [PubMed]
  5. Chen, B.; Dong, L.; Huang, J.; Wang, Y.; Jing, Z.; Yan, W.; Wang, X.; Song, Z.; Huang, Z.; Guan, X.; et al. Analysis of Long-Term Trends in the Vertical Distribution and Transport Paths of Atmospheric Aerosols in Typical Regions of China Using 15 Years of CALIOP Data. J. Geophys. Res. Atmos. 2023, 128, e2022JD038066. [Google Scholar] [CrossRef]
  6. Yang, T.; Li, H.; Xu, W.; Song, Y.; Xu, L.; Wang, H.; Wang, F.; Sun, Y.; Wang, Z.; Fu, P. Strong impacts of regional atmospheric transport on the vertical distribution of aerosol ammonium over Beijing. Environ. Sci. Technol. Lett. 2023, 11, 29–34. [Google Scholar] [CrossRef]
  7. Zhang, X.; Xu, W.; Song, X.; Duan, X.; Peng, X.; Lin, W.; Zhao, G.; Zhang, G.; Xu, X.; Jin, J. Cross-Regional Horizontal and Vertical Transport Pathways of Carbon Monoxide and Its Impact on Air Pollution in the Tibetan Plateau. J. Geophys. Res. Atmos. 2025, 130, e2024JD041859. [Google Scholar] [CrossRef]
  8. Li, X.-B.; Wang, D.-S.; Lu, Q.-C.; Peng, Z.-R.; Wang, Z.-Y. Investigating vertical distribution patterns of lower tropospheric PM2.5 using unmanned aerial vehicle measurements. Atmos. Environ. 2018, 173, 62–71. [Google Scholar] [CrossRef]
  9. Sun, Y.; Wang, Z.; Fu, P.; Jiang, Q.; Yang, T.; Li, J.; Ge, X. The impact of relative humidity on aerosol composition and evolution processes during wintertime in Beijing, China. Atmos. Environ. 2013, 77, 927–934. [Google Scholar] [CrossRef]
  10. Zhang, K.; Zhou, L.; Fu, Q.; Yan, L.; Morawska, L.; Jayaratne, R.; Xiu, G. Sources and vertical distribution of PM2.5 over Shanghai during the winter of 2017. Sci. Total Environ. 2020, 706, 135683. [Google Scholar] [CrossRef]
  11. Lung, S.C.C.; Wang, W.C.V.; Wen, T.Y.J.; Liu, C.H.; Hu, S.C. A versatile low-cost sensing device for assessing PM2.5 spatiotemporal variation and quantifying source contribution. Sci. Total Environ. 2020, 716, 137145. [Google Scholar] [CrossRef]
  12. Kim, B.Y.; Cha, J.W.; Lee, Y.H. Estimation of PM10 and PM2.5 using backscatter coefficient of ceilometer and machine learning. Aerosol. Air Qual. Res. 2023, 23, 230033. [Google Scholar] [CrossRef]
  13. Mehadi, A.; Moosmuller, H.; Campbell, D.E.; Ham, W.; Schweizer, D.; Tarnay, L.; Hunter, J. Laboratory and field evaluation of real-time and near real-time PM2.5 smoke monitors. J. Air Waste Manag. Assoc. 2020, 70, 158–179. [Google Scholar] [CrossRef]
  14. Cambra-Lopez, M.; Winkel, A.; Mosquera, J.; Ogink, N.W.; Aarnink, A.J. Comparison between light scattering and gravimetric devices for sampling PM10 mass concentration in livestock houses. In 2012 IX International Livestock Environment Symposium (ILES IX); American Society of Agricultural and Biological Engineers: St Joseph, MI, USA, 2012; p. 3. [Google Scholar] [CrossRef]
  15. Sun, Y.; Song, T.; Tang, G.; Wang, Y. The vertical distribution of PM2.5 and boundary-layer structure during summer haze in Beijing. Atmos. Environ. 2013, 74, 413–421. [Google Scholar] [CrossRef]
  16. Ran, L.; Deng, Z.; Wu, Y.; Li, J.; Bai, Z.; Lu, Y.; Zhuoga, D.; Bian, J. Measurement report: Vertical profiling of particle size distributions over Lhasa, Tibet: Tethered balloon-based in-situ measurements and source apportionment. Atmos. Chem. Phys. Discuss. 2021, 22, 6217–6229. [Google Scholar] [CrossRef]
  17. Kim, J.M.; Lee, H.J.; Jo, H.Y.; Jo, Y.J.; Kim, C.H. Vertical characteristics of secondary aerosols observed in the Seoul and Busan Metropolitan Areas of Korea during KORUS-AQ and associations with meteorological conditions. Atmosphere 2021, 12, 1451. [Google Scholar] [CrossRef]
  18. Hwang, H.; Lee, J.E.; Shin, S.A.; You, C.R.; Shin, S.H.; Park, J.-S.; Lee, J.Y. Vertical profiles of PM2.5 and O3 measured using an unmanned aerial vehicle (UAV) and their relationships with synoptic-and local-scale air movements. Remote Sens. 2024, 16, 1581. [Google Scholar] [CrossRef]
  19. Wenjuan, Z.; Bo, L.; Fengjuan, S. Research on inversion method of PM2.5 mass concentration by aerosol extinction coefficient. Infrared Laser Eng. 2020, 49, 20200367. [Google Scholar] [CrossRef]
  20. Zongming, T.; Huihui, S.; Xiaomin, M. Retrieval method of PM2.5 mass concentration profile in near-ground and case study. Laser Optoelectron. Prog. 2015, 52, 110102. [Google Scholar] [CrossRef]
  21. Hung, J.; Minnis, P.; Chen, B.; Huang, Z.; Liu, Z.; Zhao, Q.; Yi, Y.; Ayers, J.K. Long-range transport and vertical structure of Asian dust from CALIPSO and surface measurements during PACDEX. J. Geophys. Res. Atmos. 2008, 113, D23212. [Google Scholar] [CrossRef]
  22. Reid, J.S.; Kuehn, R.E.; Holz, R.E.; Eloranta, E.W.; Kaku, K.C.; Kuang, S.; Newchurch, M.J.; Thompson, A.M.; Trepte, C.R.; Zhang, J.; et al. Ground-based High Spectral Resolution Lidar observation of aerosol vertical distribution in the summertime Southeast United States. J. Geophys. Res. Atmos. 2017, 122, 2970–3004. [Google Scholar] [CrossRef]
  23. Javed, Z.; Liu, C.; Khokhar, M.F.; Tan, W.; Liu, H.; Xing, C.; Ji, X.; Tanvir, A.; Hong, Q.; Sandhu, O. Ground-based MAX-DOAS observations of CHOCHO and HCHO in Beijing and Baoding, China. Remote Sens. 2019, 11, 1524. [Google Scholar] [CrossRef]
  24. Liu, C.; Xing, C.; Hu, Q.; Li, Q.; Liu, H.; Hong, Q.; Tan, W.; Ji, X.; Lin, H.; Lu, C.; et al. Ground-based hyperspectral stereoscopic remote sensing network: A promising strategy to learn coordinated control of O3 and PM2.5 over China. Engineering 2021, 19, 71–83. [Google Scholar] [CrossRef]
  25. Irie, H.; Takashima, H.; Kanaya, Y.; Boersma, K.F.; Gast, L.; Wittrock, F.; Brunner, D.; Zhou, Y.; Van Roozendael, M. Eight-component retrievals from ground-based MAX-DOAS observations. Atmos. Meas. Tech. 2011, 4, 1027–1044. [Google Scholar] [CrossRef]
  26. Frieß, U.; Klein Baltink, H.; Beirle, S.; Clémer, K.; Hendrick, F.; Henzing, B.; Irie, H.; de Leeuw, G.; Li, A.; Moerman, M.M.; et al. Intercomparison of aerosol extinction profiles retrieved from MAX-DOAS measurements. Atmos. Meas. Tech. 2016, 9, 3205–3222. [Google Scholar] [CrossRef]
  27. Tan, W.; Liu, C.; Wang, S.; Liu, H.; Zhu, Y.; Su, W.; Hu, Q.; Liu, J. Long-distance mobile MAX-DOAS observations of NO2 and SO2 over the North China Plain and identification of regional transport and power plant emissions. Atmos. Res. 2020, 245, 105037. [Google Scholar] [CrossRef]
  28. Tan, W.; Liu, C.; Wang, S.; Xing, C.; Su, W.; Zhang, C.; Xia, C.; Liu, H.; Cai, Z.; Liu, J. Tropospheric NO2, SO2, and HCHO over the East China Sea, using ship-based MAX-DOAS observations and comparison with OMI and OMPS satellite data. Atmos. Chem. Phys. 2018, 18, 15387–15402. [Google Scholar] [CrossRef]
  29. Ou, J.; Hu, Q.; Liu, H.; Hong, Q.; Xing, C.; Tan, W.; Lin, H.; Wang, X.; Xu, H.; Zhu, P.; et al. Vertical characterization and potential sources of aerosols in different seasons over the Yangtze River Delta using ground-based MAX-DOAS. Environ. Pollut. 2021, 279, 116898. [Google Scholar] [CrossRef]
  30. Wang, Y.; Beirle, S.; Lampel, J.; Koukouli, M.; De Smedt, I.; Theys, N.; Li, A.; Wu, D.; Xie, P.; Liu, C.; et al. Validation of OMI, GOME-2A and GOME-2B tropospheric NO2, SO2 and HCHO products using MAX-DOAS observations from 2011 to 2014 in Wuxi, China: Investigation of the effects of priori profiles and aerosols on the satellite products. Atmos. Chem. Phys. 2017, 17, 5007–5033. [Google Scholar] [CrossRef]
  31. Chen, X.; de Leeuw, G.; Arola, A.; Liu, S.; Liu, Y.; Li, Z.; Zhang, K. Joint retrieval of the aerosol fine mode fraction and optical depth using MODIS spectral reflectance over northern and eastern China: Artificial neural network method. Remote Sens. Environ. 2020, 249, 112006. [Google Scholar] [CrossRef]
  32. Zhang, W.; Xu, H.; Zheng, F. Aerosol optical depth retrieval over East Asia using Himawari-8/AHI data. Remote Sens. 2018, 10, 137. [Google Scholar] [CrossRef]
  33. Che, H.; Qi, B.; Zhao, H.; Xia, X.; Eck, T.F.; Goloub, P.; Dubovik, O.; Estelles, V.; Cuevas-Agulló, E.; Blarel, L.; et al. Aerosol optical properties and direct radiative forcing based on measurements from the China Aerosol Remote Sensing Network (CARSNET) in eastern China. Atmos. Chem. Phys. 2018, 18, 405–425. [Google Scholar] [CrossRef]
  34. Marshak, A.; Ackerman, A.; da Silva, A.M.; Eck, T.; Holben, B.; Kahn, R.; Kleidman, R.; Knobelspiesse, K.; Levy, R.; Lyapustin, A.; et al. Aerosol Properties in Cloudy Environments from Remote Sensing Observations: A Review of the Current State of Knowledge. Bull. Am. Meteorol. Soc. 2021, 102, E2177–E2197. [Google Scholar] [CrossRef]
  35. Li, X.; Xie, P.; Xu, J.; Li, A.; Tian, X.; Ren, B.; Hu, Z.; Wu, Z. Aerosol Observation and Research in Hefei by MAX-DOAS Technology. Spectrosc. Spectr. Anal. 2020, 40, 712–719. [Google Scholar] [CrossRef]
  36. Shao, H.; Li, H.; Jin, S.; Fan, R.; Wang, W.; Liu, B.; Ma, Y.; Wei, R.; Gong, W. Exploring the Conversion Model from Aerosol Extinction Coefficient to PM1, PM2.5 and PM10 Concentrations. Remote Sens. 2023, 15, 2742. [Google Scholar] [CrossRef]
  37. Chai, J.; Song, J.; Xu, Y.; Zhang, L.; Guo, B. Enhancing the applicability of satellite remote sensing for PM2.5 estimation using machine learning models in China. J. Sens. 2022, 2022, 7148682. [Google Scholar] [CrossRef]
  38. Zhang, T.; Chen, Y.; Zhu, R.; Wilson, J.P.; Song, J.; Chen, R.; Liu, L.; Bao, L. An Intelligent Learning Reconfiguration Model Based on Optimized Transformer and Multi-Source Features (TMSF) for High-Precision InSAR DEM Void Filling. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5203418. [Google Scholar] [CrossRef]
  39. Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
  40. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  41. Biau, G. Analysis of a random forests model. J. Mach. Learn. Res. 2012, 13, 1063–1095. [Google Scholar]
  42. Li, J.; Cheng, J.; Shi, J.; Huang, F. Brief introduction of back propagation (BP) neural network algorithm and its improvement. In Advances in Computer Science and Information Engineering; Springer: Berlin/Heidelberg, Germany, 2012; pp. 553–558. [Google Scholar] [CrossRef]
  43. Chen, B.; Song, Z.; Pan, F.; Huang, Y. Obtaining vertical distribution of PM2.5 from CALIOP data and machine learning algorithms. Sci. Total Environ. 2022, 805, 150338. [Google Scholar] [CrossRef]
  44. Zhu, Y.; Ma, Y.; Liu, B.; Xu, X.; Jin, S.; Gong, W. Retrieving the Vertical Distribution of PM2.5 Mass Concentration from Lidar via a Random Forest Model. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5701209. [Google Scholar] [CrossRef]
  45. Hong, Q.; Liu, C.; Hu, Q.; Xing, C.; Tan, W.; Liu, H.; Huang, Y.; Zhu, Y.; Zhang, J.; Geng, T. Evolution of the vertical structure of air pollutants during winter heavy pollution episodes: The role of regional transport and potential sources. Atmos. Res. 2019, 228, 206–222. [Google Scholar] [CrossRef]
  46. Hönninger, G.; Von Friedeburg, C.; Platt, U. Multi axis differential optical absorption spectroscopy (MAX-DOAS). Atmos. Chem. Phys. 2004, 4, 231–254. [Google Scholar] [CrossRef]
  47. Lin, H.; Liu, C.; Xing, C.; Hu, Q.; Hong, Q.; Liu, H.; Li, Q.; Tan, W.; Ji, X.; Wang, Z. Validation of water vapor vertical distributions retrieved from max-doas over Beijing, China. Remote Sens. 2020, 12, 3193. [Google Scholar] [CrossRef]
  48. Xing, C.; Liu, C.; Wang, S.; Chan, K.L.; Gao, Y.; Huang, X.; Su, W.; Zhang, C.; Dong, Y.; Fan, G. Observations of the vertical distributions of summertime atmospheric pollutants and the corresponding ozone production in Shanghai, China. Atmos. Chem. Phys. 2017, 17, 14275–14289. [Google Scholar] [CrossRef]
  49. Xing, C.; Liu, C.; Wang, S.; Hu, Q.; Liu, H.; Tan, W.; Zhang, W.; Li, B.; Liu, J. A new method to determine the aerosol optical properties from multiple-wavelength O4 absorptions by MAX-DOAS observation. Atmos. Meas. Tech. 2019, 12, 3289–3302. [Google Scholar] [CrossRef]
  50. Cheng, J.; Gui, T.; Qi, L.; Bao, L.; Meng, W.; Yue, W. Vertical Profile of Aerosol in Spring in Beijing Based on Multi-Axis Differential Optical Absorption Spectroscopy Detection. Spectrosc. Spectr. Anal. 2022, 42, 265–271. [Google Scholar] [CrossRef]
  51. Grell, G.A.; Peckham, S.E.; Schmitz, R.; McKeen, S.A.; Frost, G.; Skamarock, W.C.; Eder, B. Fully coupled “online” chemistry within the WRF model. Atmos. Environ. 2005, 39, 6957–6975. [Google Scholar] [CrossRef]
  52. Zhu, Y.; Hu, Q.; Gao, M.; Zhao, C.; Zhang, C.; Liu, T.; Tian, Y.; Yan, L.; Su, W.; Hong, X.; et al. Quantifying contributions of local emissions and regional transport to NOX in Beijing using TROPOMI constrained WRF-Chem simulation. Remote Sens. 2021, 13, 1798. [Google Scholar] [CrossRef]
  53. Chen, Y. Prediction algorithm of PM2.5 mass concentration based on adaptive BP neural network. Computing 2018, 100, 825–838. [Google Scholar] [CrossRef]
  54. Fushiki, T. Estimation of prediction error by using K-fold cross-validation. Stat. Comput. 2011, 21, 137–146. [Google Scholar] [CrossRef]
  55. Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar] [CrossRef]
  56. García, M.V.; Aznarte, J.L. Shapley additive explanations for NO2 forecasting. Ecol. Inform. 2020, 56, 101039. [Google Scholar] [CrossRef]
  57. Lundberg, S.M.; Erion, G.G.; Lee, S.-I. Consistent individualized feature attribution for tree ensembles. arXiv 2018, arXiv:1802.03888. [Google Scholar] [CrossRef]
  58. Sulaymon, I.D.; Zhang, Y.; Hu, J.; Hopke, P.K.; Zhang, Y.; Zhao, B.; Xing, J.; Li, L.; Mei, X. Evaluation of regional transport of PM2.5 during severe atmospheric pollution episodes in the western Yangtze River Delta, China. J. Environ. Manag. 2021, 293, 112827. [Google Scholar] [CrossRef]
  59. Zhang, H.; Xing, Y.; Cheng, S.; Wang, X.; Guan, P. Characterization of multiple atmospheric pollutants during haze and non-haze episodes in Beijing, China: Concentration, chemical components and transport flux variations. Atmos. Environ. 2021, 246, 118129. [Google Scholar] [CrossRef]
  60. Liu, P.; Zhao, C.; Göbel, T.; Hallbauer, E.; Nowak, A.; Ran, L.; Xu, W.; Deng, Z.; Ma, N.; Mildenberger, K. Hygroscopic properties of aerosol particles at high relative humidity and their diurnal variations in the North China Plain. Atmos. Chem. Phys. 2011, 11, 3479–3494. [Google Scholar] [CrossRef]
  61. Zhang, L.; Cao, X.; Bao, J.; Zhou, B.; Huang, J.; Shi, J.; Bi, J. A case study of dust aerosol radiative properties over Lanzhou, China. Atmos. Chem. Phys. 2010, 10, 4283–4293. [Google Scholar] [CrossRef]
Figure 1. The spatial distribution of the four studied MAX-DOAS sites.
Figure 1. The spatial distribution of the four studied MAX-DOAS sites.
Remotesensing 17 03063 g001
Figure 2. Ranking histograms of feature importance for (a) XGBoost, (b) RF, and (c) BPNN models.
Figure 2. Ranking histograms of feature importance for (a) XGBoost, (b) RF, and (c) BPNN models.
Remotesensing 17 03063 g002
Figure 3. Correlation coefficients between hourly observed and estimated PM2.5 mass concentrations at four MAX-DOAS sites in the test dataset. The black line and red line are the reference and regression lines, respectively.
Figure 3. Correlation coefficients between hourly observed and estimated PM2.5 mass concentrations at four MAX-DOAS sites in the test dataset. The black line and red line are the reference and regression lines, respectively.
Remotesensing 17 03063 g003
Figure 4. Correlation coefficients between hourly observed PM2.5 mass concentration and aerosol EC at four MAX-DOAS sites. The black line and red line are the reference and regression lines, respectively.
Figure 4. Correlation coefficients between hourly observed PM2.5 mass concentration and aerosol EC at four MAX-DOAS sites. The black line and red line are the reference and regression lines, respectively.
Remotesensing 17 03063 g004
Figure 5. The hourly fluctuation trends of the near-surface aerosol EC and the observed and estimated PM2.5 mass concentrations. The green, red, and blue lines represent the aerosol EC and observed and estimated PM2.5 mass concentrations, respectively.
Figure 5. The hourly fluctuation trends of the near-surface aerosol EC and the observed and estimated PM2.5 mass concentrations. The green, red, and blue lines represent the aerosol EC and observed and estimated PM2.5 mass concentrations, respectively.
Remotesensing 17 03063 g005
Figure 6. Correlation between observed and estimated PM2.5 mass concentrations at different heights based on the XGBoost model.
Figure 6. Correlation between observed and estimated PM2.5 mass concentrations at different heights based on the XGBoost model.
Remotesensing 17 03063 g006
Figure 7. Vertical distribution of annual average PM2.5 mass concentrations and feature importance ranking at four sites.
Figure 7. Vertical distribution of annual average PM2.5 mass concentrations and feature importance ranking at four sites.
Remotesensing 17 03063 g007
Figure 8. (a) Vertical profiles of aerosol EC, NO2, HCHO, and PM2.5. (b) AOD and PM2.5 variation with time.
Figure 8. (a) Vertical profiles of aerosol EC, NO2, HCHO, and PM2.5. (b) AOD and PM2.5 variation with time.
Remotesensing 17 03063 g008
Figure 9. Transport flux calculated using (a) the turbulence coefficient and (b) vertical wind speed at different height levels from 0.001 km to 2.001 km.
Figure 9. Transport flux calculated using (a) the turbulence coefficient and (b) vertical wind speed at different height levels from 0.001 km to 2.001 km.
Remotesensing 17 03063 g009
Figure 10. The contribution rate of transport flux calculated using (a) the turbulence coefficient and (b) vertical wind speed.
Figure 10. The contribution rate of transport flux calculated using (a) the turbulence coefficient and (b) vertical wind speed.
Remotesensing 17 03063 g010
Table 1. Tuning parameters of three models.
Table 1. Tuning parameters of three models.
ModelParametersValues
XGBoostnumber of trees (n_estimators)100
maximum depth of tree (max_depth)9
subsample0.7
learning rate0.1
alpha0.1
lambda3
RFnumber of trees (n_estimators)90
maximum depth of tree (max_depth)10
criterionmse
BPNNhidden layer sizes8
solveradam
activationrelu
maximum iterations1000
init learning rate0.06
Table 2. Model performance on the test set, 10-fold CV, and spatial 4-fold CV.
Table 2. Model performance on the test set, 10-fold CV, and spatial 4-fold CV.
XGBoostRFBPNN
RRMSERRMSERRMSE
Test set0.9113.370.9013.700.8717.11
10-fold CV0.986.520.959.830.8118.46
Spatial 4-fold CV0.6120.710.6421.050.6421.98
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Q.; Luo, J.; Qin, H.; Xia, S.; Zhang, Z.; Xing, C.; Tan, W.; Liu, H.; Hu, Q. Estimation of PM2.5 Vertical Profiles from MAX-DOAS Observations Based on Machine Learning Algorithms. Remote Sens. 2025, 17, 3063. https://doi.org/10.3390/rs17173063

AMA Style

Li Q, Luo J, Qin H, Xia S, Zhang Z, Xing C, Tan W, Liu H, Hu Q. Estimation of PM2.5 Vertical Profiles from MAX-DOAS Observations Based on Machine Learning Algorithms. Remote Sensing. 2025; 17(17):3063. https://doi.org/10.3390/rs17173063

Chicago/Turabian Style

Li, Qihua, Jinyi Luo, Hanwen Qin, Shun Xia, Zhiguo Zhang, Chengzhi Xing, Wei Tan, Haoran Liu, and Qihou Hu. 2025. "Estimation of PM2.5 Vertical Profiles from MAX-DOAS Observations Based on Machine Learning Algorithms" Remote Sensing 17, no. 17: 3063. https://doi.org/10.3390/rs17173063

APA Style

Li, Q., Luo, J., Qin, H., Xia, S., Zhang, Z., Xing, C., Tan, W., Liu, H., & Hu, Q. (2025). Estimation of PM2.5 Vertical Profiles from MAX-DOAS Observations Based on Machine Learning Algorithms. Remote Sensing, 17(17), 3063. https://doi.org/10.3390/rs17173063

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop