Estimation of the PM 2.5 and PM 10 Mass Concentration over Land from FY-4A Aerosol Optical Depth Data

: The purpose of this study is to estimate the particulate matter (PM 2.5 and PM 10 ) in China using the improved geographically and temporally weighted regression (IGTWR) model and Fengyun (FY-4A) aerosol optical depth (AOD) data. Based on the IGTWR model, the boundary layer height (BLH), relative humidity (RH), AOD, time, space, and normalized difference vegetation index (NDVI) data are employed to estimate the PM 2.5 and PM 10 . The main processes of this study are as follows: ﬁrstly, the feasibility of the AOD data from FY-4A in estimating PM 2.5 and PM 10 mass concentrations were analysed and conﬁrmed by randomly selecting 5–6 and 9–10 June 2020 as an example. Secondly, hourly concentrations of PM 2.5 and PM 10 are estimated between 00:00 and 09:00 (UTC) each day. Speciﬁcally, the model estimates that the correlation coefﬁcient R 2 of PM 2.5 is 0.909 and the root mean squared error (RMSE) is 5.802 µ g/m 3 , while the estimated R 2 of PM 10 is 0.915, and the RMSE is 12.939 µ g/m 3 . Our high temporal resolution results reveal the spatial and temporal characteristics of hourly PM 2.5 and PM 10 concentrations on the day. The results indicate that the use of data from the FY-4A satellite and an improved time–geographically weighted regression model for estimating PM 2.5 and PM 10 is feasible, and replacing land use classiﬁcation data with NDVI facilitates model improvement.


Introduction
With the sustainable development and changes of the economy and society, air pollution, especially particulate matter (PM 2.5 ) pollution, has been paid more and more attention by the government and people in general. PM 2.5 is a major air pollutant that harms human health [1] and is the fifth major health risk factor of the global burden of disease [2]. At present, the most accurate means of PM 2.5 monitoring are ground-based instruments that can obtain real-time data. However, the ground detection stations only detect air quality in a point-like distribution and cannot cover large areas. Moreover, due to the high cost and maintenance requirements, the stations are mainly distributed in urban areas, making it difficult to achieve full coverage. In addition, site data are greatly affected by special circumstances in a small area near the monitoring station, which makes it difficult to take the overall situation into account.
Compared to ground-based measurements, satellite sensor systems have the advantage of regularly collecting data over large areas, and therefore are able to overcome spatial limitations to a large extent. The study of particulate matter mainly revolves around ground observations, while satellite observations do not directly provide measurements from the ground. Studies have shown that satellite-based aerosol optical depth (AOD) is an ideal substitute for PM 2.5 [3]. With the development of satellite data with increasingly  [37]. The number of different land use types refers to the land use and cover change (LUCC) classification system, as shown in Table  1. The black boundary line represents the study area.

PM2.5 and PM10 Data
The ground detection data of PM2.5 and PM10 are from the National Real-time Air Quality Publishing Platform (http://106.37.208.233:20035/ (accessed on 1 July 2020)). Monitoring sites from across the country are employed to collect data, including hourly monitoring values for PM2.5 mass concentrations and PM10 mass concentrations. These include data from over 1600 sites, and the number of sites grows over time. The measurement data for the site is collected every hour, and data with null measurements are removed. The ground monitoring value provides the basis for the establishment of the model and the verification of the results. Additionally, 6131 measurements of PM10 and 6113 measurements of PM2.5 were used for model validation.

Fengyun-4 (FY-4A) Data
The FY-4A satellite was launched on 11 December 2016. Fengyun-4 is a China′s new generation of geostationary meteorological satellite, which has greatly enhanced the capability of monitoring, warning, and forecasting of high impact weather events [38].
Its main payload is the Advanced Geosynchronous Radiation Imager (AGRI). It has 14 spectral bands that are quantised to 12 bits per pixel and sampled at 1 km at the nadir in the visible (VIS), 2 km in the near-infrared (NIR), and 4 km in the remaining IR spectral bands (compared with 1.25 km for VIS, no NIR, and 5 km for IR of FY-2). The archived L1 dataset of FY-4A is available on the National Satellite Meteorological Center (NSMC) satellite data server website (http://satellite.nsmc.org.cn (accessed on 1 July 2020 )).
Xia et al. [39] developed a grid point statistical interpolation (GSI)-based data assimilation system and successfully assimilated FY-4A aerosol optical depth (AOD) data for the first time. AOD data are obtained via the AGRI imager on the FY-4A, together with a Figure 1. The distribution of land use types over China in 2020 [37]. The number of different land use types refers to the land use and cover change (LUCC) classification system, as shown in Table 1. The black boundary line represents the study area. The ground detection data of PM 2.5 and PM 10 are from the National Real-time Air Quality Publishing Platform (http://106.37.208.233:20035/ (accessed on 1 July 2020)). Monitoring sites from across the country are employed to collect data, including hourly monitoring values for PM 2.5 mass concentrations and PM 10 mass concentrations. These include data from over 1600 sites, and the number of sites grows over time. The measurement data for the site is collected every hour, and data with null measurements are removed. The ground monitoring value provides the basis for the establishment of the model and the verification of the results. Additionally, 6131 measurements of PM 10 and 6113 measurements of PM 2.5 were used for model validation.

Fengyun-4 (FY-4A) Data
The FY-4A satellite was launched on 11 December 2016. Fengyun-4 is a China s new generation of geostationary meteorological satellite, which has greatly enhanced the capability of monitoring, warning, and forecasting of high impact weather events [38].
Its main payload is the Advanced Geosynchronous Radiation Imager (AGRI). It has 14 spectral bands that are quantised to 12 bits per pixel and sampled at 1 km at the nadir in the visible (VIS), 2 km in the near-infrared (NIR), and 4 km in the remaining IR spectral bands (compared with 1.25 km for VIS, no NIR, and 5 km for IR of FY-2). The archived L1 dataset of FY-4A is available on the National Satellite Meteorological Center (NSMC) satellite data server website (http://satellite.nsmc.org.cn (accessed on 1 July 2020)).
Xia et al. [39] developed a grid point statistical interpolation (GSI)-based data assimilation system and successfully assimilated FY-4A aerosol optical depth (AOD) data for the first time. AOD data are obtained via the AGRI imager on the FY-4A, together with a joint retrieval algorithm for surface reflectance and aerosol optical thickness using the algorithm proposed and implemented by She et al. [7]. Therefore, daily AOD data is obtained from 00:00 to 09:00 (UTC).

Relative Humidity
Relative humidity (RH) is the percentage of water vapour pressure in the air compared to the saturation water vapour pressure at the same temperature. Ground stations measure particulate concentrations by drying the sampling air stream to remove the effect of moisture on particulate mass concentrations. Obviously, atmospheric humidity also has a significant effect on pollutants in the air, so relative humidity is also used as one of the independent variables in the model. Relative humidity data are obtained from "ERA5 hourly data on single levels from 1979 to present" data (https://cds.climate.copernicus. eu/cdsapp#!/dataset/reanalysis-era5-pressure-levels?tab=overview (accessed on 1 July 2020)). The time resolution of RH data is 1 h. The data are hourly raster data at a resolution of 0.25 • × 0.25 • . Additionally, linear interpolation is used to obtain data at 1 km resolution.

(Planetary) Boundary Layer Height
The planetary boundary layer (PBL), also known as the atmospheric boundary layer or pedosphere, can be defined as the lowest part of the atmosphere, typically ranging anywhere between 100 m and 2000 m above the planetary surface, and its behaviour is directly influenced by its contact with a surface. Boundary layer height has a significant effect on atmospheric particulate matter concentrations [40]. The boundary layer height data used in this study are also from ERA5 of ECMWF, the fifth generation of ECMWF s atmospheric reanalysis of the global climate (https://cds.climate.copernicus.eu/cdsapp#!/ dataset/reanalysis-era5-single-levels?tab=overview (accessed on 1 July 2020)). The data are hourly raster data at a resolution of 0.25 • × 0.25 • . In the data pre-processing, linear interpolation was used to obtain 1 km resolution boundary layer height data for the study area.

Land Use Type
Land use classification data of China in 2020 is collected from the Resource and Environment Data Cloud Platform (http://www.resdc.cn/ (accessed on 1 July 2020)). The data are based on the Landsat TM image of the United States Landsat, which is generated by human visual interpretation with 1 km spatial resolution.
The data include six primary types of arable land, forest land, grassland, water, residential land, and unused land, and 25 secondary types. The values in the raster data are their values, which can be used as the raw data for the input, and the specific classification numbers and names are shown in Table 1.

MODIS NDVI Data
The global MOD13A2 data are available at a spatial resolution of 1 km every 16 days as a gridded level-3 of the sinusoidal projection. As vegetation indices can be used as products to show land cover and land cover change, the normalised vegetation indices from the MOD13A2 data are used in this paper as a proxy for the land use data in the IGTWR model. These products can be obtained from the website https://ladsweb.modaps. eosdis.nasa.gov/missions-and-measurements/products/MOD13A2#overview (accessed on 1 July 2020).

Methods
There have been many studies on geographic data, and the prevailing view is that geographic data are influenced by both their geographical location and time. Hence, Huang et al. [41] propose a geographic time-weighted regression model.
where i is the observation point (i = 1, 2, ..., n, n is the total number of observation points); k is the model parameter (k = 1, 2, ..., d, d is the total number of parameters); y i is the estimated parameter, and x ik represents the independent variable of k at point i; β is the coefficient; u 0 , v 0 , and t 0 represent longitude, latitude and date, respectively; and ε i represents the random error. PM 2.5 concentration levels are not only influenced by time and space but are also linked to subsurface data within the study area. In the study from Xue [36], an improved geographically and temporally weighted regression (IGTWR) model was thereby established (Equation (2)). AOD, RH, and BLH were used as parameter inputs to the model. The relationships between different sample sites are described by temporal (day and hour), spatial (longitude and latitude), and underlying surface data (land use data). In the equations, u 0 , ν 0 , d 0 , h 0 , and l 0 represent longitude, latitude, day, hour, and land use classification data, respectively. d refers to the number of independent variables (FY-4A AOD, relative humidity, and boundary layer height) in this model, and d = 3; i denotes the observation point, i = 1, 2, ..., n, where n is the total number of observation points.
The limited number of classification categories for land use data (a total of 26 land categories are in the LUCC) makes it difficult to quantify the complexity of the subsurface data. Therefore, NDVI was used instead of land use data for the modelling in this paper.
Equation (2) can be written as Equation (3), and vi 0 refers to NDVI: In the regression process, different stations have different weights, so weighting coefficients were calculated using latitude, longitude, date, hour, and NDVI, which were referenced in the model. PM 2.5 and PM 10 concentrations within the study area were fitted using AOD, RH, and BLH as independent variables, combined with the weighting coefficients, so that the objective function can be expressed as Equation (4).
The estimated PM 2.5 or PM 10 Therefore, the PM 2.5 or PM 10 The model also requires the determination of weighting factors based on spatiotemporal data and land use type data. The distance between the estimated point (u 0 , v 0 , d 0 , h 0 , vi 0 ) and any other observed sample data (u i , v i , d i , h i , vi i ) is first defined according to: where λ, µ, and ϕ represent the weighting coefficients, and the weighting function is expressed as a Gaussian function as: where h STL , h S , h T and h L represent spatial, temporal and land use bandwidths, respectively; and ω s i The cross-validation method is employed to select the best bandwidth that provides the smallest residuals.

Results
The improved IGTWR model is applied to the FY-4A data to estimate hourly PM 2.5 and PM 10 mass concentrations for 5-6 and 9-10 June 2020 in mainland China. In this section, the validation of PM measurements were focused on assessing the performance of the improved model. In addition, the daily variation of PM was analysed, and the sudden occurrence of heavy pollution in clean areas such as Xinjiang in June 2020 was explored.

Evaluation of IGTWR Model Applied to FY-4A
In this study, the IGTWR model was used to estimate hourly PM 2.5 and PM 10 mass concentrations during the daytime in China on 5-6 and 9-10 June 2020. Hourly FY-4A AOD from UTC 00:00 to UTC 09:00 (LST 08:00 to LST 17:00), relative humidity, and boundary layer height were employed as input parameters. Longitude, latitude, date, hour, and NDVI were used to calculate the regression weights. The 6049 sample points were used to construct the modelling set for the IGTWR model. The model fitting performance is shown in Figure 2 (left). A ten-fold cross-validation approach was used to assess model performance [42]. Then, 6049 sample points were divided evenly into ten parts. One part was used for validation, and the remaining nine parts were used for training; the process was repeated ten times. Figure 2 shows the cross-validation results (right). For a sample of 6049 points in ten hours, the correlation coefficient R 2 value between predicted and observed PM 2.5 was 0.909, and the root mean squared error (RMSE) of the model fitting was only 5.802 µg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated R 2 was 0.489, and the RMSE was 13.047 µg/m 3 , which was only 7.245 µg/m 3 higher than the RMSE of the model fitting. The slope of the fitting line of the measured-predicted PM 2.5 scatter plot was 0.493. This indicates that PM 2.5 was overestimated for concentrations below 30 µg and underestimated for concentrations above this for short periods. For a sample of 6070 points in ten hours, the R 2 value between predicted and observed PM 10 was 0.915, and the RMSE of the model fitting was only 12.939 µg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated correlation coefficient R 2 was 0.498, and the RMSE was 29.779 µg/m 3 , which was only 16.84 µg/m 3 higher than the RMSE of the model fitting. The slope of the fitting line of the measuredpredicted PM 10 scatter plot was 0.484. This indicates that PM 10 was overestimated for concentrations below 60 µg and underestimated for concentrations above this for short periods.
For a more intuitive comparison with the previous model, the same data are used for the calculations, except that land use data is used instead of NDVI. Figure 3 shows the scatter plots for model fitting results and tenfold cross-validation results for PM 2.5 and PM 10 . For a sample of 6149 points in ten hours, the correlation coefficient R 2 value between predicted and observed PM 2.5 was 0.857, and the root mean squared error (RMSE) of the model fitting was only 7.422 µg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated R 2 was 0.436, and the RMSE was 14.286 µg/m 3 . The slope of the fitting line of the measured-predicted PM 2.5 scatter plot was 0.448. For a sample of 6171 points in ten hours, the R 2 value between predicted and observed PM 10 was 0.858, and the RMSE of the model fitting was only 17.231 µg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated correlation coefficient R 2 was 0.433, and the RMSE was 33.308 µg/m 3 . The slope of the fitting line of the measured-predicted PM 10 scatter plot was 0.432.
The slope of the fitting line of the measured-predicted PM2.5 scatter plot was 0.493. This indicates that PM2.5 was overestimated for concentrations below 30 μg and underestimated for concentrations above this for short periods. For a sample of 6070 points in ten hours, the R 2 value between predicted and observed PM10 was 0.915, and the RMSE of the model fitting was only 12.939 μg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated correlation coefficient R 2 was 0.498, and the RMSE was 29.779 μg/m 3 , which was only 16.84 μg/m 3 higher than the RMSE of the model fitting. The slope of the fitting line of the measured-predicted PM10 scatter plot was 0.484. This indicates that PM10 was overestimated for concentrations below 60 μg and underestimated for concentrations above this for short periods. For a more intuitive comparison with the previous model, the same data are used for the calculations, except that land use data is used instead of NDVI. Figure 3 shows the scatter plots for model fitting results and tenfold cross-validation results for PM2.5 and PM10. For a sample of 6149 points in ten hours, the correlation coefficient R 2 value between predicted and observed PM2.5 was 0.857, and the root mean squared error (RMSE) of the model fitting was only 7.422 μg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated R 2 was 0.436, and the RMSE was 14.286 μg/m 3 . The slope of the fitting line of the measured-predicted PM2.5 scatter plot was 0.448. For a sample of 6171 points in ten hours, the R 2 value between predicted and observed PM10 was 0.858, and the RMSE of the model fitting was only 17.231 μg/m 3 . In the scatter plot of measured versus predicted values, the points were distributed around a 1:1 line. The cross-validated correlation coefficient R 2 was 0.433, and the RMSE was 33.308 μg/m 3 . The slope of the fitting line of the measured-predicted PM10 scatter plot was 0.432.
The difference in the validation results indicates that the model using NDVI to participate in the definition of the generalised distance has a higher R 2 than the previous model. Although this improvement is not particularly significant, it still illustrates the significance of the improved model work. Figure 4 shows the measured and estimated PM2.5/PM10 ratios. In the study area, the mass concentration of PM10 is generally higher than the mass concentration of PM2.5. In the scatter plot, the points are mostly distributed above a 1:1 line, which indicates that the The difference in the validation results indicates that the model using NDVI to participate in the definition of the generalised distance has a higher R 2 than the previous model. Although this improvement is not particularly significant, it still illustrates the significance of the improved model work. Figure 4 shows the measured and estimated PM 2.5 /PM 10 ratios. In the study area, the mass concentration of PM 10 is generally higher than the mass concentration of PM 2.5 . In the scatter plot, the points are mostly distributed above a 1:1 line, which indicates that the measured PM 2.5 /PM 10 ratio is usually somewhat higher than the estimated ratio.

Hourly PM 2.5 and PM 10 Concentrations in China
AOD, RH, BLH, and other auxiliary data and calculation methods used by the two pollutants are similar, while the ground monitoring data are different. The calculation of PM is highly dependent on the coverage of the AOD and its temporal and spatial resolution. Thus, for PM 2.5 and PM 10 estimation, hourly and 1 km resolution can be achieved. Figures 5 and 6 show the spatial distribution of model-estimated mean PM 2.5 and PM 10 concentrations in China at different times of the day at 5-6 and 9-10 June 2020, respectively. For most of the study areas, the PM 2.5 concentrations are below 90 µg/m 3 . Additionally, the PM 10  . This is similar to but not entirely consistent with previous studies [25,37]. Before sunrise, the height of the atmospheric boundary layer is relatively low, and aerosols are concentrated in the lower layers of the atmosphere. This results in relatively high PM 2.5 concentrations near the ground. The atmosphere is then heated with increasing radiation from the height of the sun, leading to expansion and thermodynamic uplift, with a corresponding rise in the boundary layer [43,44]. As a result, fine particulates in the atmosphere also move into the upper atmosphere, leading to a gradual decrease in PM 2.5 concentrations at ground level. This is due to the short time series used in this study, which is insufficient to form a more obvious statistical pattern. The variation in PM does not follow the same clear daily cycle as the BLH, because the relationship between PM concentrations and PBL varies considerably with geographical location, season, and other meteorological conditions [45]. Figures 7 and 8 show the daily daytime averages of PM 2.5 mass concentration and PM 10 mass concentration for each day and the corresponding station data, respectively. The first column in each figure shows the data estimated by the model, and the second column shows the station mean data (columns 1 and 2, from left to right). The station averages are the mean of the hourly measurements from 08:00 to 17:00 each day, where unusable data were excluded. The model-estimated PM concentrations are generally consistent with the spatial and temporal distribution of PM concentrations observed at ground level. Spatially, the higher PM 2.5 pollution concentrations are found in central China, including Shaanxi, Shanxi, Henan, and Shandong, similar to previous studies [25]. However, higher pollution was observed in Xinjiang and the Tibetan-Qinghai Plateau, which is more exceptional in June 2020. Figure 9 shows the AQI change graphs for selected cities in Tibet, Xinjiang, and Qinghai in June 2020. AQI (air quality index) describes how clean or polluted the air is, with associated health effects. The AQI is calculated by the Chinese Environmental Protection Bureau based on air quality evaluation standards by monitoring sulphur dioxide, nitrogen dioxide, PM 10 , PM 2.5 , carbon monoxide, and ozone. AQI enables air quality to be presented in a uniform evaluation standard [46]. Currently, the main influences on AQI are PM 2.5 and PM 10 . The specificity of the air quality in these regions from 5 to 10 June 2020 can be seen. Unfortunately, the raw data for the 7th and 8th are missing due to the cloud, so we have chosen 5-6 and 9-10 June 2020 for the study. The Tibetan-Qinghai Plateau and the area around the Tian Shan Mountains are clean areas with little anthropogenic pollution, and the main source of PM is soil dust. Therefore, the high concentration levels reported during the study time are due to natural causes. In central and eastern China, where air pollution is relatively high, a decreasing gradient from north to south can be seen, consistent with previous studies (e.g., [47][48][49]   The analysis in this section shows that the model is sensitive to the estimates of the ground-level hourly PM2.5 and PM10 concentrations. Additionally, it can predict the following pollution trends.

Comparison with Previous Studies
The relationship between AOD and PM2.5 concentration is known to be more complex due to the influence of multiple factors (e.g., aerosol type, meteorological variables). IGTWR is a traditional data analysis method that provides a more straightforward representation of the relationship between PM and various variables than the emerging machine learning approach. Moreover, compared to machine learning, IGTWR is less prone to overfitting. The present study improves the quality of the model by defining the generalised distance with NDVI. Additionally, it employs the improved model to estimate hourly PM2.5 and PM10 concentrations in China. The research conducted by Xu et al. showed that the quality of the model to estimate PM2.5 could be improved by introducing NDVI into the regression model [34]. The CV of the IGTWR model employed in the present study indicated that the model showed an excellent estimation ability of PM2.5 and PM10 concentrations. This advantage is reflected in statistical indicators. Specifically, the R 2 was 0.909 and 0.915, and the RMSE was 5.802 μg/m 3 and 12.939 μg/m 3 for PM2.5 and PM10, respectively. In this regard, the model employed in the present study was superior to previous models, in which FY-4A AOD data were used to estimated PM2.5 concentration. Jiang et al. [35] estimated PM2.5 concentrations using physical methods and FY-4A AOD with a maximum R 2 of 0.39, but the values were still over-or underestimated. Mao et al. [8] used the random forest algorithm and FY-4A AOD for the estimation of PM2.5 in China. Additionally, they found that the use of the FY-4A improved 17% spatial coverage compared to the Himawari-8-based PM2.5 retrievals. In addition to the satisfying performance and advantages of the method employed in the present study, the model developed for estimating hourly PM concentrations can provide detailed information on the daily cycle of PM concentrations at a good spatial resolution, which can help us to facilitate an in-depth study of PM2.5.

Potential Limitations and Room for Model Improvement
Although this model can predict PM2.5 and PM10 concentrations well, there are still several potential limitations, and there is scope for future improvements. Reanalysis data are commonly used in most studies due to the lack of high spatial and temporal resolution PBLH and RH observations. The raw resolution of the PBLH and RH product is only 0.25° The analysis in this section shows that the model is sensitive to the estimates of the ground-level hourly PM 2.5 and PM 10 concentrations. Additionally, it can predict the following pollution trends.

Comparison with Previous Studies
The relationship between AOD and PM 2.5 concentration is known to be more complex due to the influence of multiple factors (e.g., aerosol type, meteorological variables). IGTWR is a traditional data analysis method that provides a more straightforward representation of the relationship between PM and various variables than the emerging machine learning approach. Moreover, compared to machine learning, IGTWR is less prone to overfitting. The present study improves the quality of the model by defining the generalised distance with NDVI. Additionally, it employs the improved model to estimate hourly PM 2.5 and PM 10 concentrations in China. The research conducted by Xu et al. showed that the quality of the model to estimate PM 2.5 could be improved by introducing NDVI into the regression model [34]. The CV of the IGTWR model employed in the present study indicated that the model showed an excellent estimation ability of PM 2.5 and PM 10 concentrations. This advantage is reflected in statistical indicators. Specifically, the R 2 was 0.909 and 0.915, and the RMSE was 5.802 µg/m 3 and 12.939 µg/m 3 for PM 2.5 and PM 10 , respectively. In this regard, the model employed in the present study was superior to previous models, in which FY-4A AOD data were used to estimated PM 2.5 concentration. Jiang et al. [35] estimated PM 2.5 concentrations using physical methods and FY-4A AOD with a maximum R 2 of 0.39, but the values were still over-or underestimated. Mao et al. [8] used the random forest algorithm and FY-4A AOD for the estimation of PM 2.5 in China. Additionally, they found that the use of the FY-4A improved 17% spatial coverage compared to the Himawari-8-based PM 2.5 retrievals. In addition to the satisfying performance and advantages of the method employed in the present study, the model developed for estimating hourly PM concentrations can provide detailed information on the daily cycle of PM concentrations at a good spatial resolution, which can help us to facilitate an in-depth study of PM 2.5 .

Potential Limitations and Room for Model Improvement
Although this model can predict PM 2.5 and PM 10 concentrations well, there are still several potential limitations, and there is scope for future improvements. Reanalysis data are commonly used in most studies due to the lack of high spatial and temporal resolution PBLH and RH observations. The raw resolution of the PBLH and RH product is only 0.25 • × 0.25 • , which may affect the estimation accuracy of hourly PM concentrations.
Liu et al. believed that model performance would be improved if high-frequency meteorological data were available [50]. Moreover, Gui et al. constructed a virtual ground-based PM 2.5 observation network by using an extreme gradient boosting (XGBoost) model with high-density meteorological observations as the main predictor. Additionally, results showed that this network had great potential for reconstructing historical PM 2.5 concentrations [50]. This is a significant improvement to the application of the model employed in the present study to historical situations.
The present model is limited by the missing AOD and the necessity to use PM measurements. More specifically, PM concentrations cannot be estimated in the missing AOD fraction. Yin et al. proposed a method to estimate PM 2.5 distribution with the Himawari-8 top-of-atmosphere reflectance (TOAR) over China, effectively estimating PM 2.5 in areas where valid AOD observations are not available. The spatial and temporal coverage of PM 2.5 estimated by TOAR is approximately four times higher than that based on AOD data [51]. Park et al. also developed an estimation model for spatially continuous AOD and PM that can be used for all-sky conditions [5]. All these studies have significant implications for the improvement of the present model.

Conclusions
In this study, the IGTWR model is employed as the basis for model improvement by adding NDVI data to change the definition of the generalised distance. The mass concentrations of PM 2.5 and PM 10 are estimated using FY-4A data with other ancillary data. The results show that FY-4 combined with IGTWR model to estimate PM 2.5 and PM 10 concentrations in China accurately. From the cross-validation results, it appears that the estimation effect of this model on PM 2.5 is better than that on PM 10 . Further research will investigate this and improve the model to adapt it to PM 10 estimation.
The PM mass concentrations estimated throughout the study were validated against measurements from ground stations and compared qualitatively with the results of the model before the improvement. The PM 2.5 and PM 10 estimates were in good agreement with ground station measurements, with correlation coefficients of 0.909 and 0.915, respectively. The predicted and measured values of PM 2.5 and PM 10 concentrations are consistent across the study area.
Unlike the models used in previous studies, the IGTWR model redefines distance in a broad sense using land use classifications. Hu et al. [52] used spatial distance as a weighting parameter; later, Huang et al. [41] extended this distance to geographic and temporal distances by introducing days into the model. The IGTWR model is used for this study. Date, hour, and NDVI are used to construct the generalised distances. Compared to the previous study by Xue et al. [36] the model effect is reduced by extending the area from central and eastern China to the whole country, adding areas with sparser ground stations. As shown in Figures 7 and 8, the small area covered by the AOD is due to the absence of the bright surface area itself and cloud cover. The AOD data cover fewer ground stations, resulting in a training sample of just over 6000 points. One of the reasons for the overfitting is the sparse number of sample points, so the tenfold cross-validation result is worse than the model fitting result. The ability of the model to predict particulate matter has been reduced, but the overall picture remains within acceptable limits. This is due to the characteristics of the GWR model.
Future research must address several issues. Firstly, fixed bandwidth is used in this model, but it is unstable. Subsequent studies may try to use dynamic bandwidth to improve its stability. Secondly, the reason for the weaker prediction of PM is the lack of sample points. As the duration of the study increases with the quality of the FY-4A data, it can be expected that this method is able to estimate ground-level PM concentrations with higher confidence at seasonal levels or higher temporal resolutions. Thirdly, the redefinition of the generalised distance is the key point for improving the GWR in Equation (7). Therefore, the use of other types of data to improve the definition of generalised distances (e.g., DEM, etc.) and to make use of the data provided by FY-4A will be an interesting topic for future