Next Article in Journal
Prenatal Bisphenol a Exposure, DNA Methylation, and Low Birth Weight: A Pilot Study in Taiwan
Previous Article in Journal
A Novel Guided Zygomatic and Pterygoid Implant Surgery System: A Human Cadaver Study on Accuracy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

High Temporal Resolution Land Use Regression Models with POI Characteristics of the PM2.5 Distribution in Beijing, China

School of Environment, Beijing Normal University, Beijing 100875, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(11), 6143; https://doi.org/10.3390/ijerph18116143
Submission received: 19 March 2021 / Revised: 10 May 2021 / Accepted: 11 May 2021 / Published: 7 June 2021
(This article belongs to the Section Environmental Science and Engineering)

Abstract

:
PM2.5 is one of the primary components of air pollutants, and it has wide impacts on human health. Land use regression models have the typical disadvantage of low temporal resolution. In this study, various point of interests (POIs) variables are added to the usual predictive variables of the general land use regression (LUR) model to improve the temporal resolution. Hourly PM2.5 concentration data from 35 monitoring stations in Beijing, China, were used. Twelve LUR models were developed for working days and non-working days of the heating season and non-heating season, respectively. The results showed that these models achieved good fitness in winter and summer, and the highest R2 of the winter and summer models were 0.951 and 0.628, respectively. Meteorological factors, POIs, and roads factors were the most critical predictive variables in the models. This study also showed that POIs had time characteristics, and different types of POIs showed different explanations ranging from 5.5% to 41.2% of the models on working days or non-working days, respectively. Therefore, this study confirmed that POIs can greatly improve the temporal resolution of LUR models, which is significant for high precision exposure studies.

1. Introduction

Air pollution problems are becoming increasingly serious worldwide. The primary pollutants found in cities include sulfur dioxide, ozone, nitrogen oxides, and inhalable and fine particulate matter. Fine particulate matter is an important air pollutant, which is considered the primary environmental risk factor, and it is the fourth leading risk factor for death and disability in China [1]. In particular, PM2.5 has the most significant effect. Long-term exposure to PM2.5 pollution can have a severe impact on human health. Numerous epidemiological studies have shown that PM2.5 is closely related to a variety of cerebrovascular diseases, respiratory diseases, and immune system diseases, as well as long-term or short-term mortality in hospitals [2,3,4,5]. According to a report in 2014, the PM2.5 concentration in 90% of Chinese cities had exceeded the standard of 35 μg·m−3, and the average number of exceeded days was 246. Therefore, studies that investigate PM2.5 are of great significance for local pollution prevention and targeted health prevention measures.
Various modeling methods, such as spatial interpolation [6], the atmospheric dispersion model [7], satellite inversion [8], and deep learning [9,10,11], have been used to simulate the distribution of regional pollutants. However, many methods, such as diffusion models, require a large amount of data difficult to obtain, and it is difficult to achieve high-precision under the condition of a lack of input data resources [12]. The land use regression (LUR) model is a multivariate regression modeling method based on the observation concentration of air pollutants and its surrounding geographical factors. This method is widely used to estimate the concentrations of outdoor air pollutants since it was improved by Briggs et al. [13]. Due to its high accuracy and relatively low investment [14], LUR models have a wide range of applications. Beelen et al. [15] used an LUR model in the ESCAPE (European Study of Cohorts for Air Pollution Effects) project, which explained the variability of annual NO2 and NOx concentrations. Ma et al. [16] used multi-scale LUR models to simulate the NO2 concentration in Auckland, which performed better than the universal kriging (UK) model and the inverse distance weighting (IDW) and ordinary kriging (OK) models. Yang et al. [17] developed LUR models to predict ultrafine particle concentrations in London, and the results showed that the LUR models had moderate to good performances within these areas. Saucy et al. [18] used warm and cold season LUR models for NO2 and PM2.5 concentrations in peri-urban areas in South Africa, and it was demonstrated that the models could be successfully applied in local areas. Weissert et al. [19] developed a microscale LUR model for a heavily trafficked road in Auckland, New Zealand. This research represented the expansibility of small-scale variability in pollutant concentrations. In China, LUR modeling has been shown to have methodological advantages and has become increasingly popular in air pollution studies in recent years [20,21,22].
In human exposure and health risk assessment areas, LUR models are often used to simulate exposure concentrations [23]. High temporal resolution pollution data can improve the precision of epidemiological studies and human exposure research. Although LUR is a reasonable and reliable modelling approach, to some degree it has the disadvantages of low temporal resolution and poor spatial portability [16,24]. The most common spatial predictors in LUR models include elevation, population, road network, and land use. Many studies have confirmed that these predictors can explain most of the models and have strong correlations with pollutant concentrations [20,25,26].
City pollution sources and functions vary from region to region due to the different characteristics of urban layouts [27,28]. In particular, various urban functional areas are comprised of various points of interest (POIs) [29]. Studies have confirmed that different functional areas with different types of POIs have differences in air pollution characteristics and attractiveness to populations [30,31,32]. POIs show typical functional features at different times. For example, people tend to congregate in residential areas in the evening and gather in office places for working during the daytime. Therefore, this may result in different POI predictors related to pollutants at different times. That means that if POI predictors are added into LUR models, this may improve the precision and temporal–spatial resolution. Nori-Sarma et al. [26] used LUR models to predict urban NO2 exposure in Mysore, India. The study showed industrial sites and religious POIs, and these high human activity POIs were associated with higher levels of NO2. Lu et al. [33] developed three sets of LUR models and the independent variables were common land use data, local business permit data, and Google POI data. The results showed that models that used the Google POI data performed the best.
Beijing is one of the cities with the worst air pollution measures in northern China. Although Beijing has made phased progress in reducing the air pollution problem in recent years, it still exceeds the minimum limit of the China national standard (GB 3095-2012). Unlike previous studies that have used average annual and seasonal pollutant concentrations, this study uses hourly pollution concentrations from the heating season, the non-heating season, working days, and non-working days. In these LUR models, in addition to conventional predictors, such as land use, meteorological factors, population and elevation, POIs are also included. This study aims to improve the temporal resolution and model accuracy and simulate the PM2.5 distribution in Beijing during different time circumstances, and to provide guidance and reference value for continued LUR and exposure studies.

2. Materials and Methods

2.1. Study Area

The Beijing, Tianjin, and Hebei regions are the most polluted areas in northern China. Beijing is the capital of China, and it has representativeness and data availability. It has 16 districts with a total area of 16,410.54 square kilometers at a latitude of 39°26′–41°03′ and longitude of E115°25′–117°30′. The permanent resident population is 21.536 million, and the urban population is 18.65 million, accounting for 86.6 percent of the total urban population. The terrain of Beijing primarily consists of plains and mountainous areas, and is high in the northwest and low in the southeast. Beijing is surrounded on three sides by mountains to the west, north, and northeast, and to the southeast by a plain that slopes gently toward the Bohai Sea. The terrain conditions make it easy for various pollutants to accumulate and difficult to diffuse. Beijing has a warm temperate semi-humid and semi-arid monsoon climate, with high temperatures and rain in summer and cold and dry weather in winter.

2.2. PM2.5 Data

Beijing has 35 air quality monitoring stations. There are 24 environmental assessment and control stations, six regional stations, and five traffic monitoring stations. Environmental assessment stations represent the overall level and variations in the regional environment. Six regional stations represent the level of regional concentrations, and the five traffic monitoring stations were used to measure the influence of road traffic pollution sources on the ambient air quality distributed in different ring lines and primary roads. The locations of the monitoring stations are shown in Figure 1. The monitoring stations were primarily concentrated in the central urban area and were distributed evenly in the suburbs. The hourly PM2.5 concentration data came from the Beijing Municipal Environmental Protection and Testing Center (http://www.bjmemc.com.cn (accessed on 10 December 2019)).
Hourly pollutants concentration was used to develop LUR models. Because of its high temporal resolution, PM2.5 distribution showed huge differences due to season and time characteristics. Considering high time scale POI influence and corresponding to meteorological condition variation, this study chose working days and non-working days from winter heating season and summer non-heating season which were randomly chosen at 0:00, 8:00, and 18:00. These corresponded to midnight, morning rush, and evening rush. The average PM2.5 concentration at these three times on 3 August 2019 represented a summer non-working day. 6 August 2019 represented a summer working day. In the same way, 12 and 14 January 2019 represented a winter non-working day and working day, respectively. Missing values from some stations at a particular moment were removed. Given the above, 12 LUR models in total were developed.

2.3. Predictors and Data Source

Source apportionment studies have shown that PM2.5 pollutants are primarily caused by various sources, including traffic, coal combustion, biomass burning, industrial sources, and dust [34,35]. In addition, meteorological conditions, local emissions, regional transport, and complex topography can also affect the formation of pollutants [36,37]. This study summarized the common geographic variables used in previous studies. Buffer sizes with radii of 300 m, 500 m, 600 m, 700 m, 800 m, 900 m, 1000 m, 1300 m, 1500 m, 2000 m, and 2500 m centered from the monitoring station were established. The predictors included land use area, road length, number of different POIs, industrial pollution sources, meteorological factors, elevation, and population. Specific details of these predictors can be seen in Table 1.
Land use data was obtained from the Chinese resource and environment data cloud platform (http://www.resdc.cn/ (accessed on 10 December 2019)). Beijing primarily contains the following six land types: arable land, garden land, woodland, grassland, commercial service land, industrial and mining warehousing land, and residential land. Meteorological data, including temperature, relative humidity, air pressure, and wind speed, came from the China meteorological science data center (http://data.cma.cn/en (accessed on 10 December 2019)). Meteorological data from 33 stations in Beijing (15 stations) and its surrounding areas (18 stations in Hebei and Tianjin) were utilized. The Kriging interpolation method was used to simulate the annual mean values of the 35 monitoring stations. The road network data came from the National Geomatic Center of China (http://www.ngcc.cn/ngcc/ (accessed on 10 December 2019)). These data contained the name, type, length, and other information on all the roads in Beijing. While vehicle flow information was not available, the length of roads within the buffer zone of the monitoring station is a good substitute [17,38]. By considering traffic flow and geographical location information, the roads were divided into two categories. Expressways, national roads, provincial roads, county roads, and urban expressways were merged into primary roads, township roads, and pedestrian roads, and the other roads were merged into the secondary roads. The length of the various roads in each buffer zone were obtained as road factors using the GIS spatial superposition method. The POI data was obtained using an application called the Baidu map using Python. There were 13 types of POI, including catering services, scenic spots, public facilities, companies, shopping places, transportation, financial banks, science and education places, commercial and residential housing, life services, sports and leisure, medical care, and government agencies. The Shuttle Radar Topography Mission (SRTM) digital elevation data (90 m spatial resolution) was used as digital elevation model (DEM). The population density data was based on the population distribution dataset of a 1 km grid in 2015. The industrial pollution sources data was obtained from the website of the Ministry of Environmental Protection. This contained basic information on the national key monitoring enterprises of waste gas.

2.4. Land Use Regression Modelling

The land use regression (LUR) model is based on the least square method to establish a regression equation between air pollutants at monitoring points and the characteristic predictors of the surrounding environment near these points. Then, by using the characteristic predictors of the surrounding environment at unknown points, the model algorithm of pollutant concentrations at unknown points can be estimated by the equation. The model equation is as follows:
y = β0 + β1×1 + β2×2 + … + βnxn + α,
where y represents the PM2.5 concentration value; x1, x2, …, xn represents the influence factors in the model; and α is a random variable. First, a correlation analysis was conducted between the PM2.5 data and corresponding predictors of the station. In this study, the Spearman correlation coefficient was used as the standard to select these predictors. To reduce the possibility of collinearity between variables belonging to the same category, the predictor with the highest correlation coefficient in each category of the buffers was kept, and the rest were eliminated [39,40]. A stepwise linear regression was performed on the screened predictors to obtain the multiple linear regression equations. The significance level (p-value < 0.05) and variance inflation factor (VIF < 5) of each predictor that remained in the models were used to check their significance levels and ensure no issues of multicollinearity. Finally, the models’ fitness degrees were judged using R2, and the root-mean-square error (RMSE) was used to evaluate the prediction accuracy of the models. In addition, the partial R2 of each predictor was used to judge the overall explanation of the model by this factor.

2.5. Validation and Evaluation

This study used the 10-fold cross validation method. The principle of the 10-fold cross validation is to divide the entire data set into 10 parts, of which nine were selected as the training set and one as the prediction set. The RMSE and the adjusted R2 were used to evaluate the predictive accuracy and fitness of the models through many times of repeat training. R software was used to conduct all of the statistical analyses.
When the final LUR model was obtained, the regression equation was used to simulate the concentration of pollutants at non-monitored points. This process is called regression mapping [41]. This method can better simulate the spatial distribution of air pollutants. A grid of 3 km and 3 km within the research area was generated, and the corresponding impact factor values of each grid point were obtained. These were then placed into the obtained multiple regression equation to predict the PM2.5 concentration. Then, the Kriging interpolation method was used to simulate and generate the spatial distribution of the predicted PM2.5 concentration at different times during working days and non-working days in winter and summer in the study area.

3. Results

3.1. Descriptive Statistics for the PM2.5 Data

Table 2 shows the basic descriptive statistics of the PM2.5 at different times during winter working days, non-working days, summer working days, and non-working days. The PM2.5 pollutant data of each district at different times can be seen in Table S1. As a whole, there was a large difference in the study area in the PM2.5 concentrations between winter and summer. According to China’s GB 3095-2012, it can be seen that the pollution concentration during the two winter days seriously exceeded the standard, and the average PM2.5 value ranged from 95.9 to 323.4 μg·m−3. The median values of the six moments in the two winter days were generally high, the upper and lower quartile intervals had a larger span, and the PM2.5 concentration values of the different monitoring sites had a greater degree of dispersion. In comparison, the PM2.5 concentration did not exceed the standard at six times on two summer days. The average PM2.5 value was between 22.4–46.6 μg·m−3, the maximum was between 42–69 μg·m−3, and the minimum was between 9–25 μg·m−3. In general, the PM2.5 concentration at the different monitoring stations in the study area was discrete.
After a normality test of the several sets of data, a Student’s t-test and Mann–Whitney U test were performed on the PM2.5 concentration at three times of the day according to the normal distribution of the PM2.5 concentration. During winter, there was no significant difference between 0:00 and 18:00 on winter workdays (p > 0.05). In addition, there were significant differences between 0:00 and 8:00 and between 8:00 and 18:00 (p < 0.05). There were significant differences in the PM2.5 concentration of the three non-working days in winter (p < 0.01). It can be seen that in winter, whether it was a working day or a non-working day, the PM2.5 concentration at the three times all presented a trend, with the strongest at 18:00 and the lowest concentration at 8:00. There was no significant difference between 0:00 and 18:00 in summer working days and non-working days (p > 0.05), but there were significant differences between 0:00 and 8:00 and 8:00 and 18:00 (p < 0.05). This meant that at different times in one day there were large differences in the PM2.5 concentrations at the same station.

3.2. LUR Models

Correlation analysis results can be seen in Table S2. Table 3 shows the results of a total of 12 LUR models at three times during winter working days, non-working days, summer working days, and non-working days. Overall, the fitness of the LUR models at various moments in winter were better than in summer, with the adjusted R2 ranging from 0.644 to 0.951, and the 10-fold cross-validation adjusted R2 ranging from 0.671 to 0.947. The adjusted R2 of the summer time models were between 0.312 and 0.628, and the 10-fold cross-validation adjusted R2 were between 0.299 and 0.605. In summary, the R2 varied greatly at different times, even in the same season. Compared with the summer LUR models, the RMSE of the winter models were generally larger. The winter models’ RMSE ranged from 19.79 to 50.76 μg·m−3, and the 10-fold cv RMSE were between 14.52 to 55.42 μg·m−3. This was related to winter’s higher concentration of pollutants in the study area. In the summer models, the RMSE were between 5.08–8.24 μg·m−3, and the 10-fold cv RMSE were between 4.84–9.55 μg·m−3. For the working days and non-working days during the same season, the R2 and RMSE gaps were not significant.
Table 4, Table 5, Table 6 and Table 7 show the final predictors of 12 LUR models. The final predictors retained in the LUR models included meteorological factors, roads, land use types, POIs, and elevation according to the correlation analysis. The partial R2 represented the contribution of the predictive variables in each model to the model. The meteorological factors were included in nearly all the models. During winter, the LUR models at different times and relative humidities were included in all six models, and this explained 54–86% of the variations. The second meteorological factor was temperature. Unlike the winter models, the meteorological factors had little effect on the summer LUR models, although the relative humidity and temperature were both included. This may have been due to the different dominant factors of the PM2.5 concentration in the different seasons. At 0:00 on the summer working days, the temperature explained 19.3% of the variations in the model. However, at 0:00 and 8:00 on non-working days in summer, the temperature and relative humidity only explained 6.9% and 6.3%, respectively, of the variations in the models. Land use and road data also explained a large portion of the model results. Grassland was included in the winter and summer models, accounting for 5.6% and 6.3% of the variations in the models, respectively, both of which were negatively correlated with the PM2.5 concentrations. Road data were included in both the winter and summer models; at 8:00 a.m. on winter and summer working days, and at 18:00 on summer non-working day. The models that included road data were both in the morning and evening peaks and explained 3.5%, 15.6%, and 8.3% of the variations in the models.
The POIs were included in the models of both seasons and had a certain influence on the PM2.5 concentrations. Eleven types of POIs were included: residential housing, catering services, life services, scenic spots, medical care, public facilities, science and education places, sports and leisure, shopping places, government agencies, and financial banks. The results showed that on working days or non-working days in the different seasons, different POIs had a certain influence on the PM2.5 concentration, and their relationships to the pollutants at specific moments had diversity. Compared with the summer results, the number of POIs included in the winter models was smaller because the relative humidity contributed most to the winter models. At 0:00 on a winter working day, sports leisure within 2500 m explained 7.3% of the variation in the model. At 0:00 on non-working days in winter, such POIs explained 20.1% of the variation in the model. At 8:00 and 18:00 on non-working days in winter, catering services and shopping places were included in the models, which explained 5.5% and 6.8%, respectively, of the variation. Furthermore, POIs were included in all the summer models, and these explained a great deal of the variations. At 0:00 on summer working days and non-working days, commercial residences explained 41.2% and 39.8% of variation in the model, respectively. This was consistent with the feature that most people gather in residential areas at midnight. At 8:00 on a summer working day, financial banks within 3000 m, government agencies within 300 m and life service within 300 m explained 8.2%, 6.5%, and 5.9% of the variation in the model, respectively. At 18:00 on a summer working day, all of the contributions to the PM2.5 pollutants came from the POIs, including commercial residences and scenic spots, explaining 25.7% and 9.7% of the variation in the model, respectively. At 8:00 on a non-working day in summer, the POIs were still important sources of PM2.5 pollutants. Medical care and public facilities explained 45.6% and 11.6% of the variation in the model, respectively. At 18:00 on summer working days, science and education places explained 51% of the LUR model. In summary, POIs have greatly improved the time resolution of LUR models for different moments.

3.3. PM2.5 Evaluation and Distribution

The center value of each grid point in the study area was estimated according to the regression equation of the LUR models. The Kriging interpolation method was used to predict the spatial distribution of PM2.5 in the study area. Figure 2 shows the temporal and spatial distribution of the PM2.5 at different times during winter working days and non-working days in Beijing. It can be seen that distribution of PM2.5 in study area had obvious temporal and spatial heterogeneity. Even in the different seasons, the pollutants were higher in the southeast and lower in the central urban and northwest mountainous areas. This was consistent with the high terrain in the northwest and gentle terrain southeast of the study area. This feature was more significant in winter. In summer, the high PM2.5 concentration is primarily concentrated in urban areas and south of the city, but it was more obvious in urban centers.
Figure 3 shows the Kriging’s predicted PM2.5 data compared with the actual values. The scatter plots of the predicted versus observed PM2.5 data are displayed. Although there were some outliers, as a whole it can be seen that the explanatory ability of the Kriging expansion was basically the same or slightly improved compared to that of the LUR models, according to a comparison of the PM2.5 concentration predicted values with actual values by Kriging interpolation. By using the linear regression prediction of the PM2.5 concentration at the different moments, the prediction values of 0:00, 8:00, and 18:00 on winter working days ranged from 9.27–288.29 μg·m−3, 15–236.51 μg·m−3, and 10–373.33 μg·m−3 respectively, with mean values of 160.11 μg·m−3, 106.98 μg·m−3, and 180.02 μg·m−3, respectively. The relative error between the forecast mean value and the actual value was −0.94%, 11.63%, and 1.66%, respectively. The prediction value at 0:00, 8:00, and 18:00 during the winter non-working days ranged from 39.46–423.12 μg·m−3, 67.60–322.18 μg·m−3, and 85.89–612.17 μg·m−3 respectively, with mean values of 205.84 μg·m−3, 150.53 μg·m−3, and 323.52 μg·m−3, respectively. The relative error between the predicted means and actual values were −2.77%, 0.35%, and 0.06%, respectively. In the winter models, the monitoring stations for the predicted maximum and minimum PM2.5 values were consistent with the monitoring stations for the actual maximum and minimum values. However, there were more outliers in summer working days and non-working days which illustrated summer models ‘predictive effect. The prediction effect during summer time was worse than the original LUR models. The prediction ranges of the PM2.5 concentration at 0:00, 8:00, and 18:00 during the summer working days were 12.22–47.91 μg·m−3, 33.46–72.21 μg·m−3, and 39.87–59.56 μg·m−3, with mean values being 33.36 μg·m−3, 52.04 μg·m−3, and 45.98 μg·m−3, and relative prediction errors of −3.26%, 11.68%, and 0.36%, respectively. The predicted range of the PM2.5 concentration at 0:00, 8:00, and 18:00 on the non-working days during summer were 20.87–40.87 μg·m−3, 22.50–55.43 μg·m−3, and 25.73–55.47 μg·m−3, with mean values of 30.36 μg·m−3, 37.31 μg·m−3, and 37.01 μg·m−3, respectively. The relative errors with the actual values were −0.03%, −1.82%, and −2.82%, respectively. In summary, the prediction accuracy of the summer models was relatively high, but the predicted maximum and minimum PM2.5 stations were not completely consistent with the actual stations due to the small difference in the overall PM2.5 concentration distribution during summer in the study area.

4. Discussion

The LUR models in this study well explained the temporal and spatial variations in the PM2.5 pollution in the study area, and hourly time precision was achieved. The winter models fitted best, and the explanatory degree of the models at several different moments ranged from 67.5% to 95.5%. These models were highly comparable compared with the LUR models utilized in previous studies. The explanation of these models was higher than the 53% explanation of the annual PM2.5 LUR simulation for the United States [42] and higher than 11.4–46.5% of the seasonal average PM2.5 models for Bangkok, Thailand [43]. In China, the model explanations were higher than that of Cai et al. [44] to explain 65% of the spatial variability in the PM2.5 in Taizhou, China and higher than the explanation of 61% of the PM2.5 of the Liaoning central urban agglomeration in the model of Shi et al. [20]. This difference can typically be explained by the measuring concentrations, predictive variables, and original variability of the geographical and socio-economic characteristics of the study area [45].
The models for the two seasons were quite different. PM2.5 pollutants in Beijing primarily originated from man-made emissions, including coal combustion, gasoline and diesel vehicle emissions, secondary source pollution, and straw burning on farmlands. Winter is heating season in Beijing. Due to coal combustion from Beijing and its surrounding areas, as well as less vegetation than in summer [46], the concentration of PM2.5 in winter was significantly higher than in summer. In addition, Beijing is affected by the high pressure from Mongolia during winter, which leads to the local accumulation of pollutants. During summer, the source of fine particulate matter primarily originates from regional traffic sources, of which there is also a portion from cross-regional transportation. Lv et al. [47] conducted a high-time resolution particle source apportionment in Beijing showed that Baoding and Langfang in Hebei Province contributed significantly to the short-distance transportation, and this portion was not included in these models. This demonstrated that the explanation of several summer models in this study ranged between 35.4% and 66.8%, which was worse than that in winter. These results were consistent with the seasonal models of Shi et al. [20] in which their LUR models explained 61% of the variability in winter and 52% of the variability in summer. The cross-regional transport of traffic pollution sources also explained why the models at 0:00 were greater than 8:00 on the same working days in this study. This was because during the morning rush hour at 8:00, many people from the surrounding areas enter Beijing for work, and the traffic influence produced by this portion could not be included in the models. However, the PM2.5 pollutants at 0:00 were primarily from local sources, and nearly each impact factor was included in the model.
Previous studies have generally used the RMSE to determine the accuracy of the models [45], but for LUR models with different spatial and temporal scales, a direct comparison of the RMSE values may have great variability. In this study, the RMSE in the winter models was generally larger than that in summer, which was due to the different background values of the pollutants in the different seasons and the spatial differences of the pollutants. Since the terrain in Beijing is high in the northwest and low in the southeast, the different topography and underlying surface affect the transmission of air pollutants [48]. In the suburbs, human activities are less affected, and the high concentration areas are primarily in the main urban area and the southern plain. As a result, the PM2.5 concentration fitted by this model in winter reached the highest level of nearly 700 μg·m−3, while the minimum area was less than 10 μg·m−3. The RMSE of our summer models ranged from 5.08 μg·m−3 to 8.24 μg·m−3, which was not significantly different from previous studies in China [20], but was larger than the figures from other global research [45]. This was because the background level of PM2.5 in China is several times or even dozens of times higher than that in Europe and the United States. The RMSE of the winter model was also larger than summer due to above reasons.
The predictive variables in the different LUR models are typically not constant due to city-specific conditions and the availability of data. Generally, most LUR studies have used annual or seasonal values. One reason is that the non-contemporaneous measurements of monitoring stations may cause temporal variability [44]. Therefore, one significant drawback of LUR models is their low temporal resolution. The simplest method to calibrate pollutant concentrations are using observations from fixed continuous monitoring stations [49]. This combined with meteorological, satellite data or adding other time-dependent predictive variables [50,51] can also improve the temporal resolution. Since the POIs presented different functional characteristics at different times, this study assumed that different types of POI variables also have time characteristics, and hourly LUR models were developed. The results showed that POIs explained the variation in pollutant concentrations at different times. Before modeling, the 13 types of POIs included in this study were all correlated with PM2.5 concentration, and 11 types of POIs were finally included in the models. The association between POIs and pollutants and its temporal characteristics are better reflected by the attractions of POIs to the crowd [29], which will affect the behavior of the crowd, thus explaining the variability of air pollution at that moment. The POIs included in the models, such as shopping, entertainment, medical care, science, education and culture, all have a large traffic flows near these places in daily life. The pollution produced by these vehicle traffic sources contributes to the PM2.5 pollution at this moment to a large extent.
In terms of the time characteristics of POIs, it can be seen from the results that at 0:00 on summer working days and non-working days, the commercial and residential housing variables were included. This was due to the poor mobility of people in the urban area at this moment, and the crowd being concentrated near the residences. Hence, there will be some traffic source emissions. However, commercial residential areas contain a lot of commercial office buildings. In many high-tech industrial companies in Beijing, it is a normal phenomenon to work and commute after 0:00. For example, in areas such as Xierqi and Houchangcun in the Haidian District, the taxi rush often occurs after 0:00, and this is also a particular social phenomenon in China. It was also confirmed that on weekdays, commercial residence explained 41.2% of the PM2.5 at that time, while on non-weekdays, the explanation was 39.8%. At 18:00 on summer working days, the commercial and residential variable explained 25.7% of the model. This was because people got off work during evening rush hour, and the traffic flow around business buildings and residential areas increased. In addition, there were many buses stops nearby, and this also caused pollutant increases. At 0:00 on working days and non-working days in winter, the POIs explained little of the model ranging from 5.8% to 14.3%. This was primarily because the higher background concentration of pollutants during winter is several times higher than the same time in summer. Compared with meteorological conditions, the POIs contributed a smaller proportion to the PM2.5. Catering service POIs were also included in these models. Previous LUR models for Vancouver, Canada, and Europe have also considered the number of restaurants near the monitoring sites [38,52]. Like automobile exhaust, cooking fumes are also a major source of air pollution [53]. The emission inventory performed by Jin et al. [54] showed that in 2017, the catering industry in China released approximately 38.2 kt PM2.5 and 47.8 kt PM10. In this study, at 8:00 on non-working days in winter, the catering POIs explained 5.5% of the model. On non-weekdays, the passenger flow of various restaurants was more than weekdays, which also increased the emissions of restaurant lampblack. At other times during the non-working day, such as in shopping places, medical care and public facilities were also included in the model. This was because on holidays, people tend to go to these places, and many people will drive private cars or take a taxi, which cause large increases in traffic flow near these POIs and this explains the contribution of the above POIs to the entire model. For example, at 8:00 on non-working days in summer, medical care POIs explained 45.6% of the model, and public facilities POIs explained 11.6%. However, there were several types of POIs included in the model that had no obvious correlation with the places where people gather at that time. For example, at 18:00 on non-working days in summer, there should be few people in schools, but science and education POIs at that time explained 51% of the model. This is because Beijing’s super-large city characteristics lead to POI overlaps in the region. Many schools and educational institutions in Beijing are primarily located in urban areas. These POIs are not independent, and there will be other functional areas nearby, which make the pollutant source unclear. In conclusion, this study confirmed that POIs had a temporal attribute because various POIs show different attractions to the crowd.
Meteorological variables also had time attributes in the models. Meteorological conditions are the primary factors that affect the variability of pollutants with high time resolution. Studies have shown that meteorological conditions contribute greater than 70% of the daily average concentration of pollutants in China [55]. In this study, nearly every model included meteorological variables. The relative humidity was included in all the winter models, and temperature and relative humidity were included in the summer models. Generally, except in the case of precipitation, the greater the relative humidity, the more particles attach to the water vapor, which increases the mass concentration. In the winter models, the relative humidity explained the PM2.5 concentration between 54.9% and 86.7%, and all of them were positively correlated. The influence of air temperature on pollutants is complex and often plays a role in combination with wind speed, terrain, atmospheric junction, and other factors. The results of this model showed that temperature at 0:00 in summer was positively correlated with PM2.5 concentration, which may have been due to air cooling at night, but land surface temperatures are higher, which makes it difficult for pollutants to diffuse. Statheropoulos et al. [56] analyzed the air pollution factors in Athens, and the results showed that the pollutants had a significant relationship with relative humidity and wind speed. However, in many annual LUR models, meteorological variables are not included [44,57]. This may be due to the large temporal and spatial variability of meteorological conditions, and the impact on pollutants is more reflected at daily or hourly concentrations.
Road data also had a temporal attribute in these models. Theoretically, traffic load and vehicle density data may be significant indicators that reflect vehicle exhaust emissions during a short period of time. However, due to their complexity and unavailability, road lengths were used to represent the traffic emissions. Actually, this method has achieved good results in many studies [17,38,58]. In this study, the road variables were divided into primary roads and secondary roads according to the width and vehicle capacity, and these two types of roads were all included in the models. The models containing this variable were all at 8:00 and 18:00, when the traffic flow was in the peak period and a large number of vehicle exhaust emissions become an important source of air pollutants. As can be seen from the results, the included variable buffers of the primary road were relatively large, 1500 m and 2500 m, while the buffer of the second road was only 300 m. The differences in the buffer radius reflected the characteristics of primary and secondary pollutant emissions. For the PM2.5, both primary and secondary emissions can significantly increase its concentration [59]. Different road types were reflected using their functions. The primary roads included urban expressways, national highways, and provincial highways, and they are mainly busy suburban roads. Due to various and complex types of motor vehicle emissions and high traffic densities, large buffer radii may be the result of the secondary discharge of pollutants. However, the secondary roads primarily included a wide distribution of roads, with a small flow of motor vehicles that resulted in a smaller possibility of pollutants spreading in a large area [60]. Thus, they tended to affect the monitoring data in a small area.
Different from previous LUR studies, land, population, and elevation had little influence on the models in this study. This was because these models were based on the hourly time scale of PM2.5 concentration, while predictors such as land use and population were more represented in annual or seasonal models. Son et al. [61] improved the LUR models on an hourly time scale in Mexico, which primarily included temporal variables, such as hourly traffic density, meteorological, and holiday variables. Factors such as land use and population had no significant effect. This is consistent with the results of this study.
There currently exist few LUR studies based on hourly pollutant concentrations. In the short-term or more precise individual exposure studies, using annual or seasonal models may lead to deviations [62]. The biggest breakthrough of this research was the realization of the hourly temporal LUR models, which provided a more accurate method for short-term exposure studies and more accurate micro-environment individual exposure studies. High-resolution simulation of regional pollutant concentrations is of great significance for travel prevention and control of residents. Currently in China, there are hourly concentration limits for pollutant emissions. In the future, there may be more precise timescale upgrades to healthy concentration standards. In addition, more pollution emission sources were identified within a short period of time, and this proved the temporal characteristics of the POIs in the models, which has significance for more accurate pollution prevention and control measures. Though we carried out LUR models for PM2.5 and there still many researches focused on multiple pollutants, this study provides a new methodological perspective for other types of high-resolution pollutant models which is the most important significance.
However, this study still has some limitations. The best explanation of variability in the summer models was only 66.8%. Hence, there are other factors that remain to be explored during a short period of time. Second, only 35 monitoring data were used in this study, and some data were missing. Hoek et al. [45] recommended using 40–80 pollutant monitoring stations. Additional modeling was performed at other times, but not each model could be successfully applied due to the models’ high time resolution and large pollutants variability. More accurate time variables are required to be explored to optimize the LUR models. Finally, due to the progressive linear regression principle of the LUR model itself, the model assumes a linear relationship between all predictive variables and pollutants, which is inherently limited for some variables. All of these problems require further study.

5. Conclusions

In this study, 12 high-resolution LUR models based on the hourly PM2.5 concentration were developed. The results showed that, even within the same day and at different times in different seasons, the PM2.5 concentration at the same monitoring station varied greatly. The fitness of the winter models at various times were better than that in summer, and the model explanation varied between 67.5% and 95.5%. The summer models explained 35.4–66.8% of the variability. The model results of the different seasons and times were quite different, and the model fitness were not robust. The RMSE of the winter models ranged from 19.79 μg·m−3 to 50.76 μg·m−3, while the RMSE of summer models ranged from 5.08 μg·m−3 to 8.24 μg·m−3. The final predictive variables showed that there were differences in explanations of the PM2.5 during the different seasons, and the explanations of the meteorological factors to the winter models were the most significant, ranging from 54% to 86%. The road length and POIs also contributed to the winter models, but the explanations were not significant due to winter’s high background value. The summer models showed that the POIs and road length were the primary contributors to the PM2.5 pollutants. Pollutant’s transportation is also an important reason. POIs have temporal characteristics, and the contribution of different POIs to the air PM2.5 is different on working days or non-working days. In these models, the POIs explained 5.5–51.0% of the variability. This study confirmed that POIs can improve the temporal resolution of LUR models, and this is significant for short-term exposure studies and microenvironment individual exposure studies.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijerph18116143/s1, Table S1: PM2.5 pollutant concentrations in each district of Beijing at different times, Table S2: The largest correlation coefficients of predictors in different buffer radii.

Author Contributions

Conceptualization, methodology, investigation, data curation, and writing original draft, Y.Z.; validation, investigation, and resources, H.C.; D.H.; C.F.; writing—review and editing, and supervision, H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Key Research and Development Programs of China (Grant No. 2016YFC0206202) under Ministry of Science and Technology of the People’s Republic of China.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Zhou, M.; Wang, H.; Zeng, X.; Yin, P.; Zhu, J.; Chen, W.; Li, X.; Wang, L.; Wang, L.; Liu, Y.; et al. Mortality, morbidity, and risk factors in China and its provinces, 1990–2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet 2019, 394, 1145–1158. [Google Scholar] [CrossRef] [Green Version]
  2. Steinle, S.; Reis, S.; Sabel, C.E.; Semple, S.; Twigg, M.M.; Braban, C.F.; Leeson, S.R.; Heal, M.R.; Harrison, D.; Lin, C.; et al. Personal exposure monitoring of PM2.5 in indoor and outdoor microenvironments. Sci. Total Environ. 2015, 508, 383–394. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Adams, M.D.; Kanaroglou, P.S. Mapping real-time air pollution health risk for environmental management: Combining mobile and stationary air pollution monitoring with neural network models. J. Environ. Manag. 2016, 168, 133–141. [Google Scholar] [CrossRef]
  4. Anenberg, S.C.; Belova, A.; Brandt, J.; Fann, N.; Greco, S.; Guttikunda, S.; Heroux, M.-E.; Hurley, F.; Krzyzanowski, M.; Medina, S.; et al. Survey of ambient air pollution health risk assessment tools. Risk Anal. 2016, 36, 1718–1736. [Google Scholar] [CrossRef]
  5. Huang, M.; Ivey, C.; Hu, Y.; Holmes, H.A.; Strickland, M.J. Source apportionment of primary and secondary PM2.5: Associations with pediatric respiratory disease emergency department visits in the U.S. State of Georgia. Environ. Int. 2019, 133, 105167. [Google Scholar] [CrossRef]
  6. Sima, S.; Tajrishy, M. Developing water quality maps of a hyper-saline lake using spatial interpolation methods. Sci. Iran. 2015, 22, 30–46. [Google Scholar]
  7. Gulia, S.; Kumar, A.; Khare, M. Performance evaluation of CALPUFF and AERMOD dispersion models for air quality assessment of an industrial complex. J. Sci. Ind. Res. 2015, 74, 302–307. [Google Scholar]
  8. Meng, X.; Fu, Q.; Ma, Z.; Chen, L.; Zou, B.; Zhang, Y.; Xue, W.; Wang, J.; Wang, D.; Kan, H.; et al. Estimating ground-level PM10 in a Chinese city by combining satellite data, meteorological information and a land use regression model. Environ. Pollut. 2016, 208, 177–184. [Google Scholar] [CrossRef]
  9. Xiao, Q.; Chang, H.H.; Geng, G.; Liu, Y. An ensemble machine-learning model to predict historical PM2.5 concentrations in China from satellite data. Environ. Sci. Technol. 2018, 52, 13260–13269. [Google Scholar] [CrossRef]
  10. Feng, R.; Zheng, H.-J.; Zhang, A.-R.; Huang, C.; Gao, H.; Ma, Y.-C. Unveiling tropospheric ozone by the traditional atmospheric model and machine learning, and their comparison: A case study in hangzhou, China. Environ. Pollut. 2019, 252, 366–378. [Google Scholar] [CrossRef]
  11. Chen, Z.-Y.; Zhang, R.; Zhang, T.-H.; Ou, C.-Q.; Guo, Y. A kriging-calibrated machine learning method for estimating daily ground-level NO2 in mainland China. Sci. Total Environ. 2019, 690, 556–564. [Google Scholar] [CrossRef] [PubMed]
  12. Jerrett, M.; Arain, A.; Kanaroglou, P.; Beckerman, B.; Potoglou, D.; Sahsuvaroglu, T.; Morrison, J.; Giovis, C. A review and evaluation of intraurban air pollution exposure models. J. Expo. Sci. Environ. Epidemiol. 2005, 15, 185–204. [Google Scholar] [CrossRef] [PubMed]
  13. Briggs, D.J.; Collins, S.; Elliott, P.; Fischer, P.; Kingham, S.; Lebret, E.; Pryl, K.; Van Reeuwijk, H.; Smallbone, K.; Van Der Veen, A. Mapping urban air pollution using GIS: A regression-based approach. Int. J. Geogr. Inf. Sci. 1997, 11, 699–718. [Google Scholar] [CrossRef] [Green Version]
  14. Morley, D.W.; Gulliver, J. A land use regression variable generation, modelling and prediction tool for air pollution exposure assessment. Environ. Model. Softw. 2018, 105, 17–23. [Google Scholar] [CrossRef]
  15. Beelen, R.; Hoek, G.; Vienneau, D.; Eeftens, M.; Dimakopoulou, K.; Pedeli, X.; Tsai, M.-Y.; Künzli, N.; Schikowski, T.; Marcon, A.; et al. Development of NO2 and NOx land use regression models for estimating air pollution exposure in 36 study areas in Europe—The ESCAPE project. Atmos. Environ. 2013, 72, 10–23. [Google Scholar] [CrossRef]
  16. Ma, X.; Longley, I.; Gao, J.; Kachhara, A.; Salmond, J. A site-optimised multi-scale GIS based land use regression model for simulating local scale patterns in air pollution. Sci. Total Environ. 2019, 685, 134–149. [Google Scholar] [CrossRef] [PubMed]
  17. Yang, Z.; Freni-Sterrantino, A.; Fuller, G.W.; Gulliver, J. Development and transferability of ultrafine particle land use regression models in London. Sci. Total Environ. 2020, 740, 140059. [Google Scholar] [CrossRef]
  18. Saucy, A.; Röösli, M.; Künzli, N.; Tsai, M.-Y.; Sieber, C.; Olaniyan, T.; Baatjies, R.; Jeebhay, M.; Davey, M.; Flückiger, B.; et al. Land use regression modelling of outdoor NO2 and PM2.5 concentrations in three low income areas in the Western Cape Province, South Africa. Int. J. Environ. Res. Public Health 2018, 15, 1452. [Google Scholar] [CrossRef] [Green Version]
  19. Weissert, L.F.; Salmond, J.A.; Miskell, G.; Alavi-Shoshtari, M.; Williams, D.E. Development of a microscale land use regression model for predicting NO2 concentrations at a heavy trafficked suburban area in Auckland, NZ. Sci. Total Environ. 2018, 619–620, 112–119. [Google Scholar] [CrossRef]
  20. Shi, T.; Hu, Y.; Liu, M.; Li, C.; Zhang, C.; Liu, C. Land use regression modelling of PM2.5 spatial variations in different seasons in urban areas. Sci. Total Environ. 2020, 743, 140744. [Google Scholar] [CrossRef]
  21. Wang, J.; Cohan, D.S.; Xu, H. Spatiotemporal ozone pollution LUR models: Suitable statistical algorithms and time scales for a megacity scale. Atmos. Environ. 2020, 237, 117671. [Google Scholar] [CrossRef]
  22. Dong, J.; Ma, R.; Cai, P.; Liu, P.; Yue, H.; Zhang, X.; Xu, Q.; Li, R.; Song, X. Effect of sample number and location on accuracy of land use regression model in NO2 prediction. Atmos. Environ. 2021, 246, 118057. [Google Scholar] [CrossRef]
  23. Mortamais, M.; Pujol, J.; Martínez-Vilavella, G.; Fenoll, R.; Reynes, C.; Sabatier, R.; Rivas, I.; Forns, J.; Vilor-Tejedor, N.; Alemany, S.; et al. Effects of prenatal exposure to particulate matter air pollution on corpus callosum and behavioral problems in children. Environ. Res. 2019, 178, 108734. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, W.; Li, X.; Chen, Z.; Zeng, G.; León, T.; Liang, J.; Huang, G.; Gao, Z.; Jiao, S.; He, X.; et al. Land use regression models coupled with meteorology to model spatial and temporal variability of NO2 and PM10 in Changsha, China. Atmos. Environ. 2015, 116, 272–280. [Google Scholar] [CrossRef]
  25. Song, W.; Jia, H.; Li, Z.; Tang, D.; Wang, C. Detecting urban land-use configuration effects on NO2 and NO variations using geographically weighted land use regression. Atmos. Environ. 2019, 197, 166–176. [Google Scholar] [CrossRef]
  26. Nori-Sarma, A.; Thimmulappa, R.K.; Venkataramana, G.V.; Fauzie, A.K.; Dey, S.K.; Venkareddy, L.K.; Berman, J.D.; Lane, K.J.; Fong, K.C.; Warren, J.L.; et al. Low-cost NO2 monitoring and predictions of urban exposure using universal kriging and land-use regression modelling in Mysore, India. Atmos. Environ. 2020, 226, 117395. [Google Scholar] [CrossRef]
  27. Chan, L.Y.; Kwok, W.S.; Chan, C.Y. Human exposure to respirable suspended particulate and airborne lead in different roadside microenvironments. Chemosphere 2000, 41, 93–99. [Google Scholar] [CrossRef]
  28. Qiu, X.; Duan, L.; Gao, J.; Wang, S.; Chai, F.; Hu, J.; Zhang, J.; Yun, Y. Chemical composition and source apportionment of PM10 and PM2.5 in different functional areas of Lanzhou, China. J. Environ. Sci. 2016, 40, 75–83. [Google Scholar] [CrossRef] [PubMed]
  29. Yuan, J.; Zheng, Y.; Xie, X. Discovering regions of different functions in a city using human mobility and POIs. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, China, 12–16 August 2012; ACM: New York, NY, USA, 2012; pp. 186–194. [Google Scholar] [CrossRef]
  30. Yuan, Q.; Cong, G.; Ma, Z.; Sun, A.; Thalmann, N.M. Time-aware point-of-interest recommendation. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 28 July–1 August 2013; ACM: New York, NY, USA, 2013; pp. 363–372. [Google Scholar] [CrossRef]
  31. Han, Y.-J.; Kim, H.-W.; Cho, S.-H.; Kim, P.-R.; Kim, W.-J. Metallic elements in PM2.5 in different functional areas of Korea: Concentrations and source identification. Atmos. Res. 2015, 153, 416–428. [Google Scholar] [CrossRef]
  32. Gao, Y.; Guo, X.; Ji, H.; Li, C.; Ding, H.; Briki, M.; Tang, L.; Zhang, Y. Potential threat of heavy metals and PAHs in PM2.5 in different urban functional areas of Beijing. Atmos. Res. 2016, 178–179, 6–16. [Google Scholar] [CrossRef]
  33. Lu, T.; Lansing, J.; Zhang, W.; Bechle, M.J.; Hankey, S. Land Use Regression models for 60 volatile organic compounds: Comparing Google Point of Interest (POI) and city permit data. Sci. Total Environ. 2019, 677, 131–141. [Google Scholar] [CrossRef] [PubMed]
  34. Gao, J.; Peng, X.; Chen, G.; Xu, J.; Shi, G.-L.; Zhang, Y.-C.; Feng, Y.-C. Insights into the chemical characterization and sources of PM2.5 in Beijing at a 1-h time resolution. Sci. Total Environ. 2016, 542, 162–171. [Google Scholar] [CrossRef]
  35. Liu, Y.; Zheng, M.; Yu, M.; Cai, X.; Du, H.; Li, J.; Zhou, T.; Yan, C.; Wang, X.; Shi, Z.; et al. High-time-resolution source apportionment of PM2.5 in Beijing with multiple models. Atmos. Chem. Phys. 2019, 19, 6595–6609. [Google Scholar] [CrossRef] [Green Version]
  36. Wang, Y.; Bao, S.; Wang, S.; Hu, Y.; Shi, X.; Wang, J.; Zhao, B.; Jiang, J.; Zheng, M.; Wu, M.; et al. Local and regional contributions to fine particulate matter in Beijing during heavy haze episodes. Sci. Total Environ. 2017, 580, 283–296. [Google Scholar] [CrossRef] [PubMed]
  37. Sun, Y.; Jiang, Q.; Wang, Z.; Fu, P.; Li, J.; Yang, T.; Yin, Y. Investigation of the sources and evolution processes of severe haze pollution in Beijing in January 2013. J. Geophys. Res. Atmos. 2014, 119, 4380–4398. [Google Scholar] [CrossRef]
  38. van Nunen, E.; Vermeulen, R.; Tsai, M.-Y.; Probst-Hensch, N.; Ineichen, A.; Davey, M.; Imboden, M.; Ducret-Stich, R.; Naccarati, A.; Raffaele, D.; et al. Land use regression models for ultrafine particles in six European areas. Environ. Sci. Technol. 2017, 51, 3336–3345. [Google Scholar] [CrossRef] [Green Version]
  39. Henderson, S.B.; Beckerman, B.; Jerrett, M.; Brauer, M. Application of land use regression to estimate long-term concentrations of traffic-related nitrogen oxides and fine particulate matter. Environ. Sci. Technol. 2007, 41, 2422–2428. [Google Scholar] [CrossRef]
  40. Wilton, D.; Szpiro, A.; Gould, T.; Larson, T. Improving spatial concentration estimates for nitrogen oxides using a hybrid meteorological dispersion/land use regression model in Los Angeles, CA and Seattle, WA. Sci. Total Environ. 2010, 408, 1120–1130. [Google Scholar] [CrossRef]
  41. Kingham, S.; Briggs, D.; Elliott, P.; Fischer, P.; Erik, L. Spatial variations in the concentrations of traffic-related pollutants in indoor and outdoor air in Huddersfield, England. Atmos. Environ. 2000, 34, 905–916. [Google Scholar] [CrossRef]
  42. Reyes, J.M.; Serre, M.L. An LUR/BME framework to estimate PM2.5 explained by on road mobile and stationary sources. Environ. Sci. Technol. 2014, 48, 1736–1744. [Google Scholar] [CrossRef]
  43. Chalermpong, S.; Thaithatkul, P.; Anuchitchanchai, O.; Sanghatawatana, P. Land use regression modeling for fine particulate matters in Bangkok, Thailand, using time-variant predictors: Effects of seasonal factors, open biomass burning, and traffic-related factors. Atmos. Environ. 2021, 246, 118128. [Google Scholar] [CrossRef]
  44. Cai, J.; Ge, Y.; Li, H.; Yang, C.; Liu, C.; Meng, X.; Wang, W.; Niu, C.; Kan, L.; Schikowski, T.; et al. Application of land use regression to assess exposure and identify potential sources in PM2.5, BC, NO2 concentrations. Atmos. Environ. 2020, 223, 117267. [Google Scholar] [CrossRef]
  45. Hoek, G.; Beelen, R.; de Hoogh, K.; Vienneau, D.; Gulliver, J.; Fischer, P.; Briggs, D. A review of land-use regression models to assess spatial variation of outdoor air pollution. Atmos. Environ. 2008, 42, 7561–7578. [Google Scholar] [CrossRef]
  46. Yang, H.; Chen, J.; Wen, J.; Tian, H.; Liu, X. Composition and sources of PM2.5 around the heating periods of 2013 and 2014 in Beijing: Implications for efficient mitigation measures. Atmos. Environ. 2016, 124, 378–386. [Google Scholar] [CrossRef]
  47. Lv, L.; Chen, Y.; Han, Y.; Cui, M.; Wei, P.; Zheng, M.; Hu, J. High-time-resolution PM2.5 source apportionment based on multi-model with organic tracers in Beijing during haze episodes. Sci. Total Environ. 2021, 772, 144766. [Google Scholar] [CrossRef] [PubMed]
  48. Zhang, A.; Qi, Q.; Jiang, L.; Zhou, F.; Wang, J. Population exposure to PM2.5 in the urban area of Beijing. PLoS ONE 2013, 8, e63486. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Mölter, A.; Lindley, S.; de Vocht, F.; Simpson, A.; Agius, R. Modelling air pollution for epidemiologic research—Part II: Predicting temporal variation through land use regression. Sci. Total Environ. 2010, 409, 211–217. [Google Scholar] [CrossRef] [PubMed]
  50. Naughton, O.; Donnelly, A.; Nolan, P.; Pilla, F.; Misstear, B.D.; Broderick, B. A land use regression model for explaining spatial variation in air pollution levels using a wind sector based approach. Sci. Total Environ. 2018, 630, 1324–1334. [Google Scholar] [CrossRef] [Green Version]
  51. Nordio, F.; Kloog, I.; Coull, B.A.; Chudnovsky, A.; Grillo, P.; Bertazzi, P.A.; Baccarelli, A.A.; Schwartz, J. Estimating spatio-temporal resolved PM10 aerosol mass concentrations using MODIS satellite data and land use regression over Lombardy, Italy. Atmos. Environ. 2013, 74, 227–236. [Google Scholar] [CrossRef]
  52. Abernethy, R.C.; Allen, R.W.; McKendry, I.G.; Brauer, M. A land use regression model for ultrafine particles in Vancouver, Canada. Environ. Sci. Technol. 2013, 47, 5217–5225. [Google Scholar] [CrossRef] [Green Version]
  53. Zhang, D.-C.; Liu, J.-J.; Jia, L.-Z.; Wang, P.; Han, X. Speciation of VOCs in the cooking fumes from five edible oils and their corresponding health risk assessments. Atmos. Environ. 2019, 211, 6–17. [Google Scholar] [CrossRef]
  54. Jin, W.; Zhi, G.; Zhang, Y.; Wang, L.; Guo, S.; Zhang, Y.; Xue, Z.; Zhang, X.; Du, J.; Zhang, H.; et al. Toward a national emission inventory for the catering industry in China. Sci. Total Environ. 2021, 754, 142184. [Google Scholar] [CrossRef] [PubMed]
  55. He, J.; Gong, S.; Yu, Y.; Yu, L.; Wu, L.; Mao, H.; Song, C.; Zhao, S.; Liu, H.; Li, X.; et al. Air pollution characteristics and their relation to meteorological conditions during 2014–2015 in major Chinese cities. Environ. Pollut. 2017, 223, 484–496. [Google Scholar] [CrossRef]
  56. Statheropoulos, M.; Vassiliadis, N.; Pappa, A. Principal component and canonical correlation analysis for examining air pollution and meteorological data. Atmos. Environ. 1998, 32, 1087–1095. [Google Scholar] [CrossRef]
  57. Jin, L.; Berman, J.D.; Warren, J.L.; Levy, J.I.; Thurston, G.; Zhang, Y.; Xu, X.; Wang, S.; Zhang, Y.; Bell, M.L. A land use regression model of nitrogen dioxide and fine particulate matter in a complex urban core in Lanzhou, China. Environ. Res. 2019, 177, 108597. [Google Scholar] [CrossRef]
  58. Liu, C.; Henderson, B.H.; Wang, D.; Yang, X.; Peng, Z.-R. A land use regression application into assessing spatial variation of intra-urban fine particulate matter (PM2.5) and nitrogen dioxide (NO2) concentrations in City of Shanghai, China. Sci. Total Environ. 2016, 565, 607–615. [Google Scholar] [CrossRef]
  59. Dai, Q.; Bi, X.; Liu, B.; Li, L.; Ding, J.; Song, W.; Bi, S.; Schulze, B.C.; Song, C.; Wu, J.; et al. Chemical nature of PM2.5 and PM10 in Xi’an, China: Insights into primary emissions and secondary particle formation. Environ. Pollut. 2018, 240, 155–166. [Google Scholar] [CrossRef]
  60. Karner, A.A.; Eisinger, D.S.; Niemeier, D.A. Near-roadway air quality: Synthesizing the findings from real-world data. Environ. Sci. Technol. 2010, 44, 5334–5344. [Google Scholar] [CrossRef]
  61. Son, Y.; Osornio-Vargas, Á.R.; O’Neill, M.S.; Hystad, P.; Texcalac-Sangrador, J.L.; Ohman-Strickland, P.; Meng, Q.; Schwander, S. Land use regression models to assess air pollution exposure in Mexico City using finer spatial and temporal input parameters. Sci. Total Environ. 2018, 639, 40–48. [Google Scholar] [CrossRef]
  62. Wang, R.; Henderson, S.B.; Sbihi, H.; Allen, R.W.; Brauer, M. Temporal stability of land use regression models for traffic-related air pollution. Atmos. Environ. 2013, 64, 312–319. [Google Scholar] [CrossRef]
Figure 1. Study area and distribution of the monitoring stations.
Figure 1. Study area and distribution of the monitoring stations.
Ijerph 18 06143 g001
Figure 2. Spatial distribution of PM2.5 during the different seasons and times. (ac) show the spatial distribution of PM2.5 during winter working days at 0:00, 8:00, and 18:00; (df) show the spatial distribution of PM2.5 during winter non-working days at 0:00, 8:00, and 18:00; (gi) shows the spatial distribution of PM2.5 during summer working days at 0:00, 8:00, and 18:00; and (jl) shows the spatial distribution of PM2.5 during summer non-working days at 0:00, 8:00, and 18:00.
Figure 2. Spatial distribution of PM2.5 during the different seasons and times. (ac) show the spatial distribution of PM2.5 during winter working days at 0:00, 8:00, and 18:00; (df) show the spatial distribution of PM2.5 during winter non-working days at 0:00, 8:00, and 18:00; (gi) shows the spatial distribution of PM2.5 during summer working days at 0:00, 8:00, and 18:00; and (jl) shows the spatial distribution of PM2.5 during summer non-working days at 0:00, 8:00, and 18:00.
Ijerph 18 06143 g002aIjerph 18 06143 g002b
Figure 3. Scatter plots of the PM2.5 predictions from the Kriging method and the monitoring observations.
Figure 3. Scatter plots of the PM2.5 predictions from the Kriging method and the monitoring observations.
Ijerph 18 06143 g003
Table 1. Predictors in the LUR models.
Table 1. Predictors in the LUR models.
CategoryPredictor VariableUnitBuffer Radius
Land useArable landm2300 m, 500 m, 600 m, 700 m, 800 m, 900 m, 1000 m, 1300 m, 1500 m, 2000 m, 2500 m
Garden landm2
Woodlandm2
Grasslandm2
Commercial service landm2
Industrial and mining warehousing landm2
RoadPrimary roadm
Secondary roadm
POIsCatering services/
Scenic spots/
Public facilities/
Companies/
Shopping places/
Transportation/
Financial banks/
Science and education places/
Commercial and residential housing/
Life services/
Sports and leisure/
Medical care/
Government agencies/
Meteorological factorsTemperature°C/
Relative humidity%
Air pressurehPa
Wind speedm·s−1
PopulationPopulation/
ElevationElevationm
Industry sourceIndustry source/
LUR: Land use regression; POIs: Point of interests.
Table 2. Summary of the descriptive statistics of the PM2.5 data in the study area.
Table 2. Summary of the descriptive statistics of the PM2.5 data in the study area.
DayTimeMean ± SDMaxMinMedianIQR
Winter working days0:00159.5 ± 95.6271.013.0207.053.8–237.8
8:0095.9 ± 78.9319.018.066.043.8–127.5
18:00177.5 ± 107.8376.010.0184.084.0–258.0
Winter non-working days0:00213.5 ± 103.2402.064.0171.0123.3–321.5
8:00151.1 ± 84.7373.073.0118.0105.5–152.0
18:00323.4 ± 159.3624.088.0313.5157.8–466.5
Summer working days0:0034.5 ± 10.650.015.035.027.5–43.0
8:0046.6 ± 12.769.025.045.034.5–56.5
18:0038.5 ± 7.549.09.040.037.5–42.5
Summer non-working days0:0030.4 ± 7.847.011.029.026.0–36.5
8:0038.0 ± 12.962.015.037.028.0–49.8
18:0022.4 ± 7.542.010.023.016.0–27.0
SD: Standard deviation; IQR: Interquartile range.
Table 3. Summary of the basic parameters of the different models.
Table 3. Summary of the basic parameters of the different models.
DayTimeR2Adjusted R2Adjusted 10-Fold cv R2RMSE (μg·m−3)10-Fold cv RMSE (μg·m−3)
Winter working day0:000.9550.9510.94719.7929.39
8:000.6750.6440.67143.6852.64
18:000.8290.8180.80045.4827.33
Winter non-working day0:000.8030.7770.78344.7155.42
8:000.7320.7060.69842.7514.52
18:000.8920.8860.88050.7632.19
Summer working day0:000.6680.6280.5925.089.55
8:000.5670.5100.4668.244.84
18:000.3540.3120.2997.727.92
Summer non-working day0:000.4680.4340.4355.646.51
8:000.6350.5980.6057.879.23
18:000.5930.5670.5747.716.62
RMSE: Root-mean-square error.
Table 4. The LUR models at different times during a winter working day.
Table 4. The LUR models at different times during a winter working day.
TimeVariableCoefficient βtSigVIFPartial R2
0:00Intercept−673.896−15.4820.000NANA
Relative humidity13.27719.0980.0001.6830.867
Sports and leisure_25000.1217.6530.0001.1450.073
Elevation0.1143.2490.0031.7290.015
8:00Intercept−813.746−6.3030.000NANA
Relative humidity13.1377.2800.0001.5210.608
Primary road_15000.0012.4510.0201.3340.035
Elevation0.1391.7500.0901.8050.032
18:00Intercept−384.327−7.1260.000NANA
Relative humidity13.11711.5830.0002.9870.549
Temperature46.9927.1260.0002.9870.280
LUR: Land use regression; VIF: Variance inflation factor.
Table 5. The LUR models at different times during a winter non-working day.
Table 5. The LUR models at different times during a winter non-working day.
TimeVariableCoefficient βtSigVIFPartial R2
0:00Intercept−531.070−6.7340.000NANA
Relative humidity10.7708.7550.0001.0550.574
Shopping places_25001.8145.1220.0004.1160.143
Financial banks_3000−0.122−2.8970.0074.0200.058
Woodland_13004.649 × 10−52.0580.0481.1450.028
8:00Intercept−303.577−4.4390.000NANA
Relative humidity6.8387.4600.0001.0530.620
Catering services_900−0.402−3.1310.0041.1240.055
Grassland_800−2.017 × 10−4−2.5430.0161.0880.056
18:00Intercept−499.405−9.5980.000NANA
Relative humidity14.74215.1320.0001.0100.824
Shopping places_30000.6044.4860.0001.0100.068
LUR: Land use regression; VIF: Variance inflation factor.
Table 6. LUR models at different times during a summer working day.
Table 6. LUR models at different times during a summer working day.
TimeVariableCoefficient βtSigVIFPartial R2
0:00Intercept−130.226−3.0970.005NANA
Residential housing_5000.2963.2960.0031.2300.412
Temperature6.5813.8600.0011.0790.193
Grassland_3000−3.780 × 10−6−2.1790.0391.1470.063
8:00Intercept36.01914.2120.000NANA
Secondary road_3000.0094.3740.0001.9080.363
Financial banks_3000−0.023−3.1680.0043.3970.082
Government agencies_3000.2942.0990.0442.0500.065
Life services_3001.1483.7060.0011.4250.058
18:00Intercept40.45420.6230.000NANA
Residential housing_3000.6652.6340.0131.1140.257
Scenic spots_30000.0242.1570.0391.1140.097
LUR: Land use regression; VIF: Variance inflation factor.
Table 7. LUR models at different times during a summer non-working day.
Table 7. LUR models at different times during a summer non-working day.
TimeVariableCoefficient βtSigVIFPartial R2
0:00Intercept−32.405−1.1420.262NANA
Residential housing_25000.0092.3820.0231.7380.398
Temperature2.5532.0410.0501.7380.069
8:00Intercept−55.293−1.5010.144NANA
Medical care_3001.1543.4110.0021.3300.456
Public facilities_20000.0362.8830.0071.1850.116
Relative humidity1.0492.2830.0301.1880.063
18:00Intercept25.3589.3940.000NANA
Science and education places_5000.1883.1920.0031.7100.510
Primary road_25005.723 × 10−52.5590.0151.7100.083
LUR: Land use regression; VIF: Variance inflation factor.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, Y.; Cheng, H.; Huang, D.; Fu, C. High Temporal Resolution Land Use Regression Models with POI Characteristics of the PM2.5 Distribution in Beijing, China. Int. J. Environ. Res. Public Health 2021, 18, 6143. https://doi.org/10.3390/ijerph18116143

AMA Style

Zhang Y, Cheng H, Huang D, Fu C. High Temporal Resolution Land Use Regression Models with POI Characteristics of the PM2.5 Distribution in Beijing, China. International Journal of Environmental Research and Public Health. 2021; 18(11):6143. https://doi.org/10.3390/ijerph18116143

Chicago/Turabian Style

Zhang, Yan, Hongguang Cheng, Di Huang, and Chunbao Fu. 2021. "High Temporal Resolution Land Use Regression Models with POI Characteristics of the PM2.5 Distribution in Beijing, China" International Journal of Environmental Research and Public Health 18, no. 11: 6143. https://doi.org/10.3390/ijerph18116143

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop