Next Article in Journal
Tracking Mountain Degradation for the United Nations (UN) Sustainable Development Goals (SDGs) Using the State of Colorado (USA) as an Example
Previous Article in Journal
High-Resolution Urban Flood Susceptibility Mapping in Miami-Dade County: An AHP-Based GIS and Multi-Criteria Decision Analysis Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Source Integration for Assessing Air Quality Dynamics in China: The Interplay of Anthropogenic Drivers, Meteorology, and Topography

by
Hossam Aldeen Anwer
1,2,3 and
Yunfeng Hu
1,2,4,*
1
State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
2
College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China
3
Department of Surveying Engineering, Karary University, Omdurman 12304, Sudan
4
Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, China
*
Author to whom correspondence should be addressed.
Earth 2026, 7(2), 37; https://doi.org/10.3390/earth7020037
Submission received: 1 February 2026 / Revised: 24 February 2026 / Accepted: 26 February 2026 / Published: 1 March 2026

Abstract

Air pollution remains a major public health and environmental challenge in China, driven by complex non-linear interactions among anthropogenic activities, meteorological conditions, and topographic features that go beyond simple linear relationships. This study presents a comprehensive spatio-temporal assessment of key air pollutants (CO, NO2, SO2, and PM2.5) and their relationships with Total Column Ozone (TCO) across China’s provinces from 2019 to 2023. Multi-source high-resolution satellite data from Sentinel-5P/TROPOMI, the China High PM2.5 dataset, MODIS, and ERA5-Land reanalysis were integrated. A tiered analytical framework was applied, combining linear Pearson correlations, non-linear Spearman rank correlations, and interpretable XGBoost machine learning with SHAP values. Results reveal a distinct seasonal “seesaw” pattern, with primary pollutants peaking during winter stagnation and TCO reaching maximum levels in late winter and spring. Non-linear analyses uncover critical threshold effects, including exponential increases in PM2.5 and SO2 when surface temperatures drop below 0 °C, very strong SO2-TCO coupling (ρ = 0.93), and significant pollutant trapping in low-elevation regions (CO-elevation ρ = −0.82). These findings support the development of precision environmental policies with dynamic, temperature-threshold-based emission controls and topography-specific strategies to effectively mitigate air pollution in China.

1. Introduction

Air quality is a significant environmental issue, especially in urban areas, where industrial processes, vehicle exhaust, and high population density are major contributors to its deterioration [1,2,3,4]. Among the various pollutants, carbon monoxide (CO), sulfur dioxide (SO2), nitrogen dioxide (NO2), and PM2.5 are considered primary indicators due to their hazardous impacts on both human health and the environment [5,6,7,8,9].
A complex interplay of natural and anthropogenic factors governs the dynamics of these pollutants. Meteorological conditions, such as surface temperature, wind speed, and direction, crucially influence their dispersion and transformation [10,11,12,13,14]. Furthermore, topographic features including elevation, slope, and aspect can significantly alter local airflow patterns, potentially trapping pollutants in basins or facilitating their dispersal [15,16]. Concurrently, socio-economic activities such as industrial production, construction, and transportation are major contributors to emissions, while ecological factors such as vegetation cover can act as natural sinks [17,18,19].
To analyze these multi-scale processes, satellite remote sensing has been increasingly employed to overcome the spatial limitations of ground-based monitoring. Instruments like the Tropospheric Monitoring Instrument (TROPOMI) onboard the Sentinel-5P satellite provide high-resolution data on trace gases (e.g., NO2, SO2, CO), enabling the detection of emission sources and transport patterns that sparse surface stations might miss [20,21]. Similarly, the Moderate Resolution Imaging Spectroradiometer (MODIS) provides critical observations of aerosol optical depth (AOD) and active fires, facilitating the tracking of particulate matter and biomass-burning events [22]. Studies by Alam et al. [23] and Huang et al. [24] have demonstrated the power of such data to reveal seasonal variability, long-term trends, and the efficacy of pollution control policies. The integration of satellite data with reanalysis products (e.g., ERA5-Land) and socio-economic datasets has thus become a fundamental approach in atmospheric research [25,26].
Despite these advancements, significant knowledge gaps persist regarding the drivers of air pollution in China. A substantial portion of existing literature exhibits methodological or spatial limitations that hinder comprehensive national-scale analysis. Early foundational studies were often geographically constrained to megacities such as Beijing and Shanghai, establishing significant correlations but failing to capture broader, heterogeneous patterns [27]. While subsequent research expanded to national scales, many studies remained heavily reliant on ground-station data and spatial interpolation techniques (e.g., Kriging), which can introduce significant bias in regions with sparse networks or complex terrain [28]. Analytically, the preponderance of this work employs conventional linear statistical methods to link pollutant concentrations with potential drivers [29,30,31,32,33]. These linear approaches often fail to capture the non-linear interactions inherent in atmospheric systems, such as climatic threshold effects, synergistic pollutant interactions, or saturation points in socio-economic impacts [34]. Although recent work has begun to explore non-linear relationships—particularly in health impact studies using models such as Distributed Lag Non-linear Models (DLNM) [35,36]—a critical gap remains. Few studies apply sophisticated, interpretable non-linear frameworks to disentangle the complex influences of topography, meteorology, and human activity on pollution concentrations themselves, leveraging the synoptic coverage provided by modern satellite data.
To address these gaps, this study proposes an innovative synoptic approach that integrates multi-source satellite observations with interpretable machine learning. By synthesizing high-resolution data from the Sentinel-5P TROPOMI and MODIS instruments with ERA5-Land reanalysis fields, we overcome the spatial limitations of ground-based monitoring. The novelty of this research lies in its tiered analytical framework, which contrasts traditional linear statistics (Pearson correlation) and monotonic rankings (Spearman correlation) with non-linear techniques—specifically XGBoost regression and SHAP (Shapley Additive exPlanations) analysis.
Specifically, this study integrates data from 2019 to 2023 to accomplish three primary objectives:
  • To characterize spatial patterns and seasonal trends of air quality using continuous satellite coverage to overcome the limitations of sparse ground station interpolation.
  • To assess natural factors, specifically investigating how meteorological conditions (temperature, wind) and often-overlooked topographic features (elevation, slope) constrain pollutant dispersion.
  • To investigate the correlation of air pollution intensity with anthropogenic drivers, namely socio-economic activities (population, industry, transport, vehicle density, construction output) and ecological vegetation cover, employing a tiered analytical framework that contrasts linear Pearson and monotonic Spearman correlations with interpretable machine learning (XGBoost) to capture complex threshold effects missed by traditional statistical models.

2. Materials and Methods

2.1. Overview of the Study Area

The People’s Republic of China, located in East Asia, spans approximately 9.6 million square kilometers, stretching from latitudes 18° N to 54° N and longitudes 73° E to 135° E as shown in Figure 1. The country features a notable topographic gradient that ranges from lowland fertile plains in the east, around 4 m above sea level in coastal provinces like Shanghai and Jiangsu, to highland plateaus and the Himalayas in the west, where elevations exceed 4700 m in Tibet and Qinghai. As altitude increases, the terrain becomes more rugged; eastern coastal areas are relatively flat, while the mountainous regions of Sichuan and Yunnan have steeper slopes. This variety leads to a complex topographic and climatic pattern.
For this research, the focus is on the Chinese mainland, excluding Taiwan Province and the Special Administrative Regions of Hong Kong and Macao, to account for variations in administrative reporting and data.

2.2. Basic Data

This study examines the spatial distribution of key air pollutants—carbon monoxide (CO), nitrogen dioxide (NO2), sulfur dioxide (SO2), and particulate matter (PM2.5)—as well as Total Column Ozone (TCO) as shown in Table 1 across Chinese provinces from 2019 to 2023. We explore their interactions with meteorological variables, topographic features, ecological indicators, and socio-economic factors, including population density and industrial output, to enhance our understanding of air quality dynamics.

2.2.1. Atmospheric Composition

Data for carbon monoxide (CO), nitrogen dioxide (NO2), sulfur dioxide (SO2), and total carbon (TCO) were obtained from the Sentinel-5P satellite. Additionally, monthly mean data from the China High PM2.5 datasets were downloaded for the period from January 2019 to December 2023. This approach was taken to minimize short-term variability and ensure consistency with other environmental datasets.

2.2.2. Meteorological Factors

Data on surface temperature, wind speed, and wind direction were collected from the ECMWF ERA5-Land, which provides high-quality reanalysis meteorological fields. Monthly averages were computed to align with the temporal resolution of the pollution and vegetation datasets.

2.2.3. Socio-Economic Factors

Provincial-level socio-economic indicators—such as population, industrial output, construction output, transportation output, and the total number of vehicles—were obtained from the National Bureau of Statistics of China (NBS). All socio-economic data were matched to the corresponding environmental data across the same study period.

2.2.4. Topographic Factors

Digital Elevation Model (DEM) data were sourced from the Shuttle Radar Topography Mission (SRTM) dataset, which is available through Google Earth Engine (GEE) at a spatial resolution of 30 m. Slope and aspect were derived directly within GEE using its built-in terrain analysis functions. All topographic layers were then resampled in GEE to match the spatial resolution of the air pollutant datasets, ensuring spatial consistency across all variables.

2.2.5. Ecological Factors

Ecological factors were represented using vegetation indices derived from the MODIS MOD13A3 product via Google Earth Engine (GEE). This product provides monthly Normalized Difference Vegetation Index (NDVI) data at a 1 km resolution, allowing for reliable, cloud-screened vegetation information from January 2019 to December 2023. All vegetation data were processed in GEE for compatibility with air pollutant layers. Data on carbon monoxide (CO), nitrogen dioxide (NO2), sulfur dioxide (SO2), and total carbon (TCO) were sourced from the Sentinel-5P satellite, along with monthly mean data from the China High PM2.5 datasets for the same period to ensure consistency.

2.3. Overall Technical Approach and Analysis Methods

The technical workflow prioritized spatial and temporal consistency across multi-source datasets through systematic standardization and processing as shown in Figure 2. All datasets, including satellite imagery, meteorological fields, and administrative boundaries, were projected to the WGS 1984 coordinate system (EPSG:4326). Reprojection was performed using a dual-platform approach: Google Earth Engine (GEE) automated the reprojection of Sentinel-5P, ERA5-Land, MODIS, and SRTM data, while Python (Google Colab) with rioxarray and geopandas handled the high-resolution China PM2.5 dataset and provincial boundaries.
Following coordinate alignment, spatial resolutions were harmonized. Within GEE, gaseous pollutants (CO, NO2, SO2, TCO), meteorological variables, and topography layers were resampled to a uniform 7 km × 7 km grid to match the Sentinel-5P TROPOMI footprint. For PM2.5, the native 1 km × 1 km China High PM2.5 dataset was retained in Google Colab, and provincial means were derived via zonal statistics without resampling, preserving local accuracy while enabling aggregation.

2.3.1. Variable Derivation and Calculation

Several environmental indicators were derived from raw satellite bands and vector components to ensure physical meaningfulness:
  • (a) Vegetation (NDVI) The Normalized Difference Vegetation Index (NDVI) was calculated using the Near-Infrared (NIR) and Red spectral bands from the MODIS dataset to assess vegetation health and density as shown in Equation (1).
N D V I = P N I R P R E D P N I R + P R E D
where P N I R   and P R E D represent the surface reflectance in the near-infrared and red bands, respectively.
  • (b) Meteorological Factors (Wind and Temperature) Wind parameters were derived from the zonal (u) and meridional (v) components of the 10 m wind vector provided by the ERA5-Land dataset. Wind Speed (WS) was calculated as the magnitude of the wind vector as shown in Equation (2).
W s = u 2 + v 2
Wind Direction (WD) was calculated to represent the meteorological direction (the direction from which the wind is blowing, measured clockwise from North). This was derived using the four-quadrant inverse tangent function (atan2) as shown in Equation (3).
W D = 180 π   . a t a n 2 u , v + 360 ( m o d   360 )
where u represents the eastward component and v represents the northward component. The modulo operation ensures the final direction remains within the 0–360° range.
  • Additionally, surface temperature data were converted from Kelvin (K) to degrees Celsius (C) for consistency with standard reporting units as shown in Equation (4).
T c = T k 273.15
  • (c) Topographic Slope: Slope was calculated from the SRTM Digital Elevation Model (DEM) based on the rate of change in elevation in the horizontal (x) and vertical (y) directions as shown in Equation (5).
S l o p e = arctan ( ( z y ) 2 + ( z x ) 2 ) × 180 π
Aspect identifies the downslope direction of the maximum rate of change in value from each cell to its neighbors. It was calculated using the inverse tangent function of the gradients as shown in Equation (6).
A s p e c t = 180 π   . a t a n 2   z x , z y

2.3.2. Spatial and Temporal Aggregation

The calculation of mean values was performed using a multi-platform strategy that handled the specific data formats and computational requirements of each variable. Within the Google Earth Engine (GEE) platform, monthly mean values for gaseous pollutants (CO, NO2, SO2), Total Column Ozone (TCO), surface temperature, wind speed, wind direction, NDVI, and topographic factors were computed using cloud-based parallel processing to ensure temporal consistency across the five-year study period. For the PM2.5 data, a specialized Python workflow was implemented in Google Colab. This process used the xarray and netCDF4 libraries to manage high-resolution NetCDF files, and the rasterstats library to compute zonal statistics. This allowed for the precise extraction of monthly mean values for each province by masking the raster data with the corresponding administrative shapefiles.
The visualization and mapping phase used a hybrid approach that leveraged the cartographic precision of ArcGIS Pro 3.5.2 and the analytical flexibility of Python. ArcGIS Pro, in conjunction with Python scripts, was used to generate high-quality cartographic maps illustrating the spatial distribution of all pollutants, including PM2.5, at both monthly and seasonal scales. These maps, which also incorporated meteorological and environmental variables across China’s provinces, were instrumental in identifying regional trends and pollution “hotspots.” Additionally, Python’s specialized libraries were utilized to produce analytical maps and complex statistical charts to effectively visualize the temporal evolution and dynamic fluctuations of the studied factors from 2019 to 2023.

2.3.3. Statistical Correlation Analysis

To identify the primary drivers of air pollution, processed environmental and socio-economic datasets were integrated using the Pandas library. The analysis followed a multi-stage approach: first, Pearson correlation coefficients (r) were calculated to evaluate linear relationships, followed by Spearman’s rank correlation (ρ) to assess monotonic trends and handle non-normal distributions. To capture complex, non-linear interactions, XGBoost regression models were developed for each pollutant. Finally, model results were interpreted using SHAP (Shapley Additive Explanations) values, which quantified the specific contribution and importance of each environmental and socio-economic factor to variations in air quality.

2.3.4. Machine Learning Configuration and Interpretability

To quantify non-linear interactions, we developed distinct XGBoost regression models for each pollutant using 13 meteorological, topographic, and socio-economic features. To prevent overfitting on the spatio-temporal dataset, we restricted model complexity using conservative hyperparameters (adept = 4, learning_rate = 0.03, and n_estimators = 300) and validated performance on a 10% independent test set held out from the 90% training data.
Robustness was confirmed via out-of-sample on the test set, ensuring the learned patterns were not artifacts of overfitting. SHAP (Shapley Additive exPlanations) analysis was then applied to these validated models to resolve their “black box” nature and quantify non-linear thresholds.

3. Results

3.1. Spatial and Seasonal Patterns of Air Pollutants and Environmental Drivers

The mean CO distribution across China reveals a transparent east–west gradient, with elevated concentrations in the eastern and southeastern regions and minimal values in the western highlands, as shown in Figure 3. This pattern corresponds with land surface temperature anomalies, showing positive values in the east and negative anomalies over the Tibetan Plateau. Seasonally, temperature and CO exhibit opposite trends: peak temperatures occur in July (Summer), especially in southern China, as shown in Figures S1 and S2, while CO concentrations peak in January over northern industrial regions and reach their lowest levels in July.
NO2 and SO2 mean concentrations are significantly higher in eastern and southeastern China, as shown in Figure 4. Both pollutants display pronounced seasonal variability, peaking in December, particularly across northern industrial areas. High concentrations are concentrated in provinces such as Hebei, Shanxi, and Jilin, as shown in Figures S2 and S3, before declining to minimum levels in July. Similarly, reaches its maximum values in northern port regions, including Tianjin and Shandong, in December and decreases to its lowest levels in August, especially over the western highlands of Tibet and Yunnan.
PM2.5 and TCO mean concentrations are substantially higher in northern and central China, particularly over the North China Plain, and lowest in southern regions and western highlands, as shown in Figure 5. In contrast, mean Total Column Ozone (TCO) exhibits apparent latitudinal variability and a delayed seasonal peak relative to primary pollutants.
While concentrations peak in January, with provinces such as Henan and Shandong showing the highest values (Figures S2 and S4), levels decline to minima in July and August. Conversely, TCO reaches its maximum in late winter to early spring (February–March), peaking in northeastern provinces including Heilongjiang and Jilin, before decreasing to minimum values in late summer and early autumn, as shown in Figures S8 and S9.
The mean wind speed over China is higher in the northern and western regions, including the Mongolian and Tibetan Plateaus, and lowest across the central and southeastern plains, as shown in Figure 6. Wind direction shows substantial spatial variability driven by monsoonal circulation and topographic effects. Seasonally, wind speed peaks in late winter, as shown in Figure S6, with December recording the highest values in regions such as Hainan and Inner Mongolia (Figures S5 and S7), before declining to a minimum in August, particularly in interior provinces like Sichuan. Wind direction exhibits an apparent seasonal reversal, with prevailing northwesterly flows in northern cities during December shifting to southeasterly directions in July.
The mean Normalized Difference Vegetation Index (NDVI) over China indicates higher vegetation density and greener cover in the southern and southeastern regions, driven by subtropical climates, abundant rainfall, and dense forests, as shown in Figure 7. In contrast, the lowest NDVI values are observed in the northwestern deserts, the Tibetan Plateau, and arid northern areas, reflecting sparse vegetation and barren landscapes resulting from harsh climatic and topographic conditions.

3.2. Monthly and Seasonal Correlation Analysis of Drivers

Pearson and Spearman analyses (Tables S1 and S2, and Figures S8–S13) identify surface temperature as the primary regulator of air quality, exhibiting robust negative correlations with SO2 ( r = 0.63 , ρ = 0.65 ) and PM2.5 ( r = 0.55 ) annually. This confirms a seasonal pattern where cold temperatures drive heating emissions and atmospheric stagnation. Wind speed consistently acts as a dispersion factor, showing a negative correlation with PM2.5 across all seasons. Regarding inter-pollutant coupling, combustion markers NO2 and C O display the strongest annual association ( r = 0.74 ), reflecting shared industrial and vehicular sources. Notably, spring data reveals a unique meteorological synchronization between surface SO2 and stratospheric TCO ( r = 0.79 ). This strong correlation is likely not due to direct chemical interactions, but rather driven by large-scale synoptic weather systems that influence the entire atmospheric column. The same meteorological conditions that favor surface pollutant accumulation (such as stable boundary layers) coincide with the seasonal peak in stratospheric ozone transport, resulting in simultaneous highs for both variables.

3.3. Annual Correlation Analysis of Drivers

Pearson and Spearman analyses (Table S3; Figures S14 and S15) reveal surface temperature as a complex driver, showing robust negative correlations with SO2 (r = −0.65, ρ = −0.46) and PM2.5 (r = −0.26, ρ = −0.63). Wind speed acts as a purification force with stronger negative association to PM2.5 (r = −0.25, ρ = −0.43). Vegetation coverage exhibits non-linear mitigation of PM2.5 (r = −0.45, ρ = −0.60). Topography (DEM) controls pollution deposition with strong negative correlation to CO (r = −0.9, ρ = −0.82). Socioeconomically, vehicle ownership couples with CO levels (ρ = 0.55). Inter-pollutant synergy is profound: SO2-TCO strongest non-linearly (r = 0.51, ρ = 0.93) from synoptic weather; NO2-CO peaks (ρ = 0.83), confirming combustion cohesion.

3.4. XGBoost and SHAP Analysis of Drivers

Based on the XGBoost model and SHAP (Shapley Additive exPlanations) analysis, the non-linear contributions of meteorological, topographic, and socioeconomic drivers were quantified, as shown in Figure 8. To validate the reliability of these non-linear insights, the model’s performance was evaluated using the coefficient of determination (R2), indicating high predictive accuracy for most pollutants, as shown in Table S4.
Temperature emerges as the most critical determinant for PM2.5 and S o 2 , exhibiting a clear non-linear threshold effect where lower temperatures (below 10 °C) drastically increase output predicted values of pollutant variables values, corresponding to winter heating emissions. Specifically, the partial dependence plots reveal a sharp “elbow” pattern for PM2.5 where SHAP values rise steeply as temperatures drop below 0 °C, confirming the non-linear impact of cold-stagnation events as shown in Figure 9 and Figure S16. For NO2, Aspect (wind-facing direction) and Temperature are the top predictors, with vegetation also showing a strong mitigating effect where higher green cover leads to negative SHAP values as shown in Figure S17. In contrast, Elevation is the dominant factor for CO, showing a sharp decline in influence as altitude increases, confirming that carbon monoxide accumulates primarily in low-lying basins as shown in Figure S18. Furthermore, TCO displays a strong seasonal non-linearity, with SHAP values peaking in the warm months (May-August) and dropping significantly in winter, driven by photochemical reaction rates as shown in Figure 10. Regarding socioeconomic impacts, industrial output shows a saturation effect for NO2 where pollution impacts rise rapidly with initial activity but plateau at extremely high output levels, suggesting a dispersion limit, while vehicle ownership shows a logarithmic-like increase in its contribution to CO levels.

4. Discussion

4.1. Spatial and Temporal Patterns of Air Quality Dynamics

Air quality in China exhibits pronounced spatial heterogeneity, with a distinct east–west gradient aligned with topographic and socioeconomic divides. The eastern and southeastern regions, particularly the North China Plain (Hebei, Henan, Shandong) and coastal hubs (Jiangsu, Shanghai), exhibit elevated levels of primary pollutants (CO, NO2, SO2, PM2.5) due to high population density, industrial agglomeration, and vehicular emissions. In contrast, western highlands (e.g., Tibetan Plateau) maintain robust background air quality owing to limited anthropogenic activity. These patterns are modulated by topography, with low-lying basins (e.g., in Jiangsu, Hebei, Sichuan) acting as pollutant traps, as evidenced by CO accumulation in elevation-constrained zones. These regions are often hemmed in by rugged, high-level terrain, which facilitates local drainage flows and regional synoptic stagnation, further preventing the horizontal and vertical dispersion of pollutants.
Seasonally, air quality displays a “seesaw” pattern: primary pollutants (PM2.5, SO2, CO) peak in winter (December–January) due to heating emissions and stagnation under thermal inversions and low boundary layers. Conversely, total column ozone (TCO) peaks in late winter–spring, driven by Brewer-Dobson Circulation transporting stratospheric ozone, rather than surface photochemistry. These rhythms underscore the interplay of anthropogenic cycles and meteorological phenology in regional pollution dynamics.

4.2. Relationships Between Key Drivers and Air Quality

Our integrated analytical framework combining Pearson and Spearman correlations with XGBoost-SHAP machine learning reveals that while anthropogenic activities (e.g., industrial and urban emissions) establish the baseline pollution load, natural drivers such as temperature, topography, and vegetation critically amplify pollution spikes. Temperature exhibits a pronounced nonlinearity: although linear correlations indicate a general inverse relationship with pollutants, machine learning identifies a sharp threshold below 0 °C, where PM2.5 and SO2 concentrations increase exponentially. This reflects a feedback loop wherein cold stagnation and intensified heating emissions synergistically elevate pollution.
Wind speed acts as a consistent dispersion factor, showing significant negative correlations with PM2.5 (r = −0.25, ρ = −0.43) and SO2 (r = −0.65, ρ = −0.46), with a more substantial cleansing effect on SO2. Topography further modulates pollution persistence, evidenced by a strong negative correlation between elevation and CO (ρ = −0.82, r = −0.90), confirming that low-lying basins serve as pollutant traps due to restricted airflow. Notably, the reduced CO levels observed in high-altitude regions such as the Tibetan Plateau are partially attributable to atmospheric thinning rather than cleanliness alone.
Vegetation cover (NDVI) functions as a natural sink, with steadily decreasing pollutant levels (NO2, PM2.5, CO) as greenness increases, as indicated by SHAP dependence plots—an effect more continuous than threshold-driven. Although industrial output is a dominant source of emissions, its relationship with air quality is not strictly linear. At the highest levels of production, pollution impacts tend to level off or saturate. This trend suggests that mature industrial zones may be decoupling production from emissions through improved technology adoption and higher efficiency standards, preventing pollution from rising infinitely. Additionally, a strong non-linear coupling between SO2 and total column ozone (TCO, ρ = 0.93) suggests that synoptic weather systems concurrently influence both surface pollutants and stratospheric ozone, rather than shared emission pathways.

4.3. Comparison with National and Regional Studies

Our findings on spatio-temporal factors influencing air pollution in China validate and extend prior estimates in this domain, aligning with the documented decline in primary pollutants (PM2.5 and SO2) and rise in total column ozone (TCO) reported by Liu et al. [28] and Zhang et al. [27]. While these studies identified meteorological stagnation as a key winter driver using linear regression or panel models, our XGBoost and SHAP analyses reveal complex non-linear relationships, including a thermal threshold of 0 °C below which PM2.5 and SO2 concentrations increase in a nuance undetectable by linear approaches. This threshold elucidates the “cold-dry” conditions associated with elevated respiratory disease risks, as observed in recent epidemiological reports, such as influenza outbreaks in Shanghai and Huai’an, reported by Si et al. [32] and Wang et al. [30].
Regarding sources, winter peaks stem from stagnation and heating emissions. Still, autumn-winter elevation of pollutant concentration also involves agricultural straw open-field burning (ASOB), as noted by Huang et al. [29], contributing to a “seasonal seesaw” pattern via synoptic systems that trap pollutants and influence stratospheric ozone. This non-linear coupling between contaminants (e.g., SO2 and O3) and meteorology contrasts with traditional linear correlations [27,28]. It aligns with synergistic health impacts, including HFRS incidence Kuang et al. and tuberculosis Sun et al. Overall, non-linear methods better capture thresholds (<0 °C) and holistic interactions, thereby improving air quality and health assessments [30,31,32,33].

4.4. Limitations of This Study and Future Directions

This study provides a multi-scale, multi-source analysis of air quality drivers across China, but is subject to several limitations.
First, the use of monthly aggregated data, while suitable for identifying seasonal and climatological patterns, obscures short-term, high-intensity pollution episodes (e.g., 1–3-day haze events). Such acute events are critical for health impact assessments and are often driven by sub-daily meteorological variations that monthly averages cannot capture.
Second, although high-resolution inputs were employed (e.g., 1 km PM2.5 data), spatial aggregation to provincial means masks significant intra-regional variability. This approach may underestimate exposure in densely populated urban centers while overrepresenting pollution levels in rural areas, limiting the precision of localized exposure assessments.
For future research, we recommend moving beyond broad administrative units by adopting county-level or pixel-based analyses to preserve spatial heterogeneity and provide higher granularity within each province. Direct validation against in situ ground measurements should be prioritized to quantify the accuracy of satellite-derived products. Furthermore, incorporating precipitation data is essential, as wet deposition plays a key role in removing particulate matter and soluble gases, a process not fully addressed here. Finally, integrating chemical transport models (CTMs) with interpretable machine learning (XAI) could further disentangle the complex causal pathways underlying air quality dynamics.

5. Conclusions

This study presents an extensive spatio-temporal analysis of air quality across China from 2019 to 2023 by integrating multi-source satellite observations (Sentinel-5P, MODIS), ERA5-Land meteorological data, and provincial socio-economic datasets. By advancing beyond linear statistics to incorporate Spearman rank correlations and XGBoost-SHAP machine learning, we elucidate the non-linear interactions between anthropogenic drivers and natural modulating factors.
Key Findings
  • Distinct seasonal variability, with primary pollutants (CO, NO2, SO2, PM2.5), peaks in winter due to combustion emissions and atmospheric stagnation, while total column ozone (TCO) peaks in late winter–spring via stratospheric transport.
  • Temperature exhibits a non-linear threshold effect: PM2.5 and SO2 increase exponentially below 0 °C. Topography strongly modulates pollution, with low-lying basins acting as pollutant traps (e.g., ρ = −0.82 between elevation and CO).
  • Socio-economic activity drives baseline emissions but shows saturation at high industrial output, whereas vegetation (NDVI) serves as a non-linear sink for NO2 and CO.
  • Potent pollutant–meteorology couplings, such as between SO2 and TCO (ρ = 0.93), highlight systemic atmospheric linkages beyond shared emission sources.
These results underscore that while anthropogenic sources set emission baselines, natural filters (wind, vegetation) and traps (topography, thermal inversions) critically shape exposure intensity. We therefore advocate for precision environmental policies that target specific thermal thresholds (e.g., sub-zero periods) and topographically sensitive regions, rather than uniform emission caps, to more effectively enhance air quality mitigation.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/earth7020037/s1, Figure S1. Seasonal and Monthly Variations in (a) Carbon Monoxide (CO) Concentrations ( m m o l / m 2 ) and (b) Surface Temperature (°C) Across Chinese Provinces (2019–2023); Figure S2. Seasonal Spatial Distributions of Air Pollutants (CO, N O 2 , P M 2.5 , S O 2 , and TCO) Across China (Mean Values, 2019–2023); Figure S3. Seasonal and Monthly Variations in (a) Nitrogen Dioxide ( N O 2 ) Concentrations ( m m o l / m 2 ) and (b) Sulfur Dioxide ( S O 2 ) Concentrations ( m m o l / m 2 ) Across Chinese Provinces (2019–2023); Figure S4. Seasonal and Monthly Variations in (a) P M 2.5 Concentrations ( μ g / m 3 ) and (b) Total Column Ozone (TCO) Concentrations ( m m o l / m 2 ) Across Chinese Provinces (2019–2023); Figure S5. Seasonal and Monthly Variations in (a) Wind Speed ( m / s ) and (b) Wind Direction (°) Across Chinese Provinces (2019–2023); Figure S6. Seasonal Wind Rose Diagrams Depicting Direction Frequencies and Speeds ( m / s ) Over China (2019–2023); Figure S7. Seasonal Spatial Distributions of Climatic and Vegetation Parameters (Temperature, NDVI, Wind Speed, and Wind Direction) Across China (Mean Values, 2019–2023); Figure S8. Spring and Autumn Scatter Plots of Air Pollutant-Environmental Variable Relationships with Pearson Correlations, China (2019–2023); Figure S9. Winter and Summer Scatter Plots of Air Pollutant-Environmental Variable Relationships with Pearson Correlations, China (2019–2023); Figure S10. Spring and Autumn Scatter Plots of Air Pollutant-Environmental Variable Relationships with Spearman Correlations, China (2019–2023); Figure S11. Winter and Summer Scatter Plots of Air Pollutant-Environmental Variable Relationships with Spearman Correlations, China (2019–2023); Figure S12. Scatter Plots of Air Pollutant-Environmental Variable Relationships with Pearson Correlations, China (Monthly Means, 2019–2023); Figure S13. Scatter Plots of Air Pollutant-Environmental Variable Relationships with Spearman Correlations, China (Monthly Means, 2019–2023); Figure S14. Pearson’s Correlation of Annual Mean Atmospheric Composition with Natural and Socioeconomic Factors (2019–2023); Figure S15. Spearman’s Correlation of Annual Mean Atmospheric Composition with Natural and Socioeconomic Factors (2019–2023); Figure S16. SHAP dependence plots for the top 6 primary predictors of S O 2 ; Figure S17. SHAP dependence plots for the top 6 primary predictors of N O 2 ; Figure S18. SHAP dependence plots for the top 6 primary predictors of CO; Table S1. Seasonal Pearson and Spearman Correlation Coefficients Between Air Pollutants, Meteorological Factors, and Vegetation; Table S2. Monthly Pearson and Spearman Correlation Coefficients Between Air Pollutants, Meteorological Factors, and Vegetation; Table S3. Seasonal Pearson and Spearman Correlation Coefficients Between Air Pollutants, Meteorological Factors, Vegetation and Socio-Environmental Drivers; Table S4. model performance (R2).

Author Contributions

Conceptualization, H.A.A. and Y.H.; methodology, H.A.A.; software, H.A.A.; validation, H.A.A. and Y.H.; formal analysis, H.A.A.; investigation, H.A.A.; resources, Y.H.; data curation, H.A.A.; writing—original draft preparation, H.A.A.; writing—review and editing, H.A.A. and Y.H.; visualization, H.A.A.; supervision, Y.H.; project administration, Y.H.; funding acquisition, Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the National Natural Science Foundation of China (42371304); the National Key Research and Development Plan Program of China [2021YFD1300501]; the Key Project of Innovation LREIS (KPI011). The first author was supported by the ANSO-CAS-TWAS/UNESCO Scholarship.

Data Availability Statement

The data used in this study were obtained from the following sources: atmospheric composition data (CO, NO2, SO2, and TCO) were acquired from the Sentinel-5P TROPOMI satellite via Google Earth Engine (GEE); PM2.5 data were obtained from the China High PM2.5 dataset and supplemented by additional PM2.5 data from the study available at https://doi.org/10.5194/acp-20-3273-2020; meteorological data (surface temperature, wind speed, and wind direction) were derived from the ECMWF ERA5-Land reanalysis product via Google Earth Engine (GEE); topographic data (elevation, slope, aspect) were sourced from the Shuttle Radar Topography Mission (SRTM) DEM via Google Earth Engine (GEE); vegetation index (NDVI) data were obtained from the MODIS MOD13A3 product via Google Earth Engine (GEE); and socio-economic data (population, industrial output, construction output, transportation output, and vehicle numbers) were collected from the National Bureau of Statistics of China (https://www.stats.gov.cn/english/Statisticaldata/yearbook/, accessed on 7 October 2025). All data were processed and harmonized for the period January 2019 to December 2023.

Acknowledgments

We acknowledge the National Natural Science Foundation Air Pollution Complex Major Research Plan Data Integration Project (Project Number: 92044303, https://www.capdatabase.cn) for providing the data used in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

Administrative DivisionCode
BeijingBJ
TianjinTJ
HebeiHE
ShanxiSX
NeimengguNM
LiaoningLN
JilinJL
HeilongjiangHL
ShanghaiSH
ZhejiangZJ
AnhuiAH
FujianFJ
JiangxiJX
ShandongSD
HenanHA
HubeiHB
HunanHN
GuangdongGD
GuangxiGX
HainanHI
ChongqingCQ
SichuanSC
GuizhouGZ
YunnanYN
ShaanxiSN
GansuGS
QinghaiQH
NingxiaNX
XinjiangXJ
HongKongHK
MacaoMO
TaiwanTW
JiangsuJS
XizangXZ

References

  1. Smith, K.R.; Jerrett, M.; Anderson, H.R.; Burnett, R.T.; Stone, V.; Derwent, R.; Thurston, G. Public health benefits of strategies to reduce greenhouse-gas emissions: Health implications of short-lived greenhouse pollutants. Lancet 2009, 374, 2091–2103. [Google Scholar] [CrossRef]
  2. Mohamed, H.; Hassan, A.; Elhag, A. A five-year Study Using Sentinel-5P Data Observing Seasonal Dynamics and Long-term Trends of Atmospheric Pollutants. Int. J. Eng. Geosci. 2025, 10, 262–271. [Google Scholar] [CrossRef]
  3. Bell, M.L.; McDermott, A.; Zeger, S.L.; Samet, J.M.; Dominici, F. Ozone and short-term mortality in 95 US urban communities, 1987–2000. J. Am. Med. Assoc. 2004, 292, 2372–2378. [Google Scholar] [CrossRef] [PubMed]
  4. Bikis, A. Urban air pollution and greenness in relation to public health. J. Environ. Public Health 2023, 2023, 8516622. [Google Scholar] [CrossRef] [PubMed]
  5. Meo, S.A.; Salih, M.A.; Alkhalifah, J.M.; Alsomali, A.H.; Almushawah, A.A. Environmental pollutants particulate matter (PM2.5, PM10), Carbon Monoxide (CO), Nitrogen dioxide (NO2), Sulfur dioxide (SO2), and Ozone (O3) impact on lung functions. J. King Saud Univ.-Sci. 2024, 36, 103280. [Google Scholar] [CrossRef]
  6. Chen, Z.; Liu, N.; Tang, H.; Gao, X.; Zhang, Y.; Kan, H.; Deng, F.; Zhao, B.; Zeng, X.; Zhao, Z.; et al. Health effects of exposure to sulfur dioxide, nitrogen dioxide, ozone, and carbon monoxide between 1980 and 2019: A systematic review and meta-analysis. Indoor Air 2022, 32, e13170. [Google Scholar] [CrossRef]
  7. Seinfeld, J.H.; Pandis, S.N. Atmospheric Chemistry and Physics: From Air Pollution to Climate Change; John Wiley & Sons: Hoboken, NJ, USA, 2016. [Google Scholar]
  8. Yuwati, S.; Gulo, F.; Zulkarnain, M. Air Quality at Urban Transportation Nodes: A Case Study of SO2, NO2, and CO Concentrations in Palembang City. Biosci. J. Ilm. Biol. 2025, 13, 2000–2009. [Google Scholar] [CrossRef]
  9. Lu, X.; Zhang, L.; Shen, L. Meteorology and climate influences on tropospheric ozone: A review of natural sources, chemistry, and transport patterns. Curr. Pollut. Rep. 2019, 5, 238–260. [Google Scholar] [CrossRef]
  10. Levi, Y.; Khain, P.; Broday, D.M. Clustering of emission–dispersion dynamics in a state space defined by pollution sources and meteorological variables. Sci. Total Environ. 2025, 995, 180051. [Google Scholar]
  11. Rahman, A.; Khatun, M.T. Multivariate Analysis of Urban Air Pollution: Clustering and Patterns Across Major Asian Cities. In Proceedings of the 2024 IEEE International Conference on Future Machine Learning and Data Science (FMLDS); IEEE: New York, NY, USA, 2024; pp. 462–467. [Google Scholar]
  12. Heredia, B.I.; Canizo, B.V.; Diedrichs, A.L.; Altamirano, J.C.; Clausen, R.; Martinis, E.M.; Quintas, P.Y. Air quality monitoring in Mendoza, Argentina: Machine learning approaches for PM10 prediction. Environ. Sci. Pollut. Res. 2025, 32, 17511–17527. [Google Scholar] [CrossRef]
  13. Rede, G.D.; Dongre, S.; Kumar, C.R.; Kant, R.; Ramana, V.V.; Rathnamma, M.V. A Novel Approach to Mitigating Air Pollution and Climate Change with Graph Convolutional RNN Models. In Proceedings of the 2025 5th International Conference on Trends in Material Science and Inventive Materials (ICTMIM); IEEE: New York, NY, USA, 2025; pp. 1468–1473. [Google Scholar]
  14. Bernabeo, R.A.; Khalifeh, H.A.; Andretta, A.; Al Khatib, I.T.; Ghori, M.H. The impact of climate change and weather on air transport in the UAE: Reduction of CO2 emissions. In Proceedings of the 2018 Advances in Science and Engineering Technology International Conferences (ASET); IEEE: New York, NY, USA, 2018; pp. 1–7. [Google Scholar]
  15. Jiang, J.; Jin, A. Study on the Dispersion Law of Typical Pollutants in Winter by Complex Geographic Environment Based on the Coupling of GIS and CFD—A Case Study of the Urumqi Region. Appl. Sci. 2025, 15, 2469. [Google Scholar] [CrossRef]
  16. Yang, J.; Wu, L.; Li, R.; Liu, X.; Tao, C. Effects of surrounding urban morphology and wind direction on hazardous gas dispersion from chemical plants: Morphological analysis and fluid dynamics insights into industry–urban interactions. Phys. Fluids 2025, 37, 027103. [Google Scholar] [CrossRef]
  17. Sarkar, P.; Makkena, M.K.; Saha, S.; Saha, M. Analyzing the Severity of Air Pollution in an Industrialized Suburb. In Proceedings of the 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT); IEEE: New York, NY, USA, 2023; pp. 1–6. [Google Scholar]
  18. Zhang, Z.; Zhang, G.; Su, B. The spatial impacts of air pollution and socio-economic status on public health: Empirical evidence from China. Socio-Econ. Plan. Sci. 2022, 83, 101167. [Google Scholar] [CrossRef]
  19. Wang, H.; Xiao, S. Ecological Remediation Measures for Non-point Source Pollution Based on Source-Sink Landscape Theory: A Case Study of Huanghou Basin. J. Landsc. Res. 2024, 16, 72–76. [Google Scholar]
  20. Li, W.; Pacheco-Labrador, J.; Migliavacca, M.; Miralles, D.; Hoek van Dijke, A.; Reichstein, M.; Forkel, M.; Zhang, W.; Frankenberg, C.; Orth, R.; et al. Widespread and complex drought effects on vegetation physiology inferred from space. Nat. Commun. 2023, 14, 4640. [Google Scholar] [CrossRef]
  21. Man, X.; Liu, R.; Zhang, Y.; Yu, W.; Kong, F.; Liu, L.; Luo, Y.; Feng, T. High-spatial resolution ground-level ozone in Yunnan, China: A spatiotemporal estimation based on comparative analyses of machine learning models. Environ. Res. 2024, 251, 118609. [Google Scholar] [CrossRef]
  22. Mazza, A.; Guarino, G.; Scarpa, G.; Yuan, Q.; Vivone, G. Pm2.5 retrieval with sentinel-5p data over europe exploiting deep learning. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5510717. [Google Scholar] [CrossRef]
  23. Alam, M.J.; Karim, I.; Zaman, S.U. Seasonal dynamics and trends in air pollutants: A comprehensive analysis of PM2.5, NO2, CO, SO2 and O3 in Houston, USA. Air Qual. Atmos. Health 2025, 18, 2625–2642. [Google Scholar] [CrossRef]
  24. Huang, H.N.; Yang, Z.; Guo, Y.; Ma, J.J.; Ming, B.W.; Yang, J.; Guo, C.; Li, L.; Ou, C.Q. Impact of agricultural straw open-field burning on concentrations of six criteria air pollutants in China. Environ. Pollut. 2025, 373, 126109. [Google Scholar] [CrossRef]
  25. Sheng, Q.; Ji, Y.; Zhou, C.; Zhang, H.; Zhu, Z. Spatiotemporal variation and pattern analysis of air pollution and its correlation with NDVI in Nanjing City, China: A Landsat-Based study. Forests 2023, 14, 2106. [Google Scholar] [CrossRef]
  26. Wang, X.; Hu, J.; Wang, Z.; Cai, Y.; He, D. Interactive effects of meteorological factors and ambient air pollutants on influenza incidences 2019–2022 in Huaian, China. Infect. Dis. Model. 2025, 10, 1384–1397. [Google Scholar] [CrossRef] [PubMed]
  27. Veefkind, J.P.; Aben, I.; McMullan, K.; Förster, H.; De Vries, J.; Otter, G.; Claas, J.; Eskes, H.J.; de Haan, J.F.; Kleipool, Q.; et al. TROPOMI on the ESA Sentinel-5 Precursor: A GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications. Remote Sens. Environ. 2012, 120, 70–83. [Google Scholar] [CrossRef]
  28. Sun, S.; Chang, Q.; He, J.; Wei, X.; Sun, H.; Xu, Y.; Soares Magalhaes, R.J.; Guo, Y.; Cui, Z.; Zhang, W. The association between air pollutants, meteorological factors and tuberculosis cases in Beijing, China: A seven-year time series study. Environ. Res. 2023, 216, 114581. [Google Scholar] [CrossRef]
  29. Kuang, W.; He, Y.; Luo, P.; Fu, L.; Ni, H.; Zhao, J.; Yi, S.; Zha, W.; Lv, Y. Impact and lag effects of meteorology, air pollution and their interaction on HFRS: A case study of cities in Central China. One Health 2025, 20, 101088. [Google Scholar] [CrossRef] [PubMed]
  30. Kumar, S.; Maharana, P. Air quality and its impact on urban environment. In Urban Ecology; Elsevier: Amsterdam, The Netherlands, 2020; pp. 185–200. [Google Scholar]
  31. Sokhi, R.S.; Moussiopoulos, N.; Baklanov, A.; Bartzis, J.; Coll, I.; Finardi, S.; Friedrich, R.; Geels, C.; Grönholm, T.; Kukkonen, J.; et al. Advances in air quality research–current and emerging challenges. Atmos. Chem. Phys. Discuss. 2021, 22, 4615–4703. [Google Scholar] [CrossRef]
  32. Zhang, H.; Wang, Y.; Hu, J.; Ying, Q.; Hu, X.M. Relationships between meteorological parameters and criteria air pollutants in three megacities in China. Environ. Res. 2015, 140, 242–254. [Google Scholar] [CrossRef]
  33. Liu, Y.; Zhou, Y.; Lu, J. Exploring the relationship between air pollution and meteorological conditions in China under environmental governance. Sci. Rep. 2020, 10, 14518. [Google Scholar] [CrossRef]
  34. Sangeetha, A.; Amudha, T. A computational study on the effect of climatic parameters on air pollution in Coimbatore city. In Proceedings of the 2019 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE); IEEE: New York, NY, USA, 2019; pp. 66–71. [Google Scholar]
  35. Si, X.; Mengersen, K.; Ye, C.; Hu, W. Interactive effect of air pollutant and meteorological factors on seasonal influenza transmission, Shanghai, China. Atmos. Environ. 2024, 318, 120208. [Google Scholar] [CrossRef]
  36. Zou, Z.; Cheng, C.; Shen, S. The complex nonlinear coupling causal patterns between PM2.5 and meteorological factors in Tibetan plateau: A case study in xining. IEEE Access 2021, 9, 150373–150382. [Google Scholar] [CrossRef]
Figure 1. Location of the study area and its topography and geomorphology. The country comprises 34 provincial-level administrative units, and its terrain gradually rises from east to west.
Figure 1. Location of the study area and its topography and geomorphology. The country comprises 34 provincial-level administrative units, and its terrain gradually rises from east to west.
Earth 07 00037 g001
Figure 2. Methodology Work flow.
Figure 2. Methodology Work flow.
Earth 07 00037 g002
Figure 3. (a) Mean spatial distribution of CO across China from 2019 to 2023 and (b) surface temperature across China from 2019 to 2023.
Figure 3. (a) Mean spatial distribution of CO across China from 2019 to 2023 and (b) surface temperature across China from 2019 to 2023.
Earth 07 00037 g003
Figure 4. (a) Mean spatial distribution of NO2 across China from 2019 to 2023 and (b) mean spatial distribution of SO2 across China from 2019 to 2023.
Figure 4. (a) Mean spatial distribution of NO2 across China from 2019 to 2023 and (b) mean spatial distribution of SO2 across China from 2019 to 2023.
Earth 07 00037 g004
Figure 5. (a) Mean spatial distribution of PM2.5 across China from 2019 to 2023 and (b) mean spatial distribution of TCO across China from 2019 to 2023.
Figure 5. (a) Mean spatial distribution of PM2.5 across China from 2019 to 2023 and (b) mean spatial distribution of TCO across China from 2019 to 2023.
Earth 07 00037 g005
Figure 6. (a) Monthly spatial distribution of wind speed across China from 2019 to 2023 and (b) monthly spatial distribution of wind direction across China from 2019 to 2023.
Figure 6. (a) Monthly spatial distribution of wind speed across China from 2019 to 2023 and (b) monthly spatial distribution of wind direction across China from 2019 to 2023.
Earth 07 00037 g006
Figure 7. Monthly spatial distribution of NDVI across China from 2019 to 2023.
Figure 7. Monthly spatial distribution of NDVI across China from 2019 to 2023.
Earth 07 00037 g007
Figure 8. SHAP summary plots illustrating the global feature importance and impact direction for (a) PM2.5, (b) NO2, (c) CO, (d) SO2, and (e) TCO.
Figure 8. SHAP summary plots illustrating the global feature importance and impact direction for (a) PM2.5, (b) NO2, (c) CO, (d) SO2, and (e) TCO.
Earth 07 00037 g008
Figure 9. SHAP dependence plots for the top 6 primary predictors of PM2.5. The plots display the non-linear relationship between feature values (x-axis) and their impact on model output (y-axis) for (a) Temperature, (b) Aspect, (c) Month, (d) Slope, (e) Elevation, and (f) Wind Speed. Color gradients indicate interaction effects with secondary variables (Elevation or Temperature).
Figure 9. SHAP dependence plots for the top 6 primary predictors of PM2.5. The plots display the non-linear relationship between feature values (x-axis) and their impact on model output (y-axis) for (a) Temperature, (b) Aspect, (c) Month, (d) Slope, (e) Elevation, and (f) Wind Speed. Color gradients indicate interaction effects with secondary variables (Elevation or Temperature).
Earth 07 00037 g009
Figure 10. SHAP dependence plots for the top 6 primary predictors of TCO. The plots display the non-linear relationship between feature values (x-axis) and their impact on model output (y-axis) for (a) Aspect, (b) Month, (c) Construction Output, (d) Temperature, (e) Elevation, and (f) Slope. Color gradients indicate interaction effects with secondary variables (Temperature, Month, or Slope).
Figure 10. SHAP dependence plots for the top 6 primary predictors of TCO. The plots display the non-linear relationship between feature values (x-axis) and their impact on model output (y-axis) for (a) Aspect, (b) Month, (c) Construction Output, (d) Temperature, (e) Elevation, and (f) Slope. Color gradients indicate interaction effects with secondary variables (Temperature, Month, or Slope).
Earth 07 00037 g010
Table 1. Summary of Data Types, Sensors, and Spatial Resolutions.
Table 1. Summary of Data Types, Sensors, and Spatial Resolutions.
Data TypeTypeSourceSpatial Resolution
Atmospheric CompositionCOSentinel-5P TROPOMI7 km × 7 km
NO2Sentinel-5P TROPOMI5.5 km × 3.5 km
SO2Sentinel-5P TROPOMI7 km × 5.5 km
TCOSentinel-5P TROPOMI7 km × 3.5 km
PM2.5China High PM2.5 Dataset1 km × 1 km
meteorological factorsSurface TemperatureERA5-Land0.1° (~9 km)
wind speedERA5-Land0.1° (~9 km)
wind directionERA5-Land0.1° (~9 km)
topographic factorsDEMSRTM30 m
SlopeDerived from SRTM DEM30
AspectDerived from SRTM DEM30
socio-economicpopulationStatistical Yearbook/Administrative dataAdministrative unit
industry outputStatistical Yearbook/Administrative dataAdministrative unit
construction outputStatistical Yearbook/Administrative dataAdministrative unit
transport outputStatistical Yearbook/Administrative dataAdministrative unit
total number of vehiclesStatistical Yearbook/Administrative dataAdministrative unit
Ecological Factorsvegetation areaMODIS MOD13A31 km
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Anwer, H.A.; Hu, Y. Multi-Source Integration for Assessing Air Quality Dynamics in China: The Interplay of Anthropogenic Drivers, Meteorology, and Topography. Earth 2026, 7, 37. https://doi.org/10.3390/earth7020037

AMA Style

Anwer HA, Hu Y. Multi-Source Integration for Assessing Air Quality Dynamics in China: The Interplay of Anthropogenic Drivers, Meteorology, and Topography. Earth. 2026; 7(2):37. https://doi.org/10.3390/earth7020037

Chicago/Turabian Style

Anwer, Hossam Aldeen, and Yunfeng Hu. 2026. "Multi-Source Integration for Assessing Air Quality Dynamics in China: The Interplay of Anthropogenic Drivers, Meteorology, and Topography" Earth 7, no. 2: 37. https://doi.org/10.3390/earth7020037

APA Style

Anwer, H. A., & Hu, Y. (2026). Multi-Source Integration for Assessing Air Quality Dynamics in China: The Interplay of Anthropogenic Drivers, Meteorology, and Topography. Earth, 7(2), 37. https://doi.org/10.3390/earth7020037

Article Metrics

Back to TopTop