Next Article in Journal
Three-Dimensional Ice-Flow Recovery from Ascending–Descending DInSAR Pairs and Surface-Parallel Flow Hypothesis: A Simplified Implementation in SNAP Software
Previous Article in Journal
Can Stereoscopic Density Replace Planar Density for Forest Aboveground Biomass Estimation? A Case Study Using Airborne LiDAR and Landsat Data in Daxing’anling, China
Previous Article in Special Issue
Calibration of Short-Wave Infrared Spectrometer for Atmosphere Methane Monitoring
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Deep Learning Methods for Inferring Industrial CO2 Hotspots from Co-Emitted NO2 Plumes

1
Anhui Institute of Optics and Fine Mechanics, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China
2
Science Island Branch of Graduate School, University of Science and Technology of China, Hefei 230026, China
3
Key Laboratory of General Optical Calibration and Characterization Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(7), 1167; https://doi.org/10.3390/rs17071167
Submission received: 7 February 2025 / Revised: 21 March 2025 / Accepted: 21 March 2025 / Published: 25 March 2025

Abstract

:
The “top-down” global stocktake (GST) requires the processing of vast volumes of hyperspectral data to derive emission information, placing greater demands on data processing efficiency. Deep learning, leveraging its strengths in the automated and rapid analysis of image datasets, holds significant potential to enhance the efficiency and effectiveness of data processing in the GST. This paper develops a method for detecting carbon dioxide (CO2) emission hotspots using a convolutional neural network (CNN) with short-lived and co-emitted nitrogen dioxide (NO2) as a proxy. To address the data gaps in model parameter training, we constructed a dataset comprising over 210,000 samples of NO2 plumes and emissions based on atmospheric dispersion models. The trained model performed well on the test set, with most samples achieving an identification accuracy above 80% and more than half exceeding 94%. The trained model was also applied to the NO2 column data from the TROPOspheric Monitoring Instrument (TROPOMI) for hotspot detection, and the detections were compared with the MEIC inventory. The results demonstrate that in high-emission areas, the proposed method successfully identifies emission hotspots with an average accuracy of over 80%, showing a high degree of consistency with the emission inventory. In areas with multiple observations from TROPOMI, we observed a high degree of consistency between high NO2 emission areas and high CO2 emission areas from the Global Open-Source Data Inventory for Anthropogenic CO2 (ODIAC), indicating that high NO2 emission hotspots can also indicate CO2 emission hotspots. In the future, as hyperspectral and high spatial resolution remote sensing data for CO2 and NO2 continue to grow, our methods will play an increasingly important role in global data preprocessing and global emission estimation.

1. Introduction

The Paris Agreement signed by 194 countries and the European Union committed to limiting the increase in global average temperature to below 2 °C above pre-industrial levels by 2100 to combat the dangers of global warming [1]. Compiling greenhouse gas emission inventories is a fundamental task in addressing global warming. Through these inventories, we can identify the main sources of greenhouse gas emissions, assess regional emission statuses, and estimate future mitigation potential, thereby aiding the formulation of relevant emission reduction policies [2,3]. CO2 is the most significant greenhouse gas, accounting for approximately 72% of total anthropogenic greenhouse gas emissions [4]. Currently, there are two main methods for estimating CO2 emissions: “bottom-up” statistical methods and “top-down” retrieval methods based on observed CO2 data [5]. The “top-down” method offers advantages in terms of inventory timeliness, result transparency, and global comparability, making it an internationally recognized approach for carbon verification [6,7]. However, a major challenge with the “top-down” method is distinguishing between natural and anthropogenic CO2 emissions [8,9,10], which makes the accurate determination of anthropogenic emissions difficult.
Industrial activities are the primary source of anthropogenic CO2 emissions, accounting for more than 70% of global CO2 emissions. Therefore, quantifying industrial emissions is a key step in estimating anthropogenic emissions. Point sources are a typical feature of industrial emissions, such as thermal power plants and steel mills, accounting for more than 40% of global annual anthropogenic CO2 emissions [11,12]. Due to its long lifetime, CO2 can remain in the atmosphere for extended periods, undergoing global transport and mixing [13,14]. As a result, atmospheric CO2 exhibits higher background concentrations and smaller gradient variations. This requires extremely high-precision observations to distinguish CO2 concentration changes ( Δ CO2) and enable accurate emission retrieval [15].
Currently, satellites dedicated to the remote sensing of CO2 include GOSAT-1/2, Gaofen-5, Tansat, and OCO-2 [16,17]. However, these satellites primarily observe in a point or strip pattern, with spatial resolutions greater than 1 km, observation intervals exceeding 200 km, and retrieval accuracies of approximately 1–4 ppm [18]. These satellites are suitable for climate research, offering spatial resolutions in the order of hundreds of kilometers and temporal resolutions at the scale of months. However, their coarse spatial resolution and distinct observation methods limit their suitability for point-source monitoring, as they struggle to detect subtle Δ XCO2 and have insufficient observation coverage. In 2019, OCO-3 was successfully launched on the International Space Station (ISS). Similar to OCO-2, OCO-3 features an additional 2-axis pointing mirror assembly (PMA) that can be observed in the Snapshot Area Map (SAM) mode. This upgrade allows for the observation of larger areas (80 km × 80 km), compensating for the narrow observation range of previous carbon monitoring satellites and enabling selective monitoring of high-emission power plants during transit [19,20]. However, OCO-3 still has a relatively coarse spatial resolution (about 1.3 × 2.3 km2), which results in spatial averaging effects on concentration measurements. Furthermore, CO2 concentration changes are influenced by various factors, including transportation, residential areas, and vegetation [21,22]. Therefore, accurately estimating industrial CO2 emissions and eliminating other influencing factors using the “top-down” method based on observed CO2 concentration data remain challenging.
CO2 and NO2 are co-emitted during the high-temperature combustion of fossil fuels [23]. Compared to CO2, the background concentration of NO2 in the atmosphere is relatively low, and anthropogenic emissions, such as those from industrial activities, create a distinct contrast with this background. Additionally, NO2 has active chemical properties and a relatively short lifetime—typically just a few hours—resulting in local concentration clustering. This clustering behavior provides a direct indication of anthropogenic emissions. Lei identified the spatial correlation between the NO2 and CO2 concentration increments from TROPOMI by reconstructing the concentration field using the WRF-Chem model [24]. Based on the NO X /CO2 ratio, Yang indirectly estimated daily CO2 emissions from fossil fuel sources using TROPOMI’s NO2 data [25]. In terms of satellite-based NO2 emission source identification, Finch developed an algorithm that uses TROPOMI’s NO2 data fields to automatically detect emission plumes and infer anthropogenic combustion emissions, achieving the accurate location identification of hotspot emissions [26]. The proposed method is based on a deep learning model, and the initial training data are provided by manual interpretation. However, manual plume identification is subjective, inaccurate, and time-consuming. More importantly, this approach yields qualitative results (i.e., identifying the presence or absence of NO2 plumes). Building on previous research, we advanced this work by not only determining the location of emission sources but also quantifying the emissions. By incorporating atmospheric dispersion models, we took a significant step forward in estimating NO2 emissions using deep learning.
In this study, atmospheric dispersion models were used to simulate NO2 plume-emission scenarios. This not only increases the number of training datasets but also enhances the reliability of model training and enables the trained model to quantitatively detect emissions. A convolutional neural network (CNN) model was constructed to identify NO2 emission sources from the simulated data. The model was then applied to TROPOMI data to identify NO2 emission sources. Finally, the co-emission of NO2 and CO2 in industrial emissions was leveraged to locate anthropogenic CO2 emissions, and the detection results were compared and analyzed with existing inventories.

2. Training Dataset

2.1. NO2 Plume Simulation

Datasets are crucial for training model parameters. However, existing research currently faces a shortage of adequate training data. Due to the chemically reactive nature of atmospheric NO2, with a lifetime of only a few hours, it is often modeled using an empirical Gaussian plume approach [27]. Given our focus on satellite observations, we adopted a vertically integrated Gaussian plume model for NO2, which simulates the NO2 column concentrations observed by satellites like TROPOMI. The model formula is as follows:
G ( x , y ) = Q H ( x ) 2 π u σ ( x ) exp y 2 2 σ ( x ) 2 + V bg ( x , y )
where Q is the NO2 emission rate, u is the wind speed, and V bg ( x , y ) represents the background column concentration. H ( x ) is the Heaviside step function, with coordinates x and y denoting the directions along and across the plume, respectively. And the coordinates x, y are along and across the plume direction. The diffusion width across the plume direction is given by Equation (2).
σ ( x ) = 2 K x κ u
where K is the eddy diffusion coefficient in m 2 / s , representing potential variations in the diffusion rate along the plume, depending on meteorological conditions.
For short-lived NO2, Equation (1) must include an attenuation term.
D ( x , τ ) = H ( x ) exp x u τ
where τ represents the lifetime of the gas, approximately 4 h for NO2.
To adapt the single-point-source model for multi-point-source simulation, we modified this model using the following steps: (1) unifying the coordinate systems; (2) rotating and translating the simulation results of the single-point source; and (3) superimposing the simulation results from multiple point sources.

2.2. Simulation Parameter Selection

To simulate realistic NO2 plumes, we used “true” parameters. NO2 emission is sourced from the Emissions Database for Global Atmospheric Research (EDGAR), while wind field data is obtained from ECMWF Reanalysis v5 (ERA5) meteorological reanalysis data [28]. EDGAR provides gridded data with a spatial resolution of 0.1° × 0.1° and a temporal resolution of months. EDGAR estimates NO2 emissions following IPCC guidelines, ensuring the results are both reliable and comparable with those reported by European member states and the UNFCCC. ERA5 meteorological data provide three-dimensional atmospheric wind reanalysis from the 1940s, with a spatial resolution of 0.25° × 0.25°, and are widely recognized globally. We transformed these data from a geographic coordinate system (in degrees) to a projected coordinate system (in meters). To ensure consistency with the spatial resolution of TROPOMI NO2 data (3.5 × 7 km2) [29], we randomly sampled both the emission and corresponding wind field data at a 4 km spatial resolution within a 64 × 64 pixel grid, covering a total area of 256 × 256 km2.

2.3. Background Noise

The TROPOMI NO2 data contain uncertainties arising from instrument and retrieval errors, necessitating the addition of noise to the simulated plume image to replicate the concentration observed by the instrument [30,31]. Common noise types include uniform noise, salt-and-pepper noise, and Gaussian noise. Uniform noise is randomly and evenly distributed within a specific range, with a mean value close to zero. Salt-and-pepper noise, also known as impulse noise, is characterized by random occurrences of white or black spots, typically caused by sudden strong interference or analog-to-digital conversion errors. In contrast, Gaussian noise follows a probability density function consistent with a normal distribution (Gaussian distribution) and has a well-defined mathematical expression. In satellite remote sensing, many types of noise can be approximated using Gaussian noise. Therefore, Gaussian noise was chosen to construct the “real” noise in this study. The range of the mean and variance of the normal distribution was determined through statistical analysis of TROPOMI concentration field data.
Taking the China as an example, we acquired emission and wind field data from EDGAR and ERA5 for the entire country. We randomly selected emission and corresponding wind field data at a spatial resolution of 4 km. These data were then modeled using a plume model with random noise added. Figure 1 illustrates the NO2 plume shapes for different emission levels, as well as the plume shapes after adding noise. As shown in the figure, when emissions are less than 350 tons/grid/year, the NO2 enhancement caused by the emissions is easily overshadowed by background noise, making the concentration increment nearly undetectable. At emission levels of around 1000 tons/grid/year, the NO2 concentration increment becomes detectable, although it remains influenced by background noise. When emissions exceed 5000 tons/grid/year, the concentration increment is significantly higher than the background noise, resulting in a more distinct plume shape.

3. CNN Networks Incorporating Spatial Neighborhood Information

A convolutional neural network (CNN) is a multi-layer network with a large number of parameters, capable of extracting effective features from input image data and establishing complex relationships. CNNs have broad application prospects in fields such as remote sensing and computer vision. The convolutional neural network (CNN) model in specific applications relies on the configuration of network parameters, which need to be optimized using a training dataset. Through the backpropagation mechanism, these parameters are iteratively adjusted to achieve an optimal configuration. In addition to data, a CNN model requires a foundational network structure. Several classic network architectures, such as U-net, have been developed. For example, Bruno had utilized U-net for CH4 plume detection and emission estimation, highlighting the strong potential of deep learning in CH4 identification and analysis [32]. However, as the depth of CNN models increases, issues such as vanishing gradients become more pronounced during training, affecting the model’s convergence. To address this issue, researchers have adopted residual neural networks (ResNet) [33]. ResNet introduces residual modules and connections, which not only enhance the accuracy and generalization ability of deep neural networks but also mitigate vanishing and exploding gradient problems, thereby accelerating network convergence.
The location of anthropogenic CO2 emission sources can be inferred using NO2 data. The process of identifying emission hotspots is illustrated in Figure 2. First, meteorological and emission data were used to simulate a large number of NO2 plumes. Next, a convolutional neural network (CNN)-based model was constructed, using the simulated plume-emission data as training input to enable learning. Finally, the trained model was applied to TROPOMI observational data to identify emission hotspots.

3.1. Core Network Structure

CNNs can extract pattern information such as edges, textures, and gradient changes in data through convolution operations, making them highly suitable for capturing spatial distribution characteristics in hotspot recognition. This is the primary reason we chose a CNN as the foundation for network design. Additionally, ResNet introduces residual connections, which facilitate better gradient propagation during training and enhance network performance. This is demonstrated as follows:
y = F ( x ) + x
where y represents the output of the residual block, and x is the input of the residual block added via the shortcut connection.
Figure 3 shows the network structure we designed, comprising an input layer, a spatial attention module, a residual neural network block, and an output layer. The spatial attention module identifies effective regions for spatial feature extraction and facilitates the fusion of spatial features from the input data. The 2D convolution (Conv) operations in the figure employ a kernel size of 3 × 3. Additionally, batch normalization (Batch Normalization) and max pooling (MaxPooling2D) are incorporated into each module, though they are not explicitly depicted here. The model’s final output is flattened and passed through a fully connected layer.
The essence of spatial attention is to extract the location information of interest within the network while suppressing invalid information. Assuming the initial input corresponds to the NO2 concentration field data, representing the spatial dimension, the activation function ultimately generates spatial attention by accepting the output from the spatial attention module, as shown in Figure 4. Its mathematical expression is:
F * = F σ M S F
where ⊗ represents the spatial product operator, σ denotes the activation function, and M S F signifies the spatial attention map.

3.2. Normalization and Loss Function

As shown in Figure 2, the designed model can be used to identify emissions based on concentration data. However, in practice, Gaussian diffusion simulations introduce biases, and discrepancies exist between the simulated and actual plumes. Accurately estimating these deviations requires a more advanced atmospheric transport model and long-term observational data. Given the limitations of current conditions, we adopted global normalization as a preliminary approach for quantitative emission estimation using deep learning methods. Global normalization involves using the maximum and minimum emission values across all samples (over 210,000 samples) as the basis for normalization rather than relying on emissions from a single 64 × 64 pixel grid in an individual simulation scenario. This approach offers two key advantages: (1) it ensures that emission identification results are comparable, and (2) more importantly, it reduces the risk of small emissions and weak plumes being mistaken for noise, thus preventing potential instability in model identification.
We not only identified the location of emissions but also quantified their relative magnitudes. Therefore, the mean square error ( M S E ) was employed as the loss function for the regression task, defined as follows:
M S E = Σ ( y i y l ^ ) 2 n
where n is the total number of data points, y i represents the true value of the i-th data point, and y l ^ denotes the predicted value of the i-th data point.

3.3. Training Parameters

Based on the data from 210,000 simulated NO2 plume scenarios and their corresponding emission distributions, the dataset was split into a training set for model training and a test set for evaluation. The resulting network model comprised 2,160,000 parameters. The model’s hyperparameters were configured as follows: the Adam optimizer with an initial learning rate of 10−3, which was dynamically adjusted to 10−5 using the Reduce-on-Plateau strategy; a dropout rate of 0.05; a batch size of 32; and a total of 200 epochs.

4. Results and Discussion

4.1. Application to Tested Dataset

The trained model was applied to the test dataset, which contains more than 36,000 samples. The comparison results are shown in Table 1, and the identification results for single and multiple point sources are presented in Figure 5. In Table 1, the emission strength is represented as a relative value, normalized based on the maximum and minimum values across all samples, including the training set, with a range of 0–1. Recognition accuracy refers to the pixel-level accuracy. For example, if the true emission spans 4 × 4 pixels (16 pixels) and the model recognizes an emission of 3 × 4 pixels (12 pixels), the recognition accuracy is 75%. In the table, 1 QR, 2 QR, and 3 QR represent the first, second, and third quartiles, respectively.
As shown in the table, the 1Q correct recognition rate for all emissions was 80%, with more than half of the samples achieving identification rates above 94%. High-emission sources had higher identification accuracy than low-emission sources. For example, the 1Q recognition rate for low emissions (relative emission values below 0.33) was 79%, whereas for high emissions (above 0.66), more than half of the recognition rates reached 83%. The poorer recognition performance for low emissions is primarily due to the indistinct plume shapes, which are more easily obscured by random noise. Overall, the analysis demonstrated that the model achieved good recognition results on the test set, particularly for high-emission point sources.

4.2. Application of TROPOMI Observation Data

Although the model performs well on the test set, how does it perform on actual observational data? To evaluate this, we used TROPOMI NO2 data to detect emission source locations. We first tested the model’s performance for large point-source emissions and compared the results with those of the EDGAR data. The outcomes are shown in Figure 6, which displays data from Wuhan on 1 February 2019 (First row), Taiyuan on 4 February (Second row), and Baotou–Hohhot on 5 February (Third row). Overall, the identified emission sources aligned well with the high-value areas in the observed plumes, indicating reasonable results. The results indicate discrepancies between the high emission values in the EDGAR data and the observed plume concentrations. For example, in Wuhan, the EDGAR data indicate two high-emission centers, but the observed NO2 results show only one. Similarly, in Taiyuan, the EDGAR data suggested a high-emission center in the lower central area, but no corresponding high-concentration region was observed in the TROPOMI data. A similar pattern was observed in the Baotou–Hohhot region. The second column of Figure 6, showing the emission sources identified for the NO2 plume data, demonstrates that our method effectively identifies high-emission areas, with high-value regions corresponding well to high-concentration areas. We examined the geographical conditions of high-concentration areas and identified high-emission sources such as thermal power plants, as shown in Figure 7.
It should be noted that the model’s ability to detect weak emission areas is limited. This is because small increases in the NO2 concentration caused by weak emissions can be easily confused with observational errors, resulting in unclear plume shapes.
Through analysis, we conclude that the EDGAR inventory lacks timeliness, whereas our model can detect high NO2 emission sources more accurately and promptly using TROPOMI data. This capability allows for timely updates to the location of emission sources. Furthermore, we extended our application to cover the entire country and compared the detection results in existing inventories, such as MEIC. Figure 8 shows the number of TROPOMI observations for each grid in 2019. A complete observation is defined as covering more than 75% of the grid area. The Beijing–Tianjin–Hebei region has the highest observation frequency, with more than 150 observations annually (over 40% of the days). In contrast, there are fewer observations in the western regions, especially in Sichuan and Tibet, because of the complex terrain, high altitude, and low NO2 content, making detection more challenging. Figure 9a presents the spatial distribution of NO2 emissions from MEIC, showing high emission densities in the Yangtze River Delta and the Beijing–Tianjin–Hebei region. In contrast, the NO2 emissions from Xinjiang, the three northeastern provinces, and Southern China are relatively sparse and scattered. Figure 9b shows the NO2 emission hotspots detected by the model, excluding detections with relative emission intensities below 0.2 because of poor performance in identifying weak plumes. The results indicate that emission hotspots are concentrated in the Yangtze River Delta and the Beijing–Tianjin–Hebei region, while emissions in Xinjiang, the three northeastern provinces, and Southern China are more scattered, consistent with MEIC observations.
According to statistics, the MEIC emission inventory includes 19 emission hotspots with emissions exceeding 5000 tons/grid (Refer to Table 2). Our model successfully identified 17 of these, achieving an accuracy rate of 89%. For hotspots with emissions between 4000–5000 tons/grid, the identification rate was 100%. For hotspots with emissions between 2000–4000 tons/grid, 64 out of 82 were correctly identified, yielding a 78% accuracy rate. However, for hotspots with emissions below 2000 tons/grid, the accuracy dropped further, with 143 out of 212 grids correctly identified, yielding a 67% accuracy rate. Similar to the performance on the test set, the lower accuracy in detecting weaker plumes is mainly due to confusion with observational errors and differences in plume characteristics from the training set. To improve the model’s performance, a more in-depth analysis of the observed NO2 plume error structures could be conducted, and these insights could be integrated into the simulation data.
Compared to previous qualitative models that only detect the presence or absence of a plume, our proposed model advances quantification by identifying the relative emission intensity. However, this quantification is still preliminary because the conversion of the NO2 concentration to the emission magnitude is inevitably influenced by simulation errors. To improve the reliability of the simulation results, future work could involve enhancing the simulation mechanism, for instance, by using regional atmospheric transport simulations that account for more factors, leading to more realistic results. Additionally, integrating observational data into the model and assimilating them into the simulation can further improve the accuracy of the concentration-to-emission conversion. Moreover, the weak plume concentrations are easily confused by errors, highlighting the need for further research on weak-plume emission monitoring.

4.3. Correlation Analysis of Industrial NO2 and CO2 Emissions

During the high-temperature combustion of fossil fuels, both NO2 and CO2 are released simultaneously. Integrating NO2 data with CO2 emission estimates can enhance the available observational data and improve estimation accuracy. Currently, remote sensing satellites for point-source CO2 monitoring, such as Tansat-2 (scheduled for launch in 2025) and CO2M, are still in the development or planning stages, resulting in insufficient observational data for joint satellite remote sensing analysis of CO2 and NO2. However, with the successful launch of these satellites, collaborative research on CO2 and NO2 emissions is expected to become an emerging research focus.
The detected NO2 emission hotspot areas overlap with the CO2 hotspot areas to some extent. To analyze this, we combined the data with ODIAC to assess their correlation. ODIAC includes emission sources such as thermal power plants and households, with a spatial resolution of 1 km × 1 km. First, we identified and plotted further CO2 emission points in the ODIAC, with a monthly emission of approximately 100,000 tons/grid/month, as shown in Figure 10. The red dots in the figure represent high-emission sources in the ODIAC, while the blue boxes highlight areas where NO2 hotspots overlap with CO2 hotspots. Overall, there are about 252 high-emission CO2 sources in the ODIAC and 102 areas where NO2 and CO2 hotspots overlap, accounting for approximately 40% of the total. In the East China region, the overlap between NO2 and CO2 hotspots is more prominent, accounting for over 60% of the cases, whereas the identification performance is weaker in the southwest region. Combining Figure 8, the main reasons for the missed detections are: (1) This is related to the effective coverage of the original data. There were fewer observations in the southwest region, whereas the observation frequency was higher in East China and Guangdong. As a result, more NO2 plumes and CO2 hotspots that meet stringent screening conditions were detected in East China and Guangdong, whereas fewer were detected in the southwest region. (2) The detection algorithm may not be sufficiently accurate for weak plumes, potentially leading to the omission of weak emissions and, consequently, the neglect of some emission hotspots.
Overall, our detection method performed well at identifying NO2 emission hotspots under high-emission and high-frequency observation conditions, with a good degree of overlap with CO2 emission hotspots. However, in low-frequency observation and weak emission scenarios, the algorithm’s sensitivity is insufficient, highlighting the need for further improvements to enhance its ability to detect weak plumes.

4.4. Limitations

Deep learning methods have great potential for the global carbon stocktake. In this study, we propose a deep-learning-based approach for identifying industrial CO2 emission point sources, achieving the preliminary identification of emission locations and their relative magnitudes. However, due to the current limited understanding of instrument errors, inversion errors, and atmospheric transport simulation errors, it is challenging to accurately account for their impact when estimating absolute emissions. In future work, we will conduct an in-depth study of various error characteristics and propagation patterns, quantify their impact on emission estimates, and provide uncertainty ranges alongside emission estimates. In the future, we plan to apply this model to a longer time series of TROPOMI data to assess improvements in China‘s emission reduction policies by analyzing changes in detected emission hotspots.

5. Conclusions

To overcome the current limitations in available observational data for global CO2 emission hotspot monitoring, and leveraging the co-emission of NO2 and CO2 from industrial sources, we propose a method for detecting NO2 hotspots using convolutional neural networks (CNNs). This approach provides valuable prior information for detecting CO2 emission hotspots. The key contributions of this study are as follows: (1) We utilized atmospheric dispersion models to generate a wide range of simulation scenarios, creating training data for NO2 plume detection and addressing the issue of missing samples in existing NO2 hotspot detection. (2) We developed a suitable network architecture based on a convolutional neural network model for hotspot detection and applied it to satellite-observed data. (3) We analyzed the correlation between NO2 emission hotspots and CO2 hotspots. The results demonstrate that our algorithm effectively identifies emission hotspots in high NO2 emission areas under multiple observation conditions, achieving an overlap rate of over 80% with existing NO2 emission inventories. In East China, where observations are more frequent, significant overlap is observed between NO2 and CO2 emission hotspots, with NO2 hotspots acting as indicators for CO2 emission regions. However, the study also reveals some limitations, particularly in the detection sensitivity for weak NO2 emission sources, where the performance of the detection algorithm is constrained. This represents an area for further improvement in future work.
When using Tansat-2 and CO2M for point-source and urban CO2 remote sensing monitoring, a large volume of collaborative CO2 and NO2 observation data will be generated. The monitoring methods developed using deep learning models provide fast and reliable reference information for preliminary data analysis and screening, enabling efficient management of the anticipated data growth. Additionally, this approach offers a technical solution for transitioning from “concentration-to-emission” estimates based on atmospheric transport models to those derived from deep learning models.

Author Contributions

Conceptualization, E.S. and S.W.; methodology, E.S., S.W. and X.W.; software, C.L. and H.Y.; validation, C.L. and S.W.; formal analysis, C.L., X.W. and Y.A.; investigation, Y.A.; resources, X.W., H.Y., H.S. and C.L.; data curation, C.L., S.W. and Y.A.; writing—original draft preparation, C.L.; writing—review and editing, E.S. and S.W.; visualization, E.S., S.W. and Y.A.; supervision, S.W. and C.L.; project administration, X.W., H.Y., H.S. and C.L.; funding acquisition, X.W., H.Y., H.S. and S.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the National Natural Science Foundation of China (NSFC) Young Scientist Fund (42205146) and in part by the National Key R&D Program of China (2021YFE0118000). This research was also supported in part by the Natural Science Foundation of Anhui Province-Jianghuai Meteorological Joint Fund (2408055UQ003).

Data Availability Statement

Data available on request due to privacy restrictions.

Acknowledgments

The Gaussian plume model provided by Gerrit Kuhlmann is acknowledged with our gratitude. The authors acknowledge the European Space Agency for TROPOMI NO2 data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Kikstra, J.S.; Nicholls, Z.R.; Smith, C.J.; Lewis, J.; Lamboll, R.D.; Byers, E.; Sandstad, M.; Meinshausen, M.; Gidden, M.J.; Rogelj, J. The IPCC Sixth Assessment Report WGIII climate assessment of mitigation pathways: From emissions to global temperatures. Geosci. Model Dev. 2022, 15, 9075–9109. [Google Scholar] [CrossRef]
  2. Pirani, A.; Fuglestvedt, J.S.; Byers, E.; O Neill, B.; Riahi, K.; Lee, J.; Marotzke, J.; Rose, S.K.; Schaeffer, R.; Tebaldi, C. Scenarios in IPCC assessments: Lessons from AR6 and opportunities for AR7. NPJ Clim. Action 2024, 3, 1. [Google Scholar] [CrossRef]
  3. Arioli, M.S.; Márcio De Almeida, D.; Amaral, F.G.; Cybis, H.B.B. The evolution of city-scale GHG emissions inventory methods: A systematic review. Environ. Impact Assess. Rev. 2020, 80, 106316. [Google Scholar]
  4. Olivier, J.G.; Schure, K.M.; Peters, J. Trends in global CO2 and total greenhouse gas emissions. PBL Neth. Environ. Assess. Agency 2017, 5, 1–11. [Google Scholar]
  5. Bellassen, V.; Stephan, N.; Afriat, M.; Alberola, E.; Barker, A.; Chang, J.; Chiquet, C.; Cochran, I.; Deheza, M.; Dimopoulos, C. Monitoring, reporting and verifying emissions in the climate economy. Nat. Clim. Change 2015, 5, 319–328. [Google Scholar]
  6. Zheng, B.; Chevallier, F.; Ciais, P.; Broquet, G.; Wang, Y.; Lian, J.; Zhao, Y. Observing carbon dioxide emissions over China’s cities and industrial areas with the Orbiting Carbon Observatory-2. Atmos. Chem. Phys. 2020, 20, 8501–8510. [Google Scholar]
  7. Bergamaschi, P.; Danila, A.; Weiss, R.F.; Ciais, P.; Thompson, R.L.; Brunner, D.; Levin, I.; Meijer, Y.; Chevallier, F.; Janssens-Maenhout, G. Atmospheric Monitoring and Inverse Modelling for Verification of Greenhouse Gas Inventories; Publications Office of the European Union: Luxembourg, 2018; ISBN 978-92-79-88939-4. [Google Scholar]
  8. Turnbull, J.C.; Sweeney, C.; Karion, A.; Newberger, T.; Lehman, S.J.; Tans, P.P.; Davis, K.J.; Lauvaux, T.; Miles, N.L.; Richardson, S.J. Toward quantification and source sector identification of fossil fuel CO2 emissions from an urban area: Results from the INFLUX experiment. J. Geophys. Res. Atmos. 2015, 120, 292–312. [Google Scholar]
  9. Liu, F.; Duncan, B.N.; Krotkov, N.A.; Lamsal, L.N.; Beirle, S.; Griffin, D.; McLinden, C.A.; Goldberg, D.L.; Lu, Z. A methodology to constrain carbon dioxide emissions from coal-fired power plants using satellite observations of co-emitted nitrogen dioxide. Atmos. Chem. Phys. 2020, 20, 99–116. [Google Scholar] [CrossRef]
  10. Wang, S.; Zhang, Y.; Hakkarainen, J.; Ju, W.; Liu, Y.; Jiang, F.; He, W. Distinguishing anthropogenic CO2 emissions from different energy intensive industrial sources using OCO-2 observations: A case study in northern China. J. Geophys. Res. Atmos. 2018, 123, 9462–9473. [Google Scholar]
  11. Nassar, R.; Hill, T.G.; McLinden, C.A.; Wunch, D.; Jones, D.B.; Crisp, D. Quantifying CO2 emissions from individual power plants from space. Geophys. Res. Lett. 2017, 44, 10,045–10,053. [Google Scholar] [CrossRef]
  12. Li, Y.; Yang, X.; Du, E.; Liu, Y.; Zhang, S.; Yang, C.; Zhang, N.; Liu, C. A review on carbon emission accounting approaches for the electricity power industry. Appl. Energy. 2024, 359, 122681. [Google Scholar] [CrossRef]
  13. Buchwitz, M.; Reuter, M.; Noël, S.; Bramstedt, K.; Schneising, O.; Hilker, M.; Fuentes Andrade, B.; Bovensmann, H.; Burrows, J.P.; Di Noia, A. Can a regional-scale reduction of atmospheric CO2 during the COVID-19 pandemic be detected from space? A case study for East China using satellite XCO2 retrievals. Atmos. Meas. Tech. 2021, 14, 2141–2166. [Google Scholar]
  14. Weir, B.; Crisp, D.; O Dell, C.W.; Basu, S.; Chatterjee, A.; Kolassa, J.; Oda, T.; Pawson, S.; Poulter, B.; Zhang, Z. Regional impacts of COVID-19 on carbon dioxide detected worldwide from space. Sci. Adv. 2021, 7, eabf9415. [Google Scholar] [PubMed]
  15. Li, T.; Zheng, X.; Liu, X.; Zhang, H.; Grieneisen, M.L.; He, C.; Ji, M.; Zhan, Y.; Yang, F. Enhancing Space-Based Tracking of Fossil Fuel CO2 Emissions via Synergistic Integration of OCO-2, OCO-3, and TROPOMI Measurements. Environ. Sci. Technol. 2025, 59, 1587–1597. [Google Scholar] [PubMed]
  16. Feldman, A.F.; Zhang, Z.; Yoshida, Y.; Chatterjee, A.; Poulter, B. Using OCO-2 column CO2 retrievals to rapidly detect and estimate biospheric surface carbon flux anomalies. Atmos. Chem. Phys. 2022, 23, 1545–1563. [Google Scholar]
  17. Ye, H.; Shi, H.; Wang, X.; Sun, E.; Li, C.; An, Y.; Wu, S.; Xiong, W.; Li, Z.; Landgraf, J. Improving atmospheric CO2 retrieval based on the collaborative use of Greenhouse gases Monitoring Instrument and Directional Polarimetric Camera sensors on Chinese hyperspectral satellite GF5-02. Geo-Spat. Inf. Sci. 2024, 27, 572–584. [Google Scholar]
  18. Taylor, T.E.; O’Dell, C.W.; Baker, D.; Bruegge, C.; Chang, A.; Chapsky, L.; Chatterjee, A.; Cheng, C.; Chevallier, F.; Crisp, D. Evaluating the consistency between OCO-2 and OCO-3 XCO2 estimates derived from the NASA ACOS version 10 retrieval algorithm. Atmos. Meas. Tech. Discuss. 2023, 2023, 31182. [Google Scholar]
  19. Eldering, A.; Taylor, T.E.; O’Dell, C.W.; Pavlick, R. The OCO-3 mission: Measurement objectives and expected performance based on 1 year of simulated data. Atmos. Meas. Tech. 2019, 12, 2341–2370. [Google Scholar]
  20. Yang, Y.; Zhou, M.; Wang, W.; Ning, Z.; Zhang, F.; Wang, P. Quantification of CO2 Emissions from Three Power Plants in China Using OCO-3 Satellite Measurements. Adv. Atmos. Sci. 2024, 41, 2276–2288. [Google Scholar]
  21. Verhulst, K.R.; Karion, A.; Kim, J.; Salameh, P.K.; Keeling, R.F.; Newman, S.; Miller, J.; Sloop, C.; Pongetti, T.; Rao, P.; et al. Carbon dioxide and methane measurements from the Los Angeles Megacity Carbon Project—Part 1: Calibration, urban enhancements, and uncertainty estimates. Atmos. Chem. Phys. 2017, 17, 8313–8341. [Google Scholar]
  22. Karion, A.; Lopez-Coto, I.; Gourdji, S.M.; Mueller, K.; Ghosh, S.; Callahan, W.; Stock, M.; DiGangi, E.; Prinzivalli, S.; Whetstone, J. Background conditions for an urban greenhouse gas network in the Washington, DC, and Baltimore metropolitan region. Atmos. Chem. Phys. 2021, 21, 6257–6273. [Google Scholar]
  23. Hakkarainen, J.; Szeląg, M.E.; Ialongo, I.; Retscher, C.; Oda, T.; Crisp, D. Analyzing nitrogen oxides to carbon dioxide emission ratios from space: A case study of Matimba Power Station in South Africa. Atmos. Environ. X 2021, 10, 100110. [Google Scholar]
  24. Lei, R.; Feng, S.; Xu, Y.; Tran, S.; Ramonet, M.; Grutter, M.; Garcia, A.; Campos-Pineda, M.; Lauvaux, T. Reconciliation of asynchronous satellite-based NO2 and XCO2 enhancements with mesoscale modeling over two urban landscapes. Remote Sens. Environ. 2022, 281, 113241. [Google Scholar]
  25. Yang, E.G.; Kort, E.A.; Ott, L.E.; Oda, T.; Lin, J.C. Using space-based CO2 and NO2 observations to estimate urban CO2 emissions. J. Geophys. Res. 2023, 128, e2022JD037736. [Google Scholar] [CrossRef]
  26. Finch, D.P.; Palmer, P.I.; Zhang, T. Automated detection of atmospheric NO2 plumes from satellite data: A tool to help infer anthropogenic combustion emissions. Atmos. Meas. Tech. 2022, 15, 721–733. [Google Scholar]
  27. Kuhlmann, G.; Koene, E.F.M.; Meier, S.; Santaren, D.; Broquet, G.; Chevallier, F.; Hakkarainen, J.; Nurmela, J.; Amorós, L.; Tamminen, J.; et al. The ddeq Python library for point source quantification from remote sensing images (Version 1.0). Geosci. Model Dev. 2024, 17, 4773–4789. [Google Scholar]
  28. Soci, C.; Hersbach, H.; Simmons, A.; Poli, P.; Bell, B.; Berrisford, P.; Horányi, A.; Muñoz Sabater, J.; Nicolas, J.; Radu, R. The ERA5 global reanalysis from 1940 to 2022. Q. J. R. Meteorol. Soc. 2024, 150, 4014–4048. [Google Scholar]
  29. Van Geffen, J.; Boersma, K.F.; Eskes, H.; Sneep, M.; Ter Linden, M.; Zara, M.; Veefkind, J.P. S5P TROPOMI NO2 slant column retrieval: Method, stability, uncertainties and comparisons with OMI. Atmos. Meas. Tech. 2020, 13, 1315–1335. [Google Scholar]
  30. Bauwens, M.; Compernolle, S.; Stavrakou, T.; Müller, J.F.; Van Gent, J.; Eskes, H.; Levelt, P.F.; Van Der A, R.; Veefkind, J.P.; Vlietinck, J. Impact of coronavirus outbreak on NO2 pollution assessed using TROPOMI and OMI observations. Geophys. Res. Lett. 2020, 47, e2020GL087978. [Google Scholar]
  31. Radman, A.; Mahdianpari, M.; Varon, D.J.; Mohammadimanesh, F. S2MetNet: A novel dataset and deep learning benchmark for methane point source quantification using Sentinel-2 satellite imagery. Remote Sens. Environ. 2023, 295, 113708. [Google Scholar]
  32. Bruno, J.H.; Jervis, D.; Varon, D.J.; Jacob, D.J. U-Plume: Automated algorithm for plume detection and source quantification by satellite point-source imagers. Atmos. Meas. Tech. 2024, 17, 2625–2636. [Google Scholar]
  33. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Figure 1. Simulated NO2 plumes corresponding to different emission levels.
Figure 1. Simulated NO2 plumes corresponding to different emission levels.
Remotesensing 17 01167 g001
Figure 2. Emission hotspot identification process based on CNN.
Figure 2. Emission hotspot identification process based on CNN.
Remotesensing 17 01167 g002
Figure 3. Core structure of the ResNet-based network.
Figure 3. Core structure of the ResNet-based network.
Remotesensing 17 01167 g003
Figure 4. Spatial attention mechanism.
Figure 4. Spatial attention mechanism.
Remotesensing 17 01167 g004
Figure 5. NO2 emission detection results of test set.
Figure 5. NO2 emission detection results of test set.
Remotesensing 17 01167 g005
Figure 6. Emission detection applied to TROPOMI observation data: the first column shows TROPOMI NO2 observation data; the second column shows the emission sources identified by the model; the third column shows the results from EDGAR data.
Figure 6. Emission detection applied to TROPOMI observation data: the first column shows TROPOMI NO2 observation data; the second column shows the emission sources identified by the model; the third column shows the results from EDGAR data.
Remotesensing 17 01167 g006
Figure 7. The surface conditions of the three detection cases. The number 1 refers to position 1; The number 2 refers to position 2.
Figure 7. The surface conditions of the three detection cases. The number 1 refers to position 1; The number 2 refers to position 2.
Remotesensing 17 01167 g007
Figure 8. The annual effective observation count of TROPOMI.
Figure 8. The annual effective observation count of TROPOMI.
Remotesensing 17 01167 g008
Figure 9. (a) The spatial distribution of NO2 emissions in MEIC, unit: tons/grid. (b) The NO2 emission hotspots detected by the model we proposed.
Figure 9. (a) The spatial distribution of NO2 emissions in MEIC, unit: tons/grid. (b) The NO2 emission hotspots detected by the model we proposed.
Remotesensing 17 01167 g009
Figure 10. The spatial distribution of CO2 and NO2 emission hotspots. Red dots representing CO2 emission hotspots and blue dots representing detected NO2 emission hotspots.
Figure 10. The spatial distribution of CO2 and NO2 emission hotspots. Red dots representing CO2 emission hotspots and blue dots representing detected NO2 emission hotspots.
Remotesensing 17 01167 g010
Table 1. The identification performance of the model on the tested datasets.
Table 1. The identification performance of the model on the tested datasets.
Emission IntensityDetection Accuracy (Top 25%, 1Q)Detection Accuracy (Top 50%, 2Q)Detection Accuracy (Top 75%, 3Q)Count
All80%94%100%36,938
Low79%100%100%26,264
Moderate81%89%100%8222
High83%92%100%2452
Note: Emission intensity (normalized [0, 1]), All means (0 < emission ≤ 1); Low means (emission < 0.33); Moderate means (0.33 ≤ emission < 0.66); High means (emission ≥ 0.66).
Table 2. Identification results of NO2 hotspot.
Table 2. Identification results of NO2 hotspot.
Emission (Tons/Grid)Count of MEICCount of IdentificationsIdentifications Accuracy
>5000191789%
4000–500099100%
2000–4000826478%
1000–200021214367%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, E.; Wu, S.; Wang, X.; Ye, H.; Shi, H.; An, Y.; Li, C. Deep Learning Methods for Inferring Industrial CO2 Hotspots from Co-Emitted NO2 Plumes. Remote Sens. 2025, 17, 1167. https://doi.org/10.3390/rs17071167

AMA Style

Sun E, Wu S, Wang X, Ye H, Shi H, An Y, Li C. Deep Learning Methods for Inferring Industrial CO2 Hotspots from Co-Emitted NO2 Plumes. Remote Sensing. 2025; 17(7):1167. https://doi.org/10.3390/rs17071167

Chicago/Turabian Style

Sun, Erchang, Shichao Wu, Xianhua Wang, Hanhan Ye, Hailiang Shi, Yuan An, and Chao Li. 2025. "Deep Learning Methods for Inferring Industrial CO2 Hotspots from Co-Emitted NO2 Plumes" Remote Sensing 17, no. 7: 1167. https://doi.org/10.3390/rs17071167

APA Style

Sun, E., Wu, S., Wang, X., Ye, H., Shi, H., An, Y., & Li, C. (2025). Deep Learning Methods for Inferring Industrial CO2 Hotspots from Co-Emitted NO2 Plumes. Remote Sensing, 17(7), 1167. https://doi.org/10.3390/rs17071167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop