This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Fusing Multi-Source Data with Machine Learning for Ship Emission Calculation in Inland Waterways
by
Chao Wang
Chao Wang 1,2,3
,
Hao Wu
Hao Wu 4,* and
Zhirui Ye
Zhirui Ye 2,3,5
1
School of Network &Communication Engineering, Jinling Institute of Technology, Nanjing 211169, China
2
Jiangsu Key Laboratory of Digital Intelligent Low-Carbon Transportation, Jinling Institute of Technology, Nanjing 211169, China
3
School of Transportation, Southeast University, Nanjing 211189, China
4
Business School, Jinling Institute of Technology, Nanjing 211169, China
5
School of Transportation Management, Nanjing Vocational Institute of Railway Technology, Nanjing 210031, China
*
Author to whom correspondence should be addressed.
Atmosphere 2026, 17(1), 72; https://doi.org/10.3390/atmos17010072 (registering DOI)
Submission received: 11 December 2025
/
Revised: 7 January 2026
/
Accepted: 7 January 2026
/
Published: 9 January 2026
Abstract
Accurate estimation of ship emissions is essential for the effective enforcement of emission control policies in inland waterways. However, existing “bottom-up” models face significant challenges owing to severe data scarcity for inland ships, particularly regarding ship static parameters. This study proposes a novel data fusion and machine learning framework to address this issue. The methodology integrates real-time SO2 and CO2 pollutant concentrations on the Nanjing Dashengguan Yangtze River Bridge, Automatic Identification System (AIS) data, and meteorological information. To address the scarcity of design data for inland ships, web scraping was used to extract basic parameters, which were then used to train five machine learning models. Among them, the XGBoost model demonstrated superior performance in predicting the main engine rated power. A refined activity-based emission model combines these predicted parameters, ship operational profiles, and specific emission factors to calculate real-time emission source strengths. Furthermore, the model was validated against field measurements by comparing the calculated and measured emission source strengths from ships, demonstrating high predictive accuracy with R2 values of 0.980 for SO2 and 0.977 for CO2, and MAPE below 13%. This framework provides a reliable and scalable approach for real-time emission monitoring and supports regulatory enforcement in inland waterways.
Share and Cite
MDPI and ACS Style
Wang, C.; Wu, H.; Ye, Z.
Fusing Multi-Source Data with Machine Learning for Ship Emission Calculation in Inland Waterways. Atmosphere 2026, 17, 72.
https://doi.org/10.3390/atmos17010072
AMA Style
Wang C, Wu H, Ye Z.
Fusing Multi-Source Data with Machine Learning for Ship Emission Calculation in Inland Waterways. Atmosphere. 2026; 17(1):72.
https://doi.org/10.3390/atmos17010072
Chicago/Turabian Style
Wang, Chao, Hao Wu, and Zhirui Ye.
2026. "Fusing Multi-Source Data with Machine Learning for Ship Emission Calculation in Inland Waterways" Atmosphere 17, no. 1: 72.
https://doi.org/10.3390/atmos17010072
APA Style
Wang, C., Wu, H., & Ye, Z.
(2026). Fusing Multi-Source Data with Machine Learning for Ship Emission Calculation in Inland Waterways. Atmosphere, 17(1), 72.
https://doi.org/10.3390/atmos17010072
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.