This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Product Carbon Footprint Emission Factor Matching Algorithm Based on Large Language Models and Semantic Retrieval
by
Jiawei Wen
Jiawei Wen 1,*
,
Chengxin Pang
Chengxin Pang 1
,
Yanxin Wang
Yanxin Wang 1 and
Xinhua Zeng
Xinhua Zeng 2
1
Faculty of Artificial Intelligence, Shanghai University of Electric Power, Shanghai 201306, China
2
Academy for Engineering and Technology, Fudan University, Shanghai 200433, China
*
Author to whom correspondence should be addressed.
Sustainability 2026, 18(11), 5444; https://doi.org/10.3390/su18115444 (registering DOI)
Submission received: 12 April 2026
/
Revised: 18 May 2026
/
Accepted: 25 May 2026
/
Published: 28 May 2026
Abstract
Emission factor matching is the most critical step in product carbon footprint (PCF) accounting based on life cycle assessment (LCA). However, this step has long been hindered by several major challenges: a lack of standardization, overreliance on expert judgment, inconsistencies in raw data, and complex processing workflows. To address these issues, this study proposes an automated emission factor matching algorithm that combines large language models (LLMs) with semantic retrieval. The algorithm proceeds in two stages: first, an LLM identifies the reference product within the LCA database; then, an embedding model retrieves the most relevant emission factors through high-precision matching. Depending on practical requirements, the algorithm can either automatically select a single best-match factor or rank multiple best-match candidates in descending order of match precision to assist LCA experts in decision-making. We evaluate the algorithm on eight industrial products—tires, cement, ammonium phosphate, wood products, textiles, electronics and electrical appliances, steel, and lithium batteries—using the Ecoinvent 3.10 LCA database. Results demonstrate that the algorithm achieves high precision and low processing latency, significantly outperforming manual expert screening. These findings confirm that the proposed algorithm enables efficient and accurate emission factor matching, thereby providing a reliable technical solution and decision-making pathway for large-scale, automated PCF accounting.
Share and Cite
MDPI and ACS Style
Wen, J.; Pang, C.; Wang, Y.; Zeng, X.
Product Carbon Footprint Emission Factor Matching Algorithm Based on Large Language Models and Semantic Retrieval. Sustainability 2026, 18, 5444.
https://doi.org/10.3390/su18115444
AMA Style
Wen J, Pang C, Wang Y, Zeng X.
Product Carbon Footprint Emission Factor Matching Algorithm Based on Large Language Models and Semantic Retrieval. Sustainability. 2026; 18(11):5444.
https://doi.org/10.3390/su18115444
Chicago/Turabian Style
Wen, Jiawei, Chengxin Pang, Yanxin Wang, and Xinhua Zeng.
2026. "Product Carbon Footprint Emission Factor Matching Algorithm Based on Large Language Models and Semantic Retrieval" Sustainability 18, no. 11: 5444.
https://doi.org/10.3390/su18115444
APA Style
Wen, J., Pang, C., Wang, Y., & Zeng, X.
(2026). Product Carbon Footprint Emission Factor Matching Algorithm Based on Large Language Models and Semantic Retrieval. Sustainability, 18(11), 5444.
https://doi.org/10.3390/su18115444
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.