Artificial Intelligence- and Machine Learning-Driven Strategies for Catalyst Design and Sustainable Chemical Processes
Abstract
1. Introduction
1.1. Advances and Challenges in Modern Catalysis
1.2. The Central Role of Catalysis and the Complexity of Catalyst Design
1.3. Catalysis for Sustainable and Renewable Technologies
1.4. Computational, Digital, and Data Infrastructure for Modern Catalysis
2. Artificial Intelligence in Chemical Discovery and Engineering
2.1. Advances in AI and ML for Catalyst Design, Reaction Optimization, and Multi-Scale Process Engineering
2.2. AI in Catalyst Design and High-Throughput Platforms
2.3. Machine Learning and Vibrational Spectroscopy for Mechanistic Insights
2.4. ML for Nitrogen Reduction Reaction (NRR) Dual-Atom Catalysts
2.5. SHAP Feature Analysis and Workflow for Dual-Atom Catalysts
2.6. MLPs for Heterogeneous Catalysis
2.7. ML in Photocatalysis
2.8. QMOF Database for MOFs
2.9. ML and LLMs in MOF Development
3. Artificial Intelligence and Machine Learning in Photocatalysis and Catalyst Design for CO2 Conversion
3.1. AI-Driven Databases and Reaction Prediction in Photocatalysis
3.2. Data-Driven Kinetic Modeling and Mechanistic Insights
3.3. Machine Learning for Photocatalytic CO2 Reduction
3.4. ML in CO2 Capture
3.5. ML Applications in CO2 Capture Across Scales
3.6. AI-Enhanced Catalyst Design for Renewable Energy Applications
4. Ethical Issues, Safety Risks and Responsible Application of Generative AI
5. Challenges and Future Research Directions
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Baldea, M.; Broadbelt, L.J.; Ierapetritou, M.G.; Kwan, T.A.; Li, C.; Luo, Z.-H.; Ma, X.; Morbidelli, M.; Sahu, K.C.; Scurto, A.M.; et al. 2024 in retrospective: Trends in chemical engineering. Ind. Eng. Chem. Res. 2025, 64, 11615–11623. [Google Scholar] [CrossRef]
- Chakkingal, A.; Pirro, L.; Costa da Cruz, A.R.; Barrios, A.J.; Virginie, M.; Khodakov, A.Y.; Thybaut, J.W. Unravelling the influence of catalyst properties on light olefin production via Fischer–Tropsch synthesis: A descriptor space investigation using single-event microkinetics. Chem. Eng. J. 2021, 419, 129633. [Google Scholar] [CrossRef]
- Feng, J.; Lansford, J.L.; Katsoulakis, M.A.; Vlachos, D.G. Explainable and trustworthy artificial intelligence for correctable modeling in chemical sciences. Sci. Adv. 2020, 6, eabc3204. [Google Scholar] [CrossRef] [PubMed]
- Liu, J.; Liang, L.; Su, B.; Wu, D.; Zhang, Y.; Wu, J.; Fu, C. Transformative strategies in photocatalyst design: Merging computational methods and deep learning. J. Mater. Inform. 2024, 4, 33. [Google Scholar] [CrossRef]
- Yang, H.; Che, Y.; Cooper, A.I.; Chen, L.; Li, X. Machine learning accelerated exploration of ternary organic heterojunction photocatalysts for sacrificial hydrogen evolution. J. Am. Chem. Soc. 2023, 145, 27038–27044. [Google Scholar] [CrossRef] [PubMed]
- Sa, B.; Hu, R.; Zheng, Z.; Li, X.; Zhang, Y. High-throughput computational screening and machine learning modeling of Janus 2D III-VI van der Waals heterostructures for solar energy applications. Chem. Mater. 2022, 34, 6687–6701. [Google Scholar] [CrossRef]
- Singh, A.K.; Montoya, J.H.; Gregoire, J.M.; Persson, K.A. Robust and synthesizable photocatalysts for CO2 reduction: A data-driven materials discovery. Nat. Commun. 2019, 10, 443. [Google Scholar] [CrossRef]
- de la Hidalga, N.; Goodall, A.J.; Anyika, C.; Matthews, B.; Catlow, C.R.A. Designing a data infrastructure for catalysis science aligned to FAIR data principles. Catal. Commun. 2022, 162, 106384. [Google Scholar] [CrossRef]
- Spatenka, S.; Matzopoulos, M.; Urban, Z.; Cano, A. From laboratory to industrial operation: Model-based digital design and optimization of fixed-bed catalytic reactors. Ind. Eng. Chem. Res. 2019, 58, 12571–12585. [Google Scholar] [CrossRef]
- Nguyen, P.C.H.; Choi, J.B.; Udaykumar, H.S.; Baek, S. Challenges and opportunities for machine learning in multiscale computational modeling. J. Comput. Inf. Sci. Eng. 2023, 23, 060808. [Google Scholar] [CrossRef]
- Yuan, Q.; Wang, X.; Xu, D.; Liu, H.; Zhang, H.; Yu, Q.; Bi, Y.; Li, L. Machine learning-assisted catalysts for advanced oxidation processes: Progress, challenges, and prospects. Catalysts 2025, 15, 282. [Google Scholar] [CrossRef]
- Deng, C.; Su, Y.; Li, F.; Shen, W.; Chen, Z.; Tang, Q. Understanding activity origin for the oxygen reduction reaction on bi-atom catalysts by DFT studies and machine-learning. J. Mater. Chem. A 2020, 8, 24563–24571. [Google Scholar] [CrossRef]
- Ishioka, S.; Fujiwara, A.; Nakanowatari, S.; Takahashi, L.; Taniike, T.; Takahashi, K. Designing catalyst descriptors for machine learning in oxidative coupling of methane. ACS Catal. 2022, 12, 11541–11546. [Google Scholar] [CrossRef]
- Casillo, E.; Scattolin, T.; Nolan, S.P. Catalysis meets machine learning: A guide to data-driven discovery and design. Chem. Commun. 2025, 61, 18247–18272. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Li, W.; Wang, S.; Wang, X. The future of catalysis: Applying graph neural networks for intelligent catalyst design. WIREs Comput. Mol. Sci. 2025, 15, e70010. [Google Scholar] [CrossRef]
- Wang, S.; Jiang, J. Interpretable catalysis models using machine learning with spectroscopic descriptors. ACS Catal. 2023, 13, 7428–7436. [Google Scholar] [CrossRef]
- Zhao, Z.J.; Liu, S.; Zha, S.; Cheng, D.; Studt, F.; Henkelman, G.; Gong, J. Theory-guided design of catalytic materials using scaling relationships and reactivity descriptors. Nat. Rev. Mater. 2019, 4, 792–804. [Google Scholar] [CrossRef]
- Dalmau, D.; García-Abellán, S.; Alegre-Requena, J.V. Machine learning in homogeneous catalysis: Basic concepts and best practices. ACS Catal. 2026, 16, 1–11. [Google Scholar] [CrossRef]
- Zhu, Q.; Gu, Y.; Ma, J. Digital descriptors in predicting catalysis reaction efficiency and selectivity. J. Phys. Chem. Lett. 2025, 16, 2357–2368. [Google Scholar] [CrossRef]
- Choung, S.; Park, W.; Moon, J.; Han, J.W. Rise of machine learning potentials in heterogeneous catalysis: Developments, applications, and prospects. Chem. Eng. J. 2024, 494, 152757. [Google Scholar] [CrossRef]
- Mou, T.; Pillai, H.S.; Wang, S.; Wan, M.; Han, X.; Schweitzer, N.M.; Che, F.; Xin, H. Bridging the complexity gap in computational heterogeneous catalysis with machine learning. Nat. Catal. 2023, 6, 122–136. [Google Scholar] [CrossRef]
- Doan, H.A.; Li, C.; Ward, L.; Zhou, M.; Curtiss, L.A.; Assary, R.S. Accelerating the evaluation of crucial descriptors for catalyst screening via message passing neural network. Digit. Discov. 2023, 2, 59–68. [Google Scholar] [CrossRef]
- Spotti, M.; Maineri, K.; Viñes, F.; Illas, F.; Di Liberto, G.; Pacchioni, G. Scaling relations and catalytic descriptor for the nitrogen reduction on single-atom catalysts. Electrochim. Acta 2025, 542, 147389. [Google Scholar] [CrossRef]
- Cheng, Z.; Meng, Q.; Jiang, X.; Gun, S.; Fan, L.S. Machine learning-driven predictive design of catalytic oxygen carriers for chemical looping processes. Discov. Energy 2025, 5, 20. [Google Scholar] [CrossRef]
- German Research Foundation (DFG). National Research Data Infrastructure (NFDI). Available online: https://www.dfg.de/en/research-funding/funding-initiative/nfdi (accessed on 31 March 2026).
- NFDI4Cat. NFDI4Cat—NFDI for Catalysis-Related Sciences. Available online: https://nfdi4cat.org/nfdi4cat/en/ (accessed on 31 March 2026).
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR guiding principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed]
- NFDI4Cat. NFDI4Cat at EuropaCat 2025. 9 September 2025. Available online: https://nfdi4cat.org/nfdi4cat/en/News/NFDI4Cat+at+EuropaCat+2025.html (accessed on 31 March 2026).
- Huskova, N.; Dikova, Y.; Petrenko, T.; Bönisch, T. Improvement of data and metadata quality in catalysis research: A use case-driven methodology. Catal. Today 2025, 446, 115111. [Google Scholar] [CrossRef]
- DataCite. DataCite Metadata Schema. Available online: https://schema.datacite.org/ (accessed on 31 March 2026).
- Library of Congress. PREMIS Data Dictionary and Schema Revision Process. Available online: https://www.loc.gov/standards/premis/revision-process.html (accessed on 31 March 2026).
- FAIR-IMPACT. RSMD Guidelines: Research Software MetaData Guidelines. Available online: https://fair-impact.github.io/RSMD-guidelines/ (accessed on 31 March 2026).
- World Wide Web Consortium. Resource Description Framework (RDF). Available online: https://www.w3.org/RDF/ (accessed on 19 October 2025).
- Xie, E.; Wang, X.; Siepmann, J.I.; Chen, H.; Snurr, R.Q. Generative AI for design of nanoporous materials: Review and future prospects. Digit. Discov. 2025, 4, 2336–2363. [Google Scholar] [CrossRef]
- Back, S.; Aspuru-Guzik, Á.; Ceriotti, M.; Gryn’ova, G.; Grzybowski, B.; Gu, G.H.; Hein, J.; Hippalgaonkar, K.; Hormázabal, R.; Jung, Y.; et al. Accelerated chemical science with AI. Digit. Discov. 2024, 3, 23–33. [Google Scholar] [CrossRef] [PubMed]
- St. John, P.C.; Guan, Y.; Kim, Y.; Kim, S.; Paton, R.S.; Others. Prediction of organic homolytic bond dissociation enthalpies at near chemical accuracy with sub-second computational cost. Nat. Commun. 2020, 11, 2328. [Google Scholar] [CrossRef]
- Uhrin, M.; Huber, S.P.; Yu, J.; Marzari, N.; Pizzi, G. Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computational workflows. Comput. Mater. Sci. 2021, 187, 110086. [Google Scholar] [CrossRef]
- Draxl, C.; Scheffler, M. Nomad: The FAIR concept for big data-driven materials science. MRS Bull. 2018, 43, 676–682. [Google Scholar] [CrossRef]
- Li, A.; Oh, R.; Ren, Y.; Zhao, L.; Huang, X.J. Machine learning for advanced oxidation catalysis: From descriptors to inverse design. Mol. Catal. 2026, 596, 115889. [Google Scholar] [CrossRef]
- Zhang, L.; Bing, Q.; Qin, H.; Yu, L.; Li, H.; Deng, D. Artificial intelligence for catalyst design and synthesis. Matter 2025, 8, 102138. [Google Scholar] [CrossRef]
- Günay, M.E.; Yıldırım, R. Machine learning for catalytic reaction systems: A framework for complex chemical processes. ACS Eng. Au 2026, 6, 48–67. [Google Scholar] [CrossRef]
- Roch, L.M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L.P.E.; Hein, J.E.; Aspuru-Guzik, A. ChemOS: An orchestration software to democratize autonomous discovery. PLoS ONE 2020, 15, e0229862. [Google Scholar] [CrossRef] [PubMed]
- Tan, Z.; Yang, Q.; Luo, S. AI molecular catalysis: Where are we now? Org. Chem. Front. 2025, 12, 2759. [Google Scholar] [CrossRef]
- Li, A.; Cui, P.; Wang, X.; Fisher, A.; Li, L.; Cheng, D. The artificial intelligence-catalyst pipeline: Accelerating catalyst innovation from laboratory to industry. Front. Chem. Sci. Eng. 2025, 19, 55. [Google Scholar] [CrossRef]
- Ma, K. AI agents in chemical research: GVIM—An intelligent research assistant system. Digit. Discov. 2025, 4, 355–375. [Google Scholar] [CrossRef]
- Bran, A.M.; Cox, S.; Schilter, O.; Baldassari, C.; White, A.D.; Schwaller, P. Augmenting large language models with chemistry tools. Nat. Mach. Intell. 2024, 6, 525–535. [Google Scholar] [CrossRef]
- Boiko, D.A.; MacKnight, R.; Kline, B.; Gomes, G. Autonomous chemical research with large language models. Nature 2023, 624, 570–578. [Google Scholar] [CrossRef]
- Yu, B.; Baker, F.N.; Chen, Z.; Herb, G.; Gou, B.; Adu-Ampratwum, D.; Ning, X.; Sun, H. Tooling or not tooling? The impact of tools on language agents for chemistry problem solving. In Findings of the Association for Computational Linguistics: NAACL; Association for Computational Linguistics: Albuquerque, NM, USA, 2025; pp. 7620–7640. [Google Scholar] [CrossRef]
- McNaughton, A.D.; Ramalaxmi, G.; Kruel, A.; Knutson, C.R.; Varikoti, R.A.; Kumar, N. CACTUS: Chemistry Agent Connecting Tool-Usage to Science. arXiv 2024, arXiv:2405.00972. [Google Scholar] [CrossRef]
- Kumar, A.; Zavala, V.M. Editorial for the AI/ML in Chemical Engineering Special Issue. Ind. Eng. Chem. Res. 2025, 64, 9441–9442. [Google Scholar] [CrossRef]
- Daniel, T.; Xuan, J. Responsible use of generative AI in chemical engineering. Digit. Chem. Eng. 2024, 12, 100168. [Google Scholar] [CrossRef]
- Acosta-Herazo, R.; Cañaveral-Velásquez, B.; Pérez-Giraldo, K.; Mueses, M.A.; Pinzón-Cárdenas, M.H.; Machuca-Martínez, F. A MATLAB-Based Application for Modeling and Simulation of Solar Slurry Photocatalytic Reactors for Environmental Applications. Water 2020, 12, 2196. [Google Scholar] [CrossRef]
- Zhang, R.; He, H.; Tang, Y.; Zhang, Z.; Zhou, H.; Yu, J.; Zhang, L.; Dai, B. A review on Fe2O3-based catalysts for toluene oxidation: Catalysts design and optimization with the formation of abundant oxygen vacancies. ChemCatChem 2024, 16, e202400396. [Google Scholar] [CrossRef]
- Mine, S.; Takao, M.; Yamaguchi, T.; Toyao, T.; Maeno, Z.; Siddiki, S.; Takakusagi, S.; Shimizu, K.; Takigawa, I.; Shimizu, K. Analysis of updated literature data up to 2019 on the oxidative coupling of methane using an extrapolative machine-learning method to identify novel catalysts. ChemCatChem 2021, 13, 3636–3655. [Google Scholar] [CrossRef]
- de Araujo, L.G. Catalysis, meet the machine: From models to meaning. Catal. Res. 2025, 5, 005. [Google Scholar] [CrossRef]
- Abraham, B.M.; Viñes, F.; Jyothirmai, M.V.; Sinha, P.; Singh, J.K.; Illas, F. Catalysis in the digital age: Unlocking the power of data with machine learning. WIREs Comput. Mol. Sci. 2024, 14, e1730. [Google Scholar] [CrossRef]
- Rekkas, V.P.; Sotiroudis, S.; Sarigiannidis, P.; Wan, S.; Karagiannidis, G.K.; Goudos, S.K. Machine Learning in Beyond 5G/6G Networks—State-of-the-Art and Future Trends. Electronics 2021, 10, 2786. [Google Scholar] [CrossRef]
- Abdul Wahab, Y.; Shapril, N.N.; Johari, S.; Johan, M.R. Machine learning for mechanistic insights and optimization in CO2 cycloaddition catalysis. Appl. Catal. A General. 2026, 710, 120679. [Google Scholar] [CrossRef]
- Thalpage, N.S. Unlocking the black box: Explainable artificial intelligence (XAI) for trust and transparency in AI systems. J. Digit. Art. Humanit. 2023, 4, 31–36. [Google Scholar] [CrossRef]
- Semnani, P.; Bogojeski, M.; Bley, F.; Zhang, Z.; Wu, Q.; Kneib, T.; Herrmann, J.; Weisser, C.; Patcas, F.; Müller, K.-R. A machine learning and explainable AI framework tailored for unbalanced experimental catalyst discovery. J. Phys. Chem. C 2024, 128, 21349–21367. [Google Scholar] [CrossRef]
- Myllyaho, L.; Raatikainen, M.; Männistö, T.; Mikkonen, T.; Nurminen, J.K. Systematic literature review of validation methods for AI systems. J. Syst. Softw. 2021, 181, 111050. [Google Scholar] [CrossRef]
- Xu, Y.; Wang, H.; Zhang, W.; Xie, L.; Chen, Y.; Salim, F.; Zhang, Y.; Gooding, J.; Walsh, T. AI-empowered catalyst discovery: A survey from classical machine learning approaches to large language models. arXiv 2025. [Google Scholar] [CrossRef]
- Balcells, D. Co-intelligent design of catalysis research with large language models: Hype or reality? ACS Catal. 2025, 15, 16412–16420. [Google Scholar] [CrossRef]
- Li, J.; Wang, J.; He, T.; Li, Z.; Xu, H.; Fan, Z.; Liao, F.; Liu, Y.; Kang, Z. Dynamic machine learning-driven optimization of microwave-synthesized photocatalysts for enhanced hydrogen peroxide production. ChemCatChem 2025, 17, e00341. [Google Scholar] [CrossRef]
- Wang, Y.; Sun, Y.; Wang, H.; Li, J.; Liu, X.; Dong, F. Infrared spectra-based machine learning framework for photocatalytic reaction and performance. Adv. Intell. Syst. 2025, 5, 2500101. [Google Scholar] [CrossRef]
- Ozcan, A.; Coudert, F.-X.; Rogge, S.M.J.; Heydenrych, G.; Fan, D.; Sarikas, A.P.; Keskin, S.; Maurin, G.; Froudakis, G.E.; Wuttke, S.; et al. Artificial intelligence paradigms for next-generation metal−organic framework research. J. Am. Chem. Soc. 2025, 147, 23367–23380. [Google Scholar] [CrossRef]
- Wang, G.; Mine, S.; Chen, D.; Jing, Y.; Ting, K.W.; Yamaguchi, T.; Takao, M.; Maeno, Z.; Takigawa, I.; Matsushita, K.; et al. Accelerated discovery of multi-elemental reverse water-gas shift catalysts using extrapolative machine learning approach. Nat. Commun. 2023, 14, 5861. [Google Scholar] [CrossRef]
- Yu, Q.; Ma, N.; Leung, C.; Liu, H.; Ren, Y.; Wei, Z. AI in single-atom catalysts: A review of design and applications. J. Mater. Inform. 2025, 5, 9. [Google Scholar] [CrossRef]
- Bokhimi, X. Learning the use of artificial intelligence in heterogeneous catalysis. Front. Chem. Eng. 2021, 3, 740270. [Google Scholar] [CrossRef]
- Huang, J.; Zong, Z.; Wang, P.; Zhang, Y.; Gao, D.; Wang, Y.; Li, Z. Machine learning-driven simulation and optimization of phosphate adsorption on metal-organic frameworks. Sep. Purif. Technol. 2026, 394, 137479. [Google Scholar] [CrossRef]
- Srinivasan, K.; Bhakte, A.; Puliyanda, A.; Thosar, D.; Srinivasan, R.; Singh, K.; Addo, P.; Prasad, V. Artificial intelligence and machine learning at various stages and scales of process systems engineering. Can. J. Chem. Eng. 2025, 103, 22525. [Google Scholar] [CrossRef]
- Rosen, A.S.; Iyer, S.M.; Ray, D.; Yao, Z.; Aspuru-Guzik, Á.; Gagliardi, L.; Notestein, J.M.; Snurr, R.Q. Machine learning the quantum-chemical properties of metal–organic frameworks for accelerated materials discovery. Matter 2021, 4, 1578–1597. [Google Scholar] [CrossRef]
- de Araujo, L.G.; Vilcocq, L.; Fongarland, P.; Schuurman, Y. Recent developments in the use of machine learning in catalysis: A broad perspective with applications in kinetics. Chem. Eng. J. 2025, 508, 160872. [Google Scholar] [CrossRef]
- Mollick, E. Co-Intelligence: Living and Working with A.; Portfolio/Penguin: New York, NY, USA, 2024. [Google Scholar]
- Taniike, T.; Fujiwara, A.; Nakanowatari, S.; García-Escobar, F.; Takahashi, K. Automatic feature engineering for catalyst design using small data without prior knowledge of target catalysis. Commun. Chem. 2024, 7, 11. [Google Scholar] [CrossRef]
- Ntoutsi, E.; Fafalios, P.; Gadiraju, U.; Iosifidis, V.; Nejdl, W.; Vidal, M.-E.; Ruggieri, S.; Turini, F.; Papadopoulos, S.; Krasanakis, E.; et al. Bias in data-driven artificial intelligence systems—An introductory survey. WIREs Data Min. Knowl. Discov. 2020, 10, e1356. [Google Scholar] [CrossRef]
- Guendouzi, B.S.; Ouchani, S.; El Assaad, H.; El Zaher, M. A systematic review of federated learning: Challenges, aggregation methods, and development tools. J. Netw. Comput. Appl. 2023, 220, 103714. [Google Scholar] [CrossRef]
- Liu, Q.; Wang, X.; Wei, Y.; Xu, L.; Yang, Y.; Wang, J. Interpretable machine learning-assisted high-throughput screening of highly active nitrogen fixation dual-atom catalysts. AIChE J. 2025, 71, e18866. [Google Scholar] [CrossRef]
- Omranpour, A.; Elsner, J.; Lausch, K.N.; Behler, J. Machine learning potentials for heterogeneous catalysis. arXiv 2024. [Google Scholar] [CrossRef]
- Tunala, S.; Zhai, S.; Wu, F.; Chen, Y.-H. Machine learning in photocatalysis: Accelerating design, understanding, and environmental applications. Sci. China Chem. 2025, 68, 3415–3428. [Google Scholar] [CrossRef]
- Sumaria, V.; Rawal, T.B.; Li, Y.F.; Sommer, D.; Vikoren, J.; Bondi, R.J.; Rupp, M.; Prasad, A.; Prasad, D. Machine learning, density functional theory, and experiments to understand the photocatalytic reduction of CO2 on CuPt/TiO2. J. Phys. Chem. C 2024, 128, 14247–14258. [Google Scholar] [CrossRef]
- Xu, J.; Zhai, S.; Huang, P.; Yu, W.; Mao, Q.; Du, K.; Su, W.; Sun, B.; Jin, C.; Su, A. An artificial intelligence-driven synthesis planning platform (PhotoCat) for photocatalysis. Commun. Chem. 2026, 9, 92. [Google Scholar] [CrossRef]
- Xu, J.; Su, A.; Huang, P.; Yu, W.; Du, K.; Fan, Z.; Sun, B.; Zhong, Z.; Jin, C.; Su, W. PhotoCat: An artificial intelligence-driven synthesis planning platform for photocatalysis. ChemRxiv 2023. [Google Scholar] [CrossRef]
- Prabhu, S.; Kosir, N.; Kothare, M.V.; Rangarajan, S. Derivative-free domain-informed data-driven discovery of sparse kinetic models. Ind. Eng. Chem. Res. 2025, 64, 2601–2615. [Google Scholar] [CrossRef]
- Liu, X.; Wang, C.; Chen, C.; Pan, Z.; Gao, C.; Lai, W.; Zhao, J.; Tian, T.; Xiao, W. Recent advances in hierarchical porous materials for CO2 capture and utilization. Coord. Chem. Rev. 2025, 544, 21627. [Google Scholar] [CrossRef]
- Ali, M.M.; Hossen, M.A.; Abd Aziz, A. Progress in prediction of photocatalytic CO2 reduction using machine learning approach: A mini review. Next Mater. 2025, 8, 100522. [Google Scholar] [CrossRef]
- Orhan, I.B.; Zhao, Y.; Babarao, R.; Thornton, A.W.; Le, T.C. Machine learning descriptors for CO2 capture materials. Molecules 2025, 30, 650. [Google Scholar] [CrossRef] [PubMed]
- Tran, K.; Ulissi, Z.W. Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution. Nat. Catal. 2018, 1, 696–703. [Google Scholar] [CrossRef]
- Yang, H.; Kareck, T.L.; Wang, Q. New era of AI in chemical process safety: Foundation models. ACS Chem. Health Saf. 2026, 33, 171–179. [Google Scholar] [CrossRef]
- Hagendorff, T. Mapping the ethics of generative AI: A comprehensive scoping review. Minds Mach. 2024, 34, 39. [Google Scholar] [CrossRef]
- Gunasekara, L.; El-Haber, N.; Nagpal, S.; Moraliyage, H.; Issadeen, Z.; Manic, M.; De Silva, D. A systematic review of responsible artificial intelligence principles and practice. Appl. Syst. Innov. 2025, 8, 97. [Google Scholar] [CrossRef]



| Advantages | Disadvantages | Critical Perspectives | Ref. |
|---|---|---|---|
| Supervised ML | |||
|
| The most mature and practical ML strategy in catalysis. Often functions as an advanced interpolation tool rather than a true discovery engine. Published models perform well only within limited chemical spaces and fail under realistic industrial conditions. The absence of standardized negative-result databases remains a major limitation. | [57] |
| Unsupervised Learning | |||
|
| Less powerful for direct prediction, but essential for understanding high-dimensional chemical spaces and reducing dataset complexity. Results can become subjective if not supported by physical chemistry principles. | [14,21] |
| Reinforcement Learning (RL) | |||
|
| Attractive for self-driving laboratories, but remains limited by experimental throughput and imperfect reward definitions. Translating RL strategies from simulations to industrial chemistry is still challenging. | [34] |
| Performance Metric | Typical Gain |
|---|---|
| Catalyst screening speed | 100–1000 times faster |
| Experimental reduction | 50–90% fewer experiments |
| Prediction accuracy | R2 up to 0.99 |
| Optimization time | Week to hours or days |
| Throughput | 10–100 times more experiments per day |
| Yield/selectivity improvement | Often > 10–30% |
| Key Methods | Application | Main Findings | Limitations/Challenges | Improvements/Impacts | Ref. |
|---|---|---|---|---|---|
| PHOTOREAC (MATLAB-based photon absorption + kinetic modeling) | Modeling slurry solar photocatalytic reactors | Estimates radiation-independent kinetic constants, compares kinetic models, analyzes operational parameters | Limited to TiO2P25, assumes well-mixed system, ignores mass transport limitations | Enhanced understanding of reaction kinetics; guided reactor optimization | [52] |
| PhotoCat (transformer-based deep learning) + PhotoCatDB (curated database) | Multicomponent photocatalytic reactions | Predicts reaction outcomes (Top-1 accuracy 82.25%), interpretable, experimentally validated predictions | Dataset completeness, reaction conditions variation, predictive accuracy | Accelerated reaction prediction; improved catalyst selection; reduced experimental workload | [62] |
| Machine learning (linear regression, decision trees, random forests, ANNs, k-NN) | Photocatalytic CO2 reduction | Predicts catalyst performance, optimizes reaction conditions, uncovers correlations | Dataset quality, computational demands, overfitting, model bias | More efficient catalyst screening; optimized conditions; discovery of new catalyst relationships | [63] |
| Few-shot ML + iterative experiments | Microwave-assisted photocatalyst synthesis | Efficient optimization of H2O2 production; high performance in three iterations | Limited dataset availability | Reduced experimental time; faster catalyst development; resource savings | [64] |
| ML linked with IR spectroscopy | Catalyst screening | Predicts nitrate formation from adsorbed species; accelerates experiments; interpretable mechanistic insights | Transferability to other catalysts; data quality | Faster screening process; mechanistic understanding; targeted catalyst design | [65] |
| DF-SINDy (derivative-free sparse identification) | Kinetic model discovery | Recovers governing equations from experimental data, incorporates domain knowledge, interpretable | Sensitive to noise, multiphase system complexity | Better understanding of reaction mechanisms; data-driven model development | [66] |
| Key Methods | Application | Main Findings | Limitations/Challenges | Improvements/Impact | Ref. |
|---|---|---|---|---|---|
| ML potentials (MLPs) | Atomistic simulation of heterogeneous catalysis | Enables ab initio accuracy at larger scales; captures dynamic behaviors | Data selection, transferability, electronic structure references | Improved predictive accuracy; enabled larger-scale simulations; accelerated catalyst design | [50] |
| Structural engineering + defect modulation | Fe2O3-based VOC oxidation | Oxygen vacancies, morphology control, heteroatom doping improve activity | Low-temp activity, complex-gas mixtures challenge stability | Enhanced catalyst activity; better stability; tailored catalyst properties | [53] |
| Machine learning (Extra Trees Regressor, XGBoost, Random Forest) | Oxidative coupling of methane (OCM) | Predicts catalyst compositions, identifies elemental features affecting C2 yields | Dataset quality, feature representation | Accelerated catalyst discovery; identification of key elements; optimized catalyst formulations | [54] |
| Extrapolative ML + iterative experiments | Discovery of multi-element catalysts for reverse water–gas shift reaction | Identified over 100 highly active multi-element catalysts, including previously untested elements | Limited training data, generalizability | Rapid identification of promising catalysts; expanded catalyst space exploration | [67] |
| Machine learning, neural networks, high-throughput simulations, integration with DFT | AI in single-atom catalyst (SAC) design and optimization | Guides high-throughput simulations, predicts novel structures, integrates experimental data | Complexity, interpretability, data integration | Accelerated discovery of SACs; improved predictive tools; streamlined experimental validation | [68] |
| ML surrogate models + DFT descriptors | CO2 reduction and H2 evolution | Predicts promising candidates, screens chemical space, identifies activity descriptors | Stability, synthesis feasibility, adsorption effects, DFT uncertainties | Faster screening; targeted catalyst synthesis; better understanding of activity descriptors | [64] |
| Supervised learning | Hydrodesulfurization/heterogeneous catalysis | Predicts adsorption energies, surface areas, adsorption isotherms, sulfur content; accelerates design | Requires accurate input/output variables, model interpretability | Faster catalyst screening; more accurate property prediction; streamlined catalyst development | [69] |
| Key Methods | Application | Main Findings | Limitations/Challenges | Ref. |
|---|---|---|---|---|
| Machine learning + descriptor-based featurization | CO2 capture materials (MOFs, ionic liquids, membranes) | Identifies key descriptors for performance, predicts capture efficiency, guides design | Dataset diversity, synthetic feasibility, material stability | [55] |
| LLMs + generative AI (VAEs, GANs, RL) | MOF discovery and synthesis optimization | Automates literature review, predicts properties, enables inverse design, guides adaptive synthesis | Integration with experimental workflows, interpretability, data availability | [68] |
| Generative AI | Chemical engineering workflows (flowsheet/P&ID generation, process optimization) | Automates designs, optimizes chemical processes, accelerates innovation | Limited machine-readable data, poor integration with domain knowledge, risk of unsafe outputs | [71] |
| Quantum MOF (QMOF) database + ML (crystal graph CNN, SOAP, composition-based features) | MOF electronic property prediction | Predicts band gaps, identifies structure–property relationships, accelerates computational discovery | Vast chemical space, high DFT costs | [72] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Bratovčić, A.; Tomašić, V. Artificial Intelligence- and Machine Learning-Driven Strategies for Catalyst Design and Sustainable Chemical Processes. Processes 2026, 14, 1866. https://doi.org/10.3390/pr14121866
Bratovčić A, Tomašić V. Artificial Intelligence- and Machine Learning-Driven Strategies for Catalyst Design and Sustainable Chemical Processes. Processes. 2026; 14(12):1866. https://doi.org/10.3390/pr14121866
Chicago/Turabian StyleBratovčić, Amra, and Vesna Tomašić. 2026. "Artificial Intelligence- and Machine Learning-Driven Strategies for Catalyst Design and Sustainable Chemical Processes" Processes 14, no. 12: 1866. https://doi.org/10.3390/pr14121866
APA StyleBratovčić, A., & Tomašić, V. (2026). Artificial Intelligence- and Machine Learning-Driven Strategies for Catalyst Design and Sustainable Chemical Processes. Processes, 14(12), 1866. https://doi.org/10.3390/pr14121866
