Next Article in Journal
MoHyNet: Enhancing Session-Based Recommendations via Hypergraph Motifs and Contrastive Learning
Previous Article in Journal
An Empirical Evaluation of Large Language Models Applying Software Architectural Patterns
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Machine Learning Approaches for Filtering Organometallic Reactions: A Comparative Study of Molecular Descriptors

by
Walter Bonke Mahlangu
1,
Taurai Hungwe
2,
Nomasonto Rapulenyane
1,* and
Somandla Ncube
3
1
Department of Chemistry, Sefako Makgatho Health Sciences University, Medunsa, P.O. Box 60, Pretoria 0204, South Africa
2
Department of Computer Science and Information Technology, Sefako Makgatho Health Sciences University, Medunsa, P.O. Box 60, Pretoria 0204, South Africa
3
Department of Chemistry, Durban University of Technology, P.O. Box 1334, Durban 4000, South Africa
*
Author to whom correspondence should be addressed.
AI 2026, 7(6), 196; https://doi.org/10.3390/ai7060196
Submission received: 18 March 2026 / Revised: 28 April 2026 / Accepted: 6 May 2026 / Published: 27 May 2026
(This article belongs to the Section Chemical Artificial Intelligence)

Abstract

Organometallic chemistry deals with the synthesis, structure, reactivity, and applications of compounds containing metal–carbon covalent bonds. In recent years, there has been a growing interest in predicting the catalytic activity of organometallics using machine learning. However, the major drawback in developing algorithms that can be used in predicting organometallic reactions is the availability of organometallic reaction data and organometallic filtering tools. The main aim of the current study is to develop organometallic reaction-filtering tools that are crucial for building accurate and effective ML models in organometallic chemistry. Random Forest (RF), K-Nearest Neighbors (kNN), Support Vector Classifiers (SVC), and Multi-Layer Perceptrons (MLP) were employed, using feature subsets selected via Permutation Feature Importance from Morgan fingerprints and MACCS keys. The results demonstrate that the MACCS-based MLP architecture provides the most reliable filtering performance, achieving a superior F1 score of 0.85, a Recall of 0.85, and a high AUC-ROC of 0.837. Furthermore, the MACCS-MLP exhibited the highest predictive confidence, yielding the study’s lowest Log Loss of 0.312. In contrast, while Morgan fingerprints paired with kNN offered a specialized “strict” filter with absolute Precision (1.00), the sparse dimensionality of circular fingerprints generally resulted in lower calibration for probabilistic models. These findings underscore that dense, fragment-based descriptors refined by data-driven feature selection are most effective for identifying complex organometallic motifs. This study successfully provides a validated methodology for building precise filtering tools, establishing a critical foundation for automated catalyst discovery and the expansion of effective machine learning applications in organometallic chemistry. The study is limited to only identifying organometallic reactions and cannot filter based on organometallic reaction types. Future studies should also explore integrating multiple feature representations to classify or cluster the identified organometallic reactions based on the reaction types.
Keywords: organometallic compounds; machine learning; MACCS keys; Morgan fingerprints; random forest regression; multi-layer perceptron classifier; machine learning; partial least squares organometallic compounds; machine learning; MACCS keys; Morgan fingerprints; random forest regression; multi-layer perceptron classifier; machine learning; partial least squares

Share and Cite

MDPI and ACS Style

Mahlangu, W.B.; Hungwe, T.; Rapulenyane, N.; Ncube, S. Machine Learning Approaches for Filtering Organometallic Reactions: A Comparative Study of Molecular Descriptors. AI 2026, 7, 196. https://doi.org/10.3390/ai7060196

AMA Style

Mahlangu WB, Hungwe T, Rapulenyane N, Ncube S. Machine Learning Approaches for Filtering Organometallic Reactions: A Comparative Study of Molecular Descriptors. AI. 2026; 7(6):196. https://doi.org/10.3390/ai7060196

Chicago/Turabian Style

Mahlangu, Walter Bonke, Taurai Hungwe, Nomasonto Rapulenyane, and Somandla Ncube. 2026. "Machine Learning Approaches for Filtering Organometallic Reactions: A Comparative Study of Molecular Descriptors" AI 7, no. 6: 196. https://doi.org/10.3390/ai7060196

APA Style

Mahlangu, W. B., Hungwe, T., Rapulenyane, N., & Ncube, S. (2026). Machine Learning Approaches for Filtering Organometallic Reactions: A Comparative Study of Molecular Descriptors. AI, 7(6), 196. https://doi.org/10.3390/ai7060196

Article Metrics

Back to TopTop