AI-Driven Enzyme Engineering: Emerging Models and Next-Generation Biotechnological Applications
Abstract
1. Introduction
2. Core AI Techniques in Enzyme Engineering
2.1. Machine Learning (ML) Models
2.1.1. Predictive Models and Data-Driven Design
2.1.2. ML for Enzyme Function and Active Site Prediction
2.1.3. Emerging Trends in Data-Driven Enzyme Design
2.2. Deep Learning (DL) Models
2.2.1. Designing Dynamic and Functional Proteins
2.2.2. Predicting and Optimising Enzyme Function
2.2.3. Toward Intelligent, Generalizable Protein Design
2.3. Generative Models
2.3.1. Protein Language Models: Learning the Grammar of Life
2.3.2. Toward Predictive and Sustainable Enzyme Engineering
2.4. Reinforcement Learning (RL)
2.5. Quantum Computing
2.6. AI Tools Empowering Enzyme and Protein Engineering
2.6.1. Structure Prediction and Sequence-Based Models
2.6.2. Next-Generation Diffusion and Function-Conditioned Models
3. Advances in AI-Driven Enzyme Engineering
3.1. Catalytic Efficiency
3.1.1. AI Approaches for Predicting Catalytic Efficiency
3.1.2. Case Study: AI-Guided Thermostabilisation of Xylanase for Biomass Conversion
3.1.3. Implications and Future Prospects
3.2. Substrate Specificity and Selectivity
3.2.1. AI Approaches for Substrate Specificity Engineering
3.2.2. Case Study: Engineering Monoamine Oxidases for Chiral Drug Intermediates
3.2.3. Implications and Future Prospects
3.3. Stability in Extreme Milieu (Thermal and Extreme pH)
3.3.1. Mechanistic Basis of Extremostability and AI Innovations
3.3.2. Case Study: Designing Alkali- and Heat-Tolerant Cyanide Hydratases
3.3.3. Implications and Future Prospects
3.4. Solubility and Expression Efficiency
3.4.1. AI-Guided Mutational Design for Solubility Enhancement
3.4.2. Case Study: AI-Enabled Design of Detergent Lipases
3.4.3. Implications and Future Prospects
3.5. Novel Enzyme Functions (De Novo Design)
3.5.1. AI in De Novo Enzyme Design
3.5.2. Case Study: Ferric Enterobactin Esterase Syn-F4, an AI-Designed De Novo Synzyme
3.5.3. Implications and Future Prospects
3.6. Reduced Experimentation Time
3.6.1. AI Approaches to Reduce Experimentation Time
3.6.2. Case Study: ML-Accelerated Discovery of Transaminases for Sitagliptin
3.6.3. Implications and Future Prospects
3.7. Multi-Enzyme Pathway Optimisation
3.7.1. AI-Guided Frameworks for Pathway Optimisation
3.7.2. Case Study: AI-Modelled Biosynthetic Pathway for Oligosaccharide Production
3.7.3. Implications and Future Prospects
3.8. Environmental Adaptability
3.8.1. AI for Environmental Biocatalysis
3.8.2. Case Study: Engineering OP-Degrading Lactonases for Ecological Deployment
3.8.3. Implications and Future Prospects
4. Next-Generation Applications of AI-Engineered Enzymes
4.1. Pharmaceutical and Therapeutic Enzymes
4.2. Food and Agricultural Enzymes
4.3. Laundry and Detergent Enzymes
4.4. Biofuel Enzymes
4.5. Environmental Bioremediation Enzymes
5. Challenges in AI-Driven Enzyme Engineering
5.1. Data-Centric Challenges
5.1.1. Data Quality, Standardisation, and Coverage
5.1.2. Limited Access to Proprietary Data
5.1.3. Class Imbalance and Negative Dataset Scarcity
5.1.4. Incomplete Reaction Representation
5.2. Enzyme-Centric Challenges
5.2.1. Non-Canonical and Promiscuous Functions
5.2.2. Unseen or Non-Homologous Sequences
5.3. Reaction-Centric Challenges
5.3.1. Multistep Reaction Complexity
5.3.2. Prediction of Novel or Uncharacterised Reactions
5.4. Biochemical and Contextual Challenges
5.4.1. Limitations of EC Number Classification
5.4.2. In Vitro Versus in Vivo Conditions
5.4.3. Environmental Variability
5.4.4. Experimental Validation Bottlenecks
6. Conclusions and Future Directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Fasim, A.; More, V.S.; More, S.S. Large-scale production of enzymes for biotechnology uses. Curr. Opin. Biotechnol. 2021, 69, 68–76. [Google Scholar] [CrossRef] [PubMed]
- Yang, J.; Li, F.Z.; Arnold, F.H. Opportunities and challenges for machine learning-assisted enzyme engineering. ACS Cent. Sci. 2024, 10, 226–241. [Google Scholar] [CrossRef] [PubMed]
- Mao, S.; Jiang, J.; Xiong, K.; Chen, Y.; Yao, Y.; Liu, L.; Liu, H.; Li, X. Enzyme engineering: Performance optimization, novel sources, and applications in the food industry. Foods 2024, 13, 3846. [Google Scholar] [CrossRef] [PubMed]
- Mazurenko, S.; Prokop, Z.; Damborsky, J. Machine learning in enzyme engineering. ACS Catal. 2019, 10, 1210–1223. [Google Scholar] [CrossRef]
- Singh, N.; Lane, S.; Yu, T.; Lu, J.; Ramos, A.; Cui, H.; Zhao, H. A generalized platform for artificial intelligence-powered autonomous enzyme engineering. Nat. Commun. 2025, 16, 5648. [Google Scholar] [CrossRef]
- Landwehr, G.M.; Bogart, J.W.; Magalhaes, C.; Hammarlund, E.G.; Karim, A.S.; Jewett, M.C. Accelerated enzyme engineering by machine-learning guided cell-free expression. Nat. Commun. 2025, 16, 865. [Google Scholar] [CrossRef]
- Zhou, J.; Huang, M. Navigating the landscape of enzyme design: From molecular simulations to machine learning. Chem. Soc. Rev. 2024, 53, 8202–8239. [Google Scholar] [CrossRef]
- Sun, R.; Wu, D.; Chen, P.; Zheng, P. Cutting-edge computational approaches in enzyme design and activity enhancement. Biochem. Eng. J. 2024, 212, 109510. [Google Scholar] [CrossRef]
- Palabiyik, A.A. Synzymes: The future of modern enzyme engineering. Appl. Biochem. Biotechnol. 2025, 197, 5584–5607. [Google Scholar] [CrossRef]
- Chen, A.; Peng, X.; Shen, T.; Zheng, L.; Wu, D.; Wang, S. Discovery, design, and engineering of enzymes based on molecular retrobiosynthesis. mLife 2025, 4, 107–125. [Google Scholar] [CrossRef]
- Ferreira, P.; Fernandes, P.A.; Ramos, M.J. Modern computational methods for rational enzyme engineering. Chem. Catal. 2022, 2, 2481–2498. [Google Scholar] [CrossRef]
- Markus, B.; Christian C, G.; Andreas, K.; Arkadij, K.; Stefan, L.; Gustav, O.; Elina, S.; Radka, S. Accelerating biocatalysis discovery with machine learning: A paradigm shift in enzyme engineering, discovery, and design. ACS Catal. 2023, 13, 14454–14469. [Google Scholar] [CrossRef] [PubMed]
- Tripathi, N.; Hérisson, J.; Faulon, J.L. Machine learning in predictive biocatalysis: A comparative review of methods and applications. Biotechnol. Adv. 2025, 84, 108698. [Google Scholar] [CrossRef] [PubMed]
- Hanna, C.; Blot, A.; Petke, J. Reinforcement learning for mutation operator selection in automated program repair. Autom. Softw. Eng. 2025, 32, 31. [Google Scholar] [CrossRef]
- Feehan, R.; Montezano, D.; Slusky, J.S. Machine learning for enzyme engineering, selection and design. Protein Eng. Des. Sel. 2021, 34, gzab019. [Google Scholar]
- Kouba, P.; Kohout, P.; Haddadi, F.; Bushuiev, A.; Samusevich, R.; Sedlar, J.; Damborsky, J.; Pluskal, T.; Sivic, J.; Mazurenko, S. Machine learning-guided protein engineering. ACS Catal. 2023, 13, 13863–13895. [Google Scholar] [CrossRef]
- Thomas, N.; Belanger, D.; Xu, C.; Lee, H.; Hirano, K.; Iwai, K.; Polic, V.; Nyberg, K.D.; Hoff, K.G.; Frenz, L.; et al. Engineering highly active nuclease enzymes with machine learning and high-throughput screening. Cell Syst. 2025, 16, 3. [Google Scholar] [CrossRef]
- Liu, S.H.; Bai, L.; Wang, X.D.; Wang, Q.Q.; Wang, D.X.; Bornscheuer, U.T.; Ao, Y.F. Machine learning-guided protein engineering to improve the catalytic activity of transaminases under neutral pH conditions. Org. Chem. Front. 2025, 12, 4788–4793. [Google Scholar] [CrossRef]
- Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009, 10, 421. [Google Scholar] [CrossRef]
- Sigrist, C.J.; De Castro, E.; Cerutti, L.; Cuche, B.A.; Hulo, N.; Bridge, A.; Bougueleret, L.; Xenarios, I. New and continuing developments at PROSITE. Nucleic Acids Res. 2012, 41, D344–D347. [Google Scholar] [CrossRef]
- Finn, R.D.; Coggill, P.; Eberhardt, R.Y.; Eddy, S.R.; Mistry, J.; Mitchell, A.L.; Potter, S.C.; Punta, M.; Qureshi, M.; Sangrador-Vegas, A.; et al. The Pfam protein families database: Towards a more sustainable future. Nucleic Acids Res. 2016, 44, D279–D285. [Google Scholar] [CrossRef]
- Shen, H.B.; Chou, K.C. EzyPred: A top–down approach for predicting enzyme functional classes and sub-classes. Biochem. Biophys. Res. Commun. 2007, 364, 53–59. [Google Scholar] [CrossRef]
- De Ferrari, L.; Aitken, S.; van Hemert, J.; Goryanin, I. EnzML: Multi-label prediction of enzyme classes using InterPro signatures. BMC Bioinform. 2012, 13, 61. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Wang, S.; Umarov, R.; Xie, B.; Fan, M.; Li, L.; Gao, X. DEEPre: Sequence-based enzyme EC number prediction by deep learning. Bioinformatics 2018, 34, 760–769. [Google Scholar] [CrossRef]
- Dalkiran, A.; Rifaioglu, A.S.; Martin, M.J.; Cetin-Atalay, R.; Atalay, V.; Doğan, T. ECPred: A tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature. BMC Bioinform. 2018, 19, 334. [Google Scholar] [CrossRef] [PubMed]
- Zou, Z.; Tian, S.; Gao, X.; Li, Y. mlDEEPre: Multi-functional enzyme function prediction with hierarchical multi-label deep learning. Front. Genet. 2019, 9, 714. [Google Scholar] [CrossRef] [PubMed]
- Ryu, J.Y.; Kim, H.U.; Lee, S.Y. Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers. Proc. Natl. Acad. Sci. USA 2019, 116, 13996–14001. [Google Scholar] [CrossRef]
- Song, J.; Li, F.; Takemoto, K.; Haffari, G.; Akutsu, T.; Chou, K.C.; Webb, G.I. PREvaIL, an integrative approach for inferring catalytic residues using sequence, structural, and network features in a machine-learning framework. J. Theor. Biol. 2018, 443, 125–137. [Google Scholar] [CrossRef]
- Torng, W.; Altman, R.B. High precision protein functional site detection using 3D convolutional neural networks. Bioinformatics 2019, 35, 1503–1512. [Google Scholar] [CrossRef]
- Ao, Y.F. Machine learning-assisted protein engineering for improving stereoselectivity. Chem. Catal. 2025, 5, 8. [Google Scholar] [CrossRef]
- Li, G.; Qin, Y.; Fontaine, N.T.; Chong, M.N.F.; Maria-Solano, M.A.; Feixas, F.; Cadet, X.F.; Pandjaitan, R.; Garcia-Borràs, M.; Cadet, F.; et al. Machine learning enables selection of epistatic enzyme mutants for stability against unfolding and detrimental aggregation. ChemBioChem 2021, 22, 904–914. [Google Scholar] [CrossRef] [PubMed]
- Casadevall, G.; Duran, C.; Osuna, S. AlphaFold2 and deep learning for elucidating enzyme conformational flexibility and its application for design. JACS Au 2023, 3, 1554–1562. [Google Scholar] [CrossRef] [PubMed]
- Chen, B.; Khan, M.T.; Goussetis, G.; Sellathurai, M.; Ding, Y.; Mota, J.F. COMET: Co-Optimization of a CNN Model using Efficient-Hardware OBC Techniques. arXiv 2025, arXiv:2510.03516. [Google Scholar] [CrossRef]
- Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
- Lin, Z.; Akin, H.; Rao, R.; Hie, B.; Zhu, Z.; Lu, W.; Smetanin, N.; Verkuil, R.; Kabeli, O.; Shmueli, Y.; et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 2023, 379, 1123–1130. [Google Scholar] [CrossRef]
- Khan, M.T.; Alhartomi, M.A. Digit-Serial DA-Based Fixed-Point RNNs: A Unified Approach for Enhancing Architectural Efficiency. IEEE Trans. Neural Netw. Learn. Syst. 2024, 36, 8240–8254. [Google Scholar] [CrossRef]
- Yalamarthy, K.P.; Dhall, S.; Khan, M.T.; Shaik, R.A. Low-complexity distributed-arithmetic-based pipelined architecture for an LSTM network. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 2019, 28, 329–338. [Google Scholar] [CrossRef]
- Khan, M.T.; Yantır, H.E.; Salama, K.N.; Eltawil, A.M. Architectural trade-off analysis for accelerating LSTM network using Radix-r OBC scheme. IEEE Trans. Circuits Syst. I Regul. Pap. 2022, 70, 266–279. [Google Scholar] [CrossRef]
- Alhartomi, M.A.; Khan, M.T.; Alzahrani, S.; Alzahmi, A.; Shaik, R.A.; Hazarika, J.; Alsulami, R.; Alotaibi, A.; Al-Harthi, M. Low-area and low-power VLSI architectures for long short-term memory networks. IEEE J. Emerg. Sel. Top. Circuits Syst. 2023, 13, 1000–1014. [Google Scholar] [CrossRef]
- Guo, A.B.; Akpinaroglu, D.; Stephens, C.A.; Grabe, M.; Smith, C.A.; Kelly, M.J.; Kortemme, T. Deep learning–guided design of dynamic proteins. Science 2025, 388, eadr7094. [Google Scholar] [CrossRef]
- Jiang, Y.; Ran, X.; Yang, Z.J. Data-driven enzyme engineering to identify function-enhancing enzymes. Protein Eng. Des. Sel. 2023, 36, gzac009. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Xie, D.; Wu, D.; Luo, X.; Wang, S.; Li, Y.; Yang, Y.; Li, W.; Zheng, L. Robust enzyme discovery and engineering with deep learning using CataPro. Nat. Commun. 2025, 16, 2736. [Google Scholar] [CrossRef] [PubMed]
- Wang, Y.; Cheng, L.; Zhang, Y.; Cao, Y.; Alghazzawi, D. DEKP: A deep learning model for enzyme kinetic parameter prediction based on pretrained models and graph neural networks. Brief. Bioinform. 2025, 26, bbaf187. [Google Scholar] [CrossRef] [PubMed]
- Li, G.; Zhang, N.; Dai, X.; Fan, L. EnzyACT: A novel deep learning method to predict the impacts of single and multiple mutations on enzyme activity. J. Chem. Inf. Model. 2024, 64, 5912–5921. [Google Scholar] [CrossRef]
- Liu, C.; Wu, J.; Chen, Y.; Liu, Y.; Zheng, Y.; Liu, L.; Zhao, J. Advances in zero-shot prediction-guided enzyme engineering using machine learning. ChemCatChem 2025, 17, e202401542. [Google Scholar] [CrossRef]
- Shi, Z.; Xu, S.; Xue, S.; Chen, K.; Lu, Y.; Wang, F.; Long, S.; Tian, Y.; Zhang, P.; Wang, J.; et al. From Machine Learning to Multimodal Models: The AI Revolution in Enzyme Engineering. BioDesign Res. 2025, in press, 100044. [Google Scholar] [CrossRef]
- Farhan, M.; Hasani, I.W.; Khafaga, D.S.R.; Ragab, W.M.; Ahmed Kazi, R.N.; Aatif, M.; Muteeb, G.; Fahim, Y.A. Enzymes as Catalysts in Industrial Biocatalysis: Advances in Engineering, Applications, and Sustainable Integration. Catalysts 2025, 15, 891. [Google Scholar] [CrossRef]
- Xie, W.J.; Warshel, A. Harnessing generative AI to decode enzyme catalysis and evolution for enhanced engineering. Natl. Sci. Rev. 2023, 10, nwad331. [Google Scholar] [CrossRef]
- Mardikoraem, M.; Wang, Z.; Pascual, N.; Woldring, D. Generative models for protein sequence modeling: Recent advances and future directions. Brief. Bioinform. 2023, 24, bbad358. [Google Scholar] [CrossRef]
- Barghout, R.A.; Xu, Z.; Betala, S.; Mahadevan, R. Advances in generative modeling methods and datasets to design novel enzymes for renewable chemicals and fuels. Curr. Opin. Biotechnol. 2023, 84, 103007. [Google Scholar] [CrossRef]
- Hawkins-Hooker, A.; Depardieu, F.; Baur, S.; Couairon, G.; Chen, A.; Bikard, D. Generating functional protein variants with variational autoencoders. PLoS Comput. Biol. 2021, 17, e1008736. [Google Scholar] [CrossRef]
- Schmitt, L.T.; Paszkowski-Rogacz, M.; Jug, F.; Buchholz, F. Prediction of designer-recombinases for DNA editing with generative deep learning. Nat. Commun. 2022, 13, 7966. [Google Scholar] [CrossRef] [PubMed]
- Repecka, D.; Jauniskis, V.; Karpus, L.; Rembeza, E.; Rokaitis, I.; Zrimec, J.; Poviloniene, S.; Laurynenas, A.; Viknander, S.; Abuajwa, W.; et al. Expanding functional protein sequence spaces using generative adversarial networks. Nat. Mach. Intell. 2021, 3, 324–333. [Google Scholar] [CrossRef]
- Rives, A.; Meier, J.; Sercu, T.; Goyal, S.; Lin, Z.; Liu, J.; Guo, D.; Ott, M.; Zitnick, C.L.; Ma, J.; et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. USA 2021, 118, e2016239118. [Google Scholar] [CrossRef] [PubMed]
- Munsamy, G.; Illanes-Vicioso, R.; Funcillo, S.; Nakou, I.T.; Lindner, S.; Ayres, G.; Sheehan, L.S.; Moss, S.; Eckhard, U.; Lorenz, P.; et al. Conditional language models enable the efficient design of proficient enzymes. bioRxiv 2024. bioRxiv:2024-05. [Google Scholar]
- Wen, S.; Zheng, W.; Bornscheuer, U.T.; Wu, S. Generative artificial intelligence for enzyme design: Recent advances in models and applications. Curr. Opin. Green Sustain. Chem. 2025, 52, 101010. [Google Scholar] [CrossRef]
- Khan, M.T.; Shaik, R.A. Optimal complexity architectures for pipelined distributed arithmetic-based LMS adaptive filter. IEEE Trans. Circuits Syst. I Regul. Pap. 2018, 66, 630–642. [Google Scholar] [CrossRef]
- Khan, M.T.; Gustafsson, O. ASIC implementation trade-offs for high-speed LMS and block LMS adaptive filters. In Proceedings of the 2022 IEEE 65th International Midwest Symposium on Circuits and Systems (MWSCAS), Fukuoka, Japan, 7–10 August 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–4. [Google Scholar]
- Grechishnikova, D. Transformer neural network for protein-specific de novo drug generation as a machine translation problem. Sci. Rep. 2021, 11, 321. [Google Scholar] [CrossRef]
- Dauparas, J.; Anishchenko, I.; Bennett, N.; Bai, H.; Ragotte, R.J.; Milles, L.F.; Wicky, B.I.; Courbet, A.; de Haas, R.J.; Bethel, N.; et al. Robust deep learning–based protein sequence design using ProteinMPNN. Science 2022, 378, 49–56. [Google Scholar] [CrossRef]
- Sun, H.; He, L.; Deng, P.; Liu, G.; Zhao, Z.; Jiang, Y.; Cao, C.; Ju, F.; Wu, L.; Liu, H.; et al. Accelerating protein engineering with fitness landscape modelling and reinforcement learning. Nat. Mach. Intell. 2025, 7, 1440–1466. [Google Scholar] [CrossRef]
- Pereira, T.O.; Abbasi, M.; Arrais, J.P. Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms. Brief. Bioinform. 2023, 24, bbad368. [Google Scholar] [CrossRef] [PubMed]
- Haddad, R.; Litsa, E.E.; Liu, Z.; Yu, X.; Burkhardt, D.; Bhisetti, G. Targeted molecular generation with latent reinforcement learning. Sci. Rep. 2025, 15, 15202. [Google Scholar] [CrossRef] [PubMed]
- Koch, M.; Duigou, T.; Faulon, J.L. Reinforcement learning for bioretrosynthesis. ACS Synth. Biol. 2019, 9, 157–168. [Google Scholar] [CrossRef]
- Damborsky, J.; Kouba, P.; Sivic, J.; Vasina, M.; Bednar, D.; Mazurenko, S. Quantum computing for faster enzyme discovery and engineering. Nat. Catal. 2025, 8, 872–880. [Google Scholar] [CrossRef]
- Andersson, M.P.; Jones, M.N.; Mikkelsen, K.V.; You, F.; Mansouri, S.S. Quantum computing for chemical and biomolecular product design. Curr. Opin. Chem. Eng. 2022, 36, 100754. [Google Scholar] [CrossRef]
- Gertig, C.; Fleitmann, L.; Hemprich, C.; Hense, J.; Bardow, A.; Leonhard, K. CAT-COSMO-CAMPD: Integrated in silico design of catalysts and processes based on quantum chemistry. Comput. Chem. Eng. 2021, 153, 107438. [Google Scholar] [CrossRef]
- Mandal, C.; Linthicum, D.S. PROGEN: An automated modelling algorithm for the generation of complete protein structures from the α-carbon atomic coordinates. J. Comput.-Aided Mol. Des. 1993, 7, 199–224. [Google Scholar] [CrossRef]
- Krokidis, M.G.; Koumadorakis, D.E.; Lazaros, K.; Ivantsik, O.; Exarchos, T.P.; Vrahatis, A.G.; Kotsiantis, S.; Vlamos, P. AlphaFold3: An Overview of Applications and Performance Insights. Int. J. Mol. Sci. 2025, 26, 3671. [Google Scholar] [CrossRef]
- Cho, Y.; Pacesa, M.; Zhang, Z.; Correia, B.E.; Ovchinnikov, S. BoltzDesign1: Inverting All-Atom Structure Prediction Model for Generalized Biomolecular Binder Design. bioRxiv 2025. bioRxiv:2025-04. [Google Scholar]
- Ahern, W.; Yim, J.; Tischer, D.; Salike, S.; Woodbury, S.M.; Kim, D.; Kalvet, I.; Kipnis, Y.; Coventry, B.; Altae-Tran, H.R.; et al. Atom Level Enzyme Active Site Scaffolding Using RFdiffusion2. bioRxiv 2025. bioRxiv:2025-04. [Google Scholar] [CrossRef]
- Alamdari, S.; Thakkar, N.; Van Den Berg, R.; Tenenholtz, N.; Strome, R.; Moses, A.M.; Lu, A.X.; Fusi, N.; Amini, A.P.; Yang, K.K. Protein Generation with Evolutionary Diffusion: Sequence Is All You Need. bioRxiv 2023. bioRxiv:2023-09. [Google Scholar]
- Yim, J.; Trippe, B.L.; De Bortoli, V.; Mathieu, E.; Doucet, A.; Barzilay, R.; Jaakkola, T. SE(3) Diffusion Model with Application to Protein Backbone Generation. arXiv 2023, arXiv:2302.02277. [Google Scholar] [CrossRef]
- Sánchez, B.J.; Zhang, C.; Nilsson, A.; Lahtvee, P.J.; Kerkhoven, E.J.; Nielsen, J. Improving the phenotype predictions of a yeast genome-scale metabolic model by incorporating enzymatic constraints. Mol. Syst. Biol. 2017, 13, 935. [Google Scholar] [CrossRef] [PubMed]
- Khodayari, A.; Maranas, C.D. A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains. Nat. Commun. 2016, 7, 13806. [Google Scholar] [CrossRef]
- Alazmi, M. Enzyme catalytic efficiency prediction: Employing convolutional neural networks and XGBoost. Front. Artif. Intell. 2024, 7, 1446063. [Google Scholar] [CrossRef] [PubMed]
- Xuan, W.; Li, X.; Gao, H.; Zhang, L.; Hu, J.; Sun, L.; Kan, H. Artificial intelligence driven platform for rapid catalytic performance assessment of nanozymes. Sci. Rep. 2025, 15, 13305. [Google Scholar] [CrossRef]
- Teng, C.; Jiang, Y.; Xu, Y.; Li, Q.; Li, X.; Fan, G.; Xiong, K.; Yang, R.; Zhang, C.; Ma, R.; et al. Improving the thermostability and catalytic efficiency of GH11 xylanase PjxA by adding disulfide bridges. Int. J. Biol. Macromol. 2019, 128, 354–362. [Google Scholar] [CrossRef]
- Upadhyay, V.; Boorla, V.S.; Maranas, C.D. Rank-ordering of known enzymes as starting points for re-engineering novel substrate activity using a convolutional neural network. Metab. Eng. 2023, 78, 171–182. [Google Scholar] [CrossRef]
- Hedstrom, L. Enzyme Specificity and Selectivity. In Encyclopedia of Life Sciences (ELS); John Wiley & Sons, Ltd.: Chichester, UK, 2010. [Google Scholar] [CrossRef]
- Wu, L.; Qin, L.; Nie, Y.; Xu, Y.; Zhao, Y.L. Computer-aided understanding and engineering of enzymatic selectivity. Biotechnol. Adv. 2022, 54, 107793. [Google Scholar] [CrossRef]
- Arnold, F.H. Engineering proteins for nonnatural environments. FASEB J. 1993, 7, 744–749. [Google Scholar] [CrossRef]
- Bloom, J.D.; Arnold, F.H. In the light of directed evolution: Pathways of adaptive protein evolution. Proc. Natl. Acad. Sci. USA 2009, 106, 9995–10000. [Google Scholar] [CrossRef] [PubMed]
- Song, Z.; Zhang, Q.; Wu, W.; Pu, Z.; Yu, H. Rational design of enzyme activity and enantioselectivity. Front. Bioeng. Biotechnol. 2023, 11, 1129149. [Google Scholar] [CrossRef]
- Varadarajan, N.; Gam, J.; Olsen, M.J.; Georgiou, G.; Iverson, B.L. Engineering of protease variants exhibiting high catalytic activity and exquisite substrate selectivity. Proc. Natl. Acad. Sci. USA 2005, 102, 6855–6860. [Google Scholar] [CrossRef] [PubMed]
- Kim, S.; Ga, S.; Bae, H.; Sluyter, R.; Konstantinov, K.; Shrestha, L.K.; Kim, Y.H.; Kim, J.H.; Ariga, K. Multidisciplinary approaches for enzyme biocatalysis in pharmaceuticals: Protein engineering, computational biology, and nanoarchitectonics. EES Catal. 2024, 2, 14–48. [Google Scholar] [CrossRef]
- Li, T.; Liang, J.; Ambrogelly, A.; Brennan, T.; Gloor, G.; Huisman, G.; Lalonde, J.; Lekhal, A.; Mijts, B.; Muley, S.; et al. Efficient, chemoenzymatic process for manufacture of the boceprevir bicyclic [3.1.0] proline intermediate based on amine oxidase-catalyzed desymmetrization. J. Am. Chem. Soc. 2012, 134, 6467–6472. [Google Scholar] [CrossRef]
- Wilson, C.; Agard, D.A. Engineering substrate specificity. Curr. Opin. Struct. Biol. 1991, 1, 617–623. [Google Scholar] [CrossRef]
- Carter, P.; Wells, J.A. Engineering enzyme specificity by “substrate-assisted catalysis”. Science 1987, 237, 394–399. [Google Scholar] [CrossRef]
- Chakravorty, D.; Khan, M.F.; Patra, S. Multifactorial level of extremostability of proteins: Can they be exploited for protein engineering? Extremophiles 2017, 21, 419–444. [Google Scholar] [CrossRef]
- Khan, M.F. Enhancing stability of enzymes for industrial applications: Molecular insights and emerging approaches. World J. Microbiol. Biotechnol. 2025, 41, 362. [Google Scholar] [CrossRef]
- Khan, M.F.; Patra, S. Deciphering the rationale behind specific codon usage pattern in extremophiles. Sci. Rep. 2018, 8, 15548. [Google Scholar] [CrossRef]
- Saha, P.; Khan, M.F.; Patra, S. Truncated α-amylase: An improved candidate for textile processing. Prep. Biochem. Biotechnol. 2018, 48, 635–645. [Google Scholar] [CrossRef] [PubMed]
- Chakravorty, D.; Khan, M.F.; Patra, S. Thermostability of proteins revisited through machine learning methodologies: From nucleotide sequence to structure. Curr. Biotechnol. 2017, 6, 39–49. [Google Scholar] [CrossRef]
- Kang, L.; Wu, B.; Zhou, B.; Tan, P.; Kang, Y.K.; Yan, Y.; Zong, Y.; Li, S.; Liu, Z.; Hong, L. AI-enabled alkaline-resistant evolution of protein to apply in mass production. eLife 2025, 13, RP102788. [Google Scholar] [CrossRef] [PubMed]
- Bian, J.; Tan, P.; Nie, T.; Hong, L.; Yang, G.Y. Optimizing enzyme thermostability by combining multiple mutations using protein language model. MLife 2024, 3, 492–504. [Google Scholar] [CrossRef]
- Muellers, S.N.; Allen, K.N.; Whitty, A. MEnTaT: A machine-learning approach for the identification of mutations to increase protein stability. Proc. Natl. Acad. Sci. USA 2023, 120, e2309884120. [Google Scholar] [CrossRef]
- Zheng, N.; Cai, Y.; Zhang, Z.; Zhou, H.; Deng, Y.; Du, S.; Tu, M.; Fang, W.; Xia, X. Tailoring industrial enzymes for thermostability and activity evolution by the machine learning-based iCASE strategy. Nat. Commun. 2025, 16, 604. [Google Scholar] [CrossRef]
- Dou, Z.; Sun, Y.; Jiang, X.; Wu, X.; Li, Y.; Gong, B.; Wang, L. Data-driven strategies for the computational design of enzyme thermal stability: Trends, perspectives, and prospects. Acta Biochim. Biophys. Sin. 2023, 55, 343. [Google Scholar] [CrossRef]
- Teng, R.; Zhang, J.; Tu, Z.; He, Q.; Li, Y. Computer-Aided Design to Improve the Thermal Stability of Rhizomucor miehei Lipase. Foods 2024, 13, 4023. [Google Scholar] [CrossRef]
- Son, H.; Seo, H.; Han, S.; Kim, S.M.; Pham, L.T.M.; Khan, M.F.; Sung, H.J.; Kang, S.H.; Kim, K.J.; Kim, Y.H. Extra disulfide and ionic salt bridge improves the thermostability of lignin peroxidase H8 under acidic condition. Enzyme Microb. Technol. 2021, 148, 109803. [Google Scholar] [CrossRef]
- Wang, L.; Watermeyer, J.M.; Mulelu, A.E.; Sewell, B.T.; Benedik, M.J. Engineering pH-tolerant mutants of a cyanide dihydratase. Appl. Microbiol. Biotechnol. 2012, 94, 131–140. [Google Scholar] [CrossRef]
- Crum, M.A.; Sewell, B.T.; Benedik, M.J. Bacillus pumilus cyanide dihydratase mutants with higher catalytic activity. Front. Microbiol. 2016, 7, 1264. [Google Scholar] [CrossRef]
- Idicula-Thomas, S.; Balaji, P.V. Understanding the relationship between the primary structure of proteins and its propensity to be soluble on overexpression in Escherichia coli. Protein Sci. 2005, 14, 582–592. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.P.; Gong, J.S.; Su, C.; Li, H.; Xu, Z.H.; Shi, J.S. Improving the soluble expression of difficult-to-express proteins in prokaryotic expression system via protein engineering and synthetic biology strategies. Metab. Eng. 2023, 78, 99–114. [Google Scholar] [CrossRef] [PubMed]
- Jo, B.H. Improved solubility and stability of a thermostable carbonic anhydrase via fusion with marine-derived intrinsically disordered solubility enhancers. Int. J. Mol. Sci. 2024, 25, 1139. [Google Scholar] [CrossRef] [PubMed]
- Han, X.; Wang, X.; Zhou, K. Develop machine learning-based regression predictive models for engineering protein solubility. Bioinformatics 2019, 35, 4640–4646. [Google Scholar] [CrossRef]
- Hirose, S.; Noguchi, T. ESPRESSO: A system for estimating protein expression and solubility in protein expression systems. Proteomics 2013, 13, 1444–1456. [Google Scholar] [CrossRef]
- Hebditch, M.; Carballo-Amador, M.A.; Charonis, S.; Curtis, R.; Warwicker, J. Protein–Sol: A web tool for predicting protein solubility from sequence. Bioinformatics 2017, 33, 3098–3310. [Google Scholar] [CrossRef]
- Khurana, S.; Rawi, R.; Kunji, K.; Chuang, G.Y.; Bensmail, H.; Mall, R. DeepSol: A deep learning framework for sequence-based protein solubility prediction. Bioinformatics 2018, 34, 2605–2613. [Google Scholar] [CrossRef]
- Klesmith, J.R.; Bacik, J.P.; Wrenbeck, E.E.; Michalczyk, R.; Whitehead, T.A. Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanning. Proc. Natl. Acad. Sci. USA 2017, 114, 2265–2270. [Google Scholar] [CrossRef]
- Khan, M.F.; Kundu, D.; Hazra, C.; Patra, S. A strategic approach of enzyme engineering by attribute ranking and enzyme immobilization on zinc oxide nanoparticles to attain thermostability in mesophilic Bacillus subtilis lipase for detergent formulation. Int. J. Biol. Macromol. 2019, 136, 66–82. [Google Scholar] [CrossRef]
- Scherer, M.; Fleishman, S.J.; Jones, P.R.; Dandekar, T.; Bencurova, E. Computational enzyme engineering pipelines for optimized production of renewable chemicals. Front. Bioeng. Biotechnol. 2021, 9, 673005. [Google Scholar] [CrossRef] [PubMed]
- Zanghellini, A. De novo computational enzyme design. Curr. Opin. Biotechnol. 2014, 29, 132–138. [Google Scholar] [CrossRef] [PubMed]
- Cui, X.C.; Zheng, Y.; Liu, Y.; Yuchi, Z.; Yuan, Y.J. AI-driven de novo enzyme design: Strategies, applications, and future prospects. Biotechnol. Adv. 2025, 82, 108603. [Google Scholar] [CrossRef] [PubMed]
- Tiwari, V. In vitro engineering of novel bioactivity in natural enzymes. Front. Chem. 2016, 4, 39. [Google Scholar] [CrossRef]
- Hua, C.; Lu, J.; Liu, Y.; Zhang, O.; Tang, J.; Ying, R.; Jin, W.; Wolf, G.; Precup, D.; Zheng, S. Reaction-conditioned de novo enzyme design with Genzyme. arXiv 2024, arXiv:2411.16694. [Google Scholar]
- Braun, M.; Tripp, A.; Chakatok, M.; Kaltenbrunner, S.; Fischer, C.; Stoll, D.; Bijelic, A.; Elaily, W.; Totaro, M.G.; Moser, M.; et al. Computational enzyme design by catalytic motif scaffolding. Nature 2025. [Google Scholar] [CrossRef]
- Ruffolo, J.A.; Nayfach, S.; Gallagher, J.; Bhatnagar, A.; Beazer, J.; Hussain, R.; Russ, J.; Yip, J.; Hill, E.; Pacesa, M.; et al. Design of Highly Functional Genome Editors by Modelling CRISPR-Cas Sequences. Nature 2025, 645, 518–525. [Google Scholar] [CrossRef]
- Nguyen, E.; Poli, M.; Durrant, M.G.; Kang, B.; Katrekar, D.; Li, D.B.; Bartie, L.J.; Thomas, A.W.; King, S.H.; Brixi, G.; et al. Sequence Modeling and Design from Molecular to Genome Scale with Evo. Science 2024, 386, 6723. [Google Scholar] [CrossRef]
- Kurihara, K.; Umezawa, K.; Donnelly, A.E.; Sperling, B.; Liao, G.; Hecht, M.H.; Arai, R. Crystal structure and activity of a de novo enzyme, ferric enterobactin esterase Syn-F4. Proc. Natl. Acad. Sci. USA 2023, 120, e2218281120. [Google Scholar] [CrossRef]
- Orsi, E.; von Borzyskowski, L.S.; Noack, S.; Nikel, P.I.; Lindner, S.N. Automated in vivo enzyme engineering accelerates biocatalyst optimization. Nat. Commun. 2024, 15, 3447. [Google Scholar] [CrossRef]
- Wang, X.; Xu, K.; Tan, Y.; Liu, S.; Zhou, J. Possibilities of using de novo design for generating diverse functional food enzymes. Int. J. Mol. Sci. 2023, 24, 3827. [Google Scholar] [CrossRef]
- Hossack, E.J.; Hardy, F.J.; Green, A.P. Building enzymes through design and evolution. ACS Catal. 2023, 13, 12436–12444. [Google Scholar] [CrossRef]
- Naddaf, M. Scientists Use AI to Design Life-Like Enzymes from Scratch. Nature 2025. [Google Scholar] [CrossRef]
- Ming, Y.; Wang, W.; Yin, R.; Zeng, M.; Tang, L.; Tang, S.; Li, M. A review of enzyme design in catalytic stability by artificial intelligence. Brief. Bioinform. 2023, 24, bbad065. [Google Scholar] [CrossRef] [PubMed]
- Siedentop, R.; Siska, M.; Möller, N.; Lanzrath, H.; von Lieres, E.; Lütz, S.; Rosenthal, K. Bayesian optimization for an ATP-regenerating in vitro enzyme cascade. Catalysts 2023, 13, 468. [Google Scholar] [CrossRef]
- Putz, S.; Teetz, N.; Abt, M.; Jerono, P.; Meurer, T.; Franzreb, M. Optimized machine learning for autonomous enzymatic reaction intensification in a self-driving lab. Biotechnol. Bioeng. 2025, 122, 3018–3036. [Google Scholar] [CrossRef]
- Gantz, M.; Neun, S.; Medcalf, E.J.; van Vliet, L.D.; Hollfelder, F. Ultrahigh-throughput enzyme engineering and discovery in in vitro compartments. Chem. Rev. 2023, 123, 5571–5611. [Google Scholar] [CrossRef]
- Savile, C.K.; Janey, J.M.; Mundorff, E.C.; Moore, J.C.; Tam, S.; Jarvis, W.R.; Colbeck, J.C.; Krebber, A.; Fleitz, F.J.; Brands, J.; et al. Biocatalytic asymmetric synthesis of chiral amines from ketones applied to sitagliptin manufacture. Science 2010, 329, 305–309. [Google Scholar] [CrossRef]
- Desai, A.A. Sitagliptin manufacture: A compelling tale of green chemistry, process intensification, and industrial asymmetric catalysis. Angew. Chem. Int. Ed. 2011, 50, 1974–1976. [Google Scholar] [CrossRef]
- Nielsen, J.; Keasling, J.D. Engineering cellular metabolism. Cell 2016, 164, 1185–1197. [Google Scholar] [CrossRef]
- Dusséaux, S.; Wajn, W.T.; Liu, Y.; Ignea, C.; Kampranis, S.C. Transforming yeast peroxisomes into microfactories for the efficient production of high-value isoprenoids. Proc. Natl. Acad. Sci. USA 2020, 117, 31789–31799. [Google Scholar] [CrossRef] [PubMed]
- Guo, Q.; Yang, Y.X.; Li, D.X.; Ji, X.J.; Wu, N.; Wang, Y.T.; Ye, C.; Shi, T.Q. Advances in multi-enzyme co-localization strategies for the construction of microbial cell factory. Biotechnol. Adv. 2024, 77, 108453. [Google Scholar] [CrossRef] [PubMed]
- Delebecque, C.J.; Lindner, A.B.; Silver, P.A.; Aldaye, F.A. Organization of intracellular reactions with rationally designed RNA assemblies. Science 2011, 333, 470–474. [Google Scholar] [CrossRef] [PubMed]
- Boob, A.G.; Chen, J.; Zhao, H. Enabling pathway design by multiplex experimentation and machine learning. Metab. Eng. 2024, 81, 70–87. [Google Scholar] [CrossRef]
- Lee, M.E.; Aswani, A.; Han, A.S.; Tomlin, C.J.; Dueber, J.E. Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay. Nucleic Acids Res. 2013, 41, 10668–10678. [Google Scholar] [CrossRef]
- Xu, W.; Cai, J.; Wu, W.; Yuan, Q.; Mao, Z.; Ma, H. Improving metabolic engineering design with enzyme-thermo optimization. Metab. Eng. 2025, 91, 356–365. [Google Scholar] [CrossRef]
- Gao, D.Y.; Sun, X.B.; Liu, M.Q.; Liu, Y.N.; Zhang, H.E.; Shi, X.L.; Li, Y.N.; Wang, J.K.; Yin, S.J.; Wang, Q. Characterization of thermostable and chimeric enzymes via isopeptide bond-mediated molecular cyclization. J. Agric. Food Chem. 2019, 67, 6837–6846. [Google Scholar] [CrossRef]
- Radley, E.; Davidson, J.; Foster, J.; Obexer, R.; Bell, E.L.; Green, A.P. Engineering enzymes for environmental sustainability. Angew. Chem. Int. Ed. 2023, 62, e202309305. [Google Scholar] [CrossRef]
- Khan, M.F. Recent progress and challenges in microbial defluorination and degradation for sustainable remediation of fluorinated xenobiotics. Processes 2025, 13, 2017. [Google Scholar] [CrossRef]
- Khan, M.F.; Rama, M.; Murphy, C.D. Biodegradation of fluorinated β-triketone herbicide tembotrione by a bacterial–fungal consortium. Biocatal. Agric. Biotechnol. 2025, 70, 103828. [Google Scholar] [CrossRef]
- Khan, M.F.; Hof, C.; Niemcová, P.; Murphy, C.D. Recent advances in fungal xenobiotic metabolism: Enzymes and applications. World J. Microbiol. Biotechnol. 2023, 39, 296. [Google Scholar] [CrossRef] [PubMed]
- Khan, M.F.; Murphy, C.D. Nitroreduction of flutamide by Cunninghamella elegans NADPH: Cytochrome P450 reductase. Biochem. Biophys. Rep. 2022, 29, 101209. [Google Scholar] [CrossRef] [PubMed]
- Khan, M.F. Recent advances in microbial enzyme applications for sustainable textile processing and waste management. Sci 2025, 7, 46. [Google Scholar] [CrossRef]
- Griffiths, A.D.; Tawfik, D.S. Directed evolution of an extremely fast phosphotriesterase by in vitro compartmentalization. EMBO J. 2003, 22, 24–35. [Google Scholar] [CrossRef]
- Floor, R.J.; Wijma, H.J.; Colpa, D.I.; Ramos---Silva, A.; Jekel, P.A.; Szymański, W.; Feringa, B.L.; Marrink, S.J.; Janssen, D.B. Computational library design for increasing haloalkane dehalogenase stability. ChemBioChem 2014, 15, 1660–1672. [Google Scholar] [CrossRef]
- Khan, M.F. Fungi for sustainable pharmaceutical remediation: Enzymatic innovations, challenges, and applications—A review. Processes 2025, 13, 1034. [Google Scholar] [CrossRef]
- Jacquet, P.; Hiblot, J.; Daudé, D.; Bergonzi, C.; Gotthard, G.; Armstrong, N.; Chabrière, E.; Elias, M. Rational engineering of a native hyperthermostable lactonase into a broad spectrum phosphotriesterase. Sci. Rep. 2017, 7, 16745. [Google Scholar] [CrossRef]
- Poirier, L.; Pinault, L.; Armstrong, N.; Ghigo, E.; Daudé, D.; Chabrière, E. Evaluation of a robust engineered enzyme towards organophosphorus insecticide bioremediation using planarians as biosensors. Chem.-Biol. Interact. 2019, 306, 96–103. [Google Scholar] [CrossRef]
- Sung, H.J.; Khan, M.F.; Kim, Y.H. Recombinant lignin peroxidase-catalyzed decolorization of melanin using in situ generated H2O2 for application in whitening cosmetics. Int. J. Biol. Macromol. 2019, 136, 20–26. [Google Scholar] [CrossRef]
- Ndochinwa, O.G.; Wang, Q.Y.; Amadi, O.C.; Nwagu, T.N.; Nnamchi, C.I.; Okeke, E.S.; Moneke, A.N. Current status and emerging frontiers in enzyme engineering: An industrial perspective. Heliyon 2024, 10, 11. [Google Scholar] [CrossRef]
- Victorino da Silva Amatto, I.; Gonsales da Rosa-Garzon, N.; Antônio de Oliveira Simões, F.; Santiago, F.; Pereira da Silva Leite, N.; Raspante Martins, J.; Cabral, H. Enzyme engineering and its industrial applications. Biotechnol. Appl. Biochem. 2022, 69, 389–409. [Google Scholar] [CrossRef] [PubMed]
- Alcalde, M. (Ed.) Directed Enzyme Evolution: Advances and Applications; Springer: Cham, Switzerland, 2017. [Google Scholar]
- Shelat, N.Y.; Parhi, S.; Ostermeier, M. Development of a cancer-marker activated enzymatic switch from the herpes simplex virus thymidine kinase. Protein Eng. Des. Sel. 2017, 30, 95–103. [Google Scholar] [CrossRef] [PubMed]
- Khan, M.F.; Murphy, C.D. Bacterial degradation of the anti-depressant drug fluoxetine produces trifluoroacetic acid and fluoride ion. Appl. Microbiol. Biotechnol. 2021, 105, 9359–9369. [Google Scholar] [CrossRef] [PubMed]
- Shahbazmohammadi, H.; Sardari, S.; Lari, A.; Omidinia, E. Engineering an efficient mutant of Eupenicillium terrenum fructosyl peptide oxidase for the specific determination of hemoglobin A1c. Appl. Microbiol. Biotechnol. 2019, 103, 1725–1735. [Google Scholar] [CrossRef]
- Giessel, A.; Dousis, A.; Ravichandran, K.; Smith, K.; Sur, S.; McFadyen, I.; Zheng, W.; Licht, S. Therapeutic Enzyme Engineering Using a Generative Neural Network. Sci. Rep. 2022, 12, 1536. [Google Scholar] [CrossRef]
- Huisman, G.W.; Collier, S.J. On the development of new biocatalytic processes for practical pharmaceutical synthesis. Curr. Opin. Chem. Biol. 2013, 17, 284–292. [Google Scholar] [CrossRef]
- Huisman, G.W.; Liang, J.; Krebber, A. Practical chiral alcohol manufacture using ketoreductases. Curr. Opin. Chem. Biol. 2010, 14, 122–129. [Google Scholar] [CrossRef]
- Midelfort, K.S.; Kumar, R.; Han, S.; Karmilowicz, M.J.; McConnell, K.; Gehlhaar, D.K.; Mistry, A.; Chang, J.S.; Anderson, M.; Villalobos, A.; et al. Redesigning and characterizing the substrate specificity and activity of Vibrio fluvialis aminotransferase for the synthesis of imagabalin. Protein Eng. Des. Sel. 2013, 26, 25–33. [Google Scholar] [CrossRef]
- Ghislieri, D.; Green, A.P.; Pontini, M.; Willies, S.C.; Rowles, I.; Frank, A.; Grogan, G.; Turner, N.J. Engineering an enantioselective amine oxidase for the synthesis of pharmaceutical building blocks and alkaloid natural products. J. Am. Chem. Soc. 2013, 135, 10863–10869. [Google Scholar] [CrossRef]
- Fenel, F.; Leisola, M.; Jänis, J.; Turunen, O. A de novo designed N-terminal disulphide bridge stabilizes the Trichoderma reesei endo-1,4-β-xylanase II. J. Biotechnol. 2004, 108, 137–143. [Google Scholar] [CrossRef]
- Han, N.; Ma, Y.; Mu, Y.; Tang, X.; Li, J.; Huang, Z. Enhancing thermal tolerance of a fungal GH11 xylanase guided by B-factor analysis and multiple sequence alignment. Enzyme Microb. Technol. 2019, 131, 109422. [Google Scholar] [CrossRef]
- Zhou, C.; Xue, Y.; Ma, Y. Evaluation and directed evolution for thermostability improvement of a GH13 thermo-stable α-glucosidase from Thermus thermophilus TC11. BMC Biotechnol. 2015, 15, 97. [Google Scholar] [CrossRef] [PubMed]
- Lin, L.L.; Liu, J.S.; Wang, W.C.; Chen, S.H.; Huang, C.C.; Lo, H.F. Glutamic acid 219 is critical for the thermostability of a truncated α-amylase from alkaliphilic and thermophilic Bacillus sp. strain TS-23. World J. Microbiol. Biotechnol. 2008, 24, 619–626. [Google Scholar] [CrossRef]
- Duan, X.; Chen, J.; Wu, J. Improving the thermostability and catalytic efficiency of Bacillus deramificans pullulanase by site-directed mutagenesis. Appl. Environ. Microbiol. 2013, 79, 4072–4077. [Google Scholar] [CrossRef] [PubMed]
- Yu, X.J.; Huang, C.Y.; Xu, X.D.; Chen, H.; Liang, M.J.; Xu, Z.X.; Xu, H.X.; Wang, Z. Protein engineering of a pyridoxal-5′-phosphate-dependent l-aspartate-α-decarboxylase from Tribolium castaneum for β-alanine production. Molecules 2020, 25, 1280. [Google Scholar] [CrossRef]
- Lee, S.C.; Chang, Y.; Shin, D.M.; Han, J.; Seo, M.H.; Fazelinia, H.; Maranas, C.D.; Kim, H.S. Designing the substrate specificity of D-hydantoinase using a rational approach. Enzyme Microb. Technol. 2009, 44, 170–175. [Google Scholar] [CrossRef]
- Hegazy, U.M.; El-Khonezy, M.I.; Shokeer, A.; Abdel-Ghany, S.S.; Bassuny, R.I.; Barakat, A.Z.; Salama, W.H.; Azouz, R.A.; Fahmy, A.S. Revealing of a novel xylose-binding site of Geobacillus stearothermophilus xylanase by directed evolution. J. Biochem. 2019, 165, 177–184. [Google Scholar] [CrossRef]
- Madan, B.; Mishra, P. Directed evolution of Bacillus licheniformis lipase for improvement of thermostability. Biochem. Eng. J. 2014, 91, 276–282. [Google Scholar] [CrossRef]
- Zhao, H.Y.; Feng, H. Engineering Bacillus pumilus alkaline serine protease to increase its low-temperature proteolytic activity by directed evolution. BMC Biotechnol. 2018, 18, 34. [Google Scholar] [CrossRef]
- Jakob, F.; Martinez, R.; Mandawe, J.; Hellmuth, H.; Siegert, P.; Maurer, K.H.; Schwaneberg, U. Surface charge engineering of a Bacillus gibsonii subtilisin protease. Appl. Microbiol. Biotechnol. 2013, 97, 6793–6802. [Google Scholar] [CrossRef]
- Dror, A.; Shemesh, E.; Dayan, N.; Fishman, A. Protein engineering by random mutagenesis and structure-guided consensus of Geobacillus stearothermophilus lipase T6 for enhanced stability in methanol. Appl. Environ. Microbiol. 2014, 80, 1515–1527. [Google Scholar] [CrossRef] [PubMed]
- Zhao, G.; Wang, J.; Tang, Q.; Lan, D.; Wang, Y. Improving the catalytic activity and thermostability of MAS1 lipase by alanine substitution. Mol. Biotechnol. 2018, 60, 319–328. [Google Scholar] [CrossRef] [PubMed]
- Ma, C.; Wu, R.; Huang, R.; Jiang, W.; You, C.; Zhu, L.; Zhu, Z. Directed evolution of a 6-phosphogluconate dehydrogenase for operating an enzymatic fuel cell at lowered anodic pHs. J. Electroanal. Chem. 2019, 851, 113444. [Google Scholar] [CrossRef]
- Ali, M.; Husain, Q.; Alam, N.; Ahmad, M. Nano-peroxidase fabrication on cation exchanger nanocomposite: Augmenting catalytic efficiency and stability for the decolorization and detoxification of Methyl Violet 6B dye. Sep. Purif. Technol. 2018, 203, 20–28. [Google Scholar] [CrossRef]
- Harford-Cross, C.F.; Carmichael, A.B.; Allan, F.K.; England, P.A.; Rouch, D.A.; Wong, L.L. Protein engineering of cytochrome P450cam (CYP101) for the oxidation of polycyclic aromatic hydrocarbons. Protein Eng. 2000, 13, 121–128. [Google Scholar] [CrossRef]
- Syed, K.; Porollo, A.; Miller, D.; Yadav, J.S. Rational engineering of the fungal P450 monooxygenase CYP5136A3 to improve its oxidizing activity toward polycyclic aromatic hydrocarbons. Protein Eng. Des. Sel. 2013, 26, 553–557. [Google Scholar] [CrossRef]
- Bilal, M.; Iqbal, H.M. State-of-the-art strategies and applied perspectives of enzyme biocatalysis in food sector—current status and future trends. Crit. Rev. Food Sci. Nutr. 2020, 60, 2052–2066. [Google Scholar] [CrossRef]
- Vojcic, L.; Pitzler, C.; Körfer, G.; Jakob, F.; Martinez, R.; Maurer, K.H.; Schwaneberg, U. Advances in protease engineering for laundry detergents. New Biotechnol. 2015, 32, 629–634. [Google Scholar] [CrossRef]
- Khan, M.F.; Murphy, C.D. Environmental remediation by novel nanomaterials and fungi with high-degradation capacity of hazardous contaminants. In Bio and Nanoremediation of Hazardous Environmental Pollutants; CRC Press: Boca Raton, FL, USA, 2023; pp. 283–310. [Google Scholar]
- Khan, M.F.; Liao, J.; Liu, Z.; Chugh, G. Bacterial cytochrome P450 involvement in the biodegradation of fluorinated pyrethroids. J. Xenobiot. 2025, 15, 58. [Google Scholar] [CrossRef]
- Agrawal, K.; Bhatt, A.; Chaturvedi, V.; Verma, P. Bioremediation: An effective technology toward a sustainable environment via the remediation of emerging environmental pollutants. In Emerging Technologies in Environmental Bioremediation; Elsevier: Amsterdam, The Netherlands, 2020; pp. 165–196. [Google Scholar]
- Singh, N.; Malik, S.; Gupta, A.; Srivastava, K.R. Revolutionizing enzyme engineering through artificial intelligence and machine learning. Emerg. Top. Life Sci. 2021, 5, 113–125. [Google Scholar] [CrossRef]
- Sapoval, N.; Aghazadeh, A.; Nute, M.G.; Antunes, D.A.; Balaji, A.; Baraniuk, R.; Barberan, C.; Dannenfelser, R.; Dun, C.; Edrisi, M.; et al. Current progress and open challenges for applying deep learning across the biosciences. Nat. Commun. 2022, 13, 1728. [Google Scholar] [CrossRef] [PubMed]
- Nugroho, H. A review: Data quality problem in predictive analytics. IJAIT (Int. J. Appl. Inf. Technol.) 2023, 7, 79–91. [Google Scholar] [CrossRef]
- Heid, E.; Probst, D.; Green, W.H.; Madsen, G.K. EnzymeMap: Curation, validation and data-driven prediction of enzymatic reactions. Chem. Sci. 2023, 14, 14229–14242. [Google Scholar] [CrossRef] [PubMed]
- Probst, D.; Manica, M.; Nana Teukam, Y.G.; Castrogiovanni, A.; Paratore, F.; Laino, T. Biocatalysed synthesis planning using data-driven learning. Nat. Commun. 2022, 13, 964. [Google Scholar] [CrossRef]
- Tipton, K.F.; Armstrong, R.N.; Bakker, B.M.; Bairoch, A.; Cornish-Bowden, A.; Halling, P.J.; Hofmeyr, J.H.; Leyh, T.S.; Kettner, C.; Raushel, F.M.; et al. Standards for Reporting Enzyme Data: The STRENDA Consortium: What it aims to do and why it should be helpful. Perspect. Sci. 2014, 1, 131–137. [Google Scholar] [CrossRef]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Sandve, G.K.; Nekrutenko, A.; Taylor, J.; Hovig, E. Ten simple rules for reproducible computational research. PLoS Comput. Biol. 2013, 9, e1003285. [Google Scholar] [CrossRef]
- Matsuta, Y.; Ito, M.; Tohsato, Y. ECOH: An enzyme commission number predictor using mutual information and a support vector machine. Bioinformatics 2013, 29, 365–372. [Google Scholar] [CrossRef]
- Baylon, J.L.; Cilfone, N.A.; Gulcher, J.R.; Chittenden, T.W. Enhancing retrosynthetic reaction prediction with deep learning using multiscale reaction classification. J. Chem. Inf. Model. 2019, 59, 673–688. [Google Scholar] [CrossRef]
- Tavakoli, M.; Shmakov, A.; Ceccarelli, F.; Baldi, P. Rxn hypergraph: A hypergraph attention model for chemical reaction representation. arXiv 2022, arXiv:2201.01196. [Google Scholar] [CrossRef]
- Zeng, K.; Liu, X.; Zhang, Y.; Yang, X.; Jin, Y.; Xu, Y. Learning chemical reaction representation with reactant-product alignment. arXiv 2024, arXiv:2411.17629. [Google Scholar] [CrossRef]
- Copley, S.D. Enzymes with extra talents: Moonlighting functions and catalytic promiscuity. Curr. Opin. Chem. Biol. 2003, 7, 265–272. [Google Scholar] [CrossRef]
- Delépine, B.; Duigou, T.; Carbonell, P.; Faulon, J.L. RetroPath2.0: A retrosynthesis workflow for metabolic engineers. Metab. Eng. 2018, 45, 158–170. [Google Scholar] [CrossRef]
- Dönertaş, H.M.; Martínez Cuesta, S.; Rahman, S.A.; Thornton, J.M. Characterising complex enzyme reaction data. PLoS ONE 2016, 11, e0147952. [Google Scholar] [CrossRef]
- Schnoes, A.M.; Brown, S.D.; Dodevski, I.; Babbitt, P.C. Annotation error in public databases: Misannotation of molecular function in enzyme superfamilies. PLoS Comput. Biol. 2009, 5, e1000605. [Google Scholar] [CrossRef]
- Kulmanov, M.; Hoehndorf, R. DeepGOPlus: Improved protein function prediction from sequence. Bioinformatics 2020, 36, 422–429. [Google Scholar] [CrossRef]
- Heijnen, J.J.; Verheijen, P.J. Parameter identification of in vivo kinetic models: Limitations and challenges. Biotechnol. J. 2013, 8, 768–775. [Google Scholar] [CrossRef]




| Tool | Description | Key Features/Applications | Developer/Source | Tool URL |
|---|---|---|---|---|
| Structure Prediction | ||||
| AlphaFold2 (v2.0.0) | Deep learning-based protein structure prediction | High-accuracy 3D folding from sequence data; catalytic site inference | DeepMind | https://www.deepmind.com/research/highlighted-research/alphafold (accessed on 1 October 2025) |
| RoseTTAFold (v1.0) | Multi-track neural network integrating sequence and structural information | Predicts structure and function; supports novel fold design | Baker Lab | https://boinc.bakerlab.org/rosetta/ (accessed on 1 October 2025) |
| Chai-1 | Enhanced structure prediction using multimodal inputs (MSAs, templates, embeddings) | Robust prediction across diverse biomolecules | Chai Discovery | https://neurosnap.ai/service/Chai-1 (accessed on 1 October 2025) |
| OmegaFold (v1.1.0) | Structure prediction without multiple sequence alignments (MSAs) | Accurate, MSA-free folding for low-homology sequences | HeliXon Protein Inc. | https://github.com/HeliXonProtein/OmegaFold or https://cosmic-cryoem.org/tools/omegafold/ (accessed on 1 October 2025) |
| HelixFold | Fast, efficient structure prediction framework | Optimised for industrial biodesign pipelines | PaddleHelix | https://github.com/PaddlePaddle/PaddleHelix/tree/dev/apps/protein_folding/helixfold (accessed on 26 October 2025) |
| FastFold | Speed-optimised AlphaFold implementation | GPU-parallelisation for rapid inference | ByteDance | https://github.com/hpcaitech/FastFold (accessed on 1 October 2025) |
| Protein and Enzyme Design | ||||
| NeuroFold | AI-guided enzyme design platform | Generates and screens enzyme variants with improved catalytic traits | Neurosnap | https://neurosnap.ai/service/NeuroFold (accessed on 1 October 2025) |
| ProGen2 (v2.0) | Transformer-based language model for protein generation | De novo protein sequence generation preserving function | Salesforce AI Research | https://github.com/salesforce/progen (accessed on 1 October 2025) |
| ESM-2 (v1.0.0) | Large-scale protein language model | Sequence embeddings, mutation effect, and function prediction | Meta AI (FAIR) | https://github.com/facebookresearch/esm (accessed on 1 October 2025) |
| RFdiffusion | Diffusion-based generative model for protein backbones | Enables creation of novel folds and catalytic sites | Baker Lab/UW | https://github.com/RosettaCommons/RFdiffusion (accessed on 1 October 2025) |
| ProteinMPNN (v1.0.0) | Sequence design conditioned on backbone structures | Rapid backbone-to-sequence mapping for design tasks | Baker Lab/UW | https://github.com/dauparas/ProteinMPNN (accessed on 1 October 2025) |
| Chroma | Controlled generative framework for structural design | Conditional structure generation with user-specified features | Generate:Biomedicines | https://generatebiomedicines.com/chroma (accessed on 26 October 2025) |
| Catalytic Site, Substrate Specificity and Metal-Binding Prediction | ||||
| MAHOMES II | RF-based site model | Predicts catalytic metal ions in enzymes | Robinson Lab | https://mahomes.ku.edu/help (accessed on 1 October 2025) |
| AdenylPred | RF-based classifier | Predicts functional and substrate classes of adenylate-forming enzymes | Robinson Lab | https://github.com/serina-robinson/adenylpred (accessed on 1 October 2025) |
| innov’SAR | PLSR framework | Predicts turnover, stability, and stereoselectivity | PEACCEL (The AI company for Life Science) | https://www.peaccel.com/technology/innovsar-artificial-intelligence-platform/ (accessed on 1 October 2025) |
| gcWGAN | Deep generative model | 3D active-site and sequence generation | Shen Lab | https://github.com/Shen-Lab/gcWGAN (accessed on 1 October 2025) |
| Molecular Docking | ||||
| DiffDock/DiffDock-L | Diffusion-based molecular docking model | Flexible ligand–protein docking; 3D pose generation | Baker Lab/UW | https://github.com/gcorso/DiffDock (accessed on 1 October 2025) |
| GNINA | Convolutional neural network (CNN)-based docking framework | ML-enhanced scoring and ligand ranking | GNINA Team | https://github.com/gnina/gnina or https://proteiniq.io/app/gnina (accessed on 26 October 2025) |
| PocketFlow | AI-based pocket prediction and docking | Predicts binding pockets and supports flexible docking | Tencent AI Lab | https://proteiniq.io/app/pocketflow or https://github.com/Saoge123/PocketFlow (accessed on 26 October 2025) |
| AutoDock Vina | Classical open-source docking program | Widely used for small-molecule and enzyme–ligand interactions | Scripps Research | http://vina.scripps.edu (accessed on 26 October 2025) |
| In silico Mutagenesis | ||||
| DeepMutScan | Predicts the functional impact of mutations | High-throughput mutational scanning via deep learning | Gamazon Lab | https://github.com/gamazonlab/DeepMutScan (accessed on 26 October 2025) |
| MuPIPR | Predicts protein–protein interaction changes upon mutation | Estimates ΔΔG and interface disruption | Zhou Lab | https://github.com/guangyu-zhou/MuPIPR (accessed on 26 October 2025) |
| ESM-IF1 (v2.0.1) | Inverse folding model (structure → sequence) | Recovers sequences from 3D backbones | Meta AI | https://neurosnap.ai/service/ESM-IF1 (accessed on 1 October 2025) |
| DynaMut2 (v2.0) | Predicts mutation effects on stability and dynamics | Visualises conformational and flexibility changes | Biosig Lab | https://biosig.lab.uq.edu.au/dynamut/ (accessed on 26 October 2025) |
| Sequence and Structure Analysis | ||||
| ProtNLM/ProtBERT/ESM-1b | Protein language models for representation learning | Embeddings for annotation, classification, and alignment | ProtNLM by Google Research/EBI; ProtBERT by Brandes et al.; ESM-1b by Meta AI/Hugging Face | https://310.ai/docs/function/protnlm (accessed on 1 October 2025); https://huggingface.co/Rostlab/prot_bert (accessed on 26 October 2025); https://huggingface.co/facebook/esm1b_t33_650M_UR50S (accessed on 26 October 2025) |
| MMseqs2 | Protein sequence clustering and alignment | Scalable similarity search and redundancy reduction | MPI for Developmental Biology | https://toolkit.tuebingen.mpg.de/tools/mmseqs2 (accessed on 1 October 2025) |
| Foldseek | Structure-based search engine | Fast comparison of protein 3D structures | MPI for Biology | https://search.foldseek.com/search (accessed on 1 October 2025) |
| MAFFT (v7.526) | Multiple sequence alignment algorithm | High-speed, accurate alignment for large datasets | Osaka University | https://mafft.cbrc.jp/alignment/software/ (accessed on 1 October 2025) |
| HMMER (v3.4) | Hidden Markov Model-based alignment and domain search | Detects conserved motifs and functional domains | HHMI/Sean Eddy | http://hmmer.org (accessed on 1 October 2025) |
| Enzymatic Classification and Functional Prediction | ||||
| DeepEC (v1.0) | CNN-based enzymatic classifier | Predicts complete EC numbers from sequence | KAIST Systems Biology | https://bitbucket.org/kaistsystemsbiology/deepec (accessed on 1 October 2025) |
| ECPred (v1.1) | Ensemble (SVMs, kNN) | Predicts full/partial EC numbers | Alperen Dalkıran | https://ecpred.kansil.org/ (accessed on 1 October 2025) |
| Expression and Safety Tools | ||||
| NetSolP (v1.0) | Predicts protein solubility using deep learning | AI-based solubility and expression classifier | DTU Health Tech | https://services.healthtech.dtu.dk/service.php?NetSolP-1.0 (accessed on 1 October 2025) |
| SoDoPE | Predicts heterologous expression efficiency | Assists codon optimisation and solubility enhancement | SoDoPE Team | https://tisigner.com/sodope (accessed on 1 October 2025) |
| ToxinPred2 (v2.0)/ADMET-AI (v1.4.0)/eTox (v0.97) | Predicts toxicity, allergenicity, and pharmacokinetics | Comprehensive in silico safety and ADMET profiling | Various Academic Teams | https://webs.iiitd.edu.in/raghava/toxinpred2/ (accessed on 1 October 2025) |
| Antibody and Binder Design | ||||
| NeuroBind | AI-based design of antibodies, peptides, and nanobodies | Generates high-affinity binders and optimises stability | Neurosnap Inc. | https://neurosnap.ai/service/NeuroBind (accessed on 1 October 2025) |
| ABlooper | Deep-learning tool for antibody CDR loop modelling | Rapid and accurate loop conformation prediction | Oxford Protein Informatics Group | https://github.com/oxpig/ABlooper (accessed on 1 October 2025) |
| IgFold (v0.4.0) | Transformer model for antibody structure prediction | Sequence-to-structure mapping for immunoglobulins | Johns Hopkins University | https://github.com/Graylab/IgFold (accessed on 26 October 2025) |
| NanoNet | Lightweight nanobody structure predictor | Fast prediction of VHH domains and single-chain binders | Dina Lab | https://github.com/dina-lab3D/NanoNet (accessed on 26 October 2025) |
| Feature | AlphaFold2 (v2.0.0) | RoseTTAFold (v1.0) | ProGen (v1.0) | ESM-2 (v1.0.0) |
|---|---|---|---|---|
| Primary purpose | High-accuracy 3D structure prediction | Fast protein/complex structure prediction | Generative protein sequence design | Sequence embeddings and functional prediction |
| Input | AA sequence; deep MSA; optional templates | AA sequence; shallow MSA | Sequence context; optional functional/family tags | AA sequence only |
| Output | Atomic 3D structure; pLDDT; PAE | 3D structure; confidence metrics | Novel sequences; residue probability profiles | Sequence embeddings; functional scores |
| Approach | Attention-based DL (EvoFormer); MSA-driven | 3-track network integrating sequence–distance–coordinates | Autoregressive protein language model | Transformer protein language model |
| Accuracy | Very high for proteins with homologs; moderate for orphan proteins | Good; slightly below AF2; more robust with shallow MSAs | Variable; sequence plausibility high, functional accuracy depends on filtering | High for function/mutation prediction; not for 3D structure |
| Usability | Moderate (requires GPUs and MSA tools); many community pipelines | High; faster and easier to run; less computationally heavy | High; simple interface; no structural prediction required | High; embeddings easy to compute; scalable to large datasets |
| Runtime | Hours per protein (depending on MSA depth) | Minutes to hours; typically faster than AF2 | Seconds–minutes for sequence generation | Seconds for embeddings; minutes with downstream predictors |
| Availability | Open-source (DeepMind) | Open-source (Baker lab) | Available via API (Meta/Anthropic; semi-open) | Open-source model weights (Meta) |
| Use in enzyme engineering | Widely adopted for fold validation, active-site inspection, and variant analysis | Increasingly used for cross-validation and modelling complexes | Used in published de novo enzyme design and function-conditioned libraries | Frequently used for mutational scanning, functional prediction, and variant prioritisation |
| Strengths | Near-experimental accuracy; robust confidence metrics | Fast; models complexes; lower MSA burden | Explores sequence space beyond natural diversity | Effective without MSAs; good for orphan proteins; strong functional predictions |
| Limitations | MSA-dependent; static structures; limited dynamics/ligand modelling | Lower accuracy than AF; static structures | No structure prediction; generated sequences may not fold/function | Does not output structures; requires downstream modelling |
| Best use cases | Structure-guided mutagenesis; variant stability screening | Complex modelling; alternative structural predictions | De novo enzyme design; generating diverse sequence libraries | Variant ranking; function prediction; prioritising mutations |
| Handling of distant homologs | Weak when MSA sparse | Slightly more robust than AF2 but still MSA-dependent | Can generate sequences without homologs | Excels at orphan proteins; MSA-independent |
| Interpretability | Medium; confidence metrics aid interpretation | Medium | Low | Low (requires downstream interpretation) |
| Integration with experimental workflow | Strong in structure-based design and active-site mapping | Useful for folding validation and complex modelling | Provides candidates for testing and directed evolution | Guides mutational experiments; prioritises variants |
| References | [32,34,69,70] | [70,71] | [68,72] | [35] |
| Enzyme/System | Source | Industrial Purpose | AI/Engineering Strategy | Mutation(s) | Key Performance Improvement | Reference |
|---|---|---|---|---|---|---|
| Pharmaceuticals and Healthcare | ||||||
| R-selective transaminase | Arthrobacter sp. | Sitagliptin synthesis | Directed evolution | Multiple rounds (not disclosed) | 99.95% enantiopurity | [130,131] |
| Monoamine oxidase | Aspergillus niger | Chiral amines for boceprevir | Directed evolution + computational optimisation | Not specified | 150% ↑ yield; 40% ↓ water use | [87] |
| Ketoreductases (KREDs) | Lactobacillus kefir, Sporobolomyces salmonicolor | Chiral intermediates (montelukast) | Rational mutagenesis | Not specified | >99.55% enantioselectivity | [159,160] |
| Transaminases and Monoamine Oxidases | Various microbial sources | Chiral intermediates (imagabalin, etc.) | Computation + directed evolution | Not specified | >97% R-enantiomer yield | [161,162] |
| Tissue Plasminogen Activator (tPA)/Te-necteplase (TNKase®) | Humans | Thrombolytic therapy | Rational engineering | Multiple substitutions | ↑ half-life; ↑ inhibitor resistance | [154] |
| Nucleoside Kinase (HSV-TK/CH1) | Herpes simplex type 1 | Prodrug activation | Structure-guided design | Fusion construct | Higher cancer-cell specificity | [154,155] |
| Fructosyl Peptide Oxidase (FPOX) | Eupenicillium terrenum | Diabetes biosensing | AI-guided design | Y261W | ↑ specificity; reduced cross-reactivity | [157] |
| Food and Agriculture | ||||||
| Endo-1,4-β-Xylanase | Trichoderma reesei | Thermostable xylanase | Error-prone PCR | T2C, T28C | Stability ↑ from 40 s to 20 min at 65 °C | [163] |
| GH11 Xylanase | Neocallimastix patriciarum | Heat-tolerant hydrolysis | ML (B-factor analysis) | Q87R, N88G, S89H, S90T | 50–60% residual activity at 60–65 °C | [164] |
| α-Amylase | Thermus thermophilus, Bacillus sp. TS-23 | High-temp starch hydrolysis | Directed evolution + site mutagenesis | Multiple | Improved high-temperature stability | [165,166] |
| Mannanase–Xylanase Chimera | Recombinant system (SpyTag/SpyCatcher) | Biomass breakdown | Isopeptide cyclisation | SpyTag/SpyCatcher fusion | ↑ stability; ↑ metal tolerance | [139] |
| Pullulanase | Bacillus deramificans | Syrup production | Site-directed mutagenesis | D437H, D503Y | ↑ thermostability | [167] |
| L-Aspartate α-Decarboxylase/D-Hydantoinase | Bacillus stearothermophilus | Amino acid/additive synthesis | Error-prone PCR + modelling | Multiple | 2.45–200× ↑ catalytic efficiency | [168,169] |
| Xylanase (xylose-tolerant mutants) | Bacillus spp. | Starch/biofuel processes | ML-guided random mutagenesis | L133V, M116I, L131P | 3–3.5× ↑ catalytic efficiency | [170] |
| Laundry and Detergent Industry | ||||||
| Lipase (eP-231-51 mutant) | Bacillus licheniformis | Thermostable detergent lipase | Directed evolution | D72G, D61G, Y129H, T101P | 13.5× ↑ thermostability; 30% ↑ activity | [171] |
| Lipase (bsl_the3 mutant) | Bacillus subtilis | Alkaline-stable lipase | Rational design | F41K, W42E, P119E, Q121K, V149K, Q150E | Active at pH 9; ↑ surfactant tolerance | [112] |
| Alkaline Serine Protease | Bacillus pumilus BA06 | Cold-active protease | Directed evolution + site mutagenesis | P9S/K27Q; P9S/T162I | 5× ↑ activity at 15 °C | [172] |
| Subtilisin Protease | Bacillus gibsonii | High-temperature protease | Charge engineering | N253D, Q256E | ↑ thermal resistance; shifted pH optimum | [173] |
| Biofuels | ||||||
| Xylanase | Penicillium janthinellum MA21601 | Biomass hydrolysis | Site mutagenesis | Disulfide bridge insertion | Optimum temp 50–70 °C; 4.76× ↑ activity | [77] |
| Xylanase (xylose-tolerant) | Bacillus spp. | Lignocellulose conversion | ML-guided evolution | M116I, L131P, L133V | 3× activity; 3.5× ↑ Ki | [170] |
| Lipase | Geobacillus stearothermophilus T6 | Biodiesel transesterification | Consensus design + mutagenesis | H86Y, A269T; Q185L | 66× ↑ stability; +4.3 °C Tm | [174] |
| Lipase | Streptomyces sp. W007 | Triacylglyceride conversion | Site-directed mutagenesis | F153A, H108A, V233A | ↑ activity and thermostability | [175] |
| 6-Phosphogluconate Dehydrogenase | Thermotoga maritima | Enzymatic fuel cells | Directed evolution | Not disclosed | 42× ↑ activity at pH 5.4 | [176] |
| Environmental Bioremediation | ||||||
| Cyanide Dihydratase (CynD) | Bacillus pumilus | Cyanide degradation | ML-assisted evolution | E35K, E327G, R322, etc. | ↑ alkali tolerance; ↑ thermostability | [102,103] |
| Ginger Peroxidase (GP) | Zingiber officinale | Dye degradation | ML-guided immobilisation | Immobilisation, no sequence change | 3× ↑ Vmax; ↑ stability | [177] |
| Cytochrome P450 (CYP101) | Pseudomonas putida | PAH degradation | AI-guided mutagenesis | Y96A; F87A–Y96F | 31% ↑ NADH turnover | [178] |
| Cytochrome P450 (CYP5136A3) | Phanerochaete chrysosporium | PAH oxidation | Structure-guided mutagenesis | L324F; W129F/L324F | 23–187% ↑ oxidation | [179] |
| Phosphotriesterase-like Lactonase (SsoPox-αsD6) | Sulfolobus solfataricus | OP pesticide degradation | Computational design + mutagenesis | V27A, Y97W, L228M, W263M | Tm = 82.5 °C; ↑ OP detoxification | [149,150] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Khan, M.F.; Khan, M.T. AI-Driven Enzyme Engineering: Emerging Models and Next-Generation Biotechnological Applications. Molecules 2026, 31, 45. https://doi.org/10.3390/molecules31010045
Khan MF, Khan MT. AI-Driven Enzyme Engineering: Emerging Models and Next-Generation Biotechnological Applications. Molecules. 2026; 31(1):45. https://doi.org/10.3390/molecules31010045
Chicago/Turabian StyleKhan, Mohd Faheem, and Mohd Tasleem Khan. 2026. "AI-Driven Enzyme Engineering: Emerging Models and Next-Generation Biotechnological Applications" Molecules 31, no. 1: 45. https://doi.org/10.3390/molecules31010045
APA StyleKhan, M. F., & Khan, M. T. (2026). AI-Driven Enzyme Engineering: Emerging Models and Next-Generation Biotechnological Applications. Molecules, 31(1), 45. https://doi.org/10.3390/molecules31010045

