Advances in AI-Guided CRISPR-Cas9 Engineering Strategies for Microbial Biotechnology
Abstract
1. Introduction
2. CRISPR-Cas gRNAs: Opportunities for AI Optimisation
2.1. gRNA: On- and Off-Target Events
2.2. sgRNA Scaffold
2.3. RNA Secondary Structure
2.4. PAM Sequence
2.5. Cas9 mRNA with Shine-Dalgarno Sequence
3. Data-Driven Training for Microbial Systems and AI Tools for sgRNA Design
3.1. Data Collection and Preprocessing
3.2. Data Labelling and Objective Definition
3.3. Data Diversity and Representativeness
3.4. Balancing Quantity and Quality
3.5. Model Selection and Training Data
3.6. Microbial Datasets and AI Tools-Associated
4. Applied Examples of CRISPR-Cas9–AI in Microbial Biotechnology
5. Challenges and Future Perspectives
5.1. FAIR Principles for CRISPR-Cas Data
5.2. CRISPR-Cas Databases for Precision Engineering
5.2.1. Dataset Standardisation
5.2.2. Creation of Databases for CRISPR-Cas Technology
5.3. Generative AI for CRISPR-Cas Technology
5.4. Ethical Concerns with CRISPR-Cas-Based GPT Models
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| 2D | Secondary |
| AI | Artificial intelligence |
| AMR | Antimicrobial resistance |
| aSD | Anti Shine-Dalgarno |
| BiLSTM | Bidirectional long short-term memory |
| BioGRID ORCS | Biological General Repository for Interaction Datasets Open Repository of CRISPR Screens |
| Cas | CRISPR-associated protein |
| CNN | Convolutional neural network |
| CRISPR | Clustered Regularly Interspaced Short Palindromic Repeats |
| DB | Database |
| DBN | Deep Belief Network |
| DL | Deep learning |
| DNA | Deoxyribonucleic acid |
| DSB | Double-strand break |
| EGA | Elitist Genetic Algorithm |
| FAIR | Findability, Accessibility, Interoperability, and Reusability |
| GAT | Graph attention network |
| GBRT | Gradient boosting regression tree |
| GNN | Graph neural network |
| GPT | Generative pre-trained transformer |
| gRNA | Guide RNA |
| HTS | High-throughput screening |
| Indels | Insertions and deletions |
| iRF | iterative Random Forest |
| LLM | Large language model |
| MFE | Minimum free energy |
| MIACS | Minimal Information About CRISPR Screens |
| ML | Machine learning |
| mRNA | Messenger RNA |
| nt | Nucleotides |
| PAM | Protospacer adjacent motif |
| RBM | Restricted Boltzmann machine |
| RBS | Ribosome-binding site |
| RNA | Ribonucleic acid |
| RNN | Restricted Boltzmann machine |
| SD | Shine-Dalgarno |
| sgRNA | Single guide ribonucleic acid |
| SpCas9 | Streptococcus pyogenes Cas9 |
| SVM | Support Vector Machine |
| XAI | Explainable AI |
References
- Li, X.; Liu, Y.; Ma, L.; Jiang, W.; Shi, T.; Li, L.; Li, C.; Chen, Z.; Fan, X.; Xu, Q. Metabolic Engineering of Escherichia coli for High-Yield Dopamine Production via Optimized Fermentation Strategies. Appl. Environ. Microbiol. 2025, 91, e00159-25. [Google Scholar] [CrossRef] [PubMed]
- Ye, C.; Zhang, Y.; Zhang, J.; Shi, M.; Nie, F.; Liu, Q. Metabolic Engineering of Escherichia coli BW25113 for the Production of Vitamin K2 Based on CRISPR/Cas9 Mediated Gene Knockout and Metabolic Pathway Modification. J. Biol. Eng. 2026, 20, 29. [Google Scholar] [CrossRef] [PubMed]
- Han, S.; Jang, H.W.; Park, S.; Kim, T.M.; Kim, H.J. Unlocking the Flavor Potential of Brewing Yeast with CRISPR/Cas9 Genome Editing. LWT 2025, 230, 118254. [Google Scholar] [CrossRef]
- Lee, H.-J.; Shin, D.J.; Nho, S.B.; Lee, K.W.; Kim, S.-K. Metabolic Engineering of Saccharomyces cerevisiae for Fermentative Production of Heme. Biotechnol. J. 2024, 19, e202400351. [Google Scholar] [CrossRef] [PubMed]
- Bush, K.; Corsi, G.I.; Yan, A.C.; Haynes, K.; Layzer, J.M.; Zhou, J.H.; Llanga, T.; Gorodkin, J.; Sullenger, B.A. Utilizing Directed Evolution to Interrogate and Optimize CRISPR/Cas Guide RNA Scaffolds. Cell Chem. Biol. 2023, 30, 879–892.e5. [Google Scholar] [CrossRef] [PubMed]
- Li, C.; Zou, Q.; Li, J.; Feng, H. Prediction of CRISPR-Cas9 on-Target Activity Based on a Hybrid Neural Network. Comput. Struct. Biotechnol. J. 2025, 27, 2098–2106. [Google Scholar] [CrossRef] [PubMed]
- Riesenberg, S.; Helmbrecht, N.; Kanis, P.; Maricic, T.; Pääbo, S. Improved gRNA Secondary Structures Allow Editing of Target Sites Resistant to CRISPR-Cas9 Cleavage. Nat. Commun. 2022, 13, 489. [Google Scholar] [CrossRef] [PubMed]
- Reynolds, S.A.; Beery, S.; Burgess, N.; Burgman, M.; Butchart, S.H.M.; Cooke, S.J.; Coomes, D.; Danielsen, F.; Di Minin, E.; Durán, A.P.; et al. The Potential for AI to Revolutionize Conservation: A Horizon Scan. Trends Ecol. Evol. 2025, 40, 191–207. [Google Scholar] [CrossRef] [PubMed]
- Zhou, G.; Rusnac, D.-V.; Park, H.; Canzani, D.; Nguyen, H.M.; Stewart, L.; Bush, M.F.; Nguyen, P.T.; Wulff, H.; Yarov-Yarovoy, V.; et al. An Artificial Intelligence Accelerated Virtual Screening Platform for Drug Discovery. Nat. Commun. 2024, 15, 7761. [Google Scholar] [CrossRef] [PubMed]
- Yang, X.; Jia, J.; Zhou, X.; Wang, S. The Future of Artificial Intelligence: Time to Embrace More International Collaboration. Innovation 2024, 5, 100703. [Google Scholar] [CrossRef] [PubMed]
- Xiang, X.; Corsi, G.I.; Anthon, C.; Qu, K.; Pan, X.; Liang, X.; Han, P.; Dong, Z.; Liu, L.; Zhong, J.; et al. Enhancing CRISPR-Cas9 gRNA Efficiency Prediction by Data Integration and Deep Learning. Nat. Commun. 2021, 12, 3238. [Google Scholar] [CrossRef] [PubMed]
- Guha, D.; Avtaran, D.; Lenka, R.; Yang, T.; Wang, L.; Rathore, R.S. Leveraging a Smart AI-Controlled GRNA in Genome Editing for Identification and Replacement of Genetic Mutations. In Proceedings of Fourth International Conference on Computing and Communication Networks; Kumar, A., Swaroop, A., Shukla, P., Eds.; Springer Nature: Singapore, 2025; pp. 649–656. [Google Scholar]
- Wan, S.; Liu, X.; Sun, W.; Lv, B.; Li, C. Current Advances for Omics-Guided Process Optimization of Microbial Manufacturing. Bioresour. Bioprocess. 2023, 10, 30. [Google Scholar] [CrossRef] [PubMed]
- Abbate, E.; Andrion, J.; Apel, A.; Biggs, M.; Chaves, J.; Cheung, K.; Ciesla, A.; Clark-ElSayed, A.; Clay, M.; Contridas, R.; et al. Optimizing the Strain Engineering Process for Industrial-Scale Production of Bio-Based Molecules. J. Ind. Microbiol. Biotechnol. 2023, 50, kuad025. [Google Scholar] [CrossRef] [PubMed]
- Sun, X.; Zhang, H.; Jia, Y.; Li, J.; Jia, M. CRISPR-Cas9-Based Genome-Editing Technologies in Engineering Bacteria for the Production of Plant-Derived Terpenoids. Eng. Microbiol. 2024, 4, 100154. [Google Scholar] [CrossRef] [PubMed]
- Sadanov, A.K.; Baimakhanova, B.B.; Orasymbet, S.E.; Ratnikova, I.A.; Turlybaeva, Z.Z.; Baimakhanova, G.B.; Amitova, A.A.; Omirbekova, A.A.; Aitkaliyeva, G.S.; Kossalbayev, B.D.; et al. Engineering Useful Microbial Species for Pharmaceutical Applications. Microorganisms 2025, 13, 599. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; Chen, T.; Sun, W.; Chen, Y.; Ying, H. Optimizing Escherichia coli Strains and Fermentation Processes for Enhanced L-Lysine Production: A Review. Front. Microbiol. 2024, 15, 1485624. [Google Scholar] [CrossRef] [PubMed]
- Gao, H.; Qiu, Z.; Wang, X.; Zhang, X.; Zhang, Y.; Dai, J.; Liang, Z. Recent Advances in Genome-Scale Engineering in Escherichia coli and Their Applications. Eng. Microbiol. 2024, 4, 100115. [Google Scholar] [CrossRef] [PubMed]
- Ahmed, M.; Kayode, H.; Okesanya, O.; Ukoaka, B.; Eshun, G.; Mourid, M.; Adigun, O.; Ogaya, J.; Mohamed, Z.; Lucero-Prisno, D. CRISPR-Cas Systems in the Fight Against Antimicrobial Resistance: Current Status, Potentials, and Future Directions. Infect. Drug Resist. 2024, 17, 5229–5245. [Google Scholar] [CrossRef] [PubMed]
- Okesanya, O.J.; Ahmed, M.M.; Ogaya, J.B.; Amisu, B.O.; Ukoaka, B.M.; Adigun, O.A.; Manirambona, E.; Adebusuyi, O.; Othman, Z.K.; Oluwakemi, O.G.; et al. Reinvigorating AMR Resilience: Leveraging CRISPR–Cas Technology Potentials to Combat the 2024 WHO Bacterial Priority Pathogens for Enhanced Global Health Security—A Systematic Review. Trop. Med. Health 2025, 53, 43. [Google Scholar] [CrossRef] [PubMed]
- Wang, L.; Zhang, J. Prediction of sgRNA On-Target Activity in Bacteria by Deep Learning. BMC Bioinform. 2019, 20, 517. [Google Scholar] [CrossRef] [PubMed]
- Guo, J.; Wang, T.; Guan, C.; Liu, B.; Luo, C.; Xie, Z.; Zhang, C.; Xing, X.-H. Improved sgRNA Design in Bacteria via Genome-Wide Activity Profiling. Nucleic Acids Res. 2018, 46, 7052–7069. [Google Scholar] [CrossRef] [PubMed]
- Ham, D.T.; Browne, T.S.; Banglorewala, P.N.; Wilson, T.L.; Michael, R.K.; Gloor, G.B.; Edgell, D.R. A Generalizable Cas9/sgRNA Prediction Model Using Machine Transfer Learning with Small High-Quality Datasets. Nat. Commun. 2023, 14, 5514. [Google Scholar] [CrossRef] [PubMed]
- Wu, X.; Scott, D.A.; Kriz, A.J.; Chiu, A.C.; Hsu, P.D.; Dadon, D.B.; Cheng, A.W.; Trevino, A.E.; Konermann, S.; Chen, S.; et al. Genome-Wide Binding of the CRISPR Endonuclease Cas9 in Mammalian Cells. Nat. Biotechnol. 2014, 32, 670–676. [Google Scholar] [CrossRef] [PubMed]
- Xu, H.; Xiao, T.; Chen, C.H.; Li, W.; Meyer, C.A.; Wu, Q.; Wu, D.; Cong, L.; Zhang, F.; Liu, J.S.; et al. Sequence Determinants of Improved CRISPR sgRNA Design. Genome Res. 2015, 25, 1147–1157. [Google Scholar] [CrossRef] [PubMed]
- Moreno-Mateos, M.A.; Vejnar, C.E.; Beaudoin, J.-D.; Fernandez, J.P.; Mis, E.K.; Khokha, M.K.; Giraldez, A.J. CRISPRscan: Designing Highly Efficient sgRNAs for CRISPR-Cas9 Targeting In Vivo. Nat. Methods 2015, 12, 982–988. [Google Scholar] [CrossRef] [PubMed]
- Doench, J.G.; Hartenian, E.; Graham, D.B.; Tothova, Z.; Hegde, M.; Smith, I.; Sullender, M.; Ebert, B.L.; Xavier, R.J.; Root, D.E. Rational Design of Highly Active sgRNAs for CRISPR-Cas9–Mediated Gene Inactivation. Nat. Biotechnol. 2014, 32, 1262–1267. [Google Scholar] [CrossRef] [PubMed]
- Labun, K.; Montague, T.G.; Krause, M.; Torres Cleuren, Y.N.; Tjeldnes, H.; Valen, E. CHOPCHOP v3: Expanding the CRISPR Web Toolbox Beyond Genome Editing. Nucleic Acids Res. 2019, 47, W171–W174. [Google Scholar] [CrossRef] [PubMed]
- Wang, T.; Wei, J.J.; Sabatini, D.M.; Lander, E.S. Genetic Screens in Human Cells Using the CRISPR-Cas9 System. Science 2014, 343, 80–84. [Google Scholar] [CrossRef] [PubMed]
- Corsi, G.I.; Qu, K.; Alkan, F.; Pan, X.; Luo, Y.; Gorodkin, J. CRISPR/Cas9 gRNA Activity Depends on Free Energy Changes and on the Target PAM Context. Nat. Commun. 2022, 13, 3006. [Google Scholar] [CrossRef] [PubMed]
- Noshay, J.M.; Walker, T.; Alexander, W.G.; Klingeman, D.M.; Romero, J.; Walker, A.M.; Prates, E.; Eckert, C.; Irle, S.; Kainer, D.; et al. Quantum Biological Insights into CRISPR-Cas9 sgRNA Efficiency from Explainable-AI Driven Feature Engineering. Nucleic Acids Res. 2023, 51, 10147–10161. [Google Scholar] [CrossRef] [PubMed]
- Wang, D.; Zhang, C.; Wang, B.; Li, B.; Wang, Q.; Liu, D.; Wang, H.; Zhou, Y.; Shi, L.; Lan, F.; et al. Optimized CRISPR Guide RNA Design for Two High-Fidelity Cas9 Variants by Deep Learning. Nat. Commun. 2019, 10, 4284. [Google Scholar] [CrossRef] [PubMed]
- Xue, L.; Tang, B.; Chen, W.; Luo, J. Prediction of CRISPR SgRNA Activity Using a Deep Convolutional Neural Network. J. Chem. Inf. Model. 2019, 59, 615–624. [Google Scholar] [CrossRef] [PubMed]
- Jin, L.; Liyanage, R.; Duan, D.; Chen, S.-J. Machine-Learning-Inferred and Energy-Landscape-Guided Analyses Reveal Kinetic Determinants of CRISPR/Cas9 Gene Editing. PRX Life 2026, 4, 013028. [Google Scholar] [CrossRef]
- Moreb, E.A.; Lynch, M.D. A Meta-Analysis of gRNA Library Screens Enables an Improved Understanding of the Impact of GRNA Folding and Structural Stability on CRISPR-Cas9 Activity. CRISPR J. 2022, 5, 146–154. [Google Scholar] [CrossRef] [PubMed]
- Peng, H.; Zheng, Y.; Blumenstein, M.; Tao, D.; Li, J. CRISPR/Cas9 Cleavage Efficiency Regression through Boosting Algorithms and Markov Sequence Profiling. Bioinformatics 2018, 34, 3069–3077. [Google Scholar] [CrossRef] [PubMed]
- Kim, H.K.; Kim, Y.; Lee, S.; Min, S.; Bae, J.Y.; Choi, J.W.; Park, J.; Jung, D.; Yoon, S.; Kim, H.H. SpCas9 Activity Prediction by DeepSpCas9, a Deep Learning–Based Model with High Generalization Performance. Sci. Adv. 2019, 5, eaax9249. [Google Scholar] [CrossRef] [PubMed]
- Zhu, W.; Xie, H.; Chen, Y.; Zhang, G. CrnnCrispr: An Interpretable Deep Learning Method for CRISPR/Cas9 sgRNA On-Target Activity Prediction. Int. J. Mol. Sci. 2024, 25, 4429. [Google Scholar] [CrossRef] [PubMed]
- Zhang, G.; Zeng, T.; Dai, Z.; Dai, X. Prediction of CRISPR/Cas9 Single Guide RNA Cleavage Efficiency and Specificity by Attention-Based Convolutional Neural Networks. Comput. Struct. Biotechnol. J. 2021, 19, 1445–1457. [Google Scholar] [CrossRef] [PubMed]
- Dimauro, G.; Colagrande, P.; Carlucci, R.; Ventura, M.; Bevilacqua, V.; Caivano, D. CRISPRLearner: A Deep Learning-Based System to Predict CRISPR/Cas9 SgRNA On-Target Cleavage Efficiency. Electronics 2019, 8, 1478. [Google Scholar] [CrossRef]
- Chuai, G.; Ma, H.; Yan, J.; Chen, M.; Hong, N.; Xue, D.; Zhou, C.; Zhu, C.; Chen, K.; Duan, B.; et al. DeepCRISPR: Optimized CRISPR Guide RNA Design by Deep Learning. Genome Biol. 2018, 19, 80. [Google Scholar] [CrossRef] [PubMed]
- Konstantakos, V.; Nentidis, A.; Krithara, A.; Paliouras, G. CRISPR–Cas9 gRNA Efficiency Prediction: An Overview of Predictive Tools and the Role of Deep Learning. Nucleic Acids Res. 2022, 50, 3616–3637. [Google Scholar] [CrossRef] [PubMed]
- Doench, J.G.; Fusi, N.; Sullender, M.; Hegde, M.; Vaimberg, E.W.; Donovan, K.F.; Smith, I.; Tothova, Z.; Wilen, C.; Orchard, R.; et al. Optimized sgRNA Design to Maximize Activity and Minimize Off-Target Effects of CRISPR-Cas9. Nat. Biotechnol. 2016, 34, 184–191. [Google Scholar] [CrossRef] [PubMed]
- Haeussler, M.; Schönig, K.; Eckert, H.; Eschstruth, A.; Mianné, J.; Renaud, J.-B.; Schneider-Maunoury, S.; Shkumatava, A.; Teboul, L.; Kent, J.; et al. Evaluation of Off-Target and on-Target Scoring Algorithms and Integration into the Guide RNA Selection Tool CRISPOR. Genome Biol. 2016, 17, 148. [Google Scholar] [CrossRef] [PubMed]
- Labuhn, M.; Adams, F.F.; Ng, M.; Knoess, S.; Schambach, A.; Charpentier, E.M.; Schwarzer, A.; Mateo, J.L.; Klusmann, J.-H.; Heckl, D. Refined sgRNA Efficacy Prediction Improves Large- and Small-Scale CRISPR–Cas9 Applications. Nucleic Acids Res. 2018, 46, 1375–1385. [Google Scholar] [CrossRef] [PubMed]
- Jinek, M.; Chylinski, K.; Fonfara, I.; Hauer, M.; Doudna, J.A.; Charpentier, E. A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity. Science 2012, 337, 816–821. [Google Scholar] [CrossRef] [PubMed]
- Dong, C.; Gou, Y.; Lian, J. SgRNA Engineering for Improved Genome Editing and Expanded Functional Assays. Curr. Opin. Biotechnol. 2022, 75, 102697. [Google Scholar] [CrossRef] [PubMed]
- Briner, A.E.; Donohoue, P.D.; Gomaa, A.A.; Selle, K.; Slorach, E.M.; Nye, C.H.; Haurwitz, R.E.; Beisel, C.L.; May, A.P.; Barrangou, R. Guide RNA Functional Modules Direct Cas9 Activity and Orthogonality. Mol. Cell 2014, 56, 333–339. [Google Scholar] [CrossRef] [PubMed]
- De Saeger, J. A Guide to Guides: An Overview of SpCas9 sgRNA Scaffold Variants and Modifications. SynBio 2025, 3, 19. [Google Scholar] [CrossRef]
- Wong, N.; Liu, W.; Wang, X. WU-CRISPR: Characteristics of Functional Guide RNAs for the CRISPR/Cas9 System. Genome Biol. 2015, 16, 218. [Google Scholar] [CrossRef] [PubMed]
- Kocak, D.D.; Josephs, E.A.; Bhandarkar, V.; Adkar, S.S.; Kwon, J.B.; Gersbach, C.A. Increasing the Specificity of CRISPR Systems with Engineered RNA Secondary Structures. Nat. Biotechnol. 2019, 37, 657–666. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Chen, G.; Liang, C.; Yang, B.; Lei, X.; Chen, T.; Jiang, H.; Xiong, W. MultiCRISPR-EGA: Optimizing Guide RNA Array Design for Multiplexed CRISPR Using the Elitist Genetic Algorithm. ACS Synth. Biol. 2025, 14, 919–930. [Google Scholar] [CrossRef] [PubMed]
- Jiang, Y.; Li, B.; Xiong, J.; Liu, X. Graph-CRISPR: A Gene Editing Efficiency Prediction Model Based on Graph Neural Network with Integrated Sequence and Secondary Structure Feature Extraction. Brief. Bioinform. 2025, 26, bbaf410. [Google Scholar] [CrossRef] [PubMed]
- Collias, D.; Beisel, C.L. CRISPR Technologies and the Search for the PAM-Free Nuclease. Nat. Commun. 2021, 12, 555. [Google Scholar] [CrossRef] [PubMed]
- Globyte, V.; Lee, S.H.; Bae, T.; Kim, J.; Joo, C. CRISPR/Cas9 Searches for a Protospacer Adjacent Motif by Lateral Diffusion. EMBO J. 2018, 38, EMBJ201899466. [Google Scholar] [CrossRef] [PubMed]
- Malbranke, C.; Rostain, W.; Depardieu, F.; Cocco, S.; Monasson, R.; Bikard, D. Computational Design of Novel Cas9 PAM-Interacting Domains Using Evolution-Based Modelling and Structural Quality Assessment. PLoS Comput. Biol. 2023, 19, e1011621. [Google Scholar] [CrossRef] [PubMed]
- Ruffolo, J.A.; Nayfach, S.; Gallagher, J.; Bhatnagar, A.; Beazer, J.; Hussain, R.; Russ, J.; Yip, J.; Hill, E.; Pacesa, M.; et al. Design of Highly Functional Genome Editors by Modelling CRISPR–Cas Sequences. Nature 2025, 645, 518–525. [Google Scholar] [CrossRef] [PubMed]
- Glass, Z.; Lee, M.; Li, Y.; Xu, Q. Engineering the Delivery System for CRISPR-Based Genome Editing. Trends Biotechnol. 2018, 36, 173–185. [Google Scholar] [CrossRef] [PubMed]
- Tudek, A.; Krawczyk, P.S.; Mroczek, S.; Tomecki, R.; Turtola, M.; Matylla-Kulińska, K.; Jensen, T.H.; Dziembowski, A. Global View on the Metabolism of RNA Poly(A) Tails in Yeast Saccharomyces cerevisiae. Nat. Commun. 2021, 12, 4951. [Google Scholar] [CrossRef] [PubMed]
- Wen, J.-D.; Kuo, S.-T.; Chou, H.-H.D. The Diversity of Shine-Dalgarno Sequences Sheds Light on the Evolution of Translation Initiation. RNA Biol. 2021, 18, 1489–1500. [Google Scholar] [CrossRef] [PubMed]
- Passmore, L.A.; Coller, J. Roles of mRNA Poly(A) Tails in Regulation of Eukaryotic Gene Expression. Nat. Rev. Mol. Cell Biol. 2022, 23, 93–106. [Google Scholar] [CrossRef] [PubMed]
- Poonia, P.; Valabhoju, V.; Li, T.; Iben, J.; Niu, X.; Lin, Z.; Hinnebusch, A.G. Yeast Poly(A)-Binding Protein (Pab1) Controls Translation Initiation in Vivo Primarily by Blocking mRNA Decapping and Decay. Nucleic Acids Res. 2025, 53, gkaf143. [Google Scholar] [CrossRef] [PubMed]
- Zhang, M.; Holowko, M.B.; Hayman Zumpe, H.; Ong, C.S. Machine Learning Guided Batched Design of a Bacterial Ribosome Binding Site. ACS Synth. Biol. 2022, 11, 2314–2326. [Google Scholar] [CrossRef] [PubMed]
- Goshisht, M.K. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS Omega 2024, 9, 9921–9945. [Google Scholar] [CrossRef] [PubMed]
- Maharana, K.; Mondal, S.; Nemade, B. A Review: Data Pre-Processing and Data Augmentation Techniques. Glob. Transit. Proc. 2022, 3, 91–99. [Google Scholar] [CrossRef]
- Dixit, S.; Kumar, A.; Srinivasan, K.; Vincent, P.M.D.R.; Ramu Krishnan, N. Advancing Genome Editing with Artificial Intelligence: Opportunities, Challenges, and Future Directions. Front. Bioeng. Biotechnol. 2024, 11, 1335901. [Google Scholar] [CrossRef] [PubMed]
- Ortiz, B.L.; Gupta, V.; Kumar, R.; Jalin, A.; Cao, X.; Ziegenbein, C.; Singhal, A.; Tewari, M.; Choi, S.W. Data Preprocessing Techniques for AI and Machine Learning Readiness: Scoping Review of Wearable Sensor Data in Cancer Care. JMIR mHealth uHealth 2024, 12, e59587. [Google Scholar] [CrossRef] [PubMed]
- Dagal, I.; Harrison, A.; Ibrahim, A.-W.; Mbasso, W.F. Comprehensive Evaluation of Data Preprocessing and Visualization Techniques for Enhanced Classification and Sampling. Clust. Comput. 2025, 28, 476. [Google Scholar] [CrossRef]
- Yang, Y.; Li, J.; Zou, Q.; Ruan, Y.; Feng, H. Prediction of CRISPR-Cas9 off-Target Activities with Mismatches and Indels Based on Hybrid Neural Network. Comput. Struct. Biotechnol. J. 2023, 21, 5039–5048. [Google Scholar] [CrossRef] [PubMed]
- Cao, M.; Brennan, A.; Lee, C.M.; Park, S.; Bao, G. Deep Learning Based Models for CRISPR/Cas Off-Target Prediction. Small Methods 2025, 9, 2500122. [Google Scholar] [CrossRef] [PubMed]
- Abbasi, A.F.; Asim, M.N.; Dengel, A. Transitioning from Wet Lab to Artificial Intelligence: A Systematic Review of AI Predictors in CRISPR. J. Transl. Med. 2025, 23, 153. [Google Scholar] [CrossRef] [PubMed]
- Jasieniecka, A.; Domingues, I. CRISPR-Cas9 and Its Bioinformatics Tools: A Systematic Review. Curr. Issues Mol. Biol. 2025, 47, 307. [Google Scholar] [CrossRef] [PubMed]
- Tyagi, S.; Kumar, R.; Das, A.; Won, S.Y.; Shukla, P. CRISPR-Cas9 System: A Genome-Editing Tool with Endless Possibilities. J. Biotechnol. 2020, 319, 36–53. [Google Scholar] [CrossRef] [PubMed]
- Sun, Y.; Qu, K.; Corsi, G.I.; Anthon, C.; Pan, X.; Xiang, X.; Jensen, L.J.; Lin, L.; Luo, Y.; Gorodkin, J. Deep Learning Models Simultaneously Trained on Multiple Datasets Improve Base-Editing Activity Prediction. Nat. Commun. 2025, 16, 9821. [Google Scholar] [CrossRef] [PubMed]
- Bhat, A.A.; Nisar, S.; Mukherjee, S.; Saha, N.; Yarravarapu, N.; Lone, S.N.; Masoodi, T.; Chauhan, R.; Maacha, S.; Bagga, P.; et al. Integration of CRISPR/Cas9 with Artificial Intelligence for Improved Cancer Therapeutics. J. Transl. Med. 2022, 20, 534. [Google Scholar] [CrossRef] [PubMed]
- Agate, J. Artificial Intelligence Methods and Approaches to Improve Data Quality in Healthcare Data. Artif. Intell. Life Sci. 2025, 8, 100135. [Google Scholar] [CrossRef]
- Aussel, C.; Cathomen, T.; Fuster-García, C. The Hidden Risks of CRISPR/Cas: Structural Variations and Genome Integrity. Nat. Commun. 2025, 16, 7208. [Google Scholar] [CrossRef] [PubMed]
- Yan, Q.; Fong, S.S. Challenges and Advances for Genetic Engineering of Non-Model Bacteria and Uses in Consolidated Bioprocessing. Front. Microbiol. 2017, 8, 2060. [Google Scholar] [CrossRef] [PubMed]
- Call, S.N.; Andrews, L.B. CRISPR-Based Approaches for Gene Regulation in Non-Model Bacteria. Front. Genome Ed. 2022, 4, 892304. [Google Scholar] [CrossRef] [PubMed]
- Vercauteren, S.; Fiesack, S.; Maroc, L.; Verstraeten, N.; Dewachter, L.; Michiels, J.; Vonesch, S.C. The Rise and Future of CRISPR-Based Approaches for High-Throughput Genomics. FEMS Microbiol. Rev. 2024, 48, fuae020. [Google Scholar] [CrossRef] [PubMed]
- Moreb, E.A.; Hoover, B.; Yaseen, A.; Valyasevi, N.; Roecker, Z.; Menacho-Melgar, R.; Lynch, M.D. Managing the SOS Response for Enhanced CRISPR-Cas-Based Recombineering in E. coli through Transient Inhibition of Host RecA Activity. ACS Synth. Biol. 2017, 6, 2209–2218. [Google Scholar] [CrossRef] [PubMed]
- Ham, D.T.; Browne, T.S.; Zhang, C.Q.; Foo, G.W.; Uruthirapathy, A.S.; Gloor, G.B.; Edgell, D.R. Machine Learning Reveals Sequence and Methylation Determinants of SaCas9–PAM Interactions in Bacteria. Nucleic Acids Res. 2026, 54, gkaf1520. [Google Scholar] [CrossRef] [PubMed]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Xu, J.; Liu, C.; Tan, X.; Zhu, X.; Wu, A.; Wan, H.; Kong, W.; Li, C.; Xu, H.; Kuang, K.; et al. General Information Metrics for Improving AI Model Training Efficiency. Artif. Intell. Rev. 2025, 58, 289. [Google Scholar] [CrossRef]
- Kimata, K.; Satou, K. Improved CRISPR/Cas9 off-Target Prediction with DNABERT and Epigenetic Features. PLoS ONE 2025, 20, e0335863. [Google Scholar] [CrossRef] [PubMed]
- Trivedi, V.; Mohseni, A.; Lonardi, S.; Wheeldon, I. Balanced Training Sets Improve Deep Learning-Based Prediction of CRISPR sgRNA Activity. ACS Synth. Biol. 2024, 13, 3774–3781. [Google Scholar] [CrossRef] [PubMed]
- Yaish, O.; Orenstein, Y. Generating, Modeling and Evaluating a Large-Scale Set of CRISPR/Cas9 off-Target Sites with Bulges. Nucleic Acids Res. 2024, 52, 6777–6790. [Google Scholar] [CrossRef] [PubMed]
- Koike-Yusa, H.; Li, Y.; Tan, E.-P.; Velasco-Herrera, M.D.C.; Yusa, K. Genome-Wide Recessive Genetic Screening in Mammalian Cells with a Lentiviral CRISPR-guide RNA Library. Nat. Biotechnol. 2014, 32, 267–273. [Google Scholar] [CrossRef] [PubMed]
- Moreb, E.A.; Lynch, M.D. Genome Dependent Cas9/gRNA Search Time Underlies Sequence Dependent gRNA Activity. Nat. Commun. 2021, 12, 5034. [Google Scholar] [CrossRef] [PubMed]
- Trivedi, V.; Ramesh, A.; Wheeldon, I. Analyzing CRISPR Screens in Non-Conventional Microbes. J. Ind. Microbiol. Biotechnol. 2023, 50, kuad006. [Google Scholar] [CrossRef] [PubMed]
- Manghwar, H.; Li, B.; Ding, X.; Hussain, A.; Lindsey, K.; Zhang, X.; Jin, S. CRISPR/Cas Systems in Genome Editing: Methodologies and Tools for sgRNA Design, Off-Target Evaluation, and Strategies to Mitigate Off-Target Effects. Adv. Sci. 2020, 7, 1902312. [Google Scholar] [CrossRef] [PubMed]
- Moreb, E.A.; Hutmacher, M.; Lynch, M.D. CRISPR-Cas “Non-Target” Sites Inhibit On-Target Cutting Rates. CRISPR J. 2020, 3, 550–561. [Google Scholar] [CrossRef] [PubMed]
- Tafrishi, A.; Trivedi, V.; Xing, Z.; Li, M.; Mewalal, R.; Cutler, S.R.; Blaby, I.; Wheeldon, I. Functional Genomic Screening in Komagataella phaffii Enabled by High-Activity CRISPR-Cas9 Library. Metab. Eng. 2024, 85, 73–83. [Google Scholar] [CrossRef] [PubMed]
- Schwartz, C.; Cheng, J.-F.; Evans, R.; Schwartz, C.A.; Wagner, J.M.; Anglin, S.; Beitz, A.; Pan, W.; Lonardi, S.; Blenner, M.; et al. Validating Genome-Wide CRISPR-Cas9 Function Improves Screening in the Oleaginous Yeast Yarrowia lipolytica. Metab. Eng. 2019, 55, 102–110. [Google Scholar] [CrossRef] [PubMed]
- Baisya, D.; Ramesh, A.; Schwartz, C.; Lonardi, S.; Wheeldon, I. Genome-Wide Functional Screens Enable the Prediction of High Activity CRISPR-Cas9 and -Cas12a Guides in Yarrowia lipolytica. Nat. Commun. 2022, 13, 922. [Google Scholar] [CrossRef] [PubMed]
- Cho, S.; Shin, J.; Cho, B.-K. Applications of CRISPR/Cas System to Bacterial Metabolic Engineering. Int. J. Mol. Sci. 2018, 19, 1089. [Google Scholar] [CrossRef] [PubMed]
- Yang, J.; Song, J.; Feng, Z.; Ma, Y. Application of CRISPR-Cas9 in Microbial Cell Factories. Biotechnol. Lett. 2025, 47, 46. [Google Scholar] [CrossRef] [PubMed]
- Carruthers, D.N.; Kinnunen, P.C.; Li, Y.; Chen, Y.; Gin, J.W.; Yunus, I.S.; Galliard, W.R.; Tan, S.; Radivojevic, T.; Adams, P.D.; et al. Automation and Machine Learning Drive Rapid Optimization of Isoprenol Production in Pseudomonas putida. Nat. Commun. 2025, 16, 11489. [Google Scholar] [CrossRef] [PubMed]
- Tenkanen, T.; Ylinen, A.; Jouhten, P.; Penttilä, M.; Castillo, S. PHA Synthase Variant Design Using a Conditional Variational Autoencoder. PLoS Comput. Biol. 2026, 22, e1014087. [Google Scholar] [CrossRef] [PubMed]
- Wu, D.; Xu, F.; Xu, Y.; Huang, M.; Li, Z.; Chu, J. Towards a Hybrid Model-Driven Platform Based on Flux Balance Analysis and a Machine Learning Pipeline for Biosystem Design. Synth. Syst. Biotechnol. 2024, 9, 33–42. [Google Scholar] [CrossRef] [PubMed]
- Jiang, W.; Wang, S.; Ahlheit, D.; Fumagalli, T.; Yang, Z.; Ramanathan, S.; Jiang, X.; Weber, T.; Dahlin, J.; Borodina, I. High-Throughput Metabolic Engineering of Yarrowia lipolytica through Gene Expression Tuning. Proc. Natl. Acad. Sci. USA 2025, 122, e2426686122. [Google Scholar] [CrossRef] [PubMed]
- Iwai, K.; Wehrs, M.; Garber, M.; Sustarich, J.; Washburn, L.; Costello, Z.; Kim, P.W.; Ando, D.; Gaillard, W.R.; Hillson, N.J.; et al. Scalable and Automated CRISPR-Based Strain Engineering Using Droplet Microfluidics. Microsyst. Nanoeng. 2022, 8, 31. [Google Scholar] [CrossRef] [PubMed]
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for Scientific Data Management and Stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed]
- Plewnia, A.; Hoenig, B.D.; Lötters, S.; Heine, C.; Erens, J.; Böning, P.; Bending, G.D.; Krehenwinkel, H.; Williams, M.A. The Emergence of a CRISPR-Cas Revolution in Ecology: Applications, Challenges, and an Ecologist’s Overview of the Toolbox. Mol. Ecol. Resour. 2026, 26, e70086. [Google Scholar] [CrossRef] [PubMed]
- David, R.; Mabile, L.; Specht, A.; Stryeck, S.; Thomsen, M.; Yahia, M.; Jonquet, C.; Dollé, L.; Jacob, D.; Bailo, D.; et al. FAIRness Literacy: The Achilles’ Heel of Applying FAIR Principles. Data Sci. J. 2020, 19, 1–11. [Google Scholar] [CrossRef]
- D’Ambrosio, V.; Hansen, L.G.; Zhang, J.; Jensen, E.D.; Arsovska, D.; Laloux, M.; Jakočiūnas, T.; Hjort, P.; De Lucrezia, D.; Marletta, S.; et al. A FAIR-Compliant Parts Catalogue for Genome Engineering and Expression Control in Saccharomyces cerevisiae. Synth. Syst. Biotechnol. 2022, 7, 657–663. [Google Scholar] [CrossRef] [PubMed]
- Tao, J.; Bauer, D.E.; Chiarle, R. Assessing and Advancing the Safety of CRISPR-Cas Tools: From DNA to RNA Editing. Nat. Commun. 2023, 14, 212. [Google Scholar] [CrossRef] [PubMed]
- Zhang, D.; Hurst, T.; Duan, D.; Chen, S.-J. Unified Energetics Analysis Unravels SpCas9 Cleavage Activity for Optimal gRNA Design. Proc. Natl. Acad. Sci. USA 2019, 116, 8693–8698. [Google Scholar] [CrossRef] [PubMed]
- Natural Methods Editorial. CRISPR Standards. Nat. Methods 2017, 14, 541. [Google Scholar] [CrossRef]
- Takahashi, M.; Frøslev, T.G.; Paupério, J.; Thalinger, B.; Klymus, K.; Helbing, C.C.; Villacorta-Rath, C.; Silliman, K.; Thompson, L.R.; Jungbluth, S.P.; et al. A Metadata Checklist and Data Formatting Guidelines to Make EDNA FAIR (Findable, Accessible, Interoperable, and Reusable). Environ. DNA 2025, 7, e70100. [Google Scholar] [CrossRef]
- Oughtred, R.; Rust, J.; Chang, C.; Breitkreutz, B.-J.; Stark, C.; Willems, A.; Boucher, L.; Leung, G.; Kolas, N.; Zhang, F.; et al. The BioGRID Database: A Comprehensive Biomedical Resource of Curated Protein, Genetic, and Chemical Interactions. Protein Sci. 2021, 30, 187–200. [Google Scholar] [CrossRef] [PubMed]
- Karp, P.D.; Billington, R.; Caspi, R.; Fulcher, C.A.; Latendresse, M.; Kothari, A.; Keseler, I.M.; Krummenacker, M.; Midford, P.E.; Ong, Q.; et al. The BioCyc Collection of Microbial Genomes and Metabolic Pathways. Brief. Bioinform. 2018, 20, 1085–1093. [Google Scholar] [CrossRef]
- Chang, A.; Jeske, L.; Ulbrich, S.; Hofmann, J.; Koblitz, J.; Schomburg, I.; Neumann-Schaal, M.; Jahn, D.; Schomburg, D. BRENDA, the ELIXIR Core Data Resource in 2021: New Developments and Updates. Nucleic Acids Res. 2021, 49, D498–D508. [Google Scholar] [CrossRef] [PubMed]
- McDonald, A.G.; Boyce, S.; Tipton, K.F. ExplorEnz: The Primary Source of the IUBMB Enzyme List. Nucleic Acids Res. 2009, 37, D593–D597. [Google Scholar] [CrossRef] [PubMed]
- Kanehisa, M.; Furumichi, M.; Sato, Y.; Kawashima, M.; Ishiguro-Watanabe, M. KEGG for Taxonomy-Based Analysis of Pathways and Genomes. Nucleic Acids Res. 2023, 51, D587–D592. [Google Scholar] [CrossRef] [PubMed]
- Tang, Z.; Chen, S.; Chen, A.; He, B.; Zhou, Y.; Chai, G.; Guo, F.; Huang, J. CasPDB: An Integrated and Annotated Database for Cas Proteins from Bacteria and Archaea. Database 2019, 2019, baz093. [Google Scholar] [CrossRef] [PubMed]
- Pourcel, C.; Touchon, M.; Villeriot, N.; Vernadet, J.P.; Couvin, D.; Toffano-Nioche, C.; Vergnaud, G. CRISPRCasdb a Successor of CRISPRdb Containing CRISPR Arrays and Cas Genes from Complete Genome Sequences, and Tools to Download and Query Lists of Repeats and Spacers. Nucleic Acids Res. 2020, 48, D535–D544. [Google Scholar] [CrossRef] [PubMed]
- Adler, B.A.; Trinidad, M.I.; Bellieny-Rabelo, D.; Zhang, E.; Karp, H.M.; Skopintsev, P.; Thornton, B.W.; Weissman, R.F.; Yoon, P.H.; Chen, L.; et al. CasPEDIA Database: A Functional Classification System for Class 2 CRISPR-Cas Enzymes. Nucleic Acids Res. 2024, 52, D590–D596. [Google Scholar] [CrossRef] [PubMed]
- Consortium, T.U. UniProt: The Universal Protein Knowledgebase in 2025. Nucleic Acids Res. 2024, 53, D609–D617. [Google Scholar] [CrossRef] [PubMed]
- Störtz, F.; Minary, P. CrisprSQL: A Novel Database Platform for CRISPR/Cas off-Target Cleavage Assays. Nucleic Acids Res. 2021, 49, D855–D861. [Google Scholar] [CrossRef] [PubMed]
- Pu, Z.; Shi, C.-L.; Jeon, C.O.; Fu, J.; Liu, S.-J.; Lan, C.; Yao, Y.; Liu, Y.-X.; Jia, B. ChatGPT and Generative AI Are Revolutionizing the Scientific Community: A Janus-Faced Conundrum. iMeta 2024, 3, e178. [Google Scholar] [CrossRef] [PubMed]
- Leiter, C.; Zhang, R.; Chen, Y.; Belouadi, J.; Larionov, D.; Fresen, V.; Eger, S. ChatGPT: A Meta-Analysis after 2.5 Months. Mach. Learn. Appl. 2024, 16, 100541. [Google Scholar] [CrossRef]
- Moore, T. CRISPR AI Research Suite Version 0.1.0. 2026. Available online: https://github.com/Tmmoore286/crispr-ai-research-suite (accessed on 24 February 2026).
- Qu, Y.; Huang, K.; Yin, M.; Zhan, K.; Liu, D.; Yin, D.; Cousins, H.C.; Johnson, W.A.; Wang, X.; Shah, M.; et al. CRISPR-GPT for Agentic Automation of Gene-Editing Experiments. Nat. Biomed. Eng. 2026, 10, 245–258. [Google Scholar] [CrossRef] [PubMed]
- Secretaría de Ciencia, Humanidades, Tecnología e Innovación; Agencia de Transformación Digital y Telecomunicaciones. Declaración de Ética y Buenas Prácticas para el Uso y Desarrollo de la IA en México: SECIHTI y ATDT 2026. Available online: https://secihti.mx/sala-de-prensa/presentan-declaracion-de-etica-y-buenas-practicas-para-el-uso-y-desarrollo-de-la-ia-en-mexico-secihti-y-atdt/ (accessed on 25 February 2026).
- Resnik, D.B. Biosafety, Biosecurity, and Bioethics. Monash Bioeth. Rev. 2024, 42, 137–167. [Google Scholar] [CrossRef] [PubMed]
- Ostos Ortiz, O.L. Edición Genética e Inteligencia Artificial: Desafíos Éticos Frente a Los Avances Biotecnológicos. NOVA 2024, 22, 43. [Google Scholar] [CrossRef]
- AL-Eitan, L.; Alnemri, M. Biosafety and Biosecurity in the Era of Biotechnology: The Middle East Region. J. Biosaf. Biosecur. 2022, 4, 130–145. [Google Scholar] [CrossRef]
- Soleimani Sasani, M. The Importance of Biosecurity in Emerging Biotechnologies and Synthetic Biology. Avicenna J. Med. Biotechnol. 2024, 16, 223–232. [Google Scholar] [CrossRef] [PubMed]
- Federal Select Agent Program. Select Agents and Toxins List. 2025. Available online: https://selectagents.gov/sat/list.htm (accessed on 19 March 2026).
- The Australia Group. List of Human and Animal Pathogens and Toxins for Export Control 2024. Available online: https://www.dfat.gov.au/publications/minisite/theaustraliagroupnet/site/en/documents/common-control-lists/Common-Control-List-of-Dual-Use-Biological-Equipment.pdf (accessed on 19 March 2026).
- Bossi, P.; Garin, D.; Guihot, A.; Gay, F.; Crance, J.-M.; Debord, T.; Autran, B.; Bricaire, F. Biological Weapons. Cell. Mol. Life Sci. 2006, 63, 2196–2212. [Google Scholar] [CrossRef] [PubMed]
- Zarate, S.; Cimadori, I.; Roca, M.M.; Jones, M.S.; Barnhill-Dilling, K. Assessment of the Regulatory and Institutional Framework for Agricultural Gene Editing via CRISPR-Based Technologies in Latin America and the Caribbean; Inter-American Development Bank: Washington, DC, USA, 2023. [Google Scholar] [CrossRef]
- Flores-Coronado, J.A.; Alanis-Valdez, A.Y.; Herrera-Saldivar, M.F.; Flores-Flores, A.S.; Vazquez-Guillen, J.M.; Tamez-Guerra, R.S.; Rodriguez-Padilla, C. Awareness of the Dual-Use Dilemma in Scientific Research: Reflections and Challenges to Latin America. Front. Bioeng. Biotechnol. 2025, 13, 1649781. [Google Scholar] [CrossRef] [PubMed]
- Li, X.; Gao, Y.; Zhang, Z.; Deng, W.; Cao, W.; Wei, X.; Gao, Z.; Yao, L.; Wang, S.; Xie, Y.; et al. Biosafety Considerations Triggered by Genome-Editing Technologies. Biosaf. Health 2025, 7, 141–151. [Google Scholar] [CrossRef] [PubMed]



| Tool Name | Associated Algorithms | Core Features & Training Strategy | Strengths | Limitations | Target Species | Study |
|---|---|---|---|---|---|---|
| CRISPRon | CNN + Feedforward Layers | Appends thermodynamic parameters (RNA-DNA binding energy) directly into the network | Capture complex sequence features | Limited to on-target prediction | Human, Mouse, and Zebrafish | [11] |
| DeepSpCas9 | CNN (Multiple Filter Sizes) | End-to-end learning trained on a massive, uniformly generated high-throughput dataset | High generalisation performance; trained on direct DNA cleavage frequencies | Restricted to wild-type SpCas9 on-target activity | Human | [37] |
| DeepCas9 | 1D-CNN | One-hot encoding of 30-nt sequences; automated spatial feature extraction | Reliable capture of complex sgRNA sequence patterns | Susceptible to biases introduced by data inconsistencies | Human and Mouse | [33] |
| DeepHF | BiLSTM + Dense Layers | Combines sequential memory embeddings with hand-crafted biological/thermodynamic features | Demonstrates high accuracy for high-fidelity Cas9 variants | Lacks generalisation to other Cas orthologs | Human | [32] |
| CRISPR-Learner | CNN | Transfer learning capabilities; dynamic zero-padding for variable-length sequences | Supports custom dataset training for gRNA design | Currently restricted to the assessment of on-target cleavage efficiency | Human and Mouse | [40] |
| DeepCRISPR | Deep Belief Network (DBN)/ Autoencoder + CNN | Unsupervised pre-training on unlabelled data; integration of epigenetic features | Unifies on-target and off-target prediction within a single computational framework | Strictly limited to NGG-based SpCas9 systems. Currently validated only for human genomic data | Human and Mouse | [41] |
| Microorganism Species | Library Size | Associated Algorithms | Associated AI Tools | AI Tool Strengths | AI Tool Limitations | Study |
|---|---|---|---|---|---|---|
| Citrobacter rodentium | 31,796 | Transfer learning/CNN-RNN | crisprHAL | Distinguishes on-target cleavage from cellular toxicity and generalises predictive performance across diverse bacterial species | Necessitates transfer learning from larger datasets to ensure high performance | [23] |
| E. coli K12 MG1655 | 65,928 | Gradient boosting regression tree (GBRT) | sgRNA-cleavage-activity-prediction | Eliminates mammalian-specific biases like chromatin noise and DNA repair preferences | Displays reduced resolution in DNA-repair-deficient backgrounds | [22] |
| CNN | DeepSgRNAbacteria | Captures critical flanking sequence information | Lacks cross-domain validity between prokaryotes and eukaryotes | [21] | ||
| E. coli strains MG1655 and BW25113 | ~10,000 | N/A | N/A | N/A | N/A | [81] |
| E. coli W (ATCC 9637) | 6044 | N/A | N/A | N/A | N/A | [92] |
| K. phaffii GS115 | 31,984 | N/A | N/A | N/A | N/A | [93] |
| Y. lipolytica strain PO1f | 46,234 | CNN | DeepGuide | Accurately predicts high-activity Cas9 and Cas12a gRNAs in specific organisms like Y. lipolytica. Incorporates genomic context and epigenetic features to improve targeting precision | Struggles to identify low-activity gRNAs for Cas9 with accuracy | [94,95] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Delgado-Nungaray, J.A.; Pérez-Ponce, D.A.; Figueroa-Yáñez, L.J.; Reynaga-Delgado, E.; García-Ramírez, M.A.; Gonzalez-Reynoso, O. Advances in AI-Guided CRISPR-Cas9 Engineering Strategies for Microbial Biotechnology. J. Genome Biotechnol. Genet. 2026, 1, 10. https://doi.org/10.3390/jgbg1020010
Delgado-Nungaray JA, Pérez-Ponce DA, Figueroa-Yáñez LJ, Reynaga-Delgado E, García-Ramírez MA, Gonzalez-Reynoso O. Advances in AI-Guided CRISPR-Cas9 Engineering Strategies for Microbial Biotechnology. Journal of Genome Biotechnology and Genetics. 2026; 1(2):10. https://doi.org/10.3390/jgbg1020010
Chicago/Turabian StyleDelgado-Nungaray, Javier Alejandro, Dulce Alitzel Pérez-Ponce, Luis Joel Figueroa-Yáñez, Eire Reynaga-Delgado, Mario Alberto García-Ramírez, and Orfil Gonzalez-Reynoso. 2026. "Advances in AI-Guided CRISPR-Cas9 Engineering Strategies for Microbial Biotechnology" Journal of Genome Biotechnology and Genetics 1, no. 2: 10. https://doi.org/10.3390/jgbg1020010
APA StyleDelgado-Nungaray, J. A., Pérez-Ponce, D. A., Figueroa-Yáñez, L. J., Reynaga-Delgado, E., García-Ramírez, M. A., & Gonzalez-Reynoso, O. (2026). Advances in AI-Guided CRISPR-Cas9 Engineering Strategies for Microbial Biotechnology. Journal of Genome Biotechnology and Genetics, 1(2), 10. https://doi.org/10.3390/jgbg1020010

