A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery
Abstract
1. Introduction
2. Results
2.1. Venom—Conotoxin Library Design
2.2. High-Throughput Recombinant Expression of Venom Peptides
2.3. Validation of the VCX Library by Panning Against Target Proteins
2.4. ML-Assisted Affinity Maturation Strategy for DLL3
3. Discussion
4. Materials and Methods
4.1. Venom—Conotoxin Library Design
4.2. Venom Library Construction
4.3. High Throughput Cloning, Recombinant Expression, and Purification of Trx-VCX Fusion Proteins
4.4. Expression and Purification of Target Proteins
4.4.1. CD47 and DLL3
4.4.2. IL33
4.4.3. P2X7R
4.5. Primary Screening of Venom Library
4.5.1. Selection of CD47, DLL3, and IL33
4.5.2. Selection of P2X7R
4.6. Next-Generation Sequencing (NGS)
4.7. NNK-Block Screening
4.8. Machine Learning Model Training and Prediction of Focused Library
4.9. Solid-Phase Peptide Synthesis
4.10. SPR Measurements
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| ML | Machine Learning |
| UMAP | Uniform Manifold Approximation and Projection |
| VCX | Venom-ConotoXin |
| PPI | Protein–Protein Interaction |
| DCP | Disulfide-Constrained Peptide |
| SPPS | Solid Phase Peptide Synthesis |
| Trx | Thioredoxin |
| GPCR | G-Protein-coupled Receptor |
| TFA | trifluoroacetic acid |
| DMSO | dimethyl sulfoxide |
| LC-MS | liquid chromatography-mass spectrometry |
References
- Freuville, L.; Matthys, C.; Quinton, L.; Gillet, J.P. Venom-derived peptides for breaking through the glass ceiling of drug development. Front. Chem. 2024, 12, 1465459. [Google Scholar] [CrossRef]
- Kim, E.; Hwang, D.H.; Mohan Prakash, R.L.; Asirvatham, R.D.; Lee, H.; Heo, Y.; Munawir, A.; Seyedian, R.; Kang, C. Animal Venom in Modern Medicine: A Review of Therapeutic Applications. Toxins 2025, 17, 371. [Google Scholar] [CrossRef]
- Marte, F.; Sankar, P.; Patel, P.; Cassagnol, M. Captopril; StatPearls: Treasure Island, FL, USA, 2025. [Google Scholar]
- Bridges, A.; Bistas, K.G.; Jacobs, T.F. Exenatide; StatPearls: Treasure Island, FL, USA, 2025. [Google Scholar]
- Wie, C.S.; Derian, A. Ziconotide; StatPearls: Treasure Island, FL, USA, 2025. [Google Scholar]
- Cai, F.; Wei, Y.; Kirchhofer, D.; Chang, A.; Zhang, Y. Rapid prediction of key residues for foldability by machine learning model enables the design of highly functional libraries with hyperstable constrained peptide scaffolds. PLoS Comput. Biol. 2024, 20, e1012609. [Google Scholar] [CrossRef]
- Zhou, L.; Cai, F.; Li, Y.; Gao, X.; Wei, Y.; Fedorova, A.; Kirchhofer, D.; Hannoush, R.N.; Zhang, Y. Disulfide-constrained peptide scaffolds enable a robust peptide-therapeutic discovery platform. PLoS ONE 2024, 19, e0300135. [Google Scholar] [CrossRef] [PubMed]
- UniProt, C. UniProt: The Universal Protein Knowledgebase in 2025. Nucleic Acids Res. 2025, 53, D609–D617. [Google Scholar]
- Kaas, Q.; Westermann, J.C.; Halai, R.; Wang, C.K.; Craik, D.J. ConoServer, a database for conopeptide sequences and structures. Bioinformatics 2008, 24, 445–446. [Google Scholar] [CrossRef]
- Kaas, Q.; Yu, R.; Jin, A.H.; Dutertre, S.; Craik, D.J. ConoServer: Updated content, knowledge, and discovery tools in the conopeptide database. Nucleic Acids Res. 2012, 40, D325–D330. [Google Scholar] [CrossRef] [PubMed]
- Schwalen, C.J.; Babu, C.; Phulera, S.; Hao, Q.; Wall, D.; Nettleton, D.O.; Pathak, T.P.; Siuti, P. Scalable Biosynthetic Production of Knotted Peptides Enables ADME and Thermodynamic Folding Studies. ACS Omega 2021, 6, 29555–29566. [Google Scholar] [CrossRef]
- Grabski, A.C. Advances in preparation of biological extracts for protein purification. Methods Enzymol. 2009, 463, 285–303. [Google Scholar]
- Kyte, J.; Doolittle, R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982, 157, 105–132. [Google Scholar] [CrossRef]
- Lin, Z.; Akin, H.; Rao, R.; Hie, B.; Zhu, Z.; Lu, W.; Smetanin, N.; Verkuil, R.; Kabeli, O.; Shmueli, Y.; et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 2023, 379, 1123–1130. [Google Scholar] [CrossRef]
- Chennamsetty, N.; Voynov, V.; Kayser, V.; Helk, B.; Trout, B.L. Prediction of aggregation prone regions of therapeutic proteins. J. Phys. Chem. B 2010, 114, 6614–6624. [Google Scholar] [CrossRef] [PubMed]
- Ahvati, H.; Roudi, R.; Sobhani, N.; Safari, F. CD47 as a potent target in cancer immunotherapy: A review. Biochim. Biophys. Acta Rev. Cancer 2025, 1880, 189294. [Google Scholar] [CrossRef] [PubMed]
- Calderon, A.A.; Dimond, C.; Choy, D.F.; Pappu, R.; Grimbaldeston, M.A.; Mohan, D.; Chung, K.F. Targeting interleukin-33 and thymic stromal lymphopoietin pathways for novel pulmonary therapeutics in asthma and COPD. Eur. Respir. Rev. 2023, 32, 220144. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Zheng, T.; Xu, D.; Sun, C.; Huang, D.; Liu, X. Targeting DLL3: Innovative Strategies for Tumor Treatment. Pharmaceutics 2025, 17, 520. [Google Scholar] [CrossRef]
- Liu, X.; Li, Y.; Huang, L.; Kuang, Y.; Wu, X.; Ma, X.; Zhao, B.; Lan, J. Unlocking the therapeutic potential of P2X7 receptor: A comprehensive review of its role in neurodegenerative disorders. Front. Pharmacol. 2024, 15, 1450704. [Google Scholar] [CrossRef]
- Tonikian, R.; Zhang, Y.; Boone, C.; Sidhu, S.S. Identifying specificity profiles for peptide recognition modules from phage-displayed peptide libraries. Nat. Protoc. 2007, 2, 1368–1386. [Google Scholar] [CrossRef]
- Zhang, Y.; Ultsch, M.; Skelton, N.J.; Burdick, D.J.; Beresini, M.H.; Li, W.; Kong-Beltran, M.; Peterson, A.; Quinn, J.; Chiu, C.; et al. Discovery of a cryptic peptide-binding site on PCSK9 and design of antagonists. Nat. Struct. Mol. Biol. 2017, 24, 848–856. [Google Scholar] [CrossRef]
- Hsiao, M.H.; Miao, Y.; Liu, Z.; Schutze, K.; Limjunyawong, N.; Chien, D.C.; Monteiro, W.D.; Chu, L.S.; Morgenlander, W.; Jayaraman, S.; et al. Molecular Display of the Animal Meta-Venome for Discovery of Novel Therapeutic Peptides. Mol. Cell Proteom. 2025, 24, 100901. [Google Scholar] [CrossRef]
- Thurber, G.M.; Schmidt, M.M.; Wittrup, K.D. Antibody tumor penetration: Transport opposed by systemic and antigen-mediated clearance. Adv. Drug Deliv. Rev. 2008, 60, 1421–1434. [Google Scholar] [CrossRef]
- Vadevoo, S.M.P.; Gurung, S.; Lee, H.S.; Gunassekaran, G.R.; Lee, S.M.; Yoon, J.W.; Lee, Y.K.; Lee, B. Peptides as multifunctional players in cancer therapy. Exp. Mol. Med. 2023, 55, 1099–1109. [Google Scholar] [CrossRef] [PubMed]
- Armstrong, A.; Coburn, F.; Nsereko, Y.; Al Musaimi, O. Peptide-Drug Conjugates: A New Hope for Cancer. J. Pept. Sci. 2025, 31, e70040. [Google Scholar] [CrossRef] [PubMed]
- Yu, Y.; Lyu, J.; Muhadaisi, Y.; Shi, C.; Wang, D. Peptide based vesicles for cancer immunotherapy: Design, construction and applications. Front. Immunol. 2025, 16, 1609162. [Google Scholar] [CrossRef] [PubMed]
- Xiao, W.; Jiang, W.; Chen, Z.; Huang, Y.; Mao, J.; Zheng, W.; Hu, Y.; Shi, J. Advance in peptide-based drug development: Delivery platforms, therapeutics and vaccines. Signal Transduct. Target. Ther. 2025, 10, 74. [Google Scholar] [CrossRef]
- Sagar, B.; Gupta, S.; Verma, S.K.; Reddy, Y.V.M.; Shukla, S. Navigating cancer therapy: Harnessing the power of peptide-drug conjugates as precision delivery vehicles. Eur. J. Med. Chem. 2025, 283, 117131. [Google Scholar] [CrossRef]
- Daniel, J.T.; Clark, R.J. G-Protein Coupled Receptors Targeted by Analgesic Venom Peptides. Toxins 2017, 9, 372. [Google Scholar] [CrossRef]
- Mir, R.; Karim, S.; Kamal, M.A.; Wilson, C.M.; Mirza, Z. Conotoxins: Structure, Therapeutic Potential and Pharmacological Applications. Curr. Pharm. Des. 2016, 22, 582–589. [Google Scholar] [CrossRef]
- Nareoja, K.; Nasman, J. Selective targeting of G-protein-coupled receptor subtypes with venom peptides. Acta Physiol. 2012, 204, 186–201. [Google Scholar] [CrossRef]
- Saez, N.J.; Senff, S.; Jensen, J.E.; Er, S.Y.; Herzig, V.; Rash, L.D.; King, G.F. Spider-venom peptides as therapeutics. Toxins 2010, 2, 2851–2871. [Google Scholar] [CrossRef]
- Wulff, H.; Christophersen, P.; Colussi, P.; Chandy, K.G.; Yarov-Yarovoy, V. Antibodies and venom peptides: New modalities for ion channels. Nat. Rev. Drug Discov. 2019, 18, 339–357. [Google Scholar] [CrossRef]
- Kunkel, T.A.; Roberts, J.D.; Zakour, R.A. Rapid and efficient site-specific mutagenesis without phenotypic selection. Methods Enzymol. 1987, 154, 367–382. [Google Scholar]
- Delgado, B.D.; Long, S.B. Mechanisms of ion selectivity and throughput in the mitochondrial calcium uniporter. Sci. Adv. 2022, 8, eade1516. [Google Scholar] [CrossRef]
- Tombling, B.J.; Cai, F.; Chang, A.; Balana, A.; Miller, S.; Walters, A.L.; Wei, Y.; Tang, W.; Wendorff, T.; Liu, P.; et al. Developing high affinity peptide ligands of LRRC15 using machine learning. In Journal of Peptide Science; Wiley: Hoboken, NJ, USA, 2025. [Google Scholar]
- Ro, S.Y.; Jao, C.; Oh, A.; Kschonsak, M.; Li, T.; Austin, D.; Greiner, D.M.Z.; Zhou, L.; Zhang, Y.; Chen, J.; et al. Fab-Induced Stabilization of an Ion Channel Receptor Enables Mechanistic Characterization of Small-Molecule Therapeutics. Anal. Chem. 2025, 97, 5102–5108. [Google Scholar] [CrossRef]





| Number of Hits | Number of Scaffolds | Trx-Fusion Kd Range | |
|---|---|---|---|
| P2X7R | 16 | 11 | 21 nM–2 µM |
| IL33 | 57 | 39 | 213 nM–34 µM |
| DLL3 | 22 | 14 | 6.6–185 µM |
| CD47 | 33 | 28 | 1–555 µM |
| Target | Sequence | Trx-Fusion Kd (M) | SPPS Kd (M) |
|---|---|---|---|
| CD47 | QHRRNENQKAHDVNAQTYTWCCTQGPCRNTHRNGCS | 2.89 × 10−6 | 1.76 × 10−5 |
| CD47 | NWCPPRISLCNSDKHCCKYVRCQRRDARMDKEECSQ | 6.39 × 10−6 | 1.51 × 10−5 |
| CD47 | TRRRQCPPWCTYKICYESTC | 5.49 × 10−6 | 1.63 × 10−5 |
| CD47 | KCAKYHEVCGDDSKCCHSFDCPGEVIIYCEKSN | 4.27 × 10−4 | 3.91 × 10−5 |
| DLL3 | QWPFQQWIPCTIHWNCDGNWCCFPITCYEQTGMCD | 5.54 × 10−5 | 3.50 × 10−6 |
| DLL3 | SWDWTWTSWNDNHETSYQIEDCCPNLQEFCCP | 2.44 × 10−5 | 2.07 × 10−5 |
| DLL3 | YGKDHHEWVMYEWSQEEITCLDWGELCNLWFPTCCEYCIHPFCA | 1.39 × 10−5 | 7.50 × 10−6 |
| DLL3 | WSEEWTWISCPMTWNCDGNWCCWHWDCGWQTWMCD | 1.54 × 10−9 | 6.66 × 10−9 |
| DLL3 | WDPTWQWLPCPMHWNCDGNWCCWTWDCGESGWMCD | 1.10 × 10−9 | 1.26 × 10−8 |
| DLL3 | WDWEQWWIPCAMHWNCDGNWCCSWWDCTDQGGMCD | 1.36 × 10−8 | 8.77 × 10−8 |
| DLL3 | WTPECTWTCHWTTCNESWCSCWSWHECTWT | 1.05 × 10−7 | 3.26 × 10−6 |
| DLL3 | WWETWTWIPCYTSWNCDGNWCCMHTDCTESWWMCD | 1.51 × 10−8 | 6.53 × 10−9 |
| DLL3 | CGDTCYGLTCNTPFCTCKADRCWATWYWT | 2.17 × 10−7 | 2.71 × 10−8 |
| IL33 | NPRHGTCYYVKFRCEHRWCWIHVKKCPQTDADDAFN | 2.83 × 10−7 | 3.83 × 10−7 |
| IL33 | CTPNGGFCIMHYHCCKWTCFTITWNCN | 8.00 × 10−7 | 1.60 × 10−6 |
| IL33 | DRPRPSKRCIAWKQPCEPHRNHNCCQEHCWNFVCE | 2.96 × 10−6 | 2.93 × 10−6 |
| IL33 | NAAYWHPQPKWGFTQYHFICNASHCDWVWYCKFMKCYDCRNTRCT | 3.04 × 10−7 | 1.37 × 10−5 |
| IL33 | RRNLQTEWNPLSLFMWRRWCWWHCRWHSHCASHCICTFRGCGAVNG | 4.32 × 10−7 | 7.60 × 10−7 |
| IL33 | SKRKTTAHPWIEYECTYVHQTCHDQTPCCSGWCVFYCTGWR | 6.32 × 10−7 | 1.70 × 10−5 |
| IL33 | CWYCYCPWWPCPQDQDCPGECICMAHGFCG | 4.34 × 10−6 | 3.05 × 10−7 |
| IL33 | GLPVCGETCTLGKCYTSCCWCWWPWCYCR | 3.11 × 10−7 | 3.90 × 10−6 |
| IL33 | SIPIWRCYPACILDTCDSYGCDCGEWMLCYMAN | 8.99 × 10−7 | 1.79 × 10−7 |
| IL33 | GLPVCGEYCFTGKCYTWCCWCTPRRYCECR | 8.57 × 10−7 | 5.72 × 10−7 |
| IL33 | GLPLCAEECSLGTCWTSCCWCWWPWCYCR | 3.69 × 10−7 | 2.13 × 10−7 |
| IL33 | GLPLCGEECGSNSCFTTCCWCWWPWCYCR | 3.75 × 10−7 | 1.10 × 10−7 |
| IL33 | CWYCYCPYRYCPSWTDCPRHCYCRFHGFCG | 2.20 × 10−6 | 2.60 × 10−6 |
| IL33 | GIPVWRCYPACILDTCDSYGCECGEWMLCYMSD | 3.08 × 10−7 | 1.55 × 10−7 |
| IL33 | SIPVWRCYPACILDTCDSYGCECGEWMLCYMTD | 2.48 × 10−7 | 1.77 × 10−7 |
| IL33 | DSARKEVENPKASKWHWYWCQWRPRPCNSSVPCCGGSCGYFSCR | 4.70 × 10−7 | 1.73 × 10−7 |
| IL33 | EDTRKEVENPKASKWHWYWCQWRPRLCNSSVPCCSGSCGYFSCR | 5.44 × 10−7 | 3.09 × 10−7 |
| IL33 | CDCNYRCSPQYSPPCRCRWCWCHPLGLFVGFCIHPTG | 4.11 × 10−8 | 9.51 × 10−8 |
| IL33 | CWYCYCPWRYCVSAQTCSAHCWCSYHGFCG | 2.68 × 10−7 | 1.11 × 10−7 |
| P2X7R | RRDCRWYQCEFQCCETINGQERCREINCH | 1.94 × 10−6 | 5.17 × 10−6 |
| P2X7R | KSDHVHKKWRWDKTARDHSNRPPPCCNNPACLSNRC | 5.60 × 10−6 | 2.93 × 10−6 |
| P2X7R | SHGRNAARKASDLIALTVRECCSQPPCRWKHPELCS | 7.00 × 10−6 | 1.16 × 10−6 |
| P2X7R | CCHTQCSQQYNCGQ | 5.53 × 10−6 | 4.18 × 10−6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cai, F.; Zhou, L.; Delgado, B.; Chang, W.; Tom, J.; Hernandez, E.; Joshi, P.; Song, A.; Masureel, M.; Maun, H.R.; et al. A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery. Pharmaceuticals 2026, 19, 288. https://doi.org/10.3390/ph19020288
Cai F, Zhou L, Delgado B, Chang W, Tom J, Hernandez E, Joshi P, Song A, Masureel M, Maun HR, et al. A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery. Pharmaceuticals. 2026; 19(2):288. https://doi.org/10.3390/ph19020288
Chicago/Turabian StyleCai, Fei, Lijuan Zhou, Bryce Delgado, Wenping Chang, Jeffrey Tom, Evelyn Hernandez, Prajakta Joshi, Aimin Song, Matthieu Masureel, Henry R. Maun, and et al. 2026. "A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery" Pharmaceuticals 19, no. 2: 288. https://doi.org/10.3390/ph19020288
APA StyleCai, F., Zhou, L., Delgado, B., Chang, W., Tom, J., Hernandez, E., Joshi, P., Song, A., Masureel, M., Maun, H. R., Chang, A., & Zhang, Y. (2026). A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery. Pharmaceuticals, 19(2), 288. https://doi.org/10.3390/ph19020288

