SPINET-KSP: A Multi-Modal LLM-Graph Foundation Model for Contextual Prediction of Kinase-Substrate-Phosphatase Triads
Abstract
1. Introduction
2. Related Works
3. Results and Discussion
3.1. Comparative Analysis Utilising Advanced Techniques
3.2. Ablation Analyses and Controls
4. Materials and Methods
4.1. Data Sources and Integration
4.2. Workflow
4.3. Model Architecture
- a.
- Heterogeneous KSP Interaction Graph Construction
- b.
- Pretraining and Fine-Tuning
- c.
- Evaluation, Independent Testing, and System Configuration
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Moro, V.; Loh, C.; Dangovski, R.; Ghorashi, A.; Ma, A.; Chen, Z.; Kim, S.; Lu, P.Y.; Christensen, T.; Soljačić, M. Multimodal foundation models for material property prediction and discovery. Newton 2025, 1, 100016. [Google Scholar] [CrossRef]
- Wu, X.; Xu, M.; Geng, M.; Chen, S.; Little, P.J.; Xu, S.; Weng, J. Targeting protein modifications in metabolic diseases: Molecular mechanisms and targeted therapies. Signal Transduct. Target. Ther. 2023, 8, 220. [Google Scholar] [CrossRef] [PubMed]
- Weber, S.; Meyer-Roxlau, S.; Wagner, M.; Dobrev, D.; El-Armouche, A. Counteracting Protein Kinase Activity in the Heart: The Multiple Roles of Protein Phosphatases. Front. Pharmacol. 2015, 6, 270. [Google Scholar] [CrossRef] [PubMed]
- Chen, M.; Zhang, W.; Gou, Y.; Xu, D.; Wei, Y.; Liu, D.; Han, C.; Huang, X.; Li, C.; Ning, W.; et al. GPS 6.0: An updated server for prediction of kinase-specific phosphorylation sites in proteins. Nucleic Acids Res. 2023, 51, W243–W250. [Google Scholar] [CrossRef] [PubMed]
- DeMarco, A.G.; Hall, M.C. Phosphoproteomic Approaches for Identifying Phosphatase and Kinase Substrates. Molecules 2023, 28, 3675. [Google Scholar] [CrossRef]
- Verburgt, J.; Zhang, Z.; Kihara, D. AlphaFold model quality self-assessment improvement via deep graph learning. Protein Sci. 2025, 34, e70274. [Google Scholar] [CrossRef]
- Choudhuri, S.; Yendluri, M.; Poddar, S.; Li, A.; Mallick, K.; Mallik, S.; Ghosh, B. Recent Advancements in Computational Drug Design Algorithms through Machine Learning and Optimization. Kinases Phosphatases 2023, 1, 117–140. [Google Scholar] [CrossRef]
- Gavali, S.; Ross, K.; Chen, C.; Cowart, J.; Wu, C.H. A knowledge graph representation learning approach to predict novel kinase-substrate interactions. Mol. Omics 2022, 18, 853–864. [Google Scholar] [CrossRef]
- Anandakrishnan, M.; Ross, K.E.; Chen, C.; Shanker, V.; Cowart, J.; Wu, C.H. KSFinder—A knowledge graph model for link prediction of novel phosphorylated substrates of kinases. PeerJ 2023, 11, e16164. [Google Scholar] [CrossRef]
- Nováček, V.; McGauran, G.; Matallanas, D.; Vallejo Blanco, A.; Conca, P.; Muñoz, E.; Costabello, L.; Kanakaraj, K.; Nawaz, Z.; Walsh, B.; et al. Accurate prediction of kinase-substrate networks using knowledge graphs. PLoS Comput. Biol. 2020, 16, e1007578. [Google Scholar] [CrossRef]
- Anandakrishnan, M.; Ross, K.E.; Chen, C.; Vijay-Shanker, K.; Wu, C.H. KSMoFinder—Knowledge graph embedding of proteins and motifs for predicting kinases of human phosphosites. Bioinform. Adv. 2025, 5, vbaf289. [Google Scholar] [CrossRef] [PubMed]
- Ayati, M.; Yilmaz, S.; Blasco Tavares Pereira Lopes, F.; Chance, M.; Koyuturk, M. Prediction of Kinase-Substrate Associations Using The Functional Landscape of Kinases and Phosphorylation Sites. Pac. Symp. Biocomput. 2023, 28, 73–84. [Google Scholar] [PubMed]
- Varadi, M.; Bertoni, D.; Magana, P.; Paramval, U.; Pidruchna, I.; Radhakrishnan, M.; Tsenkov, M.; Nair, S.; Mirdita, M.; Yeo, J.; et al. AlphaFold Protein Structure Database in 2024: Providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. 2024, 52, D368–D375. [Google Scholar] [CrossRef] [PubMed]
- Szklarczyk, D.; Kirsch, R.; Koutrouli, M.; Nastou, K.; Mehryary, F.; Hachilif, R.; Gable, A.L.; Fang, T.; Doncheva, N.T.; Pyysalo, S.; et al. The STRING database in 2023: Protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023, 51, D638–D646. [Google Scholar] [CrossRef]
- Nita-Lazar, A.; Saito-Benz, H.; White, F.M. Quantitative phosphoproteomics by mass spectrometry: Past, present, and future. Proteomics 2008, 8, 4433–4443. [Google Scholar] [CrossRef]
- Simpson, L.M.; Fulcher, L.J.; Sathe, G.; Brewer, A.; Zhao, J.-F.; Squair, D.R.; Crooks, J.; Wightman, M.; Wood, N.T.; Gourlay, R.; et al. An affinity-directed phosphatase, AdPhosphatase, system for targeted protein dephosphorylation. Cell Chem. Biol. 2023, 30, 188–202.e6. [Google Scholar] [CrossRef]
- Yu, J.; Shan, D.; Song, H.; Yang, M. A novel hybrid machine learning model for predicting rate constants of the reactions between alkane and CH3 radical. Fuel 2022, 322, 124150. [Google Scholar] [CrossRef]
- Jiang, W.; Jaehnig, E.J.; Liao, Y.; Shi, Z.; Yaron-Barir, T.M.; Johnson, J.L.; Cantley, L.C.; Zhang, B. Deciphering the dark cancer phosphoproteome using machine-learned co-regulation of phosphosites. Nat. Commun. 2025, 16, 2766. [Google Scholar] [CrossRef]
- Crowl, S.; Jordan, B.T.; Ahmed, H.; Ma, C.X.; Naegle, K.M. KSTAR: An algorithm to predict patient-specific kinase activities from phosphoproteomic data. Nat. Commun. 2022, 13, 4283. [Google Scholar] [CrossRef]
- Ma, H.; Li, G.; Su, Z. KSP: An integrated method for predicting catalyzing kinases of phosphorylation sites in proteins. BMC Genom. 2020, 21, 537. [Google Scholar] [CrossRef]
- Zhang, X.-M.; Liang, L.; Liu, L.; Tang, M.-J. Graph Neural Networks and Their Current Applications in Bioinformatics. Front. Genet. 2021, 12, 690049. [Google Scholar] [CrossRef]
- Foody, G.M. Challenges in the real world use of classification accuracy metrics: From recall and precision to the Matthews correlation coefficient. PLoS ONE 2023, 18, e0291908. [Google Scholar] [CrossRef]









| Model | Triad Prediction | Multi-Modal Inputs | Context-Awareness | Phosphatase Handling | Quantitative Occupancy | Key Limitation |
|---|---|---|---|---|---|---|
| SPINET-KSP | Yes (KSP) | Yes (sequence, structure, interactomes, phosphoproteomics) | Yes (512D manifold) | Yes (antagonistic) | Yes (r = 0.613) | N/A |
| KSFinder | No (KS only) | No (knowledge graphs only) | No | No | No | No triad/context |
| KSMoFinder | No (KS only) | No (graphs + motifs) | No | No | No | Limited to known motifs |
| GPS 6.0 | No (KS only) | No (sequence only) | No | No | No | No structure/ network |
| Model | AUROC (Kinase Pred.) | AUROC (Phosphatase Pred.) | Pearson r (Occupancy) | Notes |
|---|---|---|---|---|
| SPINET-KSP | 0.998 | 0.992 | 0.922 | Full triad/context |
| GPS 6.0 | 0.912 | N/A | N/A | Sequence-only |
| KSFinder | 0.885 | N/A | N/A | Graph-only, no phosphatases |
| KSMoFinder | 0.902 | N/A | N/A | Motifs + graphs, no context |
| Ablation Variant | AUROC (Kinase) | Pearson r (Occupancy) | MCC | Notes |
|---|---|---|---|---|
| Full SPINET-KSP | 0.998 | 0.922 | 0.970 | Baseline |
| No Reversible Triad Attention | 0.912 | 0.745 | 0.812 | Loses antagonism modelling |
| No Multi-Modal (Sequence Only) | 0.885 | 0.682 | 0.765 | Matches sequence baselines |
| No Cell-State Manifold | 0.934 | 0.811 | 0.852 | Lacks context-awareness |
| Random Negative Sampling Only | 0.901 | 0.792 | 0.801 | Increases bias/leaks |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Arowolo, M.O.; Okon, M.E.; Austria, D.; Azam, M.; Abdulsalam, S.O. SPINET-KSP: A Multi-Modal LLM-Graph Foundation Model for Contextual Prediction of Kinase-Substrate-Phosphatase Triads. Kinases Phosphatases 2026, 4, 3. https://doi.org/10.3390/kinasesphosphatases4010003
Arowolo MO, Okon ME, Austria D, Azam M, Abdulsalam SO. SPINET-KSP: A Multi-Modal LLM-Graph Foundation Model for Contextual Prediction of Kinase-Substrate-Phosphatase Triads. Kinases and Phosphatases. 2026; 4(1):3. https://doi.org/10.3390/kinasesphosphatases4010003
Chicago/Turabian StyleArowolo, Michael Olaolu, Marian Emmanuel Okon, Davis Austria, Muhammad Azam, and Sulaiman Olaniyi Abdulsalam. 2026. "SPINET-KSP: A Multi-Modal LLM-Graph Foundation Model for Contextual Prediction of Kinase-Substrate-Phosphatase Triads" Kinases and Phosphatases 4, no. 1: 3. https://doi.org/10.3390/kinasesphosphatases4010003
APA StyleArowolo, M. O., Okon, M. E., Austria, D., Azam, M., & Abdulsalam, S. O. (2026). SPINET-KSP: A Multi-Modal LLM-Graph Foundation Model for Contextual Prediction of Kinase-Substrate-Phosphatase Triads. Kinases and Phosphatases, 4(1), 3. https://doi.org/10.3390/kinasesphosphatases4010003

