From Black Box to Biological Insight: AttentioFuse Unlocks Multi-Omics Dynamics in Lung Cancer
Simple Summary
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Acquisition and Processing
2.2. Model Design
2.2.1. Omics-Specific Encoders
2.2.2. Cross-Omics Attention Fusion
2.2.3. Interpretation Methods
2.3. Comparative Model Implementation
2.4. Model Training Protocol
3. Results
3.1. Introductory Summary of Omics-Model Characteristics
3.2. Quantitative Comparison Between 3F and 5X
3.3. Interpretability Analysis of LUAD and LUSC Staging Characteristics
3.3.1. Validated Pathways and Novel Mechanisms in LUSC
3.3.2. Validated Pathways and Novel Mechanisms in LUAD
3.4. NSCLC Common Mechanisms and Personalized Therapeutic Implications
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Alduais, Y. Non-small cell lung cancer (NSCLC): A review of risk factors, diagnosis, and treatment. Medicine 2023, 102, e32899. [Google Scholar] [CrossRef]
- Yates, J.; Van Allen, E.M. New horizons at the interface of artificial intelligence and translational cancer research. Cancer Cell 2025, 43, 708–727. [Google Scholar] [CrossRef]
- Lotter, W.; Hassett, M.; Schultz, N.; Kehl, K.; Van Allen, E.; Cerami, E. Artificial Intelligence in Oncology: Current Landscape, Challenges, and Future Directions. Cancer Discov. 2024, 14, 711–726. [Google Scholar] [CrossRef]
- Huang, D.; Li, Z.; Jiang, T.; Yang, C.; Li, N. Artificial intelligence in lung cancer: Current applications, future perspectives, and challenges. Front. Oncol. 2024, 14, 1486310. [Google Scholar] [CrossRef]
- Wysocka, M. A systematic review of biologically-informed deep learning models for cancer: Fundamental trends for encoding and interpreting oncology data. BMC Bioinform. 2023, 24, 198. [Google Scholar] [CrossRef]
- Elmarakeby, H.A.; Hwang, D.; Arafeh, R.; Crowdis, J.; Gang, S.; Liu, D.; AlDubayan, S.H.; Salari, K.; Kregel, S.; Richter, C.; et al. Biologically informed deep neural network for prostate cancer discovery. Nature 2021, 598, 348–352. [Google Scholar] [CrossRef]
- van Hilten, A.; Kushner, S.A.; Kayser, M.; Ikram, M.A.; Adams, H.H.H.; Klaver, C.C.W.; Niessen, W.J.; Roshchupkin, G.V. GenNet framework: Interpretable deep learning for predicting phenotypes from genetic data. Commun. Biol. 2021, 4, 1094. [Google Scholar] [CrossRef]
- Ghosh Roy, G.; Geard, N.; Verspoor, K.; He, S. MPVNN: Mutated Pathway Visible Neural Network architecture for interpretable prediction of cancer-specific survival risk. Bioinformatics 2022, 38, 5026–5032. [Google Scholar] [CrossRef]
- Huang, X.; Huang, K.; Johnson, T.; Radovich, M.; Zhang, J.; Ma, J.; Wang, Y. ParsVNN: Parsimony visible neural networks for uncovering cancer-specific and drug-sensitive genes and pathways. NAR Genom. Bioinform. 2021, 3, lqab097. [Google Scholar] [CrossRef]
- Hao, J.; Kim, Y.; Kim, T.-K.; Kang, M. PASNet: Pathway-associated sparse deep neural network for prognosis prediction from high-throughput data. BMC Bioinform. 2018, 19, 510. [Google Scholar] [CrossRef]
- Singh, A.; Shannon, C.P.; Gautier, B.; Rohart, F.; Vacher, M.; Tebbutt, S.J.; Lê Cao, K.A. DIABLO: An integrative approach for identifying key molecular drivers from multi-omics assays. Bioinformatics 2019, 35, 3055–3062. [Google Scholar] [CrossRef]
- Argelaguet, R.; Velten, B.; Arnol, D.; Dietrich, S.; Zenz, T.; Marioni, J.C.; Buettner, F.; Huber, W.; Stegle, O. Multi-Omics Factor Analysis—A framework for unsupervised integration of multi-omics data sets. Mol. Syst. Biol. 2018, 14, e8124. [Google Scholar] [CrossRef]
- Wang, B.; Mezlini, A.M.; Demir, F.; Fiume, M.; Tu, Z.; Brudno, M.; Haibe-Kains, B.; Goldenberg, A. Similarity network fusion for aggregating data types on a genomic scale. Nat. Methods 2014, 11, 333–337. [Google Scholar] [CrossRef]
- Liu, J.; Capurro, D.D.; Nguyen, A.; Verspoor, K. Attention-based multimodal fusion with contrast for robust clinical prediction in the face of missing modalities. J. Biomed. Inform. 2023, 145, 104466. [Google Scholar] [CrossRef]
- Beaude, A.; Augé, M.; Zehraoui, F.; Hanczar, B. CrossAttOmics: Multiomics data integration with cross-attention. Bioinformatics 2025, 41, btaf302. [Google Scholar] [CrossRef]
- Baião, A.R.; Cai, Z.; Poulos, R.C.; Robinson, P.J.; Reddel, R.R.; Zhong, Q.; Vinga, S.; Gonçalves, E. A technical review of multi-omics data integration methods: From classical statistical to deep generative approaches. Brief. Bioinform. 2025, 26, bbaf355. [Google Scholar] [CrossRef]
- Li, X.; Pan, B.; He, Y.; Wang, Z.; Tang, Y.; Zhang, Y.; Wang, L.; Han, J. Deep learning–driven multi-omics analysis: Enhancing cancer diagnostics and treatments. Brief. Bioinform. 2025, 26, bbaf440. [Google Scholar] [CrossRef]
- Ballard, J.L.; Wang, Z.; Li, W.; Shen, L.; Long, Q. Deep learning-based approaches for multi-omics data integration and analysis. BioData Min. 2024, 17, 38. [Google Scholar] [CrossRef]
- Toussaint, P.A.; Leiser, F.; Thiebes, S.; Schlesner, M.; Brors, B.; Sunyaev, A. Towards explainable artificial intelligence for omics data: A systematic mapping study. Brief. Bioinform. 2024, 25, bbad453. [Google Scholar] [CrossRef]
- Zhou, Q.; Madala, N.S.; Huang, C. Pathway-guided architectures for interpretable AI in biological research. Comput. Struct. Biotechnol. J. 2025, 27, 4779–4791. [Google Scholar] [CrossRef]
- Fernando, Z.T.; Singh, J.; Anand, A. A study on the interpretability of neural retrieval models using DeepSHAP. In Proceedings of the 42nd International ACM SIGIR Conference, Paris, France, 21–25 July 2019; ACM: New York, NY, USA, 2019. [Google Scholar]
- Han, H.; Wang, W.Y.; Mao, B.H. Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. In Advances in Intelligent Computing; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
- Colaprico, A. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2016, 44, e71. [Google Scholar] [CrossRef]
- Mermel, C.H. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 2011, 12, R41. [Google Scholar] [CrossRef]
- Asamura, H. IASLC Lung Cancer Staging Project: The new database to inform revisions in the ninth edition of the TNM classification of lung cancer. J. Thorac. Oncol. 2023, 18, 564–575. [Google Scholar] [CrossRef]
- GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 2020, 369, 1318–1330. [Google Scholar] [CrossRef]
- Zhang, J.; Baran, J.; Cros, A.; Guberman, J.M.; Haider, S.; Hsu, J.; Liang, Y.; Rivkin, E.; Wang, J.; Whitty, B.; et al. International Cancer Genome Consortium Data Portal—A one-stop shop for cancer genomics data. Database 2011, bar026. [Google Scholar] [CrossRef]
- PCAWG Consortium. Pan-cancer analysis of whole genomes. Nature 2020, 578, 82–93. [Google Scholar] [CrossRef]
- Croft, D. Reactome: A database of reactions, pathways and biological processes. Nucleic Acids Res. 2011, 39, D691–D697. [Google Scholar] [CrossRef]
- Vaswani, A. Attention is all you need. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2017. [Google Scholar]
- Fan, R.-E.; Chang, K.-W.; Hsieh, C.-J.; Wang, X.-R.; Lin, C.-J. LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res. 2008, 9, 1871–1874. [Google Scholar]
- Tibshirani, R. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B 1996, 58, 267–288. [Google Scholar] [CrossRef]
- Zou, H.; Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B 2005, 67, 301–320. [Google Scholar] [CrossRef]
- Domingos, P.; Pazzani, M. On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 1997, 29, 103–130. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA, 2016. [Google Scholar]
- Loshchilov, I.; Hutter, F. Decoupled weight decay regularization. arXiv 2017, arXiv:1711.05101. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; IEEE: New York, NY, USA, 2015. [Google Scholar]
- Zou, X. Targeting the PDGF/PDGFR signaling pathway for cancer therapy: A review. Int. J. Biol. Macromol. 2022, 202, 539–557. [Google Scholar] [CrossRef]
- Raz, A.A.; Yamashita, Y.M. Rewinding the clock: Mechanisms of dedifferentiation. Curr. Opin. Genet. Dev. 2025, 93, 102353. [Google Scholar] [CrossRef]
- Ivleva, E.A.; Grivennikov, S.I. Microbiota-driven mechanisms at different stages of cancer development. Neoplasia 2022, 32, 100829. [Google Scholar] [CrossRef]
- D’Mello, S.A. Signaling pathways in melanogenesis. Int. J. Mol. Sci. 2016, 17, 1144. [Google Scholar] [CrossRef]
- Yang, S.R. Cigarette smoke induces proinflammatory cytokine release by activation of NF-κB and posttranslational modifications of histone deacetylase in macrophages. Am. J. Physiol. Lung Cell. Mol. Physiol. 2006, 291, L46–L57. [Google Scholar] [CrossRef]
- Odaka, Y. Dihydroartemisinin inhibits the mammalian target of rapamycin-mediated signaling pathways in tumor cells. Carcinogenesis 2014, 35, 192–200. [Google Scholar] [CrossRef]
- Cheng, J. The Caspase-3/PKCδ/Akt/VEGF-A signaling pathway mediates tumor repopulation during radiotherapy. Clin. Cancer Res. 2019, 25, 3732–3743. [Google Scholar] [CrossRef]
- Han, X. Nonsense-mediated mRNA decay: A nonsense pathway makes sense in stem cell biology. Nucleic Acids Res. 2018, 46, 1038–1051. [Google Scholar] [CrossRef]
- Martínez-Martínez, E. The potential role of leptin in the vascular remodeling associated with obesity. Int. J. Obes. 2014, 38, 1565–1572. [Google Scholar] [CrossRef]
- Garbincius, J.F.; Elrod, J.W. Mitochondrial calcium exchange in physiology and disease. Physiol. Rev. 2022, 102, 893–992. [Google Scholar] [CrossRef]
- Pritchard, J.E. Non-canonical Hedgehog signaling mediates profibrotic hematopoiesis-stroma crosstalk in myeloproliferative neoplasms. Cell Rep. 2024, 43, 113608. [Google Scholar] [CrossRef]
- Park, J. tRNA-derived small RNAs: Novel epigenetic regulators. Cancers 2020, 12, 2773. [Google Scholar] [CrossRef]





| Model | Configuration (Fixed Across Folds/Tasks) |
|---|---|
| AttentioFuse (3F/5X) | AdamW; initial lr = 0.01; batch size = 128; early stopping (patience = 10); gradient clipping (1.0); Reduce-on-Plateau (factor = 0.5, patience = 5); Kaiming initialization; LeakyReLU ( = 0.01). |
| IntegratalNet/MLP | Same cross-validation protocol; MLP with 3 hidden layers (256 units), LeakyReLU ( = 0.01), dropout = 0.30. |
| LogReg (L2) | Standard implementation with fixed regularization strength; stratified 5-fold CV. |
| LASSO/Elastic-Net | Standard implementation with fixed regularization strength (and fixed ratio for Elastic-Net). |
| Random Forest/XGBoost | Standard implementations with default settings. |
| GaussianNB | Default settings. |
| Cohort | Stage | IntegratalNet | AttentioFuse-3F | MLP | LogReg | XGBoost | LASSO | Elastic-Net | GNB | RF |
|---|---|---|---|---|---|---|---|---|---|---|
| LUAD | T | ACC: 0.81 ± 0.06 F1: 0.80 ± 0.03 | ACC: 0.89 ± 0.07 F1: 0.89 ± 0.08 | ACC: 0.73 ± 0.17 F1: 0.73 ± 0.12 | ACC: 0.59 ± 0.03 F1: 0.66 ± 0.03 | ACC: 0.87 ± 0.01 F1: 0.81 ± 0.02 | ACC: 0.68 ± 0.03 F1: 0.73 ± 0.03 | ACC: 0.59 ± 0.04 F1: 0.66 ± 0.03 | ACC: 0.87 ± 0.00 F1: 0.81 ± 0.00 | ACC: 0.87 ± 0.01 F1: 0.82 ± 0.01 |
| N | ACC: 0.56 ± 0.12 F1: 0.53 ± 0.19 | ACC: 0.87 ± 0.15 F1: 0.87 ± 0.16 | ACC: 0.55 ± 0.16 F1: 0.49 ± 0.17 | ACC: 0.60 ± 0.05 F1: 0.61 ± 0.05 | ACC: 0.66 ± 0.04 F1: 0.64 ± 0.04 | ACC: 0.60 ± 0.05 F1: 0.61 ± 0.05 | ACC: 0.60 ± 0.05 F1: 0.60 ± 0.05 | ACC: 0.64 ± 0.02 F1: 0.53 ± 0.03 | ACC: 0.70 ± 0.03 F1: 0.66 ± 0.03 | |
| M | ACC: 0.92 ± 0.01 F1: 0.89 ± 0.01 | ACC: 0.94 ± 0.02 F1: 0.92 ± 0.03 | ACC: 0.91 ± 0.01 F1: 0.89 ± 0.00 | ACC: 0.60 ± 0.00 F1: 0.70 ± 0.00 | ACC: 0.93 ± 0.00 F1: 0.90 ± 0.00 | ACC: 0.74 ± 0.04 F1: 0.80 ± 0.02 | ACC: 0.61 ± 0.02 F1: 0.70 ± 0.02 | ACC: 0.93 ± 0.00 F1: 0.90 ± 0.00 | ACC: 0.93 ± 0.00 F1: 0.90 ± 0.00 | |
| LUSC | T | ACC: 0.74 ± 0.06 F1: 0.71 ± 0.03 | ACC: 0.81 ± 0.11 F1: 0.82 ± 0.10 | ACC: 0.72 ± 0.11 F1: 0.70 ± 0.07 | ACC: 0.59 ± 0.03 F1: 0.64 ± 0.02 | ACC: 0.79 ± 0.03 F1: 0.73 ± 0.04 | ACC: 0.65 ± 0.03 F1: 0.68 ± 0.03 | ACC: 0.59 ± 0.03 F1: 0.64 ± 0.02 | ACC: 0.82 ± 0.00 F1: 0.74 ± 0.00 | ACC: 0.82 ± 0.01 F1: 0.74 ± 0.00 |
| N | ACC: 0.55 ± 0.11 F1: 0.50 ± 0.13 | ACC: 0.78 ± 0.15 F1: 0.77 ± 0.16 | ACC: 0.61 ± 0.03 F1: 0.60 ± 0.04 | ACC: 0.58 ± 0.04 F1: 0.59 ± 0.05 | ACC: 0.63 ± 0.04 F1: 0.60 ± 0.05 | ACC: 0.60 ± 0.04 F1: 0.60 ± 0.04 | ACC: 0.57 ± 0.04 F1: 0.57 ± 0.04 | ACC: 0.65 ± 0.02 F1: 0.53 ± 0.03 | ACC: 0.62 ± 0.03 F1: 0.57 ± 0.05 | |
| M | ACC: 0.98 ± 0.01 F1: 0.98 ± 0.00 | ACC: 0.98 ± 0.01 F1: 0.98 ± 0.00 | ACC: 0.98 ± 0.01 F1: 0.98 ± 0.00 | ACC: 0.68 ± 0.00 F1: 0.80 ± 0.00 | ACC: 0.99 ± 0.00 F1: 0.98 ± 0.00 | ACC: 0.93 ± 0.02 F1: 0.95 ± 0.01 | ACC: 0.71 ± 0.03 F1: 0.81 ± 0.02 | ACC: 0.99 ± 0.00 F1: 0.98 ± 0.00 | ACC: 0.99 ± 0.00 F1: 0.98 ± 0.00 |
| Cohort | Task | Variant | ACC (Mean ± SD) | F1 (Mean ± SD) |
|---|---|---|---|---|
| LUAD | T | 3F | 0.89 ± 0.07 | 0.89 ± 0.08 |
| LUAD | T | 5X | 0.92 ± 0.04 | 0.95 ± 0.06 |
| LUAD | N | 3F | 0.87 ± 0.15 | 0.87 ± 0.16 |
| LUAD | N | 5X | 0.73 ± 0.16 | 0.76 ± 0.18 |
| LUAD | M | 3F | 0.94 ± 0.02 | 0.92 ± 0.03 |
| LUAD | M | 5X | 0.99 ± 0.00 | 0.99 ± 0.00 |
| LUSC | T | 3F | 0.81 ± 0.11 | 0.82 ± 0.10 |
| LUSC | T | 5X | 0.96 ± 0.02 | 0.93 ± 0.00 |
| LUSC | N | 3F | 0.78 ± 0.15 | 0.77 ± 0.16 |
| LUSC | N | 5X | 0.80 ± 0.15 | 0.80 ± 0.10 |
| LUSC | M | 3F | 0.98 ± 0.01 | 0.98 ± 0.00 |
| LUSC | M | 5X | 0.99 ± 0.00 | 1.00 ± 0.00 |
| Feature | LUAD Signature | LUSC Signature |
|---|---|---|
| Driver alterations | EGFR/KRAS/ALK mutations | PIK3CA/SOX2 amplifications |
| Metabolic reprogramming | CPT1A-driven fatty acid oxidation dominance | HK2/PKM2-driven glycolytic flux amplification |
| Immune landscape | CD68-marked PD-L1+ TAM-enriched microenvironment | CD8A-marked CD8+ T-cell infiltrated microenvironment |
| Metastatic pattern | Hematogenous (bone/brain) via AKT–VEGF angiogenesis | Locoregional (mediastinum) via Hippo–Notch crosstalk |
| Cohort | Modality | Cross-Omics | Feature-Level | Fusion-Layer | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| T | N | M | T | N | M | T | N | M | ||
| LUAD | SNV | 0.17 | 0.58 | 0.62 | 0.10 | 0.55 | 0.10 | 352.44 | 1969.28 | 363.14 |
| CNV | – | – | – | 0.10 | 0.54 | 0.10 | 356.30 | 1935.07 | 367.40 | |
| Transcriptome | – | – | – | 0.10 | 0.54 | 0.10 | 360.97 | 1963.64 | 352.20 | |
| LUSC | SNV | 0.16 | 0.19 | 0.21 | 0.12 | 0.12 | 0.12 | 514.42 | 509.82 | 515.76 |
| CNV | – | – | – | 0.12 | 0.12 | 0.12 | 518.30 | 512.27 | 527.57 | |
| Transcriptome | – | – | – | 0.12 | 0.12 | 0.12 | 521.04 | 515.37 | 517.99 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Huang, Y.; He, Y.; Zeng, L.; Liu, L.; Zhong, F. From Black Box to Biological Insight: AttentioFuse Unlocks Multi-Omics Dynamics in Lung Cancer. Cancers 2026, 18, 878. https://doi.org/10.3390/cancers18050878
Huang Y, He Y, Zeng L, Liu L, Zhong F. From Black Box to Biological Insight: AttentioFuse Unlocks Multi-Omics Dynamics in Lung Cancer. Cancers. 2026; 18(5):878. https://doi.org/10.3390/cancers18050878
Chicago/Turabian StyleHuang, Yuhang, Yungang He, Liyan Zeng, Lei Liu, and Fan Zhong. 2026. "From Black Box to Biological Insight: AttentioFuse Unlocks Multi-Omics Dynamics in Lung Cancer" Cancers 18, no. 5: 878. https://doi.org/10.3390/cancers18050878
APA StyleHuang, Y., He, Y., Zeng, L., Liu, L., & Zhong, F. (2026). From Black Box to Biological Insight: AttentioFuse Unlocks Multi-Omics Dynamics in Lung Cancer. Cancers, 18(5), 878. https://doi.org/10.3390/cancers18050878

