SCAPeSCLC: An Integrated Spatial Transcriptomic and Bayesian Pathway Enrichment Dataset for Survival Modeling in Extensive-Stage Small Cell Lung Cancer
Abstract
1. Summary
2. Data Description
2.1. Dataset Components and Characteristics
2.2. Patient Demographics and Baseline Clinical Characteristics
Principal Component Analysis of Cohort Structure
2.3. Expression of Cancer Transcriptome Atlas Genes
2.3.1. ROI Allocation and Global Gene Expression Distributions
2.3.2. SCLC Subtype Markers and Key Cancer-Related Genes
2.3.3. Patient Clustering Based on Subtype Marker Expression
2.4. Survival Metrics and Proportional Hazards
2.4.1. Survival Endpoints, Outcome Events and Time-to-Event Intervals
2.4.2. CTA Gene Expression Survival Analysis
2.5. Cancer Transcriptome Atlas Biological Pathway Enrichment and Proportional Hazards
2.5.1. CTA Biological Pathway Enrichment at ROI and Patient Level
2.5.2. Bayesian CTA Biological Pathway Enrichment Posteriors at Patient Level
2.5.3. Bayesian CTA Pathway Enrichment Survival Analysis
3. Methods
3.1. Data Structure and Cohort Integration
3.1.1. Source Datasets
3.1.2. Data Retrieval and File Processing
- Normalized DSP expression matrices (XLSX format)
- Series matrix metadata files (TXT format)
- CTA probe and pathway annotation files (PKC format)
3.1.3. Assessment of Cohort Compatibility
3.2. Gene Expression Processing
3.2.1. ROI and Patient-Level Expression Matrices
3.2.2. Exploratory SCLC Subtype Clustering
3.3. Clinical Metadata and Survival Endpoint Construction
3.3.1. Metadata Integration
3.3.2. Definition of Survival Endpoints
- Disease progression
- Death due to disease
- Death from any cause
- Time on treatment (ToT): last dose date—first dose date
- PFS: progression date—first dose date
- DSS: death of disease date—first dose date
- OS: death date—first dose date
3.4. CTA Biological Pathway Enrichment
3.4.1. ROI and Patient-Level Pathway Scoring
3.4.2. Patient-Level Bayesian Estimation of Pathway Enrichment Posteriors
3.5. Survival Modeling
Cox Proportional Hazards Modeling
- Patient-level scaled gene expression variables
- Patient-level Bayesian pathway enrichment posteriors
3.6. Statistical Analysis
3.7. SCAPeSCLC GitHub Repository
4. User Notes
Supplementary Materials
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| CPH | Cox Proportional Hazards |
| CTA | Cancer Transcriptome Atlas |
| DSP | Digital Spatial Profiler |
| DSS | Disease-Specific Survival |
| ES-SCLC | Extensive-Stage Small Cell Lung Cancer |
| GEO | Gene Expression Omnibus |
| KM | Kaplan–Meier |
| OS | Overall Survival |
| PCA | Principal Component Analysis |
| PFS | Progression-Free Survival |
| ROI | Region of Interest |
| SCLC | Small Cell Lung Cancer |
| ToT | Time on Treatment |
| TTP | Time to Progression |
References
- Yan, J.S.; Wang, Y.Z.; Zhao, S.Y.; Na, F.J.; Cui, Z.X.; Cai, L.T.; Li, H.M.; Zhao, M.F. Immunotherapy for Extensive-Stage Small Cell Lung Cancer: Current Status, Challenges and Future Strategies. Cancer Lett. 2026, 645, 218382. [Google Scholar] [CrossRef] [PubMed]
- Durer, S.; Fu, P.; Chen, Z.; Dowlati, A. Roles of Restricted Mean Survival Time and Restricted Mean Time Lost in Evaluating Immune Checkpoint Inhibitor Efficacy for Extensive-Stage Small Cell Lung Cancer. Cancer Res. Commun. 2026, 6, 77–84. [Google Scholar] [CrossRef] [PubMed]
- Cheema, P.K.; Perdrizet, K.A.; Sangha, R.S.; Breadner, D.; Daaboul, N.; Farley, S.; Jao, K.; Liu, G.; Logan, B.; Melosky, B.; et al. Early Experience with Tarlatamab (T-Cell Engagers) for Extensive-Stage Small Cell Lung Cancer (ES-SCLC) in Canada: Lessons Learned and Implementation Strategies. Curr. Oncol. 2026, 33, 84. [Google Scholar] [CrossRef] [PubMed]
- Paz-Ares, L.G.; O’Byrne, K.J.; Johnson, M.L.; Reck, M.; Girard, N.; Hayashi, H.; Zhou, C.; Gharpure, V.S.; Pisupati, R.; Pacius, M.; et al. TIGOS Trial: A Randomized, Double-Blind, Phase III Trial of Atigotatug and Nivolumab Fixed-Dose Combination With Chemotherapy Versus Atezolizumab With Chemotherapy as First-Line Therapy in Patients With Extensive-Stage Small Cell Lung Cancer. Clin. Lung Cancer 2026, 27, 140–145. [Google Scholar] [CrossRef] [PubMed]
- Loaiza-Moss, J.; Leitges, M. Pan-Cancer Landscape of Protein Kinase D3: An Integrative TCGA Multi-Omics Analysis of Clinical, Molecular, and Immunological Roles. PLoS ONE 2026, 21, e0346173. [Google Scholar] [CrossRef] [PubMed]
- Liu, N.; Bhuva, D.D.; Mohamed, A.; Bokelund, M.; Kulasinghe, A.; Tan, C.W.; Davis, M.J. StandR: Spatial Transcriptomic Analysis for GeoMx DSP Data. Nucleic Acids Res. 2024, 52, e2. [Google Scholar] [CrossRef] [PubMed]
- Thomsen, C.; Truumees, B.; Nielsen, S.; Schnack, B.; Sara, N.; Newell, R.; Thorsen, K.; Røge, R. GeoMx and RNAscope: A Comparative Assessment of Their Utility for Spatial MRNA Expression Profiling in Formalin-Fixed Breast Cancer Tissue. Cytom. Part A 2026, 109, 147–154. [Google Scholar] [CrossRef] [PubMed]
- Peressini, M.; Garcia-Campelo, R.; Massuti, B.; Martí, C.; Cobo, M.; Gutiérrez, V.; Dómine, M.; Fuentes, J.; Majem, M.; de Castro, J.; et al. Spatially Preserved Multi-Region Transcriptomic Subtyping and Biomarkers of Chemoimmunotherapy Outcome in Extensive-Stage Small Cell Lung Cancer. Clin. Cancer Res. 2024, 30, 3036–3049. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z.; Sun, X.; Liu, Y.; Zhang, Y.; Yang, Z.; Dong, J.; Wang, N.; Ying, J.; Zhou, M.; Yang, L. Spatial Transcriptome-Wide Profiling of Small Cell Lung Cancer Reveals Intra-Tumoral Molecular and Subtype Heterogeneity. Adv. Sci. 2024, 11, 2402716. [Google Scholar] [CrossRef] [PubMed]
- del Rey-Vergara, R.; Galindo-Campos, M.A.; Rocha, P.; Carpes, M.; Martínez, C.; Masfarré, L.; Menéndez, S.; Quimis, F.; Rossell, A.; Iñañez, A.; et al. MET Pathway Inhibition Increases Chemo-Immunotherapy Efficacy in Small Cell Lung Cancer. Cell Rep. Med. 2025, 6, 102194. [Google Scholar] [CrossRef] [PubMed]
- Donati, B.; Manzotti, G.; Torricelli, F.; Ascione, C.; Valli, R.; Santandrea, G.; Ragazzi, M.; Zanetti, E.; Ciarrocchi, A.; Piana, S. Digital Spatial Profiling for Pathologists. Virchows Arch. 2025, 486, 971–981. [Google Scholar] [CrossRef] [PubMed]
- Merritt, C.R.; Ong, G.T.; Church, S.E.; Barker, K.; Danaher, P.; Geiss, G.; Hoang, M.; Jung, J.; Liang, Y.; McKay-Fleisch, J.; et al. Multiplex Digital Spatial Profiling of Proteins and RNA in Fixed Tissue. Nat. Biotechnol. 2020, 38, 586–599. [Google Scholar] [CrossRef] [PubMed]
- Yu, Z.; Zou, J.; Xu, F. The Molecular Subtypes of Small Cell Lung Cancer Defined by Key Transcription Factors and Their Clinical Significance. Lung Cancer 2024, 198, 108033. [Google Scholar] [CrossRef] [PubMed]
- Baine, M.K.; Hsieh, M.S.; Lai, W.V.; Egger, J.V.; Jungbluth, A.A.; Daneshbod, Y.; Beras, A.; Spencer, R.; Lopardo, J.; Bodd, F.; et al. SCLC Subtypes Defined by ASCL1, NEUROD1, POU2F3, and YAP1: A Comprehensive Immunohistochemical and Histopathologic Characterization. J. Thorac. Oncol. 2020, 15, 1823–1835. [Google Scholar] [CrossRef] [PubMed]
- Lo, Y.C.; Rivera-Concepcion, J.; Vasmatzis, G.; Aubry, M.C.; Leventakos, K. Subtype of SCLC Is an Intrinsic and Persistent Feature Through Systemic Treatment. JTO Clin. Res. Rep. 2023, 4, 100561. [Google Scholar] [CrossRef] [PubMed]
- Zhu, Y.; Li, S.; Wang, H.; Ren, W.; Chi, K.; Wu, J.; Mao, L.; Huang, X.; Zhuo, M.; Lin, D. Molecular Subtypes, Predictive Markers and Prognosis in Small-Cell Lung Carcinoma. J. Clin. Pathol. 2024, 78, 42–50. [Google Scholar] [CrossRef] [PubMed]
- Cheng, D.T.; Mitchell, T.N.; Zehir, A.; Shah, R.H.; Benayed, R.; Syed, A.; Chandramohan, R.; Liu, Z.Y.; Won, H.H.; Scott, S.N.; et al. Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. J. Mol. Diagn. 2015, 17, 251–264. [Google Scholar] [CrossRef] [PubMed]
- Rudin, C.M.; Brambilla, E.; Faivre-Finn, C.; Sage, J. Small-Cell Lung Cancer. Nat. Rev. Dis. Prim. 2021, 7, 3. [Google Scholar] [CrossRef] [PubMed]
- Ireland, A.S.; Micinski, A.M.; Kastner, D.W.; Guo, B.; Wait, S.J.; Spainhower, K.B.; Conley, C.C.; Chen, O.S.; Guthrie, M.R.; Soltero, D.; et al. MYC Drives Temporal Evolution of Small Cell Lung Cancer Subtypes by Reprogramming Neuroendocrine Fate. Cancer Cell 2020, 38, 60–78.e12. [Google Scholar] [CrossRef] [PubMed]
- Cho, C.; Haddadi, N.S.; Kidacki, M.; Woodard, G.A.; Shakiba, S.; Yıldız-Altay, Ü.; Richmond, J.M.; Vesely, M.D. Spatial Transcriptomics in Inflammatory Skin Diseases Using GeoMx Digital Spatial Profiling: A Practical Guide for Applications in Dermatology. JID Innov. 2025, 5, 100317. [Google Scholar] [CrossRef] [PubMed]
- Hernandez, S.; Lazcano, R.; Serrano, A.; Powell, S.; Kostousov, L.; Mehta, J.; Khan, K.; Lu, W.; Solis, L.M. Challenges and Opportunities for Immunoprofiling Using a Spatial High-Plex Technology: The NanoString GeoMx® Digital Spatial Profiler. Front. Oncol. 2022, 12, 890410. [Google Scholar] [CrossRef] [PubMed]
- Liberzon, A.; Birger, C.; Thorvaldsdóttir, H.; Ghandi, M.; Mesirov, J.P.; Tamayo, P. The Molecular Signatures Database Hallmark Gene Set Collection. Cell Syst. 2015, 1, 417–425. [Google Scholar] [CrossRef] [PubMed]
- Hukku, A.; Quick, C.; Luca, F.; Pique-Regi, R.; Wen, X. BAGSE: A Bayesian Hierarchical Model Approach for Gene Set Enrichment Analysis. Bioinformatics 2020, 36, 1689–1695. [Google Scholar] [CrossRef] [PubMed]
- Tang, Z.; Kang, B.; Li, C.; Chen, T.; Zhang, Z. GEPIA2: An Enhanced Web Server for Large-Scale Expression Profiling and Interactive Analysis. Nucleic Acids Res. 2019, 47, W556–W560. [Google Scholar] [CrossRef] [PubMed]
- Kang, Y.J.; Pan, L.; Liu, Y.; Rong, Z.; Liu, J.; Liu, F. GEPIA3: Enhanced Drug Sensitivity and Interaction Network Analysis for Cancer Research. Nucleic Acids Res. 2025, 53, W283–W290. [Google Scholar] [CrossRef] [PubMed]
- Lánczky, A.; Győrffy, B. Web-Based Survival Analysis Tool Tailored for Medical Research (KMplot): Development and Implementation. J. Med. Internet Res. 2021, 23, e27633. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Lachmann, A.; Ma’ayan, A. Mining Data and Metadata from the Gene Expression Omnibus. Biophys. Rev. 2019, 11, 103–110. [Google Scholar] [CrossRef] [PubMed]
- Jäger, N. Bioinformatics Workflows for Clinical Applications in Precision Oncology. Semin. Cancer Biol. 2022, 84, 103–112. [Google Scholar] [CrossRef] [PubMed]
- Campisi, M.; Osaki, T.; Dryg, I.; Stornante, C.; Wolff, J.; Weirather, J.; Weaver, N.; Tarannum, M.; Gillanders, I.; Bers, A.; et al. Vascular STING Activation Facilitates NK Cell Anti-Tumor Immunity in Small Cell Lung Cancer. Cancer Cell 2026, 44, 858–878.e16. [Google Scholar] [CrossRef] [PubMed]









| Data Series | Platform | Sample (N) | Patient (N) | Cohort | Treatment | Ref. |
|---|---|---|---|---|---|---|
| GSE261348 | GeoMx DSP | 175 | 32 | IMfirst | Atezolizumab | [8] |
| GSE261345 | GeoMx DSP | 121 | 26 | CANTABRICO | Durvalumab | [8,10] |
| Variable | Combined Cohort (N = 58) | Cohort | ||||
|---|---|---|---|---|---|---|
| Atezolizumab (N = 32) | Durvalumab (N = 26) | p-Value | ||||
| Age (yr) | 64.5 ± 8.02 | 62.9 ± 8.57 | 66.3 ± 7.00 | 0.101 * | ||
| Sex | Female | 19 (32.8%) | 9 (28.1%) | 10 (38.5%) | 0.404 † | |
| Male | 39 (67.2%) | 23 (71.9%) | 16 (61.5%) | |||
| Smoking Status | Current | 28 (48.3%) | 13 (40.6%) | 15 (57.7%) | 0.196 † | |
| Former | 30 (51.7%) | 19 (59.4%) | 11 (42.3%) | |||
| Platinum Agent | Carboplatin | 50 (86.2%) | 27 (84.4%) | 23 (88.5%) | 0.720 ‡ | |
| Cisplatin | 8 (13.8%) | 5 (15.6%) | 3 (11.5%) | |||
| ECOG Performance | 0 | 16 (27.6%) | 10 (31.25%) | 6 (23.1%) | 0.833 ‡ | |
| 1 | 38 (65.5%) | 20 (62.50%) | 18 (69.2%) | |||
| 2 | 4 (6.9%) | 2 (6.25%) | 2 (7.7%) | |||
| Metastasis | CNS | Yes | 8 (13.8%) | 6 (18.75%) | 2 (7.7%) | 0.278 ‡ |
| No | 50 (86.2%) | 26 (81.25%) | 24 (92.3%) | |||
| Liver | Yes | 20 (34.5%) | 11 (34.4%) | 9 (34.6%) | 0.985 † | |
| No | 38 (65.5%) | 21 (65.6%) | 17 (65.4%) | |||
| Bone | Yes | 16 (27.6%) | 4 (12.5%) | 12 (46.15%) | 0.007 ‡ | |
| No | 42 (72.4%) | 28 (87.5%) | 14 (53.85%) | |||
| Overall | Yes | 36 (62.1%) | 17 (53.1%) | 19 (73.1%) | 0.119 † | |
| No | 22 (37.9%) | 15 (46.9%) | 7 (26.9%) | |||
| Cohort | Statistics | Endpoint | ||
|---|---|---|---|---|
| PFS | DSS | OS | ||
| Atezolizumab (N = 32) | Events (N) | 25 | 21 | 23 |
| Censored (N) | 7 | 11 | 9 | |
| Median (months) | 6.90 | 13.0 | 13.0 | |
| 95% CI (months) | 6.95–13.2 | 11.8–18.3 | 11.8–18.3 | |
| Durvalumab (N = 26) | Events (N) | 23 | 19 | 21 |
| Censored (N) | 3 | 7 | 5 | |
| Median (months) | 6.29 | 10.1 | 10.1 | |
| 95% CI (months) | 5.79–11.9 | 9.22–16.4 | 11.8–18.3 | |
| Combined (N = 58) | Events (N) | 48 | 40 | 44 |
| Censored (N) | 10 | 18 | 14 | |
| Median (months) | 6.75 | 11.8 | 11.8 | |
| 95% CI (months) | 7.38–11.7 | 11.7–16.4 | 11.7–16.4 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Shirvaliloo, M. SCAPeSCLC: An Integrated Spatial Transcriptomic and Bayesian Pathway Enrichment Dataset for Survival Modeling in Extensive-Stage Small Cell Lung Cancer. Data 2026, 11, 152. https://doi.org/10.3390/data11070152
Shirvaliloo M. SCAPeSCLC: An Integrated Spatial Transcriptomic and Bayesian Pathway Enrichment Dataset for Survival Modeling in Extensive-Stage Small Cell Lung Cancer. Data. 2026; 11(7):152. https://doi.org/10.3390/data11070152
Chicago/Turabian StyleShirvaliloo, Milad. 2026. "SCAPeSCLC: An Integrated Spatial Transcriptomic and Bayesian Pathway Enrichment Dataset for Survival Modeling in Extensive-Stage Small Cell Lung Cancer" Data 11, no. 7: 152. https://doi.org/10.3390/data11070152
APA StyleShirvaliloo, M. (2026). SCAPeSCLC: An Integrated Spatial Transcriptomic and Bayesian Pathway Enrichment Dataset for Survival Modeling in Extensive-Stage Small Cell Lung Cancer. Data, 11(7), 152. https://doi.org/10.3390/data11070152
