1. Introduction
Triple-negative breast cancer (TNBC) accounts for approximately 15–20% of all breast cancer cases and is defined by the absence of estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) expression [
1,
2]. This molecular profile renders TNBC particularly challenging from a clinical standpoint, resulting in an unfavorable prognosis, high rates of recurrence, and limited treatment options compared to other breast cancer subtypes [
3].
The global burden of TNBC is substantial: although TNBC represents a minority of breast cancer cases by histology, it disproportionately contributes to breast-cancer-related mortality and disability-adjusted life years (DALYs) due to its aggressive course and the absence of targeted therapeutic options analogous to those available for hormone-receptor-positive or HER2-positive disease [
3]. This unmet clinical need has driven extensive efforts to identify actionable molecular vulnerabilities in TNBC.
A central obstacle to durable treatment of TNBC is the emergence of drug resistance, which arises in large part from mutations and dysregulation of apoptotic pathways and from altered mitochondrial dynamics that together raise the apoptotic threshold of tumor cells [
4,
5,
6]. Because resistance is frequently rooted in the very survival/apoptosis circuitry that our Boolean model represents, computational strategies that explicitly target the malignant-to-apoptotic state transition are well positioned to address this challenge. Complementary to direct pathway targeting, epigenetic dysregulation is also a major driver of TNBC tumorigenesis, and histone deacetylase (HDAC) inhibitors have emerged as a promising therapeutic class for this subtype [
7]; notably, HDAC1 features as a transcriptional-repressor node in the regulatory network analyzed here, linking our network-based approach to this actively investigated drug class.
The computational identification of therapeutic targets in TNBC has evolved through distinct methodological generations. The first generation, exemplified by Tilli et al. [
8], pioneered the use of protein–protein interaction networks to identify five therapeutic targets. However, this approach faced critical limitations: low druggability (only two of five targets had available inhibitors) and high complexity, requiring the simultaneous inhibition of five genes. A second generation emerged with Sgariglia et al. [
9], who introduced a Boolean network model to reduce the target set to three. While an improvement, this approach relied on heuristic methods sensitive to noise, and the druggability of the identified targets remained a concern.
Boolean network modeling has a long and well-established history in systems biology as a framework for capturing the qualitative logic of gene regulatory networks [
10,
11]. It has been applied successfully to diverse signaling and disease contexts, including T-cell activation [
12], survival signaling in leukemia [
13], the tumor-suppressive and pro-apoptotic activities of p53 [
4], cell cycle control [
14], and the integrated p53–p21–pRB axis governing cell cycle progression [
5]. This body of work establishes Boolean modeling as a mature and appropriate abstraction for large-scale regulatory analysis in cancer, and motivates its use as the dynamical backbone of the present hybrid framework.
In parallel to computational target identification, experimental methodologies have advanced cancer research through complementary perspectives. Nanoscale techniques such as atomic force microscopy enable molecular-level characterization of cancer biomarkers and have been recently reviewed in the context of oncology and neurodegeneration, although they face challenges including calibration complexity and limited throughput [
15]. Microfluidic platforms and organ-on-chip systems provide dynamic environments for liquid biopsy and tumor-cell phenotyping, with recent integration of artificial intelligence for diagnostic applications, although typical implementations still lack inter-organ crosstalk [
16]. These experimental approaches and computational frameworks such as ours play complementary roles: experimental platforms validate molecular mechanisms in controlled settings, while computational methods enable systematic exploration of the combinatorial target space at a scale that is experimentally prohibitive.
To overcome these challenges, this work proposes a novel hybrid computational methodology that introduces a new level of rigor to therapeutic target identification. Our approach combines the strengths of Boolean network modeling with the mathematical robustness of semidefinite programming (SDP) optimization. The integration of deep learning into computational biology has, in recent years, transformed the field, from foundational advances in representation learning [
17,
18] and the transformer architecture [
19] to landmark applications such as protein-structure prediction [
20] and drug–target affinity modeling [
21]. We leverage this trajectory by incorporating a genomic deep learning model as an orthogonal source of computational support. The broader context of AI in drug discovery and target identification has been reviewed critically by Zhang et al. [
22], Wenteler et al. [
23], and Pun et al. [
24], who collectively emphasize that computational predictions—including structure-based [
25,
26] and generative approaches [
27]—must be regarded as hypothesis-generating tools rather than validated therapeutic conclusions, a framing we adopt explicitly throughout this work [
28,
29]. Crucially, we move beyond single-point estimates and introduce a comprehensive validation framework that includes (1) rigorous statistical analysis with uncertainty quantification, (2) sensitivity analysis to test the robustness of our findings, and (3) orthogonal computational support from AlphaGenome [
30], a state-of-the-art deep learning model from Google DeepMind trained on vast genomic and transcriptomic datasets. By applying this methodology, we identify a therapeutic pair with superior druggability, reduced complexity, and a synergistic mechanism computationally supported by multiple independent computational approaches.
3. Discussion
This study introduced a hybrid computational methodology that successfully identified TK1 and VIM as a robust and highly druggable therapeutic pair for TNBC. Our work addresses key limitations of previous computational approaches by integrating SDP-based optimization with a multi-level validation framework, setting a new standard for rigor in the field.
3.1. SDP Optimization vs. Heuristic Methods
The use of SDP represents a significant methodological advancement. Unlike heuristic methods, which can be sensitive to network topology and initial conditions, SDP provides a mathematically principled way to find globally optimal or near-optimal solutions for combinatorial problems. This robustness was evident in our sensitivity analysis, where the superiority of the TK1–VIM pair was maintained across a wide range of model parameters. This suggests that our finding is a fundamental property of the network’s structure, not an artifact of our chosen parameters.
3.2. The Scientific Rationale for TK1–VIM Synergy
Our results provide a strong computational basis for the synergistic effect of inhibiting TK1 and VIM. Thymidine Kinase 1 (TK1) is a key enzyme in the DNA synthesis salvage pathway, and its overexpression is a well-known marker of cell proliferation. Vimentin (VIM) is a cytoskeletal protein crucial for epithelial-to-mesenchymal transition (EMT), a process that enables cancer cells to metastasize and invade other tissues. By simultaneously targeting proliferation (TK1) and invasion (VIM), our proposed therapy attacks two fundamental pillars of cancer progression. The AlphaGenome predictions reinforce this rationale: the dual knockdown is expected to perturb cell-cycle- and EMT-related gene expression in tumor cells while inducing negligible transcriptomic disruption of normal mammary epithelium (mean
,
Section 2.6.3).
The corrected AlphaGenome analysis (
Section 2.6), stratified by mammary compartment, reveals a biologically coherent expression pattern (
Supplementary Table S4): VIM exhibits a 2.2-fold gradient between mammary stroma (0.317) and mammary epithelium (0.143), in line with its canonical role as a mesenchymal marker. The therapeutic window for VIM inhibition in TNBC, therefore, rests primarily on the strong additional upregulation of VIM in EMT-driven mesenchymal-like TNBC cells
over and above this epithelial baseline, rather than on a uniformly low baseline in normal mammary tissue. For TK1, although the predicted mRNA-seq signal in mammary epithelium is moderate (0.159), the well-documented absence of TK1 protein in quiescent non-dividing cells (immunohistochemistry,
) supports a wide therapeutic window in practice.
The role of the tumor microenvironment (TME) further supports the rationale for VIM targeting in mesenchymal-like TNBC. The stromal compartment of TNBC tumors is characterized by activated cancer-associated fibroblasts that secrete EMT-inducing signals—notably TGF-
, which is represented in our Boolean network as an upstream regulator of SMAD2 and VIM—promoting paracrine reinforcement of the mesenchymal phenotype in tumor cells [
31]. Disruption of VIM in this context is expected to act both cell-autonomously, by destabilizing the mesenchymal cytoskeleton of invasive tumor cells, and at the tumor–stroma interface, by impairing mechanotransduction-driven invasion. The pronounced VIM signal observed in our stromal compartment analysis (0.317 in mammary stroma vs. 0.143 in epithelium) is consistent with this dual relevance. We acknowledge, however, that the present computational framework does not directly model TME-specific dynamics; this is an important avenue for future work.
3.3. Implications of Multi-Level Validation
The convergence of results from multiple, independent computational methods is the cornerstone of our study’s confidence. That a curated, knowledge-driven Boolean model and a data-driven, large-scale deep learning model (AlphaGenome) produce coherent predictions in the same target context provides triangulation across modeling paradigms. The fundamental difference in the underlying paradigms—a mechanistic, logic-based Boolean simulation versus a statistical, sequence-to-expression deep learning model—ensures that the convergence is methodologically orthogonal. The agreement between these two distinct approaches strongly suggests that the identified therapeutic synergy of TK1–VIM is a robust computational prediction rather than an artifact of a specific modeling technique.
Data independence and potential biases: It is important to address the potential for circularity in our computational support. The Boolean model was constructed using data from the MDA-MB-231 cell line. While AlphaGenome is a foundational model trained on vast, diverse genomic datasets across multiple tissues and species, we acknowledge the possibility that MDA-MB-231 transcriptomic profiles were included in its extensive training corpus. Furthermore, AlphaGenome operates at the sequence-to-expression level on normal tissue ontologies and is not TNBC-specific; the analysis reported here, therefore, characterizes the predicted impact of inhibition in normal mammary tissue rather than functionally validating the dynamic response of TNBC cells.
This multi-level computational approach—combining mechanistic simulation, statistical perturbation, and orthogonal data-driven model support—provides a powerful template for future computational drug discovery efforts, increasing the likelihood that in silico findings will translate to preclinical and clinical success when complemented by experimental validation.
3.4. Limitations and Future Work
While our study provides strong computational evidence for the TK1–VIM pair, several limitations should be acknowledged explicitly.
Binary-state simplification. Boolean network modeling reduces complex biological processes to discrete ON/OFF states. This abstraction does not capture graded signaling dynamics, the continuous nature of gene expression, stochastic effects, or temporal regulatory behavior that operate on different timescales in TNBC systems. Consequently, the Boolean framework cannot represent the dose-dependent (IC50-like) nature of pharmacological inhibition: inhibiting TK1 or VIM clinically is a continuous concentration–response problem, whereas our model treats inhibition as a binary zero-clamp. This is an important caveat when translating in silico findings to dosing strategies in preclinical or clinical settings.
Network construction and assumptions. The 131-gene regulatory network employed here was inherited directly from Sgariglia et al. (2024) [
9] and was constructed by curation from the literature rather than ab initio inference from high-throughput data. The 29 core regulatory interactions reflect the canonical TNBC pathways (STAT3, PI3K/AKT, HSP90 chaperone network, p53/MDM2, intrinsic apoptosis, NF-
B, HIF1
, cell cycle, TGF
/SMAD2, HDAC1-mediated repression). The Boolean update rules are nested canalizing functions in the sense of Kauffman et al. (2004) [
32]. While these choices are consistent with established systems-biology practice, alternative logical encodings (random Boolean functions, threshold functions) or inferred edges would be expected to influence specific predictions. The convergence of three independent SDP formulations on the same TK1–VIM pair, across 75 weight combinations, supports interpretation of the result as a structural property of the canonical network rather than an artefact of any single rule set.
Absence of experimental validation. We acknowledge explicitly that the proposed TK1–VIM therapeutic pair has not been validated in vitro or in vivo within the present study; the framework reported here is entirely computational. The orthogonal computational support from AlphaGenome is supportive but is not a substitute for wet-lab validation. A staged experimental programme would include (i) genetic loss-of-function studies in MDA-MB-231 and additional TNBC cell lines representative of the major molecular subtypes (BL1, BL2, M, MSL, LAR; e.g., HCC1937, MDA-MB-468, BT-549) using siRNA, shRNA, or CRISPR-Cas9 ablation of TK1 and VIM, individually and in combination; (ii) pharmacological perturbation with available inhibitors (3′-azido-3′-deoxythymidine and related nucleoside analogues for TK1; Withaferin A and FiVe1 for VIM) with formal IC50 determination and Bliss- or Loewe-based synergy quantification; (iii) phenotypic readouts of proliferation, apoptosis, migration and invasion appropriate to a mesenchymal-invasive phenotype; (iv) in vivo validation in patient-derived xenograft or syngeneic murine TNBC models, with attention to the tumor-microenvironment interactions discussed in
Section 3.2.
Generalizability across TNBC subtypes. TNBC is molecularly heterogeneous, with at least four to six well-characterized subtypes (Lehmann classification: BL1, BL2, M, MSL, LAR) differing in transcriptomic profile and therapeutic vulnerability. Our analysis is based primarily on the mesenchymal-like MDA-MB-231 cell line; the identified TK1–VIM vulnerability is, therefore, most likely to generalize to other mesenchymal-like (M) or mesenchymal-stem-like (MSL) tumors, and may be less directly relevant to basal-like (BL1/BL2) or luminal-androgen-receptor (LAR) tumors that do not depend on the EMT axis. Extension of this computational framework to networks representative of other TNBC subtypes, and to patient-derived organoids or xenografts spanning the subtype spectrum, is a necessary next step.
AlphaGenome limitations. AlphaGenome is trained on bulk-tissue transcriptomic and epigenomic data and does not contain a TNBC-specific ontology. The mammary-tissue tracks used here represent normal mammary biology and aggregate cell-type heterogeneity within each compartment (in particular, our ‘mammary epithelium’ pool includes basal-myoepithelial and stem-cell populations that physiologically express some mesenchymal markers, partially elevating the apparent VIM baseline relative to a hypothetical pure luminal-epithelial sample). For these reasons, the AlphaGenome results in this work should be interpreted as supportive orthogonal computational evidence rather than as functional validation of in vivo behavior in TNBC.
3.5. Clinical Implications and Druggability
The identification of TK1–VIM as a high-confidence computationally nominated therapeutic pair has significant translational implications. A key advantage of this pair is its 100% perturbability—that is, both targets have existing pharmacological inhibitors, which we term “computationally druggable” to distinguish the availability of reported inhibitors from established clinical druggability in TNBC combination therapy, which would require formal pharmacokinetic, synergy, and toxicity evaluation. For TK1, compounds such as 3′-azido-3′-deoxythymidine (AZT) and other nucleoside analogs have established inhibitory effects; isoform-selective TK1 inhibitors are an active area of medicinal-chemistry development, given the need to minimize off-target effects on the closely related mitochondrial thymidine kinase TK2. For VIM, specific inhibitors like Withaferin A (WFA), a steroidal lactone from
Withania somnifera that binds covalently to Cys328 in the vimentin rod domain and inhibits cancer-cell invasion and metastasis [
33,
34] and FiVe1 (FOXO3-induced Vimentin effector 1), a small molecule identified from a synthetic-lethal screen that selectively and irreversibly inhibits the growth of mesenchymally transformed cancer cells by binding the VIM rod domain and triggering mitotic catastrophe [
35], have demonstrated efficacy in disrupting vimentin networks and impairing cancer cell motility in preclinical models; current limitations include moderate potency, suboptimal pharmacokinetic profiles, and limited clinical-grade chemistry.
To our knowledge, simultaneous pharmacological inhibition of TK1 and VIM has not been previously reported in TNBC. The computational rationale derived here, therefore, identifies a novel combinatorial strategy. While these inhibitors are primarily in preclinical or early clinical stages for solid tumors, their availability facilitates immediate experimental validation and provides a clear translational path. Future clinical translation will require careful evaluation of the combined toxicity profile, but the complementary mechanisms of action—targeting proliferation and metastasis simultaneously—offer a promising strategy for improving patient outcomes in TNBC.
4. Materials and Methods
4.1. Problem Formulation
The primary objective of this work is to identify a minimal set of therapeutic targets within a gene regulatory network that maximizes the transition of cancer cells from a malignant phenotype to an apoptotic one. We formalize this as follows:
Let be a directed gene regulatory network where V is the set of genes () and E is the set of regulatory interactions. Each gene has a binary state at time t, where 1 denotes active expression and 0 denotes inactive expression. The network dynamics are governed by Boolean update functions , where is the set of regulators of gene i.
Let denote the attractor representing the malignant phenotype (Attractor_0) and denote the attractor representing the apoptotic phenotype (Attractor_1). Our goal is to find an optimal target set with (where ) such that inhibiting the genes in causes the network to transition from the malignant state to a state maximally similar to the apoptotic phenotype.
Formally, the optimization problem is
where
returns the attractor state reached after Boolean simulation starting from the malignant state
with genes in
T held at state 0 (inhibited), and
measures the Hamming similarity between two states (defined in Equation (
8)). The solution must satisfy additional constraints: high statistical robustness across multiple initial conditions, insensitivity to model parameter variations, and high druggability (availability of specific inhibitors for both targets).
4.2. Data and Network Construction
We utilized the exact 131-gene regulatory network from Sgariglia et al. (2024) [
9], constructed from the MDA-MB-231 triple-negative breast cancer cell line (RRID:CVCL_0062). The network is a directed graph
, where
V is the set of 131 genes (
) and
E is the set of 29 curated regulatory interactions (
Supplementary Table S1). The 29 edges encode canonical TNBC signaling axes (STAT3 transcriptional program; PI3K/AKT survival signaling; HSP90 chaperone network; p53/MDM2 axis; intrinsic apoptosis cascade; NF-
B inflammation/survival; HIF1
hypoxic response; cell cycle regulation via CCND1; TGF
/SMAD2 EMT axis; and HDAC1-mediated transcriptional repression) and were curated from the primary literature by Sgariglia et al. (2024) [
9]. The Boolean update functions are modeled as nested canalizing functions, as defined by Kauffman et al. (2004) [
32]. This network has been experimentally validated and is publicly available in the
Supplementary Materials of the original publication.
4.3. Attractor Landscape and Phenotypic States
The published work by Sgariglia et al. (2024) [
9] identified three stable attractors representing distinct cellular phenotypes through analysis of 30 independent samples from the MDA-MB-231 cell line:
Attractor_0 (Malignant Phenotype): Characterized by high proliferation, survival gene activation, and low apoptosis markers. This represents the unperturbed cancer cell state.
Attractor_1 (Apoptotic Phenotype): Characterized by cell cycle arrest, apoptosis gene activation, and survival gene inhibition. This represents the desired therapeutic outcome.
Attractor_2 (Alternative Phenotype): A secondary stable state with intermediate characteristics.
The binary gene states for each attractor across all 30 samples are publicly available as
Supplementary Table S2 in Sgariglia et al. (2024) [
9] and form the basis of our analysis. For each sample
, we have state vectors
representing the gene states in each attractor.
4.4. Gene Variability Analysis and Candidate Selection
To identify genes with high potential to influence attractor transitions, we calculated a variability score for each gene. For each gene
, the variability score
quantifies the number of state changes across the three attractors and 30 samples:
where
is the indicator function. Genes with high variability are critical nodes for attractor transitions and, thus, strong therapeutic candidates. This analysis confirmed that VIM and TK1 exhibited the highest variability (20 differences each across samples and attractors), making them primary candidates for therapeutic targeting.
4.5. Target Identification via Semidefinite Programming
To identify nodes with the greatest potential to influence the transition from the malignant to the apoptotic phenotype, we formulated the problem using three complementary SDP optimizations. SDP is a subfield of convex optimization that provides tractable relaxations for NP-hard combinatorial problems, offering more robust solutions than purely heuristic methods [
36,
37].
Formulation 1—Max-Cut SDP: The objective is to partition the network nodes to maximize the weight of edges connecting survival and apoptosis gene groups, using the Goemans–Williamson SDP relaxation [
36] (Equation (
3)):
where
X is the positive semidefinite variable matrix and
W is the weighted adjacency matrix.
Formulation 2—Influence Maximization SDP: We seek to identify a set of “seed” nodes that maximizes the propagation of pro-apoptotic signals (Equation (
4)):
where
is the probability of selecting node
i,
is the influence propagation probability,
and
are the sets of survival and apoptosis genes,
is a penalty term, and
k is the maximum number of targets.
Formulation 3—Spectral Clustering SDP: We use spectral clustering based on the network’s Laplacian matrix
(Equation (
5)):
The scores from the three formulations are normalized (min–max to [0,1]) and combined into a single SDP score for each gene
i, as shown in Equation (
6):
where
. We used baseline weights of 0.4, 0.4, and 0.2, and performed a comprehensive sensitivity analysis on these parameters across 75 different weight combinations spanning the 2-simplex.
SDP Optimization Pipeline—Pseudocode
The integration of the three SDP formulations into a single combined target score proceeds as follows.
- 1.
Construct weighted adjacency matrix W from E (positive for activation, negative for inhibition).
- 2.
Solve Max-Cut SDP relaxation (Equation (
3)) via CVXPY with SCS solver →
for each
.
- 3.
Solve Influence Maximization SDP (Equation (
4)) under sparsity constraint
→
for each
.
- 4.
Solve Spectral Clustering SDP on Laplacian
(Equation (
5)) →
for each
.
- 5.
Min–max normalize each score vector to .
- 6.
Compute combined score
as in Equation (
6).
- 7.
Select top-
k genes by
and validate by Boolean simulation (
Section 4.6).
- 8.
Repeat steps 5–7 over a grid of on the 2-simplex (75 combinations) for sensitivity analysis.
The Python (Version 3.12.10) implementation is publicly available; SCS solver tolerances were left at default values, and the optimization pipeline runs in approximately five minutes on a standard workstation (16 GB RAM, no GPU required). The complete pipeline is summarized in
Figure 8.
4.6. Functional Validation via Boolean Simulation
To validate the efficacy of a candidate target pair
, we performed Boolean simulations. The network dynamics were modeled as a discrete-time system where the state of each gene
is updated based on the states of its regulators at time
t, following the nested canalizing functions defined by Sgariglia et al. [
9,
32] (Equation (
7)):
where
is the Boolean update function for gene
i and
are its regulators.
We simulated the network starting from the 30 malignant attractor states, with the target pair inhibited. The final state of each simulation was compared to the corresponding apoptotic attractor state to calculate three key metrics:
Attractor Similarity: The percentage of genes in the final simulated state that match the apoptotic attractor state (Equation (
8)).
Survival Gene Inhibition: The percentage of key survival genes (e.g., BCL2, MCL1) that are successfully inhibited (state 0) in the final state.
Apoptosis Gene Activation: The percentage of key apoptosis-promoting genes (e.g., BAX, CASP3) that are successfully activated (state 1) in the final state.
4.7. Statistical Validation and Robustness Analysis
To ensure the robustness of our findings, we performed bootstrap resampling (
n = 10,000) [
38] on the 30 initial states to calculate 95% confidence intervals (CIs) for all validation metrics. To compare the performance of our identified pair (TK1–VIM) against five alternative pairs, we used paired
t-tests and non-parametric permutation tests (
n = 10,000) [
39], with Bonferroni correction for multiple comparisons [
40]. Alternative false-discovery rate methods (Benjamini–Hochberg [
41]; Storey q-value [
42]) were evaluated and yielded qualitatively identical conclusions; Bonferroni correction is reported as it provides the most conservative family-wise error rate control appropriate for a small number of prespecified comparisons. Specifically, the five comparisons (TK1–VIM versus STAT3–BCL2L1, AKT2–MDM2, HSP90AB1–NFKB1, TP53–BAX, and CCND1–E2F1) were corrected at
= 0.05/5 = 0.01; all observed
p-values remained below 0.001 after correction. Cohen’s
d was calculated to quantify the effect size [
43]; observed values of
–
correspond to extremely large effects (conventional “large” threshold
). The 30 paired observations correspond to distinct biological replicates of the MDA-MB-231 cell line in the originally published dataset [
9], supporting the assumption of sample independence.
4.8. Orthogonal Computational Support via AlphaGenome
To provide an orthogonal layer of computational support, we used AlphaGenome [
30], a large-scale deep learning model trained on vast genomic and transcriptomic datasets, to predict the genome-wide effects of inhibiting TK1 and VIM in normal mammary tissue. We queried the model for predicted RNA-seq expression and for the effect of single-nucleotide variants 300 bp upstream of each gene’s transcription start site, across a 524,288 bp (512 kb) interval centered on TK1 (chr17:7,577,845; genome build GRCh38) and VIM (chr10:17,270,258; GRCh38). Ontology terms were validated against the AlphaGenome catalogue of 704 supported terms via a dedicated preflight script (
Supplementary Code S6) prior to any analytical query.
The mammary-tissue ontology terms were partitioned a priori into two compartments (mammary epithelium, 5 terms/13 tracks; mammary stroma, 3 terms/2 tracks; see
Section 2.6 for full term lists). Four reference tissues (liver UBERON:0002107; skeletal muscle UBERON:0001134; brain UBERON:0000955; heart UBERON:0000948) were queried as off-target organ comparators. Compartment- and tissue-specific means were computed by averaging across all returned tracks. The complete query log, including timestamp, ontology terms, and per-track values, is provided as
Supplementary Data alphagenome_v3_results.json; the ontology audit log is provided as
Supplementary Table S5.
The convergence of results from these two fundamentally different computational paradigms—a curated, knowledge-driven Boolean model and a data-driven, large-scale deep learning model—provides orthogonal computational support for the biological plausibility of the predicted therapeutic effects, without substituting for experimental validation.
5. Conclusions
We have demonstrated that a hybrid computational approach combining Boolean network modeling, semidefinite programming optimization, statistical validation, and deep learning support can identify robust therapeutic targets for TNBC. The TK1–VIM pair emerges as a high-confidence, druggable therapeutic strategy with strong biological plausibility. The compartment-stratified AlphaGenome analysis revealed a biologically coherent pattern in which VIM is enriched 2.2-fold in mammary stroma over mammary epithelium, consistent with its role as a mesenchymal marker, and in which simulated inhibition of either target induces near-zero transcriptomic perturbation of normal mammary epithelium (mean ). This provides a clear biological rationale for synergistic targeting: TK1 inhibition attacks the proliferative capacity of the bulk tumor, while VIM inhibition prevents the emergence of metastatic, invasive cell populations. This work establishes a rigorous methodological benchmark for computational drug target identification and provides a computationally grounded, druggable candidate pair as a starting point for experimental computational validation in TNBC treatment research.
The next concrete steps emerging from this work are (i) prospective experimental validation of dual TK1–VIM inhibition in a panel of TNBC cell lines representative of the molecular subtype spectrum, with formal synergy quantification (Bliss/Loewe); (ii) preclinical evaluation in patient-derived xenograft models with mesenchymal-like histology; (iii) extension of the hybrid SDP–Boolean framework to subtype-specific regulatory networks; and (iv) refinement of the computational pipeline to incorporate dose–response (IC50) representation and stochastic Boolean variants, addressing the binary-state limitation of the current model. Through these next steps, we aim to bridge the in silico-to-in vivo gap that remains the principal limitation of computational target identification.
Beyond the specific TK1–VIM finding, the model developed here is intended as a reusable, general-purpose framework. Because the pipeline is agnostic to the particular network supplied, it can be redeployed to other malignancies and regulatory contexts simply by substituting the input Boolean network and its attractor landscape: the SDP-based ranking, multi-formulation consensus, statistical robustness analysis, and orthogonal AlphaGenome support all transfer unchanged. Prospective applications of the developed model, therefore, include (a) systematic reanalysis of existing curated cancer Boolean networks to nominate minimal druggable target sets; (b) coupling the framework to automated network-inference methods so that patient-specific or subtype-specific networks can be generated and optimized in a single workflow; and (c) integration of the SDP ranking with druggability and toxicity priors as additional constraints, moving the model toward an end-to-end computational platform for combinatorial target discovery. We anticipate that this reusable design will allow the present methodology to serve as a benchmark and a practical tool for the broader computational drug-discovery community.