Next Article in Journal
Genetic Determinants of Radiosensitivity: Evidence of Radioresistance-Associated SNP Enrichment in Occupational Workers Chronically Exposed to Low-Dose Radiation
Previous Article in Journal
Sex- and Exercise-Dependent Modulation of Hypertrophic Remodeling by the MCT1 rs1049434 Polymorphism
Previous Article in Special Issue
Bulk and Single-Cell Transcriptomes Reveal Exhausted Signature in Prognosis of Hepatocellular Carcinoma
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrated Transcriptomic and Histological Analysis of TP53/CTNNB1 Mutations and Microvascular Invasion in Hepatocellular Carcinoma

Department of Computer Engineering, Automation and Robotics, CITIC, University of Granada, 18071 Granada, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2026, 17(2), 190; https://doi.org/10.3390/genes17020190
Submission received: 31 December 2025 / Revised: 27 January 2026 / Accepted: 2 February 2026 / Published: 3 February 2026
(This article belongs to the Special Issue AI and Machine Learning in Cancer Genomics)

Abstract

Background/Objectives: Hepatocellular carcinoma (HCC) shows marked molecular and histopathological heterogeneity. Among the alterations most strongly associated with clinical outcome are mutations in TP53 and CTNNB1, as well as the presence of microvascular invasion (MVI). Although these factors are well established as prognostic indicators, how their molecular effects relate to tumor morphology remains unclear. In this work, we studied transcriptomic changes linked to TP53 and CTNNB1 mutational status and to MVI, and examined whether these changes are reflected in routine histology. Methods: RNA sequencing data from HCC samples annotated for mutations and vascular invasion were analyzed using differential expression analysis combined with machine learning-based feature selection to characterize the underlying transcriptional programs. In parallel, we trained a weakly supervised multitask deep learning model on hematoxylin and eosin-stained whole-slide images using slide-level labels only, without spatial annotations, to assess whether these features could be inferred from global histological patterns. Results: Distinct gene expression profiles were observed for TP53-mutated, CTNNB1-mutated, and MVI-positive tumors, involving pathways related to proliferation, metabolism, and invasion. Image-based models were able to capture morphological patterns associated with these states, achieving above-random discrimination with variable performance across tasks. Conclusions: Taken together, these results support the existence of coherent biological programs underlying key risk determinants in HCC and indicate that their phenotypic effects are, at least in part, detectable in routine histopathology. This provides a rationale for integrative morpho-molecular approaches to risk assessment in HCC.

1. Introduction

Hepatocellular carcinoma (HCC) is the most common primary malignant tumor of the liver and one of the leading causes of cancer mortality worldwide. Despite advances in clinical treatment, HCC remains a biologically heterogeneous disease, in which tumors of similar stage and histological grade often exhibit very different clinical behavior [1].
One way to differentiate between the different subtypes of HCC is based on different genetic mutations. For example, TP53 and CTNNB1 mutations are among the most common molecular alterations in HCC and are associated with specific tumor phenotypes. Inactivating mutations in the TP53 tumor suppressor gene occur in approximately 30–50% of cases worldwide (with higher rates in regions exposed to aflatoxin B1) [2,3]. These mutations disrupt cell cycle control, DNA repair, and apoptosis, often leading to increased proliferation and a more aggressive phenotype associated with poorer outcomes. The p53 protein, encoded by the TP53 gene, serves as a guardian of genomic integrity by responding to diverse cellular stresses, including oncogene activation and metabolic disturbances. It manages key protective mechanisms such as cell cycle arrest, DNA repair, and senescence and apoptosis, preventing malignant transformation. Extensive research has established that TP53 dysregulation, primarily through mutations, drives tumorigenesis in over half of human cancers [4]. On the other hand, mutations in CTNNB1 (encoding β -catenin) primarily in exon 3, are observed in about 25–40% of HCCs (typically higher in non-viral or alcohol-related cases) [5,6]. These lead to aberrant Wnt/ β -catenin pathway activation, promoting metabolic reprogramming, immune exclusion, and a generally less aggressive and better differentiated profile compared to TP53-mutated tumors [7]. Notably, TP53 and CTNNB1 mutations tend to be mutually exclusive, contributing to separate molecular subclasses and clinical features [8].
In addition to mutational status, vascular invasion is a potential pathological feature associated with poor prognosis in HCC [9,10]. Its presence is strongly correlated with early recurrence after resection or transplantation [11,12]. Although vascular invasion is assessed in surgical specimens, there is growing evidence to suggest that it reflects broader biological programs related to invasion and proliferation, rather than a purely focal process [13].
Genomic and transcriptomic alterations in HCC have been extensively characterized, but their relationship to routine histology remains unclear. Although somatic mutations act through downstream changes in gene expression, it is not always clear how these processes translate into consistent patterns at the tissue level. In practice, it remains unclear whether clinically relevant features, such as microvascular invasion, can be reliably inferred from transcriptomic data or standard hematoxylin and eosin (H&E)-stained preparations.
In this study, we present an integrated morpho-molecular approach to refine HCC risk stratification. Beyond identifying prognostic gene signatures, we frame our analysis around a unifying question: to what extent do distinct molecular alterations and pathological features leave detectable, global morphological signatures in routine histology?
To answer this, we first performed an extensive analysis of differential gene expression on TP53/CTNNB1 activation status and tried to gain an understanding of the profound molecular mechanisms regulating mutated HCCs. In a previous study, we established a 15-gene panel pairing differential gene expression with machine learning that accurately classified liver cancer stages [14]. Building on this work, we applied an identical feature selection procedure to derive a transcriptomic signature for vascular invasion. Subsequently, we trained weakly supervised multitask deep learning models on whole-slide images (WSI) using only slide-level labels. This experimental design allows us to determine if the downstream phenotypic effects of these molecular and focal states are sufficiently pervasive to be inferred from global histological patterns without local annotations.

2. Materials and Methods

2.1. Data Resource

The Cancer Genome Atlas (TCGA) represents a comprehensive, open-access repository containing extensive multiple omics and clinical information from numerous cancer types [15]. Initiated in 2006 as a joint effort by the National Cancer Institute and the National Human Genome Research Institute in the United States, TCGA has significantly advanced the field of oncology. It has generated critical knowledge about the underlying molecular alterations in cancer, facilitating the identification of therapeutic targets and potential biomarkers for diagnosis and prognosis. Despite the end of the TCGA project, its datasets continue to be available to the scientific community via the Genomic Data Commons (GDC) platform [16], which was the source of data we used in the present study.
Specifically, WSIs and RNA-Seq gene expression data of hepatocellular carcinoma samples were obtained from the TCGA-LIHC cohort. The cases presenting Single Somatic Mutations (SSMs) on the TP53 and CTNNB1 genes were obtained using the Mutation Frequency tool. Only coding variants predicted to be functionally damaging were considered. Specifically, variants annotated as possibly damaging or probably damaging by PolyPhen and classified as coding sequence alterations (e.g., missense or stop-gained variants) were retained, while benign or non-coding variants were excluded. Vascular invasion status was retrieved filtering by Pathology Details Vascular Invasion Type and selecting micro and macro statuses. Figure 1 displays the number of samples annotated for each condition. The retrieved files included the samplesheet and clinical data to ensure all samples were correctly identified.
For the histological image classification task, all analyses were performed exclusively on diagnostic (DX) whole-slide images corresponding to formalin-fixed paraffin-embedded (FFPE) tissue sections. Frozen section slides were excluded from this study. This choice was motivated by the higher and more consistent histomorphological quality of FFPE slides, which are known to be better suited for deep learning-based image analysis. In this task, only microvascular invasion cases were considered positive, while macrovascular invasion was excluded due to its low frequency and distinct morphological characteristics, which could confound the detection of subtle invasion-related patterns.

2.2. Methodology

2.2.1. Differential Expression and Gene Signature Extraction

Gene-level expression quantification and annotation were performed using the Knowseq package [17]. A quality assessment was carried out with Knowseq (including the Kolmogorov–Smirnov test, Median Absolute Deviation, and distance-based outlier detection), which led to the identification and removal of an outlier. Finally, batch effects were corrected using surrogate variable analysis [18].
Differentially expressed genes (DEGs) were identified using the limma package in R, which employs a moderated t-test based on an empirical Bayes approach. This method shrinks the gene-wise residual variances towards a common value, increasing statistical power and stability [19].
For each of the three comparisons considered in the study, separate models were fitted. Log2 fold changes and associated p-values were computed, followed by false discovery rate (FDR) adjustment using the Benjamini–Hochberg procedure. Genes were considered differentially expressed using the fold change thresholds and p-value cutoffs defined for each contrast in the following Table 1.
In the case of the vascular invasion contrast, a comprehensive machine learning pipeline was implemented using the Knowseq framework to develop a predictive gene signature. All the steps were carried out inside a 5-fold cross-validation procedure, starting with the DEG Extraction. Then, feature selection was performed using the previously extracted DEGs with the minimum redundancy maximum relevance (mRMR) algorithm [20], which generated five rankings. These rankings were used to train SVM models, iteratively adding the ranked genes. Using this, we can have an estimation of the performance of the model for each number of genes. The most frequently selected genes across the five folds were retained as the final proposed signature. To assess its performance, a 100-times repeated final 5-fold cross-validation was conducted. A support vector machine (SVM) was used as a classifier [21] and hyperparameter tuning was performed within each training fold using an inner cross-validation loop to avoid overfitting [22].

2.2.2. WSI Classification

The methodological workflow of this study integrates whole-slide histopathology images with weakly supervised deep learning to explore the relationship between tissue morphology and molecular alterations in hepatocellular carcinoma. An overview of the main processing steps, including image preprocessing, feature extraction, multiple instance learning, and evaluation, is provided below and illustrated in Figure 2.
1.
Preprocessing and Tissue Segmentation
To minimize computational demands and prioritize diagnostically relevant areas, we applied background filtering. Tissue regions were identified by generating a binary mask via Otsu’s thresholding [23], which optimally separates foreground (tissue) from background (empty glass or whitespace) using pixel intensity histograms.
Valid tissue areas were then divided into non-overlapping 512 × 512 pixel patches at 40× magnification, corresponding to an effective spatial resolution of approximately 0.25 microns-per-pixel, as obtained from WSI metadata. Patches with insufficient tissue coverage were filtered out to maintain high-quality inputs.
2.
Feature Extraction
Patch-level features were extracted using a pre-trained foundation model tailored for histopathology. Specifically, we employed weights from the Bioptimus H-optimus1 [24] model (a Vision Transformer-based architecture with over 1 billion parameters, pre-trained via self-supervised learning on a large-scale proprietary dataset of hundreds of millions of histopathology images from diverse patients and diseases).
Formally, for a WSI W, we derive a bag of N patches X = { x 1 , x 2 , , x N } . The feature extractor f ( · ) maps each patch x i to a fixed-dimensional embedding h i R D :
h i = f ( x i )
This yields a compact bag of embeddings H = { h 1 , , h N } , reducing dimensionality while retaining essential morphological details.
3.
Attention-Based MIL Aggregation
Given weak supervision (slide-level labels only), we evaluated several attention-based MIL methods to aggregate instance embeddings H into a single slide-level representation Z in a permutation-invariant manner. The compared aggregators include the following:
  • ABMIL (Attention-Based MIL) [25]: Employs a gated attention mechanism to assign learnable importance weights to individual instances, enabling the model to focus on the most informative patches while preserving permutation invariance.
  • TransMIL (Transformer-Based MIL) [26]: Utilizes a Transformer architecture to model global contextual relationships among instances, capturing long-range dependencies between patches within a slide.
  • DFTD (Dual-Feature Tokenization MIL) [27]: Decomposes each bag into complementary global and local feature representations, enabling the model to identify informative instances through interactions between discriminative features and contextual information, without explicitly constraining attention via clustering.
  • CLAM (Clustering-Constrained Attention MIL) [28]: Introduces instance-level clustering to explicitly separate positive and negative evidence within a bag, guiding the attention mechanism toward discriminative regions.
4.
Slide-Level Classification
The aggregated representation Z is passed to a multi-layer perceptron (MLP), which projects it into the label space. Predictions are produced via softmax (multi-class) or sigmoid (binary) activation.
5.
Interpretability and Visualization
For model explainability, attention weights from the MIL aggregator were visualized as heatmaps. Normalized scores a i were overlaid on the original WSI coordinates, highlighting regions of interest (ROIs) most influential to the prediction. This facilitates pathologist review and qualitative assessment of model decisions.
6.
Evaluation Criteria
Due to the limited number of samples in the minority class (≈60 WSI), we adopted a 5-fold cross-validation protocol over the full dataset to maximize data utilization and obtain a stable performance estimate. All models were evaluated using the same cross-validation folds, with fixed hyperparameters, and each WSI was used exactly once for testing. To address class imbalance, training employed a class-weighted cross-entropy loss, and model selection within each fold was based on the F1-score of the positive class.

3. Results and Discussion

3.1. TP53 Mutations

3.1.1. Differential Expression Analysis

Following the described preprocessing steps, a total of 249 statistically significant differentially expressed genes (DEGs) were identified, of which 123 were upregulated and 126 downregulated in TP53-mutated hepatocellular carcinoma compared to the wild-type group (Table 2).
One of the most direct transcriptomic consequences of TP53 mutation is the silencing of genes that strictly depend on p53 for their expression. Our analysis reveals a cluster of significantly downregulated genes, PPP1R7, EDA2R, PURPL, PTCHD4, HGFAC, and SPATA18, representing the collapse of critical surveillance mechanisms governing mitosis, apoptosis, and metabolism.
Among these, EDA2R is a direct transcriptional target of p53 [29]. Its signaling is predominantly involved in anoikis (cell death induced by detachment from the extracellular matrix). In normal epithelial cells, loss of anchorage triggers p53-dependent EDA2R expression, ensuring that displaced cells die rather than colonize inappropriate sites [30,31]. However, in TP53-mutated HCC cells, the failure to induce EDA2R may contribute to tumor cell survival in suspension or after detachment from the basement membrane. This anoikis resistance seems to have an important role in intrahepatic metastasis and systemic dissemination.
A similar tumor-suppressive role is observed for SPATA18, which encodes a p53-inducible protein that induces lysosome organelles within mitochondria, eliminating oxidized mitochondrial proteins and functioning as a tumor suppressor via mitochondrial quality control [32]. In both renal cell carcinoma and colorectal cancer, elevated SPATA18 expression has been associated with improved prognosis and overall survival [32,33], which further supports the observation that TP53-mutant tumors exhibit poorer prognosis in association with reduced SPATA18 expression.
Metabolic dysregulation is further highlighted by the downregulation of HGFAC, a serine protease that activates Hepatocyte Growth Factor (HGF). It is a marker of mature, functional hepatocytes. Beyond HGF activation, it maintains systemic glucose and lipid homeostasis [34]. The downregulation of this gene may exacerbate lipid dysregulation, enabling rapid cell proliferation, something that has previously been linked to the lipogenic phenotype of aggressive cancers [35]. Moreover, liver cancer patients with low HGFAC expression, linked to DNA hypermethylation, have been found to present significantly poorer survival than those with high expression [36,37].
In the same metabolic context, CYP2E1 is a key enzyme involved in ethanol and fatty acid metabolism. It has been found to be consistently downregulated in poorly differentiated HCC, and its expression correlated negatively with tumor size and vascular invasion [38]. Its loss in established tumors may protect cancer cells from excessive oxidative stress-induced death, contributing to the more aggressive phenotype of TP53-mutated HCC.
Finally, the significant downregulation of asialoglycoprotein receptors (ASGR1 and ASGR2) in TP53-mutated HCC has been suggested to signify a critical event of functional dedifferentiation, where tumor cells progressively lose their specific hepatocellular identity to acquire a more invasive phenotype [39]. Under healthy conditions, ASGRs are abundantly expressed on the surface of mature hepatocytes, functioning as a clearance system for desialylated glycoproteins [40]. However, in aggressive TP53-mutated tumors, the suppression of this system is consistent with a metastatic phenotype, potentially facilitating cell migration [41]. Again, this downregulation relates TP53-mutated cases with the most aggressive phenotypes of the disease.
In contrast to these losses of tumor-suppressive and differentiation programs, the specific upregulation of GABAergic signaling components (GABRA2 and GABRA3) observed in TP53-mutated HCC aligns with the emerging concept of neuronal mimicry, where aggressive tumors hijack neuronal pathways to support survival and progression [42,43]. While GABA typically works as the primary inhibitory neurotransmitter in the mature adult brain, it reverts to an excitatory and proliferative role in neoplastic tissues, mirroring the physiology of embryonic neural stem cells [44,45].

3.1.2. Gene Ontology Enrichment

The top Gene Ontology (GO) terms associated with the DEGs provide a summary of the biological shifts occurring in TP53-mutated HCC (see Figure 3).
At the level of biological processes, the most clear signature is a profound dysregulation of the cell cycle. This enrichment is driven by the strong upregulation of key mitotic regulators, including TOP2A, BUB1B, CDCA8, CDC20, and CCNB1 [46]. These genes collectively orchestrate the G2/M transition and faithful chromosome segregation. Their overexpression in the context of TP53 mutation reflects the collapse of the p53-p21-DREAM pathway [47]. The resulting mitotic storm is indicative of accelerated proliferation and may foster chromosomal instability.
Similarly, enrichment in the regulation of the apoptotic process underscores the evasion of apoptosis. This term is influenced by the downregulation of proapoptotic genes such as EDA2R, SPATA18, and PPP1R7, jointly with the functional loss of TP53 in the mutated group. The combined effect seems to disable apoptotic safeguards, allowing cells with severe mitotic defects to survive and propagate.
Kinetochore and spindle-associated terms are enriched in the cellular component analysis, driven by BUB1B, CDCA8, and PPP1R7, that collectively disrupt accurate chromosome alignment and segregation [48,49,50]. This dysregulation directly contributes to aneuploidy, which is a common feature of TP53-mutated tumors and a driver of intratumoral heterogeneity and poor prognosis [51].
Finally, the molecular function category reveals targeted enzymatic and ion channel reprogramming. GABA-A receptor activity reflects the functional gain of chloride conductance via GABRA2 and GABRA3, potentially altering membrane potential and intracellular signaling in a manner advantageous to tumor cell survival.

3.1.3. Slides

Next, we evaluated the ability of weakly supervised deep learning to predict TP53 mutation status directly from whole-slide images of hepatocellular carcinoma (LIHC). Several attention-based multiple instance learning (MIL) models were assessed using slide-level labels only, without any spatially localised annotations. All models were trained and evaluated under the same 5-fold cross-validation protocol on the TCGA-LIHC cohort. As reported in Table 3, CLAM achieved the highest ROC-AUC, while ABMIL showed slightly higher sensitivity and comparable F1-score for the TP53-mutated class. Based on its overall performance, CLAM was selected for further qualitative analysis.
For each fold, the model was trained on four subsets and evaluated on the remaining one. Across the five folds, CLAM achieved a mean ROC-AUC of 0.82 ± 0.07 and a mean accuracy of 0.77 ± 0.07. Although some variability was observed across folds, performance remained relatively consistent in the identification of TP53-mutated cases.
The aggregated confusion matrix shown in Figure 4 indicates that 112 out of 143 TP53 wild-type cases were correctly classified, corresponding to a specificity of 77%. For the TP53-mutated class, 43 out of 61 cases were correctly identified, yielding a sensitivity of 70%. These results indicate that a substantial fraction of TP53 mutations can be inferred from histomorphological patterns alone under a weakly supervised setting; however, a subset of mutated tumors remains difficult to detect in the absence of region-level annotations.
Importantly, these predictions were obtained using weak supervision only, without any spatial annotation of tumor regions. This suggests that attention-based MIL models are able to capture global morphological correlates of molecular alterations from whole-slide images. The reduced sensitivity observed for TP53-mutated cases is consistent with the heterogeneous phenotypic manifestations associated with TP53 alterations in hepatocellular carcinoma, indicating that not all mutations give rise to clearly distinguishable histomorphological patterns.
As illustrated in Figure 5, attention maps derived from both ABMIL and CLAM highlight spatially coherent tumor regions rather than isolated patches. Representative high-attention patches tend to exhibit increased cellular density and nuclear atypia, whereas low-attention regions often correspond to more structured or stromal areas. These observations support the ability of attention-based MIL models to capture biologically meaningful morphological patterns under weak supervision.

3.2. CTNNB1 Mutation

3.2.1. Differential Expression Analysis

Using the CTNNB1 annotation, we repeated the DEGs extraction process. In this contrast, 245 DEGs were identified, of which 152 were upregulated and 93 downregulated in the CTNNB1-mutated group. The results are consistent with the Wnt/ β -catenin pathway activation, revealing coordinated effects on signaling and metabolism and suggesting some molecular mechanisms of the typical immune exclusion and differentiation characteristics of the CTNNB1-mutant phenotype (Table 4).
Among the top upregulated genes, we found NKD1, a negative feedback regulator and transcriptional target of Wnt signaling activation in liver and intestinal tumors, and suspected to enhance cancer proliferation [52,53].
Another upregulated gene, REG3A, is a C-type lectin expressed in the intestine and pancreas but inducible in the liver during injury [54]. It has been identified as a direct target of β -catenin, linking Wnt activation to regenerative and survival programs. Its upregulation suggests that CTNNB1-mutated tumors exploit normal liver regeneration pathways to enable growth under oxidative or toxic stress. The clinical implications of REG3A expression appear to be context dependent. In hepatocellular carcinoma, REG3A has been associated with early-stage tumors without vascular invasion and a more favorable prognosis [55]. In contrast, high REG3A expression in colorectal cancer has been linked to enhanced proliferative ability and poorer patient outcomes [56], suggesting that in non-regenerative contexts REG3A may contribute directly to oncogenic progression [54].
We also observed strong induction of ODAM, a recently described and highly specific Wnt/ β -catenin target in liver cancer [57]. ODAM is typically not expressed in normal liver tissue but is greatly induced in CTNNB1-mutated HCC, making it almost diagnostic of aberrant Wnt activation [58]. Its detection in this cohort provides confirmation of this novel angle for stratifying patients with Wnt-driven disease. At the same time, ODAM expression has shown to improve apoptosis in breast cancer cells, suppress growth rate, and migratory activity [59]. These previous reports suggest a similar mechanism in CTNNB1-mutated HCC, given its typical less aggressive phenotype.
A defining feature of CTNNB1-mutated HCC is its distinct metabolic phenotype, resembling that of pericentral hepatocytes (where HCC typically arises), driving the expression of genes involved in glycolysis, xenobiotic metabolism, and lipogenesis [60,61]. Consistent with this model, CYP2E1, the canonical marker of Zone 3 hepatocytes, was strongly upregulated [60]. CYP2E1 activity generates high levels of reactive oxygen species (ROS), placing these tumors under oxidative stress. Tumor survival under these conditions often requires co-activation of antioxidant pathways [62]. This metabolic configuration correlates with the well-differentiated histological appearance typical of this subclass [63]. In line with elevated oxidative stress, ALDH3A1 was also upregulated. It has been explicitly identified as another marker of Wnt/ β -catenin activation in hepatocellular adenomas and carcinomas, and its overexpression likely represents an adaptation that may buffer against apoptosis in a high-ROS environment. Elevated ALDH activity has also been associated with intrinsic resistance to alkylating chemotherapeutic agents [64,65], suggesting a target for potential treatments specific for this subclass.
CTNNB1-mutated HCCs are typically classified as immunologically cold tumors, showing a suppressed immune microenvironment with reduced infiltration of effector cells and poor response to checkpoint inhibitors [66]. Some of the found DEGs are related to these mechanisms.
The found downregulation of CASP1 likely plays a central role in suppressing immune danger signals by inhibiting the inflammasome pathway. Under normal conditions, this pathway activates pro-inflammatory cytokines that initiate pyroptosis—a form of highly immunogenic cell death that promotes T-cell recruitment [67,68]. In CTNNB1-mutated HCCs, Wnt/ β -catenin signaling appears to mediate this suppression, elevating the threshold for pyroptosis and shifting toward non-immunogenic apoptosis or enhanced cell survival under stress, evading immune surveillance [69]. Additionally, we observed downregulation of CXCL1 and CXCL6 chemokines, which in wild-type tumors attract neutrophils and initiate innate immune response [70]. Clinically, this aligns with the resistance to therapies like transarterial chemoembolization, suggesting they are not a good option for patients with CTNNB1-mutation.

3.2.2. Gene Ontology Enrichment

At the level of biological processes (Figure 6), we observed a profound metabolic reprogramming. This enrichment is driven by the upregulation of key metabolic enzymes and regulators, including CYP1A1, CYP1A2, CYP2E1, CYP3A4, UGT1A7, UGT1A9, SLC22A11, SLC22A12, and ACSL6. These genes collectively facilitate shifts in amino acid biosynthesis, fatty acid metabolism, glutamate processing, and glycolysis. Their overexpression in the context of CTNNB1 mutation reflects activation of the Wnt/ β -catenin pathway [47], promoting adaptive metabolic flexibility that supports tumor cell proliferation and survival under stress. The resulting metabolic storm enables efficient energy production and resource allocation even under the harsh tumor microenvironment.
The cellular component ontology is exclusively defined by terms related to the apical domain of the cell, specifically the apical plasma membrane. In polarized hepatocytes, the apical membrane forms the bile canaliculi [71], which are essential for secreting metabolized and detoxified compounds (xenobiotics, and bile acids) into the biliary system [72]. The strong enrichment for this component supports and physically localizes the identified biological processes; the metabolic activities are functionally linked to the apical secretory apparatus. In the context of the CTNNB1 mutation, this may indicate an attempt by the tumor cells to maintain this critical detoxification function.
The molecular functions reveal a toolkit for metabolic transformation and transport. This includes oxidoreductase, an enzyme essential for detoxifying xenobiotics and metabolizing lipids, alongside various transporters that improve the movement of metabolites across cells. These functions integrate with the metabolic reprogramming observed in the biological processes, enabling the efficient handling and secretion of compounds through the apical plasma membrane. This reinforces the CTNNB1 tumor’s detoxification capabilities, potentially contributing to drug resistance.

3.2.3. Slides

After analyzing TP53 mutation status, we used the same weakly supervised framework to predict CTNNB1 mutations in hepatocellular carcinoma. We evaluated multiple attention-based MIL models under identical experimental conditions. As shown in Table 5, ABMIL achieved the highest overall performance, with results comparable to those obtained with CLAM.
The best model had an overall accuracy of 0.73 and an ROC-AUC of 0.79. Predictive performance was slightly lower than that observed for TP53. The difference was modest. This indicates that CTNNB1-associated alterations may also be reflected in histomorphological patterns detectable in routine H&E whole-slide images. However, these alterations are less consistent.
The discrepancy in performance between TP53 and CTNNB1 is biologically plausible and likely reflects the distinct functional roles of the two genes. At the histological level, pronounced nuclear atypia and architectural disruption are readily apparent features of TP53 inactivation. On the other hand, CTNNB1 mutations mostly trigger the Wnt/ β -catenin signaling pathway and are often linked to more subtle and varied morphological characteristics, such as trabecular growth patterns, focal cholestasis, or relatively preserved differentiation. These characteristics are not evenly spread across all mutated tumors.
The aggregated confusion matrix (Figure 7) shows that 180 out of 227 CTNNB1 wild-type slides were correctly classified, corresponding to a specificity of 79%. For the CTNNB1-mutated class, 70 out of 97 cases were correctly identified, yielding a sensitivity of 72%. Most errors arose from mutated cases misclassified as wild-type, consistent with the more heterogeneous and less distinctive histomorphological presentation associated with CTNNB1 alterations.
In line with the findings for TP53, attention maps for CTNNB1 (Figure 8) prediction showed a high degree of agreement between the best-performing models. Regions assigned high attention scores were spatially coherent across methods, while negatively weighted patches corresponded to areas with clearly different histological appearances. This qualitative consistency supports the notion that, despite the lower overall performance, CTNNB1-associated morphological cues are not random but are captured in a reproducible manner by attention-based MIL models.

3.3. Vascular Invasion

3.3.1. Gene Signature Identification via Machine Learning

We will now categorize the presence of vascular invasion in the tumors, again by gene expression values. In this case, we have integrated the DEG Extraction process into a 5-fold cross validated machine learning pipeline. For each training fold of the cross-validation procedure, we extracted DEGs (139/142/135/134/139 were found, 102 common across folds) and applied the mRMR algorithm to them, obtaining an importance ranking. Then, adding genes iteratively in the ranking obtained by the algorithm, SVM classifiers were trained and tested in the samples of the remaining fold.
Figure 9 displays the mean accuracies, sensitivities, and specificities, paired with the standard deviation across folds for each incremental gene set. As the evolution reaches stability with four genes (mean 96.85% accuracy), we selected it as a compromise between complexity and performance. Using the rankings obtained by mRMR across folds, we selected the most robust genes.
The proposed final signature comprises PGM5P4-AS1 (ranked 1st in 3 folds), IRX3 (appeared in the top 3 in 4 folds), LINC00683 (ranked 1st in 2 folds), and AC112206.3 (consistently in the top 10).
To estimate the performance of the selected signature, a new 100-times repeated, 5-fold cross-validation was performed. This model achieved an accuracy of 97.81 % ± 1.68 , a sensitivity of 96.98 % ± 3.66 , and a specificity of 98.25 % ± 1.88 .
While the cross-validation strategy provides an estimation of performance, we must interpret these high metrics with caution. Given the modest sample size, imbalance and the high-dimensionality, we acknowledge the potential risk of overfitting. Although the internal cross-validation mitigates this to an extent, it cannot rule out cohort-specific effects.
Moreover, the final fixed signature was determined by analyzing consensus rankings across the entire dataset, which introduces selection bias and prevents it from serving as an unbiased estimate of predictive performance. However, this approach is valuable because it prioritizes genes that consistently rank high across different data partitions, thereby filtering out stochastic noise and artifacts specific to individual subsets. It also yields a concrete panel that can be targeted and verified in future external validation studies.
Consequently, the reported performance represents a promising initial indication of the signature’s potential, while true generalizability requires further validation on larger external cohorts to address selection bias and confirm its utility.
The selected gene signature was further supported by the biological relevance of the genes in the literature:
PGM5P4-AS1 is a long non-coding RNA (lncRNA) that appears to act as a tumor suppressor. In lung cancer, PGM5P4-AS1 is downregulated in tumors, and its overexpression inhibits cell proliferation and invasion [73]. Although direct studies in HCC or vascular invasion are lacking, PGM5P4-AS1 downregulation was identified as part of a gallbladder cancer diagnostic signature [74] suggesting it may similarly mark tumor suppression. In our analysis, PGM5P4-AS1 was found to be downregulated in HCC samples presenting vascular invasion, consistent with the previously reported lung and bladder behavior.
Likewise, LINC00683 is another lncRNA that seems to act as a tumor suppressor. It is downregulated in prostate cancer and high levels correlate with better survival [75]. In that context, gene-set analysis showed LINC00683 is involved in cancer pathways such as Wnt/ β -catenin signaling. Moreover, LINC00683 appeared in a multi-RNA signature for cervical cancer recurrence [76]. Together, these suggest LINC00683 normally restrains oncogenic signaling. These conclusions were also proven in our study, showing that LINC00683 downregulation could be involved in the promotion of vascular invasion of adjacent tissues.
Although IRX3 has been shown experimentally to promote proliferation in HCC cell line [77], its underexpression in vascular invasive tumors in our analyzed cohort suggests a context-dependent role. Homeobox transcription factors like IRX3 are linked to differentiation and early proliferative programs, and their expression can vary significantly with tumor progression stage [78]. It is plausible that vascular invasive HCC represents a later phenotypic state in which alternative signaling pathways lead to relative IRX3 repression.
Finally, for AC112206, the available evidence related to cancer is still scarce. In non-small cell lung cancer, low expression of AC112206 has been reported to correlate with poorer overall survival [79], suggesting a potential tumor-suppressive role in that context. However, to date there are no clear or direct links between AC112206 and liver cancer or vascular invasion specifically.

3.3.2. Slides

Once more, in addition to looking at changes in genes, we studied whether models that are not strongly supervised could see signs of vascular microinvasion. Vascular microinvasion is a factor that doctors use to predict how well a patient will do with hepatocellular carcinoma. We evaluated all models using the same experimental setup applied to the mutation prediction tasks. As shown in Table 6, the overall performance was substantially lower than that observed for TP53 and CTNNB1.
Across models, ROC-AUC values ranged between 0.57 and 0.66, with the highest mean AUC (0.66 ± 0.10) being achieved by CLAM. However, no approach showed consistently strong performance across sensitivity and F1-score, indicating limited discriminative power at the whole-slide level.
The aggregated confusion matrix for vascular invasion prediction in Figure 10 shows that 158 out of 213 non-invasive cases were correctly classified, corresponding to a specificity of 74%. For cases with vascular invasion, 41 out of 94 slides were correctly identified, yielding a sensitivity of 44%. Most errors were associated with invasive tumors misclassified as non-invasive, indicating limited discriminative power at the whole-slide level.
This reduced performance is not unexpected. Molecular alterations may influence tumor morphology over large tissue areas, resulting in histological patterns that are broadly distributed across the slide. In contrast, vascular microinvasion is a focal pathological phenomenon, typically confined to small regions at the tumor–vascular interface, which may represent only a minor fraction of the whole-slide image or may not be captured among the sampled tiles at all. This creates a fundamental challenge for the MIL framework used under weak supervision. Because the model relies on attention mechanisms to aggregate features from thousands of tiles into a single slide-level prediction, the informative signal originating from a small number of invasion-positive tiles is easily diluted by the overwhelming number of non-informative background tiles. In the absence of pixel- or region-level annotations to explicitly guide the attention mechanism toward these rare events, the model faces intrinsic difficulty in isolating the relevant morphological events.
The low sensitivity seen in most models supports this idea and suggests that false-negative predictions are probably due to sampling limitations, not the lack of histological correlates. Although some models achieved moderate area under the curve (AUC) values, these results were not generalizable across the cohort, underscoring the heterogeneous and spatially restricted nature of vascular microinvasion in liver hepatocellular carcinoma.
For vascular invasion, although overall predictive performance was modest across models, the corresponding attention maps in Figure 11 showed consistent spatial patterns. In particular, regions highlighted by different MIL architectures largely overlapped, suggesting that the models converge on similar histomorphological cues despite limited discriminative power at the slide level.

3.4. Limitations and Generalizability

Finally, we acknowledge the limitations regarding generalizability inherent to the use of the TCGA-LIHC dataset. While this cohort provides a unique paired morpho-molecular resource, it introduces specific biases in case selection, as it predominantly comprises surgically resectable, treatment-naïve tumors. As a result, the models are trained on a population that may not fully represent the advanced or unresectable HCC cases commonly encountered in routine clinical practice.
Regarding slide preparation, TCGA whole-slide images exhibit substantial heterogeneity in staining protocols and scanning devices across contributing institutions. Although this variability poses challenges for model development, it may also act as a partial proxy for real-world heterogeneity, encouraging the learning of robust, scanner-invariant representations rather than overfitting to institution-specific artifacts. Nevertheless, successful deployment in external clinical settings will require rigorous validation on independent cohorts and, potentially, the application of domain adaptation strategies to address domain shift arising from differences in digitization pipelines and from the transition between resection specimens and diagnostic biopsies.

4. Conclusions

We showed that the transcriptomic landscape of TP53-mutated HCC is defined by a shift towards a proliferative and dedifferentiated state. These insights reinforce the central role of TP53 loss in the development of aggressive HCC phenotypes, but also highlight potential therapeutic entry points. Conversely, the CTNNB1-mutated group shows a profile dominated by the activation of the Wnt/ β -catenin pathway and metabolic reprogramming (displayed by the Gene Ontology Enrichment). This results in a cold tumor microenvironment, with downregulated chemokines like CXCL1 and CXCL6, limiting the activation of the immune system. While these tumors seem to suffer from oxidative and toxic stress, they utilize alternative survival strategies—such as pyroptosis inhibition to sustain growth. Additionally, the vascular invasion signature achieved high cross-validation accuracy. While these results are promising, the modest sample size and lack of external validation warrant caution regarding potential overfitting and cohort overspecification. Nonetheless, the consistent selection and performance of genes across folds suggest that they provide relevant information.
At the histological level, WSI-based classification revealed that TP53 and CTNNB1 mutation status can be predicted with comparable performance when restricting the analysis to alterations with clear functional impact, suggesting that both mutations leave detectable global morphological traces. In contrast, vascular invasion proved consistently harder to predict, in line with its focal and spatially restricted nature. These results indicate that weakly supervised MIL models can recover biologically meaningful signals from routine histology, but that their performance is inherently constrained by how directly a given molecular or pathological event manifests at the tissue scale.
Revisiting our central research question, our findings clarify the extent to which molecular and pathological features are reflected in routine histology. Molecular alterations associated with widespread cellular reprogramming—such as TP53 and CTNNB1 mutations—imprint detectable global signatures that are amenable to slide-level prediction. By contrast, focal pathological phenomena like microvascular invasion induce a resolution mismatch under weak supervision, limiting achievable performance. Consequently, while deep learning offers powerful tools for decoding diffuse molecular states from H&E slides, more granular supervision will likely be required to robustly capture spatially localized events. This distinction is critical for the future development of integrative morpho-molecular biomarkers in HCC.

Author Contributions

Conceptualization, N.H. and I.G.; methodology, N.H., I.G., L.J.H., F.M.O. and I.R.; software, N.H. and I.G.; validation, N.H., I.G., L.J.H., F.M.O. and I.R.; formal analysis, N.H. and I.G.; investigation, N.H. and I.G.; resources, N.H., I.G., L.J.H., F.M.O. and I.R.; data curation, N.H. and I.G.; writing—original draft preparation, N.H. and I.G.; writing—review and editing, N.H., I.G., L.J.H., F.M.O. and I.R.; visualization, N.H. and I.G.; supervision, L.J.H., F.M.O. and I.R.; project administration, L.J.H., F.M.O. and I.R.; funding acquisition, L.J.H., F.M.O. and I.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was part of the grants PID2024-160318OB-I00 and PCI2023-146016-2 funded by MICIU/AEI/10.13039/501100011033 and co-funded by the European Union.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in the GDC Portal at https://portal.gdc.cancer.gov/.

Acknowledgments

Large language models were used to improve the flow of certain paragraphs and to find grammatical errors. The use of the LLM does not negatively affect the empirical data and conclusions. The authors have thoroughly reviewed the text and are fully responsible for its content.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ABMILAttention-Based Multiple Instance Learning
AUCArea Under the Curve
CLAMClustering-Constrained Attention Multiple Instance Learning
CVCross Validation
DFTDDual-Feature Tokenization Multiple Instance Learning
DEGDifferentially Expressed Gene
FDRFalse Discovery Rate
GDCGenomic Data Commons
GOGene Ontology
H&EHematoxylin and Eosin
HCCHepatocellular Carcinoma
MILMultiple Instance Learning
MLPMulti-Layer Perceptron
mRMRMinimum Redundancy Maximum Relevance
MVIMicrovascular Invasion
ROCReceiver Operating Characteristic
ROIRegion of Interest
ROSReactive Oxygen Species
SSMSingle Somatic Mutation
SVMSupport Vector Machine
TCGAThe Cancer Genome Atlas
TransMILTransformer-based Multiple Instance Learning
WSIWhole-Slide Image
WSLWeakly Supervised Learning

References

  1. Zheng, J.; Wang, S.; Xia, L.; Sun, Z.; Chan, K.M.; Bernards, R.; Qin, W.; Chen, J.; Xia, Q.; Jin, H. Hepatocellular carcinoma: Signaling pathways and therapeutic advances. Signal Transduct. Target. Ther. 2025, 10, 35. [Google Scholar] [CrossRef] [PubMed]
  2. Alvarez, C.S.; Ortiz, J.; Bendfeldt-Avila, G.; Xie, Y.; Wang, M.; Wu, D.; Higson, H.; Lee, E.; Teshome, K.; Barnoya, J.; et al. Analysis of TP53 aflatoxin signature mutation in hepatocellular carcinomas from Guatemala: A cross-sectional study (2016--2017). Health Sci. Rep. 2020, 3, e155. [Google Scholar] [CrossRef] [PubMed]
  3. Lee, C.C.; Liu, J.Y.; Lin, J.K.; Chu, J.S.; Shew, J.Y. p53 point mutation enhanced by hepatic regeneration in aflatoxin B1-induced rat liver tumors and preneoplastic lesions. Cancer Lett. 1998, 125, 1–7. [Google Scholar] [CrossRef] [PubMed]
  4. Kastenhuber, E.R.; Lowe, S.W. Putting p53 in Context. Cell 2017, 170, 1062–1078. [Google Scholar] [CrossRef]
  5. Xue, Y.; Tian, T.; Ottallah, M.; Mannan, M.; Barkin, J.; Jin-Smith, B.; Pi, L. Alcohol-Associated Hepatocarcinogenesis: Wnt/β-Catenin in Action. Am. J. Pathol. 2025, 196, 151–160. [Google Scholar] [CrossRef]
  6. Trépo, E.; Caruso, S.; Yang, J.; Imbeaud, S.; Couchy, G.; Bayard, Q.; Letouzé, E.; Ganne-Carrié, N.; Moreno, C.; Oussalah, A.; et al. Common genetic variation in alcohol-related hepatocellular carcinoma: A case-control genome-wide association study. Lancet Oncol. 2022, 23, 161–171. [Google Scholar] [CrossRef]
  7. Dantzer, C.; Dif, L.; Vaché, J.; Basbous, S.; Billottet, C.; Moreau, V. Specific features of ß-catenin-mutated hepatocellular carcinomas. Br. J. Cancer 2024, 131, 1871–1880. [Google Scholar] [CrossRef]
  8. Guichard, C.; Amaddeo, G.; Imbeaud, S.; Ladeiro, Y.; Pelletier, L.; Maad, I.B.; Calderaro, J.; Bioulac-Sage, P.; Letexier, M.; Degos, F.; et al. Integrated analysis of somatic mutations and focal copy-number changes identifies key genes and pathways in hepatocellular carcinoma. Nat. Genet. 2012, 44, 694–698. [Google Scholar] [CrossRef]
  9. Rodríguez-Perálvarez, M.; Luong, T.V.; Andreana, L.; Meyer, T.; Dhillon, A.P.; Burroughs, A.K. A systematic review of microvascular invasion in hepatocellular carcinoma: Diagnostic and prognostic variability. Ann. Surg. Oncol. 2013, 20, 325–339. [Google Scholar] [CrossRef]
  10. Lim, K.W.; Chow, P.K.H.; Allen, J.C.; Chia, G.S.; Lim, M.; Cheow, P.C.; Chung, A.; Ooi, L.; Tan, S.B. Microvascular invasion is a better predictor of tumor recurrence and overall survival following surgical resection for hepatocellular carcinoma compared to the Milan criteria. Ann. Surg. 2011, 254, 108. [Google Scholar] [CrossRef]
  11. Sumie, S.; Kuromatsu, R.; Okuda, K.; Ando, E.; Takata, A.; Fukushima, N.; Watanabe, Y.; Kojiro, M.; Sata, M. Microvascular Invasion in Patients with Hepatocellular Carcinoma and Its Predictable Clinicopathological Factors. J. Hepatol. 2008, 15, 1375–1382. [Google Scholar] [CrossRef]
  12. Wei, K.; Peng, C.; Ou, Y.; Wang, P.; Zhan, C.; Wei, H.; Na, J.; Zhang, Z. Decoding Hepatocellular Carcinoma Metastasis: Molecular Mechanisms, Targeted Therapies, and Potential Biomarkers. Curr. Issues Mol. Biol. 2025, 47, 263. [Google Scholar] [CrossRef] [PubMed]
  13. Huang, H.; Wu, F.; Yu, Y.; Xu, B.; Chen, D.; Huo, Y.; Li, S. Multi-transcriptomics analysis of microvascular invasion-related malignant cells and development of a machine learning-based prognostic model in hepatocellular carcinoma. Front. Immunol. 2024, 15, 1436131. [Google Scholar] [CrossRef] [PubMed]
  14. Álvarez, M.; Heredia, A.J.; Garach, I.; Rojas, I.; Herrera, L.J.; Ortuño, F.M. Machine Learning-Based Diagnosis and Staging of Liver Cancer Using RNA-Seq Data. In Bioinformatics and Biomedical Engineering, Proceedings of the 11th International Conference, IWBBIO 2025, Meloneras, Gran Canaria, Spain, 16–18 July 2025; Rojas, I., Ortuño, F., Rojas Ruiz, F., Herrera, L.J., Valenzuela, O., Escobar, J.J., Eds.; Springer Nature: Cham, Switzerland, 2026; pp. 220–234. [Google Scholar] [CrossRef]
  15. National Cancer Institute (NCI); National Institutes of Health (NIH). The Cancer Genome Atlas Program (TCGA). 2022. Available online: https://www.cancer.gov/tcga (accessed on 26 December 2025).
  16. Heath, A.P.; Ferretti, V.; Agrawal, S.; An, M.; Angelakos, J.C.; Arya, R.; Bajari, R.; Baqar, B.; Barnowski, J.H.B.; Burt, J.; et al. The NCI Genomic Data Commons. Nat. Genet. 2021, 53, 257–262. [Google Scholar] [CrossRef] [PubMed]
  17. Castillo-Secilla, D.; Gálvez, J.M.; Carrillo-Perez, F.; Verona-Almeida, M.; Redondo-Sánchez, D.; Ortuno, F.M.; Herrera, L.J.; Rojas, I. KnowSeq R-Bioc package: The automatic smart gene expression tool for retrieving relevant biological knowledge. Comput. Biol. Med. 2021, 133, 104387. [Google Scholar] [CrossRef]
  18. Leek, J.T.; Storey, J.D. Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis. PLoS Genet. 2007, 3, e161. [Google Scholar] [CrossRef]
  19. Smyth, G.K. limma: Linear Models for Microarray Data. In Bioinformatics and Computational Biology Solutions Using R and Bioconductor; Gentleman, R., Carey, V.J., Huber, W., Irizarry, R.A., Dudoit, S., Eds.; Springer: New York, NY, USA, 2005; pp. 397–420. [Google Scholar] [CrossRef]
  20. Ding, C.; Peng, H. Minimum redundancy feature selection from microarray gene expression data. In Proceedings of the 2003 IEEE Bioinformatics Conference—CSB2003, Stanford, CA, USA, 11–14 August 2003; pp. 523–528. [Google Scholar] [CrossRef]
  21. Hearst, M.; Dumais, S.; Osuna, E.; Platt, J.; Scholkopf, B. Support vector machines. IEEE Intell. Syst. Their Appl. 1998, 13, 18–28. [Google Scholar] [CrossRef]
  22. Cawley, G.C.; Talbot, N.L. On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation. J. Mach. Learn. Res. 2010, 11, 2079–2107. [Google Scholar]
  23. Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man. Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
  24. Bioptimus. H-optimus-1. Software, 2025. Available online: https://www.bioptimus.com (accessed on 26 December 2025).
  25. Ilse, M.; Tomczak, J.M.; Welling, M. Attention-based Deep Multiple Instance Learning. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
  26. Shao, Z.; Bian, H.; Chen, Y.; Wang, Y.; Zhang, J.; Ji, X.; Zhang, Y. TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification. In Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, Virtual, 6–14 December 2021; Ranzato, M., Beygelzimer, A., Dauphin, Y., Liang, P., Vaughan, J.W., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2021; Volume 34, pp. 2136–2147. [Google Scholar]
  27. Zhang, H.; Meng, Y.; Zhao, Y.; Qiao, Y.; Yang, X.; Coupland, S.E.; Zheng, Y. DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification. arXiv 2022, arXiv:2203.12081. [Google Scholar]
  28. Lu, M.Y.; Williamson, D.F.; Chen, T.Y.; Chen, R.J.; Barbieri, M.; Mahmood, F. Data-efficient and weakly supervised computational pathology on whole-slide images. Nat. Biomed. Eng. 2021, 5, 555–570. [Google Scholar] [CrossRef]
  29. Tanikawa, C.; Ri, C.; Kumar, V.; Nakamura, Y.; Matsuda, K. Crosstalk of EDA-A2/XEDAR in the p53 signaling pathway. Mol. Cancer Res. 2010, 8, 855–863. [Google Scholar] [CrossRef]
  30. Jiao, Y.; Ji, F.; Hou, L.; Zhang, J. A novel gene signature associated with anoikis predicts prognosis and unveils immune infiltration in breast cancer patients. Discov. Oncol. 2025, 16, 447. [Google Scholar] [CrossRef] [PubMed]
  31. Tanikawa, C.; Furukawa, Y.; Yoshida, N.; Arakawa, H.; Nakamura, Y.; Matsuda, K. XEDAR as a putative colorectal tumor suppressor that mediates p53-regulated anoikis pathway. Oncogene 2009, 28, 3081–3092. [Google Scholar] [CrossRef] [PubMed]
  32. Sugimura-Nagata, A.; Koshino, A.; Nagao, K.; Nagano, A.; Komura, M.; Ueki, A.; Ebi, M.; Ogasawara, N.; Tsuzuki, T.; Kasai, K.; et al. SPATA18 Expression Predicts Favorable Clinical Outcome in Colorectal Cancer. Int. J. Mol. Sci. 2022, 23, 2753. [Google Scholar] [CrossRef] [PubMed]
  33. Xie, L.; Liu, W.; Wang, Y.; Zhou, Y. High SPATA18 Expression and its Diagnostic and Prognostic Value in Clear Cell Renal Cell Carcinoma. Med. Sci. Monit. Int. Med. J. Exp. Clin. Res. 2023, 29, e938474. [Google Scholar] [CrossRef]
  34. Sargsyan, A.; Doridot, L.; Hannou, S.A.; Tong, W.; Srinivasan, H.; Ivison, R.; Monn, R.; Kou, H.H.; Haldeman, J.M.; Arlotto, M.; et al. HGFAC is a ChREBP-regulated hepatokine that enhances glucose and lipid homeostasis. JCI Insight 2022, 8, e153740. [Google Scholar] [CrossRef]
  35. Loughran, R.M.; Emerling, B.M. Mechanistic roles of mutant p53 governing lipid metabolism. Adv. Biol. Regul. 2022, 83, 100839. [Google Scholar] [CrossRef]
  36. Zou, Q.; Cao, S. miR-4270 suppresses hepatocellular carcinoma progression by inhibiting DNMT3A-mediated methylation of HGFAC promoter. PeerJ 2023, 11, e16566. [Google Scholar] [CrossRef]
  37. Yin, L.; Mu, Y.; Lin, Y.; Xia, Q. HGFAC expression decreased in liver cancer and its low expression correlated with DNA hypermethylation and poor prognosis. J. Cell. Biochem. 2019, 120, 9692–9699. [Google Scholar] [CrossRef]
  38. Zhu, L.; Yang, X.; Feng, J.; Mao, J.; Zhang, Q.; He, M.; Mi, Y.; Mei, Y.; Jin, G.; Zhang, H. CYP2E1 plays a suppressive role in hepatocellular carcinoma by regulating Wnt/Dvl2/β-catenin signaling. J. Transl. Med. 2022, 20, 194. [Google Scholar] [CrossRef] [PubMed]
  39. Roa-Colomo, A.; Lopez Garrido, M.A.; Molina-Vallejo, P.; Rojas, A.; Sanchez, M.G.; Aranda-Garcia, V.; Salmeron, J.; Romero-Gomez, M.; Muntane, J.; Padillo, J.; et al. Hepatocellular carcinoma risk-stratification based on ASGR1 in circulating epithelial cells for cancer interception. Front. Mol. Biosci. 2022, 9, 1074277. [Google Scholar] [CrossRef] [PubMed]
  40. Stockert, R.J. The asialoglycoprotein receptor: Relationships between structure, function, and expression. Physiol. Rev. 1995, 75, 591–609. [Google Scholar] [CrossRef] [PubMed]
  41. Gu, D.; Jin, H.; Jin, G.; Wang, C.; Wang, N.; Hu, F.; Luo, Q.; Chu, W.; Yao, M.; Qin, W. The asialoglycoprotein receptor suppresses the metastasis of hepatocellular carcinoma via LASS2-mediated inhibition of V-ATPase activity. Cancer Lett. 2016, 379, 107–116. [Google Scholar] [CrossRef]
  42. Bloomer, H.; Dame, H.B.; Parker, S.R.; Oudin, M.J. Neuronal mimicry in tumors: Lessons from neuroscience to tackle cancer. Cancer Metastasis Rev. 2025, 44, 31. [Google Scholar] [CrossRef]
  43. Zhang, Y.; Liao, Q.; Wen, X.; Fan, J.; Yuan, T.; Tong, X.; Jia, R.; Chai, P.; Fan, X. Hijacking of the nervous system in cancer: Mechanism and therapeutic targets. Mol. Cancer 2025, 24, 44. [Google Scholar] [CrossRef]
  44. Ben-Ari, Y. Excitatory actions of gaba during development: The nature of the nurture. Nat. Rev. Neurosci. 2002, 3, 728–739. [Google Scholar] [CrossRef]
  45. Liu, Y.; Li, Y.H.; Guo, F.J.; Wang, J.J.; Sun, R.L.; Hu, J.Y.; Li, G.C. Gamma-aminobutyric acid promotes human hepatocellular carcinoma growth through overexpressed gamma-aminobutyric acid A receptor alpha 3 subunit. World J. Gastroenterol. 2008, 14, 7175–7182. [Google Scholar] [CrossRef]
  46. Fischer, M.; Quaas, M.; Steiner, L.; Engeland, K. The p53-p21-DREAM-CDE/CHR pathway regulates G2/M cell cycle genes. Nucleic Acids Res. 2016, 44, 164–174. [Google Scholar] [CrossRef]
  47. Uxa, S.; Bernhart, S.H.; Mages, C.F.S.; Fischer, M.; Kohler, R.; Hoffmann, S.; Stadler, P.F.; Engeland, K.; Müller, G.A. DREAM and RB cooperate to induce gene repression and cell-cycle arrest in response to p53 activation. Nucleic Acids Res. 2019, 47, 9087–9103. [Google Scholar] [CrossRef]
  48. Wurzenberger, C.; Held, M.; Lampson, M.A.; Poser, I.; Hyman, A.A.; Gerlich, D.W. Sds22 and Repo-Man stabilize chromosome segregation by counteracting Aurora B on anaphase kinetochores. J. Cell Biol. 2012, 198, 173–183. [Google Scholar] [CrossRef]
  49. Long, Z.; Wu, T.; Tian, Q.; Carlson, L.A.; Wang, W.; Wu, G. Expression and prognosis analyses of BUB1, BUB1B and BUB3 in human sarcoma. Aging 2021, 13, 12395–12409. [Google Scholar] [CrossRef]
  50. Peng, B.; Zhang, J.; Xiang, Y. CDCA8 and its multifaceted role in tumorigenesis. Biomed. Pharmacother. 2025, 185, 117951. [Google Scholar] [CrossRef]
  51. Redman-Rivera, L.N.; Shaver, T.M.; Jin, H.; Marshall, C.B.; Schafer, J.M.; Sheng, Q.; Hongo, R.A.; Beckermann, K.E.; Wheeler, F.C.; Lehmann, B.D.; et al. Acquisition of aneuploidy drives mutant p53-associated gain-of-function phenotypes. Nat. Commun. 2021, 12, 5184. [Google Scholar] [CrossRef] [PubMed]
  52. Stancikova, J.; Krausova, M.; Kolar, M.; Fafilek, B.; Svec, J.; Sedlacek, R.; Neroldova, M.; Dobes, J.; Horazna, M.; Janeckova, L.; et al. NKD1 marks intestinal and liver tumors linked to aberrant Wnt signaling. Cell. Signal. 2015, 27, 245–256. [Google Scholar] [CrossRef] [PubMed][Green Version]
  53. Lu, W.; Tang, J.; Wang, Y.; Gu, X.; Zhang, H.; Liu, Y.; Xiao, Y.; Zhu, Q.; Deng, J.; Shen, Y.; et al. NKD1 enhances colon cancer progression by inhibiting the autophagic degradation of MYC. Cell Death Dis. 2025, 16, 532. [Google Scholar] [CrossRef]
  54. Faivre, J.; Shalhoub, H.; Nguyen, T.S.; Xie, H.; Moniaux, N. REG3A: A Multifunctional Antioxidant Lectin at the Crossroads of Microbiota Regulation, Inflammation, and Cancer. Cancers 2025, 17, 2395. [Google Scholar] [CrossRef] [PubMed]
  55. Yuan, R.H.; Jeng, Y.M.; Chen, H.L.; Hsieh, F.J.; Yang, C.Y.; Lee, P.H.; Hsu, H.C. Opposite Roles of Human Pancreatitis-Associated Protein and REG1A Expression in Hepatocellular Carcinoma: Association of Pancreatitis-Associated Protein Expression with Low-Stage Hepatocellular Carcinoma, β-Catenin Mutation, and Favorable Prognosis. Clin. Cancer Res. 2005, 11, 2568–2575. [Google Scholar] [CrossRef]
  56. Ye, Y.; Xiao, L.; Wang, S.J.; Yue, W.; Yin, Q.S.; Sun, M.Y.; Xia, W.; Shao, Z.Y.; Zhang, H. Up-regulation of REG3A in colorectal cancer cells confers proliferation and correlates with colorectal cancer risk. Oncotarget 2015, 7, 3921–3933. [Google Scholar] [CrossRef]
  57. Yamaguchi, K.; Horie, C.; Takane, K.; Ikenoue, T.; Nakagawa, S.; Isobe, Y.; Ota, Y.; Ushiku, T.; Tanaka, M.; Fujishiro, J.; et al. Identification of odontogenic ameloblast associated as a novel target gene of the Wnt/β-catenin signaling pathway. Cancer Sci. 2023, 114, 948–960. [Google Scholar] [CrossRef]
  58. Désert, R.; Rohart, F.; Canal, F.; Sicard, M.; Desille, M.; Renaud, S.; Turlin, B.; Bellaud, P.; Perret, C.; Clément, B.; et al. Human hepatocellular carcinomas with a periportal phenotype have the lowest potential for early recurrence after curative resection. Hepatology 2017, 66, 1502–1518. [Google Scholar] [CrossRef] [PubMed]
  59. Kestler, D.P.; Foster, J.S.; Bruker, C.T.; Prenshaw, J.W.; Kennel, S.J.; Wall, J.S.; Weiss, D.T.; Solomon, A. ODAM Expression Inhibits Human Breast Cancer Tumorigenesis. Breast Cancer Basic Clin. Res. 2011, 5, 73–85. [Google Scholar] [CrossRef] [PubMed]
  60. Krutsenko, Y.; Singhi, A.D.; Monga, S.P. β-Catenin Activation in Hepatocellular Cancer: Implications in Biology and Therapy. Cancers 2021, 13, 1830. [Google Scholar] [CrossRef] [PubMed]
  61. Raven, A.; Gilroy, K.; Jin, H.; Waldron, J.A.; Leslie, H.; Munro, J.; Hall, H.; Ridgway, R.A.; Ford, C.A.; Gulhan, D.C.; et al. Hepatic zonation determines tumorigenic potential of mutant β-catenin. Nature 2025, 649, 739–748. [Google Scholar] [CrossRef]
  62. Cederbaum, A.I. Nrf2 and antioxidant defense against CYP2E1 toxicity. Sub-Cell. Biochem. 2013, 67, 105–130. [Google Scholar] [CrossRef]
  63. Lehrich, B.M.; Tao, J.; Liu, S.; Hirsch, T.Z.; Yasaka, T.M.; Cao, C.; Delgado, E.R.; Guan, X.; Lu, S.; Pan, L.; et al. Development of mutated β-catenin gene signature to identify CTNNB1 mutations from whole and spatial transcriptomic data in patients with HCC. JHEP Rep. Innov. Hepatol. 2024, 6, 101186. [Google Scholar] [CrossRef]
  64. Calderaro, J.; Nault, J.C.; Bioulac-Sage, P.; Laurent, A.; Blanc, J.F.; Decaens, T.; Zucman-Rossi, J. ALDH3A1 is overexpressed in a subset of hepatocellular carcinoma characterised by activation of the Wnt/β-catenin pathway. Virchows Arch. Int. J. Pathol. 2014, 464, 53–60. [Google Scholar] [CrossRef]
  65. Marchitti, S.A.; Brocker, C.; Stagos, D.; Vasiliou, V. Non-P450 aldehyde oxidizing enzymes: The aldehyde dehydrogenase superfamily. Expert Opin. Drug Metab. Toxicol. 2008, 4, 697–720. [Google Scholar] [CrossRef]
  66. Pinyol, R.; Sia, D.; Llovet, J.M. Immune exclusion-Wnt/CTNNB1 class predicts resistance to immunotherapies in HCC. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 2019, 25, 2021–2023. [Google Scholar] [CrossRef]
  67. Li, M.J.; Wen, C.L.; Cheng, H.T.; Lyu, H.N.; Han, Y.Y. PANoptosis in hepatocellular carcinoma: Underlying mechanisms. World J. Hepatol. 2025, 17, 109051. [Google Scholar] [CrossRef]
  68. Arrè, V.; Negro, R.; Giannelli, G. The role of inflammasomes in hepatocellular carcinoma: Mechanisms and therapeutic insights. Ann. Hepatol. 2025, 30, 101772. [Google Scholar] [CrossRef]
  69. Hu, M.; Deng, F.; Song, X.; Zhao, H.; Yan, F. The crosstalk between immune cells and tumor pyroptosis: Advancing cancer immunotherapy strategies. J. Exp. Clin. Cancer Res. 2024, 43, 190. [Google Scholar] [CrossRef] [PubMed]
  70. Kinzler, M.N.; Bankov, K.; Bein, J.; Döring, C.; Schulze, F.; Reis, H.; Mahmoudi, S.; Koch, V.; Grünewald, L.D.; Stehle, A.; et al. CXCL1 and CXCL6 Are Potential Predictors for HCC Response to TACE. Curr. Oncol. 2023, 30, 3516–3528. [Google Scholar] [CrossRef] [PubMed]
  71. Treyer, A.; Müsch, A. Hepatocyte Polarity. Compr. Physiol. 2013, 3, 243–287. [Google Scholar] [CrossRef] [PubMed]
  72. Elferink, R.P.; Tytgat, G.N.; Groen, A.K. Hepatic canalicular membrane 1: The role of mdr2 P-glycoprotein in hepatobiliary lipid transport. FASEB J. Off. Publ. Fed. Am. Soc. Exp. Biol. 1997, 11, 19–28. [Google Scholar] [CrossRef]
  73. Feng, J.; Li, J.; Qie, P.; Li, Z.; Xu, Y.; Tian, Z. Long non-coding RNA (lncRNA) PGM5P4-AS1 inhibits lung cancer progression by up-regulating leucine zipper tumor suppressor (LZTS3) through sponging microRNA miR-1275. Bioengineered 2021, 12, 196–207. [Google Scholar] [CrossRef]
  74. Rajput, M.; Pandey, M.; Dixit, R.; Shukla, V.K. Is cross-species horizontal gene transfer responsible for gallbladder carcinogenesis. World J. Surg. Oncol. 2024, 22, 201. [Google Scholar] [CrossRef]
  75. Liu, Y.; Yang, B.; Su, Y.; Xiang, Q.; Li, Q. Downregulation of long noncoding RNA LINC00683 associated with unfavorable prognosis in prostate cancer based on TCGA. J. Cell. Biochem. 2019, 120, 14165–14174. [Google Scholar] [CrossRef]
  76. Li, M.; Tian, X.; Guo, H.; Xu, X.; Liu, Y.; Hao, X.; Fei, H. A novel lncRNA-mRNA-miRNA signature predicts recurrence and disease-free survival in cervical cancer. Braz. J. Med Biol. Res. 2021, 54, e11592. [Google Scholar] [CrossRef]
  77. Wang, P.; Zhuang, C.; Huang, D.; Xu, K. Downregulation of miR-377 contributes to IRX3 deregulation in hepatocellular carcinoma. Oncol. Rep. 2016, 36, 247–252. [Google Scholar] [CrossRef]
  78. Liu, K.Y.; Wang, L.T.; Hsu, S.H.; Wang, S.N. Homeobox Genes and Hepatocellular Carcinoma. Cancers 2019, 11, 621. [Google Scholar] [CrossRef]
  79. Yu, Y.; Ren, K. Five long non-coding RNAs establish a prognostic nomogram and construct a competing endogenous RNA network in the progression of non-small cell lung cancer. BMC Cancer 2021, 21, 457. [Google Scholar] [CrossRef]
Figure 1. Cohort selection and class distribution from the TCGA-LIHC cohort.
Figure 1. Cohort selection and class distribution from the TCGA-LIHC cohort.
Genes 17 00190 g001
Figure 2. Weakly supervised learning (WSL) pipeline for whole-slide image (WSI) classification. The pipeline comprises four key stages: (1) tissue segmentation and preprocessing, (2) patch tessellation and feature extraction, (3) MIL-based feature aggregation, and (4) slide-level classification with interpretability support (Red indicates high attention, and blue indicates low attention).
Figure 2. Weakly supervised learning (WSL) pipeline for whole-slide image (WSI) classification. The pipeline comprises four key stages: (1) tissue segmentation and preprocessing, (2) patch tessellation and feature extraction, (3) MIL-based feature aggregation, and (4) slide-level classification with interpretability support (Red indicates high attention, and blue indicates low attention).
Genes 17 00190 g002
Figure 3. TP53 DEGs Gene Ontology enrichment analysis.
Figure 3. TP53 DEGs Gene Ontology enrichment analysis.
Genes 17 00190 g003
Figure 4. Confusion matrix of the CLAM model for TP53 mutation prediction at the whole-slide level.
Figure 4. Confusion matrix of the CLAM model for TP53 mutation prediction at the whole-slide level.
Genes 17 00190 g004
Figure 5. Representative example of TP53 mutation prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by the two best-performing models (CLAM and ABMIL), highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for TP53 mutation. (E) Patch with the lowest attention score, representing negative evidence.
Figure 5. Representative example of TP53 mutation prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by the two best-performing models (CLAM and ABMIL), highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for TP53 mutation. (E) Patch with the lowest attention score, representing negative evidence.
Genes 17 00190 g005
Figure 6. CTNNB1 DEGs Gene Ontology enrichment analysis.
Figure 6. CTNNB1 DEGs Gene Ontology enrichment analysis.
Genes 17 00190 g006
Figure 7. Confusion matrix of the ABMIL model for CTNNB1 mutation prediction at the whole-slide level.
Figure 7. Confusion matrix of the ABMIL model for CTNNB1 mutation prediction at the whole-slide level.
Genes 17 00190 g007
Figure 8. Representative example of CTNNB1 mutation prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by the two best-performing models (CLAM and ABMIL), highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for the CTNNB1 mutation. (E) Patch with the lowest attention score, representing negative evidence.
Figure 8. Representative example of CTNNB1 mutation prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by the two best-performing models (CLAM and ABMIL), highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for the CTNNB1 mutation. (E) Patch with the lowest attention score, representing negative evidence.
Genes 17 00190 g008
Figure 9. Evolution of cross-validation classification metrics across the 5-fold mRMR rankings. Solid lines denote mean performance, while the shaded colored areas indicate the standard deviation across the folds.
Figure 9. Evolution of cross-validation classification metrics across the 5-fold mRMR rankings. Solid lines denote mean performance, while the shaded colored areas indicate the standard deviation across the folds.
Genes 17 00190 g009
Figure 10. Confusion matrix of the CLAM model for vascular invasion prediction at the whole-slide level.
Figure 10. Confusion matrix of the CLAM model for vascular invasion prediction at the whole-slide level.
Genes 17 00190 g010
Figure 11. Representative example of vascular invasion prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by CLAM and ABMIL, highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for vascular invasion. (E) Patch with the lowest attention score, representing negative evidence.
Figure 11. Representative example of vascular invasion prediction at the whole-slide level. (A) Original H&E-stained whole-slide image. (B,C) Attention heatmaps generated by CLAM and ABMIL, highlighting regions contributing most to the slide-level prediction (Red indicates high attention, and blue indicates low attention). (D) Patch with the highest positive attention score, corresponding to strong evidence for vascular invasion. (E) Patch with the lowest attention score, representing negative evidence.
Genes 17 00190 g011
Table 1. Differentially expressed genes identification thresholds and number of DEGs retained for the three comparisons.
Table 1. Differentially expressed genes identification thresholds and number of DEGs retained for the three comparisons.
Comparison | log 2   Fold   Change | FDR p-ValueDEGs Retained
TP53 Mutant vs. Wild Type0.80.001249
CTNNB1 Mutant vs. Wild Type1.00.001245
Vascular Invasion Presence0.50.001102 common
Table 2. Key differentially expressed genes in TP53-mutated HCC.
Table 2. Key differentially expressed genes in TP53-mutated HCC.
GeneLogFCMechanistic Association in TP53 Mutant Context
PPP1R7 2.69 Loss of p53-dependent mitotic control; resistance to apoptosis.
EDA2R 2.61 Failure of p53-mediated anoikis signaling; facilitates metastasis.
SPATA18 1.86 Metabolic reprogramming; poor prognosis.
HGFAC 1.89 Disrupts lipid and glucose homeostasis.
CYP2E1 1.44 Loss of ethanol and fatty acid metabolism; reduces oxidative stress.
ASGR2 1.09 Loss of glycoprotein clearance promotes metastasis.
GABRA2 + 1.26 GABAergic signaling; proliferation and neuronal mimicry.
TOP2A + 0.85 Derepression of the p53–p21–DREAM pathway.
BUB1B + 0.84 Aneuploidy and mitotic progression despite errors.
Table 3. Comparison of attention-based MIL models for TP53 mutation prediction. Sensitivity and F1-score correspond to the positive (mutated) class. The best results are marked in bold.
Table 3. Comparison of attention-based MIL models for TP53 mutation prediction. Sensitivity and F1-score correspond to the positive (mutated) class. The best results are marked in bold.
ModelROC-AUC (±std)SensitivityF1-Score
ABMIL0.79 ± 0.070.720.63
TransMIL0.73 ± 0.090.430.46
DFTD0.75 ± 0.070.590.52
CLAM0.82 ± 0.070.700.63
Table 4. Key differentially expressed genes in CTNNB1-mutated HCC.
Table 4. Key differentially expressed genes in CTNNB1-mutated HCC.
GeneLogFCMechanistic Association in CTNNB1 Mutant Context
CXCL1 1.42 Limits neutrophil recruitment; weakens innate immunity.
CXCL6 1.29 Suppresses chemokine-mediated immune cell infiltration.
CASP1 1.02 Inhibition of inflammasome activation; immune escape.
NKD1 + 2.58 Negative feedback regulator of Wnt/ β - signaling; proliferation.
REG3A + 2.70 Resistance to apoptosis under oxidative or toxic stress.
ODAM + 2.31 Wnt/ β -catenin target in liver cancer; less aggressive phenotype.
CYP2E1 + 2.32 Metabolic reprogramming; increased oxidative stress.
ALDH3A1 + 2.69 Survival under stress; resistance to alkylating chemotherapies.
Table 5. Comparison of attention-based MIL models for CTNNB1 mutation prediction. Sensitivity and F1-score correspond to the positive (mutated) class. The best results are marked in bold.
Table 5. Comparison of attention-based MIL models for CTNNB1 mutation prediction. Sensitivity and F1-score correspond to the positive (mutated) class. The best results are marked in bold.
ModelROC-AUC (±std)SensitivityF1-Score
ABMIL0.79 ± 0.020.720.65
TransMIL0.66 ± 0.050.470.43
DFTD0.67 ± 0.040.560.49
CLAM0.79 ± 0.050.650.62
Table 6. Comparison of attention-based MIL models for vascular invasion prediction. Sensitivity and F1-score correspond to the positive (presence of invasion) class. The best results are marked in bold.
Table 6. Comparison of attention-based MIL models for vascular invasion prediction. Sensitivity and F1-score correspond to the positive (presence of invasion) class. The best results are marked in bold.
ModelROC-AUC (±std)SensitivityF1-Score
ABMIL0.61 ± 0.100.520.45
TransMIL0.60 ± 0.080.540.47
DFTD0.57 ± 0.130.620.45
CLAM0.66 ± 0.100.440.43
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Garach, I.; Hernandez, N.; Herrera, L.J.; Ortuño, F.M.; Rojas, I. Integrated Transcriptomic and Histological Analysis of TP53/CTNNB1 Mutations and Microvascular Invasion in Hepatocellular Carcinoma. Genes 2026, 17, 190. https://doi.org/10.3390/genes17020190

AMA Style

Garach I, Hernandez N, Herrera LJ, Ortuño FM, Rojas I. Integrated Transcriptomic and Histological Analysis of TP53/CTNNB1 Mutations and Microvascular Invasion in Hepatocellular Carcinoma. Genes. 2026; 17(2):190. https://doi.org/10.3390/genes17020190

Chicago/Turabian Style

Garach, Ignacio, Nerea Hernandez, Luis J. Herrera, Francisco M. Ortuño, and Ignacio Rojas. 2026. "Integrated Transcriptomic and Histological Analysis of TP53/CTNNB1 Mutations and Microvascular Invasion in Hepatocellular Carcinoma" Genes 17, no. 2: 190. https://doi.org/10.3390/genes17020190

APA Style

Garach, I., Hernandez, N., Herrera, L. J., Ortuño, F. M., & Rojas, I. (2026). Integrated Transcriptomic and Histological Analysis of TP53/CTNNB1 Mutations and Microvascular Invasion in Hepatocellular Carcinoma. Genes, 17(2), 190. https://doi.org/10.3390/genes17020190

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop