Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines

Reinschmidt, Andrew; Solano, Luis; Chavez, Yonny; Hulsy, William Drew; Nikolaidis, Nikolas

doi:10.3390/ijms26031057

Open AccessArticle

Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines

by

Andrew Reinschmidt

,

Luis Solano

^†

,

Yonny Chavez

,

William Drew Hulsy

and

Nikolas Nikolaidis

^*

Department of Biological Science, Center for Applied Biotechnology Studies, and Center for Computational and Applied Mathematics, California State University Fullerton, Fullerton, CA 92831, USA

^*

Author to whom correspondence should be addressed.

^†

Current address: Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA.

Int. J. Mol. Sci. 2025, 26(3), 1057; https://doi.org/10.3390/ijms26031057

Submission received: 27 December 2024 / Revised: 20 January 2025 / Accepted: 22 January 2025 / Published: 26 January 2025

(This article belongs to the Collection Feature Papers in Molecular Biophysics)

Download

Browse Figures

Versions Notes

Abstract

The cellular stress response (CSR) is a conserved mechanism that protects cells from -environmental and physiological stressors. The heat shock response (HSR), a critical component of the CSR, utilizes molecular chaperones to mitigate proteotoxic stress caused by elevated temperatures. We hypothesized that while the canonical HSR pathways are conserved across cell types, specific cell lines may exhibit unique transcriptional responses to heat shock. To test this, we compared the transcriptomic responses of HEK293, HepG2, and HeLa cells under control conditions immediately following heat shock and after an 8-h recovery period. RNA sequencing revealed the conserved activation of canonical HSR pathways, including the unfolded protein response, alongside the -enrichment of the non-canonical “Receptor Ligand Activity” pathway across all cell lines. Cell-line-specific variations were observed, with HepG2 cells exhibiting significantly higher ex-pression levels of certain genes compared to other cell lines under stress conditions, as well as greater fold changes in gene expression relative to its control conditions. Validation by qPCR confirmed the activation of key genes within the “Receptor Ligand Activity” pathway across time points. These findings provide insights into conserved and context-specific aspects of the HSR, contributing to a more comprehensive understanding of stress response mechanisms across mammalian cells.

Keywords:

heat shock response (HSR); RNA sequencing; cellular stress response (CSR)

1. Introduction

The cellular stress response (CSR) is a critical survival mechanism that helps cells cope with environmental stressors, and the heat shock response (HSR) is one of its most essential components [1,2,3,4,5,6,7]. The HSR is primarily activated by temperature increases and other stressors, triggering the activation of heat shock factors (HSFs), especially HSF1, which then drive the expression of heat shock proteins (HSPs). These molecular chaperones, including HSP70 and HSP90, facilitate protein homeostasis by refolding misfolded proteins, preventing aggregation, and modulating apoptosis [3,4,5,8,9,10,11,12]. While the canonical HSR is well-characterized, the full scope of its regulatory mechanisms and potential adaptations to different cellular contexts still need to be understood [13]. In particular, non-canonical HSR pathways and their roles in cellular stress adaptation are less explored [14], despite their potential significance in various disease processes.

In cancer, the HSR’s molecular machinery is hijacked for cellular survival, enabling tumor cells to withstand the proteotoxic stress generated by rapid proliferation, hypoxia, and therapeutic interventions. Cancer cells frequently upregulate key HSR components, including HSF1, to maintain proteostasis and resist apoptosis induced by therapies [15,16,17,18]. While much is known about the activation of canonical HSR pathways in cancer, a significant gap remains in understanding how these pathways are modulated in cancer cells and whether non-canonical HSR pathways contribute to the enhanced stress resistance observed in tumors. Current research broadly focuses on the well-established roles of HSPs and HSF1. Still, the complexity of the HSR—particularly its dynamic crosstalk with other stress response networks like the unfolded protein response (UPR) and DNA damage response (DDR)—and how these interactions regulate stress resistance in tumors remain underexplored [1,2,3,4,5,6,7,8,9,10,11,12].

This study aims to fill these gaps by conducting a comprehensive RNA sequencing (RNA-seq) analysis comparing cancerous (HeLa, HepG2) and non-cancerous (HEK293) cell lines exposed to heat stress. By comparing these cell lines, we aim to identify both conserved and cell-specific mechanisms regulating the HSR, with particular emphasis on uncovering non-canonical pathways that may play a significant role in cellular stress adaptation. We hypothesize that, while canonical HSR pathways are activated across all cell types, cancer cells will exhibit unique regulatory modifications and interactions, particularly involving non-canonical HSR pathways. These candidate pathways may contribute to enhanced survival and therapy resistance phenotypes.

By identifying these novel non-canonical HSR targets, we anticipate uncovering new insights into how cells adapt to stress, particularly in the context of cancer. Moreover, our study will provide a deeper understanding of the full spectrum of HSR mechanisms, including those understudied. This research has broad implications not only for cancer biology but also for understanding the role of stress responses in a variety of other diseases, including neurodegenerative disorders and aging, where similar proteotoxic stress is a hallmark [19,20]. By uncovering the molecular mechanisms driving non-canonical HSR pathways, this work can potentially inform the development of novel therapeutic strategies that target stress responses, providing new avenues for treating cancer and other stress-related diseases.

2. Results

2.1. Principal Component Analysis

To identify key drivers of variance in the transcriptomic dataset, we conducted a principal component analysis (PCA) using VST-normalized read counts of the entire dataset [all three cell lines (HEK293, HepG2, and HeLa) under three conditions (control, 0 h recovery after heat shock, and 8 h recovery after heat shock) across two experimental batches (Batch 1 and Batch 2)]. PCA results of the entire dataset reveal that samples primarily cluster by cell line, regardless of heat shock condition or batch, indicating that cell line type is the most significant source of variance in gene expression (Figure 1).

Further PCA analyses on individual cell lines show that within-cell line variance is predominantly driven by experimental batch effects (Supplementary Figure S1). These findings necessitated batch-specific analyses to mitigate confounding effects.

Figure 2 stratifies the analysis by cell line and batch, showing that heat shock conditions (0 h and 8 h recovery) explain most of the variance within each cell line. When analyzing data separated by cell line and batch, PCA revealed that heat shock conditions significantly contributed to gene expression variance (Figure 2). For the HEK293 cell line, heat shock (0 h recovery) explained 81% of the variance in batch 1 (and 65% in batch 2; Figure 2A,D), emphasizing a robust transcriptional response immediately following stress. For HeLa cells, heat shock accounted for 77% of the variance for batch 1, while recovery from heat shock could explain 53% of the variance for batch 2 (Figure 2B,E). HepG2 cells displayed a distinct pattern: most of the variance was explained by recovery from heat shock (8 h), with PC1 capturing 80% of the variance in batch 1 (and 78% in batch 2; Figure 2C,F). The observed pattern highlights cell-line-specific differences in the heat shock response (HSR). These findings collectively establish that cell type, batch, and heat shock are the principal drivers of transcriptomic variance while revealing unique stress response and recovery dynamics across cell lines.

2.2. Differential Gene Expression Analysis

We performed a differential gene expression (DGE) analysis to evaluate transcriptional changes across conditions for each cell line. The number of differentially expressed genes (DEGs) varied across conditions and batches, reflecting biological and technical factors. Differences in cell line-specific responses to heat shock, recovery dynamics, and batch-specific technical variability likely contributed to these discrepancies. Additionally, the increased sample size in Batch 2 provided greater statistical power to detect smaller changes in gene expression.

Histograms (Figure 3) illustrate significant differences in DEG counts, with HepG2 cells consistently showing higher numbers of DEGs than HEK293 and HeLa cells. For batch 1 (Figure 3 top panels), the DEG counts for HEK293 were 3444 (0 h vs. control), 2096 (8 h vs. control) and 3802 (8 h vs. 0 h). Similarly, HeLa showed 2168, 1590 and 3916 DEGs for these comparisons, respectively, while HepG2 exhibited 4609, 9948 and 9219 DEGs. In batch 2 (Figure 3 bottom panels), HEK293 had 1639, 947 and 2955 DEGs; HeLa had 2613, 4877 and 5076 DEGs; and HepG2 exhibited 3547, 9665 and 8791 DEGs for the exact comparisons.

Volcano plots (Figure 4 and Supplementary Figure S2) visualized DEGs’ fold change and statistical significance across comparisons. This approach highlights the fold change and statistical significance of gene expression for each comparison: in HEK293, 1867 genes were overexpressed, and 1577 genes were underexpressed in the 0hrvsCnt condition for Batch 1. In HeLa, 1661 genes were overexpressed, and 507 were underexpressed in the 0 h vs. Cnt condition for Batch 1. In HepG2, 2188 genes were overexpressed, and 2421 genes were underexpressed in the 0 h vs. Cnt condition for Batch 1.

Key heat shock response genes, including HSPA6, HSPA1A, HSPA1B, BAG3, and DNAJB1, were significantly overexpressed across all cell lines and conditions, with HepG2 cells demonstrating the most robust transcriptional response. These genes play critical roles in protein folding, chaperoning, and cellular recovery from stress, highlighting their centrality in the heat shock response. Notably, HSPA6 and HSPA1A exhibited a continuous increase in expression at 8 h post-recovery in HepG2 cells, contrasting with the other cell lines, where their expression dropped at 8 h (remaining higher than control but lower than 0 h). This distinct pattern suggests a prolonged activation of the heat shock response in HepG2 cells, potentially reflecting their reliance on enhanced proteostasis under stress. A comprehensive summary of HSP expression, including log2 fold changes and adjusted p-values, is provided in Supplementary Table S1.

The next step involved identifying conserved genes shared between the cell lines to further elucidate the similarities in gene expression during mild heat shock across HeLa, HEK, and Hep cells. Conserved DEGs across all three cell lines and batches included 405 genes overexpressed at 0 h post-heat shock vs. Control, 196 genes overexpressed at 8 h post-heat shock vs. 0 h, and 159 genes overexpressed at 8 h post-heat shock vs. Control (Figure 5A,C,E).

Underexpressed genes shared across all cell lines and batches included 123 genes at 0 h post-heat shock vs. Control, 263 genes at 8 h post-heat shock vs. 0 h, and 50 genes at 8 h post-heat shock vs. Control (Figure 5B,D,F).

2.3. Functional Enrichment Analysis

To elucidate biological processes associated with DEGs, we performed functional enrichment analyses using GSEA and STRING, conducting these analyses separately for each batch due to batch effects. Dot plots (Figure 6 and Supplementary Figure S3) highlight the top ten gene sets with the highest and lowest normalized enrichment scores (NESs) across comparisons. In HEK293 (0 h vs. Control), enriched gene sets related to protein folding and response to unfolded proteins were consistently observed in both batches. However, HeLa and HepG2 showed less overlap in enriched gene sets between batches, suggesting a more significant batch-dependent variability.

Distribution plots (Figure 7 and Supplementary Figure S4) of enriched genes within the top 15 NES gene sets revealed similar enrichment profiles for genes involved in protein folding and chaperone activity in HEK293. In contrast, unique enrichment distributions were observed in HeLa and HepG2, reflecting differences in stress response pathways.

Enrichment maps (Figure 8 and Supplementary Figure S5C–F) visualized interactions among enriched gene sets. In HEK293, gene sets related to protein folding and heat shock response formed cohesive clusters. In HeLa and HepG2, fewer interconnections were observed, with unexpected enrichments in immune-related pathways, such as B cell-mediated immunity and immunoglobulin receptor binding, in HepG2.

A combined analysis of enriched Gene Ontology (GO) terms across HEK293, HeLa, and HepG2 cells revealed conserved and cell-line-specific responses to heat shock (Supplementary Figures S6 and S7). Universal enrichment in “response to unfolded protein”, “protein folding”, and “response to heat” highlights shared proteostasis mechanisms critical for managing proteotoxic stress. HEK293 cells uniquely emphasized transcriptional regulation and proteome stability, with enrichment in “RNA catabolic process” and “negative regulation of transcription from RNA polymerase II promoter”. HeLa cells included proteostasis and metabolic flexibility, with pathways like “response to topologically incorrect protein” and “response to oxygen levels”, reflecting tumor-specific adaptations. HepG2 cells displayed liver-specific adaptations, including “response to glucocorticoid” and “fat cell differentiation”, alongside general stress responses, demonstrating a dual focus on proteotoxicity and metabolic regulation.

Although several pathways were unique, we also identified some conserved gene sets enriched across all cell lines and conditions. Compared to the control sample, four gene sets were enriched and shared between HEK293, HepG2, and HeLa cell lines immediately following mild heat shock. These gene sets are Receptor Ligand Activity (GO:0048018), Signaling Receptor Activator Activity (GO:0030545), Protein Folding Chaperone (GO:0044183), and Class A/1 Rhodopsin-like Receptors (HSA-373076) (Table 1 and Supplementary Table S2. Notably, the Receptor Ligand Activity pathway (GO:0048018) was consistently enriched across all conditions, cell lines, and batches (Table 1).

2.4. Analysis of Receptor Ligand Activity (GO:0048018) Genes

Of the 75 genes associated with the Receptor Ligand Activity pathway, 13 were consistently expressed across both batches and all three cell lines under each condition. These genes, including TNF, GNRH2, PSPN, MIA, SEMA4D, and HBEGF, are involved in cell survival, repair, and growth processes (Figure 9). Interaction networks (Figure 10) further illustrate how these genes interact to coordinate stress responses. As anticipated, the gene expression patterns of the conserved genes are similar between the cell lines and batches throughout heat shock. However, some variations in the magnitude of expression can be observed. Supplementary Table S3 provides an overview of the biological processes associated with these 13 genes, many promoting cell survival, repair, and growth.

2.5. Gene Expression Assessment via qPCR

We conducted qPCR on selected genes to validate the DGE analysis findings. This approach confirmed our transcriptomic results and provided additional insight into the induction of the heat shock response. The Heat Acclimation gene set (GO:0010286) was chosen for its relevance, containing six genes known to exhibit significant expression changes during heat shock.

Figure 11 summarizes the expression changes in HEK293 (Figure 11A,B), HeLa (Figure 11C,D), and HepG2 (Figure 11E,F) cell lines across batches and conditions. These results strongly support the activation of the heat shock response. HSPA1A, HSPA1B, and HSPA6 were consistently upregulated across all cell lines, while RBBP7 was downregulated. However, expression patterns of HSBP1 and HSBP1L1 varied. In HEK293 cells, HSBP1 decreased at 0- and 8-h post-heat shock, whereas it increased in HeLa and HepG2 cells under the same conditions. Conversely, HSBP1L1 increased in HEK and HeLa cells but exhibited batch-specific behavior in HepG2, decreasing in Batch 1 at 8 h and remaining unchanged in Batch 2. Despite this variability, the consistent expression trends of HSPA1A, HSPA1B, HSPA6, and RBBP7 across all conditions underscore the canonical activation of the heat shock response.

qPCR analysis was further performed on mRNA from HeLa cells at 0- and 8-h post-heat shock, along with control samples maintained at 37 °C. The primers targeted four “Heat Acclimation” genes (HSPA1A, HSPA6, BAG3, DNAJB1) and three “Signal Receptor Ligand Activity” genes (LTA, MIA, TNF), with ACTB and GAPDH as reference controls.

As shown in Figure 12, HSPA1A and HSPA6 exhibited the highest fold changes at 0- and 8 h post-heat shock (Figure 12A), aligning with their central roles in the heat shock response. BAG3 and DNAJB1 also displayed significant changes in expression (Figure 12B). For the “Signal Receptor Ligand Activity” genes, LTA, MIA, and TNF were upregulated immediately after heat shock but returned to near-control levels by 8 h post-recovery (Figure 12C). The observed patterns closely mirrored those revealed by RNA-seq analyses, except HSPA1A and HSPA6, which continued to increase at 8 h post-recovery in the qPCR data. Although the log2 fold change values varied slightly between the two methods, the overall trends were consistent (Supplementary Figure S8). Differences between RNA-seq and qPCR results, particularly for HSPA1A and HSPA6 at 8 h post-recovery, highlight the complementary nature of these methods. RNA-seq provides a transcriptome-wide perspective, normalized across all detected genes, while qPCR focuses on absolute expression changes relative to housekeeping genes, offering enhanced sensitivity for specific transcripts. The observed differences suggest that while RNA-seq indicates a relative decrease from the immediate heat shock response at 0 h, qPCR reflects the ongoing transcriptional activity of these genes at 8 h, underscoring the dynamic nature of their regulation during recovery.

These results validate the involvement of key “Heat Acclimation” and “Signal Receptor Ligand Activity” genes in the heat shock response. The consistency between qPCR findings and DGE and functional enrichment analyses reinforces the robustness of our conclusions and highlights specific pathways central to stress adaptation mechanisms.

3. Discussion

This study characterizes the heat shock response (HSR) in three human cell lines—HeLa, HEK293, and HepG2—by subjecting them to mild heat shock and examining recovery periods of 0 and 8 h. Using principal component analysis (PCA), differential gene expression (DGE), and functional enrichment analyses, we identified conserved and cell-specific genes and pathways activated during and after heat shock. PCA revealed that the primary variance in gene expression was driven by cell type, batch effects, and heat shock recovery time points (Figure 1 and Figure 2). To address the loss of statistical power caused by splitting into two experimental batches, we analyzed the batches individually, effectively gaining a second independent repetition and ensuring that observed trends were consistent across both sets of experiments.

These findings align with prior transcriptomic studies, which emphasize cell type [21,22] and experimental conditions [21,23] as dominant sources of variance in gene expression under stress. Notably, our results highlight how heat shock as a specific stressor influences transcriptomic variability, echoing previous research that identified stressor-specific transcriptional signatures while reaffirming the universal role of cell type as a major determinant of gene expression patterns [13,21,24,25].

Variability in DEG counts across batches and conditions reflects the dynamic nature of the heat shock response (HSR) and the inherent biological and technical factors influencing transcriptomic analyses [26,27]. Differences in sample size and experimental timing between batches likely contributed to these discrepancies [13,28]. Despite these discrepancies, consistent trends across batches reinforce the robustness of our findings and the conserved nature of key HSR pathways.

While the heat shock response was conserved across all cell lines, transcriptional responses suggested cell-type-specific adaptations. Universal enrichment in proteostasis pathways indicates shared strategies for maintaining protein integrity under heat stress, while cell-specific differences point to unique biological priorities. In HEK293 cells, the data suggest a focus on transcriptomic and proteomic stability to support homeostasis, whereas HeLa cells appear to utilize proteostasis and metabolic reprogramming to sustain rapid proliferation. HepG2 cells show evidence of liver-specific adaptations, including glucocorticoid signaling and differentiation processes, highlighting a dual focus on systemic regulation and localized stress responses. These differences may be reflective of the inherent stress environments that cancer cells encounter, such as hypoxia [29] and metabolic dysregulation [29,30]. Cancer cells like HeLa and HepG2 are known to rely on enhanced proteostasis to maintain homeostasis under stress [31], which may explain their more dynamic transcriptional response compared to non-cancerous HEK293 cells [31,32,33,34,35,36,37].

A consistent finding was the enrichment of the “Receptor Ligand Activity” pathway across all conditions and cell lines. Traditionally associated with extracellular signaling, this pathway may mediate cellular responses to proteotoxic stress by coordinating adaptive responses such as apoptotic regulation and survival signaling [38,39]. For example, genes like TNF and HBEGF contribute to inflammatory signaling and tissue repair, while PSPN and MIA support survival under adverse conditions [40,41,42,43]. In multicellular systems, receptor-ligand interactions may synchronize stress responses across cell populations, balancing survival, and apoptosis to maintain tissue integrity [44,45,46]. Further exploration of receptor-ligand pairs involved in the HSR could uncover novel mechanisms of stress adaptation and resilience, offering potential therapeutic strategies targeting stress communication networks. Identifying receptors uniquely or commonly expressed on the surfaces of HeLa, HepG2, and HEK293 cells could provide valuable insights into how receptor-ligand signaling mediates the HSR. Receptors conserved across all cell lines or specific to cancer cells could represent likely candidates for immediate regulation by the HSR, either following heat shock or during recovery. Targeting receptor-ligand signaling could offer a novel therapeutic strategy by disrupting stress communication networks, complementing existing therapies such as heat shock protein inhibitors to reduce tumor resilience to proteotoxic stress.

The upregulation of heat shock protein (HSP) genes, including HSPA1A, HSPA1B, and HSPA6, underscores their central role in managing proteotoxic stress across all cell lines. These genes support proteome stability in HEK293 cells but are more dynamically expressed in HeLa and HepG2 cells, reflecting their dependence on HSPs to withstand chronic stress conditions. This reliance on HSPs highlights their potential as therapeutic targets, particularly in cancer cells, where stress response pathways are critical for resilience.

It is important to note that these experiments were conducted in established cell lines, which, while offering precise control over experimental variables, lack the complexity and tissue-level interactions present in primary cells or animal models. This limitation restricts our ability to fully capture the influence of microenvironmental factors, cell-cell interactions, and in vivo stress responses. Future studies incorporating primary cells or in vivo models could provide deeper insights into how these pathways function in more physiologically relevant settings [47].

In conclusion, this study highlights the conserved nature of cellular stress responses across diverse cell types and the unique adaptations cancer cells employ to survive under stress. The consistent activation of “Receptor Ligand Activity” offers a novel perspective on stress communication mechanisms and their role in cancer resilience. By uncovering both conserved pathways and cell-type-specific adaptations, this study lays the groundwork for developing interventions that exploit the vulnerabilities of stress-adapted cells.

4. Materials and Methods

4.1. Cell Culture

To examine the differential heat shock response, we utilized three human cell lines: human embryonic kidney cells (HEK293; ATCC^® CRL-1573™) (American Type Culture Collection, Manassas, VA, USA), HeLa cells derived from Henrietta Lacks (ATCC^® CCL-2™), and hepatocellular carcinoma cells (HepG2; ATCC^® HB-8065) obtained from ATCC in December 2016 and verified bi-annually. HEK293 cells were cultured in Dulbecco’s Modified Eagle Medium (DMEM; Corning, Corning, NY, USA), while HeLa and HepG2 cells were grown in Minimum Essential Medium (MEM; Corning, Corning, NY, USA). Media for both cell lines were supplemented with 10% fetal bovine serum (FBS; Gibco, Thermo Fisher Scientific, Waltham, MA, USA), 2 mM L-glutamine, and penicillin-streptomycin. HeLa and HepG2 media were supplemented with 0.1 mM non-essential amino acids (NEAA; Corning, Corning, NY, USA) and 1 mM sodium pyruvate (Corning, Corning, NY, USA). All cultures were maintained in a humidified atmosphere containing 5% CO₂ at 37 °C.

4.2. Heat Shock Treatment

To assess the effect of heat shock on transcription, cells were either maintained at 37 °C or subjected to heat stress at 42 °C for 60 min in a humidified CO₂ incubator. Following heat shock, cells were allowed to recover at 37 °C for 8 h. This procedure was independently conducted on HeLa, HEK293, and HepG2 cell lines. Newly thawed, low-passage cells (passages 4–7) were cultured in 175 cm² flasks until reaching approximately 80% confluency. For each experimental condition, three flasks were used: one remained at 37 °C as a control, while two others were exposed to 42 °C for one hour to induce heat shock. Of the heat-shocked flasks, one was processed immediately after heat shock (0 h recovery, 0R), and the other was allowed to recover at 37 °C for 8 h (8 h recovery, 8R) (Supplementary Figure S9). This procedure was repeated using freshly thawed cells to generate biological replicates. After treatment, cells were harvested through trypsinization, pelleted, and frozen for subsequent analysis.

4.3. Sample Preparation, cDNA Library Preparation, and Sequencing

Cells from the control, 0R, and 8R conditions were sent to Novogene (Sacramento, CA, USA) for RNA isolation, library preparation, and sequencing. Total RNA was isolated using the Qiagen RNeasy series kit (Hilden, Germany), and quality control (QC) was meticulously performed at each step to ensure reliable data. RNA degradation and contamination were assessed on 1% agarose gels, purity was verified using a NanoPhotometer^® spectrophotometer (IMPLEN, Westlake Village, CA, USA), and integrity and quantification were evaluated using the RNA Nano 6000 Assay Kit on the Bioanalyzer 2100 system (Agilent Technologies, Santa Clara, CA, USA). For transcriptome sequencing, 1 μg of total RNA per sample was used to generate sequencing libraries with the NEBNext^® UltraTM RNA Library Prep Kit for Illumina^® (NEB, Ipswich, MA, USA) following the manufacturer’s protocol. mRNA was isolated using poly-T oligo-attached magnetic beads, fragmented under elevated temperature, and reverse transcribed into cDNA. First and second-strand cDNA synthesis involved random hexamer primers, M-MuLV Reverse Transcriptase, DNA Polymerase I, and RNase H, followed by end repair, adenylation, and ligation of NEBNext adaptors. cDNA fragments of 150–200 bp were selected using AMPure XP beads (Beckman Coulter, Brea, CA, USA), processed with USER Enzyme, and amplified via PCR with Phusion High-Fidelity DNA polymerase and indexed primers. Libraries were purified with the AMPure XP system and validated on the Agilent Bioanalyzer 2100 system. Clustering was conducted on a cBot Cluster Generation System using a PE Cluster Kit cBot-HS (Illumina, San Diego, CA, USA), and paired-end sequencing was performed on an Illumina NovaSeq S4 PE100 platform, generating at least 30 million reads per sample across all conditions.

4.4. Transcriptomics Analyses (Detailed Methodology in Supplementary Data S1)

Two experimental batches were conducted to evaluate the heat shock response (HSR) in three cell lines (HEK293, HeLa, and HepG2). Batch 1 included three biological replicates per condition, and Batch 2 included six biological replicates per condition. Power analysis using powsimR showed that three and six replicates explain ~58% and ~70% of DEGs, respectively, at an FDR of 15% and 10%. While combining the batches would theoretically provide greater statistical power (~75% of DEGs at 8% FDR), the experiments were conducted separately, leading to batch-specific variability. To address this, batches were analyzed independently, ensuring trends and biological patterns observed in the datasets were consistent and reproducible.

Read Quality Control Overview: Adapter sequences, PhiX library sequences, and low-quality reads are common byproducts of library preparation and next-generation sequencing (NGS). To ensure that high-quality reads are used for downstream analysis, BBDuk was employed to perform adapter trimming, PhiX filtering, and quality filtering based on Phred scores [48,49].

Adapter and PhiX Trimming: To eliminate unwanted sequences from downstream analysis, we used BBDuk’s k-mer trimming functionality for both adapter and PhiX filtering. Each paired read was processed with a BBDuk shell script to remove reads that matched reference k-mers associated with adapter sequences or the PhiX control library.

Phred Quality Score: To ensure high-quality reads were retained, BBDuk’s Phred quality trimming function was applied. This function scans the right end of paired reads, trimming until the Phred quality score meets the specified threshold. If the score remains unsatisfactory, the entire paired read is discarded. These processes were executed via command line for each pair of reads.

Reference Genome Indexing and Alignment of Reads to an Indexed Reference Genome: To quantify gene expression, it is essential to map high-quality reads to the human reference genome to identify the features in the dataset. STAR was employed for this purpose, accepting reads cleaned by BBDuk and outputting binary sequence alignment map files (.BAM) after performing splice-aware mapping against the indexed human reference genome (GRCh38.p13) [50]. Before alignment, the indexed reference genome was generated.

Sort and Index Mapped Reads for Feature Counting: Samtools was used to sort an input .BAM file and create a corresponding index file (.BAI), which is necessary for downstream feature counting steps [51]. The splice-aware and mapped .BAM files generated by STAR, as detailed in “Reference Genome Indexing and Alignment of Reads to an Indexed Reference Genome”, served as the input.

Feature Counts: HTSeq was utilized to create a raw gene count matrix by determining the number of reads mapped to each feature. The inputs required for HTSeq include a sorted aligned .BAM file, its corresponding index .BAI file, and a reference human genome annotation file (GRCh38.104.gtf) [52]. The sorted and indexed alignment files were produced as outlined in “Sort and Index Mapped Reads for Feature Counting”.

Quality Control: BBDuk, STAR, Samtools, and HTSeq generated multiple output files distributed across various directories. To summarize and evaluate these files, MultiQC was utilized, which provides an overview of file types associated with next-generation sequencing read processing [53]. By running MultiQC from a directory containing all relevant subdirectories with output files, users can verify the expected number of files and their types for each sample. Supplementary Tables S4 and S5 display the quality control metrics for each sample in batch 1 and batch 2, respectively.

Generating and Visualizing DEGs: The negative binomial distribution is a suitable model for analyzing raw count data that is well-dispersed and influenced by multiple sources of variance, making DESeq2 an ideal choice for identifying differentially expressed genes (DEGs) [54]. DESeq2 estimates the mean and models the dispersion of gene expression for each gene observed in the dataset.

Gene Counts Normalization and Dimensionality Reduction: Utilizing DESeq2 for differential gene expression (DGE) analysis allows researchers to leverage its built-in variance stabilizing transformation (VST), which helps reduce the impact of outliers and addresses heteroscedasticity [54]. The VST-normalized counts were subsequently used as inputs for dimensionality reduction analyses. Principal component analysis (PCA) and row-scaled heatmap analysis were employed as dimensionality reduction techniques. These methods were chosen to effectively visualize the intricate VST-normalized gene expression data, encompassing over 20,000 gene features across nine samples, into a more concise and interpretable format. PCA was conducted to identify the primary sources of variation within the dataset and to estimate the effect sizes between samples based on the gene features contributing to that variance. PCA was selected as the dimensionality reduction technique for this study due to its ability to effectively summarize high-dimensional data while retaining the principal variance in gene expression patterns. This technique is computationally efficient and widely used in transcriptomic studies to identify clustering patterns and primary drivers of variance. To ensure robustness, we evaluated PCA results across multiple normalization methods, including Variance Stabilizing Transformation (VST), log-transformation, and Counts Per Million (CPM). The clustering patterns and primary drivers of variance remained consistent across all normalization methods, confirming the reliability of the observed results. PCA was conducted using R’s prcomp function, along with the tidyverse and ggfortify libraries.

Histograms, Volcano Plots, and Venn Diagrams: DEGs were acquired from the normalized log2FC data created by DESeq2. Separating by cell line, the over and under-expressed genes for each condition comparison (0R vs. Control, 8R vs. Control, 8R vs. 0R) were categorized as log2FC > 0.5 and log2FC < 0.5, respectively. These were plotted on a histogram in R using ggplot2 to visualize the number of DEGs present within a cell line at each pairwise condition comparison. Volcano plots were then generated for each condition pairwise comparison with R’s ggplot2 library to visualize statistically significant fold changes in gene expression. To visualize the significant batch conserved DEGs within each cell line at a given condition, Venn diagrams were created using tidy verse’s ggplot2 library. Significant DEGs were defined as having a |log2FC| > 0.5 and a p. adj. < 0.05 (Benjamini-Hochberg). Tables with DEGs from all comparisons are in Supplementary Data S2.

Functional Enrichment Analysis Using GSEA: Gene Set Enrichment Analysis (GSEA) was run against the ontological gene set collections (C5) defined by the molecular signatures database [55,56]. GSEA inputs ranked log2FC outputs from DESeq2 and designated an enrichment score for each pathway within the gene set collection. Enrichment scores are defined by increases to a running-sum statistic when a gene is in the gene set and decreases when it is not; the normalized enrichment scores (NESs) then enable comparison across gene sets by accounting for differences in set size and correlations between the gene set and expression dataset. For this study, only human collections were used. For this reason, the normalized enrichment score was used in this analysis. Tables with GSEA analyses are in Supplementary Data S2.

Functional Enrichment Analysis using STRING: STRING analysis investigated protein–protein interactions across various batches, cell lines, and condition comparisons. The STRING database aggregates, scores, and integrates publicly available information on protein–protein interactions, leveraging these data sources to generate predictive models of interaction networks [57]. We used our differentially expressed gene (DEG) data obtained from DESeq2 to carry out an enrichment analysis. To determine conserved pathways, the tables were compared, and genesets present in both batches and all three cell lines at a given condition were compiled in a table. Tables with STRING analyses are in Supplementary Data S2.

Visualization of Gene Expression Patterns Using Heatmaps: Row scaled heatmaps built with the Complexheatmap library were used to visualize the VST normalized expression patterns of genes found in “Heat Acclimation” and “Receptor Ligand Activity” gene sets. Thirteen genes from “Receptor Ligand Activity” with conserved gene expression in both batches, three cell lines, and three condition pairwise comparisons were identified. All six genes from “Heat Acclimation” were visualized with row scaled heatmaps for each batch [54,58].

Cytoscape Generated Network: Cytoscape’s [59] predicted network analysis was performed to determine the interactions of genes based on the acquired expression data. The network analysis uses a list of DEGs and the known interaction networks in the MSigdbr database.

4.5. Molecular Validation of Using qPCR

Following the manufacturer’s protocol, RNA was isolated using 4 million HeLa cells per condition (control cells, 0 h recovery, 8 h recovery; different batches from the ones used for RNA-seq) using the Direct-Zol RNA mini-prep Kit (ZymoResearch, Irvine, CA, USA). Following the manufacturer’s protocol, cDNA was synthesized from 1 ug of total RNA using the Superscript IV First-Strand synthesis system (ThermoFisher Scientific, Waltham, MA, USA) and Oligo (dT)₂₀ primers. cDNA samples were diluted to a concentration of 50 ng/uL. qPCR reactions were prepared with the Power SYBR™ Green PCR Master Mix (ThermoFisher Scientific, Waltham, MA, USA) according to the manufacturer’s instructions. Three biological replicates were run for each gene and condition [Gene names and primers (generated using NCBI’s primer-blast utility) are shown in Supplementary Table S6]. qPCR was performed using the CFX96 Touch Real-Time Detection System (Bio-Rad, Hercules, CA, USA). The relative normalized expression [60] of the raw transcript levels was calculated using the Livak method for each gene [61] using the software provided with the instrument [62]. The reference genes used in this method were ACTB and GAPDH. Statistical significance was assessed using one-way ANOVA (Analysis of Variance) followed by post-hoc Tukey HSD (Honestly Significant Difference) and Bonferroni tests. A p value < 0.05 was considered statistically significant. Results were plotted via boxplot using BoxPlotR [63].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms26031057/s1.

Author Contributions

Conceptualization, N.N.; Data curation, A.R. and N.N.; Formal analysis, A.R., L.S., Y.C., W.D.H. and N.N.; Funding acquisition, N.N.; Methodology, A.R., L.S. and N.N.; Resources, N.N.; Writing—original draft, A.R. and N.N.; Writing—review & editing, A.R., L.S., Y.C., W.D.H. and N.N. All authors have read and agreed to the published version of the manuscript.

Funding

Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number SC3GM121226 and the National Cancer Institute under award number P20 CA253251. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data reported are provided in the text, Supplementary Figures, or Supplementary Data S1 and S2. The raw data are hosted at NCBI (GEO: GSE285497). The scripts used can be found at (https://github.com/ajreinschmidt/MSc-Code/blob/main/iSEE%20Analysis%20Script.R, accessed on 1 January 2024).

Acknowledgments

We thank Dimitra Chalkia for her valuable comments and help with the analysis of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Fulda, S.; Gorman, A.M.; Hori, O.; Samali, A. Cellular stress responses: Cell survival and cell death. Int. J. Cell Biol. 2010, 2010, 214074. [Google Scholar] [CrossRef] [PubMed]
Galluzzi, L.; Yamazaki, T.; Kroemer, G. Linking cellular stress responses to systemic homeostasis. Nat. Rev. Mol. Cell Biol. 2018, 19, 731–745. [Google Scholar] [CrossRef] [PubMed]
Kultz, D. Evolution of the cellular stress proteome: From monophyletic origin to ubiquitous function. J. Exp. Biol. 2003, 206, 3119–3124. [Google Scholar] [CrossRef] [PubMed]
Kultz, D. Molecular and evolutionary basis of the cellular stress response. Annu. Rev. Physiol. 2005, 67, 225–257. [Google Scholar] [CrossRef]
Richter, K.; Haslbeck, M.; Buchner, J. The heat shock response: Life on the verge of death. Mol. Cell 2010, 40, 253–266. [Google Scholar] [CrossRef] [PubMed]
Somero, G.N. The cellular stress response and temperature: Function, regulation, and evolution. J. Exp. Zool. A Ecol. Integr. Physiol. 2020, 333, 379–397. [Google Scholar] [CrossRef]
Sun, S.; Zhou, J. Molecular mechanisms underlying stress response and adaptation. Thorac. Cancer 2018, 9, 218–227. [Google Scholar] [CrossRef]
Camberg, J.L.; Doyle, S.M.; Johnston, D.M.; Wickner, S. Molecular Chaperones. In Brenner’s Encyclopedia of Genetics; Elsevier: Amsterdam, The Netherlands, 2013; pp. 456–460. [Google Scholar] [CrossRef]
Lindquist, S.; Craig, E.A. The heat-shock proteins. Annu. Rev. Genet. 1988, 22, 631–677. [Google Scholar] [CrossRef]
McCallister, C.; Siracusa, M.C.; Shirazi, F.; Chalkia, D.; Nikolaidis, N. Functional diversification and specialization of cytosolic 70-kDa heat shock proteins. Sci. Rep. 2015, 5, 9363. [Google Scholar] [CrossRef]
Radons, J. The human HSP70 family of chaperones: Where do we stand? Cell Stress. Chaperones 2016, 21, 379–404. [Google Scholar] [CrossRef] [PubMed]
Morimoto, R.I. Regulation of the heat shock transcriptional response: Cross talk between a family of heat shock factors, molecular chaperones, and negative regulators. Genes Dev. 1998, 12, 3788–3796. [Google Scholar] [CrossRef] [PubMed]
Mahat, D.B.; Salamanca, H.H.; Duarte, F.M.; Danko, C.G.; Lis, J.T. Mammalian heat shock response and mechanisms underlying its genome-wide transcriptional regulation. Mol. Cell 2016, 62, 63–78. [Google Scholar] [CrossRef]
Zhao, P.; Javed, S.; Shi, X.; Wu, B.; Zhang, D.; Xu, S.; Wang, X. Varying architecture of heat shock elements contributes to distinct magnitudes of target gene expression and diverged biological pathways in heat stress response of bread wheat. Front. Genet. 2020, 11, 30. [Google Scholar] [CrossRef]
Zuo, W.F.; Pang, Q.; Zhu, X.; Yang, Q.Q.; Zhao, Q.; He, G.; Han, B.; Huang, W. Heat shock proteins as hallmarks of cancer: Insights from molecular mechanisms to therapeutic strategies. J. Hematol. Oncol. 2024, 17, 81. [Google Scholar] [CrossRef]
Kunachowicz, D.; Krol-Kulikowska, M.; Raczycka, W.; Sleziak, J.; Blazejewska, M.; Kulbacka, J. Heat shock proteins, a double-edged sword: Significance in cancer progression, chemotherapy resistance and novel therapeutic perspectives. Cancers 2024, 16, 1500. [Google Scholar] [CrossRef]
Cyran, A.M.; Zhitkovich, A. Heat shock proteins and HSF1 in cancer. Front. Oncol. 2022, 12, 860320. [Google Scholar] [CrossRef]
Somu, P.; Mohanty, S.; Basavegowda, N.; Yadav, A.K.; Paul, S.; Baek, K.H. The interplay between heat shock proteins and cancer pathogenesis: A novel strategy for cancer therapeutics. Cancers 2024, 16, 638. [Google Scholar] [CrossRef]
Ruz, C.; Alcantud, J.L.; Vives Montero, F.; Duran, R.; Bandres-Ciga, S. Proteotoxicity and neurodegenerative diseases. Int. J. Mol. Sci. 2020, 21, 5646. [Google Scholar] [CrossRef]
Douglas, P.M.; Dillin, A. Protein homeostasis and aging in neurodegeneration. J. Cell Biol. 2010, 190, 719–729. [Google Scholar] [CrossRef]
Flati, T.; Gioiosa, S.; Chillemi, G.; Mele, A.; Oliverio, A.; Mannironi, C.; Rinaldi, A.; Castrignano, T. A gene expression atlas for different kinds of stress in the mouse brain. Sci. Data 2020, 7, 437. [Google Scholar] [CrossRef] [PubMed]
Murray, J.I.; Whitfield, M.L.; Trinklein, N.D.; Myers, R.M.; Brown, P.O.; Botstein, D. Diverse and specific gene expression responses to stresses in cultured human cells. Mol. Biol. Cell 2004, 15, 2361–2374. [Google Scholar] [CrossRef] [PubMed]
Gao, H.; Wang, Y.; Liu, X.; Yan, T.; Wu, L.; Alm, E.; Arkin, A.; Thompson, D.K.; Zhou, J. Global transcriptome analysis of the heat shock response of Shewanella oneidensis. J. Bacteriol. 2004, 186, 7796–7803. [Google Scholar] [CrossRef]
Xu, P.; Wang, M.; Song, W.M.; Wang, Q.; Yuan, G.C.; Sudmant, P.H.; Zare, H.; Tu, Z.; Orr, M.E.; Zhang, B. The landscape of human tissue and cell type specific expression and co-regulation of senescence genes. Mol. Neurodegener. 2022, 17, 5. [Google Scholar] [CrossRef]
Pessa, J.C.; Joutsen, J.; Sistonen, L. Transcriptional reprogramming at the intersection of the heat shock response and proteostasis. Mol. Cell 2024, 84, 80–93. [Google Scholar] [CrossRef]
Leek, J.T.; Scharpf, R.B.; Bravo, H.C.; Simcha, D.; Langmead, B.; Johnson, W.E.; Geman, D.; Baggerly, K.; Irizarry, R.A. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat. Rev. Genet. 2010, 11, 733–739. [Google Scholar] [CrossRef]
Love, M.I.; Hogenesch, J.B.; Irizarry, R.A. Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation. Nat. Biotechnol. 2016, 34, 1287–1291. [Google Scholar] [CrossRef]
Vihervaara, A.; Mahat, D.B.; Guertin, M.J.; Chu, T.; Danko, C.G.; Lis, J.T.; Sistonen, L. Transcriptional response to stress is pre-wired by promoter and enhancer architecture. Nat. Commun. 2017, 8, 255. [Google Scholar] [CrossRef]
Al Tameemi, W.; Dale, T.P.; Al-Jumaily, R.M.K.; Forsyth, N.R. Hypoxia-modified cancer cell metabolism. Front. Cell Dev. Biol. 2019, 7, 4. [Google Scholar] [CrossRef]
Belisario, D.C.; Kopecka, J.; Pasino, M.; Akman, M.; De Smaele, E.; Donadelli, M.; Riganti, C. Hypoxia dictates metabolic rewiring of tumors: Implications for chemoresistance. Cells 2020, 9, 2598. [Google Scholar] [CrossRef]
Kotsantis, P.; Silva, L.M.; Irmscher, S.; Jones, R.M.; Folkes, L.; Gromak, N.; Petermann, E. Increased global transcription activity as a mechanism of replication stress in cancer. Nat. Commun. 2016, 7, 13087. [Google Scholar] [CrossRef] [PubMed]
Rios, F.; Uriostegui-Arcos, M.; Zurita, M. Transcriptional stress induces the generation of DoGs in cancer cells. Noncoding RNA 2024, 10, 5. [Google Scholar] [CrossRef] [PubMed]
Ackerman, D.; Simon, M.C. Hypoxia, lipids, and cancer: Surviving the harsh tumor microenvironment. Trends Cell Biol. 2014, 24, 472–478. [Google Scholar] [CrossRef]
Calderwood, S.K.; Khaleque, M.A.; Sawyer, D.B.; Ciocca, D.R. Heat shock proteins in cancer: Chaperones of tumorigenesis. Trends Biochem. Sci. 2006, 31, 164–172. [Google Scholar] [CrossRef]
Ciocca, D.R.; Cayado-Gutierrez, N.; Maccioni, M.; Cuello-Carrion, F.D. Heat shock proteins (HSPs) based anti-cancer vaccines. Curr. Mol. Med. 2012, 12, 1183–1197. [Google Scholar] [CrossRef] [PubMed]
Zoppino, F.C.M.; Guerrero-Gimenez, M.E.; Castro, G.N.; Ciocca, D.R. Comprehensive transcriptomic analysis of heat shock proteins in the molecular subtypes of human breast cancer. BMC Cancer 2018, 18, 700. [Google Scholar] [CrossRef]
Ramilowski, J.A.; Goldberg, T.; Harshbarger, J.; Kloppmann, E.; Lizio, M.; Satagopam, V.P.; Itoh, M.; Kawaji, H.; Carninci, P.; Rost, B.; et al. A draft network of ligand-receptor-mediated multicellular signalling in human. Nat. Commun. 2015, 6, 7866. [Google Scholar] [CrossRef]
Kurop, M.K.; Huyen, C.M.; Kelly, J.H.; Blagg, B.S.J. The heat shock response and small molecule regulators. Eur. J. Med. Chem. 2021, 226, 113846. [Google Scholar] [CrossRef]
Kuo, D.; Ding, J.; Cohn, I.S.; Zhang, F.; Wei, K.; Rao, D.A.; Rozo, C.; Sokhi, U.K.; Shanaj, S.; Oliver, D.J.; et al. HBEGF(+) macrophages in rheumatoid arthritis induce fibroblast invasiveness. Sci. Transl. Med. 2019, 11, eaau8587. [Google Scholar] [CrossRef]
Rocourt, D.V.; Mehta, V.B.; Wu, D.; Besner, G.E. Heparin-binding EGF-like growth factor decreases neutrophil-endothelial cell interactions. J. Surg. Res. 2007, 141, 262–266. [Google Scholar] [CrossRef]
Rampioni Vinciguerra, G.L.; Capece, M.; Reggiani Bonetti, L.; Nigita, G.; Calore, F.; Rentsch, S.; Magistri, P.; Ballarin, R.; Di Benedetto, F.; Distefano, R.; et al. Nutrient restriction-activated Fra-2 promotes tumor progression via IGF1R in miR-15a downmodulated pancreatic ductal adenocarcinoma. Signal Transduct. Target. Ther. 2024, 9, 31. [Google Scholar] [CrossRef] [PubMed]
Onodera, T.; Momose, I.; Adachi, H.; Yamazaki, Y.; Sawa, R.; Ohba, S.I.; Kawada, M. Human pancreatic cancer cells under nutrient deprivation are vulnerable to redox system inhibition. J. Biol. Chem. 2020, 295, 16678–16690. [Google Scholar] [CrossRef] [PubMed]
Shankaran, H.; Resat, H.; Wiley, H.S. Cell surface receptors for signal transduction and ligand transport: A design principles study. PLoS Comput. Biol. 2007, 3, e101. [Google Scholar] [CrossRef]
Arguello-Miranda, O.; Marchand, A.J.; Kennedy, T.; Russo, M.A.X.; Noh, J. Cell cycle-independent integration of stress signals by Xbp1 promotes Non-G1/G0 quiescence entry. J. Cell Biol. 2022, 221, e202103171. [Google Scholar] [CrossRef]
Chung, K.M.; Kim, H.; Roque, C.G.; McCurdy, E.P.; Nguyen, T.T.T.; Siegelin, M.D.; Hwang, J.Y.; Hengst, U. A systemic cell stress signal confers neuronal resilience toward oxidative stress in a Hedgehog-dependent manner. Cell Rep. 2022, 41, 111488. [Google Scholar] [CrossRef]
Kolodkin, A.; Sahin, N.; Phillips, A.; Hood, S.R.; Bruggeman, F.J.; Westerhoff, H.V.; Plant, N. Optimization of stress response through the nuclear receptor-mediated cortisol signalling network. Nat. Commun. 2013, 4, 1792. [Google Scholar] [CrossRef]
Phillips, N.L.H.; Roth, T.L. Animal models and their contribution to our understanding of the relationship between environments, epigenetic modifications, and behavior. Genes 2019, 10, 47. [Google Scholar] [CrossRef]
Grigoriev, I.V.; Nordberg, H.; Shabalov, I.; Aerts, A.; Cantor, M.; Goodstein, D.; Kuo, A.; Minovitsky, S.; Nikitin, R.; Ohm, R.A.; et al. The genome portal of the Department of Energy Joint Genome Institute. Nucleic Acids Res. 2012, 40, D26–D32. [Google Scholar] [CrossRef]
Nordberg, H.; Cantor, M.; Dusheyko, S.; Hua, S.; Poliakov, A.; Shabalov, I.; Smirnova, T.; Grigoriev, I.V.; Dubchak, I. The genome portal of the Department of Energy Joint Genome Institute: 2014 updates. Nucleic Acids Res. 2014, 42, D26–D31. [Google Scholar] [CrossRef]
Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 2013, 29, 15–21. [Google Scholar] [CrossRef]
Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M.; et al. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
Anders, S.; Pyl, P.T.; Huber, W. HTSeq—A Python framework to work with high-throughput sequencing data. Bioinformatics 2015, 31, 166–169. [Google Scholar] [CrossRef] [PubMed]
Ewels, P.; Magnusson, M.; Lundin, S.; Käller, M. MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics 2016, 32, 3047–3048. [Google Scholar] [CrossRef]
Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
Liberzon, A.; Subramanian, A.; Pinchback, R.; Thorvaldsdóttir, H.; Tamayo, P.; Mesirov, J.P. Molecular signatures database (MSigDB) 3.0. Bioinformatics 2011, 27, 1739–1740. [Google Scholar] [CrossRef]
Subramanian, A.; Tamayo, P.; Mootha, V.K.; Mukherjee, S.; Ebert, B.L.; Gillette, M.A.; Paulovich, A.; Pomeroy, S.L.; Golub, T.R.; Lander, E.S.; et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 2005, 102, 15545–15550. [Google Scholar] [CrossRef]
Szklarczyk, D.; Gable, A.L.; Lyon, D.; Junge, A.; Wyder, S.; Huerta-Cepas, J.; Simonovic, M.; Doncheva, N.T.; Morris, J.H.; Bork, P.; et al. STRING v11: Protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019, 47, D607–D613. [Google Scholar] [CrossRef]
Durinck, S.; Spellman, P.T.; Birney, E.; Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 2009, 4, 1184–1191. [Google Scholar] [CrossRef]
Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
Vandesompele, J.; De Preter, K.; Pattyn, F.; Poppe, B.; Van Roy, N.; De Paepe, A.; Speleman, F. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002, 3, 91. [Google Scholar] [CrossRef]
Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) Method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef] [PubMed]
Hellemans, J.; Mortier, G.; De Paepe, A.; Speleman, F.; Vandesompele, J. qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data. Genome Biol. 2007, 8, R19. [Google Scholar] [CrossRef] [PubMed]
Spitzer, M.; Wildenhain, J.; Rappsilber, J.; Tyers, M. BoxPlotR: A web tool for generation of box plots. Nat. Methods 2014, 11, 121–122. [Google Scholar] [CrossRef]

Figure 1. Principal component analysis (PCA) of variance-stabilizing transformed (VST) counts reveals that the primary source of variance in the entire dataset is the cell line type, regardless of batch or heat shock treatment. PCA reduces high-dimensional gene expression data to principal components, visually representing clustering patterns based on sample similarities. Each point represents a sample, with closer proximity indicating higher similarity.

Figure 2. PCA of VST--normalized counts, stratified by batch and cell line, highlights clustering based on heat shock conditions. Individual PCA plots for HEK293 (A,D), HeLa (B,E), and HepG2 (C,F) show distinct groupings between heat-shocked and control samples, indicating a robust transcriptional response to heat shock across all cell lines. The separation along principal components reflects the contribution of gene expression differences induced by heat shock and recovery time.

Figure 3. The Number of DEGs varies by Cell Line and Batch. Histogram depicting the number of DEGs (|log2 fold change| > 0.5; adjusted p-value < 0.05) in HEK293 (A,B), HeLa (C,D), and HepG2 (E,F) cell lines during heat shock for batch 1 (top rows) and batch 2 (bottom rows). The height of each bar represents the count of upregulated and downregulated genes, emphasizing the scale of transcriptional changes in response to heat shock. Note that the y-axis scale varies across panels to improve visualization of the differences in gene expression dynamics between cell lines and batches.

Figure 4. Volcano plots show the distribution of DEGs for condition comparisons within each cell line. Each point represents a gene, with the x-axis indicating log2 fold change (magnitude of expression change) and the y-axis representing statistical significance (negative log10 adjusted p-value). Genes upregulated (log2FC > 1) are shown in red, while downregulated genes (log2FC < −1) are in blue. Examples include HEK293 (A,B), HeLa (C,D), and HepG2 (E,F) comparisons at 0 h post-heat shock vs. control, visualizing the most significant transcriptional changes.

Figure 5. Venn diagrams summarizing conserved DEGs between cell lines under specific conditions. The size of overlapping circles corresponds to the number of shared genes across comparisons. Overexpressed genes (log2FC > 0.5; p-adj. < 0.05) and underexpressed genes (log2FC < −0.5; p-adj. < 0.05) are visualized for 0 h post-heat shock vs. control (A,B), 8 h vs. 0 h (C,D), and 8 h vs. control (E,F).

Figure 6. Dot plot visualization of Gene Set Enrichment Analysis (GSEA) results. The top 15 positively and negatively enriched gene sets, ranked by normalized enrichment scores (NES), are shown for HEK293 at 0 h post-heat shock vs. control (Batch 1 and Batch 2). Each dot represents a gene set, with size proportional to the number of enriched genes and color reflecting the statistical significance (adjusted p-value).

Figure 7. Distribution of Log2 Fold Changes in Enriched Gene Sets. Distribution of log2 fold changes for the top 15 positively enriched GSEA hits in HEK293, HeLa, and HepG2 cell lines at 0 h post-heat shock vs. control. The peak height represents the number of genes within each range of log2 fold change, highlighting the degree of enrichment within gene sets. Separate plots depict Batch 1 and 2 results for each cell line.

Figure 8. Enrichment maps of the top 10 positively enriched gene sets (GSEA NES) for HEK293 cells at 0 h post-heat shock vs. control. Each node represents a gene set, with edges indicating shared genes between sets. The network visualizes functional overlap and relationships among the most significant pathways for each batch. (A) protein folding and (B) protein folding and DNA-binding transcription activator activity).

Figure 9. Distribution of Log2 Fold Changes in Enriched Gene Sets. Heatmaps of Z-score-scaled log2 fold changes for 13 conserved genes within the Signal Receptor Ligand Activity (GO:0048018) pathway. Rows represent genes, while columns correspond to cell lines and conditions. Colors correspond to Z-scores, with red indicating positive Z-scores (maximal upregulation) and blue indicating negative Z-scores (maximal downregulation). Separate heatmaps are provided for Batch 1 (A,C,E) and Batch 2 (B,D,F), illustrating consistent patterns of gene regulation across batches and cell lines.

Figure 10. Network Map of GO:0048018 Pathway Genes. Cytoscape-generated network map of the 13 conserved DEGs within the Signal Receptor Ligand Activity (GO:0048018) pathway. Nodes represent genes, and edges indicate predicted functional interactions, offering a systems-level view of pathway dynamics.

Figure 11. Heatmaps of Heat Acclimation Genes Across Conditions. Heat maps show expression changes for Heat Acclimation (GO:0010286) genes at control, 0 h post-heat shock, and 8 h post-heat shock conditions. Colors correspond to Z-scores, with red indicating positive Z-scores (maximal upregulation) and blue indicating negative Z-scores (maximal downregulation). Separate heatmaps are provided for Batch 1 (A,C,E) and Batch 2 (B,D,F) across HEK293, HeLa, and HepG2 cell lines.

Figure 12. qPCR Validation of Key Gene Sets. mRNA fold change measured by qPCR for Heat Acclimation (GO:0010286) and Signal Receptor Ligand Activity (GO:0048018) genes in HeLa cells at control, 0 h post-heat shock, and 8 h post-heat shock. Results are normalized against housekeeping genes (ACTB and GAPDH). Heat shock genes (HSPA1A, HSPA6) exhibit the highest induction, while signal receptor ligand genes (LTA, MIA, TNF) show transient upregulation (A–C). Note that the Y-axis scale varies across panels. The experiment was repeated using three biological replicates (black dots). Statistical significance was determined using one-way ANOVA (Analysis of Variance) followed by post-hoc Tukey HSD (Honestly Significant Difference) and Bonferroni tests.

Table 1. Enriched gene sets were conserved in both batches and HEK293, HepG2, and HeLa cell lines (associated functions can be found in Supplementary Table S2).

0R vs. Control		8R vs. Cnt		8R vs. 0R
ID	Name	ID	Name	ID	Name
GO:0048018	Receptor Ligand Activity	GO:0048018	Receptor Ligand Activity	GO:0048018	Receptor Ligand Activity
GO:0030545	Signaling receptor activator activity	hsa04080	Neuroactive ligand-receptor interaction	hsa04080	Neuroactive ligand-receptor interaction
GO:0044183	Protein folding chaperone	HSA-500792	GPCR ligand binding	HSA-500792	GPCR ligand binding
HSA-373076	Class A/1 (Rhodopsin-like receptors)			HSA-373076	Class A/1 (Rhodopsin-like receptors)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Reinschmidt, A.; Solano, L.; Chavez, Y.; Hulsy, W.D.; Nikolaidis, N. Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines. Int. J. Mol. Sci. 2025, 26, 1057. https://doi.org/10.3390/ijms26031057

AMA Style

Reinschmidt A, Solano L, Chavez Y, Hulsy WD, Nikolaidis N. Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines. International Journal of Molecular Sciences. 2025; 26(3):1057. https://doi.org/10.3390/ijms26031057

Chicago/Turabian Style

Reinschmidt, Andrew, Luis Solano, Yonny Chavez, William Drew Hulsy, and Nikolas Nikolaidis. 2025. "Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines" International Journal of Molecular Sciences 26, no. 3: 1057. https://doi.org/10.3390/ijms26031057

APA Style

Reinschmidt, A., Solano, L., Chavez, Y., Hulsy, W. D., & Nikolaidis, N. (2025). Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines. International Journal of Molecular Sciences, 26(3), 1057. https://doi.org/10.3390/ijms26031057

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Transcriptomics Unveil Canonical and Non-Canonical Heat Shock-Induced Pathways in Human Cell Lines

Abstract

1. Introduction

2. Results

2.1. Principal Component Analysis

2.2. Differential Gene Expression Analysis

2.3. Functional Enrichment Analysis

2.4. Analysis of Receptor Ligand Activity (GO:0048018) Genes

2.5. Gene Expression Assessment via qPCR

3. Discussion

4. Materials and Methods

4.1. Cell Culture

4.2. Heat Shock Treatment

4.3. Sample Preparation, cDNA Library Preparation, and Sequencing

4.4. Transcriptomics Analyses (Detailed Methodology in Supplementary Data S1)

4.5. Molecular Validation of Using qPCR

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI