1. Introduction
Gastrointestinal stromal tumors (GISTs) are the most common mesenchymal tumor of the gastrointestinal tract [
1], mostly occurring in the stomach and small intestine, and less commonly in the esophagus, colorectum and outside the gastrointestinal tract. The GIST annual incidence is approximately 1–2/100,000, with no significant sex or racial differences [
2]. Depending on the size and location of the tumor, common clinical symptoms include abdominal pain, bloating, dysphagia, bleeding, and obstruction in the digestive tract. GISTs are highly heterogeneous, with the incidence of micro GISTs in the elderly population being as high as 30% with little or no malignant potential, while large and high mitotic count GISTs are at high risk of recurrence and prone to peritoneal metastases, which are often fatal [
3].
At the beginning of this century, the median survival time of patients with advanced GIST was approximately one year [
4]. The subsequent introduction of imatinib, a tyrosine kinase inhibitor (TKI), significantly improved outcomes for intermediate- and high-risk GIST patients [
5,
6]; however, many patients develop resistance within 1–3 years [
2]. The second- and third-line multitargeted TKIs sunitinib [
7] and regorafenib [
8] only provide limited improvement in prognosis, with a median progression-free survival (PFS) that is only 4–5 months longer compared to placebo. Therefore, there is an urgent requirement to further explore the GIST’s pathogenesis and find potential new therapeutic targets.
Long non-coding RNA (lncRNA) is defined as non-coding RNA greater than 200 nucleotides in length. LncRNAs can influence gene expression at various epigenetic, transcriptional and post-transcriptional levels and play an oncogenic and/or cancer-inhibiting role in neoplastic diseases [
9,
10]. Moreover, lncRNAs also play an important role in tumor metastasis, and are involved in multiple processes including local invasion, epithelial-mesenchymal transition (EMT), metastatic colonization, regulation of organ-specific tropism, and the metastatic microenvironment [
11]. In addition, most lncRNAs are expressed with tissue and cell-type specificity [
12] and thus have theoretical potential to become efficient targets for tumor therapy. Moreover, their high tissue specificity and ability to regulate specific facets of the cellular networks may make lncRNAs superior to proteins in terms of their potential targeting-related adverse effects. Features such as lack of translation, rapid turnover, and low basal expression levels allow lncRNAs to exert effects at low doses [
13]. These features have made lncRNAs one of the foci of oncology research.
To better elucidate the potential roles of lncRNAs in the progression of intermediate- to high-risk GISTs and to identify novel therapeutic targets or biomarkers, we conducted a comprehensive lncRNA expression profiling study. Specifically, three pairs of imatinib-untreated intermediate- to high-risk GISTs and matched peritumoral tissues were sequenced for lncRNAs and subjected to differential analysis and GO/KEGG enrichment analysis to explore the possible involvement of differentially expressed genes in biochemical, metabolic, and signal transduction pathways. Subsequently, high-content screening (HCS) was employed to identify lncRNAs that significantly promote cell proliferation, leading to the selection of CTD-2245E15.3 for functional validation as a potential therapeutic target or biomarker.
2. Materials and Methods
2.1. Clinical Specimens
All patient samples were collected from Zhongshan Hospital, Fudan University, whose medical ethics committee has reviewed and approved this study. We collected three pairs of fresh frozen specimens of primary GISTs and paratumor normal tissues with different locations for lncRNA sequencing. The detailed clinicopathological characteristics of the three pairs of sequenced samples are listed in
Table 1. The pathological diagnosis for each case was confirmed by at least two senior pathologists. A total of 507 primary GIST tissue specimens were collected for RNA in situ hybridization (ISH) analysis of CTD-2245E15.3 expression. Patients were stratified into high- and low-expression groups according to the median ISH score. Clinicopathological characteristics and survival outcomes were compared between the two groups. PFS and overall survival (OS) were obtained through regular outpatient visits and telephone follow-up. PFS was calculated from the date of initial tumor resection to the occurrence of documented disease progression or the date of last follow-up. OS was calculated from the date of surgery to death from any cause or the last known follow-up.
2.2. Library Construction and Sequencing Process
We extracted RNAs from the tissues using Trizol reagent (Invitrogen, Carlsbad, CA, USA) and applied agarose gel electrophoresis to analyze the RNA integrity and possible presence of DNA contamination [
14]. The RNA concentration and purity were then checked initially by Nanodrop 2000 (Thermo Fisher, Wilmington, DE, USA) and precisely by the Agilent 2100 bioanalyzer (Agilent, Santa Clara, CA, USA) for RNA integrity. Then, ribosome RNAs were removed from the total RNA using the rRNA Depletion Kit (NEBNext, Ipswich, MA, USA) according to the manufacturer’s instructions, and strand-specific libraries were constructed. After the libraries were qualified, Illumina PE150 sequencing (Illumina, Inc., San Diego, CA, USA) was performed after pooling according to the effective concentration of the libraries and data output requirements.
2.3. Sequencing and Screening of Hub lncRNAs
The lncRNA screening workflow is illustrated in
Figure 1. Following transcriptome sequencing, differentially expressed lncRNAs between tumor and adjacent normal tissues were identified using the criteria |log2FC| ≥ 1 and adjusted
p value (Padj) < 0.05, with
p values corrected by the Benjamini–Hochberg method to control the false discovery rate (FDR). Venn analysis was subsequently performed to identify lncRNAs that were consistently upregulated or downregulated across all three paired samples.
2.4. Cluster Analysis
Cluster analysis is another way to display differentially expressed genes, which brings together those with similar expression patterns. These genes may have common functions or participate in common metabolic and signal pathways. The values of log10 (FPKM + 1) were transformed and analyzed by cluster analysis.
2.5. Gene Ontology (GO) and KEGG Pathway Enrichment Analysis
The GO database can describe gene and protein functions in detail by tree stratification and can clarify the hierarchical relationship between gene functions, which are divided into molecular functions (MFs), biological processes (BPs), and cellular components (CCs). GO enrichment analysis is based on the GO database. The differential genes are analyzed by Fisher’s exact test, and the target functional genes are analyzed statistically. KEGG is a database for systematic analysis of gene function and genome information, and signal pathway analysis is based on this database. The pathways involved in the target genes were statistically analyzed by Fisher’s exact test, and the signal pathways with significant differences were screened out according to Padj < 0.05.
2.6. High-Content Screening (HCS)
A high-content screening (HCS) approach was applied as a data-driven strategy to prioritize lncRNAs with potential functional relevance in GIST cells. For each candidate gene, three RNA interference targets were designed, and the corresponding plasmids were mixed in equal proportions for lentiviral packaging. GIST-T1 cells were seeded into 96-well plates at a density of 2 × 10
3 cells per well and infected with appropriate amounts of lentivirus. The list of lentiviral constructs used in the screening is shown in
Table 2. Cell proliferation was monitored once daily for seven consecutive days using the Celigo imaging cytometer (Nexcelom, Lawrence, MA, USA). Cells expressing green fluorescence were automatically identified, photographed, and quantified. Cell proliferation fold changes were calculated for each group at each time point, and growth curves were generated to evaluate the effects of lncRNA knockdown on cell proliferation. Candidate lncRNAs were initially prioritized based on a predefined proliferation fold-change threshold and the consistency of their growth-inhibitory effects across time points. CTD-2245E15.3 was selected for subsequent functional validation based on its robust and reproducible anti-proliferative phenotype in the HCS assay, together with efficient knockdown performance.
2.7. Cell Culture and Lentiviral Transduction
The GIST-T1 cell line (harboring a KIT exon 11: V560_Y578del mutation, RRID: CVCL_7044) was purchased from Cosmo Bio Co., Ltd. (Tokyo, Japan). The GIST-882 cell line (bearing a KIT exon 13 K642E mutation) was kindly provided by Dr. Fletcher (Harvard Medical School). GIST-T1 cells were cultured in Iscove’s Modified Dulbecco’s Medium (IMDM, Corning, New York, NY, USA), whereas GIST-882 cells were maintained in RPMI-1640 medium (Corning, USA). Both media were supplemented with 10% fetal bovine serum (FBS, Ausbian) in a 37 °C incubator containing 5% CO2. Short hairpin RNAs (shRNAs) targeting lncRNAs and control lncRNAs were synthesized by GeneChem (Shanghai, China). shCTD-2245E15.3 (5′-GCATTCTTAGGAAATGCCTCC-3′) was inserted into the GV112 lentiviral vector and a non-silencing shRNA was designed as a negative control (shCtrl group). The cells were infected with lenti-shlncRNA or lenti-shCtrl as recommended by the manufacturer. Briefly, GIST-T1 cells were incubated with viral supernatants for 72 h, then the infected cells were selected with puromycin (2 µg/mL) for 2 days before the experiments. The knockdown efficacy was determined using quantitative polymerase chain reaction (qPCR).
2.8. Cell Proliferation
CCK-8 reagent (Dojindo, Kumamoto, Japan) was used to measure GIST-T1 cell proliferation. The GIST-T1 cells were seeded in 96-well plates with 2 × 103 cells/well. Two hours before detection, 10 µL of CCK reagent was added to each well, and then the absorbance value at 450 nm was measured. Each experiment had three replicates, was repeated three times, and was measured continuously for 5 days.
2.9. Cell Apoptosis Analysis
The cells were seeded in 6-well plates at a density of 1 × 105 cells per well and cultured for 2 days. Cell apoptosis was detected with an apoptosis assay kit (eBioscience, San Diego, CA, USA) according to the instructions. Briefly, cells in 6-well plates were harvested with 0.25% trypsin-EDTA, washed twice with cold phosphate-buffered saline (PBS) buffer and centrifuged at 1300 rpm for 3 min. Then, 200 μL of 1× binding buffer and 10 µL of Annexin V-APC were added and mixed for 15 min at room temperature in the dark. The results were determined on a flow cytometer (BD, San Jose, CA, USA).
2.10. Cell Migration
A Transwell kit (8-μm pores, Corning Inc., Corning, New York, NY, USA) was used for cell migration analysis. A total of 100 µL of serum-free IMDM was added to the upper chamber and placed in the incubator at 37 °C for 1 h. The medium in the upper chamber was then removed and 1 × 105 cells with serum-free medium were added to the coated filter membrane, and 600 µL of IMDM with 30% FBS was added to the lower chamber. The cells were incubated for another 18 h. Then, the chamber was placed in 4% paraformaldehyde solution for 30 min and the migrated cells were stained with crystal violet. Nine random regions were selected to count the cell number.
2.11. Colony Formation Assay
GIST-T1 cells were inoculated in 6-well plates at a density of 1000 cells/well and cultured continuously for 14 days, with medium changes every four days. Then, the cells were washed twice with PBS and 4% paraformaldehyde was added to each hole at 4 °C for 1 h. Next, the cells were stained with 0.1% crystal violet staining solution for 10–20 min. Finally, the number of cell colonies containing > 50 cells in each well was calculated and photographed.
2.12. Real-Time Quantitative Polymerase Chain Reaction (qPCR)
Total RNA was isolated from GIST-T1 using Trizol reagent (SuperfecTRI, Shanghai, China) following the manufacturer’s instructions. Complementary DNA (cDNA) was synthesized from 1 μg of total RNA using the PrimeScript™ RT Reagent Kit (Takara, Kusatsu, Japan) and following the manufacturer’s protocol, with reverse transcription performed at 37 °C for 15 min and 85 °C for 5 s. Gene-specific primers were used for amplification: CTD-2245E15.3-F: GTCACATCGGTCAAACCCTAC; CTD-2245E15.3-R: GTTCCGTTCTGGAGTTAGTCG; β-actin-F: GCGTGACATTAAGGAGAAGC; and β-actin-R: CCACGTCACACTTCATGATGG. Real-time qPCR was performed using SYBR Green qPCR MasterMix (Bio-Rad, Hercules, CA, USA) on a CFX96 Real-Time PCR Detection System (Bio-Rad) with the following cycling conditions: initial denaturation at 95 °C for 3 min, followed by 40 cycles of 95 °C for 10 s, 60 °C for 30 s, and 72 °C for 30 s. The CT values were normalized to the value of β-actin, and the relative RNA expression levels of the genes were calculated using the 2−∆∆Ct method.
2.13. RNA In Situ Hybridization
RNA in situ hybridization was performed on formalin-fixed, paraffin-embedded (FFPE) tissue sections (4 μm) using the RNAscope® 2.5 HD Detection Kit (Advanced Cell Diagnostics, Newark, CA, USA) according to the manufacturer’s instructions. After deparaffinization, target retrieval and protease treatment, sections were hybridized with a specific probe targeting CTD-2245E15.3 at 40 °C for 2 h. Signal amplification and chromogenic detection were subsequently performed. Positive (PPIB) and negative (DapB) control probes were included. Slides were counterstained with hematoxylin and examined under a light microscope. The shRNA target sequence and RNAscope probe region were designed based on the annotated CTD-2245E15.3 transcript in the Ensembl database. RNA in situ hybridization signals were evaluated by experienced gastrointestinal pathologists who were blinded to clinical outcomes, with stromal and inflammatory areas excluded from scoring.
2.14. Statistical Analysis
Continuous variables are expressed as mean ± standard deviation (SD). The statistical differences between the two groups were analyzed using the two-tailed Mann–Whitney U test, and comparisons among multiple groups were performed using the Kruskal–Wallis test. With
p < 0.05 considered statistically significant. All statistical analyses were performed using GraphPad Prism 8.0.2 (GraphPad Software, San Diego, CA, USA). Kaplan–Meier method with log-rank test was utilized for survival analysis. Univariate and multivariate Cox proportional hazards regression analyses were performed to identify independent prognostic factors. Two-sided
p < 0.05 was considered statistically significant. Statistical analysis was performed using SPSS 26.0 (IBM Corp., Armonk, NY, USA) and R 4.3.0 (
http://www.r-project.org/) software.
4. Discussion
GIST, the most common tumor of mesenchymal origin in the gastrointestinal tract, can be graded for risk based on tumor location, size, mitotic count, and the presence or absence of rupture. Considering both practicality and convenience, the 2008 modified NIH classification is most commonly employed [
15]. Given that intermediate- and high-risk GISTs are more likely to develop implantation and metastasis, we conducted lncRNA sequencing on frozen specimens from three pairs of intermediate- and high-risk tumors along with matched peritumoral tissues. The specimens originated from the stomach, duodenum, and small intestine, respectively. While the inclusion of tissues from different anatomical sites introduces biological variability, it also serves as a strength of this study. Identification of dysregulated lncRNAs across these anatomically distinct samples suggests that these genes, including CTD-2245E15.3, may represent fundamental oncogenic drivers common to high-risk GISTs regardless of their site of origin. This implies that CTD-2245E15.3 could potentially serve as a broad-spectrum therapeutic target, although future studies with larger, site-stratified cohorts are needed to dissect site-specific nuances.
There are several previous reports on lncRNA sequencing and differential screening of GISTs and normal paratumor tissue. Jingyi Yan et al. compared lncRNA expression in primary GISTs, secondary imatinib-resistant GISTs, and normal gastric wall tissues, and performed bioinformatic analysis of differentially expressed lncRNAs to predict their potential roles in tumor development, recurrence, metastasis, and drug resistance, but did not perform functional validation and mechanistic exploration [
16]. Ugne Gyvyte et al. examined GIST and peritumor tissue and identified nine significantly dysregulated lincRNAs, of which MALAT1, H19, and FENDRR were validated by quantitative real-time PCR, and their correlation with GIST-related oncogenes and miRNAs was analyzed. However, the study used formalin-fixed paraffin specimens with severe RNA degradation, which may have affected the reliability of the results [
17]. Xiaonan Yin et al. performed RNA sequencing of GISTs with different risk grades to demonstrate the lncRNA expression profile during malignant transformation of GISTs. They showed that DNM3OS correlated with tumor volume, mitotic count, risk classification, and survival in GIST patients, and that it promoted GIST cells’ proliferation and mitosis by regulating the expression of GLUT4 and CD36 [
18]. Yebo Shao et al. [
19] performed high-throughput RNA sequencing of ten pairs of GIST tissues and adjacent normal tissues to characterize the transcription and dysregulation of lncRNAs in GISTs. The research identified RP11-616M22.7 as being associated with imatinib resistance, a lncRNA that binds to the RASSF1 protein and is involved in regulating the Hippo signaling pathway. Given the relatively limited availability of lncRNA data for GISTs in commonly used databases such as TCGA, Oncomine, and GEO, using freshly frozen surgical specimens to obtain first-hand sequencing data for a particular patient population can help to gain insight into the pathogenesis and therapeutic targets of GISTs.
The sequencing results of our study demonstrated many differentially expressed lncRNAs, and the GO and KEGG enrichment analyses provided preliminary insights into the biochemical, metabolic, and signaling pathways that may be involved in these differentially expressed lncRNAs. GO and KEGG analyses revealed that differentially expressed lncRNAs were mainly enriched in cellular metabolism-related pathways, and metabolic reprogramming was closely associated with tumor occurrence and development [
20].
However, it is worth noting that the degree of difference in lncRNA expression does not necessarily correlate with its importance in tumor development, and the bioinformatics analysis results are not sufficient to clarify the physiopathological functions and acting mechanisms of lncRNAs [
21]. We combined the degree of expression variation and HCS to initially select the lncRNA CTD-2245E15.3, and performed CCK-8 cell proliferation, transwell cell migration assay, colony formation assay, and cell apoptosis analysis to verify that knockdown of this lncRNA inhibits the proliferation and migration ability of GIST-T1 cells and induces apoptosis. The results in GIST-882 cells are consistent with those observed in GIST-T1, showing that CTD-2245E15.3 knockdown suppresses proliferation and migration and promotes apoptosis. These results suggest that CTD-2245E15.3 may be associated with the biological behaviors of GISTs, such as tumorigenesis, invasion, and metastasis.
Reports on the lncRNA CTD-2245E15.3 are relatively rare. Chen Wang et al. employed lncRNA microarray analysis to analyze differentially expressed lncRNAs in non-small-cell lung cancer (NSCLC), from which CTD-2245E15.3, with the highest expression level, was selected and its tumor growth-promoting effect was demonstrated in in vitro and in vivo experiments [
22]. The authors also explored the mechanism of CTD-2245E15.3 and found that the down-regulated genes after the lncRNA knockdown were mainly involved in biomolecular metabolic processes and fatty acid synthesis through GO analysis. The study confirmed that CTD-2245E15.3 binds to two key enzymes of anabolic metabolism, acetyl-CoA carboxylase 1 (ACC1) and pyruvate carboxylase (PC), and enhances cell proliferation by promoting anabolism, thus exerting pro-tumorigenic effects.
Although bulk RNA sequencing enables comprehensive transcriptomic profiling, it inherently reflects composite signals derived from tumor cells as well as surrounding stromal and immune components. Therefore, differential expression identified from bulk tissue may partially capture microenvironmental influences rather than tumor cell–intrinsic alterations alone. To mitigate this limitation, we performed RNA in situ hybridization in an independent clinical cohort, which demonstrated that CTD-2245E15.3 expression was predominantly localized within tumor regions. Nevertheless, we acknowledge that bulk transcriptomic analyses cannot fully disentangle cellular heterogeneity. Future studies integrating single-cell RNA sequencing or computational deconvolution approaches would further clarify the cell-type–specific expression pattern and functional contribution of CTD-2245E15.3 in GIST progression.
Our study depicted a panel of differentially expressed lncRNAs from different sites of intermediate- to high-risk GIST, which were mostly enriched in metabolism-related pathways as shown by GO and KEGG analyses. We selected CTD-2245E15.3 based on the degree of differential expression and HCS results, in which it was shown to promote GIST cell proliferation and migration and reduce apoptosis by in vitro functional cellular assays. To further strengthen the translational relevance of CTD-2245E15.3, future mechanistic studies are planned to elucidate its molecular interactions and regulatory networks in GISTs. Specifically, we aim to perform RNA pull-down and RNA immunoprecipitation (RIP) assays to identify binding proteins and potential effector molecules. Additionally, overexpression and complementary knockdown experiments will be conducted to validate the causal roles of CTD-2245E15.3 in modulating proliferation, migration, and apoptosis. These planned studies will provide mechanistic insights and may inform the development of targeted therapeutic strategies in the future. The main challenge in lncRNA research is determining how to select effective therapeutic targets or tumor markers from the vast amount of sequencing data. We will continue to attempt to validate new lncRNAs with therapeutic or diagnostic potential among differentially expressed lncRNAs, and will validate the expression of lncRNAs in more clinical cases and their relationship with survival, recurrence, metastasis, and drug resistance.