1. Introduction
Lung cancer constitutes the leading cause of cancer death worldwide, with an estimated 1.8 million deaths and 2.2 million new cases registered in 2020 [
1,
2,
3,
4]. Approximately 85% of all lung cancer cases correspond to non-small cell lung cancer (NSCLC) [
3,
4,
5], with adenocarcinoma and squamous cell carcinoma being the most common histological types [
1]. A major clinical difficulty lies in the fact that a high percentage of patients, around 60–65%, are diagnosed at locally advanced or metastatic stages [
4,
5], significantly limiting curative treatment options.
In recent decades, the management and therapeutic approach to NSCLC have undergone a radical transformation, evolving from a unique approach based on cytotoxic chemotherapy towards a precision medicine model guided by molecular biomarkers [
2,
6,
7]. This strategy is based on the identification of specific genomic alterations, known as oncogenic drivers, which drive tumor growth and can be selectively inhibited [
8]. This has allowed for the identification of patient subgroups that benefit from targeted therapies, significantly improving survival [
2,
6]. The identification of actionable biomarkers has become essential to guide therapeutic decisions, enabling more effective and less toxic treatments.
National and international clinical guidelines, such as those of the Spanish Society of Pathology (SEAP) and the Spanish Society of Medical Oncology (SEOM), the National Comprehensive Cancer Network (NCCN), or the European Society for Medical Oncology (ESMO), have rapidly expanded the list of predictive biomarkers whose determination is considered essential for therapeutic decision-making in NSCLC. Thus, key molecular biomarkers that must be mandatorily determined in patients with NSCLC include mutations in
EGFR,
BRAF,
KRAS, and
MET, as well as rearrangements or fusions in
ALK,
ROS1,
NTRK, and
RET, in addition to PD-L1 expression, which is a crucial biomarker for immunotherapy [
1,
9,
10,
11]. This essential panel includes genomic alterations with approved therapies. Among them are
KRAS mutations, which are the most frequent, present in 25–30% [
11,
12,
13]; activating mutations of the
EGFR gene, present in 10–20% of the Caucasian population and up to 50% of the Asian population [
2,
3,
14];
BRAF V600E mutations (2–4%) [
2,
3]; splicing mutations causing
MET exon 14 skipping (3–4%) [
2,
13,
15,
16]; and rearrangements of the
ALK (2–7%),
ROS1 (1–2%),
RET (1–2%), and
NTRK (0.1–1%) genes [
2,
11,
17,
18,
19].
Furthermore, the field of precision oncology continues to expand with the consolidation of emerging biomarkers. Mutations in
ERBB2 (
HER2), mainly exon 20 insertions occurring in 2–4% of adenocarcinomas [
3,
11], have become actionable thanks to new drugs such as antibody–drug conjugates [
2,
20,
21,
22]. Along with these, the identification of concomitant mutations in genes such as
TP53,
STK11, or
KEAP1 has an increasingly relevant prognostic and predictive value, influencing the response to targeted therapies and immunotherapy [
10,
13,
23,
24,
25]; for example, the coexistence of a
TP53 mutation with a main driver (e.g.,
EGFR or
ALK) is consistently associated with a worse prognosis and shorter duration of response to targeted therapy [
26].
The growing number of clinically relevant biomarkers has surpassed the capabilities of traditional single-gene testing methods, such as real-time polymerase chain reaction (RT-PCR) or fluorescence in situ hybridization (FISH), making sequential gene-by-gene analysis strategies inefficient, as they consume a large amount of tissue—often scarce—and delay the acquisition of a complete molecular profile [
2,
8,
23,
27,
28]. In this context, next-generation sequencing (NGS) has consolidated itself as the standard of care recommended by major scientific societies [
1,
2,
3,
8,
9,
10,
11]. NGS allows for the simultaneous analysis of multiple genes and types of alterations—including single-nucleotide variants (SNVs), small insertions and deletions (indels), copy number alterations (CNAs), and gene fusions—from a single sample, being more cost-effective, saving time, and maximizing the detection of actionable alterations [
23,
27,
28,
29,
30,
31]. This capability for comprehensive molecular profiling is crucial, especially because NSCLC is a molecularly heterogeneous disease where the coexistence of multiple alterations, or the presence of rare and uncommon mutations, can influence treatment response and resistance.
For comprehensive genomic characterization, an approach combining DNA and RNA analysis is considered ideal [
10]. RNA-based NGS has proven to be superior and more sensitive for the detection of gene fusions (
ALK,
ROS1,
RET,
NTRK), whose breakpoints may be located in large introns that hinder their detection by DNA NGS, and for splicing events, such as
METex14 [
6,
10,
13,
15,
16,
17,
18,
32,
33]. The implementation of an NGS-based workflow in routine clinical practice is, therefore, a fundamental pillar of modern thoracic oncology.
This study aims to describe the experience of the first two years of the Molecular Diagnosis Unit at the Hospital Universitario de Gran Canaria Dr. Negrín (HUGCDN) in sequencing a non-selected cohort of patients with NSCLC, detailing the clinicopathological characteristics and the spectrum of genomic variants found using NGS panels.
4. Discussion
The implementation of NGS in our center has allowed for the characterization of the genomic landscape of NSCLC in the Canary Islands population. Our findings reveal a prevalence of actionable alterations (55.1%) consistent with other Western and Spanish cohorts, such as the ATLAS study [
36], but with distinct local particularities. Beyond confirming the utility of NGS over sequential testing, our real-world data provide crucial insights into the complexity of co-mutation profiles and age-dependent genomic patterns, which have direct implications for patient management and prognosis.
For the analysis of our results, we stratified the cohort into three age groups (<50, 50–69, and ≥70 years). This division, although variable in the literature, is based on recurrent cut-off points that have demonstrated biological and clinical relevance [
37]. Age significantly influences the molecular presentation of tumors, likely reflecting different exposures to carcinogens and the cellular aging process [
38]. The definition of “young patient” often uses thresholds of 40 or 50 years, while the advanced age group is usually defined from 60 or 70 years onwards [
37,
39,
40]. This stratification allows us to explore if our cohort replicates the age-dependent genomic patterns previously described.
Several large-scale studies have reported that younger patients present a higher frequency of gene fusions (
ALK,
ROS1,
RET) and certain
EGFR mutations, while older patients tend to show a higher prevalence of
KRAS mutations,
METex14 splicing alterations, and a higher tumor mutational burden (TMB) [
37,
39,
40]. Therefore, analyzing our findings across these age groups will allow us to contextualize the molecular profile of our population and evaluate the distribution of actionable biomarkers across different age segments.
Our findings, obtained in a cohort of 448 tumors from predominantly Canarian patients, largely reflect the genomic landscape described in Caucasian populations, albeit with certain peculiarities deserving discussion compared to the literature [
41,
42]. The overall frequency of actionable alterations detected (55.1% of tumors with at least one variant in
KRAS,
EGFR,
BRAF,
MET,
ALK,
RET,
NTRK, or
ROS1) underscores the importance of implementing broad genomic profiles in our clinical practice.
Activating mutations in
EGFR were detected in 14.06% of our tumors, a prevalence fitting perfectly within the 10–20% range described for Caucasian patients [
11,
43] and specific European (12.8–14.1%) and Spanish (14–14.5%) data [
41,
44]. However, the distribution of canonical variants showed a slight deviation: exon 19 deletions represented 35.62% of
EGFR mutations and L858R 16.44%, whereas in the literature, they usually account for around 57% and 23%, respectively [
45]. This lower relative proportion of L858R could be a particular characteristic of our population or due to sample size. Exon 20 insertions constituted 5.48% of
EGFR mutations, within the expected range of 4–12% [
44,
46,
47].
A notable finding was the detection of
EGFR mutations in 14 squamous cell carcinomas (22.22% of total mutated
EGFR). Of these alterations, three corresponded to deletions in exon 19 and one alteration to L858R. Although traditionally associated with adenocarcinoma, this result highlights the importance of not ruling out testing in squamous cell carcinoma histologies; although guidelines indicate that alterations in all genes should only be sought in patients with low or no smoking history or <50 years [
11], in our population, there are four possible cases that could benefit from targeted therapy with tyrosine kinase inhibitors.
KRAS mutations were the most frequent alteration in our series (27.23%), a figure that aligns perfectly with prevalences reported in non-squamous NSCLC of Western populations (15–30%) [
2,
24] and adenocarcinomas (20–25%) [
13], although slightly lower than the 36% found in some Spanish cohorts analyzed by NGS [
41]. The distribution of variants within
KRAS was also consistent, with G12C being the most common (37.40%), followed by G12V (26.84%) and G12A (6.50%), consistent with data placing G12C between 39 and 42% of total
KRAS mutations [
13,
24], although slightly below the 53.6% reported in the Spanish ATLAS cohort [
36]. This predominance of G12C reinforces the relevance of recently developed specific inhibitors.
Regarding histology, 81.97% of KRAS mutations were found in adenocarcinomas, as expected, but it is interesting to note their presence in 33.33% (17/51) of NOS, confirming their relevant role also in less differentiated tumors. Regarding squamous histology, three alterations of the G12C variant were detected in patients > 50 years, which again highlights the importance of rethinking variant testing in tumors with this histology.
The prevalence of
BRAF mutations (5.36%) in our cohort was at the high end of the 2–8% range reported in the literature [
8,
9,
11,
41]. The proportion of the V600E variant was 33.33% (8/24), lower than the 50% usually described [
8,
11], indicating a significant representation of non-V600E variants in our population [66.67%], whose clinical relevance is being actively investigated.
MET alterations involving exon 14 skipping (
METex14 skipping) were found in 8 of the 448 analyzed tumors (1.80%), a frequency below the 3–4% described in the literature [
11,
16,
41,
46,
48].
MET amplification, on the other hand, was less frequent in our series (2/448; 0.4%) compared to reported rates of 1–6% as a primary event [
13,
36,
49,
50]. This low frequency could be real or influenced by the detection thresholds and amplification definitions used.
For
ERBB2 (
HER2), we identified alterations in 3.57% of cases, in line with the expected 2–4% [
11,
20,
22]. However, the variant distribution was atypical. The Y772_A775dup insertion accounted for 25% (4/16) in our cohort, even though in large cohorts of exon 20 insertions, Y772_A775dup constitutes the majority: 58% in the Chinese cohort and 41.6% in the US cohort analyzed in a multicenter study of 3000 patients. In the real-world HaploX database, the Y772_A775dup alteration was observed in 71.5% of the 284
ERBB2 exon 20 insertions detected [
51,
52,
53]. This could suggest a different diversity in the Canarian population.
ERBB2 amplifications (CNAs) were detected in five cases of the 448 analyzed tumors [1.12%], consistent with the 1–4% described in the consulted bibliography [
11,
22,
36]. Regarding single-nucleotide variants, analysis of the 448 tumors showed seven alterations [43.75%].
Regarding gene fusions, our results show rates within the expected range for
ALK (2.23% vs. 2–7%) [
9,
21,
54], which were lower than expected for
ROS1 (0.45 vs. 1–2%) [
2,
11,
44,
54] but slightly higher than expected for
RET (2.23% vs. 1–2%) [
11,
55] and
NTRK (1.12% vs. <1%) [
11]. The lower frequency of
ROS1 could be due to population factors, although the implemented RNA-based detection should maximize sensitivity [
2,
4,
11,
16,
22,
26,
45,
46,
47]. The slightly elevated prevalence of
RET and
NTRK is an interesting finding that could indicate an enrichment in our geographic area, although absolute numbers are small.
Analysis by age groups (<50, 50–69, ≥70 years) confirms some trends described in the literature on NSCLC in young patients (AYA, ≤50 years) [
19]. The most striking observation is the concentration of
ALK fusions: four of the ten
ALK-positive cases (40%) occurred in the <50 years group, which only represents 2.7% of the total cohort. This translates to an
ALK prevalence of 33.33% (4/12) in this group, much higher than the 2.2% globally and in line with the 10–25% enrichment described in AYA [
19]. Conversely,
KRAS mutations were infrequent in those under 50 years (16.7%, 2/12), concentrating mostly in the 50–69 years group (66.4% of all
KRAS, 81/122), supporting the association of
KRAS with older ages and possibly greater cumulative tobacco exposure [
19,
46]. The single
EGFR mutation detected in <50 years (8.3%) suggests a low prevalence in this group in our cohort, although the literature varies on this [
19]. Our data, although limited in the youngest group, reinforce the concept of distinct molecular profiles according to age, highlighting the importance of screening for fusions in young patients. Therefore, age should not be a deterrent for testing actionable fusions, particularly
ALK, even in the absence of other clinical risk factors.
Correlation with sex also showed patterns consistent with the literature. There was a higher frequency of
EGFR mutations in women (39 women vs. 24 men), which was especially marked in the ≥70 years group (18 vs. 10), and there was a higher prevalence of
KRAS (75 men vs. 47 women) and
BRAF (16 men vs. 8 women) mutations in men [
2,
22,
46]. These findings reaffirm the epidemiological and possibly biological differences linked to sex in NSCLC.
Analysis of smoking status in our cohort (
Table 3 and
Table 8) confirms several key associations and reveals interesting findings. As expected, we observed an almost absolute association of squamous cell carcinoma with smoking (99.1% of cases in smokers/former smokers), as well as the expected high prevalence of
KRAS (89.3%) and
TP53 (81.1%) mutations in this same patient group. This agrees with the literature, which links these alterations to high tumor mutational burden (TMB) and exposure to tobacco carcinogens [
3]. Specifically, our data showed that
KRAS G12C was the most prevalent variant (45.1%), aligning with the description of G12C and G12V variants as dominant in smokers [
8,
11,
13,
36,
46]. We also observed that the vast majority of
BRAF mutations (22/24 cases, 91.7%) were associated with smoking. This fact could correlate with our high proportion of non-V600E variants (58.3%), as the literature describes that non-V600
BRAF mutations (Class II and III) are more common in patients with a smoking habit [
11,
24,
46]. Regarding
MET and
HER2, our data (7/11 and 11/16 in smokers/former smokers, respectively) support the lack of a clear statistical association with smoking, as described in the literature [
2,
3,
22,
24,
41,
50].
However, the most striking finding of our cohort is the distribution of driver alterations classically associated with non-smokers (
EGFR,
ALK,
ROS1,
RET). Although the literature indicates an
EGFR prevalence of up to three times higher in non-smokers [
2,
26,
44], in our series, the majority of
EGFR mutations (34/63, 54.0%) were detected in patients with a history of smoking (18 smokers, 16 former smokers) compared to only 22 cases in non-smokers. Similarly,
ALK fusions, traditionally linked to non-smokers [
11,
46,
54,
56], were found in five out of ten cases (50%) in patients with a smoking history. Although we did not find
ROS1 or
RET fusions in non-smokers, our numbers are too small to establish a trend. These data on
EGFR and
ALK, together with the 14
EGFR cases in squamous cell carcinomas, suggest that, in our population, smoking should not be an exclusion factor for seeking key actionable alterations. These results strongly support the implementation of universal NGS testing for all NSCLC patients, regardless of their smoking history.
Finally, the high prevalence of
TP53 alterations (30.8%, underestimated due to the panel change) agrees with its role as the most frequently altered gene in NSCLC and its frequent co-occurrence with other drivers [
25,
41,
42,
55]. Full characterization of
TP53 variants and their correlation with other genes and prognosis will be the subject of future analysis once complete data with the OPA panel is available.
One of the most relevant findings enabled by NGS through the use of panels is the identification of concomitant genomic alterations in the same tumor. The traditional paradigm considering driver mutations as mutually exclusive events has been superseded, as current evidence demonstrates that the coexistence of multiple alterations is a more frequent phenomenon than initially estimated, with profound clinical implications [
42,
57]. Although the prevalence of the coexistence of two or more actionable drivers is low, reported in around 1.5–1.7% of NSCLC patients [
42,
58], comprehensive genomic analysis reveals that up to 82.8% of tumors with a known driver harbor at least one additional pathogenic co-alteration [
57]. In our series, we identified 172 cases with more than one alteration [69.6% of tumors with molecular alterations], and notably, 86 cases presented concomitant alterations among the ten main genes analyzed in this series [34.8% of tumors with molecular alterations]. This underscores the limited scope of single-gene testing.
Our data confirms
TP53 as the predominant co-driver in NSCLC, present in 29.5% of cases and frequently co-occurring with
KRAS (5.6%) and
EGFR (2.5%). This high prevalence is clinically critical because
TP53 co-mutations are not merely passenger events; they are established negative prognostic factors associated with reduced responsiveness to tyrosine kinase inhibitors in
EGFR-mutant patients [
59] and variable responses to immunotherapy in
KRAS-mutant tumors [
25]. By capturing these complex
TP53-driven profiles, our study demonstrates that comprehensive genomic profiling is essential not just for diagnosis but for accurate prognostic stratification.
Finally, our study has limitations inherent to its retrospective, single-center design, which may limit the generalizability of the findings to broader populations. Furthermore, the technological transition during the study period introduces a bias in the estimation of variant frequencies. The switch from the Oncomine Focus Assay (OFA) to the Oncomine Precision Assay (OPA) means that genes such as TP53 were not sequenced in samples from the first period (2023), leading to an underestimation of their overall prevalence in the total cohort. Conversely, genes like ARAF, present in OFA coverage but not in OPA, were only assessed in the first cohort.