Molecular Landscape and Validation of New Genomic Classification in 2668 Adult AML Patients: Real Life Data from the PETHEMA Registry

Simple Summary Next–Generation Sequencing (NGS) has provided a deeper genetic understanding of acute myeloid leukemia (AML) that has been recently incorporated into AML classification and risk–stratification guidelines. Single molecular analysis has become inefficient and molecular testing based on NGS is emerging as an irreplaceable diagnostic tool in clinical settings. The PETHEMA cooperative group has constituted a nationwide NGS network with centralized analysis in seven high–skilled laboratories. The study of molecular profiles in the “real–life” PETHEMA cohort supports the increasing role of NGS on the clinical management of AML patients. Abstract Next–Generation Sequencing (NGS) implementation to perform accurate diagnosis in acute myeloid leukemia (AML) represents a major challenge for molecular laboratories in terms of specialization, standardization, costs and logistical support. In this context, the PETHEMA cooperative group has established the first nationwide diagnostic network of seven reference laboratories to provide standardized NGS studies for AML patients. Cross–validation (CV) rounds are regularly performed to ensure the quality of NGS studies and to keep updated clinically relevant genes recommended for NGS study. The molecular characterization of 2856 samples (1631 derived from the NGS–AML project; NCT03311815) with standardized NGS of consensus genes (ABL1, ASXL1, BRAF, CALR, CBL, CEBPA, CSF3R, DNMT3A, ETV6, EZH2, FLT3, GATA2, HRAS, IDH1, IDH2, JAK2, KIT, KRAS, MPL, NPM1, NRAS, PTPN11, RUNX1, SETBP1, SF3B1, SRSF2, TET2, TP53, U2AF1 and WT1) showed 97% of patients having at least one mutation. The mutational profile was highly variable according to moment of disease, age and sex, and several co–occurring and exclusion relations were detected. Molecular testing based on NGS allowed accurate diagnosis and reliable prognosis stratification of 954 AML patients according to new genomic classification proposed by Tazi et al. Novel molecular subgroups, such as mutated WT1 and mutations in at least two myelodysplasia–related genes, have been associated with an adverse prognosis in our cohort. In this way, the PETHEMA cooperative group efficiently provides an extensive molecular characterization for AML diagnosis and risk stratification, ensuring technical quality and equity in access to NGS studies.


Introduction
Introduction of Next-Generation Sequencing (NGS) into routine molecular diagnosis has provided deep molecular knowledge of acute myeloid leukemia (AML). These findings have allowed for the refinement of classification and risk stratification systems based on recurrent genetic abnormalities.
In 2016, Papaemmanuil et al. proposed the first genomic classification of AML that identifies 11 molecular classes, each with distinct diagnostic features and clinical outcomes [1]. This classification has been recently revised and updated in Tazi et al., 2022, proposing 16 molecular classes based on cytogenetics and the mutational status of 32 genes [2]. The importance of genomic characterization has also been reflected in the recently revised World Health Organization (WHO) Classification [3], new International Consensus Classification (ICC) [4] and European LeukemiaNet (ELN) risk stratification [5], which prioritize genetic abnormalities to establish diagnosis and prognosis to evaluate measurable residual disease (MRD) and to select treatment.
In this situation, molecular analysis by single-gene techniques has become inefficient in order to provide a complete characterization of AML. In contrast, NGS represents a more sensitive tool to capture all the relevant molecular markers in one assay and is widely recommended to study the molecular landscape of this disease [6].
NGS implementation to perform accurate diagnosis in AML is currently demanded by physicians and patients. However, introduction of NGS into clinical routine faces novel challenges [7]. NGS requires large batches of samples in order to be cost-effective, workflows are time-consuming, and interpretation needs highly qualified specialists. Moreover, the diversity of NGS panels, platforms and quality control criteria might prevent the success of the approach [8]. Hence, to efficiently introduce NGS into routine molecular diagnostics, it is necessary to establish quality requirements and to standardize gene panels and variant reporting.
In order to provide comprehensive NGS studies to AML patients and to guarantee equity of access, the PETHEMA cooperative group (Programa Español de Tratamientos en Hematología) has established a nationwide network of central laboratories aimed to harmonize NGS results under consensual criteria in newly diagnosed and relapsed/refractory AML patients [9].
This study summarized the NGS-AML project (NCT03311815), reporting quality control assays and the molecular profile of 2668 AML patients reported in the PETHEMA AML registry. We show co-occurring and mutual exclusion relationships among genes and distinct molecular profiles according to disease stage, age and sex and genomic classification in the "real-life PETHEMA cohort".

Development of the Diagnostic Platform
Implementing NGS studies in the routine molecular diagnosis of AML patients requires specialization, budgetary stability and logistical support. The PETHEMA cooperative group established a nationwide network of NGS studies for fast and standardized molecular diagnosis of AML. This strategy aims to provide coverage of NGS studies to 38.5 million inhabitants distributed in geographical areas ranging from 2.2 to 8.9 million habitants. For this purpose, seven centers with logistical and technical capacity for the management of a high number of samples were designed as reference laboratories for the centralization of samples submitted by PETHEMA institutions in each area. In this way, a large territory and population was covered. The platform was supported by PETHEMA in logistical management, as well as, closer and well-established relationships between the sample referral institution and the assigned central laboratory. The designated reference centers for NGS analysis concentrate a large number of AML samples, allowing for the rapid completion of the sequencing runs and their management by highly specialized staff.

Consensus Genes Establishment
The development of the diagnostic platform required several meetings to coordinate criteria on which genes should be analyzed based on their clinical relevance in AML. After extensive bibliographic revision of current molecular basis of AML, all reference laboratories should assess by NGS the mutational status of genes that define the diagnosis and prognosis as well as guide treatment options (ASXL1, BCOR, CEBPA, EZH2, FLT3, IDH1, IDH2, NPM1, RUNX1, SF3B1, SRSF2, STAG2, U2AF1, ZRSR2 and TP53). Moreover, there was a recommendation for the study of other genes with proven evidence on their relevance in AML pathogenesis (ABL1, BRAF, CALR, CBL, CSF3R, DNMT3A, ETV6, GATA2, HRAS, JAK2, KIT, KRAS, MPL, NRAS, PTPN11, SETBP1, TET2 and WT1).
The establishment of consensus genes has enabled laboratories to work with the NGS platforms and panels according to their individual requirements, which have largely enabled the development and maintenance of the diagnostic platform. NGS panels and platforms used by each center are described in Supplementary Materials Tables S1 and S2.

NGS Standardization Procedures and Cross-Validation Rounds
The diagnostic platform established a quality control assay by exchanging control samples among reference laboratories every 9-12 months. To date, three cross validation (CV) rounds have been performed. As previously reported, in the first and second CV rounds, minimum quality parameters [uniformity (>85%) and mean read depth of 1000X] and consensus recommendations for variant report [All centers should report: (1) mutations in relevant genes for clinical guidelines, targeted therapy and risk stratification and (2) all pathogenic variants detected with VAF >5% excepting those described at hotspot regions which will be reported up to 1% VAF] were stablished. For both CV rounds, 10 samples harboring 54 variants were distributed. In first CV round, VAF for all variants was >5% while in the second CV round, 5 from 30 total variants had low VAF ranging from 1. 8-4.9%. In the third CV round, to explore the accuracy and reliability of low VAF variants (<5%) detection, 4 samples harboring 32 variants (11 VAF < 5%) were analyzed, according to previously established criteria [9].

Patients and Inclusion Criteria
All adult patients (≥18 years) with newly diagnosed or relapsed/refractory AML (excluding acute promyelocytic leukemia) according to the World Health Organization criteria (2008 and 2016), regardless of the treatment received, were eligible for mutational profile study by NGS. The Institutional Ethics Committee for Clinical Research of each institution approved this study. Written informed consent in accordance with the recommendations of the Declaration of Human Rights, the Conference of Helsinki, and institutional regulations were obtained from all patients.
This classification also proposes an integrated risk score based on the 16 molecular classes: "NPM1", "inv(16)", "t(8;21)", "CEBPAbi" and "No events" define the favorable risk group; "sAML1", "t(6;9)", "WT1", "mNOS", "t(11;X)", "DNMT3A-IDH1/2" and "trisomies" the intermediate and "TP53-CK", "sAML2" and "inv(3)" the adverse risk group. Genomic groups and sub-classifications are summarized in Supplementary Table S3. Internal tandem duplications (ITD) in FLT3 are the only genetic alterations with independent prognosis value from class membership. These mutations were not considered as "class defining" alterations as they are represented in all classes but modulate risk groups classification as follows: In the favorable risk group, patients with mutated NPM1 who also harbored a FLT3-ITD mutation were reclassified to the intermediate risk group. Similarly, a FLT3-ITD mutation reclassifies intermediate risk patients to an adverse risk group. Moreover, in order to assess the prognostic impact of TP53 configurations, we classified patients according TP53 mono-allelic or multi-hit as described by Tazi et al.: Mono-allelic: One TP53 mutation with VAF ≤ 65%; and multi-hit: Two TP53 mutations or one TP53 mutation with VAF > 65% or one TP53 mutation + del (17).

Statistics
All statistics were performed using SPSS version 22 (IBM, Armonk, NY, USA) and GraphPad Prism 4 (GraphPad, La Jolla, CA, USA) software programs. Chi square test was used to assess associations between categorical variables. Survival analyses were performed using the Kaplan-Meier method and the log-rank test. The Cox proportional-hazards model was used to evaluate the risk of death among groups. p-value (p) < 0.05 was considered as a statistically significant test.

Third Cross Validation Round
In the third CV round, the error rate (ER) for variants with VAF > 5% decreased from previous rounds: 1st: 39%, 2nd: 14.4% and 3rd: 4.76%. However, the ER in variants with VAF < 5% increased from 28.6% (five variants with mean VAF 3.3%) to 59.6% (11 variants with mean VAF 1.2%). ER, mean VAF and standard deviation (SD) for the last CV round are summarized in Supplementary Table S4 and Figure S1. Therefore, the diagnostic platform maintained: (1) the cut-off of VAF > 5% to report clinically relevant variants and (2) the criteria to report only low VAF variants (<5%) in hotspot regions with strong clinical evidence, suggesting variant confirmation in an additional sample.

Baseline Demographics and Molecular Profile in NGS-AML Protocol Cohort
From October 2017 to October 2019, NGS analyses were performed in 1631 samples from 1471 AML patients enrolled in the NGS-AML protocol (NCT03311815), with available clinical date (i.e., treatment approach). Disease status at samples collection was: 1268 diagnosis (DX), 204 relapse (REL) and 159 refractoriness (RES) ( Table 1). An additional cohort of 1225 samples analyzed by NGS between November 2019-May 2021 has been included only for molecular characterization of AML: 1166 (DX), 47 (REL) and 12 (RES). Only those samples that met NGS quality requirements established by the diagnostic platform were included in the study. Overall, 2856 samples were analyzed (2434 DX, 251 REL, and 171 RES).

Paired Samples and Mutation Stability
Molecular profiling of paired samples was conducted in 97 patie relapse (DX-REL) and 59 patients at diagnosis-refractoriness (DX-R comparison, loss events were more frequently observed than gain events in relapse samples. Interestingly, 17.2% of patients showed simultaneous gain events, while 24.1% maintain the diagnosis' mutational profile. The m genes were TP53, WT1 and NPM1, with stability rates of 81.3%, 80% and 7 In contrast, signaling activating genes were found to be highly unstable: K FLT3-TKD mutations, NRAS, KRAS and PTPN11 showed stability

Paired Samples and Mutation Stability
Molecular profiling of paired samples was conducted in 97 patients at diagnosis-relapse (DX-REL) and 59 patients at diagnosis-refractoriness (DX-RES). In DX-REL comparison, loss events were more frequently observed than gain events (43.7% vs. 26.4%) in relapse samples. Interestingly, 17.2% of patients showed simultaneous mutation loss and gain events, while 24.1% maintain the diagnosis' mutational profile. The most stable mutated genes were TP53, WT1 and NPM1, with stability rates of 81.3%, 80% and 77.8%, respectively. In contrast, signaling activating genes were found to be highly unstable: KIT, FLT3-ITD and FLT3-TKD mutations, NRAS, KRAS and PTPN11 showed stability rates below 50%. Moreover, while mutations in KIT, FLT3-TKD, KRAS and PTPN11 were almost equally lost and acquired, mutations in NRAS and FLT3-ITD were predominantly lost (Supplementary Figure S3).
In refractory AML, 49.2% of patients retained the mutational status found at diagnosis. NPM1 mutations were also the most stable gene at refractory AML (83.3%). Signaling activating genes were highly unstable: NRAS, KRAS, PTPN11 and FLT3-TKD showed stability rates below 45%, being more frequent the loss of these mutations (Supplementary Figure S4).

Integrated Risk Score
We also assessed the integrated risk score based on cytogenetic and gene mutations proposed in the new genomic classification [2]. In the evaluable cohort (N = 954), 23.8% of patients were classified to a favorable risk group, 27.1% were included in the intermediate risk group and 49.1% in the adverse risk group. We also evaluated the impact of age at

Integrated Risk Score
We also assessed the integrated risk score based on cytogenetic and gene mutations proposed in the new genomic classification [2]. In the evaluable cohort (N = 954), 23.8% of patients were classified to a favorable risk group, 27.1% were included in the intermediate risk group and 49.1% in the adverse risk group. We also evaluated the impact of age at diagnosis in the new genomic classification. Risk stratification distribution was significantly different between age groups: <65 years (intensive = 416; non-intensive = 10) vs. ≥65 years (intensive = 150; non-intensive = 378). Young patients showed homogeneous distribution of the different risk groups (Favorable: 31.2%, Intermediate: 32.4% and adverse: 36.4%), while older patients were predominantly included in the adverse risk group (Favorable: 17.8%, Intermediate: 22.9% and adverse: 59.3%; p < 0.001).
In terms of outcomes, median OS was 30.8 months (95%CI NR) in the favorable risk group,  in the intermediate group and .00) in the adverse risk group (p < 0.001) ( Figure 8A). Intermediate and adverse risk patients had 1.5 (95%CI 1.1-2.0; p < 0.01) and 2.7 (95%CI 2.1-3.5; p < 0.001) increased risk of death relative to favorable risk group ( Figure 7B, Supplementary   in the intermediate group and .00) in the adverse risk group (p < 0.001) ( Figure 8A). Intermediate and adverse risk patients had 1.5 (95%CI 1.1-2.0; p < 0.01) and 2.7 (95%CI 2.1-3.5; p < 0.001) increased risk of death relative to favorable risk group ( Figure 7B, Supplementary Table S7B). We also found differences in terms of OS in both age groups. Median OS in favorable-risk group was not reached at 42.8 months in patients < 65 years, while intermediate and adverse risk groups reached a median OS of 23.6 (95%CI NR) months and , respectively (p < 0.001) ( Figure 8B). In contrast, median OS of older patients was significantly decreased for all risk groups: Favorable (14.0;, intermediate (14.6;) and adverse (6.9;3) (p < 0.001) ( Figure 8C).

Comparison between the Integrated Risk Score and 2022 ELN Risk Classification
According to current 2022 ELN risk stratification, we selected 546 fit patients for a tentative comparison with the AML genomic classification risk score. We did not find a distinct OS (2022 ELN vs. Tazi   We also found differences in terms of OS in both age groups. Median OS in favorablerisk group was not reached at 42.8 months in patients < 65 years, while intermediate and adverse risk groups reached a median OS of 23.6 (95%CI NR) months and , respectively (p < 0.001) ( Figure 8B). In contrast, median OS of older patients was significantly decreased for all risk groups: Favorable (14.0;, intermediate (14.6;) and adverse (6.9;3) (p < 0.001) ( Figure 8C).
Regarding risk score, young patients classified in the intermediate and adverse risk group showed 2.6 (95%CI 1.5-4.4; p < 0.001) and 3.5 (95%CI 2.1-5.8; p < 0.001) higher risk of death relative to those patients classified in the favorable risk group ( Figure 7C, Supplementary Table S7C). On the other hand, patients >65 years classified in the adverse risk group showed an increased risk of death of 1.9 (95%CI 1.4-2.5; p < 0.001) compared to those classified as favorable risk. No statistically significant results were found in the risk of death of intermediate risk patients: 1.0 (95%CI 0.7-1.4; p = 0.864) ( Figure 7D, Supplementary  Table S7D). In general, patients >65 years had a dismal prognosis and higher risk of death compared to younger ones for all risk groups: Favorable: 4.7 (95%CI 2.8-8.0; p < 0.001), intermediate: ; p < 0.01) and adverse: 2.6 (95%CI 2.0-3.5; p < 0.001).

Comparison between the Integrated Risk Score and 2022 ELN Risk Classification
According to current 2022 ELN risk stratification, we selected 546 fit patients for a tentative comparison with the AML genomic classification risk score. We did not find a distinct OS (2022 ELN vs. Tazi

Discussion
NGS has become the preferred technology to capture the heterogeneous molecular landscape of AML [2][3][4][5]. These approaches have been rapidly adopted as a potential routine tool for molecular diagnosis in AML patients [11]. However, its translation into clinical practice is hampered by specific requirements, such as the necessity of highly skilled laboratories, the increased cost compared to single-gene assays and the expected high turnaround time [7,12]. In this scenario, our results show that an NGS diagnostic platform, established by the PETHEMA cooperative group, was able to provide NGS reliable results with all relevant molecular data currently required for diagnosis and prognosis stratification and therapy choice. In addition, this kind of cooperative study allows for the assessment of the genetic heterogeneity of AML in large cohorts of patients and provides an extensive quality molecular data to evaluate current genomic knowledge in "real-life" AML cohorts.
Due to the rapid NGS implementation into routine molecular diagnosis of AML, NGS workflows and quality specifications are heavily reliant on laboratory specific procedures [13,14]. Therefore, current NGS analyses are characterized by the lack of standardized procedures, the diversity of quality metrics criteria, and the high variability of the assessed genes and variant reporting criteria [15]. To address the need for harmonization procedures, the PETHEMA cooperative group implemented the first Spanish nationwide NGS testing strategy. Regular rounds of cross-validation were planned in order to identify weaknesses and to establish consensus quality metrics criteria for variant reporting among seven central laboratories in AML molecular diagnosis.
Networking for NGS studies allowed us to identify challenges in its clinical implementation [9]. The first CV round revealed the absence of AML key genes in some NGS approaches, and consequently a consensus set of relevant genes for the clinical management was defined. Next steps of the diagnostic platform enabled us to unify variant reporting according to the role of genes in diagnostic classification, prognostic utility and targeted therapy. Cooperative studies have also allowed us to address technical challenges. Comparison of results between centers facilitates discrimination of polymorphic variants and sequencing artifacts from real AML-related variants [7].
Our results demonstrated that NGS standardization in the context of a cooperative group is possible with a concordance of 96.6% in variant detection (VAF > 5%). Noteworthy, the detection of low VAF (<5%) variants (concordance 41.2%) was also consistent with previous studies, which report that accurate detection of low VAF variants by NGS could be compromised. CV rounds results reflected the improvement of the diagnostic platform performance as the error rate decreased in variants VAF > 5% as a result of the experience gained in NGS studies. Although the second CV round included 5 variants with a mean VAF of 3.3%, in the third CV round we aim to explore the

Discussion
NGS has become the preferred technology to capture the heterogeneous molecular landscape of AML [2][3][4][5]. These approaches have been rapidly adopted as a potential routine tool for molecular diagnosis in AML patients [11]. However, its translation into clinical practice is hampered by specific requirements, such as the necessity of highly skilled laboratories, the increased cost compared to single-gene assays and the expected high turnaround time [7,12]. In this scenario, our results show that an NGS diagnostic platform, established by the PETHEMA cooperative group, was able to provide NGS reliable results with all relevant molecular data currently required for diagnosis and prognosis stratification and therapy choice. In addition, this kind of cooperative study allows for the assessment of the genetic heterogeneity of AML in large cohorts of patients and provides an extensive quality molecular data to evaluate current genomic knowledge in "real-life" AML cohorts.
Due to the rapid NGS implementation into routine molecular diagnosis of AML, NGS workflows and quality specifications are heavily reliant on laboratory specific procedures [13,14]. Therefore, current NGS analyses are characterized by the lack of standardized procedures, the diversity of quality metrics criteria, and the high variability of the assessed genes and variant reporting criteria [15]. To address the need for harmonization procedures, the PETHEMA cooperative group implemented the first Spanish nationwide NGS testing strategy. Regular rounds of cross-validation were planned in order to identify weaknesses and to establish consensus quality metrics criteria for variant reporting among seven central laboratories in AML molecular diagnosis.
Networking for NGS studies allowed us to identify challenges in its clinical implementation [9]. The first CV round revealed the absence of AML key genes in some NGS approaches, and consequently a consensus set of relevant genes for the clinical management was defined. Next steps of the diagnostic platform enabled us to unify variant reporting according to the role of genes in diagnostic classification, prognostic utility and targeted therapy. Cooperative studies have also allowed us to address technical challenges. Comparison of results between centers facilitates discrimination of polymorphic variants and sequencing artifacts from real AML-related variants [7].
Our results demonstrated that NGS standardization in the context of a cooperative group is possible with a concordance of 96.6% in variant detection (VAF > 5%). Noteworthy, the detection of low VAF (<5%) variants (concordance 41.2%) was also consistent with previous studies, which report that accurate detection of low VAF variants by NGS could be compromised. CV rounds results reflected the improvement of the diagnostic platform performance as the error rate decreased in variants VAF > 5% as a result of the experience gained in NGS studies. Although the second CV round included 5 variants with a mean VAF of 3.3%, in the third CV round we aim to explore the performance of NGS studies in very low VAF variants. For this purpose, we included 11 variants with a mean VAF of 1.2%. Data analysis revealed an increase of the ER between both CV rounds due to the very low VAF variants of the third CV round. Consequently, we established VAF 5% cut-off for variant reporting, although according to other specific studies, variants in hotspot regions were reported up to 1% VAF even at borderline technical quality [16][17][18].
Our results reflect that a comprehensive NGS approach is suitable for defining the molecular profile of AML as over 96% of patients of the "real-life" PETHEMA cohort harbored at least one mutated gene. Our study also reflects the genomic heterogeneity that encompasses AML (13,14) as several gene-to-gene interactions but also co-occurrence and mutual exclusivity patterns across functional categories were described. Recently, published studies have highlighted the impact of co-mutations in modulating treatment response [19,20].
Similarly, distinct mutational profiles were detected among disease stages, reflecting the clonal evolution of AML [1,21]. In fact, when compared to the diagnosis' molecular profile we detected fewer mutational changes in resistance to induction therapy (51%) than relapse AML patients (87%) which may suggest different mechanisms underlying both moments [22][23][24][25]. Clonal evolution of the disease could be especially relevant for treatment response [26,27], and in relapsed AML patients, our results supported that molecular testing should be conducted again in order to identify targetable abnormalities such as FLT3 mutations, which showed a stability rate of 43.3% in our cohort [28].
In addition to mutational changes according to disease stages, specific mutational profiles have been associated to patient's clinical characteristics, such as sex and age, with significant impact on prognosis, therapeutic allocation and disease monitoring [29]. In fact, we report that young patients show a mutational profile very similar to female AML patients, which allows them to benefit more from targeted therapies due to higher frequency of FLT3 mutations [30][31][32][33]. In contrast, male and elderly patients show a molecular profile with a higher frequency of adverse risk genetic abnormalities and limited targeted therapy opportunities [34][35][36]. Indeed, some studies suggest that these features should be considered as an essential variable in clinical trials to deepen understanding of the disease and to identify new treatment opportunities [37,38].
Molecular diagnosis of AML has shifted towards a comprehensive mutational study driven by NGS, yielding large amounts of data. Updated diagnostic (WHO and ICC) and prognostic (ELN) classifications [3][4][5] include an increasing number of genetic abnormalities, which may challenge its applicability in many countries and institutions who cannot afford molecular data integration. However, comprehensive molecular analyses, as the recent revision of the genomic classification of AML are needed to understand the clinical relevance of molecular biomarkers to define novel clusters with prognostic value. In our comparison between genomic AML classification and 2022 ELN risk stratification we did not find differences in OS for favorable, intermediate and adverse risk groups. However, for a comprehensive comparison it would be necessary to evaluate the prognosis impact of individual genetic abnormalities in larger cohorts. Mutations in sAML genes are considered in both risk classification proposals with different prognostic impact [2,5]. Although ELN guidelines associate mutations in "myelodisplasia-related genes" as adverse risk regardless of the number of mutations, our results found significant differences in the OS between sAML1 and sAML2 patients, as described by Tazi et al.,(p = 0.018). In this sense, further studies are needed in order to certainly clarify the impact of sAML mutations on prognosis.
Regarding TP53 mutations, in the PETHEMA cohort, 47 patients were included in the mono-allelic group (32.9%), while 96 were included in the multi-hit group (67.1%). Similar results were described in the Tazi et al. cohort: mono-allelic (31.1%) vs. multi-hit: (68.9%). We did not find a distinct outcome between these groups, although the sample size was limited to draw solid conclusions. Our results are concordant with Tazi et al., who concluded that the allelic state of TP53 (mono allelic or multi-hit) provide no further prognostic information in AML.
On the other hand, recent studies reported that instead of biallelic mutations, only in-frame mutations in the bZIP domain of CEBPA should be considered as a favorable prognosis marker [39]. In our cohort, the percentage of patients with CEBPA bZIP mutations is similar to the CEBPAbi subgroup reported by Tazi et al.,(1.2% vs. 1.8%) and in both cases, OS analysis includes them within a favorable risk subgroup. However, CEBPAbi and CEBPA bZIP may be overlapping categories in some patients as several studies suggest that CEBPA double mutated frequently includes mutations in the C-terminal region which allocated bZIP domain [40]. Moreover, we believe that the recent recognition of in-frame mutations in the bZIP domain of CEBPA as a prognostic biomarker may allow for more homogeneous analysis by reducing the variability in the interpretation of the biallelic character of these mutations [41].

Conclusions
This report reflects the efforts of the PETHEMA cooperative scientific group to adopt a nationwide strategy network of reliable and consistent NGS analyses. The establishment of consensus subset genes and the periodic CV rounds have strengthened the diagnostic network by unifying analysis criteria and decreasing reporting variability. Molecular analyses through NGS are routinely performed for AML patients and a comprehensive molecular profile of the disease is offered to clinicians in order to individualize the therapeutic strategy. Moreover, NGS results have provided a large amount of molecular data that has revealed the molecular complexity of the disease. In this cohort, mutual exclusion and mutational co-occurrences among genes and functional categories have been deciphered and a distinct molecular profile between age groups at diagnosis and sex has been detected. Moreover, clinical validation of the current genomic classification in the "real-life" PETHEMA cohort has demonstrated the correlation of the molecular subgroups with clinical prognosis, reflecting the utility of the cooperative NGS studies in routine molecular diagnostics.
Supplementary Materials: The following supporting information can be downloaded at: https:// www.mdpi.com/article/10.3390/cancers15020438/s1, Table S1: Description of the sequencing technology, NGS panel and NGS platform used by each reference laboratory; Table S2: Detailed gene panel composition; Table S3: Genomic groups and integrated risk score based on 16 molecular classes defined by Tazi et al., 2022 in the updated genomic classification of AML; Table S4: Third Cross-validation round results; Table S5: Molecular alterations pertaining to described functional categories. Table S6: Mutational frequency distribution according to moment disease, age and sex; Table S7: Hazard ratio for (A) molecular classes, (B) Risk score in global cohort, (C) Risk score in <65 years-old, (D) Risk score in ≥65 years-old; Figure S1: Third Cross-validation round results; Figure S2: Mutational frequency of PETHEMA consensus genes in the global cohort; Figure S3: Stability rates in diagnosis-relapse samples. Figure S4: Stability rates in diagnosis-refractoriness samples; Figure S5: Overall survival curves according to AML molecular classes; Figure S6: Overall survival curves of TP53 mutated patients according to mono-allelic vs. multi-hit configurations.  Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.

Data Availability Statement:
The data presented in this study are available on request from the corresponding author.

Acknowledgments:
The authors would like to thank María D. García, Carlos Pastorini, Rafael Vianney, Yolanda Mendizabal and Alvaro Fernández for data collection and management.

Conflicts of Interest:
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.