Bioinformatics Analysis of Common Differential Genes of Viral Myocarditis and Dilated Cardiomyopathy: Screening for Potential Pharmacological Compounds

(1) Background: The mechanism of viral myocarditis (VMC) progression to dilated cardiomyopathy (DCM) remains unclear. The aim of this study was to identify key genes in the progression of VMC to DCM, so as to find potential therapeutic drugs and provide insights for future research. (2) Methods: Differential expression analysis of GSE4172 and GSE17800 from the Gene Expression Omnibus (GEO) database was performed using GEO2R, which contained genome-wide analysis of myocardial biopsies from VMC and DCM, respectively. We used the Venn diagram analysis to screen the common differentially expressed genes (DEGs). GO functional enrichment analysis and KEGG pathway analysis were also performed. Then we conducted protein–protein interaction (PPI) networks using STRING and identified hub genes using Cytoscape. Finally, we used cMAP to screen out candidate compounds targeting these hub genes; (3) Results: In total, 2143 DEGs for VMC and 1365 DEGs for DCM were found. Then a total of 191 common DEGs were identified. Biological processes and pathway involved in these genes mainly include GABA−gated chloride ion channel activity and Rap1 signaling pathway. A total of 14 hub genes were identified. PPI network showed these hubs mainly enriched in regulation of WNT signaling pathway and GABA-gated chloride ion channel activity. Subgroup analysis of Severe VMC cohort revealed 10 hub genes which mainly clustered in GABA channel activity, extracellular matrix remodeling and sarcomere dysfunction. Using cMAP, we obtained top 10 potential medications, but only amlodipine is currently viable; (4) Conclusions: Our study finds the hub genes and reveals the important role of GABA-gated chloride ion channel, Rap1 signaling pathway, WNT signaling pathway, extracellular matrix remodeling and sarcomere dysfunction in the progression from VMC to DCM. Amlodipine is a potential viable drug in preventing the progression of VMC to DCM.


Introduction
Viral myocarditis (VMC) is a localized or diffuse inflammatory disease of the myocardium associated with viral infection, which may have an acute or chronic course [1]. Most patients have a good prognosis, but 10%-15% may progress to dilated cardiomyopathy (DCM) [2]. DCM is a myocardial disease characterized by unilateral or bilateral enlargement of heart cavities and decreased myocardial contractility. Its clinical manifestations are mainly progressive congestive heart failure, often accompanied by various types of arrhythmias, which has a serious impact on patients' quality of life and life expectancy [3]. Currently, there is a lack of effective clinical treatment. Patients with dilated cardiomyopathy have only a 55% 5-year survival rate with current heart failure therapy, suggesting an urgent need for targeted therapy strategies [4].
The viruses known to cause myocarditis include coxsackie virus (A,B), echovirus, adenovirus, influenza virus (A,B), cytomegalovirus, and chiropoliovirus [5]. The hypothesis of the progression from VMC to DCM has been proposed for a long time. Kawai postmortem examined 44 DCM patients, 8 of whom had definite histological myocarditis [6]. It was found that neutralizing antibody titers of Coxsackie GROUP B (CB) virus were greater than 1:1024 in approximately 30% of DCM patients [7]. Of 165 serologically confirmed CB viral myocarditis patients with long-term follow-up, 13% developed DCM [8]. Due to the close relationship between VMC and DCM, it is often impossible to separate them completely in clinical practice, so some scholars believe that they are different stages of the same pathological process. However, the actual pathogenesis is not well understood. In recent years, with the great improvement of molecular biology techniques such as nucleic acid hybridization, polymerase chain reaction (PCR), endocardial biopsy, and modern immunohistochemistry, more evidence has been obtained on the mechanism of progression from VMC to DCM. Virus infection, immune response, and apoptosis are considered to be important factors [9][10][11]. However, the key genes in the pathogenesis from VMC to DCM have not been identified so far.
In this study, through bioinformatics, we searched for the key genes in the transition from viral myocarditis to non-familial dilated cardiomyopathy by screening the common differentially expressed genes (DEGs) between VMC and DCM, with the hope of finding potential drugs that could prevent the progression of VMC to DCM to some extent and providing clues for further research.

Acquisition of Data of Gene Expression Profiles
The datasets of gene expression profiles GSE4172 and GSE17800 were acquired by logging in the Gene Expression Omnibus (GEO) database (accessed on 5 June 2022, https: //www.ncbi.nlm.nih.gov/gds/) [12,13]. In GSE4172, there were 8 samples from VMC patients infected with parvovirus B19 and 4 samples from controls. In GSE17800, the number of samples from DCM patients and controls was 40 and 8, respectively. Both of the two chips were from endomyocardial biopsies (EMBs), the detailed information of which is shown in Table 1. We used fold change and corrected p-values to draw Volcano plots. Boxplots were drawn using the R package ggplot2; Expression heatmaps were drawn using the R package heatmap. DEGs were analyzed using the GEO2R tool. We defined p < 0.05 and |log2 fold change (FC)| > 0.58 (namely fold change >1.5 times) as statistically significant DEGs.

Screening of Common Differentially Expressed Genes
The differential genes selected from the two chips underwent Venn diagram analysis and a minority of DEGs shared by both were defined as common DEGs, which will be used for subsequent analysis.

Gene Ontology (GO) Enrichment Analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway Analysis
GO functional enrichment analysis which included molecular function (MF), biological process (BP), and cellular component [CC] analysis and KEGG pathway analysis of the DEGs were performed using the R package clusterProfiler.

Construction of Protein-Protein Interaction (PPI) Networks and Screening of Hub Genes
PPI network analysis was performed for the common DEGs of VMC and DCM by using String (accessed on 6 June 2022, https://www.string-db.org/) to predict the interaction between proteins encoded by DEGs. Confidence interaction score was set at 0.15 for the significant criterion. According to the result of PPI network, we used Cytoscape and the MCODE module to screen the hub genes.

Screening of Potential Pharmacological Targets
The ConnectivityMap (cMAP) (accessed on 6 June 2022, https://clue.io) database contains data on changes in gene expression profiles caused by more than 30,000 smallmolecule compounds acting on multiple cell lines and can be used to compare similarities between drug-induced gene profiles and gene expressions, with connectivity scores ranging from −100 to 100. A positive score suggests that compounds can cause changes similar to those in uploaded genes; a negative score suggests that the compound causes changes opposite to those in the uploaded genes. Therefore, for those up-regulated genes, a negative score suggests that the compound may have a therapeutic effect on the disease. On the contrary, for those down-regulated genes, a positive score suggests that the agent may have a therapeutic effect on the disease. Agents with absolute value of the connectivity scores >80 were considered promising predictors.

Subgroup Analysis
We additionally selected patients with severe myocarditis (sVMC) from GSE4172 for subgroup analysis. We defined sVMC as an EF < 40%. Four patients met the definition of sVMC.

Screening of Differentially Expressed Genes in GSE417
A total of 2143 DEGs were selected from GSE4172, including 1217 up-regulated genes and 926 downregulated genes. A part of the major DEGs is presented in Table 2. Clustering analysis of these DEGs was performed, as shown in the volcano plot ( Figure 1A). Data normalization and cross-comparability were also evaluated. As shown in Figure 1B, the selected samples were centered and the numerical distribution met the standard, indicating that the microarray data were of high quality and had cross-comparability. Based on the p-value, the former 50 DEGs with the lowest p-values were analyzed in the heatmap ( Figure 1C).  Totally, 1365 DEGs were selected from GSE17800, of which 869 were upregulated genes and 496 were downregulated genes. A part of major DEGs is presented in Table 3. Similarly, clustering analysis, data normalization and cross comparability and heatmap of these DEGs were also performed. As a result, the volcano plot is shown in Figure 2A.

Screening of Differentially Expressed Genes in GSE17800
Totally, 1365 DEGs were selected from GSE17800, of which 869 were upregulated genes and 496 were downregulated genes. A part of major DEGs is presented in Table 3. Similarly, clustering analysis, data normalization and cross comparability and heatmap of these DEGs were also performed. As a result, the volcano plot is shown in Figure 2A. The box plot showing a concentrated sample distribution is presented in Figure 2B. The heatmap of the former 50 DEGs with the lowest p-values was shown in Figure 2C.

Screening of Common Differentially Expressed Genes
Using Venn analysis, a total of 191 common DEGs were screened from GSE4172 and GSE17800 ( Figure 3A). Among these genes, 66 were common up-regulated DEGs ( Figure 3B) and 70 were common down-regulated DEGs ( Figure 3C). The former ten common DEGs with the highest average |log2FC| are shown in Table 4.

. Screening of Common Differentially Expressed Genes
Using Venn analysis, a total of 191 common DEGs were screened from GSE4172 GSE17800 ( Figure 3A). Among these genes, 66 were common up-regulated DEGs ( Fig  3B) and 70 were common down-regulated DEGs ( Figure 3C). The former ten comm DEGs with the highest average |log2FC| are shown in Table 4.

PPI Network Analysis and Screening of Hub Genes
To further explore the roles of these common DEGs, we used protein-protein interaction data from STRING to construct a network which contained 109 nodes and 306 edges ( Figure 5). By importing the results of STRING into Cytoscape, we identified 14  The gene ontology enrichment analysis of the common down-regulated DEGs of VMC and DCM were mainly enriched in biological process (BP) and molecular function (MF), especially in "ventricular cardiac muscle tissue morphogenesis" (GO-ID:0055010, p = 1.96 × 10 −4 ) and "ventricular cardiac muscle tissue development" (GO-ID:0003229, p = 2.94 × 10 −4 ). KEGG enrichment results showed that these genes were mainly involved in Rap1 signaling pathway (Table 6, Figure 4B). These results suggest that ventricular myocardial tissue development and Rap1 signaling pathway are inhibited in patients with VMC and DCM.

PPI Network Analysis and Screening of Hub Genes
To further explore the roles of these common DEGs, we used protein-protein interaction data from STRING to construct a network which contained 109 nodes and 306 edges ( Figure 5). By importing the results of STRING into Cytoscape, we identified 14 hub genes: FGFR2, AQP4, LTBP2, BMP6, WISP1, SFRP4, FRZB, ABCG2, HBEGF, UCHL1, VSNL1, GABRB1, GABRA4, and RGS4 (Figure 6), of which FGFR2 and AQP4 were down-regulated DEGs and others were up-regulated DEGs. The proteins in the network were mainly enriched in regulation of WNT signaling pathway (GO-ID:0030111, p = 0.0413), cellular response to stimulus (GO-ID:0051716, p = 0.0262), and GABA-gated chloride ion channel activity (GO-ID:0022851, p = 0.0467). . Figure 5. Results of the protein-protein interaction (PPI) network analysis of common differentially expressed genes (DEG) of VMC and DCM constructed using STRING.

Screening for Potential Pharmacological Targets
Potential targeted drugs of these hub genes were searched in the cMAP database, and the drugs were sorted and screened according to their connectivity score. The top 10 compounds suggested for these common DEGs were HG-6-64-01, selamectin, tetrindole, PD-198306, elvitegravir, amsacrine, GSK-461364, gefitinib, amlodipine, and AZ-628 (Table 7).

Screening for Potential Pharmacological Targets
Potential targeted drugs of these hub genes were searched in the cMAP database, and the drugs were sorted and screened according to their connectivity score. The top 10 compounds suggested for these common DEGs were HG-6-64-01, selamectin, tetrindole, PD-198306, elvitegravir, amsacrine, GSK-461364, gefitinib, amlodipine, and AZ-628 (Table 7).

Subgroup Analysis
From the GSE4172 dataset, four VMC patients with EF < 40% formed the sVMC subgroup. Their differential gene analysis with four normal controls revealed a total of 2121 DEGs. There were 180 common DEGs for sVMC and DCM ( Figure 7A). GO/KEGG analysis showed that these genes were mainly enriched in the collagen-containing extracellular matrix ( Figure 7B). A PPI network containing 143 nodes and 447 edges were constructed. By importing these data into Cytoscape, we identified 10 hub genes (4 upregulated and 6 downregulated): VSNL1, GABRA4, GABRB1, ACTG2, SCN1A, NFASC, AQP4, MYH6, COL1A1 and CSRP2, respectively ( Figure 7C). The proteins encoded by these genes were mainly clustered in the GABA-A receptor complex (GO:1902711) and myosin filaments (GO:0032982).

Subgroup Analysis
From the GSE4172 dataset, four VMC patients with EF < 40% formed the sVMC subgroup. Their differential gene analysis with four normal controls revealed a total of 2121 DEGs. There were 180 common DEGs for sVMC and DCM ( Figure 7A). GO/KEGG analysis showed that these genes were mainly enriched in the collagen-containing extracellular matrix ( Figure 7B). A PPI network containing 143 nodes and 447 edges were constructed. By importing these data into Cytoscape, we identified 10 hub genes (4 upregulated and 6 downregulated): VSNL1, GABRA4, GABRB1, ACTG2, SCN1A, NFASC, AQP4, MYH6, COL1A1 and CSRP2, respectively ( Figure 7C). The proteins encoded by these genes were mainly clustered in the GABA-A receptor complex (GO:1902711) and myosin filaments (GO:0032982).

Discussion
Dilated cardiomyopathy is a heterogeneous group of diseases that can be divided into two categories: primary (familial inheritance) and secondary. Viral myocarditis has traditionally been considered the most common cause of secondary dilated cardiomyo-

Discussion
Dilated cardiomyopathy is a heterogeneous group of diseases that can be divided into two categories: primary (familial inheritance) and secondary. Viral myocarditis has traditionally been considered the most common cause of secondary dilated cardiomyopathy [14,15]. However, the pathogenesis of transition from viral myocarditis to dilated cardiomyopathy is poorly understood, and the key genes and pathways among this pathophysiological process await further discovery. In this study, we first obtained the common DEGs between VMC and DCM based on gene chips from myocardial biopsy. A total of 191 DEGs of VMC and DCM were unearthed.
When compared with the studies that originally generated the reevaluated datasets, the original study that generated the GSE4172 dataset [12] found that cysteine-rich angiogenic inducer 61 (CYR61) and adiponectin (APN) played an important role in inflammatory cardiomyopathy caused by parvovirus B19-infected viral myocarditis. However, in our study, the APN gene was not among the DEGs. Although CYR61 was among the DEGs, it was not found in the final common DEGs, indicating that although CYR61 and APN may play an important role in viral myocarditis, they are not the key genes in the progression of viral myocarditis to dilated cardiomyopathy. The original study that generated the GSE4172 dataset [13] compared the differential genes of responders and non-responders to immunoadsorption with subsequent immunoglobulin G substitution (IA/IgG) treatment in 40 DCM patients with normal controls. Because our study overall compared genetic differences between DCM patients and normal controls, the results of the DEGs differed from this study.
GO enrichment analysis and KEGG pathway analysis showed that the common upregulated GEGs are mainly involved in GABA receptor complex, GABA−gated chloride ion channel activity, xenobiotic transmembrane transporter activity, and GABAergic synapse, indicating that GABA-related pathway functions are enhanced in VMC and DCM patients. This is a novel finding. Currently, it is known that γ-aminobutyric acid (GABA) is widely distributed in the cardiovascular system activity regulatory region of the central nervous system (CNS) and directly regulates cardiac sympathetic nerve activity (C-SNA) [16,17]. On the other hand, there is strong evidence that cardiac sympathetic hyperactivation is associated with DCM [18][19][20]. Therefore, whether GABA dysregulation in VMC leads to the development of DCM through this pathophysiological mechanism is worth exploring.
For the common downregulated DEGs, GO enrichment analysis found these genes were mainly enriched in ventricular cardiac muscle tissue morphogenesis and ventricular cardiac muscle tissue development, which means that ventricular myocardial tissue development are suppressed in patients with VMC and DCM. The KEGG pathway analysis found that these downregulated DEGs were mainly enriched in Rap1 signaling pathway. Rap1 (Ras-proximate-1) is a small GTPase protein which plays a crucial role in mediating cAMP signaling in isolated cardiac tissues and cell lines [21]. Similar to our study, a recent study in found a significant enrichment in Rap1 signaling pathway in VMC mice [22]. Further, basic studies show that suppression of Rap1 impairs cardiac myofibrils and conduction system in zebrafish [23]. Our study suggests that inhibition of the Rap1 signaling pathway in VMC may contribute to the progression of DCM.
Fourteen genes were identified as hub genes in the progression from VMC to DCM. Of them, FGFR2 (Fibroblast growth factor receptor 2) is a tyrosine-protein kinase that plays an essential role in the regulation of cell proliferation, differentiation, migration and apoptosis [24]. AQP4 (Aquaporin-4) forms a water-specific channel and plays an important role in brain water homeostasis and in glymphatic solute transport [25]. LTBP2 (Latent-transforming growth factor beta-binding protein 2) may play an integral structural role in elastic-fiber architectural organization and/or assembly [26]. BMP6 (Bone morphogenetic protein 6) plays an important role in bone formation and iron metabolism [27]. WISP1 (WNT1 inducible signaling pathway protein 1) is a downstream regulator in the Wnt/Frizzled-signaling pathway and is associated with cell survival [28]. SFRP4 (Secreted frizzled-related protein 4) and FRZB (frizzled related protein) are both soluble frizzledrelated proteins (sFRPS) that function as modulators of Wnt signaling through direct interaction with Wnts. [29,30]. ABCG2 (ATP-binding cassette subfamily G member 2) is a broad substrate-specificATP-binding cassette transporter that plays an important role in porphyrin homeostasis and the cellular export of heme [31]. HBEGF (heparin binding EGF-like growth factor) is a Growth factor that may be involved in macrophage-mediated cellular proliferation and fibroblast mitosis [32]. UCHL1 (Ubiquitin carboxyl-terminal hydrolase isozyme L1) involved both in the processing of ubiquitin precursors and of ubiquitinated proteins. [33]. VSNL1 (Visinin-like protein 1) regulates the inhibition of rhodopsin phosphorylation in a calcium-dependent manner [34]. GABRB1 (Gamma-aminobutyric acid receptor subunit beta-1) and GABRA4 (Gamma-aminobutyric acid receptor subunit alpha-4) are both components of receptor for GABA, which functions as a ligand-gated chloride channel, and play an important role in ion transport [35,36]. RGS4 (Regulator of G-protein signaling 4) inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits, which is associated with negative regulation of cardiac muscle cell development [37]. PPI network showed that these hub genes were mainly clustered in regulation of WNT signaling pathway and GABA-gated chloride ion channel activity. Consistent with our study, several studies have reported the downregulation of the Wnt pathway in animal models of DCM. In contrast, activation of the Wnt pathway in different ways helps prevent DCM development [38][39][40]. Our findings suggest that the suppression of the Wnt signaling pathway may play an important role in the progression of VMC to DCM. These hub genes may be potential therapeutic targets for the protection against progression of VMC to DCM.
As shown in Table 7, the top 10 agents predicted by cMAP for these hub genes were more concentrated in the RAF/MEK/MAPK signaling axis. To evaluate the feasibility of these drugs, HG-6-64-01 has not yet entered clinical trials. Selamectin is a potent nematocide but is highly toxic (neurotoxic). Tetrindole, as a selective inhibitor of monoamine oxidase A, is a new type of antidepressant but has a marked negative effect on detoxification. PD198306 is a selective inhibitor of MEK-1/2, which has not been used in clinical practice. Elvitegravir is a human immunodeficiency virus (HIV) integrase inhibitor which is used largely in a four-drug combination with cobicistat, emtricitabine, and tenofovir as therapy of HIV infection, but no cardiovascular applications have been reported. Az-628 is an RAF inhibitor that has not yet entered clinical trials. Amsacrine is an acridine derivative that is active in the treatment of acute leukemias and lymphomas, but no cardiovascular applications have been reported. GSK461364 is an ATP-competitive inhibitor of polo-like kinase 1 (Plk1) that currently has only one completed phase I clinical trial for non-Hodgkin's lymphoma. Gefitinib, an EGFR inhibitor, is widely used in the treatment of non-small cell lung cancer but not cardiovascular disease. Amlodipine, as a calcium channel blocker, is one of the most widely used antihypertensive drugs. Although calcium channel blockers have not been shown to be beneficial for the treatment of patients with heart failure, several studies have shown a favorable effect of amlodipine on nonischemic dilated cardiomyopathy [41,42]. In addition, amlodipine has been reported to have a protective effect on myocardial injury in mice with congestive heart failure caused by viral myocarditis, and this therapeutic effect may be in part resulting from the inhibition of NO overproduction [42]. In conclusion, only amlodipine is currently viable among these potential therapeutic agents. Clinical trials are needed to verify its effectiveness in preventing the progression of viral myocarditis to dilated cardiomyopathy.
Because of evidence of a strong association between severe myocarditis and dilated cardiomyopathy, we created a subgroup of patients with severe myocarditis. Excitingly, the subgroup analysis yielded some novel findings. Firstly, the common DEGs of cVMC and DCM were enriched in the collagen-containing extracellular matrix, which was not found before. Currently, there is evidence that extracellular matrix remodeling plays an important role in dilated cardiomyopathy [43]. Our study suggests that viral myocarditis may develop dilated cardiomyopathy through changes in extracellular matrix related genes. Second, AQP4, VSNL1, GABRA4, and GABRB1 were still among the hub genes obtained by subgroup analysis, indicating that these genes were strongly associated with dilated cardiomyopathy caused by myocarditis. Among the newly discovered hub genes, ACTG2 was up-regulated and the rest were down-regulated. ACTG2 is involved in coding for actin, which is a major component of the sarcomere and plays an important role in cardiac contraction [44]. SCN1A encodes Sodium channel protein type 1 subunit alpha, which mediates the membrane permeability of voltage-dependent sodium channels and plays an important role in the water-sodium balance [45]. NFASC encodes Neurofascin, an ankyrinbinding protein that promotes cell adhesion [46]. Myosin-6 encoded by MYH6 is an actinbased motor protein that plays an important role in myocardial contraction [47]. Col1A1 encodes the collagen alpha-1 (1) chain, which plays an important role in extracellular matrix and fiber formation [48]. CSRP2 encodes cysteine and glycine-rich protein 2, which promotes smooth muscle cell proliferation and dedifferentiation [49]. Thirdly, the PPI network showed that these genes were clustered in the GABA-A receptor complex and myosin filaments. This is consistent with our previous findings of global hub genes clustering at GABA receptors, whereas myosin filament clustering is a novel finding. Combined with the functions of these hub genes, we can infer that severe myocarditis develops dilated cardiomyopathy through genetic changes related to sarcomere dysfunction and extracellular matrix remodeling. However, these need to be verified experimentally.
For dilated cardiomyopathy, the known and recognized mutated genes need to be taken seriously. Currently the major mutations identified in familial dilated cardiomyopathy include TIN, LMNA, MYH7, MYH6, TNNT2, ACTC1, BAG3, DSP, MYBPC3, RBM20, SCN5A, and TPM1. Our differential gene analysis did detect MYH7 and MYH6, and MYH6 was also among the hub genes of the severe myocarditis cohort. However, no other known genes were detected, probably because most of the included patients in the original study producing GSE17800 had sporadic dilated cardiomyopathy.
There are several limitations in this study. Firstly, the data set for myocarditis in this study used parvovirus B19 myocarditis samples, and whether it is representative is debatable. However, this is the largest genome-wide sample of viral myocarditis currently available, and the Venn method of screening for common DEGs has itself largely eliminated confounding factors such as different viral infections. Secondly, there were no RNAseq data, which limits the detection of moderate/weak expression changes. Unfortunately, there are currently no high-throughput datasets from myocardial biopsy samples from human viral myocarditis. The lack of RNAseq data does reduce the sensitivity of this study, but in turn increases the specificity of the detection of differential expression. Thirdly, our study lacks protein expression data and thus the most direct representation of the biological process of VMC to DCM. We will add the content of proteomics in a subsequent study. Fourthly, although we screened out potential therapeutic compounds targeting hub genes, most of them seemed infeasible, and thus further verification by animal experiments is required.

Conclusions
In conclusion, our study is the first to identify hub genes in the progression of VMC to DCM, and reveals the important role of GABA-gated chloride ion channel activity, RAP1 signaling pathway, and WNT signaling pathway. Hub genes of severe VMC and DCM include VSNL1, GABRA4, GABRB1, ACTG2, SCN1A, NFASC, AQP4, MYH6, COL1A1, and CSRP2, which mainly cluster in extracellular matrix remodeling and sarcomere dysfunction. These genes can be considered as genetic markers of early high-risk myocarditis. Amlodipine is a potential viable drug in preventing the progression of VMC to DCM. Further basic studies are needed to verify the role of these genes and pathways in the progression of VMC to DCM.  Data Availability Statement: GSE4172: accessed on 5 June 2022, https://www.ncbi.nlm.nih.gov/ geo/query/acc.cgi?acc=GSE4172. GSE17800: accessed on 5 June 2022, https://www.ncbi.nlm.nih. gov/geo/query/acc.cgi.