Next Article in Journal
Psoralen and Isopsoralen from Psoralea corylifolia Suppress NSCLC by Dual Mechanisms: STAT3 Inhibition and ROS Modulation
Next Article in Special Issue
Integrated Computational Investigation of Cannabis sativa Phytoconstituents as Putative Multi-Target Inhibitors in Skin Cancer: A Molecular Docking, Dynamics, and ADMET Profiling Study
Previous Article in Journal
Quality-by-Design Development of a Clofazimine–Pyrazinamide Dermal Emulsion and Its Diffusion Behavior in Strat-M® and Human Skin
Previous Article in Special Issue
Advancing PROTAC Discovery Through Artificial Intelligence: Opportunities, Challenges, and Future Directions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Integrated Network Biology and Molecular Dynamics Approach Identifies CD44 as a Promising Therapeutic Target in Multiple Sclerosis

by
Mohammad Abdullah Aljasir
Department of Medical Laboratories, College of Applied Medical Sciences, Qassim University, Buraydah 51431, Saudi Arabia
Pharmaceuticals 2026, 19(2), 254; https://doi.org/10.3390/ph19020254
Submission received: 22 December 2025 / Revised: 22 January 2026 / Accepted: 29 January 2026 / Published: 1 February 2026
(This article belongs to the Special Issue Computer-Aided Drug Design and Drug Discovery, 2nd Edition)

Abstract

Background: Multiple sclerosis (MS) is a neuroinflammatory disease characterized by autoimmune-driven inflammation in the central nervous system that damages axons and destroys myelin. It is difficult to diagnose multiple sclerosis due to its complexity, and different people may react differently to different treatments. While the exact cause of multiple sclerosis (MS) and the reasons for its increasing prevalence remain unclear, it is widely believed that a combination of genetic predisposition and environmental influences plays a significant role. Methods: Finding biomarkers for complicated diseases like multiple sclerosis (MS) is made more promising by the emergence of network and system biology technologies. Currently, using tools like Network Analyst to apply network-based gene expression profiling provides a novel approach to finding potential medication targets followed by molecular docking and MD Simulations. Results: There were 1200 genes found to be differentially expressed, with CD44 showing the highest degree score of 15, followed by CDC42 and SNAP25 genes, each with a degree score of 14. To explore the regulatory kinases involved in the protein–protein interaction network, we utilized the X2K online tool. The present study examines the binding interactions and the dynamic stability of four ligands (Obeticholic acid, Chlordiazepoxide, Dextromethorphan, and Hyaluronic acid) in the Hyaluronan binding site of the human CD44 receptor using molecular docking and molecular dynamics (MD) simulations. Docking studies demonstrated a significant docking score for Obeticholic acid (−6.3 kcal/mol), underscoring its medicinal potential. MD simulations conducted over a 100 ns period corroborated these results, revealing negligible structural aberrations (RMSD 1.3 Å) and consistent residue flexibility (RMSF 0.7 Å). Comparative examinations of RMSD, RMSF, Rg, and β-factor indicated that Obeticholic acid exhibited enhanced stability and compactness, establishing it as the most promising choice. Conclusions: This integrated method underscores the significance of dynamic validations for dependable drug design aimed at CD44 receptor-mediated pathways. Future experimental techniques are anticipated to further hone these findings, which further advance our understanding of putative biomarkers in multiple sclerosis (MS).

Graphical Abstract

1. Introduction

Multiple sclerosis (MS) is a chronic, inflammatory, and demyelinating disease of the central nervous system (CNS) that predominantly affects young adults, with a higher prevalence among women [1]. Multiple sclerosis is defined by a complex interaction of genetic susceptibility, environmental influences, and immune system dysfunction, resulting in myelin degradation and axonal injury [2]. The global incidence of multiple sclerosis varies considerably, with estimates between 2 and 150 cases per 100,000 persons yearly, reflecting regional, ethnic, and environmental disparities. The incidence is greatest in temperate climbs, including North America and Northern Europe, and diminished in tropical regions [3]. Multiple sclerosis (MS) presents in several clinical forms, notably relapsing-remitting MS (RRMS), which constitutes over 85% of first diagnoses, alongside progressive variants, including secondary progressive MS (SPMS) and primary progressive MS (PPMS). The variability in illness development highlights the need for individualized strategies in diagnosis and therapy [4]. Genetic study has clarified the hereditary factors involved in the etiology of multiple sclerosis. The human leukocyte antigen (HLA) region, namely the HLA-DRB1*15:01 allele, is significantly correlated with heightened vulnerability to multiple sclerosis (MS) [5]. Genome-wide association studies, or GWASs, have found more than 200 genetic loci linked to multiple sclerosis (MS), involving genes involved in immune modulation, including IL7R, IL2RA, and TNFRSF1A [6]. In addition to these loci, epigenetic processes and gene–environment interactions, such as exposure to Epstein–Barr virus (EBV) and vitamin D insufficiency, further influence the likelihood of developing multiple sclerosis (MS) [7]. At the molecular level, multiple sclerosis pathogenesis is defined by the infiltration of autoreactive T cells, B cells, and macrophages into the central nervous system [8]. This immune response results in the development of inflammatory lesions, demyelination, and axonal degeneration, which are characteristic features of the illness [9]. Genetic mutations affecting immune response regulation and myelin repair are associated with illness severity and progression, providing prospective indicators for prognosis and therapeutic intervention [10]. Therapeutic strategies for multiple sclerosis seek to alter disease progression, diminish relapse frequency, and mitigate symptoms [11]. Initial therapy includes disease-modifying drugs (DMTs) including interferon-β and glatiramer acetate, which regulate immunological function [12]. Recent breakthroughs include monoclonal antibodies that target CD20 (e.g., ocrelizumab) and integrins (e.g., natalizumab), in addition to sphingosine-1-phosphate receptor modulators (e.g., fingolimod). Treatment options for progressive types of MS are restricted; however, high-efficacy disease-modifying therapies and neuroprotective techniques are now being researched [13]. Novel therapeutics concentrate on remyelination and neuroprotection, targeting the fundamental causes of neuronal damage [14]. Experimental methodologies, such as technology for gene editing and transplantation of stem cells, provide potential for the healing of CNS damage and the restoration of function [15]. Moreover, personalized medicine, using genomic and molecular profiling, has the capacity to transform the management of MS according to each patient’s profile [16].
The primary objectives of this work are to identify hub genes and investigate their involvement in the pathophysiology of multiple sclerosis (MS) via the utilization of microarray datasets and analysis using bioinformatics. The project aims to identify crucial regulatory genes involved in MS pathophysiology and examine their connections within biological pathways via the analysis of extensive gene expression data. Furthermore, the research seeks to identify prospective pharmacological targets by merging gene expression data with pharmaceutical databases, therefore paving the way for innovative therapeutic strategies. This study aims to deepen our comprehension of MS and provide a basis for the advancement of precision medicine in its treatment [17].
The ideal hub gene will be used in molecular docking studies, later succeeded by molecular dynamics (MD) simulations to evaluate the effectiveness of the inhibitor against the receptor. These results will enhance future experimental studies as well as the development of novel biomarkers and therapeutic strategies for MS.
Though the analytical methods used in this paper adhere to the well-established bioinformatics workflows, the originality of the work is that the combination of these systems-level layers of differentiation gene expression, protein–protein interaction network topology, upstream kinase-transcription factor regulation, drug-target mapping, and molecular docking with MD simulation are specifically applied to identify CD44-centered molecular signatures in multiple sclerosis. There is no previous research that has combined these studies to the GSE135511 data to emphasize CD44-linked pathways and ligand associations as possible treatment entry points. This combination framework consequently offers fresh biological understanding on top of the individual parts of the analysis.

2. Results

2.1. PCA and Quality Control Analysis

The information on all individuals was obtained from Table S1 following the analysis of expression profiles employing the MS gene datasets (GSE135511), that were derived from the GEO database of NCBI. Log2FC values and p-values obtained from the Network Analyst program were used to discover gene expression differences in both cohorts. DEGs were defined as genes that met the cutoff criteria of p < 0.05. The transcriptome expressions were estimated using the VSN quantile normalization method. Figure 1A,B display the mean values and boxplots before and after normalization. The graphics demonstrate that normalization ensures uniform mean values across specimens and eliminates variations in the data. The Principal Componential Analysis (PCA) was conducted to assess sample clustering in terms of gene expressions profile. The normalization process has resulted in PC1 (6.8%) and PC2 (5.2%) of the total variance distinguishing between the control (pink) and the MS (blue) samples, and there is evident clustering between the two groups (Figure 1B). This contributes to the validity of the DEGs between the disease and control cohorts.

2.2. Differential Expression Genes DEGs and PPI Network Analysis

The healthy and sick groups were analyzed using various statistical tests, such as the Benjamini–Hochberg method, the Student t-test, and the Pearson correlation test, which led to the recognition of 45 DEGs. In these DEGs, we identified 4 significant up-regulated and 41 significant down-regulated genes. The final DEGs were ranked according to the adjusted p-value. PPI network analysis is a crucial tool for understanding the mechanisms of cellular interaction. By examining the differences in protein networks between healthy and diseased states, we gain insights into the condition’s harshness. This computer model includes representations of both genes and proteins as nodes, with relationships between them shown as edges. A network of interactions for the identified DEGs was created by mapping these proteins and genes. Using Cytoscape, the DEGs were visualized, and interactions between them were retrieved from the STRING database with a medium confidence level of 0.400. The network that all the DEGs create is shown in Figure 2. The network has 169 edges and 150 nodes. It was determined that the mean node degree was 2.25 and the mean local clustering index was 0.364, demonstrating the importance of network connections. The volcano plot (Figure 3A) was plotted based on the significance cutoffs of |log2FC| ≥ 1, and FDR < 0.05 distinctly demonstrated significantly up-regulated and down-regulated genes. VST-normalized and row-scaled (Z-score) expression values were used to create the heatmap (Figure 3B) where differential expression patterns of control and MS samples were observed.

2.3. Analysis of KEGG Pathway Enrichment and GO Functional Analysis

FDR was used as a critical measure to identify significant keywords used in an examination of enrichment, which included GO cellular components, biological processes molecular functions, and KEGG pathways. Based on the GO cellular component analysis of the DEGs, the proteins were primarily associated with specific granules, tertiary granules, specific granule lumens, tertiary granule lumens, secretory granule lumens, cytoplasmic vesicle lumens, vesicle lumens, and secretory granule membranes. For molecular purposes, two key functions identified for the proteins were carbohydrate binding and lipopolysaccharide binding. Additionally, the enriched biological processes (BPs) associated with the DEGs included the immune response to microorganisms, the humoral immune response, the antimicrobial humoral immune response mediated by antimicrobial peptides, the defensive response, and erythrocyte formation.
The functional enrichment analysis revealed several highly enriched GO terms, KEGG pathways related to immune response and neuroinflammatory signaling. The statistics of enrichment, such as enrichment scores, gene ratios, and FDR-adjusted p-values, are summarized in Supplementary Tables S2 (GO) and S3 (KEGG).

2.4. Identification of Hub Genes

The hub genes were considered based on node connectivity (degree) to identify the central regulators in the PPI network through the CytoHubba plugin in Cytoscape. The degree values of nodes that are higher than 10 were chosen as hub genes, which contain the largest number of interactions in the network. This cutoff was identified according to the topology of the network, as well as the distribution of node degrees, to obtain only the most generally connected and possibly functionality-important genes that were subjected to downstream analyses. Such kinds of studies have also been established while integrating the computational analysis with experimental approaches to investigate the identified hub genes of the complicated diseases [18,19].
Biologically, hub genes are frequently associated with fundamental proteins that are involved in the coordination of essential cell processes. The nodes that are very closely linked, like CD44, SNAP25, GFAP, PVALB, and HSPB1, are involved in immune modulation, neuronal signaling, and glial cell activity, which are pertinent to disease pathogenesis in the context of MS. For example, CD44, which facilitates leukocyte migration and Hyaluronan signaling, facilitates inflammation and demyelination [20]. On the same note, SNAP 25 is needed to help in the trafficking of synaptic vesicles and the release of neurotransmitters, whereas GFAP and PVALB indicate the activities of the astrocytes and the interneurons, which are disturbed in MS lesions.
The top 10 hub genes plus their connectivity scores are tabulated in Table 1. These genes have the shortest path interactions, as shown in Figure 4, and that further demonstrates their central functions in network integrity and their potential applications to MS pathology.

2.5. Transcription Factor Analysis

This research revealed significant protein kinases and transcription factors associated with the DEGs for their involvement in enhancing the regulated network. A regulatory network was established using transcription factors, kinases, and the transitory proteins to which they were linked throughout the evolution of the regulatory complexity. The first phase was the integration of target genes for transcription factors discovered using assays (ChEA) to forecast the principal transcriptional regulators (TFs), thereafter mapping them into the PPI network. Figure 5A,B illustrate the network and the TFs that were found constructed from PPI analysis. Based on the hypergeometric p-value, the key transcriptional regulators in this context include CREB1, MAX, UBTF, SPI1, USF2, and RCOR1. Furthermore, kinases that are likely regulators of the extended PPI networks were identified and mapped onto the PPI network. Figure 5C,D show the key kinases and their connections in the PPI network. Based on the hypergeometric p-value, the main kinases linked to these DEGs were identified as CSNK2A1, K2ALPHA, CDK1, MAPK14, CDK4, CDC2, HIPK2, GSK3B, MAPK3, and AKT1.

2.6. Hub Genes Survival Investigations

GEPIA survival evaluation was utilized to examine the overall effect on the survival of ten significant hub genes selected from both the down-regulated and up-regulated DEGs. Figure 6 displays the results of the current investigation, with a hazard ratio (HR) of 1.4 and 1.6, respectively. CD44, HSPB1, and SERPINH1 were the only three of the ten genes being studied to exhibit worse overall survival in the group with higher expression.

2.7. CD44 and Drug Interactions

Utilizing the best 10 hub genes, prospective pharmaceuticals were discovered from a drug library database. In this investigation, we conducted a manual search for relevant drugs aimed at each hub gene. The query panels included pharmaceuticals from several classifications, namely experimental, FDA-approved, and nutraceuticals. We conducted a thorough evaluation of these target medications to identify those that communicate with each protein designated as a hub gene. A total of 54 pharmacological agents were found for all the hub genes. We have found 1 drug for GFAP, 3 drugs for SNAP25, 3 drugs for CDC42, no drug for PVALB, 10 drugs for CD44, 5 drugs for HSPB1, no drug for PRKACB, 8 drugs for CDKN1A, and 13 for KCNC2. The identified drugs for all these targets are given in Table 2.

2.8. Molecular Docking

The principal protein discovered via network-based analysis was subjected to a molecular docking method. The crystal structure of the target protein was obtained from the protein database via PyRx. A blind docking methodology was employed to identify and confirm the precise active site using information derived from the literature. Drug compounds, including Hyaluronic acid, Chlordiazepoxide, Dextromethorphan, and Obeticholic acid, anticipated to interact with the target protein, were formulated based on an examination of the DrugBank database. Hyaluronan (HA) may exhibit proinflammatory functions in the realm of central nervous system autoimmunity. It aggregates in demyelinated lesions of multiple sclerosis (MS), facilitates antigen presentation, and amplifies T cell activation and proliferation [21].
The results of molecular docking provide essential insights into the efficacy of these medicines for treating multiple sclerosis (MS). MD simulations elucidate the dynamic interactions between ligands and the Hyaluronan binding domain of the human CD44 receptor. This domain pertains to immunological regulation and cell adhesion mechanisms relevant to multiple sclerosis. Under physiological circumstances, Obeticholic acid (Top1) has consistent associations with residues GLU75 and THR76, suggesting its efficacy in inhibiting CD44-mediated pathways, as indicated by a significant docking score of −6.3 kcal/mol (Figure 7A). The stability of Chlordiazepoxide (Top2) is encouraging, as indicated by its interaction with HIS35, suggesting its ability to modulate immunological responses (Figure 7B). The docking score is −6.0 kcal/mol. Dextromethorphan (Top4), with a weak propensity for the binding of −5.3 kcal/mol, may demonstrate variable stability while retaining the therapeutic promise (Figure 7D). Conversely, Hyaluronic acid (Top3) is expected to demonstrate restricted effectiveness in targeting CD44 because of its low docking score (−6.0 kcal/mol), as shown in (Figure 7C). Through the analysis of hydrogen bond dynamics, RMSD, and interaction energies, molecular dynamics simulations substantiate these findings, offering an in-depth understanding of the therapeutic significance and stability of each medication in the context of multiple sclerosis treatment.
The illustration presents four protein–ligand complexes designated as A, B, C, and D in Figure 8, depicted using ribbon diagrams to emphasize the protein’s secondary structures, including alpha helices (red), beta sheets (cyan), and loops (gray). The ligands are shown in stick format, with binding locations indicated by hydrogen bond donor areas (pink) and acceptor regions (green). In Complex A, the ligand establishes hydrogen bonds with residues GLY40 and ASN25, exhibiting bond lengths of 2.8 Å and 3.1 Å, respectively, signifying robust and stable interactions. Complex B demonstrates the ligand interaction with residues GLU127 and ASP151, exhibiting bond lengths of 3.5 Å and 3.8 Å, indicative of modest binding potential. Complex C exhibits diminished contacts, as the ligand establishes hydrogen bonds with SER50 at 4.2 Å, indicating a weak electrostatic stability. Complex D has moderate binding, characterized by hydrogen bonds between the ligand and residues ARG75 (3.9 Å) and THR100 (3.6 Å), suggesting moderate stability and effectiveness. This visualization highlights the spatial distribution of hydrogen bond donors and acceptors, offering insights into the binding specificity and stability of each ligand inside the protein’s active site (Figure 8).

2.9. MD Simulation

The MD simulation findings for the Top1–Top4 complexes provide essential insights into their behavior and stability during interactions with the receptor protein under dynamic settings. Top1 (Obeticholic acid) is recognized as the optimal complex owing to its exceptional stability parameters, with the lowest average RMSD value of 1.3 Å, minimum deviations (maximum RMSD 2 Å, standard deviation 0.18 Å) (Figure 9A), and a compact structure (average Rg 16.6 Å, standard deviation 0.1 Å) (Figure 9C). The data indicate that Top1 preserves structural integrity and stable binding inside the receptor pocket, essential for successful therapeutic activity. Moreover, the low RMSF values (mean 0.7 Å, highest fluctuation 3 Å at residue 149, lowest 0.3 Å at residue 67) signify restricted residue flexibility, hence reinforcing its stability under physiological circumstances (Figure 9B).
In comparison, Top3 (Hyaluronic acid) exhibits the least favorable outcomes, characterized by the greatest average RMSD value (1.9 Å), maximum deviation (3.5 Å), and standard deviation (0.4 Å), indicating considerable variability and diminished binding stability. The elevated RMSF values (mean 0.9 Å, highest fluctuation 4.3 Å at residue 79, minimum 0.4 Å at residue 29) and inconsistent β-factor (mean 37.1 Å, maximum 505.8 Å at residue 79) underscore its diminished dependability and efficacy as a therapeutic candidate (Figure 9D). Top2 (Chlordiazepoxide) and Top4 (Dextromethorphan) exhibit moderate stability, with average RMSDs of 1.5 Å and 1.6 Å, respectively, alongside balanced residue flexibility (mean RMSF of 0.8 Å for both) and compactness (Rg values of 16.5 Å and 16.6 Å, with a standard deviation of 0.1 Å for both), suggesting reasonable yet less robust interactions in comparison to Top1 (Figure 9B,C). Further, the binding free energy calculation predicted all four complexes as stable complexes, where van der Waals forces dominated the overall energy terms, followed by electrostatic energy. The solvation energy, in contrast, contributed non-favorably to complex formation. The net binding energy of Top1, Top2, Top3, and Top4 was estimated as −66.16 kcal/mol, −56.00 kcal/mol, −56.36 kcal/mol, and −49.59 kcal/mol, respectively (Table 3).
Conducting MD simulations after molecular docking is vital for validation and a comprehensive knowledge of ligand–receptor interactions. Docking scores offer a static assessment of binding compatibility but fail to consider dynamic alterations, including conformational changes, flexibility, or stability in real-world settings. MD simulations provide the temporal monitoring of ligand and receptor dynamics, documenting structural deviations (RMSDs), residue fluctuations (RMSFs), compactness (Rg), and atomic displacement (β-factor). These studies facilitate the identification of the most stable and attractive candidates for therapeutic applications, ensuring that computational predictions correspond with practical and physiological relevance. Validation using MD simulations mitigates the likelihood of drug failure in later experimental phases and establishes dependable foundations for drug design and development.

3. Discussion

Multiple sclerosis (MS) is a chronic inflammatory disorder that results in demyelination inside the central nervous system (CNS), producing localized lesions in both gray and white matter, as well as extensive neurodegeneration throughout the brain [8]. It is an autoimmune disorder and the primary cause of nontraumatic neurological disability in young adults. It is defined by two key pathological features: (1) inflammation leading to demyelination and (2) astroglial proliferation (gliosis) along with neurodegeneration [12]. MS is influenced by multiple genetic factors that slightly elevate susceptibility, along with well-established environmental contributors such as vitamin D levels, ultraviolet B (UVB) exposure, Epstein–Barr virus (EBV) infection, obesity, and smoking [1]. In recent years, advancements in understanding the mechanisms of RRMS have resulted in the developing of several disease-modifying therapies that help reduce relapse severity and frequency by modulating or suppressing the immune system [15]. Managing MS is challenging and requires multiple drugs with different mechanisms of action, depending on the disease type and progression. While no definitive treatment exists for the primary progressive form, some medications can help alleviate symptoms in the secondary progressive stage and effectively regulate disease activity in relapsing-remitting MS [9].
This research focused on discovering new biomarker candidates and associated protein–protein interaction networks relevant to MS pathways. Transcriptome signature profile data, encompassing microarray and Illumina analysis, were collected from the NCBI GEO database. Gray matter tissues of 50 samples were obtained from multiple sclerosis patients. There were 4 significant up-regulated genes and 41 down-regulated genes. The final DEGs were ranked based on the adjusted p-value. The PPI network serves as an essential tool for revealing the structural organization of cellular interactions. GO cellular component (CC) analysis showed that the identified proteins were predominantly localized in the specific granule. The relationship between hub genes and nodes was analyzed and ranked, with red nodes indicating high connectivity. CytoHubba was used to assess node degrees, and those exceeding a degree value of 10 were classified as hub nodes. In addition, significant protein kinases and transcription factors linked to differentially expressed genes (DEGs) were found. The research examined pharmaceuticals from many classifications, including FDA-approved, experimental, and nutraceutical agents. A manual screening process was conducted to identify medications that interacted with each hub gene-associated protein. A total of 54 drugs were identified for each protein. Subsequently, drug molecules were prepared based on predictions from DrugBank database analysis to target the identified proteins. Molecular docking and dynamics simulations were performed, highlighting their significance in elucidating ligand–receptor interactions and assessing the potential therapeutic effectiveness of the examined complexes. The docking data establish a preliminary static framework, elucidating the binding strength and hydrogen bond interactions of the ligands inside the Hyaluronan binding domain of the human CD44 receptor. Among the substances, Obeticholic acid (Top1) had the highest binding affinity, facilitated by significant hydrogen bonds with residues essential for receptor functionality. Conversely, Hyaluronic acid (Top3) had a low binding affinity, indicating its restricted use for medicinal purposes. These findings emphasize the significance of preliminary docking in the identification of lead compounds.
To corroborate and enhance the docking results, molecular dynamics simulations were conducted over a 100 ns period, enabling the observation of real-time ligand–receptor interactions. Top1, aligning with its robust docking score, demonstrated exceptional stability, exhibiting minor structural variations (average RMSD of 1.3 Å) and a compact conformation (average radius of gyration of 16.6 Å). The low RMSF values suggested limited flexibility, demonstrating stable interactions between the ligand and critical residues. Top2 and Top4 exhibited intermediate stability, shown by somewhat elevated RMSD and RMSF values, implying adequate, although less resilient, binding. Conversely, Top3 exhibited considerable variability and instability in both RMSD and RMSF assessments, which corresponded with its diminished docking efficacy. The RMSDs of the complexes can be seen on average to be between 1 and 3 Å, which can be believed as stable dynamics. Some small plot jumps can be seen, which reflect complex loops’ deviation, which are naturally unstable during a dynamic environment, to support proteins’ natural functions [22]. In the RMSFs, the central and N-terminal domains revealed higher local variations compared to the C-terminal domain; however, the net residue fluctuations are well within a stable range. The flexible loops correspond to the unstable fluctuations, which are considerably higher in the central and N-terminal domains. Additionally, the Rg analysis predicted small value changes among the complexes. This illustrates that, as the proteins are the same, the compact nature of the receptor is likely to remain the same, though binding ligands impact the protein compact/relax nature, but in this case, it was observed less.
The amalgamation of the radius of gyration and β-factor investigations yielded more insights into the structural compactness and residue flexibility of the complexes under dynamic settings. Top1 exhibited constant compactness and little atomic displacement, hence reinforcing its potential as a therapeutic candidate. Conversely, Top3 exhibited elevated β-factor values and considerable residue flexibility, underscoring its unstable binding within the receptor pocket.
These findings highlight the need of integrating molecular docking with molecular dynamics simulations to thoroughly assess prospective medicinal medicines. Docking offers a valuable overview of ligand–receptor interactions, whereas molecular dynamics simulations give a dynamic viewpoint, encompassing conformational alterations, binding stability, and residue flexibility. Integrative analyses enhance the trustworthiness of computational predictions, facilitating the identification of stable and effective therapeutic candidates for experimental and clinical investigation. Obeticholic acid (Top1) is identified as the most promising candidate in this study, exhibiting a decent binding profile and consistent interaction dynamics, necessitating additional exploration for therapeutic uses.
Some of the candidate drugs discovered, such as monoclonal antibodies, act on extracellular antigens or cell-surface receptors and may not affect intracellular targets. This limitation clarifies that in silico drug predictions can only be used with caution, given the pharmacological possibility and cellular uptake.
The study is entirely a computational study and no experiments to verify the functional roles of the identified DEGs or the predicted drug candidates were conducted. In silico workflows offer potent funneling insights; however, the inability to determine causal connections between the suggested targets and the pathology of multiple sclerosis is constrained by the lack of in vitro or in vivo validation. Besides that, drug–gene interactions and docking results need additional experimental validation in the form of cell-based assays, molecular characterization, and preclinical models to support their therapeutic potential. Further experiments will be conducted to confirm the biological significance of such computational results through structure and mechanistic methods such as qPCR, Western blotting, and transmission electron microscopy, functional and activity assays (enzyme inhibition assay, cell-based bioassays, flow cytometry, and antimicrobial susceptibility testing), and binding and affinity assays (X-ray crystallography, surface plasmon resonance, isothermal titration calorimetry, etc.).

4. Materials and Methods

The integrative systems biology analytical framework enables the classification of novel molecular gene pathways, and the identification of unique genetic signatures associated with multiple sclerosis (MS) is shown in Figure 10.

4.1. Retrieval Datasets from GEO Database

For this integrated bioinformatics analysis, GSE135511 was retrieved from the NCBI GEO database [23,24]. GEO is an open-access open database that hosts a wide range of high-volume functional genomic information, including microarray studies, next-generation sequencing analyses, and other datasets contributed by research groups worldwide [25]. The GSE135511 database contains 50 samples, of which 10 samples are healthy controls and the other 40 samples have multiple sclerosis; all 50 samples are run on GPL6883 (Illumina HumanRef-8 v3.0 expression beadchip). Gene expression datasets were acquired from GEO datasets. In this study, several analyses were performed, including the identification of sub-networks, sample clustering using principal component analysis (PCA), heatmap visualization of differentially expressed genes (DEGs), construction of the DEG protein–protein interaction (PPI) network, differential expression analysis, identification of hub genes, exploration of drug–protein interaction networks, and molecular docking studies.

4.2. Differentially Expressed Gene (DEG) Identification

Network Analyst tool, which is a unified platform for gene expression data analysis, was used to determine and evaluate the genes exhibiting differential expression between normal controls and multiple sclerosis samples, based on data that has statistical significance criteria [26]. The gene set was organized with rows representing individual gene entries and columns corresponding to the specimens. The experimental results indicated an equitable distribution of samples, consisting of 10 controls and 30 sick specimens. Although no further weightings and corrections were made, the normalization and LIMMA analysis were conducted in the same way in all samples. Furthermore, all gene assay identities were correlated with their corresponding Entrez IDs. Network Analyst 3.0 is a sophisticated web-based visual analytics application designed for comprehensive profiling, meta-analysis, and systems-level examination of gene expression data [27]. Each dataset underwent normalization using VSN, log2, and quantile approaches, with normality confirmed by the analysis of box plots and PCA plots. The LIMMA (Linear Models of Microarray Data) workflow of Network Analyst was used to analyze data through a differential expression analysis. The mention of the Student t-test is the moderated t-test in the LIMMA empirical Bayes framework and is not an independent statistical test. LIMMA calculates moderated t-statistics, log ww2-fold changes, and adjusted p-values on each gene. To control the false discovery rate (FDR), the Benjamini–Hochberg method was used. Only the exploratory assessment of co-expression was conducted by Pearson correlation analysis, which did not play a role in the determination of the final list of 45 DEGs. Differential gene expressions were considered with FDR less than 0.05 and |log2FC| ≥ 1. These thresholds were used to produce volcano plots that could be used to visually depict highly up/down-regulated genes. To visualize the heatmap, the expression matrix was transformed into a normalized expression using the variance-stabilizing transformation (VST) and each gene (each represented as the number of its numeric identifier) within each row was standardized to a Z-score to present the differences in relative expression.

4.3. Enrichment and Expression of Genes Analysis

The Network Analyst program enables extensive gene expression profiling and offers support for generic annotation using KEGG orthologs (KOs) [26]. The modules, including differentially expressed genes and core genes, were uploaded with defined parameters to examine functional and pathway enrichment. These included functional mode, gene IDs, species classification such as Homo sapiens, molecular functions, biological processes, and pertinent ontologies. Metadata from Gene Ontology (GO) annotations and KEGG pathway databases were used to guarantee a thorough study. Gene Ontology (GO) offers a hierarchical framework for the analysis of gene functions and interactions, inferring functional similarity across genes based on common GO annotations and using information content provided from species data or GO structure [28].
The significance of enrichment was assessed by a hypergeometric test, and the enrichment scores, the ratios of the genes, and adjusted p-values were obtained per term. The enrichment outcome of the whole GO and KEGG, along with the enrichment score, the gene ratio, and the FDR-adjusted p-value, can be found in Supplementary Tables S1 and S2. To correct the multiple testing, p-values obtained after GO and KEGG enrichment analysis were corrected by the Benjamini–Hochberg correction to control the false discovery rate (FDR). Pathways whose FDR was less than 0.05 were deemed to be significant. Herein, the Enrichr server was used to investigate the GO and KEGG with a common technique (Benjamini–Hochberg (BH) approach) for adjusting p-values for multiple testing https://maayanlab.cloud/Enrichr/ (accessed on 10 September 2025) [29].

4.4. Protein–Protein Interaction (PPI) Network Analysis

The STRING database was used to create and display the protein–protein interaction (PPI) network for differentially expressed genes (DEGs) [30]. The comprehensive database STRING includes both established and predicted protein–protein interactions. These relationships include both direct (physical) and indirect (functional) linkages, which are deduced by computational techniques and forecasts [31]. The preliminary interactions among proteins were shown in Cytoscape 3.10.2 and then assessed for their synergistic effects. Scores must be below 0.75 (showing moderate confidence) to be considered significant. The network configuration was originally restricted to the differentially expressed genes (DEGs), and the full dataset for both DEGs was uploaded. To ensure accurate visualization of the A zero-order interaction method for protein–protein interactions (PPI), networks were selected for the overall network design, which were subsequently constrained to the original DEGs. Cytoscape includes an integrated/built-in “Network Analyzer” tool designed to analyze relationships and interactions within a network of genes and proteins [32].
A medium confidence level STRING score cutoff of 0.400 was chosen in this paper because of its established sensitivity and specificity balance when used to build exploratory networks. Only experimentally studied and database-verifiable interactions were kept, providing sensitivity check with comparison between the network topology at 0.400 and 0.700 thresholds, which revealed stability in identifying the top hub genes (CD44, SNAP25, GFAP). This shows that the selection of hub genes was not contingent on one arbitrary cutoff.

4.5. Hub Gene Identification

Cytoscape 3.10.2 was utilized to visualize the protein–protein interaction network derived from the STRING database. This software platform is developed for the visualization, analysis, and interpretation of complex biological systems and molecular interaction information [33]. The nodes with significant degree values were discovered and categorized as hub genes within the network, selected according to their degree metrics. CytoHubba, a Cytoscape plugin, was employed to assess the hub genes within the established PPI network. The ten hub genes exhibiting the shortest pathways were identified as novel therapeutic targets for the treatment of multiple sclerosis.

4.6. Correlation of Network of Transcription Factors and Gene Regulators

The correlation between the genes and the regulatory factor was determined using the Expression2Kinase (X2K) program https://maayanlab.cloud/X2K/ (accessed on 10 September 2025). The complete set of DEGs, along with the specified gene markers, was submitted to the X2K software version 1.6.1207 (accessed on 10 September 2025) for analysis. The top ten most important kinases and controls of transcription factors enrichment values were calculated by applying module of kinases and transcription factors (TFs) data. These were constructed from the ChEA69 database and assessed based on Fisher’s exact test p-values. X2K Web predicts upstream regulatory networks based on signatures of differentially expressed genes. Through the integration of transcription factors (TFs), kinase enrichment analysis, and PPI network expansion, it generates inferred networks comprising transcription factors, proteins, and kinases that are likely to regulate the expression of the provided gene list. When the network was developed, it was ensured that the edge nodes were properly linked [34].
The ChEA69 database provided the transcription factor–gene interaction via the X2K platform. The curated kinase datasets included in X2K were used to obtain the kinase–gene interactions. The calculated p-values based on Fisher’s exact test served as the measure of confidence to rank and select the transcription factors and kinases in the regulatory networks.

4.7. Investigation of Targeted Hub Genes Survival

A complete online tool for quick and adaptable gene expression data analysis is the Gene Expression Profiling Interactive Analysis (GEPIA2) platform [35,36]. It provides several analytical characteristics by combining data from the Genotype-Tissue Expression (GTEx) database and The Cancer Genome Atlas (TCGA). GEPIA2 makes it easier to assess which genes show differential expressions, in particular cancer samples, as well as how these genes affect expression patterns and survival outcomes. Through single-gene analysis, the platform was utilized to evaluate the prognostic importance of significant hub genes in the context of (MS), computing the hazard ratio (HR) with a 95% confidence interval and the log-rank p-value.

4.8. Protein and Drug Interaction Network Construction

To investigate drug–gene interactions, we analyzed the top 10 hub genes. Drug data and target information were retrieved from the latest version of DrugBank Online, which was integrated with Network Analyzer for the analysis. DrugBank includes 500,000+ drugs and their targets. In the latest version of DrugBank 6.0, FDA-approved drugs increased by 72%, increasing from 2646 to 4563. Investigational drugs grew by 38%, from 3394 to 6231. Drug–drug interactions saw a 300% surge, increasing from 365,984 to 1,413,412, while drug–food interactions extended by 200%, from 1195 to 2475 [37]. DrugBank feature entries encompassed approved small-molecule drugs, accepted biologics such as proteins, peptides, allergenics, and vaccines, and nutraceuticals and experimental drugs in the detection phase.

4.9. CD44 Molecular Docking with Therapeutics Drugs

The therapeutic target, sourced via the RCSB Protein Data Bank (PDB) having Uniprot ID: P16070, CD44_HUMAN, and PDB ID: 1UUH, had been generated for molecular docking using UCSF Chimera. The docking receptor and ligands were generated using UCSF Chimera, a comprehensive visualization application for structural investigation and energy minimization [38,39,40]. Gasteiger charges were allocated to proteins, and structural limitations were eliminated via 1500 minimization iterations with the ff03.rl force field with a 0.02 step size [41,42]. The reduced protein was confirmed and utilized for docking analyses. The anticipated cavity coordinates were established as center_x = −0.6073, center_y = 1.8726, and center_z = 15.3886, serving as the grid center parameters for the x-, y-, and z-axes in AutoDock Vina version 1.1.2. The grid dimensions were established as size_x = 25, size_y = 25, and size_z = 25 Å, with a spacing of 1.00 Å, guaranteeing exact alignment at the center of the active site. The ligands’ binding affinities were assessed via AutoDock Vina [43]. During the docking process, the most favorable docked compounds were chosen for further analysis utilizing UCSF Chimera [44]. The docking process was validated by considering PDB ID: 4QDI [45], where the co-crystalized ATP molecule was removed from the MurF ligase enzyme and blindly redocked to the protein using the same steps discussed above. The intermolecular conformation of complex after redocking was aligned 3D to the crystalized structure and RMSD was calculated, which was 0.13 Å. The same binding mode of the ATP to the MurF ligase enzyme was reported in both crystalized structures and reported complexes, which implies the accuracy of the docking procedure.

4.10. MD Simulation

The dynamic behavior of the docked complex comprising the highest-ranking chemicals was examined using molecular dynamics (MD) simulations. The simulation procedure was executed using AMBER (Assisted Model Building with Energy Refinement), and subsequent analysis was conducted using its various components [46]. The primary coordinates were obtained from the docked complexes, and topology files were generated using the tLEAP interface of AMBER16 [46]. Likewise, tLEAP was employed for the incorporation of absent atoms. Heavy atom side chains of leucine, glutamine, lysine, and aspartate, together with hydrogen atoms, were incorporated into all residue side chains. System solvation was performed using three-point transferable intermolecular potential (TIP3P) water [47], whereas the force fields utilized for calculations comprise GAFF [48] and ff99SB [49]. The docked protein system was neutralized by the addition of 11 Na+ ions, while the ligand atom types and properties were rectified using the antechamber module 17.3. To eliminate steric conflicts, the docked protein complex underwent minimization through 1500 steepest descent steps and 1000 conjugate gradient steps. The system was heated using Langevin dynamics for 10 ps, followed by equilibration for 100 ps in the canonical (NVT) ensemble [50]. During the production run, the SHAKE algorithm was implemented to constrain the lengths of covalent bonds involving hydrogen atoms [51]. The production run lasted a total of 90 nanoseconds. Periodic boundary conditions were employed for all minimization and simulations, with the system size established at a minimum of 8 Å from the protein, while volume, temperature (300 K), and pressure (1 atm) were held constant [46]. The MD production run was performed in triplicate, each time with a different initial velocity, and average structure plots were derived and interpreted. The produced trajectories were examined using the AMBER PTRAJ module [52]. The analysis encompassed Root Mean Square Deviation (RMSD), Beta-Factor (β-factor), Root Mean Square Fluctuation (RMSF), and radius of gyration (Rg) [53]. These post-MD simulation analyses were performed using the CPPTRAJ module of AMBER and are based on carbon alpha atoms [52]. Xmgrace v 5.0 was utilized for graphical demonstration and analytical assessment of these values [54]. The MMGBSA-based binding free energy of complexes was estimated using the MMPBSA.py module of AMBER [55]. The prmtop files were generated through the ante-MMPBSA.py module. In total, 1000 frames were considered from the simulation trajectories at equal time intervals [56].

5. Conclusions

In summary, our research highlights the significance of evaluating transcription factors and PPI networks as a sound framework in addition to offering significant new insights into potential biomarkers connected to patient prognoses for multiple sclerosis (MS). Our research primarily aims to identify possible biomarkers and provide a comprehensive understanding of the underlying factors influencing the prognosis of patients with MS. With good binding scores, the molecular docking technique verified all four of the drugs as potential CD44 inhibitors. Also, MD simulations demonstrated the remarkable stability of receptor complexes, such as Top1 and Top3, over 100 ns time in comparison to other complexes, because of structural changes in a physiochemical environment. Remarkably, these inhibitors’ stability was barely affected by minor changes to their side chain and loop mobility. The structural stability seen in the docked complexes following simulation validated the possibility of the selected ligands as lead-like compounds. High binding affinities of the complexes across time intervals were demonstrated by the binding energies that were ultimately computed. Considering these findings, our study urges researchers to examine the development of a customized and precisely targeted medication against MS using the powerful insights gathered from this research. The system’s consistent stability throughout the simulations offers a strong foundation for further study and the creation of effective treatments for multiple sclerosis.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ph19020254/s1, Table S1: Sample information from the GEO database’s GSE135511 datasets. Table S2: Shows the gene of interest involved in GO Biological process. Table S3: Presenting the genes with KEGG enrichment pathways.

Funding

The researchers would like to thank the Deanship of Graduate Studies and Scientific Research at Qassim University for its financial support (QU-APC-2026).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Acknowledgments

The researchers would like to thank the Deanship of Graduate Studies and Scientific Research at Qassim University for its financial support (QU-APC-2026).

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Dobson, R.; Giovannoni, G. Multiple Sclerosis—A Review. Eur. J. Neurol. 2019, 26, 27–40. [Google Scholar] [CrossRef] [PubMed]
  2. Leray, E.; Moreau, T.; Fromont, A.; Edan, G. Epidemiology of Multiple Sclerosis. Rev. Neurol. 2016, 172, 3–13. [Google Scholar] [CrossRef]
  3. Howard, J.; Trevick, S.; Younger, D.S. Epidemiology of Multiple Sclerosis. Neurol. Clin. 2016, 34, 919–939. [Google Scholar] [CrossRef]
  4. Ghasemi, N.; Razavi, S.; Nikzad, E. Multiple Sclerosis: Pathogenesis, Symptoms, Diagnoses and Cell-Based Therapy. Cell J. 2017, 19, 1–10. [Google Scholar] [CrossRef] [PubMed]
  5. Michalik, J.; Čierny, D.; Kantorová, E.; Kantárová, D.; Juraj, J.; Párnická, Z.; Kurča, E.; Dobrota, D.; Lehotský, J. The Association of HLA-DRB1 and HLA-DQB1 Alleles with Genetic Susceptibility to Multiple Sclerosis in the Slovak Population. Neurol. Res. 2015, 37, 1060–1067. [Google Scholar] [CrossRef]
  6. Cotsapas, C.; Mitrovic, M. Genome-Wide Association Studies of Multiple Sclerosis. Clin. Transl. Immunol. 2018, 7, e1018. [Google Scholar] [CrossRef] [PubMed]
  7. Olsson, T.; Barcellos, L.F.; Alfredsson, L. Interactions between Genetic, Lifestyle and Environmental Risk Factors for Multiple Sclerosis. Nat. Rev. Neurol. 2017, 13, 25–36. [Google Scholar] [CrossRef] [PubMed]
  8. Lassmann, H. Multiple Sclerosis Pathology. Cold Spring Harb. Perspect. Med. 2018, 8, a028936. [Google Scholar] [CrossRef]
  9. Huang, W.-J.; Chen, W.-W.; Zhang, X. Multiple Sclerosis: Pathology, Diagnosis and Treatments. Exp. Ther. Med. 2017, 13, 3163–3166. [Google Scholar] [CrossRef]
  10. Patil, M.S.; Lin, L.Y.; Marsh-Wakefield, F.; James, E.J.; Palendira, M.; Hawke, S.; Grau, G.E. Multiple Sclerosis: Immune Cells, Histopathology, and Therapeutics. Sclerosis 2024, 2, 117–139. [Google Scholar] [CrossRef]
  11. Faissner, S.; Plemel, J.R.; Gold, R.; Yong, V.W. Progressive Multiple Sclerosis: From Pathophysiology to Therapeutic Strategies. Nat. Rev. Drug Discov. 2019, 18, 905–922. [Google Scholar] [CrossRef] [PubMed]
  12. Hauser, S.L.; Cree, B.A.C. Treatment of Multiple Sclerosis: A Review. Am. J. Med. 2020, 133, 1380–1390.e2. [Google Scholar] [CrossRef] [PubMed]
  13. Gholamzad, M.; Ebtekar, M.; Ardestani, M.S.; Azimi, M.; Mahmodi, Z.; Mousavi, M.J.; Aslani, S. A Comprehensive Review on the Treatment Approaches of Multiple Sclerosis: Currently and in the Future. Inflamm. Res. 2019, 68, 25–38. [Google Scholar] [CrossRef]
  14. Scolding, N.J.; Pasquini, M.; Reingold, S.C.; Cohen, J.A. Cell-based therapeutic strategies for multiple sclerosis. Brain 2017, 140, 2776–2796. [Google Scholar] [CrossRef]
  15. Correale, J.; Gaitán, M.I.; Ysrraelit, M.C.; Fiol, M.P. Progressive Multiple Sclerosis: From Pathogenic Mechanisms to Treatment. Brain 2017, 140, 527–546. [Google Scholar] [CrossRef]
  16. Plemel, J.R.; Liu, W.-Q.; Yong, V.W. Remyelination Therapies: A New Direction and Challenge in Multiple Sclerosis. Nat. Rev. Drug Discov. 2017, 16, 617–634. [Google Scholar] [CrossRef]
  17. Liu, Y.; Wang, Q.; Zhao, Y.; Liu, L.; Hu, J.; Qiao, Y.; Chen, J.; Qin, C. Identification of Novel Drug Targets for Multiple Sclerosis by Integrating Plasma Genetics and Proteomes. Exp. Gerontol. 2024, 194, 112505. [Google Scholar] [CrossRef]
  18. Zhang, S.; Ma, Y.; Luo, X.; Xiao, H.; Cheng, R.; Jiang, A.; Qin, X. Integrated Analysis of Immune Infiltration and Hub Pyroptosis-Related Genes for Multiple Sclerosis. J. Inflamm. Res. 2023, 16, 4043–4059. [Google Scholar] [CrossRef]
  19. Tong, Y.; Chen, R.; Ling, H.; Shen, Y.; Tian, H.; Zeng, L. Network Pharmacology and Molecular Dynamics Simulation Elucidate the Potential Mechanism of Batatasin-III in Bletilla Striata against Ulcerative Colitis. Sci. Rep. 2025, 16, 1032. [Google Scholar] [CrossRef]
  20. Pedrycz-Wieczorska, A.; Chylińska-Wrzos, P.; Grzywacz, A.; Zieliński, E.; Bartosiński, A.; Kędziora-Kornatowska, K.; Lis-Sochocka, M.; Mertowska, P.; Mertowski, S.; Bojarski, K.; et al. CD44 as a Central Integrator of Inflammation and Fibrosis: From Molecular Signaling to Environmental Modulation. Int. J. Mol. Sci. 2025, 26, 8870. [Google Scholar] [CrossRef] [PubMed]
  21. Mueller, A.M.; Yoon, B.H.; Sadiq, S.A. Inhibition of Hyaluronan Synthesis Protects Against Central Nervous System (CNS) Autoimmunity and Increases CXCL12 Expression in the Inflamed CNS. J. Biol. Chem. 2014, 289, 22888–22899. [Google Scholar] [CrossRef] [PubMed]
  22. Corbella, M.; Pinto, G.P.; Kamerlin, S.C.L. Loop Dynamics and the Evolution of Enzyme Activity. Nat. Rev. Chem. 2023, 7, 536–547. [Google Scholar] [CrossRef]
  23. Fitzgerald, K.C.; Kim, K.; Smith, M.D.; Aston, S.A.; Fioravante, N.; Rothman, A.M.; Krieger, S.; Cofield, S.S.; Kimbrough, D.J.; Bhargava, P.; et al. Early Complement Genes Are Associated with Visual System Degeneration in Multiple Sclerosis. Brain 2019, 142, 2722–2736. [Google Scholar] [CrossRef] [PubMed]
  24. Liberati, A.; Altman, D.G.; Tetzlaff, J.; Mulrow, C.; Gøtzsche, P.C.; Ioannidis, J.P.A.; Clarke, M.; Devereaux, P.J.; Kleijnen, J.; Moher, D. The PRISMA Statement for Reporting Systematic Reviews and Meta-Analyses of Studies That Evaluate Health Care Interventions: Explanation and Elaboration. J. Clin. Epidemiol. 2009, 62, e1–e34. [Google Scholar] [CrossRef]
  25. Clough, E.; Barrett, T. The Gene Expression Omnibus Database. Methods Mol. Biol. 2016, 1418, 93–110. [Google Scholar] [CrossRef]
  26. Xia, J.; Gill, E.E.; Hancock, R.E.W. NetworkAnalyst for Statistical, Visual and Network-Based Meta-Analysis of Gene Expression Data. Nat. Protoc. 2015, 10, 823–844. [Google Scholar] [CrossRef]
  27. Zhou, G.; Soufan, O.; Ewald, J.; Hancock, R.E.W.; Basu, N.; Xia, J. NetworkAnalyst 3.0: A Visual Analytics Platform for Comprehensive Gene Expression Profiling and Meta-Analysis. Nucleic Acids Res. 2019, 47, W234–W241. [Google Scholar] [CrossRef]
  28. Merico, D.; Isserlin, R.; Stueker, O.; Emili, A.; Bader, G.D. Enrichment Map: A Network-Based Method for Gene-Set Enrichment Visualization and Interpretation. PLoS ONE 2010, 5, e13984. [Google Scholar] [CrossRef]
  29. Kuleshov, M.V.; Jones, M.R.; Rouillard, A.D.; Fernandez, N.F.; Duan, Q.; Wang, Z.; Koplev, S.; Jenkins, S.L.; Jagodnik, K.M.; Lachmann, A.; et al. Enrichr: A Comprehensive Gene Set Enrichment Analysis Web Server 2016 Update. Nucleic Acids Res. 2016, 44, W90–W97. [Google Scholar] [CrossRef] [PubMed]
  30. Szklarczyk, D.; Franceschini, A.; Kuhn, M.; Simonovic, M.; Roth, A.; Minguez, P.; Doerks, T.; Stark, M.; Muller, J.; Bork, P.; et al. The STRING Database in 2011: Functional Interaction Networks of Proteins, Globally Integrated and Scored. Nucleic Acids Res. 2011, 39, D561–D568. [Google Scholar] [CrossRef]
  31. Szklarczyk, D.; Gable, A.L.; Nastou, K.C.; Lyon, D.; Kirsch, R.; Pyysalo, S.; Doncheva, N.T.; Legeay, M.; Fang, T.; Bork, P.; et al. The STRING Database in 2021: Customizable Protein-Protein Networks, and Functional Characterization of User-Uploaded Gene/Measurement Sets. Nucleic Acids Res. 2021, 49, D605–D612. [Google Scholar] [CrossRef]
  32. Hameed, A.R.; Fakhri Ali, S.; Almanaa, T.N.; Aljasir, M.A.; Alruwetei, A.M.; Sanami, S.; Ayaz, H.; Ali, I.; Ahmad, F.; Ahmad, S. Exploring the Hub Genes and Potential Drugs Involved in Fanconi Anemia Using Microarray Datasets and Bioinformatics Analysis. J. Biomol. Struct. Dyn. 2023, 43, 3297–3310. [Google Scholar] [CrossRef]
  33. Su, G.; Morris, J.H.; Demchak, B.; Bader, G.D. Biological Network Exploration with Cytoscape 3. Curr. Protoc. Bioinformatics 2014, 47, 8.13.1–8.13.24. [Google Scholar] [CrossRef]
  34. Clarke, D.J.B.; Kuleshov, M.V.; Schilder, B.M.; Torre, D.; Duffy, M.E.; Keenan, A.B.; Lachmann, A.; Feldmann, A.S.; Gundersen, G.W.; Silverstein, M.C.; et al. eXpression2Kinases (X2K) Web: Linking Expression Signatures to Upstream Cell Signaling Networks. Nucleic Acids Res. 2018, 46, W171–W179. [Google Scholar] [CrossRef] [PubMed]
  35. Liu, C.; Chen, Y.; Deng, Y.; Dong, Y.; Jiang, J.; Chen, S.; Kang, W.; Deng, J.; Sun, H. Survival-Based Bioinformatics Analysis to Identify Hub Genes and Key Pathways in Non-Small Cell Lung Cancer. Transl. Cancer Res. 2019, 8, 1188–1198. [Google Scholar] [CrossRef] [PubMed]
  36. Tang, Z.; Kang, B.; Li, C.; Chen, T.; Zhang, Z. GEPIA2: An Enhanced Web Server for Large-Scale Expression Profiling and Interactive Analysis. Nucleic Acids Res. 2019, 47, W556–W560. [Google Scholar] [CrossRef]
  37. Knox, C.; Wilson, M.; Klinger, C.M.; Franklin, M.; Oler, E.; Wilson, A.; Pon, A.; Cox, J.; Chin, N.E.L.; Strawbridge, S.A.; et al. DrugBank 6.0: The DrugBank Knowledgebase for 2024. Nucleic Acids Res. 2024, 52, D1265–D1275. [Google Scholar] [CrossRef]
  38. Ahmad, F.; Azam, S.S. Role of Ring Positioning and Preferential Occupation of Ligand Obtained through Molecular Dynamics Simulation of Peptidoglycan Associated Lipoprotein (Pal). J. Mol. Graph. Model. 2020, 98, 107585. [Google Scholar] [CrossRef]
  39. Kaliappan, S.; Bombay, I.I.T. UCSF Chimera-Superimposing and Morphing; UCSF: San Francisco, CA, USA, 2016. [Google Scholar]
  40. Mavra, A.; Petrou, C.C.; Vlasiou, M.C. Ligand and Structure-Based Virtual Screening in Combination, to Evaluate Small Organic Molecules as Inhibitors for the XIAP Anti-Apoptotic Protein: The Xanthohumol Hypothesis. Molecules 2022, 27, 4825. [Google Scholar] [CrossRef] [PubMed]
  41. Ahmad, F.; Azam, S.S. From Pan-Genome to Protein Dynamics: A Computational Hierarchical Quest to Identify Drug Target in Multi-Drug Resistant Burkholderia Cepacia. J. Mol. Liq. 2020, 317, 113904. [Google Scholar] [CrossRef]
  42. Ahmad, S.; Shaker, B.; Ahmad, F.; Raza, S.; Azam, S.S. Moleculer Dynamics Simulaiton Revealed Reciever Domain of Acinetobacter Baumannii BfmR Enzyme as the Hot Spot for Future Antibiotics Designing. J. Biomol. Struct. Dyn. 2018, 37, 2897–2912. [Google Scholar] [CrossRef] [PubMed]
  43. Dallakyan, S.; Olson, A.J. Small-Molecule Library Screening by Docking with PyRx. In Chemical Biology; Springer: Berlin/Heidelberg, Germany, 2015; pp. 243–250. [Google Scholar]
  44. Huang, C.C.; Meng, E.C.; Morris, J.H.; Pettersen, E.F.; Ferrin, T.E. Enhancing UCSF Chimera through Web Services. Nucleic Acids Res. 2014, 42, W478–W484. [Google Scholar] [CrossRef]
  45. Cha, S.-S.; An, Y.J.; Jeong, C.-S.; Yu, J.H.; Chung, K.M. ATP-Binding Mode Including a Carbamoylated Lysine and Two Mg(2+) Ions, and Substrate-Binding Mode in Acinetobacter Baumannii MurF. Biochem. Biophys. Res. Commun. 2014, 450, 1045–1050. [Google Scholar] [CrossRef] [PubMed]
  46. Case, D.A.; Betz, R.M.; Cerutti, D.S.; Cheatham, T.; Darden, T.; Duke, R.E.; Giese, T.J.; Gohlke, H.; Götz, A.W.; Homeyer, N.; et al. Amber 16, University of California, San Francisco; University of California: San Francisco, CA, USA, 2016. [Google Scholar] [CrossRef]
  47. Brice, A.R.; Dominy, B.N. Examining Electrostatic Influences on Base-Flipping: A Comparison of TIP3P and GB Solvent Models. Commun. Comput. Phys. 2013, 13, 223–237. [Google Scholar] [CrossRef]
  48. Sprenger, K.G.; Jaeger, V.W.; Pfaendtner, J. The General AMBER Force Field (GAFF) Can Accurately Predict Thermodynamic and Transport Properties of Many Ionic Liquids. J. Phys. Chem. B 2015, 119, 5882–5895. [Google Scholar] [CrossRef]
  49. Tian, C.; Kasavajhala, K.; Belfon, K.A.A.; Raguette, L.; Huang, H.; Migues, A.N.; Bickel, J.; Wang, Y.; Pincay, J.; Wu, Q.; et al. ff19SB: Amino-Acid-Specific Protein Backbone Parameters Trained against Quantum Mechanics Energy Surfaces in Solution. J. Chem. Theory Comput. 2020, 16, 528–552. [Google Scholar] [CrossRef]
  50. Pastor, R.W.; Brooks, B.R.; Szabo, A. An Analysis of the Accuracy of Langevin and Molecular Dynamics Algorithms. Mol. Phys. 1988, 65, 1409–1419. [Google Scholar] [CrossRef]
  51. Kräutler, V.; van Gunsteren, W.F.; Hünenberger, P.H. A Fast SHAKE Algorithm to Solve Distance Constraint Equations for Small Molecules in Molecular Dynamics Simulations. J. Comput. Chem. 2001, 22, 501–508. [Google Scholar] [CrossRef]
  52. Roe, D.R.; Cheatham, T.E. PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput. 2013, 9, 3084–3095. [Google Scholar] [CrossRef]
  53. Soremekun, C.; Jjingo, D.; Kateete, D.; Nash, O.; Grallert, H.; Peters, A.; Chikowore, T.; Batini, C.; Soremekun, O.; Fatumo, S. Structural Insights into Conformational Stability of Both Wild-Type and Mutant Insulin Receptor Gene. Next Res. 2024, 1, 100041. [Google Scholar] [CrossRef]
  54. 385 PDFs|Review Articles in XMGRACE. Available online: https://www.researchgate.net/topic/Xmgrace/publications (accessed on 31 December 2023).
  55. Miller, B.R.; McGee, T.D.; Swails, J.M.; Homeyer, N.; Gohlke, H.; Roitberg, A.E. MMPBSA.Py: An Efficient Program for End-State Free Energy Calculations. J. Chem. Theory Comput. 2012, 8, 3314–3321. [Google Scholar] [CrossRef] [PubMed]
  56. Alqahtani, S.M.; Altharawi, A.; Alabbas, A.; Ahmad, F.; Ayaz, H.; Nawaz, A.; Rahman, S.; Alossaimi, M.A. System Biology Approach to Identify the Novel Biomarkers in Glioblastoma Multiforme Tumors by Using Computational Analysis. Front. Pharmacol. 2024, 15, 1364138. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Both boxplots and expression plots were used for the sample visualization, which is shown in the image both before (A) and after (B) the normalization procedure. Variance stabilization normalization (VSN) and quantile normalization were used to adjust the mean and lower noise.
Figure 1. Both boxplots and expression plots were used for the sample visualization, which is shown in the image both before (A) and after (B) the normalization procedure. Variance stabilization normalization (VSN) and quantile normalization were used to adjust the mean and lower noise.
Pharmaceuticals 19 00254 g001
Figure 2. Depicts the network formed by all the DEGs retrieved from the STRING database.
Figure 2. Depicts the network formed by all the DEGs retrieved from the STRING database.
Pharmaceuticals 19 00254 g002
Figure 3. (A) A volcano plot displays the intensity distribution of each gene, with red highlighting up-regulated genes, blue highlighting down-regulated genes, and gray highlighting non-significant genes, based on |log2 fold change| ≥ 1 and FDR < 0.05. (B) Heatmap showing the patterns of the differentially expressed genes in different samples. The expression values were normalized and row-scaled where the low expression was expressed in blue and the high expression in red.
Figure 3. (A) A volcano plot displays the intensity distribution of each gene, with red highlighting up-regulated genes, blue highlighting down-regulated genes, and gray highlighting non-significant genes, based on |log2 fold change| ≥ 1 and FDR < 0.05. (B) Heatmap showing the patterns of the differentially expressed genes in different samples. The expression values were normalized and row-scaled where the low expression was expressed in blue and the high expression in red.
Pharmaceuticals 19 00254 g003
Figure 4. The most direct route interactions for all identified hub genes are displayed, with node colors representing their degree scores.
Figure 4. The most direct route interactions for all identified hub genes are displayed, with node colors representing their degree scores.
Pharmaceuticals 19 00254 g004
Figure 5. (A) The anticipated transcription factors for the DEGs list are displayed, with a bar chart showing hypergeometric scores and p-values for each identified transcription factor. (B) A ball-and-stick diagram illustrates a sub-network of interconnected transcription factors and their interacting proteins. Pink nodes represent transcription factors, while gray nodes indicate the associated proteins. The size of each node corresponds to its degree within the network. (C) A bar graph presents a ranking of the most anticipated kinases, based on their scores derived from hypergeometric p-values. (D) Illustrating the transcription factors.
Figure 5. (A) The anticipated transcription factors for the DEGs list are displayed, with a bar chart showing hypergeometric scores and p-values for each identified transcription factor. (B) A ball-and-stick diagram illustrates a sub-network of interconnected transcription factors and their interacting proteins. Pink nodes represent transcription factors, while gray nodes indicate the associated proteins. The size of each node corresponds to its degree within the network. (C) A bar graph presents a ranking of the most anticipated kinases, based on their scores derived from hypergeometric p-values. (D) Illustrating the transcription factors.
Pharmaceuticals 19 00254 g005
Figure 6. Kaplan–Meier analysis was used to examine the hub genes expressed in patients with multiple sclerosis (MS) to assess overall survival. Survival curves were produced using the Gene-Expression Profiling Interactive Analysis (GEPIA) tool, with statistical significance set at p ≤ 0.01. whereas, Differential survival outcomes over time are depicted by the dotted lines that separate the groups with high and low gene expression.
Figure 6. Kaplan–Meier analysis was used to examine the hub genes expressed in patients with multiple sclerosis (MS) to assess overall survival. Survival curves were produced using the Gene-Expression Profiling Interactive Analysis (GEPIA) tool, with statistical significance set at p ≤ 0.01. whereas, Differential survival outcomes over time are depicted by the dotted lines that separate the groups with high and low gene expression.
Pharmaceuticals 19 00254 g006
Figure 7. Binding interactions of ligands with the Hyaluronan binding domain of the human CD44 receptor: a comparative analysis of Obeticholic acid (Top1), (A), Chlordiazepoxide (Top2), (B), Hyaluronic acid shows significant hydrogen bond interactions with active residues and interactions between THR76 and glucose moieties, suggesting a possible glycosylation site or substrate binding interface vital for protein function (Top3), (C), and Dextromethorphan (Top4), (D) via docking scores and hydrogen bond dynamics.
Figure 7. Binding interactions of ligands with the Hyaluronan binding domain of the human CD44 receptor: a comparative analysis of Obeticholic acid (Top1), (A), Chlordiazepoxide (Top2), (B), Hyaluronic acid shows significant hydrogen bond interactions with active residues and interactions between THR76 and glucose moieties, suggesting a possible glycosylation site or substrate binding interface vital for protein function (Top3), (C), and Dextromethorphan (Top4), (D) via docking scores and hydrogen bond dynamics.
Pharmaceuticals 19 00254 g007
Figure 8. The binding cavity with interactions illustrates hydrogen bond acceptor and donor regions: (A) denotes Top1 (Obeticholic acid), (B) denotes Top2 (Chlordiazepoxide), (C) denotes Top3 (Hyaluronic acid), and (D) denotes Top4 (Dextromethorphan), emphasizing the spatial arrangement of donor and acceptor clouds within the Hyaluronan binding domain of the human CD44 receptor. Herein, A distinct binding pocket made of α-helices (red) and β-sheets (blue) holds the ligand. A stable and functionally significant binding mode is supported by these additional structural components, which also allow conserved hydrogen-bond and electrostatic interactions with important active-site residues.
Figure 8. The binding cavity with interactions illustrates hydrogen bond acceptor and donor regions: (A) denotes Top1 (Obeticholic acid), (B) denotes Top2 (Chlordiazepoxide), (C) denotes Top3 (Hyaluronic acid), and (D) denotes Top4 (Dextromethorphan), emphasizing the spatial arrangement of donor and acceptor clouds within the Hyaluronan binding domain of the human CD44 receptor. Herein, A distinct binding pocket made of α-helices (red) and β-sheets (blue) holds the ligand. A stable and functionally significant binding mode is supported by these additional structural components, which also allow conserved hydrogen-bond and electrostatic interactions with important active-site residues.
Pharmaceuticals 19 00254 g008
Figure 9. MD simulation results for Top1–Top4 complexes: Panels (A,B) present RMSF values, indicating residue flexibility; Panel (C) illustrates the radius of gyration, demonstrating structural compactness; and Panel (D) depicts β-factor analysis, highlighting atomic displacement and dynamic behavior over a 100 ns duration.
Figure 9. MD simulation results for Top1–Top4 complexes: Panels (A,B) present RMSF values, indicating residue flexibility; Panel (C) illustrates the radius of gyration, demonstrating structural compactness; and Panel (D) depicts β-factor analysis, highlighting atomic displacement and dynamic behavior over a 100 ns duration.
Pharmaceuticals 19 00254 g009
Figure 10. A flowchart showing the thorough bioinformatics analysis method used in this study. The DEGs were identified and subjected to mini up-regulated DEG and down-regulated DEG analyses. The construction of the PPI network for the down-regulated DEGs was performed to ensure the lack of a high-confidence cellular network.
Figure 10. A flowchart showing the thorough bioinformatics analysis method used in this study. The DEGs were identified and subjected to mini up-regulated DEG and down-regulated DEG analyses. The construction of the PPI network for the down-regulated DEGs was performed to ensure the lack of a high-confidence cellular network.
Pharmaceuticals 19 00254 g010
Table 1. The top 10 genes from the DEGs network analysis were graded based on data from the STRING interaction database.
Table 1. The top 10 genes from the DEGs network analysis were graded based on data from the STRING interaction database.
RankGene SymbolScoreAdj. p-ValueLog FC
1CD44150.002061.368089
2SNAP25140.0083515−1.2745
2GFAP140.0021351−1.23569
4PVALB130.0050701−2.02376
5CDC4290.03157181.485748
5HSPB190.00634391.111093
5PRKACB90.1050604−0.45806
5CDKN1A90.03694241.079374
9KCNC280.0059254−1.31174
9SERPINH180.00532811.410022
Table 2. Identification of potential drugs targeting specific proteins was conducted through analysis of the DrugBank database.
Table 2. Identification of potential drugs targeting specific proteins was conducted through analysis of the DrugBank database.
Sr. No.ProteinUniprot IDDrug NameDrugBank ID
1GFAPP14136Isopropyl alcoholDB02325
2SNAP25P60880LetibotulinumtoxinADB16820
P11473CalcifediolDB00146
P12319OmalizumabDB00043
3CDC42Q96RI1Obeticholic acidDB05990
P10912SomatrogonDB14960
P42345EverolimusDB01590
4PVALB...
5CD44P20701EfalizumabDB00095
O75330Hyaluronic acidDB08818
Q8N1C3ChlordiazepoxideDB00475
P04234MuromonabDB00075
P11836OfatumumabDB06650
P11836GlofitamabDB16371
Q8TCU5DextromethorphanDB00514
Q9NZD1TalquetamabDB16678
P15391InebilizumabDB12530
Q96RI1Obeticholic acidDB05990
6HSPB1P35367ClemastineDB00283
P02766Iodide I-123DB09420
P05181tioconazoleDB01007
P11473Dihydrotachysterol DB01070
P41143TapentadolDB06204
7PRKACB...
8CDKN1AP353637ClemastineDB00283
P11473DihydrotachysterolDB01070
P23975IobenguaneDB06704
P61073MotixafortideDB14939
P16850TioconazoleDB01007
P08185TriamcinoloneDB00620
P35367AcrivastineDB09488
P35367AzelastineDB00972
9KCNC2P25021CimetidineDB00501
P41143TapentadolDB06204
P04150ClocortoloneDB00838
P04035RosuvastatinDB01098
P9Y5N1BetahistineDB06698
P42338Copanlisib DB12483
Q99456GriseofulvinDB00400
P30968DegarelixDB06699
P07550SalmeterolDB00938
P12821PerindoprilDB00790
Q9UKR5ErgosterolDB04038
Q14524EncainideDB01228
P23075LevomilnacipranDB08918
10SERPINH1P35367ClemastineDB00283
P16422HypromelloseDB11075
Q16602ZavegepantDB15688
Q5T9C2TamoxifenDB00675
P23975VenlafaxineDB00285
P08185AlclometasoneDB00240
P31639CanagliflozinDB08907
P23975GuanethidineDB01170
P03372EstramustineDB01196
Table 3. MMGBSA-based binding free energies. The energies were estimated based on 1000 frames collected at equal intervals of the MD trajectories.
Table 3. MMGBSA-based binding free energies. The energies were estimated based on 1000 frames collected at equal intervals of the MD trajectories.
TechniqueEnergy SectionTop1Top2Top3Top4
MMGBSAVan der Waals Energy (kcal/mol)−58.20 (±3.41)−52.31 (±2.84)−51.00 (±1.36)−45.89 (±3.68)
Electrostatic Energy (kcal/mol)−14.49 (±2.01)−12.71 (±2.01)−13.04 (±1.52)−11.05 (±1.02)
Polar Salvation Energy (SE) (kcal/mol)10.19 (±1.09)11.50 (±1.36)10.11 (±1.22)9.43 (±0.69)
Non-Polar SE (kcal/mol)−3.66 (±0.59)−2.48 (±0.58)−2.46 (±0.87)−2.08 (±0.67)
Gas Phase Energy (kcal/mol)−72.69 (±4.69)−65.02 (±3.67)−64.01 (±4.05)−56.94 (±3.12)
Total (kcal/mol)−66.16 (±4.63)−56.00 (±3.20)−56.36 (±3.09)−49.59 (±2.84)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Aljasir, M.A. An Integrated Network Biology and Molecular Dynamics Approach Identifies CD44 as a Promising Therapeutic Target in Multiple Sclerosis. Pharmaceuticals 2026, 19, 254. https://doi.org/10.3390/ph19020254

AMA Style

Aljasir MA. An Integrated Network Biology and Molecular Dynamics Approach Identifies CD44 as a Promising Therapeutic Target in Multiple Sclerosis. Pharmaceuticals. 2026; 19(2):254. https://doi.org/10.3390/ph19020254

Chicago/Turabian Style

Aljasir, Mohammad Abdullah. 2026. "An Integrated Network Biology and Molecular Dynamics Approach Identifies CD44 as a Promising Therapeutic Target in Multiple Sclerosis" Pharmaceuticals 19, no. 2: 254. https://doi.org/10.3390/ph19020254

APA Style

Aljasir, M. A. (2026). An Integrated Network Biology and Molecular Dynamics Approach Identifies CD44 as a Promising Therapeutic Target in Multiple Sclerosis. Pharmaceuticals, 19(2), 254. https://doi.org/10.3390/ph19020254

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop