Aspartame is a widely used artificial sweetener, but its possible relationship with rheumatoid arthritis (RA) remains insufficiently understood. This study aimed to explore, rather than prove, potential molecular links between aspartame-related targets and RA-associated gene networks. Three public RA transcriptomic datasets (GSE55235, GSE55457,
[...] Read more.
Aspartame is a widely used artificial sweetener, but its possible relationship with rheumatoid arthritis (RA) remains insufficiently understood. This study aimed to explore, rather than prove, potential molecular links between aspartame-related targets and RA-associated gene networks. Three public RA transcriptomic datasets (GSE55235, GSE55457, and GSE77298) from the Gene Expression Omnibus (GEO) database were integrated as discovery/training data. Because these datasets included different tissue origins, batch correction was used to reduce dataset-level technical variation, whereas tissue-origin-related biological variation was not assumed to be fully removable. After differential expression analysis, RA-associated differentially expressed genes (DEGs) were identified. The single-cell dataset GSE200815 was used for cell annotation and cellular expression visualization; because its comparator group consists of psoriatic arthritis (PsA) samples rather than healthy controls, single-cell results were interpreted as RA-vs-PsA observations and were not treated as disease-versus-healthy-control evidence. Potential targets of aspartame were retrieved from ChEMBL, SwissTargetPrediction, and the Similarity Ensemble Approach (SEA), and were intersected with RA-related DEGs to construct an aspartame-gene-RA regulatory network. Diagnostic models were developed using 113 machine-learning algorithm combinations to determine an optimal multigene model and its core genes.
HLA-DRB1 was selected for exploratory scTenifoldKnk-based virtual knockout mainly because it was included in the optimal model and has a well-established role in RA immunogenetics; the single-cell analysis was used only to describe cellular distribution in the RA/PsA dataset. Molecular docking was then used to evaluate the possible interaction between aspartame and
HLA-DRB1. Forty-four intersected genes linked the predicted aspartame targets with RA DEGs. The random forest plus partial least-squares generalized linear model (RF + plsRglm) identified 16 core genes. Network-level interpretation indicated that these genes were distributed across immune/antigen-processing, inflammatory-signaling, protease/extracellular-matrix-remodeling, adhesion, metabolic, and proliferation-related modules; therefore,
HLA-DRB1 was treated as a prioritized immune-module candidate rather than as the sole driver of the network. Following virtual knockout of
HLA-DRB1, affected genes were enriched in extracellular matrix organization, extracellular structure organization, extracellular matrix, collagen trimer, extracellular matrix structural constituent, and collagen binding. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways included integrin signaling, focal adhesion, proteoglycans in cancer, cytoskeleton in muscle, and phosphoinositide 3-kinase/protein kinase B (PI3K/AKT) signaling. Molecular docking showed a minimum binding energy of −6.7 kcal/mol, which was more negative than the preset stability criterion of −5.0 kcal/mol, and the docking pose suggested contacts around ARG-146. This integrative analysis suggests a hypothesis-generating association between aspartame-related predicted targets and RA-relevant molecular networks involving
HLA-DRB1 and other core genes. The findings do not establish causality and require experimental, epidemiological, biophysical, and tissue-stratified validation before any causal or clinical inference can be made.
Full article