Metabolomics-Based Machine Learning Diagnostics of Post-Acute Sequelae of SARS-CoV-2 Infection
Abstract
1. Introduction
2. Methods
2.1. Approach Overview
2.2. Data Collection and Preprocessing
2.3. Molecular Descriptor Calculation
2.4. Machine Learning Model Training
2.5. Validation on Independent Testing Sets and Other Diseases
2.6. Threshold Determination
3. Results
4. Discussion
5. Limitations
6. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| API | application programming interface |
| AUC | area under curve |
| CACTUS | CADD Group Chemoinformatic Tools and User Services |
| CADD | computer-aided design and drafting |
| FM | fibromyalgia |
| HMDB | Human Metabolite Data Base |
| IBS | irritable bowel syndrome |
| LMT | logistic model tree |
| ME/CFS | myalgic encephalomyelitis/chronic fatigue syndrome |
| ML | machine learning |
| MLP | multi-layer perceptron |
| NCI | National Cancer Institute |
| PASC | Post-Acute Sequelae of COVID-19 |
| PCC | post-COVID-19 condition |
| PCS | post-COVID-19 syndrome |
| POTS | postural orthostatic tachycardia syndrome |
| PR | precision–recall |
| RECOVER | Researching COVID to Enhance Recovery |
| ROC | receiver operating characteristic |
| SGD | stochastic gradient descent |
| SMILES | Simplified Molecular Input Line Entry System |
| SMO | sequential minimal optimization |
| SVM | support vector machine |
| WEKA | Waikato Environment for Knowledge Analysis |
| WHO | World Health Organization |
References
- Global COVID-19 Overview|WHO COVID-19 Dashboard. Available online: https://data.who.int/dashboards/covid19/summary (accessed on 5 November 2025).
- Post COVID-19 Condition (Long COVID). Available online: https://www.who.int/europe/news-room/fact-sheets/item/post-COVID-19-condition (accessed on 5 November 2025).
- Nalbandian, A.; Sehgal, K.; Gupta, A.; Madhavan, M.V.; McGroder, C.; Stevens, J.S.; Cook, J.R.; Nordvig, A.S.; Shalev, D.; Sehrawat, T.S.; et al. Post-Acute COVID-19 Syndrome. Nat. Med. 2021, 27, 601–615. [Google Scholar] [CrossRef] [PubMed]
- López-Hernández, Y.; Monárrez-Espino, J.; López, D.A.G.; Zheng, J.; Borrego, J.C.; Torres-Calzada, C.; Elizalde-Díaz, J.P.; Mandal, R.; Berjanskii, M.; Martínez-Martínez, E.; et al. The Plasma Metabolome of Long COVID Patients Two Years after Infection. Sci. Rep. 2023, 13, 12420. [Google Scholar] [CrossRef] [PubMed]
- New Insights on Long COVID Symptoms in Adults Highlight Update to RECOVER Study Findings|RECOVER COVID Initiative. Available online: https://recovercovid.org/news/new-insights-long-covid-symptoms-adults-highlight-update-recover-study-findings (accessed on 5 November 2025).
- Long COVID (Post-COVID Conditions, PCC). Available online: https://www.yalemedicine.org/conditions/long-covid-post-covid-conditions-pcc (accessed on 5 November 2025).
- Babalola, T.K.; Clouston, S.A.P.; Sekendiz, Z.; Chowdhury, D.; Soriolo, N.; Kawuki, J.; Meliker, J.; Carr, M.; Valenti, B.R.; Fontana, A.; et al. SARS-CoV-2 Re-Infection and Incidence of Post-Acute Sequelae of COVID-19 (PASC) among Essential Workers in New York: A Retrospective Cohort Study. Lancet Reg. Health Am. 2025, 42, 100984. [Google Scholar] [CrossRef]
- Long COVID. Available online: https://www.health.ny.gov/diseases/long_covid/ (accessed on 5 November 2025).
- Frontera, J.A.; Thorpe, L.E.; Simon, N.M.; De Havenon, A.; Yaghi, S.; Sabadia, S.B.; Yang, D.; Lewis, A.; Melmed, K.; Balcer, L.J.; et al. Post-Acute Sequelae of COVID-19 Symptom Phenotypes and Therapeutic Strategies: A Prospective, Observational Study. PLoS ONE 2022, 17, e0275274. [Google Scholar] [CrossRef]
- Deer, R.R.; Rock, M.A.; Vasilevsky, N.; Carmody, L.; Rando, H.; Anzalone, A.J.; Basson, M.D.; Bennett, T.D.; Bergquist, T.; Boudreau, E.A.; et al. Characterizing Long COVID: Deep Phenotype of a Complex Condition. eBioMedicine 2021, 74, 103722. [Google Scholar] [CrossRef]
- O’Hare, A.M.; Vig, E.K.; Iwashyna, T.J.; Fox, A.; Taylor, J.S.; Viglianti, E.M.; Butler, C.R.; Vranas, K.C.; Helfand, M.; Tuepker, A.; et al. Complexity and Challenges of the Clinical Diagnosis and Management of Long COVID. JAMA Netw. Open 2022, 5, e2240332. [Google Scholar] [CrossRef]
- Goldenberg, D.L. How to Understand the Overlap of Long COVID, Chronic Fatigue Syndrome/Myalgic Encephalomyelitis, Fibromyalgia and Irritable Bowel Syndromes. Semin. Arthritis Rheum. 2024, 67, 152455. [Google Scholar] [CrossRef]
- POTS or Long COVID? How to Tell the Difference|Brain|COVID|Heart|UT Southwestern Medical Center. Available online: https://utswmed.org/medblog/pots-long-covid-research/ (accessed on 5 November 2025).
- Thor, D.C.; Suarez, S. Corona With Lyme: A Long COVID Case Study. Cureus 2023, 15, e36624. [Google Scholar] [CrossRef]
- Sisó-Almirall, A.; Brito-Zerón, P.; Conangla Ferrín, L.; Kostov, B.; Moragas Moreno, A.; Mestres, J.; Sellarès, J.; Galindo, G.; Morera, R.; Basora, J.; et al. Long COVID-19: Proposed Primary Care Clinical Guidelines for Diagnosis and Disease Management. Int. J. Environ. Res. Public Health 2021, 18, 4350. [Google Scholar] [CrossRef]
- Raveendran, A.V. Long COVID-19: Challenges in the Diagnosis and Proposed Diagnostic Criteria. Diabetes Metab. Syndr. Clin. Res. Rev. 2021, 15, 145–146. [Google Scholar] [CrossRef] [PubMed]
- Azhir, A.; Hügel, J.; Tian, J.; Cheng, J.; Bassett, I.V.; Bell, D.S.; Bernstam, E.V.; Farhat, M.R.; Henderson, D.W.; Lau, E.S.; et al. Precision Phenotyping for Curating Research Cohorts of Patients with Unexplained Post-Acute Sequelae of COVID-19. Med 2025, 6, 100532. [Google Scholar] [CrossRef]
- Buonsenso, D.; Cotugno, N.; Amodio, D.; Pascucci, G.R.; Di Sante, G.; Pighi, C.; Morrocchi, E.; Pucci, A.; Olivieri, G.; Colantoni, N.; et al. Distinct Pro-Inflammatory/pro-Angiogenetic Signatures Distinguish Children with Long COVID from Controls. Pediatr. Res. 2025, 98, 928–935. [Google Scholar] [CrossRef] [PubMed]
- Yap, C.W. PaDEL-descriptor: An Open Source Software to Calculate Molecular Descriptors and Fingerprints. J. Comput. Chem. 2011, 32, 1466–1474. [Google Scholar] [CrossRef]
- Yao, J.Z.; Tsigelny, I.F.; Kesari, S.; Kouznetsova, V.L. Diagnostics of Ovarian Cancer via Metabolite Analysis and Machine Learning. Integr. Biol. 2023, 15, zyad005. [Google Scholar] [CrossRef]
- Dong, J.; Cao, D.-S.; Miao, H.-Y.; Liu, S.; Deng, B.-C.; Yun, Y.-H.; Wang, N.-N.; Lu, A.-P.; Zeng, W.-B.; Chen, A.F. ChemDes: An Integrated Web-Based Platform for Molecular Descriptor and Fingerprint Computation. J. Cheminform 2015, 7, 60. [Google Scholar] [CrossRef]
- Yang, R.; Tsigelny, I.F.; Kesari, S.; Kouznetsova, V.L. Colorectal Cancer Detection via Metabolites and Machine Learning. Curr. Issues Mol. Biol. 2024, 46, 4133–4146. [Google Scholar] [CrossRef]
- Wishart, D.S.; Guo, A.; Oler, E.; Wang, F.; Anjum, A.; Peters, H.; Dizon, R.; Sayeeda, Z.; Tian, S.; Lee, B.L.; et al. HMDB 5.0: The Human Metabolome Database for 2022. Nucleic Acids Res. 2022, 50, D622–D631. [Google Scholar] [CrossRef]
- Tetko, I.V.; Gasteiger, J.; Todeschini, R.; Mauri, A.; Livingstone, D.; Ertl, P.; Palyulin, V.A.; Radchenko, E.V.; Zefirov, N.S.; Makarenko, A.S.; et al. Virtual Computational Chemistry Laboratory—Design and Description. J. Comput. Aided Mol. Des. 2005, 19, 453–463. [Google Scholar] [CrossRef] [PubMed]
- Witten, I.H.; Frank, E.; Hall, M.A.; Pal, C.J. Data Mining: Practical Machine Learning Tools and Techniques, 4th ed.; Morgan Kaufmann Series in Data Management Systems; Morgan Kaufmann: San Francisco, CA, USA, 2017; ISBN 9780128042915. [Google Scholar]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-Learn: Machine Learning in Python. Available online: https://jmlr.csail.mit.edu/papers/v12/pedregosa11a.html (accessed on 8 December 2025).
- Wang, K.; Khoramjoo, M.; Srinivasan, K.; Gordon, P.M.K.; Mandal, R.; Jackson, D.; Sligl, W.; Grant, M.B.; Penninger, J.M.; Borchers, C.H.; et al. Sequential Multi-Omics Analysis Identifies Clinical Phenotypes and Predictive Biomarkers for Long COVID. Cell Rep. Med. 2023, 4, 101254. [Google Scholar] [CrossRef] [PubMed]
- Marino, C.; Grimaldi, M.; Sabatini, P.; Amato, P.; Pallavicino, A.; Ricciardelli, C.; D’Ursi, A.M. Fibromyalgia and Depression in Women: An 1H-NMR Metabolomic Study. Metabolites 2021, 11, 429. [Google Scholar] [CrossRef]
- Molins, C.R.; Ashton, L.V.; Wormser, G.P.; Hess, A.M.; Delorey, M.J.; Mahapatra, S.; Schriefer, M.E.; Belisle, J.T. Development of a Metabolic Biosignature for Detection of Early Lyme Disease. Clin. Infect. Dis. 2015, 60, 1767–1775. [Google Scholar] [CrossRef] [PubMed]
- Germain, A.; Barupal, D.K.; Levine, S.M.; Hanson, M.R. Comprehensive Circulatory Metabolomics in ME/CFS Reveals Disrupted Metabolism of Acyl Lipids and Steroids. Metabolites 2020, 10, 34. [Google Scholar] [CrossRef]
- Li, Y.; Bai, B.; Wang, H.; Wu, H.; Deng, Y.; Shen, C.; Zhang, Q.; Shi, L. Plasma Metabolomic Profile in Orthostatic Intolerance Children with High Levels of Plasma Homocysteine. Ital. J. Pediatr. 2024, 50, 52. [Google Scholar] [CrossRef]
- Han, L.; Zhao, L.; Zhou, Y.; Yang, C.; Xiong, T.; Lu, L.; Deng, Y.; Luo, W.; Chen, Y.; Qiu, Q.; et al. Altered Metabolome and Microbiome Features Provide Clues in Understanding Irritable Bowel Syndrome and Depression Comorbidity. ISME J. 2022, 16, 983–996. [Google Scholar] [CrossRef]
- Weininger, D. SMILES, a Chemical Language and Information System. 1. Introduction to Methodology and Encoding Rules. J. Chem. Inf. Comput. Sci. 1988, 28, 31–36. [Google Scholar] [CrossRef]
- Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem 2025 Update. Nucleic Acids Res. 2025, 53, D1516–D1525. [Google Scholar] [CrossRef] [PubMed]
- NCI/CADD Group Chemoinformatics Tools and User Services. Available online: https://cactus.nci.nih.gov/index.html (accessed on 5 November 2025).
- E-Dragon Software. Available online: https://vcclab.org/lab/edragon/ (accessed on 5 November 2025).
- Khoja, O.; Mulvey, M.; Astill, S.; Tan, A.L.; Sivan, M. New-Onset Chronic Musculoskeletal Pain Following COVID-19 Infection Fulfils the Fibromyalgia Clinical Syndrome Criteria: A Preliminary Study. Biomedicines 2024, 12, 1940. [Google Scholar] [CrossRef]
- Leonardo, S.; Fregni, F. Exploring the Inflammatory Basis of Fibromyalgia and Long-COVID: Insights from Human and Animal Studies. Princ. Pract. Clin. Res. 2025, 1, 15022. [Google Scholar] [CrossRef]
- Häuser, W.; Ablin, J.; Fitzcharles, M.-A.; Littlejohn, G.; Luciano, J.V.; Usui, C.; Walitt, B. Fibromyalgia. Nat. Rev. Dis. Primers 2015, 1, 15022. [Google Scholar] [CrossRef]
- Merino, A.S.; Simon, D. Fatores genéticos associados à fibromialgia: Uma revisão narrativa. Res. Soc. Dev. 2022, 11, e11211326421. [Google Scholar] [CrossRef]
- Nunn, A.V.W.; Guy, G.W.; Brysch, W.; Bell, J.D. Understanding Long COVID; Mitochondrial Health and Adaptation—Old Pathways, New Problems. Biomedicines 2022, 10, 3113. [Google Scholar] [CrossRef] [PubMed]
- Molnar, T.; Lehoczki, A.; Fekete, M.; Varnai, R.; Zavori, L.; Erdo-Bonyar, S.; Simon, D.; Berki, T.; Csecsei, P.; Ezer, E. Mitochondrial Dysfunction in Long COVID: Mechanisms, Consequences, and Potential Therapeutic Approaches. GeroScience 2024, 46, 5267–5286. [Google Scholar] [CrossRef] [PubMed]
- Cordero, M.D.; De Miguel, M.; Moreno Fernández, A.M.; Carmona López, I.M.; Garrido Maraver, J.; Cotán, D.; Gómez Izquierdo, L.; Bonal, P.; Campa, F.; Bullon, P.; et al. Mitochondrial Dysfunction and Mitophagy Activation in Blood Mononuclear Cells of Fibromyalgia Patients: Implications in the Pathogenesis of the Disease. Arthritis Res. Ther. 2010, 12, R17. [Google Scholar] [CrossRef]
- Cordero, M.D.; De Miguel, M.; Moreno-Fernández, A.M. La disfunción mitocondrial en la fibromialgia y su implicación en la patogénesis de la enfermedad. Med. Clínica 2011, 136, 252–256. [Google Scholar] [CrossRef]
- Marino, Y.; Inferrera, F.; D’Amico, R.; Impellizzeri, D.; Cordaro, M.; Siracusa, R.; Gugliandolo, E.; Fusco, R.; Cuzzocrea, S.; Di Paola, R. Role of Mitochondrial Dysfunction and Biogenesis in Fibromyalgia Syndrome: Molecular Mechanism in Central Nervous System. Biochim. Biophys. Acta (BBA) Mol. Basis Dis. 2024, 1870, 167301. [Google Scholar] [CrossRef]
- COVID and Fibromyalgia: Is There a Link? Available online: https://www.healthline.com/health/fibromyalgia/covid-and-fibromyalgia (accessed on 5 November 2025).
- Davis, L.; Higgs, M.; Snaith, A.; Lodge, T.A.; Strong, J.; Espejo-Oltra, J.A.; Kujawski, S.; Zalewski, P.; Pretorius, E.; Hoerger, M.; et al. Dysregulation of Lipid Metabolism, Energy Production, and Oxidative Stress in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome, Gulf War Syndrome and Fibromyalgia. Front. Neurosci. 2025, 19, 1498981. [Google Scholar] [CrossRef] [PubMed]





| Characteristic | PASC Training (n = 117) | Healthy Training (n = 28) | PASC Testing (n = 48) | Healthy Testing (n = 37) |
|---|---|---|---|---|
| Age (years) | 62 (53–73) | 55 (52–59) | 51.5 (43.5–60.8) | 40.5 (37–53.3) |
| Male | 66 (56.4) | 16 (57.1) | 28 (58.3) | 17 (44.7) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cai, E.; Kouznetsova, V.L.; Tsigelny, I.F. Metabolomics-Based Machine Learning Diagnostics of Post-Acute Sequelae of SARS-CoV-2 Infection. Metabolites 2025, 15, 801. https://doi.org/10.3390/metabo15120801
Cai E, Kouznetsova VL, Tsigelny IF. Metabolomics-Based Machine Learning Diagnostics of Post-Acute Sequelae of SARS-CoV-2 Infection. Metabolites. 2025; 15(12):801. https://doi.org/10.3390/metabo15120801
Chicago/Turabian StyleCai, Ethan, Valentina L. Kouznetsova, and Igor F. Tsigelny. 2025. "Metabolomics-Based Machine Learning Diagnostics of Post-Acute Sequelae of SARS-CoV-2 Infection" Metabolites 15, no. 12: 801. https://doi.org/10.3390/metabo15120801
APA StyleCai, E., Kouznetsova, V. L., & Tsigelny, I. F. (2025). Metabolomics-Based Machine Learning Diagnostics of Post-Acute Sequelae of SARS-CoV-2 Infection. Metabolites, 15(12), 801. https://doi.org/10.3390/metabo15120801

