You are currently viewing a new version of our website. To view the old version click .
  • This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
  • Article
  • Open Access

7 December 2025

A Machine Learning Model Based on First-Trimester Lipidomic Signatures for Predicting Metabolic Pregnancy Complications

,
,
,
,
,
and
1
V.I. Kulakov National Medical Research Center for Obstetrics, Gynecology and Perinatology, Ministry of Healthcare of Russian Federation, Moscow 117997, Russia
2
Laboratory of Translational Medicine, Siberian State Medical University, Tomsk 634050, Russia
3
Department of Obstetrics, Gynecology, Perinatology and Reproductology, Institute of Professional Education, Federal State Autonomous Educational Institution of Higher Education, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation, Moscow 119991, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci.2025, 26(24), 11824;https://doi.org/10.3390/ijms262411824 
(registering DOI)
This article belongs to the Section Molecular Endocrinology and Metabolism

Abstract

Gestational diabetes mellitus (GDM) and macrosomia are crucial for improving maternal and neonatal outcomes. Molecular dysregulations can manifest long before clinical symptoms appear. This study aimed to leverage first-trimester serum lipidomic signatures to build early predictive models for these complications. A case–control study was conducted using serum samples from 119 women during first-trimester screening: 40 cases and 79 controls for GDM prediction and 45 cases and 74 controls for macrosomia prediction (newborn weight more than 90 percentile). Lipidomic profiling was performed using shotgun mass spectrometry in both positive and negative electrospray ionization modes. After feature selection based on Shapley values, machine learning models—including Random Forest and XGBoost—were constructed and evaluated via 10-fold cross-validation. For GDM, potential early biomarkers included elevated levels of triacylglycerol (TG) 55:7 and decreased levels of 13-Docosenamide, plasmenyl-phosphatidylcholine (PC P)-36:2, and phosphatidylcholine (PC) 42:7. For macrosomia, phosphatidylglycerol (PG) (i-, a- 29:0), 4-Hydroxybutyric acid, and Pantothenol were significantly altered. The model for GDM prediction achieved a sensitivity of 87% and specificity of 89%. For macrosomia, the model demonstrated a sensitivity of 87% and specificity of 93%. The Random Forest and XGBoost models demonstrated comparable performance metrics on average. The risk ratio between the high- and low-risk groups defined by the models was 11.9 for GDM and 11.1 for macrosomia. Our findings demonstrate that first-trimester serum lipidomic profiles, combined with clinical data and interpreted by advanced machine learning, can accurately identify patients at high risk for GDM and macrosomia. This integrated approach holds significant promise for developing a clinical tool for timely intervention and personalized pregnancy management.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.