Next Article in Journal
Genomic Surveillance of SARS-CoV-2 in Malaysia during the Era of Endemic COVID-19
Next Article in Special Issue
The Task Matters: A Comprehensive Review and Proposed Literature Score of the Effects of Chemical and Physical Parameters on Embryo Developmental Competence
Previous Article in Journal
Applications of Artificial Intelligence and Radiomics in Molecular Hybrid Imaging and Theragnostics for Neuro-Endocrine Neoplasms (NENs)
Previous Article in Special Issue
Outcomes of Pregnancy in COVID-19-Positive Mothers in a Tertiary Centre
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Risk Prediction Model of Early-Onset Preeclampsia Based on Risk Factors and Routine Laboratory Indicators

1
Department of Laboratory Medicine, Peking University Third Hospital, Beijing 100191, China
2
Department of Blood Transfusion, Peking University Third Hospital, Beijing 100191, China
3
Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China
4
Research Center of Clinical Epidemiology, Peking University Third Hospital, Beijing 100191, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Life 2023, 13(8), 1648; https://doi.org/10.3390/life13081648
Submission received: 15 June 2023 / Revised: 16 July 2023 / Accepted: 19 July 2023 / Published: 28 July 2023
(This article belongs to the Special Issue Obstetrics and Gynecology Medicine: Go From Bench to Bedside)

Abstract

:
Background: Globally, 10–15% of maternal deaths are statistically attributable to preeclampsia. Compared with late-onset PE, the severity of early-onset PE remains more harmful with higher morbidity and mortality. Objective: To establish an early-onset preeclampsia prediction model by clinical characteristics, risk factors and routine laboratory indicators were investigated from pregnant women at 6 to 10 gestational weeks. Methods: The clinical characteristics, risk factors, and 38 routine laboratory indicators (6–10 weeks of gestation) including blood lipids, liver and kidney function, coagulation, blood count, and other indicators of 91 early-onset preeclampsia patients and 709 normal controls without early-onset preeclampsia from January 2010 to May 2021 in Peking University Third Hospital (PUTH) were retrospectively analyzed. A logistic regression, decision tree model, and support vector machine (SVM) model were applied for establishing prediction models, respectively. ROC curves were drawn; area under curve (AUCROC), sensitivity, and specificity were calculated and compared. Results: There were statistically significant differences in the rates of diabetes, antiphospholipid syndrome (APS), kidney disease, obstructive sleep apnea (OSAHS), primipara, history of preeclampsia, and assisted reproductive technology (ART) (p < 0.05). Among the 38 routine laboratory indicators, there were no significant differences in the levels of PLT/LYM, NEU/LYM, TT, D-Dimer, FDP, TBA, ALP, TP, ALB, GLB, UREA, Cr, P, Cystatin C, HDL-C, Apo-A1, and Lp(a) between the two groups (p > 0.05). The levels of the rest indicators were all statistically different between the two groups (p < 0.05). If only 12 risk factors of PE were analyzed with the logistic regression, decision tree model, and support vector machine (SVM), and the AUCROC were 0.78, 0.74, and 0.66, respectively, while 12 risk factors of PE and 38 routine laboratory indicators were analyzed with the logistic regression, decision tree model, and support vector machine (SVM), and the AUCROC were 0.86, 0.77, and 0.93, respectively. Conclusions: The efficacy of clinical risk factors alone in predicting early-onset preeclampsia is not high while the efficacy increased significantly when PE risk factors combined with routine laboratory indicators. The SVM model was better than logistic regression model and decision tree model in early prediction of early-onset preeclampsia incidence.

1. Introduction

Globally, 10–15% of all maternal deaths can be attributed to preeclampsia or eclampsia, a placentally derived disease of pregnancy [1,2]. Maternal complications associated with preeclampsia include placental abruption, acute kidney disease, pulmonary edema, and heart failure. In severe cases, preeclampsia leads to eclamptic seizures and life-threatening hemolysis, elevated liver enzymes, and low platelet count (HELLP) syndrome [3]. Moreover, fetal complications related to preeclampsia include impaired fetal growth, neonatal respiratory distress syndrome, and stillbirth. Preeclampsia can be classified as early-onset preeclampsia, which develops before 34 weeks’ gestation, and the more common late-onset preeclampsia, which develops at or after 34 weeks’ gestation [4]. Compared with late-onset PE, the severity of early-onset PE remains more harmful with higher morbidity and mortality [5].
Despite the serious clinical consequences, there is no effective preventive measure for preeclampsia currently. Timely identification and management of preeclampsia can significantly improve maternal and perinatal outcomes [6]. Therefore, risk prediction of preeclampsia and preeclampsia-related disorders has received considerable attention over the past two decades. A practical prediction model would allow for increased surveillance of at-risk patients and reduce the surveillance of patients who are less likely to develop preeclampsia, which makes medical resources fully and reasonably allocated and utilized. Although previous studies have analyzed clinical features and evaluated biomarkers for effective prediction, few have demonstrated clinically sufficient properties [7,8,9,10,11].
Machine learning (ML) techniques provide the possibility to infer important connections between items from different data sets that would otherwise be difficult to correlate [12,13]. Due to the vast amount and complexity of medical information, ML is considered a promising method for diagnosing diseases or predicting clinical outcomes. Multiple ML techniques have been used in clinical settings and shown to be more accurate than traditional methods in predicting disease [14].
This study was aimed to develop ML models to predict early-onset preeclampsia by using risk factors and routine laboratory indicators and to compare the performance of different models.

2. Materials and Methods

2.1. Study Population

Pregnant women with non-singleton, miscarriage or fetal death, intrauterine chromosomal disorders or fetal malformations, and missing laboratory data were excluded. Preeclampsia (PE) is defined as systolic blood pressure at ≥140 mm Hg and/or diastolic blood pressure at ≥90 mm Hg on at least two occasions measured 4 h apart in previously normotensive women and is accompanied by one or more of the following new-onset conditions at or after 20 weeks of gestation: 1. Proteinuria (i.e., ≥30 mg/mol protein:creatinine ratio; ≥300 mg/24 h; or ≥2 + dipstick); 2. Evidence of other maternal organ dysfunction, including acute kidney injury (creatinine ≥90 μmol/L; 1 mg/dL); liver involvement (elevated transaminases, e.g., alanine aminotransferase or aspartate aminotransferase >40 IU/L) with or without right upper quadrant or epigastric abdominal pain; neurological complications (e.g., eclampsia, altered mental status, blindness, stroke, clonus, severe headaches, and persistent visual scotomata); or hematological complications (thrombocytopenia–platelet count <150,000/μL, disseminated intravascular coagulation, hemolysis); 3. Uteroplacental dysfunction (such as fetal growth restriction, abnormal umbilical artery Doppler waveform analysis, or stillbirth) according to the FIGO guidelines [6]. Pregnant women who met the diagnostic criteria for preeclampsia and with delivery at <34+0 weeks of gestation can be subclassified into early-onset preeclampsia. A total of 91 Chinese pregnant women who were diagnosed with early-onset preeclampsia in the Department of Obstetrics and Gynecology of Peking University Third Hospital from January 2010 to May 2021 were included as PE group. Meanwhile, 709 Chinese pregnant women who had normal delivery and single live birth in the department of Obstetrics and Gynecology of Peking University Third Hospital during the same period were selected as the control group (CON). The retrospective study protocol was approved by the Peking University Third Hospital Medical Science Research Ethics Committee (IRB00006761-M2021032).

2.2. Clinical and Biochemical Data Collection

Clinical characteristics of patients, such as age of admission, gestational age, disease history, pregnancy history, and blood pressure (1 mmHg = 0.133 kPa), were obtained from electronic medical records. There are 12 risk factors [15] for preeclampsia, which include diabetes, thrombotic diseases, systemic lupus erythematosus (SLE), antiphospholipid syndrome (APS) and kidney diseases, assisted reproductive technology (ART), obstructive sleep apnea, pre-pregnancy body mass index (BMI) > 30 kg/m2, age > 35 years old, multiple pregnancies, primipara, and history of eclampsia or preeclampsia. Routine laboratory indicators including albumin (ALB), alanine transaminase (ALT), aspartate transaminase (AST), alkaline phosphatase (ALP), complement C1q, calcium (Ca), creatinine (Cr), C-reactive protein (CRP), cystatin C, γ-glutamyl transpeptidase (GGT), globulin (GLB), triglyceride (TG), total cholesterol (TC), high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), lipoprotein (a) [Lp (a)], apolipoprotein A1 (ApoA1), apolipoprotein B (ApoB), small dense low density lipoprotein (sdLDL-C), total protein (TP), total bile acid (TBA), total bilirubin (T-Bil), direct bilirubin (D-Bil), uric acid (UA), Urea (UREA), phosphorus (P), absolute value of lymphocyte (LYM), absolute value of neutrophil (NEU), platelet count (PLT), NEU /LTM ratio, PLT /LYM ratio, prothrombin time (PT), prothrombin activity (PTA), activated partial thrombin time (APTT), fibrinogen (FIB), D-Dimer, fibrinogen degradation Products (FDP), thrombin time (TT).

2.3. Instruments and Reagents

Fasting blood samples of the participants were collected from elbow venous using vacutainer containing separation glue at 6–10 weeks of gestation. The blood samples were centrifuged at 2793× g for 5 min. The serum was separated and stored at −80 °C refrigerator for subsequent detection. Serum liver and kidney function, lipid metabolism, and complement indexes were detected by AU5800 automatic biochemical analyzer (Beckman Coulter, Brea, CA, USA).
The peripheral blood samples were obtained with venipuncture and collected into vacuum blood collection tubes containing sodium citrate as the anticoagulant (INSEPACK® Sekisui, Beijing, China). The plasma was obtained by centrifuging the samples at 1500× g for 5 min. Automatic coagulation analyzer (ACL-TOP 700®, Werfen, Barcelona, Spain) was used to detect coagulation items.
The peripheral blood samples were obtained with venipuncture and collected in vacuum blood collection tubes containing EDTA-K2 as the anticoagulant (INSEPACK®, Sekisui, Beijing, China). The peripheral leukocytes were counted and classified into neutrophils, eosinophils, basophils, lymphocytes, and monocytes in the traditional five subtype classification method with an automatic blood count analyzer (SYSMEX XN-2000 Automated Hematology Analyzer, Kobe, Japan).
Instrument calibration, calibration, quality control were matched and applied in strict accordance with the standard operation procedure.

2.4. Statistical Analysis

SPSS 24.0 and MATLAB software (R2022a) were used for data analysis. The K-S normal distribution was used to detect the normality of data; measurement data conformed to normal distribution with x ± s description and non-normal distribution with a median (interquartile range). Mann–Whitney U test was used for pairwise comparison of skewed distribution data between groups. The count data were tested with chi-square test, and the number of use cases (percentage) was described. p < 0.05 was considered statistically significant.

2.4.1. Logistic Regression Model

All routine laboratory indicators were analyzed with univariate binary Logistic regression; multivariate binary Logistic regression analysis was performed for the variable of p < 0.05. The maximum Youden index was taken as the cut-off point, the risk degree was expressed as the OR value [95% confidence interval (CI), 95%CI], and the receiver operating characteristic (ROC) curve was made.

2.4.2. Machine Learning

We used 2 machine learning algorithms: the decision tree model and support vector machine (SVM). For the development of machine learning models, we obtained 12 risk factors and 38 routine laboratory indicators mentioned above. The predictive value of individual risk factors and the models combining risk factors and laboratory indicators were explored, respectively. Machine learning models were trained with all variables as inputs to classify patients likely to have favorable outcomes. Among the study population, 80% were randomly selected for the training set, and the remaining 20% were used as the test set to prevent overfitting of the models. MATLAB version R 2022a was used to train the machine learning models.

Decision Tree Model

Decision tree is a commonly used supervised learning algorithm. It uses Gini coefficient, entropy, and other parameters to select features and generate a tree structure, and classifies the original data set into a series of smaller subgroups. This method had the advantages of strong interpretability, low computational costs, and strong robustness. Similarly, the ROC curve was made compared with other models.

Support Vector Machine (SVM)

The support vector machine is a learning system that uses a hypothesis space of linear functions in a high-dimensional feature space. This method maximizes the separation boundary of the two classes under the assumption of improving the generalization ability of the classifier. It makes all samples of different classes well discriminated by finding a projection direction and obtaining the optimal hyperplane. In addition, this method can also achieve nonlinear mapping through the kernel function so as to obtain a stronger fitting ability. Among them, the commonly used kernel functions are as follows: gaussian kernel function, polynomial kernel function, sigmoid kernel function, etc. In this study, considering the strong linear relationship between laboratory indicators and predicted results and the objective situation due to limited sample size, a linear kernel with lower complexity was used. We used the ten-fold cross-validation method to verify the ability of the model, and the results of the sensitivity, specificity, and other indicators were good and consistent, which proved that the model had good fitting and generalization ability.

3. Results

3.1. Participants’ Clinical Characteristics

There were significant differences in maternal age and pre-pregnancy BMI between the two groups (p < 0.05), and the PE group had a higher pre-pregnancy BMI compared with the control group (Table 1).

3.2. Comparison of Risk Factors

There was no significant difference in the proportion of pregnant women with thrombotic disease or systemic lupus erythematosus (SLE) between the two groups (p > 0.05). However, there were statistically significant differences in the rates of diabetes, antiphospholipid syndrome (APS), kidney disease, obstructive sleep apnea (OSAHS), primipara, history of preeclampsia, and assisted reproductive technology (ART) (p < 0.05). The proportion of thrombotic diseases in the PE group was lower than that in the control group, and the other proportions were higher than that in the control group (Table 1).

3.3. Comparison of Routine Laboratory Indicators

Among the 38 routine laboratory indicators, there were no significant differences in the levels of PLT/LYM, NEU/LYM, TT, D-Dimer, FDP, TBA, ALP, TP, ALB, GLB, UREA, Cr, P, Cystatin C, HDL-C, Apo-A1, and Lp(a) between the two groups (p > 0.05). The levels of the rest indicators were all statistically different (p < 0.05) (Table 2).

3.4. Results of Each Model and Receiver Operating Curve (ROC) Analysis

  • Logistic regression analysis
When the risk factors were analyzed with a univariate logistic regression, the results were shown in Supplementary Table S1 (p < 0.05).
If only 12 risk factors of PE were analyzed with a multivariate binary logistic regression and an ROC curve analysis was performed, the maximum Youden index of logistic regression was 0.110, the sensitivity of the model was 12.1%, the specificity was 98.9%, and the AUCROC = 0.78.
Multivariate binary logistic regression analysis was performed on 12 risk factors of PE and 38 routine laboratory indicators. An ROC curve analysis was performed according to the above methods, the maximum Youden index of logistic regression was 0.701, the sensitivity of the model was 73.6%, the specificity was 96.5%, and the AUCROC = 0.86.
b.
Decision tree model analysis
Using a decision tree learning algorithm, if only 12 risk factors of PE were included in the model, the maximum Youden index of logistic regression was 0.130, the sensitivity of the model was 15.4%, the specificity was 97.6%, and the AUCROC = 0.74; when 12 risk factors of PE and 38 routine laboratory indicators were included in the model, the maximum Youden index of logistic regression was 0.616, the sensitivity of the model was 64.8%, the specificity was 96.8%, and the AUCROC = 0.77.
c.
Support vector machine (SVM) analysis
Using SVM learning system, if only 12 risk factors of PE were included in the model, the maximum Youden index of logistic regression was 0.055, the sensitivity of model was 6.6%, the specificity of model was 98.9%, and the AUCROC = 0.66. When 12 risk factors of PE and 38 routine laboratory indicators were included in the model, the maximum Youden index of logistic regression was 0.669, the sensitivity of the model was 67.0%, the specificity was 99.9%, and the AUCROC = 0.93.
The results of the ROC analysis based on 12 risk factors of PE are shown in Figure 1, and the results of the ROC analysis based on 12 risk factors of PE combining with 38 routine laboratory indicators are shown in Figure 2.
d.
Delong test of ROCs differ between models
Delong tests were used to explore whether there were statistical differences in the area under the curve between the three models. If only 12 risk factors of PE were included in the models, the results of the pairwise comparison of ROC curves between support vector machine and decision tree models and those between support vector machine and logistic regression models were statistically different. The results are shown in Table 3 (p < 0.05).
While 12 risk factors of PE and 38 routine laboratory indicators were included in the models, the results of pairwise comparison of ROC curves between support vector machine and decision tree models and that between support vector machine and logistic regression models were also statistically different. The results are shown in Table 4 (p < 0.05).

4. Discussion

The incidence of PE is related to spiral artery remodeling disorder, endothelial dysfunction, vasospasm, oxidative stress, and micro-embolism. Therefore, factors affecting placenta formation and endothelial function damage are the risk factors for PE [16]. Consistent with other studies [17,18,19], previous history of preeclampsia, diabetes mellitus, thrombotic disease, systemic lupus erythematosus (SLE), antiphospholipid syndrome (APS), kidney disease, assisted reproductive technology, obstructive sleep apnea hypopnea syndrome (OSAHS), BMI > 30 kg/m2, age over 35, multiple pregnancies, and primipara were included as risk factors in the model. In this study, there was no thrombotic disease in the PE group, which may be because pregnant women with thrombotic disease tendency continued to take anticoagulant drugs, such as aspirin in the first trimester, effectively preventing abnormal blood flow status and thrombosis and reducing the risk of PE.
The pathologic lesions of preeclampsia and eclampsia are characterized by widespread endothelial lesions in various organ beds [20], such as liver lesions with periportal and portal necrosis and hepatic arterial medial necrosis, based on an autopsy series of 317 mothers who died of eclampsia. Therefore, when PE has not progressed in the first trimester of pregnancy, slight changes in liver vessels may have occurred in pregnant women, and the liver function is affected, which is manifested as elevated liver enzymes, abnormal coagulation function, and abnormal substance metabolism. Similarly, renal tissue demonstrated hallmarks of glomerular endotheliosis reported in previous studies [21]. Glomerular endothelial cell lesions, impaired mechanical barrier and charge barrier, and increased filtration membrane permeability lead to abnormal renal function and proteinuria in PE patients [22].
Previous studies have shown an association between abnormal lipid metabolism and inflammatory activation with preeclampsia [23,24,25,26]. In this study, except for HDL-C and ApoA1, the other blood lipid indicators in the PE group were higher than those in the control group. HDL is involved in the reverse transport of cholesterol (as a vascular protective factor that has an anti-atherosclerosis effect while ApoA1 is a tool to carry HDL), is also a component of HDL, and has a relatively important role in preventing the occurrence of atherosclerosis. HDL-C and ApoA1 levels of the PE group were lower than those of the control group, which was consistent with previous studies [27].
In this study, it is not hard to see that the models established combining routine laboratory indicators with risk factors improve the accuracy of prediction rather than only with risk factors. In this study, the SVM model had the best prediction ability of early-onset PE. Machine learning has received a lot of attention in recent years. The advantages and disadvantages of machine learning and traditional statistical models vary with different research questions, research designs, and research data. Compared to the other machine learning methods, the SVM is very powerful at recognizing subtle patterns in complex datasets, greatly improves the prediction performance of the model, and has a good clinical application prospect [28]. The SVM loss function has its own regular term, so SVM is a structural risk minimization algorithm. The so-called structural risk minimization means to seek a balance between a training error and model complexity to prevent overfitting so as to minimize the real error. In order to better minimize structural risks, regular terms were added to the SVM model construction to further reduce potential overfitting.
In previous studies, mean arterial pressure, uterine arterial pulse index, and serum placental growth factor were selected as biomarkers for early-onset PE prediction [29,30]. Although the accuracy and specificity are relatively high, the collection cost is high, and the operation is difficult. Similar to this study, Jong et al. used logistic regression, decision tree model, naive Bayes classification, support vector machine, random forest algorithm, and stochastic gradient boosting method to build a prediction model for delayed preeclampsia by collecting general clinical data, medical history and biochemical laboratory data. The stochastic gradient boosting model had the best prediction performance with an accuracy and false positive rate of 0.973 and 0.009 [31]. Although different from the variables included in this study, it also shows that machine learning algorithms can effectively predict preeclampsia to a certain extent. The routine laboratory indicators adopted in this study are included in the routine prenatal examination, which is convenient to obtain, simple and rapid, and at the same time reduces the extra cost for patients and the prediction cost. However, the deficiency also lies in this; the established model lacks the specific index of early-onset PE, and the specificity of the model is not high. The sample size of the PE group is smaller than that of the control group, which may have a certain impact on the model. Subsequent studies will improve upon this. The sample size of this study is small, which does not meet the requirements of EPV (Event Per Variable), so the results of the logistic regression may not be robust enough. However, considering that this type of patient is rare and that the results are somewhat interpretable, it is still presented. Further research is needed to confirm the reliability of the results.

5. Conclusions

The performance of clinical risk factors alone in predicting early-onset PE is poor, and the performance significantly improved when combing risk factors with routine laboratory indicators. The support vector machine (SVM) model showed the best AUCROC, specificity, and sensitivity compared with the logistic regression model and decision tree model.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life13081648/s1, The univariate logistic regression analysis of risk factors for PE was shown in Supplementary Table S1 (p < 0.05).

Author Contributions

Y.X.: Conceptualization, Investigation, Visualization, Writing—original draft. N.Y.: Investigation, Data curation. X.G.: Investigation, Data curation. Y.W.: Writing-review & editing, Project administration, Funding acquisition. H.Z.: Data curation. K.J.: Conceptualization, Writing—review & editing, Project administration and Funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the General Project of National Natural Science Foundation of China (Project No.62071007).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Peking University Third Hospital (protocol code IRB00006761-M2016122, date of approval 22 September 2016).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are available from the corresponding author upon request.

Acknowledgments

We would like to thank all study participants who were involved in this investigation but are not in the author list.

Conflicts of Interest

We declare that we have no financial and personal relationships with other people or organizations that can inappropriately influence our work, there is no professional or other personal interest of any nature or kind in any product, service and/or company that could be construed as influencing the position presented in, or the review of, the manuscript entitled.

Abbreviations

PE: preeclampsia; ML, machine learning; BMI, body mass index; SLE, systemic lupus erythematosus; APS, antiphospholipid syndrome; ART, assisted reproductive technology; OSAHS, obstructive sleep apnea; ALB, albumin; ALT, alanine transaminase; AST, aspartate transaminase; ALP, alkaline phosphatase; C1q, complement C1q; Ca, calcium; Cr, creatinine; CRP, C-reactive protein; GGT, γ-glutamyl transpeptidase; GLB, globulin; TG, triglyceride; TC, total cholesterol; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; Lp (a), lipoprotein (a); ApoA1, apolipoprotein A1; ApoB, apolipoprotein B; sdLDL-C, small dense low density lipoprotein; TP, total protein; TBA, total bile acid; T-Bil, total bilirubin; D-Bil, direct bilirubin; UA, uric acid; UREA, urea; P, phosphorus; LYM, absolute value of lymphocyte; NEU, absolute value of neutrophil; PLT, platelet count; PT, prothrombin time; PTA, prothrombin activity; APTT, activated partial thrombin time; FIB, fibrinogen; FDP, fibrinogen degradation Products; TT, thrombin time; SVM, support vector machine, CI, confidence interval.

References

  1. Khan, K.S.; Wojdyla, D.; Say, L.; Gülmezoglu, A.M.; Van Look, P.F. WHO analysis of causes of maternal death: A systematic review. Lancet 2006, 367, 1066–1074. [Google Scholar] [CrossRef]
  2. Duley, L. The Global Impact of Pre-eclampsia and Eclampsia. Semin. Perinatol. 2009, 33, 130–137. [Google Scholar] [CrossRef]
  3. Habli, M.; Eftekhari, N.; Wiebracht, E.; Bombrys, A.; Khabbaz, M.; How, H.; Sibai, B. Long-term maternal and subsequent pregnancy outcomes 5 years after hemolysis, elevated liver enzymes, and low platelets (HELLP) syndrome. Am. J. Obs. Gynecol. 2009, 201, 381–385. [Google Scholar] [CrossRef]
  4. Nelson, D.B.; Ziadie, M.S.; McIntire, D.D.; Rogers, B.B.; Leveno, K.J. Placental pathology suggesting that preeclampsia is more than one disease. Am. J. Obs. Gynecol. 2014, 210, 61–66. [Google Scholar] [CrossRef]
  5. De Kat, A.C.; Hirst, J.; Woodward, M.; Kennedy, S.; Peters, S.A. Prediction models for preeclampsia: A systematic review. Pregnancy Hypertens. 2019, 16, 48–66. [Google Scholar] [CrossRef]
  6. Poon, L.C.; Shennan, A.; Hyett, J.A.; Kapur, A.; Hadar, E.; Divakar, H.; McAuliffe, F.; da Silva Costa, F.; von Dadelszen, P.; McIntyre, H.D.; et al. The International Federation of Gynecology and Obstetrics (FIGO) initiative on pre-eclampsia: A pragmatic guide for first-trimester screening and prevention. Int. J. Gynaecol. Obstet. 2019, 145 (Suppl. S1), 1–33. [Google Scholar] [CrossRef] [Green Version]
  7. Payne, B.A.; Hutcheon, J.A.; Ansermino, J.M.; Hall, D.R.; Bhutta, Z.A.; Bhutta, S.Z.; Biryabarema, C.; Grobman, W.A.; Groen, H.; Haniff, F.; et al. A risk prediction model for the assessment and triage of women with hypertensive disorders of pregnancy in low-resourced settings: The miniPIERS (Pre-eclampsia Integrated Estimate of RiSk) multi-country prospective cohort study. PLoS Med. 2014, 11, e1001589. [Google Scholar] [CrossRef] [Green Version]
  8. Thangaratinam, S.; Allotey, J.; Marlin, N.; Mol, B.W.; Von Dadelszen, P.; Ganzevoort, W.; Akkermans, J.; Ahmed, A.; Daniels, J.; Deeks, J.; et al. Development and validation of Prediction models for Risks of complications in Early-onset Pre-eclampsia (PREP): A prospective cohort study. Health Technol. Assess. 2017, 21, 1–100. [Google Scholar] [CrossRef] [Green Version]
  9. Chappell, L.C.; Duckworth, S.; Seed, P.T.; Griffin, M.; Myers, J.; Mackillop, L.; Simpson, N.; Waugh, J.; Anumba, D.; Kenny, L.C.; et al. Diagnostic accuracy of placental growth factor in women with suspected preeclampsia: A prospective multicenter study. Circulation 2013, 128, 2121–2131. [Google Scholar] [CrossRef] [Green Version]
  10. Zeisler, H.; Llurba, E.; Chantraine, F.; Vatish, M.; Staff, A.C.; Sennström, M.; Olovsson, M.; Brennecke, S.P.; Stepan, H.; Allegranza, D.; et al. Predictive Value of the sFlt-1:PlGF Ratio in Women with Suspected Preeclampsia. N. Engl. J. Med. 2016, 374, 13–22. [Google Scholar] [CrossRef]
  11. North, R.A.; McCowan, L.M.; Dekker, G.A.; Poston, L.; Chan, E.H.; Stewart, A.W.; Black, M.A.; Taylor, R.S.; Walker, J.J.; Baker, P.N.; et al. Clinical risk prediction for pre-eclampsia in nulliparous women: Development of model in international prospective cohort. BMJ 2011, 342, d1875. [Google Scholar] [CrossRef] [Green Version]
  12. Obermeyer, Z.; Emanuel, E.J. Predicting the Future—Big Data, Machine Learning, and Clinical Medicine. N. Engl. J. Med. 2016, 375, 1216–1219. [Google Scholar] [CrossRef] [Green Version]
  13. Darcy, A.M.; Louie, A.K.; Roberts, L.W. Machine Learning and the Profession of Medicine. JAMA 2016, 315, 551–552. [Google Scholar] [CrossRef]
  14. Frizzell, J.D.; Liang, L.; Schulte, P.J.; Yancy, C.W.; Heidenreich, P.A.; Hernandez, A.F.; Bhatt, D.L.; Fonarow, G.C.; Laskeyet, W.K. Prediction of 30-Day All-Cause Readmissions in Patients Hospitalized for Heart Failure: Comparison of Machine Learning and Other Statistical Approaches. JAMA Cardiol. 2017, 2, 204–209. [Google Scholar] [CrossRef] [Green Version]
  15. Gestational Hypertension and Preeclampsia: ACOG Practice Bulletin Summary, Number 222. Obstet. Gynecol. 2020, 135, 1492–1495. [CrossRef]
  16. Ives, C.W.; Sinkey, R.; Rajapreyar, I.; Tita, A.T.N.; Oparil, S. Preeclampsia—Pathophysiology and Clinical Presentations: JACC State-of-the-Art Review. J. Am. Coll. Cardiol. 2020, 76, 1690–1702. [Google Scholar] [CrossRef]
  17. Phipps, E.A.; Thadhani, R.; Benzing, T.; Karumanchi, S.A. Pre-eclampsia: Pathogenesis, novel diagnostics and therapies. Nat. Rev. Nephrol. 2019, 15, 275–289. [Google Scholar] [CrossRef]
  18. Good clinical practice advice: First trimester screening and prevention of pre-eclampsia in singleton pregnancy. Int. J. Gynaecol. Obstet. 2019, 144, 325–329. [CrossRef] [Green Version]
  19. ACOG Practice Bulletin No. 202: Gestational Hypertension and Preeclampsia. Obs. Gynecol. 2019, 133, 1.
  20. Hecht, J.L.; Ordi, J.; Carrilho, C.; Ismail, M.R.; Zsengeller, Z.K.; Karumanchi, S.A.; Rosen, S. The pathology of eclampsia: An autopsy series. Hypertens. Pregnancy 2017, 36, 259–268. [Google Scholar] [CrossRef]
  21. Gaber, L.W.; Spargo, B.H.; Lindheimer, M.D. Renal pathology in pre-eclampsia. Baillieres Clin. Obs. Gynaecol. 1994, 8, 443–468. [Google Scholar] [CrossRef] [PubMed]
  22. Garovic, V.D.; Wagner, S.J.; Turner, S.T.; Rosenthal, D.W.; Watson, W.J.; Brost, B.C.; Rose, C.H.; Gavrilova, L.; Craigo, P.; Bailey, K.R.; et al. Urinary podocyte excretion as a marker for preeclampsia. Am. J. Obs. Gynecol. 2007, 196, 320–321. [Google Scholar] [CrossRef] [PubMed]
  23. Dey, M.; Arora, D.; Narayan, N.; Kumar, R. Serum Cholesterol and Ceruloplasmin Levels in Second Trimester can Predict Development of Pre-eclampsia. N. Am. J. Med. Sci. 2013, 5, 41–46. [Google Scholar] [CrossRef] [PubMed]
  24. van Rijn, B.B.; Veerbeek, J.H.; Scholtens, L.C.; Post, U.E.; Koster, M.P.; Peeters, L.L.; Koenen, S.V.; Bruinse, H.W.; Franx, A. C-reactive protein and fibrinogen levels as determinants of recurrent preeclampsia: A prospective cohort study. J. Hypertens. 2014, 32, 408–414. [Google Scholar] [CrossRef]
  25. Alma, L.J.; Bokslag, A.; Maas, A.; Franx, A.; Paulus, W.J.; de Groot, C. Shared biomarkers between female diastolic heart failure and pre-eclampsia: A systematic review and meta-analysis. ESC Heart Fail. 2017, 4, 88–98. [Google Scholar] [CrossRef]
  26. Mary, S.; Kulkarni, M.J.; Malakar, D.; Joshi, S.R.; Mehendale, S.S.; Giri, A.P. Placental Proteomics Provides Insights into Pathophysiology of Pre-Eclampsia and Predicts Possible Markers in Plasma. J. Proteome Res. 2017, 16, 1050–1060. [Google Scholar] [CrossRef]
  27. Wang, Y.; Shi, D.; Chen, L. Lipid profile and cytokines in hypertension of pregnancy: A comparison of preeclampsia therapies. J. Clin. Hypertens. 2018, 20, 394–399. [Google Scholar] [CrossRef] [Green Version]
  28. Huang, S.; Cai, N.; Pacheco, P.P.; Narrandes, S.; Wang, Y.; Xu, W. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genom. Proteom. 2018, 15, 41–51. [Google Scholar]
  29. Serra, B.; Mendoza, M.; Scazzocchio, E.; Meler, E.; Nolla, M.; Sabrià, E.; Rodríguez, I.; Carreras, E. A new model for screening for early-onset preeclampsia. Am. J. Obs. Gynecol. 2020, 222, 601–608. [Google Scholar] [CrossRef] [Green Version]
  30. Akolekar, R.; Syngelaki, A.; Poon, L.; Wright, D.; Nicolaides, K.H. Competing Risks Model in Early Screening for Preeclampsia by Biophysical and Biochemical Markers. Fetal Diagn. Ther. 2013, 33, 8–15. [Google Scholar] [CrossRef] [Green Version]
  31. Jhee, J.H.; Lee, S.; Park, Y.; Lee, S.E.; Kim, Y.A.; Kang, S.-W.; Kwon, J.-Y.; Park, J.T. Prediction model development of late-onset preeclampsia using machine learning-based methods. PLoS ONE 2019, 14, e221202. [Google Scholar] [CrossRef] [PubMed]
Figure 1. ROC curves of three models based on 12 risk factors (RF).
Figure 1. ROC curves of three models based on 12 risk factors (RF).
Life 13 01648 g001
Figure 2. ROC curves of three models based on 12 risk factors (RF) and 38 routine laboratory indicators.
Figure 2. ROC curves of three models based on 12 risk factors (RF) and 38 routine laboratory indicators.
Life 13 01648 g002
Table 1. Clinical characteristics and risk factors of Control group and early-onset PE group.
Table 1. Clinical characteristics and risk factors of Control group and early-onset PE group.
VariablesControl Group
(n = 709)
PE Group
(n = 91)
Statistical Magnitudep Value
Age, year35 (32–38) a33 (31–35.5) a4.32 b<0.001 *
Body mass index, BMI21.71 (19.93–23.81) a23.73 (23.12–24.43) a−4.68 b<0.001 *
Medical history, n (%)
Diabetes155 (21.86)29 (31.87)4.56 c0.033 *
Thrombotic disease2 (0.28)0 (0.00)<0.001 c0.998
Systemic lupus erythematosus (SLE)5 (0.71)3 (3.30)3.17 c0.075
Antiphospholipid syndrome (APS)20 (2.82)7 (7.69)7.31 c0.026 *
Kidney disease5 (0.71)6 (6.59)16.51 c<0.001 *
Obstructive sleep apnea hypopnea syndrome (OSAHS)0 (0.00)2 (2.20)8.05 c0.005 *
History of eclampsia or preeclampsia4 (0.56)6 (6.59)19.12 c<0.001 *
History of gestation, n (%)591 (83.36)51 (56.04)46.69 c<0.001 *
Primipara, n (%)238 (33.57)72 (79.12)74.97 c<0.001 *
Fertilization way, n (%)
Assisted reproductive technology (ART)119 (16.78)32 (35.16)16.61 c<0.001 *
Natural conception590 (83.22)59 (64.84)
a Expressed as the median (interquartile range). b Rank sum test: Z value c Chi-square value * p values were statistically different, p < 0.05.
Table 2. Routine laboratory indicators at 6–10 weeks of gestation between Control group and early-onset PE group.
Table 2. Routine laboratory indicators at 6–10 weeks of gestation between Control group and early-onset PE group.
VariableControl Group
(n = 709)
PE Group
(n = 91)
Statistical Magnitudep Value
Blood cell count
PLT, ×109/L244 (211–283) a256.5 (226.25–300.5) a−2.58 b<0.001 *
LYM, ×109/L1.79 (1.48–2.1) a1.98 (1.74–2.40) a−3.56 b<0.001 *
NEU, ×109/L5.24 (4.28–6.55) a6.25 (5.24–8.35) a−4.63 b<0.001 *
PLT/LYM135.19 (114.35–163.73) a125.46 (106.26–158.16) a1.51 b0.13
NEU/LYM2.95 (2.29–3.73) a2.97 (2.42–4.14) a−0.97 b0.33
Index of coagulation function
PT, s11.3 (11–11.7) a10.9 (10.5–11.3) a5.53 b<0.001 *
PTA, %91 (88–96) a97 (92–102) a−5.15 b<0.001 *
APTT, s31.2 (29.3–33.1) a29.6 (27.7–31.3) a4.30 b<0.001 *
TT, s13.5 (12.9–14) a13.4 (12.7–14) a1.06 b0.29
FIB, g/L3.27 (2.92–3.66) a3.51 (3.1–3.98) a−3.38 b<0.001 *
D-Dimer, mg/L0.15 (0.15–0.18) a0.15 (0.15–0.17) a1.13 b0.26
FDP, μg/mL2.5 (2.5–2.5) a2.5 (2.5–2.5) a−0.79 b0.43
Liver function index
ALT, U/L13 (10–18) a18 (13–27.5) a−4.67 b<0.001 *
AST, U/L16 (14–19) a19 (15–25) a−4.70 b<0.001 *
T-Bil, umol/L12 (10–14.8) a10 (8.9–12.6) a4.71 b<0.001 *
D-Bil, umol/L1.3 (1–1.8) a1.1 (0.6–1.8) a2.27 b0.02 *
TBA, umol/L1.6 (1–2.3) a1.4 (1–2.35) a0.54 b0.59
ALP, U/L49 (43–58) a53 (44–64) a−1.92 b0.05
TP, g/L72.4 (70.1–74.7) a72 (70.9–74.9) a−0.24 b0.81
ALB, g/L43.01 ± 2.42 c43.04 ± 3.00 c−0.08 d0.94
GGT, U/L14 (11–18) c17 (14–28) c−4.86 b<0.001 *
GLB, g/L29 (27–31) c30 (28–32) c−1.12 b0.26
Renal function index
UREA, mmol/L3.1 (2.7–3.7) a3.2 (2.8–3.7) a−1.09 b0.28
Cr, umol/L59 (54–63) a60 (54–65) a−0.87 b0.39
Ca, mmol/L2.3 (2.24–2.35) a2.33 (2.27–2.4) a−3.40 b<0.001 *
P, mmol/L1.232 ± 0.136 c1.228 ± 0.138 c0.21 d0.83
UA, umol/L210 (184–239) a238 (214–279.75) a−5.57 b<0.001 *
Cystatin C, mg/L0.62 (0.57–0.68) a0.59 (0.54–0.66) a1.18 b0.24
Blood lipid indicators
TCHO, mmol/L4.01 (3.56–4.48) a4.27 (3.87–4.79) a−3.09 b<0.001 *
TG, mmol/L1.06 (0.8–1.43) a1.39 (1.15–1.83) a−5.00 b<0.001 *
HDL-C, mmol/L1.43 (1.24–1.63) a1.33 (1.14–1.58) a1.40 b0.16
LDL-C, mmol/L2.16 (1.79–2.53) a2.38 (2.11–2.93) a−3.84 b<0.001 *
ApoA1, g/L1578 (1355–1893) a1508 (1293–2057) a0.10 b0.92
ApoB, g/L621 (519–738) a702 (520–828) a−2.35 b0.02 *
Lp(a), mg/L87 (44–188) a87 (42–188.5) a0.29 b0.77
sdLDL-C, mmol/L0.7 (0.56–0.87) a0.93 (0.77–1.00) a−3.33 b<0.001 *
Complement/inflammatory markers
CRP, mg/dL1.04 (0.5–2.6) a1.71 (0.895–3.68) a−2.68 b0.007 *
C1q, mg/L184 (166–211) a194 (171.75–224.75) a−2.11 b0.03 *
a Expressed as the median (interquartile range). b Rank sum test: Z value; c Expressed as mean ± standard deviation (SD); d Student’s t-test: t-value; * p-values were statistically different, p < 0.05.
Table 3. Delong test of ROCs differ between models (12 risk factors). Pairwise comparison of ROC curves.
Table 3. Delong test of ROCs differ between models (12 risk factors). Pairwise comparison of ROC curves.
SVM~Decision Tree
Difference between areas 0.08
Standard Error a0.044
95% Confidence Interval0.047 to 0.220
z statistic3.02
Significance levelp = 0.0025
Logistic Regression~Decision Tree
Difference between areas 0.04
Standard Error a0.025
95% Confidence Interval−0.009 to 0.088
z statistic1.57
Significance levelp = 0.1162
Logistic Regression~SVM
Difference between areas 0.12
Standard Error a0.034
95% Confidence Interval0.106 to 0.238
z statistic5.08
Significance levelp < 0.0001
Table 4. Delong test of ROCs differ between models (12 risk factors and 38 routine laboratory indicators).
Table 4. Delong test of ROCs differ between models (12 risk factors and 38 routine laboratory indicators).
SVM~Decision Tree
Difference between areas 0.07
Standard Error a0.027
95% Confidence Interval0.026 to 0.132
z statistic2.92
Significance levelp = 0.0035
Logistic Regression~Decision Tree
Difference between areas 0.09
Standard Error a0.033
95% Confidence Interval−0.050 to 0.077
z statistic0.41
Significance levelp = 0.6837
Logistic Regression~SVM
Difference between areas 0.07
Standard Error a0.023
95% Confidence Interval0.020 to 0.112
z statistic2.82
Significance levelp = 0.0049
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xue, Y.; Yang, N.; Gu, X.; Wang, Y.; Zhang, H.; Jia, K. Risk Prediction Model of Early-Onset Preeclampsia Based on Risk Factors and Routine Laboratory Indicators. Life 2023, 13, 1648. https://doi.org/10.3390/life13081648

AMA Style

Xue Y, Yang N, Gu X, Wang Y, Zhang H, Jia K. Risk Prediction Model of Early-Onset Preeclampsia Based on Risk Factors and Routine Laboratory Indicators. Life. 2023; 13(8):1648. https://doi.org/10.3390/life13081648

Chicago/Turabian Style

Xue, Yuting, Nan Yang, Xunke Gu, Yongqing Wang, Hua Zhang, and Keke Jia. 2023. "Risk Prediction Model of Early-Onset Preeclampsia Based on Risk Factors and Routine Laboratory Indicators" Life 13, no. 8: 1648. https://doi.org/10.3390/life13081648

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop