Next Article in Journal
Arterial and Venous Doppler Parameters in Fetal Growth Restriction: A Comparative Evaluation of Early- and Late-Onset Subtypes
Previous Article in Journal
Analytical Validation of MyProstateScore 2.0—Active Surveillance: A Urinary-Based Clinical RT-PCR Prostate Cancer Assay
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrating Clinical Factors and Parity-Specific Models with Molecular Biomarkers to Better Predict the Risk of Preterm Birth in Asymptomatic Women

1
Sera Prognostics, Inc., Salt Lake City, UT 84109, USA
2
Department of Obstetrics & Gynecology, The University of Texas Medical Branch, Galveston, TX 77555, USA
*
Author to whom correspondence should be addressed.
Diagnostics 2026, 16(10), 1487; https://doi.org/10.3390/diagnostics16101487
Submission received: 17 March 2026 / Revised: 4 May 2026 / Accepted: 12 May 2026 / Published: 14 May 2026
(This article belongs to the Section Clinical Diagnosis and Prognosis)

Abstract

Background/Objectives: Prior spontaneous preterm birth (sPTB) and short cervical length predict the occurrence of sPTB with low sensitivity, highlighting the need for better detectors of at-risk pregnancies. PreTRM® is a validated, biomarker-based sPTB predictor that we aimed to improve in this study by developing models that incorporate parity and key risk factors. Methods: A Model was developed and validated through retrospective analysis of a cohort of singleton pregnancies that resulted in a live term or preterm birth (PTB) (n = 976). The Model’s ability to predict sPTB and PTB was assessed and its clinical utility compared to PreTRM. Results: The Model predicted sPTB with 77.1% sensitivity, 74.4% specificity, 21.4% positive predictive value (PPV) and 97.3% negative predictive value (NPV), an improvement over PreTRM’s sensitivity (75.0%) and PPV (14.6%), and a higher PPV than short cervix (16.2%). PTB was predicted by the Model with 76.8% sensitivity, 74.6% specificity, 31.6% PPV and 95.5% NPV. Relative to PreTRM, the Model achieved significantly higher area under the receiver operating characteristic curve (AUC) results when predicting whether a PTB or sPTB would be associated with a neonatal hospital stay ≥5 days (p = 0.001 for PTB; p = 0.044 for sPTB). The Model also achieved significantly higher sensitivity than PreTRM in predicting a ≥5 day hospital stay associated with PTB (p = 0.009) and higher sensitivity for a ≥5 day hospital stay associated with sPTB, showing improved clinical utility over PreTRM. Conclusions: The Model achieved substantially higher performance than standard of care risk predictors, and an improvement in clinical utility over PreTRM, demonstrating the robustness of the Model as a sPTB and PTB predictor.

1. Introduction

Spontaneous preterm birth (sPTB), defined as birth that occurs before 37 weeks of gestational age following spontaneous labor or rupture of the membranes, accounts for an estimated one third of infant deaths in the United States and is a major contributor to neonatal morbidities [1,2,3,4,5]. Various mitigating interventions have been proposed to prevent sPTB, including vaginal progesterone, low-dose aspirin, and/or focused care management [6,7,8,9]. The two best clinical indicators of pregnancies at risk of preterm delivery are prior sPTB and short cervix detected at 180/7 to 226/7 weeks gestation [10,11,12]. However, as the majority of women who will experience a sPTB do not have a prior sPTB or a short cervix, the impact of these clinical factors alone is limited [13,14,15]. Published findings and calculations made from the literature [6,13,16,17,18] (Supplemental Figure S1) indicate that prior sPTB and short cervix identify at-risk women with low sensitivity (6–11%), highlighting the need for sPTB predictors with better detection rates at similar positive predictive values (PPVs).
PreTRM is a validated commercially available risk predictor that is used in clinical practice. It is indicated for the early detection of asymptomatic women at higher risk of sPTB who lack identifiable risk factors, including a prior sPTB or short cervix. The test predicts a binary at risk/not at risk result based on the ratio of maternal serum insulin-like growth factor binding protein 4 (IGFBP4) to sex hormone binding globulin (SHBG) measured between 180/7 and 206/7 weeks of gestation [19,20]. IGFBP4 is important for maintaining adequate delivery of nutrients to the fetus and is a marker of placental insufficiency, while SHBG is thought to play a role in the response to placental inflammation via the regulation of androgen- and estrogen-driven pathways [21,22,23,24]. Placental insufficiency and inflammation are independent causative factors of preterm birth [25,26,27].
As stated in the “American College of Obstetricians and Gynecologists (ACOG) Practice Bulletin 234” [28], to be effective, a screening program for identifying women at highest risk for sPTB must have the following: (1) an adequately high detection rate, (2) appropriately low false positive rate, and (3) an acceptable PPV. Bulletin 234 endorses the use of short cervix and prior sPTB in screening pregnancies for risk of a sPTB. As both indicators demonstrate sensitivities in the 6–11% range and PPVs in the 16–22% range, these guidelines set the performance bar for screening tests like PreTRM. In its first validation, the PreTRM test predicted sPTB with 75.0% sensitivity and 74.0% specificity in a racially diverse, BMI-restricted population with blood drawn between 191/7 and 206/7 weeks of (134–146 days) gestation [19,20]. The test demonstrated clinical utility in two intervention studies that compared clinical utility between standard of care and intervention that was guided by the PreTRM test. In these studies, participants in the screen-guided intervention arm experienced fewer PTBs compared to the control arm, and the offspring of mothers in the intervention arm had shorter hospital stays, spent fewer days in the neonatal intensive care unit (NICU), and had lower morbidity [29,30]. In a separate cost-effectiveness analysis, screen-guided intervention resulted in a 20% reduction in total costs associated with PTB relative to costs incurred under standard of care [31].
Although PreTRM is a validated predictor of sPTB, it does not incorporate parity or clinical factors other than pre-pregnancy BMI in its algorithm. This can be limiting because the clinical history of prior pregnancies is an important predictor of sPTB, and risk prediction is impacted by parity regardless of prior history. Moreover, at the time of PreTRM’s development, multiparous patients with a prior sPTB were not considered in intended use because their care path included an existing intervention. Updated guidelines currently recommend vaginal progesterone for patients with a prior sPTB only if they develop premature cervical shortening [32,33]. For this reason, we aimed to develop an improved version of PreTRM that has parity-specific submodels integrated with known clinical risk factors, including prior sPTB. In addition to developing an improved test, the goals of this study were to compare the improved performance to the standard clinical indicators of prior sPTB and cervical length screening and compare the performance of the new test to PreTRM with respect to clinical utility based on how effectively the new test could identify preterm pregnancies destined for long hospital stays. Throughout the paper, we refer to this new test (i.e., the improved version of PreTRM) as the ‘Model’.

2. Materials and Methods

2.1. Cohorts and Study Design

This study was conducted through retrospective analysis of a large, multicenter cohort referred to as PAPR (Proteomic Assessment of Preterm Birth, NCT01371019), which was used to develop the original PreTRM Test [19]. The cohort analyzed in this study was restricted to singleton pregnancies with complete sets of demographic and clinical variables along with serum protein measurements. Only including pregnancies with full data sets ensured that the data analyzed in this study had no missing values. This was important because the training methods would not allow the differentiation of missing data from data that was not reported, which would impact submodel coefficients. Gestations with known or suspected major fetal abnormalities were excluded from the study.
We developed two submodels specific to nulliparous and multiparous pregnancies that each used PTB (spontaneous and medically induced) as the clinical outcome. Linear models were chosen for interpretability, simplicity of implementation, and limited size of available training data. A parity-specific modeling approach was adopted for two reasons. First, the literature shows that biology may differ between nulliparous and multiparous pregnancies due to alterations in uterine vascular architecture, immune tolerance, and placental development [34,35,36,37,38]. Second, parity-specific submodels accommodate predictors unique to specific parity groups (e.g., previous obstetric history), which maintains model parsimony. We chose this approach over the inclusion of interaction terms to ensure stable coefficient estimation and avoid overfitting, given the available sample size. Both parity-specific submodels included the log IGFBP4/SHBG ratio (the PreTRM Score) as a predictive factor. Clinical factors were selected for parity-specific submodel development (Table 1) based on the guidance in the ACOG Practice Bulletin 234 [28]. Nulliparous submodels that included combinations of maternal age, chronic hypertension, and BMI were rejected during feature selection due to overfitting.

2.2. Model Development

The PAPR dataset was temporally split into subsets by sample collection date with approximately 50% for training and 50% withheld for validation purposes (Figure 1). A temporal split was chosen to better test the ability for the Model to generalize to a new population by ensuring the validation subset included an external dimension. The development of the improved test had three phases. In the first phase, parity-specific submodels were developed on the first temporal half of the PAPR cohort, which included feature selection and bootstrap cross-validation. The second phase validated these parity-specific submodels on the second temporal half of the cohort to measure performance on an independent set of subjects. Once the parity-specific submodel coefficients were validated and demonstrating acceptable performance measurements, they were updated using the validated training pipeline (excluding feature selection steps) with the full cohort dataset to ensure that the best possible final Model would be deployed for clinical testing [39,40].
Parity-specific submodels were fit using ridge regression [41] and the PreTRM Score was standardized to values between 0 and 1 for compatibility with this method. Bootstrap iterations were generated with replacement, each requiring that at least 30% of data be withheld as a cross-validation testing set. The outcomes in the remaining 70% training portion were upsampled randomly with replacement to match the control count and allow fitting methods to prioritize the sensitivity and specificity equally. Average coefficients for the 200 bootstrap iterations were obtained as submodel parameters to reduce the risk of overfitting (Figure 2). Area under the receiver operating characteristic curves (AUCs) were generated for each model category. Models met acceptance criteria if they achieved an AUC > 0.5, a Wilcoxon rank-sum test result with a significance (alpha) of 0.05, and some overlap between AUC 95% confidence intervals for the testing and training sets. Overfitting was also assessed using the train–test AUC difference across 200 bootstrap iterations with a Student’s t-test to determine if the difference contained zero. Feature down-selection included fitting submodels with combinations of features. Any features that introduced significant overfitting or did not improve the mean test set performance were removed. Decision thresholds were chosen to meet a prespecified screen positive rate.

2.3. Model Validation

Submodels were validated on the 50% withheld temporal split of the cohort. A threshold enrichment and test of significance for the difference in PTB (spontaneous and medically induced) and non-PTB were computed. A secondary analysis was used to test a comprehensive set of submodels with different outcomes and corresponding thresholds. Correction for multiple testing was not applied during the secondary analysis. Once validated, submodel coefficients were retrained on the full dataset (i.e., training plus validation sets combined) using the validated training pipeline (Figure 2). Mean coefficients from 200 bootstrap iterations were obtained as the final submodel parameters. Decision thresholds were then selected to meet a prespecified screen positive rate and fine-tuned for PTB and sPTB outcomes. The final nulliparous and multiparous submodels are used for nulliparous and multiparous samples, respectively, as the Model (Figure 3).

2.4. Clinical Utility

The Model’s ability to predict a long neonatal hospital stay associated with sPTB or PTB was assessed on the full cohort. For this analysis, cases were defined as a neonatal hospital stay of at least 5 days that occurred in conjunction with either a PTB or sPTB. Non-cases were defined as any other outcome. The Model’s performance was measured by sensitivity and AUC for these clinical utility outcomes. Performance of the Model and the PreTRM Test were compared.

2.5. Statistical Analyses

Statistical power for all AUC calculations was determined as previously described [42]. For model training and validation, the ~50% temporal split of the cohort provided n = 476 for training and n = 500 for validation (total n = 976) to give a statistical power of 80% for AUC values ≥ 0.59 in the validation (≥0.64 for nulliparous and ≥0.62 for multiparous). For the entire cohort of n = 976, the requirement for a statistical power of 80% was AUC ≥ 0.56 (≥0.61 for nulliparous, and ≥0.58 for multiparous). Fisher’s Exact test was used to compute the significance of threshold enrichment with a significance (alpha) of 0.05. Stratification of all PTB and non-PTB was tested using AUC and the Wilcoxon rank-sum test with a significance (alpha) of 0.05. Model calibration was assessed to evaluate the agreement between predicted probabilities and observed outcomes. The degree of miscalibration was quantified using the Estimated Calibration Index (ECI), where an ECI value of 0 indicates perfect calibration and values below 0.2 represent a high degree of clinical reliability [43]. The statistical significance of differential AUC performance for clinical utility between PreTRM and the Model was computed using the DeLong test with significance (alpha) 0.05. Sensitivities for clinical utility measurements were compared between PreTRM and the Model using the McNemar test, with a significance level (alpha) of 0.05. All 95% confidence intervals (CI) were computed using the Wilson score method. All p-values associated with multiple measurements were adjusted using Bonferroni corrections. All analyses were performed using R version 4.4.2 [44].

3. Results

3.1. Study Participant Demographics

The cohort was comprised of nulliparous (n = 356) and multiparous (n = 620) women with blood drawn between 180/7 and 206/7 weeks of gestational age, the intended gestational age for the PreTRM Test (Table 2). Demographics of study participants grouped into the training and validation sets are shown in Supplementary Tables S1 and S2. A breakdown of cases by the prespecified risk factors to address confounding with cases is shown in Supplementary Table S3.

3.2. Performance of Parity-Specific Submodels

Performance estimates of the trained nulliparous and multiparous submodels were evaluated. Both submodels met acceptance criteria by achieving statistical significance (p < 0.05) and AUCs > 0.5 in both training and test measurements (Table 3).
Submodel calibration measurements showed little difference between observed and predicted probabilities. For predicted probabilities, estimated calibration indices (ECI) fell well within the established threshold for clinical reliability (ECI < 0.2) and high R2 values were observed, indicating that the submodels provide accurate estimates. Similar performance and calibration were demonstrated when the submodels were independently validated on the withheld temporal split of the dataset (Table 4, first and second rows). The performance of both submodels combined (i.e., the Model) was validated and met acceptance criteria (Table 4, third row).

3.3. Performance of the Model

After validation, the nulliparous and multiparous submodel coefficients were retrained on the combined training and validation groups of the cohort to produce the final Model parameters. The performance of the Model was then evaluated on two clinical subgroups in the cohort. The first subgroup included the full range of gestational age at blood draw (GABD) (126–146 days) and all BMIs, which is the full intended use population for the PreTRM test. The second subgroup was restricted to 136–146 days GABD and BMIs ≥ 21 because the PreTRM Score has been shown to have higher sensitivity under these conditions. The Model produced AUCs ranging from 0.71 to 0.79, achieving statistical significance for PTB and sPTB in both subgroups (p < 0.05) (Table 5). Corresponding Receiver Operating Characteristics (ROC) curves and calibration curves are shown in Supplementary Figure S2 and Figure S3, respectively.
We next evaluated the threshold performance of the Model. In the restricted subgroup, the Model detected sPTB with 77.1% sensitivity (95% CI, 59.4–89.0), 74.4% specificity (95% CI, 69.7–78.6), 21.4% PPV, and 97.3% NPV. In this same population, it detected PTB with 76.8% sensitivity (95% CI, 63.3–86.6), 74.6% specificity (95% CI, 69.7–78.9), 31.6% PPV, and 95.5% NPV (Table 6). The Model demonstrated similar specificities and lower sensitivities in the unrestricted subgroup. The number needed to screen (NNS) to detect either a single PTB or a single sPTB were both higher in the unrestricted subgroup than in the restricted subgroup. Per Supplementary Table S3, risk factors are associated with approximately 80% of sPTB and PTB cases (81.0% and 78.9% respectively); however, they are also associated with 58.5% of pregnancies, leading to an unacceptably high screen positive rate for risk factor-based prediction alone. The Model reaches similar sensitivity (up to 77.1% for sPTB in the restricted cohort) but with a lower false detection rate. Contingency tables are provided in Supplementary Table S4.
Importantly, the Model demonstrated similar PPV (21.4%) to the risk predictors currently used, prior sPTB (22.5%) and cervical length screening (16.2%), while having a far superior sensitivity (77.1% vs. 8% and 11%) (Table 7).

3.4. Assessment of Clinical Utility

We evaluated the clinical utility of the Model relative to PreTRM based on the ability to predict PTBs or sPTBs associated with a neonatal hospital stay lasting 5 or more days (length of stay (LOS) ≥ 5 days). This assessment is important because the PreTRM Test is currently being used clinically and has been shown to improve clinical outcomes [29,30]. This comparison can determine whether the Model is as good or better than PreTRM at selecting pregnancies that could benefit from intervention. PreTRM and the Model both demonstrated statistically significant AUC values for PTB predictions, and the Model was significant for sPTB when corrected for multiple comparisons (Table 8). The Model achieved significantly higher AUC and sensitivity than PreTRM (p = 0.001, and p = 0.009 respectively) when predicting LOS ≥ 5 days associated with PTB. Relative to PreTRM, the Model demonstrated a higher AUC and sensitivity in predicting LOS associated with sPTB, which was significant for AUC but did not reach statistical significance for sensitivity (p = 0.044, and p = 0.263 respectively). Overall, these findings show that the clinical utility demonstrated by the Model is an improvement over that of PreTRM.

4. Discussion

Survival and long-term health outcomes of premature infants can be improved by extending the length of gestation. Tools that reliably predict pregnancies at risk of preterm birth, and that can guide the use of treatment strategies shown to impact clinical outcomes, are a significant need [30]. Here, we aimed to improve the performance of the PreTRM Test by developing and validating a Model that integrated parity and other clinical factors with the log IGFBP4/SHBG ratio (PreTRM Score) to predict sPTB and PTB. The resulting Model outperformed PreTRM in sensitivity and PPV, important metrics for tests intended to screen patients for intervention. Importantly, in contrast to PreTRM, the Model demonstrated improved performance for a wider BMI range, and training and validation were based on a larger number of subjects (n = 476 and n = 500). Finally, in a direct comparison, the Model outperformed PreTRM in detecting pregnancies susceptible to sPTB and PTB. By showing that the Model performed better than PreTRM in these assessments, the primary goal of this study was achieved.
An additional goal of this study was to evaluate the Model’s performance as a screening tool for pregnancies at risk of sPTB relative to the performances of short cervix and prior sPTB screening methods. The Model predicted sPTB with a higher PPV (21.4%) than short cervix (16.4%) and a similar PPV to prior sPTB (22.5%) while having a substantially higher sensitivity (77%) than those calculated for both short cervix (8%) and prior sPTB (11%). These values indicate overall higher performance than the commonly used risk predictors, thus achieving the second study goal. Given that prior sPTB and short cervix are accepted screening tools, their sensitivities and PPVs are viewed as adequate. However, these methods still fail to detect the majority of sPTBs in asymptomatic women, which underscores the need for better screening tools. The high sensitivity and improved PPV demonstrated by the Model are particularly noteworthy as they bring the Model in line with ACOG clinical performance expectations and endorsed screening approaches [19,30,45].
We utilized the grading scale developed by ACOG to contextualize this study’s results within the cumulative body of evidence that supports the stated performance and clinical utility of PreTRM (Table 9) [46]. The original PreTRM test was developed and validated via retrospective analysis of the PAPR cohort using a nested case–control design [19]. In that study, PreTRM demonstrated 75% sensitivity and 74% specificity at a threshold of maximum accuracy. In a follow-up study, PreTRM was prospectively validated on an independent cohort in a blinded manner and demonstrated similar performance [45]. In a subsequent randomized clinical trial of test-guided management, clinical interventions applied in response to a positive PreTRM result significantly reduced the incidence of PTB [30]. The current study addressed the performance of PreTRM in the context of Bulletin 234.
ACOG guidelines specify that sPTB risk predictors must: (1) demonstrate adequately high sensitivity and acceptable PPV, (2) effectively reduce preterm birth, and (3) be feasible, cost-effective, and accessible to all patients [28]. Here, the Model demonstrated improved performance over PreTRM and commonly used screening tools, satisfying the first requirement of a sPTB risk predictor. The Model met the second requirement by demonstrating improved clinical utility over PreTRM, which was shown to effectively reduce NICU admission, NICU length of stay, and PTB prior to 35 weeks [30]. Finally, the Model fulfills the third requirement of being feasible, cost-effective, and accessible in low resource settings by being non-invasive, amenable to sample self-collection, and not requiring cold chain storage. Overall, the findings of this study, in conjunction with results of our previous studies, show that the Model satisfies all ACOG requirements for predictors of preterm delivery [16,17,47].
Although the results of this study are promising, we acknowledge its limitations. First, the study was retrospective, which confined our dataset to patients who had complete PreTRM testing results and clinical data and introduced the potential for results to be impacted by selection bias. Second, although the Model was validated on temporally distinct groups within the full dataset, the validation was internal. An independent validation is needed to more thoroughly characterize the Model’s behavior in its intended use population. Third, we acknowledge that, at present, the impact of Model-guided care on individuals across different healthcare systems, such as Medicaid and commercial insurance, remains unknown. However, since our latest clinical and health economics studies showed that the PreTRM Test significantly reduced PTBs, infant morbidity, and overall costs associated with PTB in various healthcare systems [30,31], we expect the Model to have similar effects, especially given its improved performance. Addressing these aspects of the Model will be a focus of our future work. Overall, our findings on the performance of the Model represent another important step towards improving risk stratifiers to help identify and prevent preterm deliveries and subsequent complications.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/diagnostics16101487/s1, Figure S1: Performance metrics of prior sPTB as a predictor of at-risk pregnancies; Table S1: Demographics of PAPR training set; Table S2: Demographics of PAPR validation set; Figure S2: ROC curves and corresponding AUC values for the Model; Figure S3: Calibration curves for sPTB and PTB outcomes; Table S3: Association of risk factors with outcomes; Table S4: Contingency table for the Model performance for the full cohort for all parities combined.

Author Contributions

Conceptualization, A.P., C.C., J.J.B., G.S. and P.K.; methodology, A.P., C.C. and J.J.B.; validation, A.P., C.C., H.B., M.B.B., J.J.B. and P.K.; formal analysis, A.P., C.C., H.B., M.B.B. and J.J.B.; investigation, A.P., C.C., J.J.B. and P.K.; data curation, A.P., C.C., H.B. and M.B.B.; writing—original draft preparation, A.P., C.C., L.J.S., J.J.B. and P.K.; writing—review and editing, A.P., C.C., H.B., M.B.B., L.J.S., J.J.B., G.S. and P.K.; supervision, J.J.B. and P.K.; project administration, L.J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The PAPR study (NCT01371019) was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Boards/Ethics Committees of all 11 participating study sites. The following is a list of approvals: Intermountain Healthcare Institutional Review Board (1022138, 22 December 2010), Office of Human Research Ethics, University of North Carolina (11-1641, 13 September 2011), Institutional Review Board for Human Research, Office of Research Integrity, Medical University of South Carolina (Pro00012552, 11 October 2011), Maricopa Integrated Health System Institutional Review Board (2011-078, 18 October 2011), Western IRB (used by Ohio State University (20112063, 13 December 2011), Baystate Medical Center Institutional Review Board (BH-12-020, 23 December 2011), Research Integrity Office, Oregon Health & Science University (IRB00008131, 7 February 2012), San Diego Perinatal Center (20112063, 10 February 2012), University of Texas Medical Branch Institutional Review Board (12-046, 29 March 2012), Regional Obstetrical Consultants (20112063, 20 November 2012), and Christiana Care Institutional Review Board (32234, 28 December 2012). All necessary patient/participant consent was obtained, and the appropriate institutional forms have been archived.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors acknowledge the data science team at Sera Prognostics, Inc., all study coordinators, and all study participants.

Conflicts of Interest

A.P., C.C., H.B., M.B.B., J.B., and P.K. are employees of Sera Prognostics, Inc. L.S. is a consultant for Sera Prognostics, Inc. All other authors have no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AUCArea Under the Receiver Operating Characteristic Curve
BMIBody Mass Index
CIConfidence Interval
ECIEstimated Calibration Index
GABDGestational Age at Blood Draw
IGFBP4Insulin-like Growth Factor Binding Protein 4
LOSLength of (Hospital) Stay
LRLikelihood Ratio
NICUNeonatal Intensive Care Unit
NNLOSNeonatal Length of (Hospital) Stay
NNSNumber Needed to Screen
NPVNegative Predictive Value
PAPRProteomic Assessment of Preterm Birth
PEPreeclampsia
PPVPositive Predictive Value
PTBPreterm Birth
SDStandard Deviation
SHBGSex Hormone Binding Globulin
sPTBSpontaneous Preterm Birth

References

  1. Vogel, J.P.; Chawanpaiboon, S.; Moller, A.B.; Watananirun, K.; Bonet, M.; Lumbiganon, P. The global epidemiology of preterm birth. Best Pract. Res. Clin. Obstet. Gynaecol. 2018, 52, 3–12. [Google Scholar] [CrossRef] [PubMed]
  2. American College of Obstetricians and Gynecologists. Practice Bulletin No. 171: Management of Preterm Labor. Obstet. Gynecol. 2016, 128, e155–e164. [Google Scholar] [CrossRef]
  3. Allin, M.; Rooney, M.; Griffiths, T.; Cuddy, M.; Wyatt, J.; Rifkin, L.; Murray, R. Neurological abnormalities in young adults born preterm. J. Neurol. Neurosurg. Psychiatry 2006, 77, 495–499. [Google Scholar] [CrossRef]
  4. Cao, G.; Liu, J.; Liu, M. Global, Regional, and National Incidence and Mortality of Neonatal Preterm Birth, 1990–2019. JAMA Pediatr. 2022, 176, 787–796. [Google Scholar] [CrossRef]
  5. Fernandez de Gamarra-Oca, L.; Ojeda, N.; Ontanon, J.M.; Loureiro-Gonzalez, B.; Gomez-Gastiasoro, A.; Pena, J.; Ibarretxe-Bilbao, N.; Garcia-Guerrero, M.A.; Zubiaurre-Elorza, L. Long-term brain structural and cognitive outcomes in a low-risk preterm-born sample. Sci. Rep. 2024, 14, 21110. [Google Scholar] [CrossRef]
  6. Hassan, S.S.; Romero, R.; Vidyadhari, D.; Fusey, S.; Baxter, J.K.; Khandelwal, M.; Vijayaraghavan, J.; Trivedi, Y.; Soma-Pillay, P.; Sambarey, P.; et al. Vaginal progesterone reduces the rate of preterm birth in women with a sonographic short cervix: A multicenter, randomized, double-blind, placebo-controlled trial. Ultrasound Obstet. Gynecol. 2011, 38, 18–31. [Google Scholar] [CrossRef]
  7. Fonseca, E.B.; Celik, E.; Parra, M.; Singh, M.; Nicolaides, K.H. Fetal Medicine Foundation Second Trimester Screening Group. Progesterone and the risk of preterm birth among women with a short cervix. N. Engl. J. Med. 2007, 357, 462–469. [Google Scholar] [CrossRef] [PubMed]
  8. Hoffman, M.K.; Goudar, S.S.; Kodkany, B.S.; Metgud, M.; Somannavar, M.; Okitawutshu, J.; Lokangaka, A.; Tshefu, A.; Bose, C.L.; Mwapule, A.; et al. Low-dose aspirin for the prevention of preterm delivery in nulliparous women with a singleton pregnancy (ASPIRIN): A randomised, double-blind, placebo-controlled trial. Lancet 2020, 395, 285–293. [Google Scholar] [CrossRef] [PubMed]
  9. Garite, T.J.; Manuck, T.A. Should case management be considered a component of obstetrical interventions for pregnancies at risk of preterm birth? Am. J. Obstet. Gynecol. 2023, 228, 430–437. [Google Scholar] [CrossRef]
  10. Romero, R.; Conde-Agudelo, A.; Da Fonseca, E.; O’Brien, J.M.; Cetingoz, E.; Creasy, G.W.; Hassan, S.S.; Nicolaides, K.H. Vaginal progesterone for preventing preterm birth and adverse perinatal outcomes in singleton gestations with a short cervix: A meta-analysis of individual patient data. Am. J. Obstet. Gynecol. 2018, 218, 161–180. [Google Scholar] [CrossRef]
  11. Cobo, T.; Kacerovsky, M.; Jacobsson, B. Risk factors for spontaneous preterm delivery. Int. J. Gynaecol. Obstet. 2020, 150, 17–23. [Google Scholar] [CrossRef]
  12. Gulersen, M.; Divon, M.Y.; Krantz, D.; Chervenak, F.A.; Bornstein, E. The risk of spontaneous preterm birth in asymptomatic women with a short cervix (≤25 mm) at 23–28 weeks’ gestation. Am. J. Obstet. Gynecol. MFM 2020, 2, 100059. [Google Scholar] [CrossRef]
  13. Petrini, J.R.; Callaghan, W.M.; Klebanoff, M.; Green, N.S.; Lackritz, E.M.; Howse, J.L.; Schwarz, R.H.; Damus, K. Estimated effect of 17 alpha-hydroxyprogesterone caproate on preterm birth in the United States. Obstet. Gynecol. 2005, 105, 267–272. [Google Scholar] [CrossRef] [PubMed]
  14. Esplin, M.S.; Elovitz, M.A.; Iams, J.D.; Parker, C.B.; Wapner, R.J.; Grobman, W.A.; Simhan, H.N.; Wing, D.A.; Haas, D.M.; Silver, R.M.; et al. Predictive Accuracy of Serial Transvaginal Cervical Lengths and Quantitative Vaginal Fetal Fibronectin Levels for Spontaneous Preterm Birth Among Nulliparous Women. JAMA 2017, 317, 1047–1056. [Google Scholar] [CrossRef] [PubMed]
  15. Berghella, V.; Saccone, G. Cervical assessment by ultrasound for preventing preterm delivery. Cochrane Database Syst. Rev. 2019, 9, CD007235. [Google Scholar] [CrossRef]
  16. Esplin, M.S.; O’Brien, E.; Fraser, A.; Kerber, R.A.; Clark, E.; Simonsen, S.E.; Holmgren, C.; Mineau, G.P.; Varner, M.W. Estimating recurrence of spontaneous preterm delivery. Obstet. Gynecol. 2008, 112, 516–523. [Google Scholar] [CrossRef]
  17. Biggio, J.; Society for Maternal-Fetal Medicine (SMFM); SMFM Publications Committee. SMFM Consult Series #70: Management of short cervix in individuals without a history of spontaneous preterm birth. Am. J. Obstet. Gynecol. 2024, 231, B2–B13. [Google Scholar] [CrossRef] [PubMed]
  18. Martin, J.A.; Hamilton, B.E.; Sutton, P.D.; Ventura, S.J.; Menacker, F.; Munson, M.L. Births: Final data for 2002. Natl. Vital. Stat. Rep. 2003, 52, 10. [Google Scholar]
  19. Saade, G.R.; Boggess, K.A.; Sullivan, S.A.; Markenson, G.R.; Iams, J.D.; Coonrod, D.V.; Pereira, L.M.; Esplin, M.S.; Cousins, L.M.; Lam, G.K.; et al. Development and validation of a spontaneous preterm delivery predictor in asymptomatic women. Am. J. Obstet. Gynecol. 2016, 214, 633.e1–633.e24. [Google Scholar] [CrossRef]
  20. Markenson, G.R.; Saade, G.R.; Laurent, L.C.; Heyborne, K.D.; Coonrod, D.V.; Schoen, C.N.; Baxter, J.K.; Haas, D.M.; Longo, S.; Grobman, W.A.; et al. Performance of a proteomic preterm delivery predictor in a large independent prospective cohort. Am. J. Obstet. Gynecol. MFM 2020, 2, 100140. [Google Scholar] [CrossRef]
  21. Forbes, K.; Westwood, M. Maternal growth factor regulation of human placental development and fetal growth. J. Endocrinol. 2010, 207, 1–16. [Google Scholar] [CrossRef]
  22. Qiu, Q.; Bell, M.; Lu, X.; Yan, X.; Rodger, M.; Walker, M.; Wen, S.W.; Bainbridge, S.; Wang, H.; Gruslin, A. Significance of IGFBP-4 in the development of fetal growth restriction. J. Clin. Endocrinol. Metab. 2012, 97, E1429–E1439. [Google Scholar] [CrossRef]
  23. Nawathe, A.R.; Christian, M.; Kim, S.H.; Johnson, M.; Savvidou, M.D.; Terzidou, V. Insulin-like growth factor axis in pregnancies affected by fetal growth disorders. Clin. Epigenet. 2016, 8, 11. [Google Scholar] [CrossRef]
  24. Gladych-Macioszek, A.; Ozegowska, K.; Radzicka-Mularczyk, S.; Sibiak, R.; Desoye, G.; Wender-Ozegowska, E. Insulin-Like Growth Factors, Binding Proteins and Their Role in Pregnancy in Patients With Diabetes. J. Diabetes Res. 2025, 2025, 3330482. [Google Scholar] [CrossRef] [PubMed]
  25. Preston, M.; Hall, M.; Shennan, A.; Story, L. The role of placental insufficiency in spontaneous preterm birth: A literature review. Eur. J. Obstet. Gynecol. Reprod. Biol. 2024, 295, 136–142. [Google Scholar] [CrossRef]
  26. Habelrih, T.; Augustin, T.L.; Mauffette-Whyte, F.; Ferri, B.; Sawaya, K.; Cote, F.; Gallant, M.; Olson, D.M.; Chemtob, S. Inflammatory mechanisms of preterm labor and emerging anti-inflammatory interventions. Cytokine Growth Factor Rev. 2024, 78, 50–63. [Google Scholar] [CrossRef] [PubMed]
  27. Cappelletti, M.; Della Bella, S.; Ferrazzi, E.; Mavilio, D.; Divanovic, S. Inflammation and preterm birth. J. Leukoc. Biol. 2016, 99, 67–78. [Google Scholar] [CrossRef]
  28. American College of Obstetricians and Gynecologists. ACOG Practice Bulletin No. 234. Obstet. Gynecol. 2021, 138, e65–e90. [Google Scholar] [CrossRef]
  29. Hoffman, M.K.; Kitto, C.; Zhang, Z.; Shi, J.; Walker, M.G.; Shahbaba, B.; Ruhstaller, K. Neonatal Outcomes after Maternal Biomarker-Guided Preterm Birth Intervention: The AVERT PRETERM Trial. Diagnostics 2024, 14, 1462. [Google Scholar] [CrossRef]
  30. Iriye, B.K.; O’Brien, J.M.; Ennen, C.S.; Barrilleaux, P.S.; Berkin, J.A.; Palatnik, A.; Son, M.; Gyamfi-Bannerman, C.; McDonnold, M.; Markenson, G.R.; et al. Neonatal impact of maternal biomarker screening for risk of preterm birth with targeted interventions (PRIME): A multicenter, randomized, controlled trial. Pregnancy 2026, 2, e70202. [Google Scholar] [CrossRef]
  31. Grabner, M.; Burchard, J.; Nguyen, C.; Chung, H.; Gangan, N.; Boniface, J.J.; Zupancic, J.A.F.; Stanek, E. Cost-Effectiveness of a Proteomic Test for Preterm Birth Prediction. Clin. Outcomes Res. 2021, 13, 809–820. [Google Scholar] [CrossRef]
  32. American College of Obstetricians and Gynecologists. Updated Clinical Guidance for the Use of Progestogen Supplementation for the Prevention of Recurrent Preterm Birth. 2025. Available online: https://www.acog.org/clinical/clinical-guidance/practice-advisory/articles/2023/04/updated-guidance-use-of-progestogen-supplementation-for-prevention-of-recurrent-preterm-birth (accessed on 20 March 2026).
  33. Society for Maternal-Fetal Medicine (SMFM); SMFM Publications Committee. Society for Maternal-Fetal Medicine Statement: Response to the Food and Drug Administration’s withdrawal of 17-alpha hydroxyprogesterone caproate. Am. J. Obstet. Gynecol. 2023, 229, B2–B6. [Google Scholar] [CrossRef]
  34. Ballering, G.; Leijnse, J.; Eijkelkamp, N.; Peeters, L.; de Heus, R. First-trimester placental vascular development in multiparous women differs from that in nulliparous women. J. Matern. Fetal Neonatal Med. 2018, 31, 209–215. [Google Scholar] [CrossRef]
  35. Haizler-Cohen, L.; Wang, G.; Habtewold, T.D.; Wijesiriwardhana, P.; Grantz, K.L.; Tekola-Ayele, F. Placental gene expression signatures based on maternal parity. Placenta 2026, 174, 214–224. [Google Scholar] [CrossRef]
  36. Ding, W.; Lau, S.L.; Wang, C.C.; Zhang, T.; Getsko, O.; Lee, N.M.W.; Chim, S.S.C.; Wong, C.K.; Leung, T.Y. Dynamic changes in maternal immune biomarkers during labor in nulliparous vs multiparous women. Am. J. Obstet. Gynecol. 2022, 227, 627.e1–627.e23. [Google Scholar] [CrossRef] [PubMed]
  37. Wolf, M.; Shah, A.; Lam, C.; Martinez, A.; Smirnakis, K.V.; Epstein, F.H.; Taylor, R.N.; Ecker, J.L.; Karumanchi, S.A.; Thadhani, R. Circulating levels of the antiangiogenic marker sFLT-1 are increased in first versus second pregnancies. Am. J. Obstet. Gynecol. 2005, 193, 16–22. [Google Scholar] [CrossRef]
  38. Quezada, M.S.; Rodriguez-Calvo, J.; Villalain, C.; Gomez-Arriaga, P.I.; Galindo, A.; Herraiz, I. sFlt-1/PlGF ratio and timing of delivery in early-onset fetal growth restriction with antegrade umbilical artery flow. Ultrasound Obstet. Gynecol. 2020, 56, 549–556. [Google Scholar] [CrossRef]
  39. Wilimitis, D.; Walsh, C.G. Practical Considerations and Applied Examples of Cross-Validation for Model Development and Evaluation in Health Care: Tutorial. JMIR AI 2023, 2, e49023. [Google Scholar] [CrossRef] [PubMed]
  40. Steyerberg, E.W. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating; Springer: New York, NY, USA, 2009. [Google Scholar]
  41. Hastie, T.; Tibshirani, R.; Friedman, J. The Elembents of Statistical Learning; Springer: New York, NY, USA, 2009. [Google Scholar]
  42. Obuchowski, N.A.; McClish, D.K. Sample size determination for diagnostic accuracy studies involving binormal ROC curve indices. Stat. Med. 1997, 16, 1529–1542. [Google Scholar] [CrossRef]
  43. Van Calster, B.; Nieboer, D.; Vergouwe, Y.; De Cock, B.; Pencina, M.J.; Steyerberg, E.W. A calibration hierarchy for risk models was defined: From utopia to empirical data. J. Clin. Epidemiol. 2016, 74, 167–176. [Google Scholar] [CrossRef]
  44. R Foundation for Statistical Computing. R: A Language and Environment for Statistical Computing. 2021. Available online: https://www.R-project.org/ (accessed on 3 February 2026).
  45. Burchard, J.; Polpitiya, A.D.; Fox, A.C.; Randolph, T.L.; Fleischer, T.C.; Dufford, M.T.; Garite, T.J.; Critchfield, G.C.; Boniface, J.J.; Saade, G.R.; et al. Clinical Validation of a Proteomic Biomarker Threshold for Increased Risk of Spontaneous Preterm Birth and Associated Clinical Outcomes: A Replication Study. J. Clin. Med. 2021, 10, 5088. [Google Scholar] [CrossRef] [PubMed]
  46. ACOG Practice Bulletin. Clinical management guidelines for obstetrician-gynecologists. No. 82 June 2007. Management of herpes in pregnancy. Obstet. Gynecol. 2007, 109, 1489–1498. [Google Scholar] [CrossRef]
  47. Mercer, B.M.; Goldenberg, R.L.; Das, A.; Moawad, A.H.; Iams, J.D.; Meis, P.J.; Copper, R.L.; Johnson, F.; Thom, E.; McNellis, D.; et al. The preterm prediction study: A clinical risk assessment system. Am. J. Obstet. Gynecol. 1996, 174, 1885–1893; discussion 1893–1895. [Google Scholar] [CrossRef]
Figure 1. Development process of the Model. Development of the Model included three phases: (1) training of the parity-specific submodels, (2) validation of the parity-specific submodels, and (3) retraining the parity-specific submodels using all available data to obtain the Model. Nullips denotes the nulliparous samples and multips denotes the multiparous samples.
Figure 1. Development process of the Model. Development of the Model included three phases: (1) training of the parity-specific submodels, (2) validation of the parity-specific submodels, and (3) retraining the parity-specific submodels using all available data to obtain the Model. Nullips denotes the nulliparous samples and multips denotes the multiparous samples.
Diagnostics 16 01487 g001
Figure 2. The Model training pipeline. The pipeline involved 200 bootstrap iterations, and in each iteration data was split into training and testing subsets. Cases in the training subset were upsampled to balance the data (50/50), and ridge regression was performed. The regression model was recalibrated and assessed with the testing set of the samples. Average coefficients from the 200 bootstraps were computed as the final set of submodel coefficients. The testing set performance summary was also computed.
Figure 2. The Model training pipeline. The pipeline involved 200 bootstrap iterations, and in each iteration data was split into training and testing subsets. Cases in the training subset were upsampled to balance the data (50/50), and ridge regression was performed. The regression model was recalibrated and assessed with the testing set of the samples. Average coefficients from the 200 bootstraps were computed as the final set of submodel coefficients. The testing set performance summary was also computed.
Diagnostics 16 01487 g002
Figure 3. The Model workflow.
Figure 3. The Model workflow.
Diagnostics 16 01487 g003
Table 1. Clinical factors included in the Model, according to parity.
Table 1. Clinical factors included in the Model, according to parity.
Chronic DiabetesBMI ≥ 30Maternal Age ≥ 35 YearsPrior sPTBPrior PEChronic Hypertension
Nulliparous     
Multiparous
BMI = Body mass index, sPTB = spontaneous preterm birth, PE = preeclampsia.
Table 2. Cohort outcomes and demographics.
Table 2. Cohort outcomes and demographics.
Clinical VariableNulliparousMultiparousCombined
NTotal356620976
N(%) sPTB < 37 Outcome27 (7.6%)57 (9.2%)84 (8.6%)
N(%) PTB < 37 Outcome49 (13.8%)93 (15%)142 (14.5%)
N(%) sPTB < 37 & NNLOS ≥ 5 days13 (3.7%)23 (3.7%)36 (3.7%)
N(%) PTB < 37 & NNLOS ≥ 5 days29 (8.1%)36 (5.8%)65 (6.7%)
N(%) Chronic Diabetes24 (6.7%)31 (5.0%)55 (5.6%)
N(%) Chronic Hypertension17 (4.8%)44 (7.1%)61 (6.3%)
N(%) Prior PEN/A60 (9.7%)60 (6.1%)
N(%) Prior PTBN/A165 (26.6%)165 (16.9%)
NNLOS (mean, median, SD, min, max)4.07, 3, 6.99, 0, 994.06, 3, 7.93, 0, 884.06, 3, 7.6, 0, 99
N(%) NNLOS ≥ 5 days53 (14.9%)72 (11.6%)125 (12.8%)
Maternal Age (mean, median, SD, min, max)25.29, 24, 5.81, 18, 4628.98, 28, 5.59, 18, 4427.64, 27, 5.94, 18, 46
N(%) Maternal Age ≥ 3531 (8.7%)122 (19.7%)153 (15.7%)
BMI (mean, median, SD, min, max)27.45, 25.8, 7.23, 15.2, 58.129.26, 28.3, 7.59, 15.8, 75.628.6, 27.3, 7.51, 15.2, 75.6
N(%) BMI ≥ 30108 (30.3%)248 (40.0%)356 (36.5%)
N(%) BMI ≥ 21301 (84.6%)554 (89.4%)855 (87.6%)
N(%) White257 (72.2%)450 (72.6%)707 (72.4%)
N(%) Black68 (19.1%)101 (16.3%)169 (17.3%)
N(%) Asian6 (1.7%)7 (1.1%)13 (1.3%)
N(%) Hispanic98 (27.5%)257 (41.5%)355 (36.4%)
N(%) Other Race25 (7.0%)62 (10.0%)87 (8.9%)
PE = Preeclampsia; NNLOS = neonatal length of (hospital) stay; N/A = not applicable.
Table 3. Submodel performance and calibration measurements in training/test (bootstrap cross validation) splits of the ~50% training split of the cohort with PTB as the outcome.
Table 3. Submodel performance and calibration measurements in training/test (bootstrap cross validation) splits of the ~50% training split of the cohort with PTB as the outcome.
Model Training AUC (95% CI)Mean AUC of Cross Validation MeasurementsWilcoxon p-Value 1ECIR2
Nulliparous submodel0.74 (0.62–0.86)0.73<0.0010.120.99
Multiparous submodel0.78 (0.71–0.84)0.74<0.0010.070.99
1 p-value showing significance of AUCs (p < 0.05) with a Bonferroni correction for multiple testing. ECI = Estimated calibration index.
Table 4. Model performance and calibration measurements in the ~50% validation subset of the cohort with PTB as the outcome.
Table 4. Model performance and calibration measurements in the ~50% validation subset of the cohort with PTB as the outcome.
Model AUCAUC 95% CIWilcoxon
p-Value 1
ECIR2
Nulliparous submodel0.710.60–0.82<0.0010.110.88
Multiparous submodel0.720.63–0.80<0.0010.090.72
Model (Combined Parity)0.700.64–0.76<0.0010.100.85
1 p-value showing significance of AUCs (p < 0.05) with a Bonferroni correction for multiple testing. ECI = Estimated calibration index.
Table 5. Model AUCs for all parities combined for the full cohort.
Table 5. Model AUCs for all parities combined for the full cohort.
SubgroupOutcomeAUCAUC 95% CIWilcoxon
p-Value 1
Full GABD range, all BMIssPTB0.710.65–0.77<0.001
PTB0.730.69–0.78<0.001
Restricted GABD range and BMIsPTB0.770.70–0.85<0.001
PTB0.790.72–0.85<0.001
1 p-value showing significance of AUCs (p < 0.05) with a Bonferroni correction for multiple testing.
Table 6. Model performance on the full cohort for all parities combined. The 95% confidence intervals were computed using binomial proportions.
Table 6. Model performance on the full cohort for all parities combined. The 95% confidence intervals were computed using binomial proportions.
SubgroupOutcomeFisher’s p-Value 1Sensitivity
(95% CI)
Specificity
(95% CI)
PPV
(95% CI)
NPV
(95% CI)
LR+
(95% CI)
LR−
(95% CI)
NNS (95% CI)
Full GABD range, all BMIssPTB<0.00161.9%
(50.6–72.1)
76.7%
(73.7–79.4)
20.0%
(16.9–23.5)
95.5%
(94.2–96.6)
2.65
(2.16–3.26)
0.5
(0.38–0.65)
19
(16–23)
PTB<0.00163.4%
(54.8–71.2)
76.7%
(73.7–79.5)
31.7%
(28.0–35.6)
92.5%
(90.8–93.9)
2.72
(2.29–3.25)
0.48
(0.38–0.59)
11
(10–13)
Restricted GABD range and BMIsPTB<0.00177.1%
(59.4–89.0)
74.4%
(69.7–78.6)
21.4%
(17.6–25.9)
97.3%
(95.1–98.5)
3.02
(2.35–3.86)
0.31
(0.17–0.57)
16
(14–20)
PTB<0.00176.8%
(63.3–86.6)
74.6%
(69.7–78.9)
31.6%
(26.9–36.7)
95.5%
(92.9–97.1)
3.02
(2.41–3.79)
0.31
(0.19–0.5)
10
(9–12)
1 p-values showing the significance of the chosen threshold (p < 0.05) with a Bonferroni correction for multiple testing. PPV = positive predictive value; NPV = negative predictive value; LR+ = Positive Likelihood Ratio; LR− = Negative Likelihood Ratio; NNS = number needed to screen to detect one (s)PTB.
Table 7. Comparisons between the Model, the PreTRM Test, and commonly accepted clinical predictors of spontaneous preterm birth (sPTB) using key performance metrics as specified by ACOG (Bulletin 234). NNS was calculated using the PAPR sPTB prevalence.
Table 7. Comparisons between the Model, the PreTRM Test, and commonly accepted clinical predictors of spontaneous preterm birth (sPTB) using key performance metrics as specified by ACOG (Bulletin 234). NNS was calculated using the PAPR sPTB prevalence.
sPTB PredictorSensitivity PPV NNSLR+LR−
Model77.1% 21.4%163.00.3
PreTRM Test75.0% [19]14.6% [19]162.9 [28]0.3 [28]
Cervical Length Screening 8% [6,17,28]16.2% [6,17,28]1513.7 [14]0.9 [14]
Prior sPTB 11% [13,17,18]22.5% [13,17,18]1093.4 [13] *0.9 [13] *
* Numbers were computed based on the calculation table reported in [18]. Full calculations are shown in Supplementary Figure S1. PPV = positive predictive value; NNS = number needed to screen to detect one sPTB; LR+ = Positive Likelihood Ratio; LR− = Negative Likelihood Ratio.
Table 8. Performance comparisons between the Model and the PreTRM Test in predicting a clinical outcome. Note that LOS ≥ 5 refers to a neonatal hospital stay of 5 or more days.
Table 8. Performance comparisons between the Model and the PreTRM Test in predicting a clinical outcome. Note that LOS ≥ 5 refers to a neonatal hospital stay of 5 or more days.
ModelPreTRM TestDeLong
p-Value 2
McNemar
p-Value 3
MetricAUC
(95% CI)
Wilcoxon
p-Value 1
Sensitivity
(95% CI)
Specificity
(95% CI)
AUC
(95% CI)
Wilcoxon
p-Value 1
Sensitivity
(95% CI)
Specificity
(95% CI)
LOS ≥ 5 + sPTB0.74
(0.66–0.82)
<0.00169.4%
(51.7–83.1)
75.0%
(72.1–77.7)
0.61
(0.50–0.72)
0.05455.6
(38.3–71.7)
67.9%
(64.8–70.8)
0.0440.263
LOS ≥ 5 + PTB0.76
(0.71–0.82)
<0.00172.3%
(59.6–82.3)
74.0%
(71.0–76.8)
0.62
(0.54–0.70)
0.00252.3
(39.6–64.7)
68.4%
(65.2–71.4)
<0.0010.009
1 p-value showing significance of AUC (p < 0.05) with a Bonferroni correcting for multiple testing. 2 p-values comparing AUCs (p < 0.05) between the Model and PreTRM with a Bonferroni correction for multiple testing. 3 p-values comparing sensitivities (p < 0.05) between the Model and PreTRM with Bonferroni correction for multiple testing.
Table 9. A self-assessment of the evidence supporting the performance and clinical utility of the PreTRM test, using the grading scale developed by ACOG.
Table 9. A self-assessment of the evidence supporting the performance and clinical utility of the PreTRM test, using the grading scale developed by ACOG.
Study ScenarioProgressive GradePreTRM Evidence
Retrospective case–control study (PAPR) [19]Grade C (consensus or very limited/poor quality data)Development of PreTRM
Internal, retrospective validation of PreTRM
Prospective blinded validation in an independent cohort (TREETOP) [45]Grade B (consistent evidence but still limited)External, prospective validation of PreTRM
Randomized clinical trial of test-guided management (PRIME) [30]Grade A (strong recommendation based on consistent RCT and meta-analysis evidence)PreTRM guides treatment that reduces sPTB
Performance Enhancement (the Model presented in the current study)Grade A (meets performance requirements in ACOG practice Bulletin 234)Enhanced PPV and outcome prediction
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Polpitiya, A.; Cox, C.; Butler, H.; Badsha, M.B.; Sommerville, L.J.; Boniface, J.J.; Saade, G.; Kearney, P. Integrating Clinical Factors and Parity-Specific Models with Molecular Biomarkers to Better Predict the Risk of Preterm Birth in Asymptomatic Women. Diagnostics 2026, 16, 1487. https://doi.org/10.3390/diagnostics16101487

AMA Style

Polpitiya A, Cox C, Butler H, Badsha MB, Sommerville LJ, Boniface JJ, Saade G, Kearney P. Integrating Clinical Factors and Parity-Specific Models with Molecular Biomarkers to Better Predict the Risk of Preterm Birth in Asymptomatic Women. Diagnostics. 2026; 16(10):1487. https://doi.org/10.3390/diagnostics16101487

Chicago/Turabian Style

Polpitiya, Ashoka, Charles Cox, Heather Butler, Md. Bahadur Badsha, Laura J. Sommerville, J. Jay Boniface, George Saade, and Paul Kearney. 2026. "Integrating Clinical Factors and Parity-Specific Models with Molecular Biomarkers to Better Predict the Risk of Preterm Birth in Asymptomatic Women" Diagnostics 16, no. 10: 1487. https://doi.org/10.3390/diagnostics16101487

APA Style

Polpitiya, A., Cox, C., Butler, H., Badsha, M. B., Sommerville, L. J., Boniface, J. J., Saade, G., & Kearney, P. (2026). Integrating Clinical Factors and Parity-Specific Models with Molecular Biomarkers to Better Predict the Risk of Preterm Birth in Asymptomatic Women. Diagnostics, 16(10), 1487. https://doi.org/10.3390/diagnostics16101487

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop