1. Introduction
Female sexual function is a multidimensional component of health and quality of life, commonly conceptualized across desire, arousal, lubrication, orgasm, satisfaction, and pain. In reproductive health research, these domains are frequently quantified with the Female Sexual Function Index (FSFI), which provides both domain-level information and a total score suited for longitudinal cohort tracking [
1]. Although postpartum sexual concerns are common, they remain under-discussed in routine care, and structured measurement can facilitate normalization, counseling, and timely referral [
2,
3].
Across pregnancy and the first postpartum year, sexual function typically shows a predictable time course: progressive decline into late pregnancy, a nadir in early postpartum, and gradual recovery thereafter [
4,
5]. However, recovery is heterogeneous; some women return to near pre-pregnancy levels by late postpartum, while others experience persistent difficulties in specific domains—most commonly pain, low desire, and reduced satisfaction [
6,
7,
8,
9,
10]. This heterogeneity suggests that average trends may not capture clinically meaningful subgroups with distinct recovery pathways.
Obstetric factors plausibly contribute through tissue injury and pain mechanisms. Perineal trauma, episiotomy, and higher-degree lacerations have been linked to postpartum dyspareunia and delayed sexual recovery, with evidence suggesting graded relationships between trauma severity and persistence of symptoms at 6–12 months postpartum [
5,
8,
9]. Yet obstetric exposures alone do not fully explain late-postpartum sexual outcomes, and mode of delivery has not consistently shown a protective long-term effect on global sexual function across studies [
4,
6].
Psychosocial context appears particularly important for distinguishing recovery from persistent dysfunction. Depressive symptoms are consistently associated with lower FSFI scores in postpartum cohorts, and relationship quality has emerged as a robust correlate of sexual satisfaction and function across the transition to parenthood [
11,
12,
13]. In parallel, sleep disruption is near universal postpartum and may affect sexual health through fatigue, stress reactivity, mood dysregulation, dyadic functioning, and pain sensitivity [
14,
15]. However, sleep is often treated as a background context rather than an independent determinant in multivariable models of late-postpartum sexual function.
Accordingly, this prospective cohort study followed women from early pregnancy to 6–12 months postpartum with repeated FSFI assessment and integrated late-postpartum measurement of sleep quality and relationship satisfaction. We aimed to (i) describe FSFI trajectories across six timepoints spanning pregnancy through 12 months postpartum and (ii) evaluate whether sleep quality and relationship satisfaction are independently associated with late-postpartum sexual function beyond obstetric factors and depressive symptoms, using both continuous FSFI outcomes and clinically interpretable subgroup analyses.
2. Materials and Methods
2.1. Study Design, Setting, and Ethics
This was a single-center, prospective cohort study conducted at the Department of Obstetrics and Gynecology and affiliated outpatient services of Victor Babes University of Medicine and Pharmacy Timisoara. The protocol was designed as a pragmatic longitudinal observational study aligned with routine antenatal and postpartum follow-up schedules, enabling repeated measurement of sexual function without requiring additional clinical procedures.
Ethics approval was obtained from the institutional ethics committee, and all participants provided written informed consent. Given the intimate nature of the questionnaire content, confidentiality procedures were emphasized: surveys were completed privately without partners present, study identifiers were separated from personal identifiers, and analyses were performed on de-identified datasets.
To minimize social desirability bias, study staff were trained to provide neutral clarification of questionnaire items without prompting specific responses. Participants were explicitly informed that participation would not affect clinical care and that they could skip any question they found uncomfortable, while being encouraged to complete as much as they felt able.
Participant recruitment occurred between March 2023 and March 2024 during routine antenatal visits at the Department of Obstetrics and Gynecology and affiliated outpatient services of Victor Babeș University of Medicine and Pharmacy Timișoara (Timișoara, Romania). Postpartum follow-up and data collection continued until November 2026, with the last assessment performed at 6–12 months postpartum.
To reduce selection bias, recruitment was consecutive among eligible clinic attendees, and follow-up assessments were aligned with standard care to maximize retention. To reduce information and social desirability bias, questionnaires were self-administered privately, partners were not present, and staff provided neutral clarification without prompting responses. Exposure classification (delivery mode/trauma category) was abstracted from medical records using standardized operational definitions. To reduce confounding, multivariable models included prespecified obstetric and psychosocial covariates selected a priori based on the literature.
2.2. Participants, Follow-Up Schedule, and Procedures
Eligible participants were pregnant women aged ≥18 years with singleton pregnancy who enrolled at ≤20 weeks gestation during routine antenatal care. Exclusion criteria were conditions expected to strongly preclude sexual activity at baseline (medically mandated abstinence), major cognitive impairment preventing questionnaire completion, or anticipated inability to complete follow-up through 12 months postpartum.
Participants completed FSFI at six timepoints: first trimester (T1), second trimester (T2), third trimester (T3), 6–8 weeks postpartum (PP1), 3 months postpartum (PP2), and 6–12 months postpartum (PP3). This study prioritized capturing both the early postpartum nadir and later recovery period, recognizing that recovery trajectories may not be evident by the standard postpartum visit alone.
At PP3, participants completed an expanded psychosocial battery: PHQ-9 for depressive symptoms, WHOQOL-BREF domains for quality of life, a body-image disturbance scale focused on intimacy-related self-consciousness, PSQI for sleep quality, and a brief relationship satisfaction scale (1–5). Obstetric data (delivery mode and trauma category) were abstracted from the medical record using standardized definitions.
Follow-up reminders were implemented via phone or messaging to reduce attrition, and visits were aligned with routine appointments whenever possible. For PP3, in-person completion was preferred, but secure remote completion was permitted to reflect pragmatic postpartum constraints while preserving privacy.
Participants were recruited consecutively from eligible women attending antenatal care in the study setting. Study staff introduced this study in a private setting, confirmed eligibility, and obtained written informed consent. Follow-up was aligned with routine visits; participants received standardized reminders (telephone/text) before postpartum assessments. When an in-person visit was not feasible, questionnaires were completed remotely using a secure link, with instructions to complete surveys privately without partner presence.
Attrition was handled by complete-case analysis for each model; in this cohort, no enrolled participants were lost to follow-up, and all 112 completed the scheduled assessments.
The late-postpartum assessment was scheduled within a 6–12-month window to accommodate routine-care variability; when participants attended within this window, the completed assessment was treated as the PP3 timepoint.
2.3. Measures and Instruments
Sexual function was assessed using the Female Sexual Function Index (FSFI), a 19-item measure producing a total score from 2 to 36, where higher scores indicate better function. Although FSFI cutoffs were originally derived in non-pregnant populations, we used the widely reported threshold of FSFI < 26.55 to define “FSFI-defined sexual dysfunction” at PP3 for descriptive and comparative purposes, while analyzing FSFI primarily as a continuous outcome.
Depressive symptoms were measured with PHQ-9 (0–27), higher scores indicating greater symptom burden. Sleep quality was measured with PSQI (0–21), higher scores indicating worse sleep. Relationship satisfaction was assessed on a 1–5 scale (higher = better). Quality of life was measured using WHOQOL-BREF domain scores (0–100). Body-image disturbance was summarized on a 0–4 scale, where higher values reflect more dissatisfaction and intimacy-related self-consciousness.
Obstetric exposure was categorized into three delivery groups to balance clinical interpretability and statistical stability: (1) vaginal birth with low trauma, (2) vaginal birth with higher trauma (episiotomy and/or higher-degree lacerations as operationalized in the study dataset), and (3) cesarean delivery. Early postpartum perineal pain at PP1 was dichotomized into none/mild vs. moderate/severe. Pelvic-floor symptoms at PP3 were defined as the presence of at least one bothersome urinary or bowel symptom on a structured postpartum checklist.
An optional antenatal sexual-health counseling session (yes/no) was included as an exposure reflecting a feasible “real clinic” educational add-on rather than a formal randomized intervention. Attendance was recorded during third-trimester visits. The counseling content was standardized (normalizing common sexual changes, guidance on timing and comfort, and brief communication strategies), but uptake was voluntary.
All instruments used in this study are widely validated patient-reported measures with established feasibility and responsiveness in clinical and reproductive-health research. The FSFI has demonstrated strong psychometric performance and sensitivity to longitudinal change across reproductive life stages. Depressive symptoms (PHQ-9), sleep quality (PSQI), and quality of life (WHOQOL-BREF) were assessed using standardized instruments with established validity; Romanian-language versions have been previously used and/or validated in Romanian samples [
16,
17,
18,
19].
2.4. Statistical Analysis
This cohort was designed as a pragmatic longitudinal study embedded in routine care; therefore, the sample size was primarily determined by feasibility (clinic volume during the recruitment window) and the goal of achieving adequate precision for the primary descriptive endpoint (late-postpartum dysfunction prevalence) and stable estimation for prespecified multivariable models. With n = 112, the prevalence estimate for FSFI-defined dysfunction at 6–12 months postpartum has acceptable precision for cohort description, and the number of dysfunction events (n = 65) supports multivariable modeling with a limited set of a priori predictors. Missing data were minimal; participants included in the analysis completed all scheduled FSFI timepoints, and models were fit using complete-case data.
Continuous variables were summarized as mean ± standard deviation (SD) and categorical variables as counts and percentages. Normality was assessed by Shapiro–Wilk testing and distribution inspection; given the typical mild deviations and unequal variances between groups, Welch’s t-tests were used for most two-group comparisons of continuous outcomes, while χ2 tests were used for categorical comparisons (Fisher’s exact test was reserved for sparse 2 × 2 tables when applicable).
Longitudinal change in FSFI across the six timepoints was evaluated using repeated-measures ANOVA with time as the within-subject factor, reporting the overall time effect with F statistics and p-values. Subgroup comparisons across more than two groups used one-way ANOVA. To evaluate whether sleep quality and relationship satisfaction jointly shaped PP3 FSFI, a two-way ANOVA tested main effects and their interaction. Pearson correlations quantified associations between PP3 FSFI and psychosocial variables.
Multivariable modeling was prespecified to examine predictors beyond bivariate comparisons. Linear regression modeled PP3 FSFI as a continuous outcome, with key predictors entered simultaneously: standardized PHQ-9, PSQI, relationship satisfaction, body-image disturbance, delivery group indicators, perineal pain, pelvic-floor symptoms, counseling attendance, and age. Logistic regression modeled PP3 dysfunction (FSFI < 26.55), reporting odds ratios (ORs) with 95% confidence intervals. Two-sided p < 0.05 defined statistical significance.
3. Results
A total of 112 women were enrolled at ≤20 weeks gestation and completed the FSFI at all six scheduled timepoints (first trimester, second trimester, third trimester, 6–8 weeks postpartum, 3 months postpartum, and 6–12 months postpartum). All enrolled participants completed the late-postpartum psychosocial battery at 6–12 months postpartum, and all 112 were included in the final analysis. Delivery group distribution was vaginal low trauma (n = 45), vaginal higher trauma (n = 29), and cesarean delivery (n = 38). Participant flow from enrollment to analysis is summarized in
Figure 1.
Women with PP3 dysfunction (FSFI < 26.55; n = 65) were similar to those without dysfunction (n = 47) in age (29.7 ± 4.3 vs. 28.5 ± 4.3 years;
p = 0.172), BMI (24.3 ± 3.9 vs. 25.1 ± 3.3 kg/m
2;
p = 0.216), primiparity (53.8% vs. 57.4%;
p = 0.847), perineal pain at PP1 (29.2% vs. 34.0%;
p = 0.681), exclusive breastfeeding at PP3 (58.5% vs. 42.6%;
p = 0.126), pelvic-floor symptoms at PP3 (33.8% vs. 31.9%;
p = 1.000), and counseling attendance (29.2% vs. 31.9%;
p = 0.836). However, delivery mode/trauma distribution differed significantly (
p = 0.012), with higher-trauma vaginal birth more common among women with dysfunction (33.8% vs. 14.9%) and low-trauma vaginal birth less common (29.2% vs. 55.3%). Psychosocial and health-related measures showed clear separation: dysfunction was associated with higher depressive symptoms (PHQ-9: 8.6 ± 4.1 vs. 6.4 ± 3.3;
p = 0.002), worse sleep (PSQI: 8.6 ± 2.8 vs. 7.0 ± 2.4;
p = 0.002), lower relationship satisfaction (3.6 ± 0.5 vs. 3.9 ± 0.4;
p < 0.001), and lower WHOQOL-Physical scores (66.4 ± 12.1 vs. 71.8 ± 12.0;
p = 0.021), while WHOQOL-Psychological and body-image disturbance did not differ significantly (
p = 0.13 and
p = 0.485, respectively), as seen in
Table 1.
FSFI total scores declined progressively across pregnancy from T1 (26.5 ± 4.1) to T2 (25.2 ± 4.2) and T3 (21.8 ± 4.5), reaching the lowest mean in early postpartum at PP1 (18.8 ± 4.3). Scores then improved at PP2 (20.6 ± 4.1) and showed substantial recovery by PP3 (25.4 ± 5.0), approaching first-trimester levels. Repeated-measures ANOVA confirmed a strong overall time effect across the six timepoints (F(5555) = 268.0,
p < 0.001), supporting a marked late-pregnancy/early-postpartum nadir followed by gradual recovery over the first postpartum year (
Table 2).
Table 3 shows a clear imbalance in late-postpartum FSFI-defined dysfunction (PP3; FSFI < 26.55) across delivery/trauma categories. Low-trauma vaginal birth had the lowest dysfunction burden (19/45, 42.2%) and the highest proportion without dysfunction (26/45, 57.8%). In contrast, higher-trauma vaginal birth clustered disproportionately in the dysfunction group (22/29, 75.9%), with only 7/29 (24.1%) classified as no dysfunction. Cesarean delivery also showed a higher dysfunction proportion than low-trauma vaginal birth (24/38, 63.2%).
Early postpartum sexual function (PP1) differed across delivery groups (
p = 0.011), with the highest mean FSFI in the low-trauma vaginal group (20.3 ± 3.7; n = 45), lower scores after cesarean (18.2 ± 5.0; n = 38), and the lowest scores in the higher-trauma vaginal group (17.5 ± 3.4; n = 29). At PP3, group differences persisted (
p = 0.027): low-trauma vaginal birth remained highest (26.7 ± 4.6), cesarean was intermediate (25.3 ± 5.6), and higher-trauma vaginal birth remained lowest (23.5 ± 4.4). Clinically, dysfunction prevalence at PP3 also varied significantly by group (
p = 0.012), affecting 42.2% of low-trauma vaginal births, 63.2% of cesareans, and 75.9% of higher-trauma vaginal births, indicating a graded relationship between obstetric trauma category and late-postpartum sexual outcomes, as seen in
Table 4 and
Figure 2).
At PP3, sexual function showed strong gradients across sleep quality: mean FSFI was highest in the good-sleep tertile (27.4 ± 5.0; n = 38), slightly lower in the intermediate group (26.0 ± 5.1; n = 36), and lowest in the poor-sleep tertile (22.8 ± 3.8; n = 38), with dysfunction rates rising from 44.7% and 47.2% to 81.6%, respectively. A similar pattern was observed for relationship satisfaction: women with low relationship satisfaction had lower FSFI (23.4 ± 4.4; n = 38) and markedly higher dysfunction prevalence (84.2%) compared with moderate (25.8 ± 5.2; 45.9%) and high satisfaction tertiles (27.0 ± 4.9; 43.2%). Overall, between-tertile differences were significant for FSFI (one-way ANOVA
p = 0.007) and for dysfunction prevalence (χ
2 p < 0.001). In two-way ANOVA, both sleep (
p < 0.001) and relationship satisfaction (
p = 0.022) had independent main effects on PP3 FSFI, while their interaction was not significant (
p = 0.826), suggesting additive rather than synergistic associations in this cohort (
Table 5).
Correlation analyses indicated that worse depressive symptoms and poorer sleep were moderately associated with lower PP3 FSFI (PHQ-9: r = −0.4,
p < 0.001; PSQI: r = −0.4,
p < 0.001). Relationship satisfaction and WHOQOL-BREF physical domain showed smaller but statistically significant positive correlations with FSFI (both r = 0.2;
p = 0.015 and
p = 0.017, respectively), while WHOQOL-BREF psychological domain showed a similar magnitude that did not reach conventional significance (r = 0.2,
p = 0.077). Body-image disturbance, anxiety (GAD-7), age, BMI, and breastfeeding duration showed weak and non-significant associations (|r| ≤ 0.1;
p > 0.05 for all), suggesting that mood and sleep factors were the most prominent correlates of late-postpartum sexual function in this dataset (
Table 6).
In multivariable linear regression, higher depressive symptoms and worse sleep quality were independently associated with lower PP3 FSFI (PHQ-9 per SD: β = −1.3, 95% CI −2.2 to −0.4;
p = 0.004; PSQI per SD: β = −1.3, 95% CI −2.2 to −0.4;
p = 0.004). Obstetric trauma also remained significant: higher-trauma vaginal delivery was associated with a 2.9-point lower FSFI compared with low-trauma vaginal delivery (β = −2.9, 95% CI −5.0 to −0.8;
p = 0.008). Cesarean delivery showed a negative but non-significant association (β = −1.7, 95% CI −3.7 to 0.3;
p = 0.086). Relationship satisfaction trended positively but did not reach significance (β = 0.8, 95% CI −0.1 to 1.6;
p = 0.08), and body-image disturbance, perineal pain, pelvic-floor symptoms, counseling attendance, and age were not significant predictors (
p ≥ 0.194). Overall model performance was moderate (R
2 = 0.3; adjusted R
2 = 0.2; overall
p < 0.001), as seen in
Table 7.
In logistic regression, worse sleep quality increased the odds of PP3 dysfunction (PSQI per SD: OR = 1.9, 95% CI 1.2 to 3.2;
p = 0.009), whereas higher relationship satisfaction was protective (per SD: OR = 0.5, 95% CI 0.3 to 0.8;
p = 0.005). Delivery group effects were substantial: higher-trauma vaginal birth was associated with markedly higher odds of dysfunction vs. low-trauma vaginal birth (OR = 5.7, 95% CI 1.7 to 19.5;
p = 0.005), and cesarean delivery also increased odds (OR = 3.9, 95% CI 1.3 to 11.7;
p = 0.016). Depressive symptoms showed a positive trend but did not reach statistical significance (PHQ-9 per SD: OR = 1.5, 95% CI 0.9 to 2.5;
p = 0.097). Body-image disturbance, perineal pain, pelvic-floor symptoms, counseling attendance, and age were not significant (
p ≥ 0.355). The overall model was significant (likelihood ratio test
p < 0.001) with modest explanatory power (pseudo-R
2 = 0.2), as presented in
Table 8.
Table 9 quantifies the delivery-group association with PP3 dysfunction using multiple effect estimates. Compared with vaginal low-trauma birth, higher-trauma vaginal birth showed consistently elevated risk across models: unadjusted OR 4.30 (95% CI 1.56–11.87), bias-reduced OR 4.00 (1.55–10.34), unadjusted RR 1.80 (1.23–2.63), and adjusted OR 5.7 (1.7–19.5). Cesarean delivery showed a smaller and less consistent elevation: unadjusted OR 2.35 (0.96–5.77) and bias-reduced OR 2.26 (0.95–5.41) crossed/approached the null, while the adjusted OR remained significant at 3.9 (1.3–11.7).
Compared with the resilient phenotype, the pain-dominant phenotype was primarily associated with higher depressive symptoms (PHQ-9 per point: RRR = 1.1, 95% CI 1.0 to 1.2;
p = 0.028) and higher obstetric trauma (vaginal higher trauma vs. low: RRR = 2.6, 95% CI 1.0 to 6.8;
p = 0.047), while sleep quality and relationship satisfaction showed only borderline/non-significant associations (PSQI:
p = 0.217; relationship satisfaction:
p = 0.064). In contrast, the global low phenotype showed a stronger psychosocial pattern: worse sleep (PSQI per point: RRR = 1.3, 95% CI 1.1 to 1.6;
p = 0.003) and higher depressive symptoms (PHQ-9 per point: RRR = 1.2, 95% CI 1.1 to 1.4;
p < 0.001) increased risk, whereas greater relationship satisfaction reduced risk (RRR = 0.5, 95% CI 0.3 to 0.8;
p = 0.006). Delivery group and pelvic-floor symptoms were not significant predictors for membership in the global-low phenotype (
p = 0.089 to 0.65). The overall multinomial model fit was significant (likelihood ratio test
p < 0.001) with McFadden pseudo-R
2 = 0.2, supporting distinct predictor profiles for pain-dominant vs. global-low dysfunction phenotypes (
Table 10).
Figure 3 shows clear separation in postpartum sexual-function recovery across sleep tertiles. Women in the good-sleep tertile had consistently higher FSFI means at each postpartum visit compared with the poor-sleep tertile, with divergence increasing over time. At PP1, mean FSFI was 20.2 (good sleep) vs. 16.5 (poor sleep), a difference of 3.7 points; at PP2, it was 22.1 vs. 18.7 (3.4 points). By PP3, mean FSFI reached 27.4 in good sleep compared with 22.6 in poor sleep, expanding the gap to 4.8 points, indicating both a lower early postpartum level and less complete recovery among those with worse sleep quality. The intermediate group tracked between these curves (PP3 26.1), supporting a graded relationship between sleep quality and FSFI recovery across the postpartum period.
Figure 4 demonstrates that postpartum sexual dysfunction can express distinct domain phenotypes rather than a uniform reduction across all FSFI components. The resilient group (n = 37) had high median domain scores across all domains, including pain (4.8) and lubrication (4.6). In contrast, the pain-dominant phenotype (n = 33) showed relatively preserved medians for desire (4.0), arousal (3.9), orgasm (4.0), and satisfaction (4.1) but a pronounced reduction in the pain domain (median 3.0), separating it from the resilient phenotype by 1.9 points in pain (4.8 vs. 3.0). The global low phenotype (n = 42) showed broadly lower medians across domains (desire 3.2, orgasm 3.1, satisfaction 3.1, pain 3.2), indicating a more generalized impairment profile. These patterns support phenotype-driven stratification (pain-focused vs. global psychosocial/biobehavioral pathways) rather than a single FSFI cutoff alone.