4.3. Comparison of the Fitted Count Data Models
This is a comparison of all fitted models for maternal death counts using Akaike information criteria and Bayesian information criteria.
Table 3 below provides the AIC and BIC values used to choose the model that best fits the data.
The comparison of the performance of the model in
Table 3 reveals that the RZIP model provided the best fit to the data, outperforming all other models based on AIC and BIC. The NB model showed considerable improvement over the standard Poisson model, indicating its ability to handle overdispersion in the data more effectively. Among the robust models, the RZINB and RHP models showed competitive performance, especially the RZINB, but were still outperformed by the RZIP model. The RHNB model showed a slightly poorer fit compared to the RZIP, RHP, and RZINB models but performed better than the standard count (ZINB and HNB) and traditional (Poisson and NB) models. Looking at the ZINB, showed a better fit over the HNB and traditional models. These findings generally emphasize the suitability of the RZIP model for this dataset, demonstrating its ability to effectively address zero inflation and outliers at approximately 5%.
Figure 4 shows the observed histogram of maternal deaths overlaid with fitted probability curves from six different count regression models (Poisson, NB, RZIP, RZINB, RHP, and RHNB) that included covariates for assisted deliveries, breech delivery, early teen pregnancy, late teen pregnancy, attending at least four ANC visits (ANC4Visits), cesarean section (CS), and stillbirth. For each model, fitted probabilities were computed for individual observations using the estimated parameters and included covariates and then averaged across all observations to generate the marginal predicted distribution. This approach allows for a fair comparison with the empirical distribution of maternal deaths. The observed distribution exhibits significant excess zeros beyond what would be expected under standard count model assumptions, as also shown in
Table 2. This zero inflation necessitates the evaluation of models explicitly designed to handle both structural and sampling zeros. The standard Poisson model performs poorly due to its assumption of equidispersion (equal mean and variance), leading it to severely underestimate the frequency of zero outcomes. Although the NB model partially addresses this by accommodating overdispersion, it still lacks a mechanism to model excess structural zeros. In contrast, the robust hurdle models (RHNB and RHP) and robust zero-inflated models (RZINB and RZIP) offer superior fits by explicitly modeling the dual processes that generate zero and positive counts separately. Among these, the RZIP model provides the best overall fit. Its two-part structure, separating the not-at-risk group (structural zeros) from the at-risk group (count component), enables it to accurately capture both the high frequency of zeros and the distribution of non-zero maternal deaths. These results are consistent with those in
Table 3, where the RZIP achieved the lowest AIC and BIC scores.
4.4. Robust Count Regression Estimation Results
A significant frequency of zero counts and outliers was observed in the maternal mortality data. These observations deviate substantially from the expected patterns of standard count data distributions such as the Poisson and NB models. Such deviations suggest that the data may arise from two latent subpopulations, one that is not at risk of maternal mortality and thus always yields a count of zero, and another that is at risk and generates counts according to a standard count process (possibly zero or positive). In maternal mortality analysis, at-risk facilities are those that conduct deliveries with potential complications, high-risk pregnancies, or limited access to advanced obstetric care, making maternal deaths possible. Not-at-risk facilities are structurally unlikely to report maternal deaths because they either do not offer delivery services, handle only low-risk births, transfer complicated cases, or operate solely as antenatal/postnatal clinics. Furthermore, the presence of outliers in the data requires the use of robust estimation techniques, which the standard count models cannot account for. Given these challenges, an RZIP model is identified as the most appropriate for this analysis, as shown in
Figure 4 and supported by model comparison results in
Table 3. In the following
Table 4, the study displays parameter estimates and the robust standard error of the best-performing model, which is RZIP.
Table 4 displays the estimated coefficients of the RZIP model for two parts, namely the count model and the zero model. The zero model explains the excess zeros in the data, while the count model concentrates on the count (frequency) of the event, which is maternal mortality in our case. Each component describes a distinct feature of the data. In the count component, the RZIP model estimates the expected number of maternal deaths per 1000 births among the at-risk group. The estimated intercept is
, which corresponds to an expected count of 0.4245 maternal deaths when all predictors are at their baseline levels. This significantly lower baseline suggests that, in the absence of specific risk factors, maternal mortality remains below the mean of the sample. Among predictors, BreachDelivery has a statistically significant positive effect on maternal mortality among the at-risk group, with an estimated coefficient of 0.2813 (
p = 0.0251), corresponding to a
increase in the expected number of maternal deaths per 1000 births
, reinforcing its role as a critical risk factor. Conversely, AssistedDeliveries do not exhibit a meaningful impact, with an estimated effect of
p = 0.8435), in the at-risk group
, corresponding to a 40.13% decrease in the expected number of maternal deaths per 1000 births of an individual being in an at-risk group in maternal mortality, suggesting a limited influence on maternal mortality among individuals at risk. EarlyTeenPreg, however, shows a notable positive association in the at-risk group, with an estimated coefficient of
,
p = 0.0185), indicating a
increase in the expected count of maternal deaths per 1000 births among individuals at risk
. The statistical significance of this effect suggests that early teenage pregnancy is an important predictor of maternal mortality within the at-risk group.
LateTeenPreg, on the other hand, has an estimated effect of , p = 0.7900), in the at-risk group corresponding to a 8.26% decrease in the expected count of maternal deaths per 1000 births . However, this effect is not statistically significant ( 0.05), suggesting that late teenage pregnancy may not be a strong predictor of maternal mortality among individuals at risk. ANC4Visits and CS are both negatively associated with maternal mortality among individuals at risk, based on the count component of the model. ANC4Visits shows a statistically significant effect, with a decrease in the expected count of maternal deaths per 1000 births , Although CS is associated with a decrease in the expected number of maternal death per 1000 births , this effect is not statistically significant, indicating limited evidence of its independent contribution within the at-risk population. In contrast, StillBirth shows a positive significant effect, with a increase in the expected count of maternal deaths per 1000 births . This finding strongly strengthens the critical role of stillbirth as a key risk factor for maternal mortality among at-risk individuals and emphasizes the need for targeted interventions addressing this factor to reduce maternal deaths.
The zero inflation component models the log-odds of belonging to the not-at-risk group, those for whom maternal mortality cannot occur (i.e., they always yield a zero count). Among predictors, BreachDelivery has an estimated effect of , p = 0.5500), implying a decreased odds of an individual being in the not-at-risk group , suggesting a limited influence on maternal mortality among individuals that are not-at-risk group. Similarly, AssistedDeliveries has an estimated effect of , p = 0.8560), suggesting a limited influence on maternal mortality among individuals who are not-at-risk group. EarlyTeenPreg has an estimated effect of , p = 0.2140), suggesting that for each unit increase in early teenage pregnancies, the odds of belonging to the not-at-risk group , but there is limited influence on maternal mortality among individuals not-at-risk 0.05) indicating no strong evidence. In contrast, LateTeenPreg demonstrates a significant association in the zero-inflation component, with an estimated coefficient of , p = 0.0306), indicating a increase in the odds of being in the not-at-risk group . This implies that regions with higher late teenage pregnancy rates may be more likely to fall into a subgroup structurally unlikely to report maternal deaths, possibly reflecting reporting differences or underlying protective mechanisms.
ANC4Visits has a statistically significant positive effect, with an estimated coefficient of , p = 0.0164), suggesting that for each unit increase in ANC4Visits pregnancies, the odds of belonging to the not-at-risk group . This result suggests that higher ANC4Visits is associated with a substantially greater probability of avoiding maternal death, indicating that antenatal care may play a protective role in identifying or preventing risk before it manifests. The significance of this effect strengthens the importance of ANC4Visits in reducing the likelihood of being at risk for maternal mortality. CS has an estimated coefficient of , , corresponding to a increase in the odds of being in the not-at-risk group per unit increase . This indicates that individuals with higher values of CS are significantly more likely to be in the not-at-risk group. Similarly, StillBirth has a negative estimated coefficient of , , suggesting a decrease in the odds of being in the not-at-risk group . This estimate is also not statistically significant, and the large standard error reflects substantial uncertainty. Thus, there is no reliable evidence that StillBirth affects the likelihood of being in the not-at-risk group.