SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces

Chango, Wilson; Salguero, Ana; Landivar, Tatiana; Vásconez, Roberto; Silva, Geovanny; Peñafiel-Arcos, Pedro; Núñez, Lucía; Velasteguí-Izurieta, Homero

doi:10.3390/computation14010005

Open AccessArticle

SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces

by

Wilson Chango

^1,2,*

,

Ana Salguero

²

,

Tatiana Landivar

²

,

Roberto Vásconez

²

,

Geovanny Silva

²

,

Pedro Peñafiel-Arcos

²

,

Lucía Núñez

²

and

Homero Velasteguí-Izurieta

^1,*

¹

Department of Systems and Computation, Pontifical Catholic University of Ecuador, Esmeraldas Campus PUCESE, Esmeraldas 080101, Ecuador

²

Faculty of Informatics and Electronics, Escuela Superior Politécnica de Chimborazo (ESPOCH), Riobamba 060155, Ecuador

^*

Authors to whom correspondence should be addressed.

Computation 2026, 14(1), 5; https://doi.org/10.3390/computation14010005

Submission received: 14 November 2025 / Revised: 9 December 2025 / Accepted: 16 December 2025 / Published: 31 December 2025

(This article belongs to the Topic Intelligent Optimization Algorithm: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

This study aimed to evaluate the comparative predictive efficacy of the SARIMA statistical model and the Prophet machine learning model for forecasting monthly traffic accidents across the 24 provinces of Ecuador, addressing a critical research gap in model selection for geographically and socioeconomically heterogeneous regions. By integrating classical time series modeling with algorithmic decomposition techniques, the research sought to determine whether a universally superior model exists or if predictive performance is inherently context-dependent. Monthly accident data from January 2013 to June 2025 were analyzed using a rolling-window evaluation framework. Model accuracy was assessed through Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE) metrics to ensure consistency and comparability across provinces. The results revealed a global tie, with 12 provinces favoring SARIMA and 12 favoring Prophet, indicating the absence of a single dominant model. However, regional patterns of superiority emerged: Prophet achieved exceptional precision in coastal and urban provinces with stationary and high-volume time series—such as Guayas, which recorded the lowest MAPE (4.91%)—while SARIMA outperformed Prophet in the Andean highlands, particularly in non-stationary, medium-to-high-volume provinces such as Tungurahua (MAPE 6.07%) and Pichincha (MAPE 13.38%). Computational instability in MAPE was noted for provinces with extremely low accident counts (e.g., Galápagos, Carchi), though RMSE values remained low, indicating a metric rather than model limitation. Overall, the findings invalidate the notion of a universally optimal model and underscore the necessity of adopting adaptive, region-specific modeling frameworks that account for local geographic, demographic, and structural factors in predictive road safety analytics.

Keywords:

SARIMA; Prophet; traffic accident prediction; time series forecasting; model selection; Ecuador

Graphical Abstract

1. Introduction

Injuries from traffic accidents constitute a global public health crisis of epidemic proportions, being the leading cause of death among children and young people worldwide according to the World Health Organization, which reports approximately 1.3 million annual fatalities and between 20 and 50 million people who suffer non-fatal injuries [1]. This situation represents a particularly devastating socioeconomic burden for developing countries, where healthcare systems often exhibit greater structural vulnerabilities.

This worrying global context underscores the urgent need to develop sophisticated predictive tools that enable proactive management of road safety. The conceptual framework of our study is based on the comparative analysis of time series models for accident prediction, specifically evaluating the predictive efficacy of traditional statistical approaches versus machine learning methodologies. The relevance of this research transcends the methodological, addressing a critical public health problem where the ability to anticipate fluctuations in road traffic crashes can optimize the allocation of healthcare and emergency resources, design more effective preventive interventions and, ultimately, save lives.

The justification for this work emerges from a research gap identified in the scientific literature: there is a limited number of exhaustive comparative studies that evaluate the performance of predictive models across diverse geographical regions with heterogeneous socioeconomic and topographical characteristics [2]. This deficiency has prevented the establishment of clear guidelines for model selection in specific operational contexts, limiting the transferability of findings across different territorial realities.

Investigative background reveals that while SARIMA (Seasonal Autoregressive Integrated Moving Average) models have demonstrated robustness in capturing seasonal and trend patterns in transportation data [3], modern algorithms like Prophet, developed by Facebook, offer superior flexibility in handling nonlinear patterns and outliers [4]. The theories supporting this study are rooted in the integration of classic statistical time series analysis, based on Box–Jenkins theory, with the computational principles of machine learning, creating a hybrid theoretical framework that allows for rigorously evaluating the assumptions, advantages, and limitations of each predictive paradigm.

The context in which this study is conducted—the 24 provinces of Ecuador between 2013 and 2024—offers an ideal natural laboratory due to its geographical diversity (coastal, Andean, and Amazonian regions), climatic variability, and disparities in road infrastructure, providing a robust empirical scenario for testing the models’ generalizability. The pioneering innovation of this research lies in constituting the first comparative analysis at the Ecuadorian national level of classical versus ML approaches for accident prediction, generating locally contextualized evidence.

Consequently, this study seeks to answer the following research question: To what extent does the predictive efficacy differ between the SARIMA and Prophet models when forecasting traffic accidents across the diverse provinces of Ecuador, and what regional factors account for these variations in performance?

2. Background

2.1. Road Safety and Risk Prediction as a Global and Regional Public Health Challenge

Injuries resulting from traffic accidents represent one of the main causes of morbidity and mortality worldwide, surpassing in some countries the mortality rates for infectious diseases [5]. It is estimated that more than 1.19 million people die each year in road traffic crashes, and that traffic accidents constitute the leading cause of death among young people aged 15–29, with an economic impact exceeding 3% of the global GDP [6].

In the Latin American context, recent studies highlight the lack of evidence-based predictive mechanisms to identify critical risk zones and periods [7]. Road safety is, therefore, recognized as a structural public health challenge that requires preventive strategies oriented toward proactive risk management, supported by quantitative and predictive data analysis.

In particular, the use of time series models allows for the analysis of historical trends and the prediction of crash occurrences, facilitating the optimization of resources in health, transportation, and police control [8]. The integration of forecasting tools into road safety management not only offers operational advantages but constitutes a key component of public policies for the prevention of avoidable deaths, especially in countries with territorial disparities in infrastructure and vehicular control such as Ecuador [9] (see Table 1).

2.2. Fundamentals of Classic Time Series Models

The SARIMA (Seasonal Autoregressive Integrated Moving Average) model is an extension of the ARIMA model that incorporates seasonal components in order to capture patterns that repeat at regular time intervals, following the methodology proposed by Box and Jenkins [15].

Mathematically, SARIMA is denoted as:

SARIMA (p, d, q) {(P, D, Q)}_{s}

where

p and q represent the orders of the autoregressive and moving average components.
d represents the necessary differentiation to achieve stationarity.
$(P, D, Q)$ are the analogous parameters for the seasonal component with periodicity s [16].

Stationarity, understood as the statistical invariance of the mean and variance over time, is an essential requirement to ensure the model’s validity [17]. Its application in accident prediction contexts has proven effective for series with regular and seasonal patterns, such as monthly vehicular traffic fluctuations [18].

Furthermore, SARIMA models are validated using statistical tests such as the Ljung–Box test, the Partial Autocorrelation Function (PACF) analysis, and the selection of optimal orders based on the AIC and BIC information criteria [19]. This methodological structure has made the SARIMA model a classic and robust tool in forecasting variables related to mobility and road safety.

2.3. Modern Forecasting Algorithms: The Facebook Prophet Model

The Prophet model, developed by Taylor and Letham at Meta (Facebook), constitutes a modern machine learning approach for time series forecasting. Unlike traditional models based on stationarity, Prophet employs an additive model that decomposes the time series into three fundamental components: trend, seasonality, and holiday or special event effects [20].

This flexible approach allows capturing complex and nonlinear patterns, common in traffic data influenced by climatic, tourist, or socioeconomic variations [21]. Prophet employs optimization algorithms based on Bayesian regression, facilitating the dynamic updating of parameters and the incorporation of new observations in real-time [22]. Various studies have shown that its performance is superior to that of SARIMA models in scenarios with high temporal irregularity or the presence of outliers [23]. In the Ecuadorian context, its application is particularly relevant given the climatic and topographic diversity of the 24 provinces, which condition the occurrence of traffic accidents in a differentiated manner (See Table 2).

2.4. Evaluation and Validation Criteria for Multi-Regional Prediction Models

The evaluation of predictive models in multi-regional contexts requires the use of quantitative performance metrics and statistical tests that determine the superiority of one model over another. Two of the most commonly used metrics are the Mean Absolute Percentage Error (MAPE) and the Root Mean Square Error (RMSE), which measure prediction accuracy and error stability across regions [29]. These indicators are especially useful when analyzing datasets originating from provinces with heterogeneous temporal behaviors, such as those of Ecuador. Recent studies emphasize that MAPE and RMSE continue to be fundamental metrics in traffic forecasting, particularly when models must handle complex temporal patterns and varying demand conditions [30].

Additionally, the Diebold–Mariano (DM) test provides a robust statistical framework for comparing the predictive performance of two models over the same time series, evaluating whether differences in forecasting errors are statistically significant [31]. In the context of intelligent transportation systems, recent advances highlight the relevance of statistically grounded comparisons when working with deep learning or hybrid forecasting architectures, as shown in the multi-regional evaluation framework proposed by [32].

The integration of these metrics and statistical procedures ensures an objective and reproducible evaluation of the relative performance of SARIMA and Prophet. This approach allows determining which model offers greater accuracy and generalization capacity across heterogeneous regions. Complementary comparative error plots and cross-validation tables further strengthen methodological transparency, an essential aspect of predictive research applied to road safety.

3. Materials and Methods

The comparative analysis of predictive models was conducted under the principles of Design Science Research (DSR), aiming to develop and rigorously evaluate a predictive artifact (an automated multi-regional forecasting framework) capable of identifying the most effective time series model for traffic accident prediction across different Ecuadorian provinces. The DSR structure integrates the PRISMA framework for the initial literature review and the CRISP-DM methodology for the artifact’s design and development, ensuring both theoretical relevance and operational rigor [33] (see Figure 1).

3.1. Relevance Phase

3.1.1. Problem Identification

Traffic accidents constitute a significant public health crisis in Ecuador, necessitating advanced predictive tools for preemptive resource allocation and policy development [34]. Traditional forecasting methods often fail to capture the complex seasonality, non-linear trends, and regional heterogeneity present in accident data. This gap justifies the comparative study between the classical, stationary-based SARIMA model and the modern, decomposition-based Prophet model.

3.1.2. Systematic Literature Review (PRISMA Framework)

To define the research gap and justify the model comparison, a systematic review was performed following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines [35]. The search focused on comparative studies of time series forecasting in traffic safety or public health, with a temporal range from 2020 to 2025 (see Figure 2).

Identification: An exhaustive search was carried out in scientific databases and registers, initially identifying $n = 140$ records (120 from databases and 20 from registers). Combined keywords with Boolean operators were used: (“SARIMA” OR “ARIMA”) AND (“Prophet”) AND (“traffic accidents” OR “road safety”) AND (“forecasting”). The temporal range considered was from 2020 to 2025.
Screening: From the identified records, $n = 20$ were removed before screening (5 duplicate records, 5 marked as ineligible by automation tools, and 10 removed for other reasons). The remaining $n = 100$ records were screened by title and abstract, of which $n = 20$ were excluded (e.g., non-empirical publications or studies that did not use time series models).
Eligibility: $n = 80$ full-text reports were sought for retrieval. These $n = 80$ reports were assessed for eligibility. $n = 30$ reports were excluded based on detailed criteria: $n = 10$ due to methodological flaws (e.g., insufficient data), $n = 10$ due to a lack of performance comparison between key models, and $n = 10$ for not utilizing flexible decomposition models.
Inclusion: Finally, $n = 50$ studies were included for qualitative synthesis. These documents demonstrate the existing gap and justify the need for the systematic comparison between traditional modeling (SARIMA) and modern additive models (Prophet), serving as the foundation for the proposed artifact’s justification.

3.2. Design and Development Phase (CRISP-DM)

3.2.1. Data Preparation and Exploratory Analysis

This study uses monthly traffic accident records from Ecuador’s national database, covering the 24 provinces from January 2013 to June 2025. The data were aggregated at a monthly resolution (

s = 12

) to capture annual seasonality and separated into 24 individual time series, one per province.

Data Source and Granularity. The information was structured at a monthly frequency to allow the identification of seasonal patterns and to ensure comparability across provinces.

Preprocessing. A comprehensive preprocessing phase was conducted. This included: (a) identifying the proportion and temporal distribution of missing values, (b) detecting potential outliers that could distort model fitting, and (c) carefully selecting an interpolation strategy. Sporadic missing values were imputed using linear interpolation; although simple, this method was validated against the short gaps observed and the smooth monthly dynamics of the series. Structural shifts were normalized when necessary to avoid distortions in trend estimation.

Stationarity Analysis. For the SARIMA approach, stationarity was assessed with the Augmented Dickey–Fuller (ADF) test for each provincial series. Differencing (parameter d) was applied only when required to stabilize the mean and variance [36].

3.2.2. Predictive Modeling

Two modeling frameworks were implemented to capture the temporal behavior of the multi-regional data.

SARIMA Model Specification. Optimal SARIMA orders were selected per province through an iterative search guided by the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). Model adequacy was validated using the Ljung–Box test to confirm that no significant autocorrelation remained in the residuals. For baseline comparison, a SARIMA(1,1,1)(1,1,1)₁₂ configuration was evaluated, as it effectively captures the annual seasonality commonly present in monthly accident data.

Prophet Model Specification. The Prophet model, based on an additive decomposition framework [37], was also implemented. Prophet is formally expressed as:

y (t) = g (t) + s (t) + h (t) + ε_{t}

(1)

where

y (t)

is the observed series,

g (t)

is the non-linear trend (piecewise linear or logistic),

s (t)

models seasonal components (weekly and annual),

h (t)

accounts for holiday effects (not included in the baseline comparison), and

ε_{t}

is the error term. In this study, the model automatically considered the annual seasonality component and estimated the underlying non-linear trend.

3.2.3. Model Evaluation Setup and Visualization

A rolling-window evaluation scheme was applied using a fixed training period and a 12-month sliding forecast window. This approach simulates real forecasting conditions and enables assessing model stability over time [38]. Comparative error plots were generated for each province to visually contrast the performance of the SARIMA and Prophet models.

3.3. Rigor Phase—Artifact Evaluation

The effectiveness of the predictive artifact was assessed using quantitative accuracy metrics and statistical significance testing.

3.3.1. Performance Metrics

Model performance for both SARIMA and Prophet was evaluated using two primary metrics:

Mean Absolute Percentage Error (MAPE). This metric evaluates forecast accuracy as a percentage and is especially useful for comparing provinces of different scales.

MAPE = \frac{1}{n} \sum_{t = 1}^{n} |\frac{A_{t} - F_{t}}{A_{t}}| \times 100 %

(2)

Root Mean Square Error (RMSE). This metric captures the standard deviation of forecasting errors and penalizes large deviations more heavily, thus reflecting model stability.

RMSE = \sqrt{\frac{1}{n} \sum_{t = 1}^{n} {(A_{t} - F_{t})}^{2}}

(3)

where in Formulas (2) and (3),

A_{t}

is the actual value,

F_{t}

the forecast value, and n the number of predictions.

3.3.2. Diebold–Mariano Statistical Test

The Diebold–Mariano (DM) test was applied to determine whether the prediction accuracy of the two models differed significantly in each province [39]. The test evaluates the null hypothesis that both models exhibit equal forecasting accuracy. Rejection of the null at

α = 0.05

indicates that one model is statistically superior. This province-specific analysis provides rigorous statistical support to identify the most effective forecasting approach for broader spatial generalization [40].

4. Results

4.1. Coast Region

The analysis of the traffic accident time series in Guayas (101,179 total accidents) reveals a stationary series (p-value of

0.018

), which simplifies the modeling process.

The comparative evaluation between the models confirms a significant superiority of the Prophet model over SARIMA.

Prophet demonstrated much higher accuracy with a Mean Absolute Percentage Error (MAPE) of 4.91% and a Root Mean Square Error (RMSE) of 33.76.
SARIMA recorded considerably higher metrics (MAPE of 9.77% and RMSE of 64.64).

Prophet’s performance represents a reduction of almost 50% in the average prediction error, positioning it as the most effective and stable model for this province. Prophet’s future projections suggest a seasonal pattern, predicting a peak in accidents in December 2025 (733) and a trough in February 2026 (568) (see Figure 3).

The Figure 4 illustrates the temporal behavior and forecast performance of traffic accidents in Manabí province (2013–2025). The time series reveals fluctuations with peaks in 2014 and 2021, followed by a moderate decline during 2023–2024. The trend shows a gradual decrease with slight recovery in early 2025. Forecasts for July 2025–June 2026 indicate a mild downward trend, with SARIMA providing smoother and more accurate results (MAPE 14.22%, RMSE 26.32) compared to Prophet. The ADF test (p = 0.008) confirmed the stationarity of the series, and the predicted monthly accidents range between 73 and 109, suggesting moderate variability and stability in the short term.

The Figure 5 depicts the monthly evolution and forecasts of traffic accidents in Los Ríos province (2013–2025). The time series reveals fluctuations with peaks above 130 accidents during 2014–2015 and a downward trend after 2022. The recent trend shows a steady decline until mid-2024 with a mild recovery in 2025. Both SARIMA and Prophet models produced consistent forecasts for July 2025–June 2026, stabilizing around 70–100 accidents per month. Quantitatively, Prophet achieved slightly better accuracy (MAPE 17.91%, RMSE 15.14) than SARIMA (MAPE 19.88%, RMSE 16.85). The ADF test (p = 0.004) confirmed the series’ stationarity, supporting SARIMA’s validity and indicating a stable, moderately variable trend in road accidents for the province.

The Figure 6 shows the monthly evolution of traffic accidents between 2013 and 2025, with projections extending to 2026. A total of 11,356 accidents were recorded, with a monthly average of 75.7 and values ranging between 28 and 159. Peaks were observed in 2014–2015 and 2019–2020, followed by stabilization after 2021. The stationarity test (p-value = 0.002) confirms that the series is suitable for ARIMA/SARIMA models. Both models demonstrate high predictive accuracy (MAPE < 15%), although Prophet outperforms SARIMA with a lower error (MAPE 9.67% vs. 12.35%). Forecasts for 2025–2026 indicate stability between 73 and 104 accidents, with a peak in December 2025 and stabilization around 82 by mid-2026. Overall, the series exhibits a predictable pattern, and Prophet provides the best overall performance.

The Figure 7 summarizes the temporal evolution and predictive analysis of traffic accidents in El Oro province (2013–2025).

Accidents peaked above 100 cases between 2014–2016, followed by a steady decline after 2018, as shown in the full time series. The recent trend (2023–2024) confirms a continued downward pattern, suggesting improved safety conditions.

Model validation indicates consistent predictive performance, while 12-month forecasts (July 2025–June 2026) show a stable range of 33–48 accidents per month. Quantitatively, SARIMA outperformed Prophet, with MAPE = 18.21% and RMSE = 7.97, versus Prophet’s MAPE = 40.60% and RMSE = 16.11. Despite the non-stationary series (

p = 0.331

), SARIMA demonstrated superior stability and accuracy in forecasting accident trends.

The Figure 8 summarizes the temporal evolution and predictive modeling of traffic accidents in Santa Elena. The complete time series (top-left) shows high variability with peaks near 100 accidents (2014–2015) and stabilization between 40 and 60 after 2020. The trend (top-right) indicates a decline until early 2024, followed by a slight recovery in 2025. The model validation (bottom-left) reveals strong consistency between real and test data, while the 12-month forecasts (bottom-right) suggest a stable accident frequency between 41 and 51 cases per month. Quantitatively, the SARIMA model outperformed Prophet (MAPE = 15.29%, RMSE = 10.31 vs. MAPE = 20.58%, RMSE = 12.78). The ADF test (p = 0.000) confirmed stationarity, validating SARIMA’s assumptions and indicating stable medium-term accident trends.

The analysis focuses on the traffic accident time series for the province of Esmeraldas, which is characterized by low volume (3039 total accidents, 20.3 monthly average) and extreme volatility (minimum of 2, maximum of 54). Methodologically, the series is NON-stationary (p-value of

0.411

), implying the need for differentiation in the SARIMA model (see Figure 9).

In the comparative evaluation, although both models exhibit a high Mean Absolute Percentage Error (MAPE) (Prophet:

73.93 %

; SARIMA:

92.37 %

), which is typical for low-volume series, the Prophet model demonstrates clear superiority. Prophet records an MAPE that is 18.44 percentage points lower and an RMSE (Root Mean Square Error) of

6.10

(compared to

6.81

for SARIMA). This lower metric in absolute terms positions Prophet as the most accurate and stable model for forecasting accidents in this non-stationary, low-volume time series (see Table 3).

4.2. Andean Region

90,636 accidents were recorded with a monthly average of 604.2, showing high variability between 115 and 1617 monthly accidents. The series is not stationary (p-value: 0.463), justifying the use of SARIMA. This model demonstrated high performance (MAPE: 13.38%, RMSE: 55.33), while Prophet had low performance (MAPE: 66.62%, RMSE: 248.24). SARIMA predictions for the next 12 months range between 380–494 accidents, with a peak in December 2025, showing overall stability with seasonal variations (see Figure 10).

15,398 accidents were analyzed with a monthly average of 102.7, showing variability between 17 and 173 monthly incidents. The series is not stationary (p-value: 0.076). The SARIMA model showed poor performance (MAPE: 72.51%, RMSE: 46.95), while Prophet demonstrated superior performance (MAPE: 48.34 RMSE: 33.64). SARIMA predictions for the next 12 months range between 111–132 accidents, showing stability but with limited reliability due to high error. It is recommended to use Prophet over SARIMA (see Figure 11).

Our analysis of 15,280 traffic accidents in Tungurahua revealed a monthly average of 101.9, though the data showed significant fluctuations from as few as 6 to as many as 228 accidents in a single month. The data pattern over time is not stationary, confirming the presence of underlying trends or seasonal effects.

In our model evaluation, the SARIMA model demonstrated exceptional performance, with a very low prediction error, significantly outperforming the Prophet model.

Looking ahead, the SARIMA model forecasts a relatively stable trend for the next 12 months, with predicted accidents ranging between 79 and 105 per month. A noticeable peak of 105 accidents is anticipated for December 2025 (see Figure 12).

The accident data in Imbabura reveals a complex picture, with 8850 total accidents and a monthly average of 59, but with high variability: ranging from 13 to 236 accidents in different months, suggesting the influence of extraordinary factors.

When comparing predictive models, SARIMA achieved an error margin of 41%, which, while high, significantly outperforms Prophet’s 77% error, making it the best available option, though it should be used with caution.

Projections for the next year estimate between 37 and 51 monthly accidents, generally below the historical average. It is recommended to investigate the causes of historical peaks and explore complementary methods to improve predictability (see Figure 13).

When analyzing Loja, a total of 7844 accidents were found with a monthly average of 52.3. However, monthly figures have varied dramatically—from just 6 accidents in the calmest month to 205 in the most critical one—indicating exceptional events in the province’s history.

The good news is that the data shows consistent patterns over time, making accident prediction more reliable. When comparing forecasting methods, both models perform acceptably, with Prophet having a slight advantage, predicting with a 16.4% error compared to SARIMA’s 18.07%.

Projections for the next year indicate between 47 and 65 monthly accidents, remaining close to the historical average. It is observed that December would be the most critical month with 65 projected accidents, while January would show the lowest activity with 48. This stability in the predictions gives us greater confidence for preventive planning (see Figure 14).

Chimborazo recorded 7342 accidents with a monthly average of 48.9, showing moderate variability between 10 and 89 monthly accidents. Although the series is stationary (p-value: 0.001), a condition that normally favors prediction, both models performed poorly. SARIMA had an error of 67.48% and Prophet 66.55%, levels that are unacceptable for practical use. SARIMA projections for the next 12 months range between 26–47 accidents, below the historical average. The contradiction between stationarity and the high errors suggests that the models are not capturing non-linear patterns, dominant external factors, or possible random behavior in the data (see Figure 15).

Cañar recorded 1993 accidents in total, making it the province with the lowest volume analyzed. Its monthly average is 13.3 accidents, but with significant relative variability, having fluctuated between 1 and 38 accidents per month. The series is not stationary (p-value: 0.688), which complicates predictive modeling.

When comparing the models, SARIMA showed an error of 42.53%, marginally outperforming Prophet (46.76%), with both showing low absolute errors. The performance is acceptable considering the reduced scale of the data.

SARIMA projections for the next 12 months predict between 3 and 6 monthly accidents, well below the historical average, with the highest values in October and December 2025 (6 accidents) and the lowest in January and June 2026 (3 accidents) (see Figure 16).

Cotopaxi recorded 4371 accidents with a monthly average of 29.1, showing high variability between 4 and 85 monthly accidents. The series is not stationary (p-value: 0.132). Both predictive models failed severely: SARIMA showed an extreme error of 269.65%, while Prophet, although better, maintained a critical error of 100.83%. The SARIMA projections (64–79 monthly accidents) lack credibility as they double or triple the historical average (see Figure 17).

Bolívar recorded 1787 accidents with a monthly average of 11.9, showing controlled variability between 0 and 27 monthly accidents. The series is not stationary (p-value: 0.174). SARIMA demonstrated better performance (MAPE: 40.72%, RMSE: 5.03) than Prophet (MAPE: 43.11%, RMSE: 7.20), showing clear superiority in both metrics. SARIMA projections for the next 12 months range between 9–17 accidents, aligning with the historical average, with the highest peak in May 2026 (17 accidents) and the lowest values in August 2025 and March 2026 (9 accidents). The results indicate a favorable scenario with recognizable seasonal patterns and stable projections that enable reliable planning (see Figure 18).

Carchi recorded 1293 accidents with a very low monthly average of 8.6, ranging between 0 and 25 monthly accidents. The series is not stationary. A critical technical failure was identified in both models: the MAPE values are numerically absurd (trillions of percent), indicating a computational error, possibly due to division by zero or corrupted data. However, the RMSE is low (2.47–2.97) and the SARIMA projections (0–6 monthly accidents) are consistent with the low historical volume, although well below the average (see Figure 19).

The analysis of the provinces shows that Pichincha has the highest accident volume (90,636), while Carchi has the lowest (1293). SARIMA demonstrated better performance in high-volume provinces like Pichincha (MAPE 13.38%) and Tungurahua (6.07%), while Prophet was superior in Azuay (48.34%) and Loja (16.40%). Most series are not stationary, except for Loja and Chimborazo. Predictive errors vary significantly, from an excellent 6.07% in Tungurahua to a critical 100.83% in Cotopaxi, with lower-volume provinces presenting greater predictive challenges with MAPE generally above 40%. (see Table 4).

4.3. Amazon Region

Morona Santiago recorded 1553 accidents with a monthly average of 11.3, showing moderate variability between 1 and 26 monthly accidents. The series is not stationary (p-value: 0.324). Prophet demonstrated better performance (MAPE: 43.36%, RMSE: 5.73) than SARIMA (MAPE: 52.62%, RMSE: 6.16), clearly outperforming it in both metrics. The SARIMA projections for the next 12 months range between 3–8 accidents, well below the historical average, with the highest peak in December 2025 (8 accidents) and the lowest values in January and April 2026 (3 accidents) (see Figure 20).

Orellana recorded 1175 accidents with a very low monthly average of 7.8, showing high relative variability between 0 and 29 monthly accidents. The series is stationary (p-value: 0.012), a favorable characteristic. SARIMA demonstrated better performance (MAPE: 31.86%, RMSE: 2.27) than Prophet (MAPE: 37.87%, RMSE: 3.00), consistently outperforming it in both metrics. The SARIMA projections for the next 12 months range between 3–9 accidents, close to the historical average, with the highest peak in November 2025 (9 accidents) and the lowest values in August 2025 and June 2026 (3 accidents) (see Figure 21).

Zamora Chinchipe recorded 1143 accidents with a very low monthly average of 7.6, showing significant variability between 0 and 25 monthly accidents. The series is not stationary (p-value: 0.251). Prophet demonstrated better performance (MAPE: 52.55%) than SARIMA (MAPE: 68.58%), although both models show high errors. The SARIMA projections for the next 12 months range between 2–9 accidents, close to the historical average, with peaks in October and November 2025 (9 accidents) and the lowest value in September 2025 (2 accidents) (see Figure 22).

Napo recorded 1091 accidents with an extremely low monthly average of 7.3, showing high variability between 0 and 35 monthly accidents. The series is not stationary (p-value: 0.383). SARIMA marginally outperformed Prophet with a MAPE of 72.63% versus 79.10%, although both models showed poor performance. The SARIMA projections for the next 12 months range between 0–4 accidents, well below the historical average, with the highest peak in December 2025 (4 accidents) and the lowest value in September 2025 (0 accidents) (see Figure 23).

Pastaza recorded 1041 accidents with an extremely low monthly average of 6.9, showing high variability between 0 and 28 monthly accidents. The series is not stationary. A critical computational failure was identified in both models, with numerically impossible MAPE values (trillions of percent), which completely invalidates this metric. However, the RMSE is very low (1.49–1.57) and the SARIMA projections (0–3 monthly accidents) are consistent with the low historical volume, although well below the average. The predictions show maximum values in December 2025 and February 2026 (3 accidents) and the minimum in August 2025 (0 accidents) (see Figure 24).

Sucumbíos recorded 828 accidents, the lowest volume analyzed, with a monthly average of 5.5 and high variability between 0 and 23 monthly accidents. The series is not stationary. Both models showed critical failures in MAPE calculation with numerically absurd values (billions of percent), following the error pattern observed in other low-volume provinces. The SARIMA projections range between 0–4 monthly accidents, with 7 of the 12 months projecting 0 accidents and only December 2025 showing a significant value (4 accidents), a trend well below the historical average (see Figure 25).

Among the analyzed Amazonian provinces, Morona Santiago recorded the highest accident volume (1553) while Sucumbíos had the lowest (828). Orellana was the only province with a stationary series, where SARIMA achieved the best performance (MAPE 31.86%). In the other non-stationary provinces, Prophet showed an advantage in Morona Santiago (43.36%) and Sucumbíos (533.73%), although the latter presents an extremely high error. Predictive errors vary dramatically, from the acceptable 31.86% in Orellana to critically high values in Pastaza (1703.95%) and Sucumbíos (533.73%), reflecting the challenges of modeling very low-volume series (see Table 5).

4.4. Insular Region

The Insular Region (Galápagos) has the lowest accident volume in the country, with a monthly average of only 0.5. Its time series is not stationary and the MAPE values are extremely high and unstable, showing numerically absurd figures that confirm a systematic error pattern in provinces with minimal volumes. This MAPE failure appears to be related to the high frequency of zero or near-zero values in the data (see Figure 26).

However, the RMSE values are very low (0.56–0.62), indicating that the numerical predictions are accurate in scale. The projections of 0–1 monthly accidents are consistent with historical data, showing a random distribution of zeros and ones without a clear seasonal pattern. The projected average of 0.33 accidents per month is slightly lower than the historical average of 0.5 (see Table 6).

Despite the Prophet model registering the highest individual accuracy (4.91% MAPE in Guayas), the analysis reveals a Global Tie (12 vs. 12) in the number of provinces where each model is superior according to the MAPE.

SARIMA is more effective in the Sierra Region (provinces with Non-Stationary series but with medium-to-high volume, such as Pichincha and Tungurahua). Prophet is more effective in the Coast Region (provinces with high volume and stationary series, such as Guayas and Sto. Domingo de los Tsáchilas).

There is no single general "Best Model" when counting the victories per province; both models are equally competitive, with the selection of the optimal model depending on the intrinsic characteristics (stationarity, volume, volatility) of each province’s time series (see Table 7).

The accident analysis compared the SARIMA and Prophet models for all 24 provinces, most having 150 monthly records, selecting the best model for each. Guayas has the highest accident volume (101,179 total, 674.5 monthly), and Galápagos the lowest (81 total, 0.5 monthly).The best overall precision was achieved with Prophet in Guayas (

M A P E : 4.9 %

), while SARIMA’s best precision was in Tungurahua (

M A P E : 6.1 %

). Only 7 provinces were classified as stationary time series (including Guayas, Manabí, and Sto. Domingo), while 17 provinces were non-stationary. The model choice was critical: Pichincha strongly favored SARIMA (

13.4 %

MAPE), and Guayas favored Prophet (

4.9 %

MAPE) (Figure 27).

4.5. Diebold–Mariano Test Results

The Final Table 8 presents the results of the Diebold–Mariano (DM) test applied to each province of Ecuador, with the purpose of evaluating whether statistically significant differences exist between the two forecasting models used to estimate accident time series. The DM test directly compares the forecast errors generated by both models and determines whether one model systematically outperforms the other.

For each province, three key elements are reported:

dm_stat: the DM test statistic, which quantifies the magnitude and direction of the difference in predictive accuracy between the models.
dm_p: the associated p-value, reflecting the probability that the observed difference in errors occurred by chance.
significance: a categorical interpretation based on the p-value that facilitates the identification of statistically relevant differences.

For interpretation, the following standard criteria were adopted:

$p \geq 0.10$ : No evidence, indicating no statistical support for differences in model performance.
$0.05 \leq p < 0.10$ : Weak evidence (not present in the analyzed data).
$0.01 \leq p < 0.05$ : Significant, indicating statistically significant evidence favoring one model.
$p < 0.01$ : Highly significant, reflecting very strong evidence that one model consistently outperforms the other.

The results indicate that, for most provinces, the p-values are relatively high (

p \geq 0.10

), meaning that no statistical evidence exists to claim that one model predicts better than the other. However, several provinces exhibit clearly significant differences. Specifically, Cotopaxi, El Oro, Guayas, Imbabura, Morona, Pichincha, and Tungurahua show p-values below 0.01, providing highly significant evidence of superior performance by one of the models. Additionally, three provinces—Zamora Chinchipe, Sucumbíos, and Santa Elena—show statistically significant differences at the 5% level.

Overall, the results reveal a geographically heterogeneous pattern. While provinces in the central-northern Sierra and parts of the Coastal region present pronounced differences in model performance, other regions exhibit statistically indistinguishable accuracy between the two models. This heterogeneous behavior suggests that local factors—such as seasonal patterns, regional variability, or specific characteristics of the time series—may influence the relative predictive performance of the models.

5. Discussion

The empirical comparison between the SARIMA and Prophet models enabled us to address the research question, showing that the predictive effectiveness of both approaches depends strongly on the regional context and the temporal behavior of the accident series. Although the results revealed an overall balance—12 provinces favored each model—SARIMA achieved a slight advantage in national average accuracy (MAPE: 12.3% versus 15.7% for Prophet). This finding rules out the existence of a universally superior model for road-accident prediction in Ecuador and indicates that the suitability of each model is closely linked to the local characteristics of each province.

The regional analysis identified consistent patterns explaining the differences in model performance. Provinces with a high volume of accidents, such as Guayas and Pichincha, generate denser and more structurally coherent time series, facilitating model fitting. In these cases, the choice between SARIMA and Prophet is closely related to the stationarity of the series. Prophet showed outstanding results in coastal provinces with high volume and stationary behavior—for example, Guayas, where it achieved a MAPE of 4.91%—due to its ability to model complex seasonal patterns without requiring differencing [41]. In contrast, SARIMA outperformed Prophet in Andean provinces with medium-to-high volumes but non-stationary series, such as Pichincha (MAPE: 13.38%) and Tungurahua (MAPE: 6.07%). Its methodological structure, which incorporates differencing to achieve stationarity, enables it to more effectively handle the nonlinear trends and structural shifts characteristic of these regions [42].

These differences are not explained solely by the mathematical structure of the models but also by underlying geographic and socioeconomic factors. In the coastal region, dense traffic, moderate climate variability, and more homogeneous economic activity generate marked and relatively stable seasonal patterns, conditions that favor Prophet. In contrast, the Andean region exhibits complex topography, higher climatic variability, and a more heterogeneous road network, factors that introduce irregularities, abrupt peaks, and less predictable trends. In these settings, the parametric structure of SARIMA provides a clear advantage in the presence of potential structural breaks [43]. These findings reinforce the importance of considering contextual variables—such as topography, climate, and traffic dynamics—when selecting predictive models tailored to geographic reality.

Despite the overall strong performance, the study faced methodological limitations in provinces with very low data volume and a high frequency of zero or near-zero values, such as Galápagos, Carchi, Pastaza, and Sucumbíos. In such cases, MAPE becomes mathematically unstable, as its formulation disproportionately magnifies small absolute errors when real values are very low. This can produce artificially inflated percentage errors that do not reflect the true performance of the model. However, RMSE values in these provinces remained consistently low, indicating that the limitations stem from the metric rather than from the predictive models themselves. Therefore, conclusions based solely on MAPE for these provinces should be interpreted with caution.

For future analyses in low-incident contexts, alternative metrics less sensitive to zero values are recommended, such as the Mean Absolute Error (MAE) or the Symmetric Mean Absolute Percentage Error (sMAPE). These indicators provide a more stable basis for comparison across regions with extreme heterogeneity in data volume [44].

From an applied perspective, the results offer valuable guidance for road-safety policy, supporting a model-selection approach tailored to regional needs. Prophet is more suitable for urban coastal provinces with stable seasonal patterns, whereas SARIMA is preferable in Andean regions where volatile and structurally complex trends predominate.

Finally, several avenues for future research arise from these findings: developing hybrid models that combine the parametric robustness of SARIMA with the decomposition flexibility of Prophet; incorporating exogenous variables—such as weather conditions, traffic flow, or economic indicators—to improve explanatory power; extending the analysis to sub-provincial scales; and comparing these results with deep learning architectures, such as LSTM and Transformers, to further validate predictive performance in complex transportation systems [45].

6. Conclusions

The present study aimed to rigorously evaluate the predictive performance of the SARIMA and Prophet models in estimating traffic accidents across Ecuador’s 24 provinces, with the objective of determining whether a universally superior model exists or whether performance depends on regional conditions. This objective addressed the central research question concerning the contextual nature of prediction within heterogeneous transportation systems, considering both the geographic particularities and the structural variability of the time series associated with road accidents.

The results revealed that no single model consistently outperformed its counterpart across the entire national territory, with a global tie observed in performance between both approaches (12 provinces favoring SARIMA and 12 favoring Prophet). However, the regional analysis demonstrated distinct patterns of effectiveness: Prophet exhibited outstanding performance in coastal and urban provinces with high density and stationary behavior, such as Guayas, which achieved the lowest MAPE in the study (4.91%), while SARIMA excelled in the Andean region, particularly in provinces with non-stationary series and more pronounced structural variability, such as Tungurahua (MAPE 6.07%) and Pichincha. These findings confirm that the predictive accuracy of the models is strongly conditioned by the geographic and structural dynamics of the data, invalidating the hypothesis of a single, universally optimal model for traffic accident forecasting.

From a theoretical perspective, the study contributes to closing a gap in the literature by providing the first systematic comparison between a classical statistical model (SARIMA) and a machine learning-based model (Prophet) applied to the Ecuadorian context, characterized by high topographic and demographic heterogeneity. The results offer strong evidence that the process of predictive model selection must incorporate criteria related to localization and series structure, beyond global accuracy alone. In this sense, the differentiated performance of each model across regions reinforces the notion that geographical conditions, urbanization levels, and the intrinsic seasonality of the data determine the methodological suitability of each approach.

Despite the robustness of the results obtained in high-volume provinces such as Guayas and Tungurahua, the study acknowledged important methodological limitations, particularly concerning low-volume series in provinces such as Galápagos, Carchi, Pastaza, and Sucumbíos. In these cases, the MAPE metric exhibited severe computational failures, generating numerically implausible values due to the high frequency of zeros and the sensitivity of the formula’s denominator. Nonetheless, the RMSE remained low, suggesting that the issue lies within the metric itself rather than the underlying model. These observations highlight the need to reassess traditional accuracy metrics in low-incidence contexts and propose the adoption of more robust indicators for interprovincial comparison.

From a practical perspective, the findings provide a valuable methodological guide for the design of road safety policies and the allocation of predictive resources tailored to regional contexts. Rather than adopting a single model for the entire country, a localized model selection framework is recommended—one that leverages the specific advantages of SARIMA in non-stationary environments and Prophet in urban and stable settings. Likewise, future research should integrate exogenous variables such as weather conditions, economic indicators, or mobility data to enhance the explanatory capacity of the models. The development of hybrid or ensemble frameworks that combine the strengths of both approaches represents a promising direction for improving multiregional forecasting.

Overall, this study reaffirms the importance of understanding the contextual nature of predictive modeling in complex transportation systems. By demonstrating that model performance depends on the structural and geographical characteristics of each province, the research provides a solid empirical and methodological foundation for the adoption of adaptive forecasting approaches. Beyond its technical contribution, the results underscore the relevance of predictive analytics as a strategic tool for accident prevention and evidence-based policymaking, strengthening the link between data science and the intelligent management of road safety.

Author Contributions

Conceptualization, T.L.; methodology, R.V. and H.V.-I.; software, A.S. and P.P.-A.; validation, L.N.; investigation, G.S.; Project administration, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All data and files related to the predictive models are available and organized in the following Google Drive link: https://drive.google.com/drive/folders/1W3poLPpuC-Tgjqh2VS1s_GMWbASUCJs7?usp=sharing (accessed on 8 December 2025). In this repository, you will be able to access all the necessary information, including datasets, scripts, documentation, and any other relevant resources for the development and analysis of the models.

Acknowledgments

We express our sincere gratitude to Wilson Chango from PUCE Esmeraldas for his invaluable technical assistance and guidance in configuring the sensor systems and data acquisition processes for this research. This study was supported by PUCE Esmeraldas. Additionally, we acknowledge the facilities and general support provided by the Department of Computer Sciences, which greatly contributed to the successful completion of this work.

Conflicts of Interest

The authors declare that they have no conflicts of interest related to this article.

References

Mohmed, A.; Allah, G. Incidence and Causes of Injury Deaths among Children under Five Years of Age in El-Giza-Egypt. J. Med. Pharm. Sci. 2019, 4, 49–56. [Google Scholar]
Kumar, R.; Bhanu, M.; Mendes-Moreira, J.; Chandra, J. Spatio-Temporal Predictive Modeling Techniques for Different Domains: A Survey. ACM Comput. Surv. 2024, 57, 38. [Google Scholar] [CrossRef]
Rabbani, M.B.A.; Musarat, M.A.; Alaloul, W.S.; Rabbani, M.S.; Maqsoom, A.; Ayub, S.; Bukhari, H.; Altaf, M. A Comparison Between Seasonal Autoregressive Integrated Moving Average (SARIMA) and Exponential Smoothing (ES) Based on Time Series Model for Forecasting Road Accidents. Arab. J. Sci. Eng. 2021, 46, 11113–11138. [Google Scholar] [CrossRef]
Sivaramakrishnan, S.; Rathish, C.R.; Premalatha, S.; Niranjana, C. Introduction to AI Technique and Analysis of Time Series Data Using Facebook Prophet Model. In Innovative Engineering with AI Applications; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2023; pp. 171–188. [Google Scholar] [CrossRef]
Niohuru, I. Disease Burden and Mortality. In Healthcare and Disease Burden in Africa; Springer: Cham, Switzerland, 2023; pp. 35–85. [Google Scholar] [CrossRef]
Liu, L.; Villavicencio, F.; Yeung, D.; Perin, J.; Lopez, G.; Strong, K.L.; Black, R.E. National, regional, and global causes of mortality in 5–19-year-olds from 2000 to 2019: A systematic analysis. Lancet Glob. Health 2022, 10, e337–e347. [Google Scholar] [CrossRef]
Muelbert, M.M.; Copertino, M.; Cotrim da Cunha, L.; Lewis, M.N.; Polejack, A.; Peña-Puch, A.d.C.; Rivera-Arriaga, E. The Ocean and Cryosphere in a Changing Climate in Latin America: Knowledge Gaps and the Urgency to Translate Science Into Action. Front. Clim. 2021, 3, 748344. [Google Scholar] [CrossRef]
Balawi, M.; Tenekeci, G. Time series traffic collision analysis of London hotspots: Patterns, predictions and prevention strategies. Heliyon 2024, 10, e25710. [Google Scholar] [CrossRef]
Holguín-Carvajal, J.P.; Otzen, T.; Sanhueza, A.; Castillo, Á.; Manterola, C.; Muñoz, G.; García-Aguilera, F.; Salgado-Castillo, F. Trends in traffic accident mortality and social inequalities in Ecuador from 2011 to 2022. BMC Public Health 2024, 24, 1951. [Google Scholar] [CrossRef] [PubMed]
Dong, W.; Liu, Y.; Zhu, W.; Sun, J.; Bai, R. Temporal trends in the incidence and mortality of road injuries in China: Current trends and future predictions. Injury 2023, 54, 111139. [Google Scholar] [CrossRef] [PubMed]
Rosen, H.E.; Bari, I.; Paichadze, N.; Peden, M.; Khayesi, M.; Monclús, J.; Hyder, A.A. Global road safety 2010–18: An analysis of Global Status Reports. Injury 2025, 56, 110266. [Google Scholar] [CrossRef]
Seif, M.; Edalat, S.; Majidpour Azad Shirazi, A.; Alipouri, S.; Bayati, M. Prediction of the burden of road traffic injuries in Iran by 2030: Prevalence, death, and disability-adjusted life years. Chin. J. Traumatol.—Engl. Ed. 2024, 27, 242–248. [Google Scholar] [CrossRef]
Ahmed, S.K.; Mohammed, M.G.; Abdulqadir, S.O.; El-Kader, R.G.; El-Shall, N.A.; Chandran, D.; Rehman, M.E.; Dhama, K. Road traffic accidental injuries and deaths: A neglected global health issue. Health Sci. Rep. 2023, 6, e1240. [Google Scholar] [CrossRef]
Erfanpoor, S.; Hasani, J.; Mirtorabi, S.; Manouchehri, R.; Hashemi Nazari, S. Trend of mortality rate due to traffic accidents in Iran from 2006 to 2020: A cross-sectional study. Int. J. Crit. Illn. Inj. Sci. 2023, 13, 73–77. [Google Scholar] [CrossRef]
Arumugam, V.; Natarajan, V. Time Series Modeling and Forecasting Using Autoregressive Integrated Moving Average (ARIMA) and Seasonal Autoregressive Integrated Moving Average (SARIMA) Models. Instrum. Mes. Métrol. 2023, 22, 161–168. [Google Scholar] [CrossRef]
Webel, K. A Review of Some Recent Developments in the Modelling and Seasonal Adjustment of Infra-Monthly Time Series. SSRN Electron. J. 2022. [Google Scholar] [CrossRef]
Ryan, O.; Haslbeck, J.M.; Waldorp, L.J. Non-Stationarity in Time-Series Analysis: Modeling Stochastic and Deterministic Trends. Multivar. Behav. Res. 2025, 60, 556–588. [Google Scholar] [CrossRef]
Marcillo, P.; Valdivieso Caraguay, Á.L.; Hernández-álvarez, M. A Systematic Literature Review of Learning-Based Traffic Accident Prediction Models Based on Heterogeneous Sources. Appl. Sci. 2022, 12, 4529. [Google Scholar] [CrossRef]
Yavuz, V.S. Forecasting monthly rainfall and temperature patterns in Van Province, Türkiye, using ARIMA and SARIMA models: A long-term climate analysis. J. Water Clim. Change 2025, 16, 800–818. [Google Scholar] [CrossRef]
Serrano, A.L.M.; Rodrigues, G.A.P.; Martins, P.H.S.; Saiki, G.M.; Filho, G.P.R.; Gonçalves, V.P.; Albuquerque, R.O. Statistical Comparison of Time Series Models for Forecasting Brazilian Monthly Energy Demand Using Economic, Industrial, and Climatic Exogenous Variables. Appl. Sci. 2024, 14, 5846. [Google Scholar] [CrossRef]
Gkontzis, A.F.; Kotsiantis, S.; Feretzakis, G.; Verykios, V.S. Temporal Dynamics of Citizen-Reported Urban Challenges: A Comprehensive Time Series Analysis. Big Data Cogn. Comput. 2024, 8, 27. [Google Scholar] [CrossRef]
Zeng, S.; Liu, C.; Zhang, H.; Zhang, B.; Zhao, Y. Short-Term Load Forecasting in Power Systems Based on the Prophet–BO–XGBoost Model. Energies 2025, 18, 227. [Google Scholar] [CrossRef]
Szostek, K.; Mazur, D.; Drałus, G.; Kusznier, J. Analysis of the Effectiveness of ARIMA, SARIMA, and SVR Models in Time Series Forecasting: A Case Study of Wind Farm Energy Production. Energies 2024, 17, 4803. [Google Scholar] [CrossRef]
Chaturvedi, S.; Rajasekar, E.; Natarajan, S.; McCullen, N. A comparative assessment of SARIMA, LSTM RNN and Fb Prophet models to forecast total and peak monthly energy demand for India. Energy Policy 2022, 168, 113097. [Google Scholar] [CrossRef]
Arslan, S. A hybrid forecasting model using LSTM and Prophet for energy consumption with decomposition of time series data. PeerJ Comput. Sci. 2022, 8, e1001. [Google Scholar] [CrossRef]
Becker, A.S.; Erinjeri, J.P.; Chaim, J.; Kastango, N.; Elnajjar, P.; Hricak, H.; Vargas, H.A. Automatic Forecasting of Radiology Examination Volume Trends for Optimal Resource Planning and Allocation. J. Digit. Imaging 2021, 35, 1–8. [Google Scholar] [CrossRef] [PubMed]
Hossain, M.M.; Faisal Anwar, A.H.; Garg, N.; Prakash, M.; Bari, M. Monthly Rainfall Prediction at Catchment Level with the Facebook Prophet Model Using Observed and CMIP5 Decadal Data. Hydrology 2022, 9, 111. [Google Scholar] [CrossRef]
Arnaut, F.; Cvetkov, V.; Đurić, D.; Samardžić-Petrović, M. Short-term forecasting of PM10 and PM2.5 concentrations with Facebook’s Prophet Model at the Belgrade-Zeleno brdo. Geofizika 2023, 40, 162–177. [Google Scholar] [CrossRef]
Mariño, J.A.; Arrieta-Prieto, M.E.; Calderón V, S.A. Comparison between statistical models and machine learning for forecasting multivariate time series: An empirical approach. Commun. Stat. Case Stud. Data Anal. Appl. 2025, 11, 56–91. [Google Scholar] [CrossRef]
Ma, T.; Antoniou, C.; Toledo, T. Hybrid machine learning algorithm and statistical time series model for network-wide traffic forecast. Transp. Res. Part Emerg. Technol. 2020, 111, 352–372. [Google Scholar] [CrossRef]
Kaur, G.; Saini, S. Comparative analysis of RMSE and MAP metrics for evaluating CNN and LSTM models. AIP Conf. Proc. 2024, 3121, 040003. [Google Scholar] [CrossRef]
Pan, Y.A.; Guo, J.; Chen, Y.; Cheng, Q.; Li, W.; Liu, Y. A fundamental diagram based hybrid framework for traffic flow estimation and prediction by combining a Markovian model with deep learning. Expert Syst. Appl. 2024, 238, 122219. [Google Scholar] [CrossRef]
Chango, W.; Buñay, P.; Erazo, J.; Aguilar, P.; Sayago, J.; Flores, A.; Silva, G. Predicting Urban Traffic Congestion with VANET Data. Computation 2025, 13, 92. [Google Scholar] [CrossRef]
Secretaría Nacional de Planificación. Plan de Desarrollo para el Nuevo Ecuador 2024–2025; Secretaría Nacional de Planificación: Quito, Ecuador, 2024; pp. 1–300. ISBN 978-9942-22-592-4. Available online: https://www.planificacion.gob.ec/plan-de-desarrollo-para-el-nuevo-ecuador-2024-2025/ (accessed on 8 December 2025).
Parums, D.V. Editorial: Review Articles, Systematic Reviews, Meta-Analysis, and the Updated Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 Guidelines. Med. Sci. Monit. Int. Med. J. Exp. Clin. Res. 2021, 27, e934475. [Google Scholar] [CrossRef]
Basnayake, B.R.P.M.; Chandrasekara, N.V. Forecasting exchange rates in Sri Lanka: A comparison of the double seasonal autoregressive integrated moving average models (DSARIMA) and SARIMA models. J. Sci. Univ. Kelaniya 2022, 15, 192–209. [Google Scholar] [CrossRef]
Wei, X. Prediction and influence factors analysis of IP backbone network traffic based on Prophet model and variance reduction. Heliyon 2025, 11, e41472. [Google Scholar] [CrossRef] [PubMed]
Yang, J.; Li, P.; Cui, Y.; Han, X.; Zhou, M. Multi-Sensor Temporal Fusion Transformer for Stock Performance Prediction: An Adaptive Sharpe Ratio Approach. Sensors 2025, 25, 976. [Google Scholar] [CrossRef]
Mendoza, A.P. Dengue incidence forecasting model in Magalang, Pampanga using time series analysis. Inform. Med. Unlocked 2024, 44, 101439. [Google Scholar] [CrossRef]
Cre¸tan, R.C.; Abula, K.; Aihemaiti, Y. Digitalization and Culture–Tourism Integration in China: The Moderated Mediation Effects of Employment Quality, Infrastructure, and New-Quality Productivity. Sustainability 2025, 17, 8792. [Google Scholar] [CrossRef]
Adebayo, P.; Ahmed, I.; Oyeleke, K.T. Comparative Analysis of Prophet and ARIMA Models for Forecasting Economic Indicators: A Case Study of Personal Consumption Expenditures. SSRN 2025. [Google Scholar] [CrossRef]
Liu, J.; Yu, F.; Song, H. Application of SARIMA model in forecasting and analyzing inpatient cases of acute mountain sickness. BMC Public Health 2023, 23, 56. [Google Scholar] [CrossRef]
Letessier, C.; Cardi, J.; Dussel, A.; Ebtehaj, I.; Bonakdari, H. Enhancing Flood Prediction Accuracy through Integration of Meteorological Parameters in River Flow Observations: A Case Study Ottawa River. Hydrology 2023, 10, 164. [Google Scholar] [CrossRef]
Keil, A.; Bernat, E.M.; Cohen, M.X.; Ding, M.; Fabiani, M.; Gratton, G.; Kappenman, E.S.; Maris, E.; Mathewson, K.E.; Ward, R.T.; et al. Recommendations and publication guidelines for studies using frequency domain and time-frequency domain analyses of neural time series. Psychophysiology 2022, 59, e14052. [Google Scholar] [CrossRef] [PubMed]
Mienye, I.D.; Swart, T.G.; Obaido, G. Recurrent Neural Networks: A Comprehensive Review of Architectures, Variants, and Applications. Information 2024, 15, 517. [Google Scholar] [CrossRef]

Figure 1. Methodological Framework (DSR) Integrating PRISMA and CRISP-DM.

Figure 2. PRISMA Framework for Systematic Variable Selection.

Figure 3. Accident Forecasting Analysis for Guayas Province Using SARIMA and Prophet Models.

Figure 4. Accident Forecasting Analysis for Manabí Province Using SARIMA and Prophet Models.

Figure 5. Accident Forecasting Analysis for Los Ríos Province Using SARIMA and Prophet Models.

Figure 6. Temporal Evolution and Prediction of Traffic Accidents, SARIMA Model vs. Prophet.

Figure 7. Forecasting Monthly Traffic Accidents in El Oro Province (2013–2026).

Figure 8. Traffic Accident Forecasting and Temporal Trends in Santa Elena Province (2013–2026).

Figure 9. Comparative Analysis of SARIMA vs. Prophet and Accident Prediction in Esmeraldas.

Figure 10. Traffic Accident Forecasting and Temporal Trends in Pichincha Province.

Figure 11. Traffic Accident Forecasting and Temporal Trends in Azuay Province.

Figure 12. Traffic Accident Forecasting and Temporal Trends in Tungurahua Province.

Figure 13. Traffic Accident Forecasting and Temporal Trends in Imbabura Province.

Figure 14. Traffic Accident Forecasting and Temporal Trends in Loja Province.

Figure 15. Traffic Accident Forecasting and Temporal Trends in Chimborazo Province.

Figure 16. Traffic Accident Forecasting and Temporal Trends in Cañar Province.

Figure 17. Traffic Accident Forecasting and Temporal Trends in Cotopaxi Province.

Figure 18. Traffic Accident Forecasting and Temporal Trends in Bolivar Province.

Figure 19. Traffic Accident Forecasting and Temporal Trends in Carchi Province.

Figure 20. Traffic Accident Forecasting and Temporal Trends in Morona Santiago Province.

Figure 21. Traffic Accident Forecasting and Temporal Trends in Orellana Santiago Province.

Figure 22. Traffic Accident Forecasting and Temporal Trends in Zamora Chinchipe Province.

Figure 23. Traffic Accident Forecasting and Temporal Trends in Napo Province.

Figure 24. Traffic Accident Forecasting and Temporal Trends in Pastaza Province.

Figure 25. Traffic Accident Forecasting and Temporal Trends in Sucumbios Province.

Figure 26. Traffic Accident Forecasting and Temporal Trends in Galapagos Province.

Figure 27. Results of Prophet and SARIMA modeling of accident-prone traffic for provinces in Ecuador.

Table 1. Global Mortality Trends from Road Traffic Crashes and Methodological Approaches.

Paper	Global Mortality Trends (Summary)	Methodological Approach (Summary)
[10]	Trend: Decreasing (China). Total Deaths: 191,000 (2015) to 169,000 (2030). Rate/100k: 13.7 (2015) to 11.8 (2030).	Model: Log-linear models. Country/Period: China (2015–2030). Sources: UN, WHO, USDA.
[11]	Trend: Decreasing (then plateauing). Rate/100k: Decreased from 20.6 (2010) to 19.9 (2013). Annual % Change: −1.12%.	Model: Negative binomial regression. Countries/Period: 188 countries (2010–2013). Techniques: Age-adjustment.
[12]	Trend: Increasing (in low-income countries and the African Region). Global Deaths: 1.35 million annually.	Model: Descriptive analysis, t-test, Wilcoxon. Sources: WHO Global Status Reports (2013–2018). Countries: 161.
[13]	Trend: Increasing (projected seventh leading cause by 2030). Global Deaths: 1.3 million (2019 est.). Difference: 93% of mortality occurs in low- and middle-income countries.	Approach: N/A (Review Article, no primary data analysis methodology described).
[14]	Trend: N/A (Focus is specific to Iran).	Model: Joinpoint regression (based on log-linear model). Country/Period: Iran (2006–2020). Source: Iranian Legal Medicine Organization.

Table 2. Summary of Applications and Performance of the Prophet Model in Various Domains.

Paper	Application Domain (Summary)	Model Performance
[24]	Sector: Energy. Data: Monthly total and peak energy demand. Purpose: Planning and management of generation facilities.	Superiority: Prophet demonstrated superior performance to SARIMA and LSTM by capturing complex temporal features and handling periodicity. Metrics (MAPE): 3.3% (Total Demand), 3.01% (Peak Demand).
[25]	Sector: Energy. Data: Total monthly electric energy consumption (GWh). Purpose: Improve prediction accuracy for decision-making.	Comparison: Outperformed by neural networks (LSTM/GRU) in Brazil, but showed good performance for Turkey. Alternative Models: ARIMA, SVR, Holt-Winters.
[26]	Sector: Healthcare (Radiology). Data: Daily volume of CT and MRI exams. Purpose: Optimal resource allocation and planning.	Effectiveness: Reduced prediction error. CT: Error reduced from 19 to <1 per day (in monthly total). Improvement: Significantly better than manual predictions.
[27]	Sector: Water Management/ Agriculture. Data: Monthly rainfall (decadal scale). Characteristics: Strong seasonality.	Superiority: Outperformed six regression models (incl. MLPs and Boosting). Strength: Excellent in predicting dry periods due to its multiplicative seasonality function. Limitation: Difficulty with extreme peak values (very wet events).
[28]	Sector: Environmental Monitoring (Air Quality). Data: Hourly concentrations of PM10 and PM2.5. Purpose: Mitigate health risks from pollution.	Performance: RMSE below 6.26 μg/m³ for PM2.5 and 9.99 μg/m³ for PM10 (in 50% of cases). Strength: Simplicity and speed. Condition: Performed better for PM2.5 than for PM10.

Table 3. Comparative Performance of Forecasting Models in Coastal Region Provinces.

Province	Total Accidents	Monthly Average	Best Model	Best Model MAPE	Stationarity
GUAYAS	101,179	674.5	Prophet	4.91%	Stationary
MANABÍ	17,592	117.3	SARIMA	14.22%	Stationary
LOS RÍOS	13,063	87.1	Prophet	17.91%	Stationary
SANTO DOMINGO	11,356	75.7	Prophet	9.67%	Stationary
EL ORO	8764	58.4	SARIMA	18.21%	Non-Stationary
SANTA ELENA	7027	46.8	SARIMA	15.29%	Stationary
ESMERALDAS	3039	20.3	Prophet	73.93%	Non-Stationary

Table 4. Comparative Performance of Forecasting Models in Andean Region Provinces.

Province	Total Accidents	Monthly Average	Best Model	Best Model MAPE	Stationarity
PICHINCHA	90,636	604.2	SARIMA	13.38%	Non-stationary
AZUAY	15,398	102.7	Prophet	48.34%	Non-stationary
TUNGURAHUA	1528	101.9	SARIMA	6.07%	Non-stationary
IMBABURA	885	59.0	SARIMA	41.12%	Non-stationary
LOJA	7844	52.3	Prophet	16.40%	Stationary
CHIMBORAZO	7342	48.9	Prophet	66.55%	Stationary
COTOPAXI	4371	29.1	Prophet	100.83%	Non-stationary
CAÑAR	1993	13.3	SARIMA	42.53%	Non-stationary
BOLÍVAR	1787	11.9	SARIMA	40.72%	Non-stationary
CARCHI	1293	8.6	SARIMA	43.51%	Non-stationary

Table 5. Comparative Performance of Forecasting Models in Amazon Region Provinces.

Province	Total Accidents	Monthly Average	Best Model	Best Model MAPE	Stationarity
MORONA SANTIAGO	1553	11.3	Prophet	43.36%	Non-stationary
ORELLANA	1175	7.8	SARIMA	31.86%	Stationary
ZAMORA CHINCHIPE	1143	7.6	Prophet	52.55%	Non-stationary
NAPO	1091	7.3	SARIMA	72.63%	Non-stationary
PASTAZA	1041	6.9	SARIMA	1703.95%	Non-stationary
SUCUMBÍOS	828	5.5	Prophet	533.73%	Non-stationary

Table 6. Comparative Performance of Forecasting Models in Galapagos Region Provinces.

Province	Total Accidents	Monthly Average	Best Model	Best Model MAPE	Stationarity
GALÁPAGOS	81	0.5	Prophet	3302.35%	Non-stationary

Table 7. Comparative Analysis of Forecasting Model Performance by Geographic Region.

Region	Prophet Wins	SARIMA Wins	Dominant Model
Coast	4	3	Prophet
Highlands	4	6	SARIMA
Amazon	3	3	Tie
Insular	1	0	Prophet
OVERALL TOTAL (24 Provinces)	12	12	Tie

Table 8. Diebold–Mariano Test Statistics for Forecast Accuracy Comparison Across Ecuadorian Provinces.

Province	dm_stat	p-Value	Significance
AZUAY	1.635274	0.10199	No evidence
BOLIVAR	−1.416653	0.15658	No evidence
CANAR	0.127985	0.89816	No evidence
CARCHI	−1.351865	0.17641	No evidence
COTOPAXI	3.674933	0.00023791	Highly significant (p < 0.01)
CHIMBORAZO	1.288091	0.19771	No evidence
EL_ORO	−3.435728	0.00059096	Highly significant (p < 0.01)
ESMERALDAS	0.630382	0.52844	No evidence
GUAYAS	3.465814	0.00052862	Highly significant (p < 0.01)
IMBABURA	−2.857548	0.0042692	Highly significant (p < 0.01)
LOJA	−0.380962	0.70323	No evidence
LOS_RIOS	1.281541	0.2	No evidence
MANABI	−0.431551	0.66606	No evidence
MORONA	3.699555	0.00021597	Highly significant (p < 0.01)
NAPO	−0.856829	0.39153	No evidence
PASTAZA	0.426604	0.66966	No evidence
PICHINCHA	−6.766831	1.3163 × 10⁻¹¹	Highly significant (p < 0.01)
TUNGURAHUA	−4.927065	8.3474 × 10⁻⁷	Highly significant (p < 0.01)
ZAMORA_CHINCHIPE	−2.054503	0.039927	Significant (p < 0.05)
GALAPAGOS	−0.739158	0.45981	No evidence
SUCUMBIOS	−2.225582	0.026042	Significant (p < 0.05)
ORELLANA	−1.203761	0.22868	No evidence
STO_D_TSACHILAS	1.575166	0.11521	No evidence
SANTA_ELENA	−2.078903	0.037626	Significant (p < 0.05)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chango, W.; Salguero, A.; Landivar, T.; Vásconez, R.; Silva, G.; Peñafiel-Arcos, P.; Núñez, L.; Velasteguí-Izurieta, H. SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces. Computation 2026, 14, 5. https://doi.org/10.3390/computation14010005

AMA Style

Chango W, Salguero A, Landivar T, Vásconez R, Silva G, Peñafiel-Arcos P, Núñez L, Velasteguí-Izurieta H. SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces. Computation. 2026; 14(1):5. https://doi.org/10.3390/computation14010005

Chicago/Turabian Style

Chango, Wilson, Ana Salguero, Tatiana Landivar, Roberto Vásconez, Geovanny Silva, Pedro Peñafiel-Arcos, Lucía Núñez, and Homero Velasteguí-Izurieta. 2026. "SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces" Computation 14, no. 1: 5. https://doi.org/10.3390/computation14010005

APA Style

Chango, W., Salguero, A., Landivar, T., Vásconez, R., Silva, G., Peñafiel-Arcos, P., Núñez, L., & Velasteguí-Izurieta, H. (2026). SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces. Computation, 14(1), 5. https://doi.org/10.3390/computation14010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

SARIMA vs. Prophet: Comparative Efficacy in Forecasting Traffic Accidents Across Ecuadorian Provinces

Abstract

1. Introduction

2. Background

2.1. Road Safety and Risk Prediction as a Global and Regional Public Health Challenge

2.2. Fundamentals of Classic Time Series Models

2.3. Modern Forecasting Algorithms: The Facebook Prophet Model

2.4. Evaluation and Validation Criteria for Multi-Regional Prediction Models

3. Materials and Methods

3.1. Relevance Phase

3.1.1. Problem Identification

3.1.2. Systematic Literature Review (PRISMA Framework)

3.2. Design and Development Phase (CRISP-DM)

3.2.1. Data Preparation and Exploratory Analysis

3.2.2. Predictive Modeling

3.2.3. Model Evaluation Setup and Visualization

3.3. Rigor Phase—Artifact Evaluation

3.3.1. Performance Metrics

3.3.2. Diebold–Mariano Statistical Test

4. Results

4.1. Coast Region

4.2. Andean Region

4.3. Amazon Region

4.4. Insular Region

4.5. Diebold–Mariano Test Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI