4.1. Formulation of the SARIMA Framework
To build a reliable forecasting framework, first it is necessary to understand the time-series structure and potential anomalies. Therefore, a box plot, as shown in
Figure 2 and
Figure 3, was constructed to visually inspect the distribution of monthly crash counts for both ADAS and ADS crashes in order to identify potential outliers in the training data.
The monthly ADAS crash time series can be visualized from the above box-plot diagram (
Figure 2). The lowest number of ADAS crashes occurs in February, where the median count is approximately 29, making it the most stable and least dangerous month across the study period. Looking at the broader seasonal pattern, crashes remain relatively moderate during the first quarter (January–March), though March exhibits notable variability, with a wide interquartile range, suggesting inconsistent crash frequency across years. The spring months of April and May show moderate increases, with May recording a consistently higher median of approximately 48. A slight dip is observed from April through June, with relatively tight distributions indicating stable but moderate crash activity. August emerges as a secondary peak month, with elevated crash activity, while July shows relatively high year-to-year variability. The final quarter shows a clear upward trend, with December emerging as the most dangerous month, recording the highest median crash count of approximately 65 or above and a wide interquartile range. The distribution of monthly crashes exhibits a bimodal seasonal pattern, with two distinct peaks observed in January and May–June, and two troughs in February and August, confirming the presence of strong seasonality in the data. Furthermore, no data points were found beyond the whisker boundaries in any month, indicating the absence of statistical outliers in the training dataset. This suggests that the data is sufficiently clean and no outlier treatment or replacement is necessary prior to model development. The ratio between the maximum median monthly value (December, ~65) and the minimum median monthly value (February, ~29) is approximately 2.24, reflecting a substantial seasonal difference, underscoring a strong seasonal pattern present in the ADAS crash data that must be accounted for in forecasting.
The monthly ADS crash time series can be visualized from the box-plot diagram in
Figure 3. The lowest number of ADS crashes occurs in January, where the median count is approximately 10, making it the least active month across the study period. February shows a slight increase, with a tight distribution, indicating consistent but low crash activity. A notable rise is observed in March, with a median of approximately 19 and a narrow interquartile range, suggesting relatively stable and moderate crash frequency. April and May follow a similar moderate trend, while June shows a marginal decline. July and August emerge as the most active months, with August recording the highest median of approximately 21 and the widest interquartile range, reflecting substantial year-to-year variability. September and October show elevated but declining activity, while November and December record the lowest medians in the latter half of the year. There are no visible outliers. The distribution of monthly ADS crashes exhibits a trimodal seasonal pattern, with three discernible peaks observed in March, July, and August, and troughs in January, November, and December. The ratio between the maximum median monthly value (August) and the minimum median monthly value (January) reflects a considerable seasonal difference, underscoring the importance of accounting for seasonal dynamics in any ADS crash forecasting.
To further understand the structural dynamics of monthly crash counts, a preliminary time-series decomposition was performed using an additive model. This method assumes that the observed time series is the sum of three distinct components, which are:
Here, is the trend component (the long-term progression or direction of the series), is the seasonal component (the recurring, periodic fluctuations), and is the residual component, the random noise or irregular variation not explained by trend or seasonality.
Figure 4 shows that for ADAS, monthly crash counts fluctuate between 20 and 70, with significant variability and sharp fluctuations throughout the series. For ADS, as shown in
Figure 5, the series shows counts fluctuating smoothly between 10 and 40. The trend component shows a steady upward trajectory from late 2021 through early 2024, suggesting a gradual increase in ADAS incidents. This could be the result of rising ADAS adoption, improved reporting, or system-specific vulnerabilities. The trend component in ADS also shows an increase in incidents, especially from mid-2022 onward. The seasonal patterns for ADAS and ADS are relatively stable, with consistent peaks and troughs across years. This suggests that for both systems crashes may be influenced by recurring temporal factors, such as weather, traffic density and user behavior. The residual components in both series are approximately centered around zero, with no evident autocorrelation or systematic bias, which suggests that the additive decomposition has effectively captured the underlying trend and seasonal structure, leaving only random noise.
To evaluate the temporal dependency of the ADAS crash and preliminary SARIMA model specification, Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) plots were generated as shown in
Figure 6 and
Figure 7.
The ACF plot in
Figure 6 shows a gradual decline across lags, with initial values exceeding the confidence interval. This pattern is consistent with Moving Average behavior and may also indicate non-stationarity, supporting the need for differencing. In the PACF plot for ADAS in
Figure 7, a prominent spike at lag 1, followed by rapid decay within the confidence bounds, suggests a short autoregressive structure, likely AR (1). The absence of significant partial autocorrelations beyond lag 1 indicates limited long-range dependency.
Figure 8 and
Figure 9 show the ACF and PACF plots for ADS crashes. The autocorrelation at lag 1 is strong and statistically significant, followed by a gradual decay across subsequent lags. This pattern suggests the presence of Moving Average components and possible non-stationarity, suggesting that differencing may be required. In the PACF plot, a sharp spike at lag 1, with all subsequent lags falling within the confidence bounds, points to a short autoregressive structure, likely AR (1). The clean cutoff after lag 1 supports a low-order AR term. These insights guide the initial parameter bounds for SARIMA grid search and model tuning.
To assess the stationarity of the crash count series, both the Augmented Dickey–Fuller (ADF) and Kwiatkowski–Phillips–Schmidt–Shin (KPSS) tests were applied as shown in
Table 1. These tests offer complementary perspectives, with ADF testing the null hypothesis of a unit root (non-stationarity), while KPSS tests the null hypothesis of stationarity [
36,
37].
The ADF statistics for ADAS in
Table 1 reject the null hypothesis of non-stationarity, denoting that the series is stationary, while the KPSS statistics are close to the rejection threshold, suggesting mild non-stationarity. For the ADS series, both tests suggest non-stationarity.
Guided by decomposition, autocorrelation diagnostics, and stationarity testing, a comprehensive SARIMA grid search was implemented for both datasets. This was achieved by using nested loops over candidate values for (p, d, q) and (P, D, Q), with a seasonal period set to 12. The search included combinations of non-seasonal parameters p [0, 2], d [0, 2], q [0, 2] and seasonal parameters P [0, 2], D [0, 2], Q [0, 2] with a seasonal period of m = 12, which resulted in a total of 729 unique SARIMA combinations for each dataset. Each configuration was fitted using python libraries (i.e., ). Given the ACF and PACF plots, which showed at most one dominant spike and no indication of higher-order lag structure, and the limited length of the training series (30 months), values of (p, d, q, P, D, Q) were restricted to values ≤ 2 to maintain parsimony and avoid over-fitting. Their Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) were recorded to evaluate model fit during training. Each model was then validated on a separate validation set. Predictions were performed on the validation dataset and multiple performance metrics (AIC, BIC, RMSE, MSE, MAE, MAPE and Theil’s U1) were calculated to assess statistical fit and predictive accuracy. The best model was selected based on the lowest AIC and BIC, but only after confirming its predictive ability through validation metrics. This two-stage evaluation ensured that the final model was both statistically efficient and practically reliable for forecasting.
4.3. Model Evaluation
Table 2 shows the top 10 models across all combinations for the ADAS series.
The AIC and validation metrics were used to choose the best model. The best performing SARIMA (p, d, q, P, D, Q, S) model for the ADAS series was (0, 0, 2) (2, 0, 2, 12), selected based on lowest AIC (21.637), BIC (15.103) and strong validation metrics: RMSE: 5.731, MSE:32.84, MAE:4.527, MAPE: 13.687%, and Theil’s U: 0.103, as shown in
Table 2. Although the configuration did not include autoregressive terms, it consistently outperformed other candidates across both statistical and predictive criteria, suggesting that the selected structure was sufficient to capture the series’ underlying dynamics.
The best-performing SARIMA (p, d, q, P, D, Q, S) configuration for the ADS series was (2, 2, 0) (0, 2, 0, 12), selected based on the lowest AIC value of −31.872, BIC −35.793 and robust predictive metrics (RMSE: 5.292, MSE: 28.007, MAE:4.721, MAPE: 13.458%, and Theil’s U: 0.102), as shown in
Table 3. Despite the absence of Moving Average terms in the seasonal component, this model consistently outperformed alternatives across both statistical and forecast accuracy criteria. Its structure effectively captured the underlying seasonal and trend dynamics of the series, offering a parsimonious yet powerful fit.
The
Table 4 below shows Prophet’s accuracy on the validation dataset across four metrics for ADAS. The top 10 models’ results have been shown here. The model was tuned based on three parameters, changepoint prior scale, seasonality prior scale and seasonality mode. The best values of the tuning parameters were changepoint prior scale 0.1, seasonality prior scale 0.1, and the seasonality mode was found to be multiplicative, for which the model yielded an MSE of 7.330, RMSE of 2.710, MAPE of 6.90%, and a Theil’s U1 value of 0.089, reflecting the magnitude and relative scale of forecast deviations. From the previous table, the best SARIMA configuration (0, 0, 2, 2, 0, 2, 12) for ADAS yielded higher error values (MSE of 32.84, RMSE of 5.731, MAPE of 13.687%, and Theil’s U1 of 0.103).
Table 5 below shows the Prophet model’s accuracy on four different metrics of the top 10 models for ADS. The changepoint prior scale of 0.05, seasonality prior scale of 1 and the additive seasonality mode were found to be the most optimized values for the ADS data. The model yielded RMSE of 2.242, MSE of 5.026, Mape of 8.850%, and Theil’s U1 of 0.095.
The results from
Table 2,
Table 3,
Table 4 and
Table 5 suggest that the Prophet model yielded better outputs than the SARIMA model across both datasets in terms accuracy and error metrics. The differences in Theil’s U1 and MAPE further show the model’s robustness in capturing temporal dynamics. As the Prophet model consistently showed higher accuracy across both ADAS and ADS datasets, it was selected for future projection of crashes. To transparently demonstrate Prophet’s performance, a detailed month-wise error analysis was conducted over the validation period for both ADAS and ADS series, as shown in
Table 6 and
Table 7.
The table includes standard error metrics such as absolute error, squared error, etc. The forecasted crash counts are presented in
Table 6 and
Table 7. The tables show a precise error computation and more accurate performance diagnostics. The low MAE, MSE, RMSE, MAPE and Theil’s U1 values show that the Prophet model consistently provided accurate forecasts for both ADAS and ADS crash series. The optimal seasonality mode differed between datasets. The best performance was achieved for ADAS with multiplicative seasonality and for ADS with additive seasonality. This difference reflects the underlying data characteristics, with ADAS crashes showing proportional seasonal variations (multiplicative), whereas ADS crashes show more constant seasonal fluctuations (additive).
4.4. Forecast of Future ADAS Crash Counts
The Prophet model was trained on the dataset from July 2021 to December 2023. The model’s accuracy was validated using predictions on the January–June 2024 validation dataset. The previous section clearly demonstrated that FB Prophet offers the best compromise between managing complexities and providing predictions that are close to actual outcomes. Therefore, to forecast for future data, Prophet has been used. A future forecast has been generated for the next six months (July to December 2024), as shown in
Figure 10. It indicates a rising trend with seasonal fluctuations based on historical data, and the validation predictions reasonably match actual values. The July to December 2024 forecast (red line) suggests that monthly crashes might stay high, possibly between 40 and 80. Although the confidence interval becomes wider, reflecting uncertainty, it still provides a range that prevents expectations of uncontrollable spikes. In practical terms, without specific actions such as better driver training, system improvements, or policy changes, ADAS-related crashes are likely to remain at current levels or increase slightly but predictably.
Figure 11 compares the theoretical quantiles (expected under a normal distribution) with the actual residuals from the Prophet model’s forecasts of ADAS crashes. Most points align closely with the red reference line, which indicates that the residuals are broadly consistent with normality. A slight deviation in the upper tail (right side) suggests that the model may slightly underpredict some high values. To further assess residual behavior, the Ljung–Box test was applied at lag 10, yielding a
p-value of 0.01. This
p-value suggests no statistically significant autocorrelation. This supports the forecasts’ reliability and affirms that the model has captured the major temporal structure in the data.
Figure 12 shows the trend component extracted from the Prophet model for ADAS crash data. The steadily rising blue line indicates a consistent upward trajectory in the underlying crash trend, independent of seasonal or irregular fluctuations. This trend suggests that over time, ADAS-related crashes have been increasing. Possible reasons include, but are not limited to, the broader adoption of the ADAS technology, changing traffic patterns, or system limitations. While the trend does not capture short-term variations, it provides a clear signal that the baseline risk is gradually increasing.
Figure 13 shows the yearly seasonality component of ADAS crash data from the Prophet model. The blue line reveals recurring peaks and troughs. It indicates that crash counts tend to rise sharply around January and mid-year, then drop in between. The pattern exhibits two major seasonal peaks: the highest occurs around early January, reaching approximately +0.43 to +0.45, and a moderate peak in May–June reaches approximately +0.20 to +0.21. Conversely, major troughs appear around February–March (approximately −0.30) and August–September (approximately −0.15), indicating periods when seasonal effects reduce crash counts below the baseline. This cyclical pattern suggests that certain months consistently experience higher numbers of crashes, possibly due to weather, traffic volume, or behavioral factors such as holiday travel or commuting peaks. The amplitude of the seasonal effect ranges from about −0.30 to +0.40, indicating that seasonality can significantly influence monthly crash counts.
The graph (
Figure 14) shows the average annual pattern of ADAS crashes, with the x-axis representing the day of year and the y-axis showing the deviation from the yearly mean crash count. Positive values suggest above-average crash activity, while negative values indicate below-average levels. The pattern exhibits a bimodal distribution with two distinct peaks. The highest peak occurs around early January (~+18) and a moderate peak appears around May–June (approximately +8). Conversely, two major troughs appear around late February–March (approximately −10 to −13) and August–September (~−7) indicating periods when crash activity falls significantly below the annual average. These patterns suggest that early January and June consistently experience higher levels of crash activity, while March and September represent relatively safe periods. The wide amplitude from roughly −13 to +18 highlights the strong influence of intra-year dynamics on crash frequency.
The observed bimodal pattern in ADAS crashes, with a peak in early January and a secondary peak in May–June, may be associated with several plausible external mechanisms. The January peak coincides with post-holiday traffic resumption, increased winter precipitation and reduced road surface friction, which may degrade sensor performance in camera and radar-based ADAS systems. Holiday travel in December–January also increases vehicle miles traveled on unfamiliar routes, potentially straining system capabilities. The May–June secondary peak may correspond to end-of-quarter vehicle delivery cycles and fleet deployment surges, higher summer traffic volumes, and increased highway driving associated with seasonal travel. The troughs in February–March and August–September may reflect post-holiday traffic normalization and a mid-year lull in deployment activity, respectively. It must be emphasized that these are hypothesized mechanisms drawn from domain knowledge and the prior literature. The present univariate framework does not permit causal attribution of these patterns to any specific external factor.
It is important to note that the seasonal patterns identified in this study represent empirically observed temporal regularities derived from univariate time-series analysis. Without the inclusion of external covariates such as monthly vehicle miles traveled (VMT), ADAS fleet size, weather indices (e.g., temperature, precipitation, visibility), or software update logs, these observed peaks and troughs cannot be causally attributed to specific external factors. The hypothesized mechanisms described above including holiday travel patterns, seasonal weather conditions affecting sensor performance, end-of-quarter vehicle delivery cycles, and seasonal testing schedules are plausible explanations based on domain knowledge and industry practices. However, formal attribution would require a multivariate modeling framework that explicitly incorporates such covariates. Future research incorporating these external variables would allow for more definitive causal inferences regarding the drivers of observed seasonality.
The stacked area chart below (
Figure 15) shows the monthly breakdown of ADAS crashes by accident type from July 2021 to July 2024. The chart reveals an upward trend in total crash volume, rising from approximately 20 to 35 crashes per month in 2021 and 2022 to peaks exceeding 65 crashes by early 2024. Three accident types dominate the composition. Unknown/non-contact crashes (dark gray), particularly during peak periods, frontal impact (light blue) and multi-point/complex crashes (orange). Rear-end impacts (light green) and side impacts (purple) also contribute, but to a lesser degree, while corner impacts and vertical impacts remain relatively rare. Notable peaks in total crash volume occur around January 2023 (~65 crashes) and May–June 2024 (~67 crashes), with distinct seasonal fluctuations evident throughout the series. The large portion of unknown crashes indicates that the crash data report should be improved.
4.5. Forecast of Future ADS Crash Counts
The graph below (
Figure 16) shows the ADS crash forecast for the 6 months using the Prophet model. It reveals a rising trend with seasonal fluctuations. Validation predictions closely follow actual values, suggesting the model captures temporal patterns well. The forecast for July to December 2024 suggests monthly crash counts will likely range between 20 and 40. The narrowing confidence interval reflects increased certainty of the predicted values. Unless targeted interventions such as improved driver monitoring, system recalibration or regulatory oversight are taken, ADS-related crashes are expected to rise in the near future.
The Ljung–Box test was applied at lag 10, yielding a
p-value of 0.929 (
Figure 17). This high
p-value suggests no significant autocorrelation, reinforcing the assumption of temporal independence and supporting the robustness of the Prophet model’s forecasts.
The Prophet trend component for ADS in
Figure 18 shows a steady, near linear increase from approximately 11.5 to 29.5 crashes per month from September 2021 to mid-2024, representing a 2.5-fold increase. This upward trajectory suggests growing baseline crash risk. Which could be due to increased ADS deployment, improved reporting and cumulative exposure. The consistent linearity from 2023 to 2024 indicates that ADS crash frequencies may continue to rise in the near future.
The annual seasonality component of the ADS crashes in
Figure 19 shows recurring fluctuations in crash risk throughout the year. There is a trimodal pattern with three distinct peaks. April–May (reaching approximately +4), August (~+3) and October (~+3.5). Conversely, three major troughs appear in January–February (approximately −4), September (transition period around −4), and December (~−7, the deepest dip). These seasonal patterns may correspond to operational cycles such as increased deployment during spring and late summer testing periods, environmental factors such as varying weather conditions, or behavioral trends such as seasonal traffic patterns. The amplitude of seasonal effects, ranging from approximately −7 to +4, indicates that seasonality plays a substantial role in monthly crash variation. This should be considered when planning safety measures. The consistent recurrence of this trimodal pattern across multiple years confirms that these are statistically significant seasonal effects rather than random fluctuations.
The average annual pattern (
Figure 20) reveals the typical within-year crash frequency variation, averaged across all observed years. The curve shows a complex wave-like structure with multiple peaks and troughs, indicating strong intra-year cyclicality. Three distinct peaks emerge in April (~+3.5 to +4), August (~+4, the highest annual peak) and October (~+3.5). This suggests that crash rates significantly exceed the annual average during these periods. Conversely, three major troughs occur in February–March (~−4), September (brief dip during transition), and December (~−7, the deepest annual trough), indicating relatively safe periods. The variation ranging from approximately −7 to +4 shows that these seasonal effects are substantial and should be considered for policy planning. Notably, the trimodal pattern of spring, late summer and fall suggests that ADS crashes are influenced by multiple distinct seasonal risk factors operating at different times. Therefore, necessary measures should be taken considering this.
The trimodal ADS crash pattern, with peaks in April–May, August, and October, may reflect a distinct set of operational drivers compared to ADAS. Spring testing and deployment cycles, which are common among ADS manufacturers conducting expanded road trials ahead of annual reporting periods, may contribute to elevated crash counts in April–May. The August peak could be associated with high summer traffic volumes, heat-related sensor degradation (particularly for LiDAR and thermal camera systems), and intensified operational testing. The October peak may correspond to fall fleet deployment announcements and pre-winter testing campaigns, as well as increased traffic variability from school reopening and seasonal commuting changes. The December trough, the deepest in the ADS series, may partly reflect winter operational restrictions imposed on ADS fleets, reduced deployment, and lower vehicle miles traveled. As with the ADAS analysis, these remain domain-informed hypotheses: the observed patterns are empirical temporal regularities and causal attribution requires future multivariate investigation.
As with ADAS, the trimodal seasonal pattern observed in ADS crashes represents an empirically identified temporal regularity rather than a causally established relationship. However, without monthly VMT data, ADS fleet size estimates, weather indices, and deployment logs, these remain plausible hypotheses rather than empirically validated causal factors. Future work incorporating such covariates in a multivariate Prophet model or hierarchical time series framework would enable the disentanglement of exposure growth effects from intrinsic seasonal risk variations.
The stacked area chart (
Figure 21) shows the monthly distribution of ADS crashes by accident type from July 2021 to October 2024. Frontal impacts (light orange) form the largest and most consistent base layer throughout the observation period. Then rear-end crashes (brown) maintain a substantial and stable contribution. Multi-point/complex crashes (red) show sporadic spikes, particularly during peak periods. Side-impact (pink) and unknown/non-contact cases (yellow) provide moderate contributions. Overall crash volumes show clear peaks around July 2023 and July 2024, suggesting seasonal or operational surges in risk, with total monthly counts reaching approximately 35–40 during peak periods. Corner and vertical impacts remain relatively rare. The chart underscores the need to prioritize mitigation strategies for high frequency crash types, particularly Frontal and Rear-End collisions, while also addressing the mid-year seasonal surges.