Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction

Han, Dae Cheol

doi:10.3390/app14209436

Open AccessHypothesis

Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction

by

Dae Cheol Han

Department of Highway & Transportation Research, Korea Institute of Civil Engineering and Building Technology, Goyang 10223, Republic of Korea

Appl. Sci. 2024, 14(20), 9436; https://doi.org/10.3390/app14209436

Submission received: 19 August 2024 / Revised: 3 October 2024 / Accepted: 10 October 2024 / Published: 16 October 2024

(This article belongs to the Special Issue Intelligent Transportation System Technologies and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Accurate traffic volume data are crucial for effective traffic management, infrastructure development, and demand forecasting. This study addresses the challenges associated with traffic volume data collection, including, notably, equipment malfunctions that often result in missing data and inadequate anomaly detection. We have developed a deep-learning-based model to improve the reliability of predictions for annual average daily traffic volume. Utilizing a decade of traffic survey data (2010–2020) from the Korea Institute of Civil Engineering and Building Technology, we constructed a univariate time series prediction model across three consecutive sections. This model incorporates both raw and adjusted traffic volume data from 2017 to 2019, employing long short-term memory (LSTM) techniques to manage data discontinuities. A power function was integrated to simulate various error correction scenarios, thus enhancing the model’s resilience to prediction inaccuracies. The performance of the model was evaluated using certain metrics, such as the mean absolute error, the root mean squared error, and the coefficient of determination, thus validating the effectiveness of the deep learning approach in refining traffic volume estimations.

Keywords:

annual average daily traffic volume (AADT); traffic volume; long short-term memory (LSTM); power function; deep learning

1. Introduction

The annual average daily traffic volume (AADT) is a crucial metric in road traffic management extensively used for traffic planning, road infrastructure development, and forecasting traffic demand. It also provides a comparative basis for assessing factors that adversely affect economic growth or quality of life across different cities. The metric is typically used to evaluate the road traffic congestion coefficient, which can highlight instances where the traffic exceeds road capacities, leading to congestion.

Predicting long-term traffic volume requires substantial computational resources and depends heavily on hardware capabilities; thus, short-term forecasting of traffic conditions often takes precedence. Extensive research has been conducted on these short-duration predictions, focusing on approximately one-hour intervals. Despite the development and adoption of various short-term prediction models, their reliability can be compromised by incomplete datasets. Hence, the acquisition of precise and comprehensive raw data is crucial.

Road traffic volume is defined as the total number of vehicles traversing a particular point or segment of a road within a specific time unit. Traffic data collection, which includes differentiating the traffic volume by vehicle type and direction on an hourly basis, is known as a traffic volume survey. Current practices in traffic volume surveys are categorized into occasional and continuous types. Occasional surveys, which provide essential data for analyzing the overall road usage, are performed annually and target specific points and sections. These are further subdivided by the targeted roads—general national roads and, for the October survey, expressways and local roads. Techniques for occasional surveys range from mechanical methods using portable traffic equipment to video surveys with closed-circuit television (CCTV) and manual counts by personnel.

Continuous surveys deploy automatic vehicle classification (AVC) systems equipped with sensors that continuously record traffic data around the clock at designated points. These systems gather detailed information on the vehicle type, direction, and timing. Technologies used in continuous surveys include loop and piezoelectric sensors embedded in the roadway, artificial-intelligence-driven video analysis from CCTV footage, and vehicle license plate recognition systems that integrate CCTV with a vehicle specifications database.

Among the continuous traffic volume survey methods, those utilizing buried sensors encounter several challenges. Frequent sensor damage caused by road wear from heavy vehicles often results in gaps in the traffic volume data. Similarly, video survey methods face data losses arising from equipment malfunctions and adverse weather conditions. Furthermore, the traffic volume survey equipment designated for 365-day measurement suffers from inconsistent maintenance times due to communication disruptions, controller malfunctions, and sensor damage. These factors result in prolonged periods where there are missing or abnormal data, rendering it challenging to provide accurate observed traffic volume data. Consequently, the traffic volume information collected from such continuous survey equipment may not accurately reflect the actual traffic volumes owing to a combination of technical limitations, controller defects, sensor errors, external environmental factors, and communication issues.

This study aims to develop and verify a deep-learning-based traffic volume correction and prediction algorithm to enhance the reliability of high-quality traffic volume information and raw data for use in national infrastructure planning, as defined in Article 102 of the Road Act and under Article 88 of the National Transport System Efficiency Act. The spatial scope of this study encompasses 551 points where general national road traffic volume survey equipment (AVC) is installed to calculate the AADT. The temporal scope includes hourly and daily continuous traffic volume data from the past decade (2010 to 2020) collected in real-time from the AVC equipment installed on general national roads.

The main objective is to correct and predict traffic volume by applying a cyclic model-based algorithm to short-cycle complex seasonal time series information while specifically addressing the issue of frequent missing data during the traffic volume data collection process. Most importantly, this study aims to minimize the out-of-sample issues that commonly occur during model construction by comparing and verifying the actual observed traffic volume data against those obtained using the proposed methodology, thereby ensuring reliable traffic volume information.

Additionally, this study focuses accurately predicting traffic volume, which has various potential applications in the field of road traffic. Specifically, it aims to provide more reliable traffic volume information by addressing the issue of missing data owing to the failure of traffic information collection equipment, thereby contributing to improved travel reliability and enhanced transportation planning and management.

In this study, a power-function-based long short-term memory (LSTM) model algorithm is proposed to enhance the model’s ability to recognize uncertainty and long-term trend patterns that accurately reflect the characteristics of traffic volume.

2. Review of Related Studies

The introduction of advanced traffic volume survey equipment capable of automatically detecting traffic volumes has allowed for addressing the limitations of traditional traffic surveys by reducing outliers and missing values. However, the unpredictability of the traffic volume data due to equipment malfunctions and communication disruptions continues to pose significant challenges. Consequently, extensive research has been conducted to correct and predict the traffic volume data. These studies fall into three primary categories: (1) traffic volume correction studies utilizing historical data, (2) traffic volume prediction studies employing statistical learning algorithms, and (3) traffic-volume-related prediction studies using deep learning techniques.

2.1. Traffic Volume Correction and Methods Based on Historical Data

In Texas and Georgia, USA, the missing rates of traffic volume collection equipment have been reported at approximately 15–90% and 5–14%, respectively. In Alberta, Canada, an analysis over seven years revealed average missing rates ranging from 10% to 40% [1,2]. Although several studies have attempted to address these discrepancies, the methods predominantly rely on historical data from the same locations. These statistical techniques and algorithms, although useful, cannot fully eliminate the uncertainty associated with traffic volume. Relying on historical data for traffic volume correction is time-consuming and often ineffective when traffic patterns undergo significant changes. Moreover, if the reliability of the historical traffic volume data is questionable, corrections based on subjective analyst judgment or experience can lead to distorted interpretations of the time series data.

Mei et al. [3] enhanced short-term traffic volume prediction by integrating historical data with transition matrices and proposing an absorbing Markov chain (AMC) model. In terms of prediction accuracy, their findings suggest that the AMC model outperforms both the seasonal autoregressive integrated moving average (ARIMA) and neural network models.

Myung-Sik et al. [4] developed an algorithm designed to eliminate outliers and correct missing values in the toll collection system data. They determined the algorithm’s applicability to various traffic flow scenarios across different sections of highways during weekdays, holidays, and special event periods.

Ji-Yeon et al. [5] implemented traffic volume corrections using profile-based mean correction, regression correction with time-specific data from similar detectors, and autoregression with complete daily traffic data from similar detectors through significance tests of monthly and daily traffic volume change patterns, comparisons of similar traffic volume change patterns, and cluster analysis of daily average traffic volume change patterns.

Jeong-Yeon et al. [6] reported that historical-data-based correction methods were more accurate than the adjacent point reference method. Among three techniques using historical data, applying the same period’s historical data, regression analysis, and lane usage rates, the replacement of historical data from the same period and regression analysis of front and rear points presented the most reliable estimates.

Seung-Weon and Ju-Sam [7] suggested correcting the missing traffic volumes by calculating the axle correction coefficients derived from the cumulative number of axles detected by piezo sensors. This method exhibited a lower error rate than the spatial trend correction method for specific points.

Ha et al. [8] developed an autoregressive analysis method using traffic volume data from the same time as the target for correction and a seasonal time series analysis correction method using data from the same point as the target. Their findings indicated that the autoregressive method, which considers the current time points, presented fewer errors when compared to the time series method, indicating that present conditions influence the traffic volumes more significantly than historical data.

2.2. Traffic Volume Prediction Based on Statistical Learning

Traffic volume prediction typically utilizes a data-driven approach based on statistical techniques, which can be classified into parametric and nonparametric estimation methods [9]. The parametric techniques include traditional linear and nonlinear regression methods [10,11,12,13], historical average algorithms, moving averages, smoothing techniques [14], and autoregressive linear processes [12,13,15]. The selection of the algorithm depends on the periodicity and variability of the traffic volume data, with the autoregressive moving average (ARMA) and ARIMA being the most commonly employed linear parametric models.

In particular, the ARIMA model is notable for its application to differenced time series, addressing the non-stationarity and lack of mean reversion in the original series through differencing. This model involves detrending and deseasonalizing the data, applying the ARMA model, and subsequently reintegrating the trends and seasonality during the estimation phase. The ARIMA model is known for its effectiveness and is often improved by incorporating seasonal adjustments, as traffic volume data typically exhibit pronounced seasonal trends. This enhancement is captured in the seasonal ARIMA (SARIMA) model, which has been shown to provide more accurate predictions for traffic data with strong seasonal patterns [16].

Nonlinear parametric models, such as the generalized linear mixed models, fuzzy logic, and genetic algorithms, require analysts to make subjective judgments regarding the form of nonlinearity, and their complex estimation processes can deter widespread usage. Spatial statistical methods, such as the Kriging interpolation, are used to predict traffic volumes by utilizing data from spatially related areas. However, these methods are typically suitable only for road segments with strong spatial interrelationships and are considered to be limited in their explanatory power for univariate analysis.

Table 1 lists the studies conducted on traffic volume prediction based on statistical learning algorithms.

2.3. Deep-Learning-Based Traffic Volume Prediction

Existing studies on traffic volume correction and prediction have primarily relied on historical data for corrections and statistical learning algorithms for both correction and prediction. However, with recent advancements in data collection, management, and analysis technologies, there has been an increasing focus on employing various methods to correct missing data and predict future traffic volumes.

Various machine-learning-based anomaly detection techniques have been developed, including classification-based, nearest-neighbor-based, clustering-based, and statistical anomaly detection methods. Additionally, deep-learning-based techniques, which integrate dimensionality reduction with anomaly detection, have been introduced [10].

Yisheng et al. [18] employed a deep learning approach using the stacked auto-encoder model, applying a greedy layer-wise unsupervised learning algorithm for initial deep network pre-training followed by a fine-tuning process to enhance prediction performance.

Fusco et al. [19] analyzed the effectiveness of explicit models, such as dynamic traffic assignment, and implicit (data-driven) models, including artificial neural networks and Bayesian networks, for short-term prediction in urban traffic networks, which are crucial for intelligent transportation system applications. Explicit models make predictions at 15 min intervals, whereas implicit models present predictions at 5 min intervals.

Cho et al. [20] developed deep learning models to predict public bicycle rental volumes, evaluating exponential smoothing, ARIMA, and LSTM models and finding that deep learning models showed superior performance.

2.4. Distinctions from Prior Research

Prior research on traffic volume prediction has predominantly utilized historical data and conventional statistical methods. These approaches typically require substantial time for generating predictions and may fail to forecast volumes for specific traffic segments. With recent advances in computational technology, deep learning techniques have been introduced for predicting traffic volumes. However, to date, no studies have employed LSTM models for this purpose. This study distinguishes itself by enhancing the time series deep learning framework, specifically through modifications to the LSTM forget gate, thereby improving the accuracy of traffic volume predictions.

3. Analysis and Procedure of the Development Model

3.1. Analysis of the Development Model

This study evaluates the traditional statistical models (ARIMA, SARIMA) alongside deep learning models (recurrent neural network (RNN), LSTM, power-function-based LSTM) to select suitable models and propose effective algorithms by addressing their limitations.

(1): ARIMA

The ARIMA model is a well-established time series analysis method that incorporates trends and seasonality by performing seasonal and first differencing, rendering the series stationary, as expressed in Equation (1):

γ_{t} = T_{t} + S_{t} + E_{t} .

(1)

Here,

T

represents the trend,

S

represents the seasonality, and

E

represents the randomness.

The ARIMA model extends a typical ARMA model where, if the first difference is zero, it represents a stationary series. The formula for estimating the ARIMA model, which combines first-differenced ARMA, is expressed in Equation (2),

{γ^{'}}_{t} = I + α_{1} {y^{'}}_{t - 1} + \dots + α_{p} {y^{'}}_{t - p} + e_{t} + θ_{1} e_{t - 1} + \dots + θ_{q} e_{t - q},

(2)

where

e

denotes the error term. However, the ARIMA model is best suited to contexts with short seasonal cycles owing to its structure, which limits its effectiveness in analyzing long seasonal cycles and handling multiple seasonalities.

(2): SARIMA

The SARIMA model integrates the seasonal autoregressive model and seasonal moving average model within the ARIMA framework. The model, as formulated in Equation (3), accommodates the probabilistic characteristics of seasonality (m),

S A R I M A (p, d, q) {(P, D, Q)}_{m},

(3)

where p denotes the non-seasonal autoregressive order, d denotes the degree of differencing, q denotes the non-seasonal moving average order, P denotes the seasonal autoregressive order, D denotes the degree of seasonal differencing, Q denotes the seasonal moving average order, and m represents the number of observations per year (indicating annual seasonality).

The distinction between ARIMA and SARIMA lies in SARIMA’s capacity to encapsulate the seasonal cycle (m), as expressed in Equation (4),

Φ_{p} (B^{s}) Φ_{p} (B) (1 - B)^{d} (1 - B^{s})^{D} Y_{t} = 1 + θ_{Q} (B^{s}) θ_{q} (B) ε_{t},

(4)

where

ε_{t}

denotes the error term (also known as white noise) and

B

represents the backward shift operator. If the traffic volume data cycle is recorded monthly, this equation can be modified to Equation (5):

(1 - Φ_{p}) (1 - Φ_{1} B^{12}) (1 - B) (1 - B^{12}) y_{t} = (1 + θ_{1} B) (1 + θ_{1} B^{12}) ε_{t} .

(5)

(3): RNN

RNN is predominantly utilized for time series analysis and is particularly adept at processing sequential data. A defining feature of the RNN is its state vector (also known as the hidden unit), which forms a singular neural structure for the sequence and retains the memory of all preceding elements. The schematic recursive structure of an RNN incorporating the backpropagation through a time algorithm is presented in Figure 1 [21].

A major assumption for different traditional statistical learning models is the independence among data samples. However, this assumption does not hold for sequential data, where dependencies exist among individual elements over time. This characteristic is particularly relevant to the traffic volume data analyzed in this study, negating the typical neural network advantage of processing each data sample independently.

Within an RNN, the transfer of information through the network’s weight matrices is described by the following equations:

s_{t} = σ ({U x}_{t} + {W s}_{t - 1} + b_{s}),

(6)

h_{t} = s o f t m a x ({V s}_{t} + b_{h}) .

(7)

In RNNs, the weights are designed to handle variable-length sequences. Each timestep receives a new input value, thus updating the hidden state,

s_{t}

. The information is sequentially relayed throughout the network over time, with each prior input stored as memory, thus enhancing the processing of incoming data.

(4): LSTM

LSTM networks, a type of traditional RNN, are designed to address the gradient vanishing problem that occurs during the training processes, particularly when there is a significant time gap between the relevant information and its point of use in long-term trend data. This issue causes the gradient to diminish progressively during backpropagation, substantially reducing the network’s learning capability. In contrast to that of the standard RNNs, the LSTM architecture incorporates a cell state along with the hidden state, which helps in preserving information over longer sequences.

In this study, the LSTM model was employed to compare its predictive power against that of the newly developed power-function-based LSTM model. The standard LSTM model is structured as a relatively simple multi-layered neural network optimized for predicting short-term time steps (e.g., one hour ahead) using a single window. The predictive accuracy of the LSTM model varies with the time steps. Structurally, a linear transformation layer is inserted between the data input and prediction layers owing to limitations of the algorithm. While enabling fast computation, this layer reduces the model’s predictive performance. The model relies primarily on the values and features of the immediately preceding time step (t = 0) to predict the next time step, often failing to capture long-term trends and characteristics effectively.

(5): Proposed Power-Function-Based LSTM Model

This study introduces a power-function-based LSTM model to address the challenges presented by the forget gate in the conventional LSTM models. The employed LSTM model is an RNN that includes an LSTM hidden layer. When dealing with lengthy input streams without explicitly identified start and end points, the state of the LSTM can expand indefinitely, thus potentially destabilizing the network. To overcome this issue, we implemented a method of resetting the memory cell contents before initiating a new sequence. The use of an LSTM with a forget gate aims to prevent instability or interruptions during the learning process.

3.2. Development Model Analysis Procedure

Figure 2 depicts the procedure for developing the deep-learning-based model. Initially, the process involved establishing baseline values for traffic volume data, detecting anomalies in the raw data, and conducting data validation and preprocessing to correct and predict the traffic volume. Subsequently, datasets were organized for anomaly detection in traffic volume data, including a training set exclusively comprising normal values, a validation set with normal values, a secondary validation set containing both normal values and anomalies, and a test set comprising both normal values and anomalies. The third step involved defining the characteristics of the data to ensure appropriate data transfer suitable for each deep learning structure before the actual implementation of the deep learning model. The fourth step addressed the challenge of gradient vanishing during data analysis by setting input sequences for traffic volume prediction to construct the deep learning model. An architecture was then developed to evaluate and validate the predictive power of the devised model.

4. Statistical Learning Verification and Machine Learning Model Analysis and Evaluation

4.1. Statistical Learning Verification Design

Previous studies have commonly employed a preprocessing process to correct data anomalies before training the deep learning models. This approach, although necessary in the absence of actual values, tends to complicate the analysis process and significantly reduces the processing speed. Moreover, the use of simple interpolation methods during the correction phase can degrade the data quality. In this study, where actual values are available, the predictive power was verified without prior data correction, thereby simplifying the analysis process and enhancing efficiency. This study evaluated the model’s performance using certain metrics, such as the mean absolute percentage error (MAPE), the MAE, the root mean squared error (RMSE), and the coefficient of determination (R²), and formulated additional data-missing scenarios for analysis. Figure 3 depicts the design procedure for the time series prediction performance verification.

Scenario 1: Assumes 10% of the data is missing.
Scenario 2: Assumes 50% of the data is missing.

Bayesian optimization was conducted using the GPyOpt 1.2.6 library, thus necessitating the determination of candidate values for each hyperparameter. Values for certain parameters, such as the learning rate and the dropout rate, were established based on the existing literature. Additionally, this study explored three distinct and arbitrarily selected network architecture variations.

For the coding and implementation of experiments, the Python 3.12.4 programming language and Keras 2.3 were employed. Keras is an open-source project that offers a high-level API that supports the implementation of deep neural networks along with libraries, such as TensorFlow 2.10 and Theano 1.0.0. In this study, Keras was utilized in addition to TensorFlow, thus providing advantages in rapid prototyping and experimental flexibility through its simple API. This API supports modular neural network construction via the integration of various layers, activation functions, loss functions, and optimizers, and it includes provisions for most conventional deep learning components. Despite its broad utility, Keras may present limitations for constructing custom or novel solutions, for which TensorFlow may be a more appropriate choice. Notably, Keras incorporates an LSTM implementation featuring a forget gate, as described by Gers et al. (2000) [22].

4.2. Machine Learning Model Analysis and Evaluation

(1): SARIMA Model Analysis Results

In evaluating the SARIMA model, the analysis of sample errors revealed no discernible patterns in the residuals between the estimated and actual values, suggesting the absence of significant heterogeneity issues; therefore, no additional log transformations were necessary. The Ljung–Box (LB) test was conducted to verify the accuracy of the model judgments. The LB test yielded a significance probability of 0.04, which is below the threshold of 0.1, indicating minimal autocorrelation. Consequently, the optimal parameters for the SARIMA model were determined through the auto grid searching method without the need for further log transformations or differencing.

The selection of optimal parameters was guided by Akaike’s information criterion (AIC), the Bayesian information criterion (BIC), and various significance tests. The results are summarized in Table 2.

The overall prediction results indicated excellent performance, averaging approximately 85%; however, there was a notable decline in the prediction accuracy due to long-term missing data from 2019, which adversely affected the model’s ability to reflect the periodicity and trends accurately. Table 3 presents the evaluation results of the SARIMA (4,1,3)(4,0,3) 12 model, revealing the discrepancy between the actual and predicted values, which ranged from a minimum of 3.28% to a maximum of 14.59%. Although the SARIMA model typically uses MAPE to estimate prediction errors, for consistency in comparisons across the statistical, machine learning, and deep learning models, MAPE was converted to MAE. The converted MAE was analyzed to be 0.158.

(2): Prophet Model Analysis

A separate analysis was conducted using the min–max scaling method to standardize the range of data values. Evaluation metrics included RMSE, MAE, and R². The analysis focused on two specific points, as detailed in Table 4. Given its nature as a univariate regression model, R², which indicates the linear fit, serves as a measure of prediction accuracy.

Figure 4 depicts the results of predicting the hourly traffic volume at point 10001 using the Prophet model, where MAE is 0.117, RMSE is 0.147, and R² is 0.914, indicating considerably high predictive performance.

Figure 5 presents the results of predicting the hourly traffic volume at point 10004 using the Prophet model. The blue line represents the actual traffic values, and the yellow line represents the predicted values. For point 10001, the regularity in the past time series patterns (seasonality, periodicity) was confirmed, leading to a high predictive performance, with an MAE of 0.117 and an RMSE of 0.147.

However, for point 10004, irregular past time series patterns presented lower predictive performance, with an MAE of 111.687, an RMSE of 158.050, and an R² of 0.847, when compared to point 10001. This indicates the presence of uncertainties that are not recognized by the data. To address these limitations, analyses utilizing deep-learning-based models were conducted.

(3): LSTM Model

The LSTM model was utilized to evaluate the predictive power of the proposed LSTM model, which incorporates a power function. The predictive accuracy of the LSTM model varies based on the time point, as shown in Figure 6. Owing to the structural limitations of the algorithm, a linear transformation layer is incorporated between the input values (inputs) and the predicted outputs (predictions). This architecture lacks densely nonlinear hidden layers between the input and output layers, which, while enhancing the computational speed, considerably diminishes the predictive performance.

Furthermore, the model predicts the subsequent time step based solely on the values and characteristics of the immediately preceding time step (t = 0). This approach does not adequately capture the long-term trends and characteristics inherent in the data.

(4): Proposed LSTM Model Based on Power Function

Previous efforts to modify the LSTM architectures have aimed at enhancing the inference capabilities for data characterized by long-term trends, typically yielding similar performance levels. However, our analysis identified inconsistencies in the LSTM performance. In particular, within the forget gate, the memory cell,

c_{t}

, which stores past data, is described by Equation (8):

c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ \bar{c_{t}} .

(8)

Despite the superior performance of the LSTM models with forget gates over the standard LSTM or RNN models, they frequently suffer significant information loss during the decay process, where non-essential information is discarded using a Sigmoid function. Shivangi et al. [23] suggested an improved decay function using exponential functions to address this issue. However, exponential functions still led to excessive information loss in datasets with pronounced long-term trends. This effect was notably severe in data with substantial long-term dependencies, such as the hourly traffic volume data. Therefore, this study proposes an LSTM model that employs a power function as the decay function within the forget gate, designed to accommodate long-term patterns and reduce unnecessary information decay in the time series data with significant long-term dependencies.

The memory cell in the forget gate is redefined by the power function to reduce the rate of information decay:

c_{t} = c_{0} ⊙ {(t - t_{0} + 1)}^{- p} = (\frac{t - t_{0} + 1}{t - t_{0}}) ⊙ c_{t - 1} .

(9)

To evaluate the absolute predictive performance of the proposed LSTM model, it was compared against those of linear combination models, multi-step combination models, and convolution neural networks. Figure 7 depicts the comparative prediction performance of the power-function-based LSTM model against that of the traditional LSTM model, demonstrating enhanced prediction accuracy for the traffic volume.

To further substantiate these findings, Table 5 presents the prediction performance metrics, demonstrating that the LSTM model achieves an MAE of 0.1288, whereas the power-function-based LSTM model significantly improves this with an MAE of 0.0733. This confirms that the LSTM model developed with the power function exhibits superior predictive performance when compared to the standard LSTM model.

(5): Traffic Volume Prediction Using the Power-Function-Based LSTM

Figure 8 depicts the 24 h traffic volume predictions at various random points and times using the power-function-based LSTM model developed in this study.

Table 6 presents the evaluation results of the LSTM model at point 10001 under two scenarios, each assuming different data missing rates. The analysis indicated that scenario 2, with a higher missing rate, outperformed scenario 1, which had a lower missing rate. This suggests that the power-function-based LSTM model can effectively capture long-term trends and uncertainties in the traffic volume data. However, a higher missing rate does not universally ensure better predictions. If the points or times with significant missing data are randomly selected, the prediction performance at a 50% missing rate might actually deteriorate. Figure 9 and Figure 10 present the results for each scenario.

5. Model Prediction Performance Verification Results

This study aimed to develop an optimal model to address the missing traffic volume data by comparing and analyzing statistical learning, machine learning, and deep learning models. The SARIMA model was employed to account for seasonality in the traffic volume data, the Prophet model was used for univariate predictions specializing in time series estimation, such as traffic volume, and the LSTM model was utilized to address long-term trends and uncertainties.

Table 7 presents the prediction results based on MAE for the analyzed statistical learning, machine learning, and deep learning models. The verification results indicated that the deep learning models performed the best in predicting traffic volume, followed by machine learning models and statistical learning models. Notably, the LSTM model in deep learning exhibited a higher MAE than the Prophet model in machine learning.

To address this problem, this study proposed the power-function-based LSTM model. This modified LSTM model, utilizing the developed power function, achieved superior performance in correcting and predicting traffic volume compared to the existing models.

Table 8 compares the error rates among the statistical learning, machine learning, and deep learning models, revealing that the power-function-based LSTM model demonstrated lower errors when compared to SARIMA (84.18%), Prophet (78.63%), and LSTM (80.47%). The power-function-based LSTM model recorded an MAE of 0.025, indicating the best performance among all evaluated models.

To further verify the prediction accuracy of the proposed power-function-based LSTM model, the AADT at point 10001 was assumed to have a 50% missing rate when compared to the actual values. The verification results demonstrated a difference of approximately 24 vehicles/day using the existing methods, with an error rate of 0.164%. The power-function-based LSTM model exhibited a difference of approximately four vehicles/day, with an error rate of 0.025%.

The accuracy verification for the AADT indicated a difference of 20 vehicles/day and 8760 vehicles/year, demonstrating that the proposed model significantly outperformed existing methods in correcting traffic volume based on actual values. Table 9 presents the accuracy verification results of the power-function-based LSTM model.

6. Conclusions

The challenge of uncertainty in traffic volume data persists as an unavoidable issue for next-generation traffic systems. In particular, sensor-based time series traffic volume data, which accumulate vast amounts of real-time information, consistently exhibit anomalies and missing values despite the sophistication of the collection equipment. Therefore, it is crucial to accurately assess the uncertainty of the collected data and execute appropriate correction tasks.

This study applied statistical learning, machine learning, and deep learning models—all primarily utilized for time series algorithms—to develop and evaluate methods that accurately reflect the characteristics of the traffic volume data and enhance the accuracy of correcting and predicting the missing traffic volume data. The key findings of this study are summarized as follows.

First, a statistical learning model, SARIMA (4,1,3)(4,0,3) 12 [9], effectively captured the periodicity, seasonality, trend, and residual characteristics of traffic volume data. Predictions were also obtained using the Prophet model, a machine-learning-based additive algorithm that performs iterative computations. The SARIMA model predicted traffic volume with an average accuracy of 85%; a comparison of actual values to predicted values exhibited a variation ranging from a minimum of 3.28% to a maximum of 14.59%. For the Prophet model, the performance was evaluated by deleting 10% of the data arbitrarily, revealing an MAE of 0.117, an RMSE of 0.147, and an R² of 0.914 for point 10001 and an MAE of 111.687, an RMSE of 158.050, and an R² of 0.847 for point 10004.

Second, to address the long-term trends and data uncertainties evident in SARIMA and Prophet models, the performance of the LSTM model—a deep learning approach—was assessed for traffic volume prediction. The results indicated that the LSTM model outperformed the SARIMA model but was less accurate than the Prophet model. To enhance the LSTM model’s performance, this study introduced an improved LSTM model using a power-function-based forget gate memory cell. This adaptation aims to reduce information loss over time and more accurately reflect long-term patterns in time series data with high dependencies.

Third, the traffic volume prediction performance of the proposed power-function-based LSTM model was compared with those of the statistical learning model SARIMA, the machine learning model Prophet, and the deep learning model LSTM. The results confirmed that the proposed model achieved the lowest MAE for correcting and predicting the missing traffic volume.

Therefore, the proposed power-function-based LSTM model is expected to significantly contribute to enhancing the efficiency of traffic surveys and statistics. By improving traffic volume corrections and predictions, it can help prepare accurate and reliable road traffic statistics, which are imperative for national planning and decision making. Moreover, its ability to handle time series data collected in various formats from diverse sources demonstrates its versality and potential applications beyond traffic volume analysis, which is another significant achievement of this study.

Funding

This study was prepared with funding from the 2024 Traffic Information Provision System Service (Grant No: 20240266-004). The author extends appreciation to the related organizations for their cooperation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

This study was conducted using data provided by the Korea Institute of Civil Engineering and Building Technology (Project Title: 2024 Traffic Volume Information Provision System Service), and the author extends appreciation to the related organizations for their cooperation.

Conflicts of Interest

The author declares no conflicts of interest.

References

Satish, S.; Lingras, P.; Zhong, M. Effect of missing value imputations on traffic parameters estimations from permanent traffic counts. In Proceedings of the 80th Annual Meeting, Washington, DC, USA, 9–12 April 1986; Transportation Research Board: Washington, DC, USA, 2003; pp. 1–33. [Google Scholar]
Tim, L.; Shawn, T.; Richard, M. Monitoring Urban Roadway in 2000: Using Archived Operations Data for Reliability and Mobility Measurement; Texas Transportation Institute: College Station, TX, USA, 2001. [Google Scholar]
Mei, H.; Ma, A.; Poslad, S.; Oshin, T.O. Short-term traffic volume prediction for sustainable transportation in an urban area. J. Comput. Civ. Eng. 2015, 1, 11. [Google Scholar] [CrossRef]
Do, M.-S.; Lee, H.-M.; NamKoong, S. Outlier filtering and missing data imputation algorithm using TCS data. Korean Soc. Transp. 2008, 26, 241–250. [Google Scholar]
Lee, J.; Do, M.-S.; Kim, S.-H.; Ryu, S.-K. Real-time adjustment of traffic volume—Based on the national highway Route 3. Korean J. Appl. Stat. 2003, 16, 203–215. [Google Scholar] [CrossRef]
Kim, J.-Y.; Lee, Y.-I.; Baek, S.-G.; Nam, G.-S. A study on the imputation for missing data in dual-loop vehicle detector system. Korean Soc. Transp. 2006, 24, 27–40. [Google Scholar]
Jung, S.-W.; Oh, J.-S. Traffic volume correction system using axle counts from piezo sensors. J. Korea Contents Assoc. 2021, 21, 277–283. [Google Scholar] [CrossRef]
Ha, J.A.; Park, J.H.; Kim, S.H. Missing data imputation using permanent traffic counts on national highways. Korean Soc. Transp. 2007, 25, 121–132. [Google Scholar]
Han, D.; Lee, D.W.; Jung, D. A study on the traffic volume correction and prediction using SARIMA algorithm. J. Korea Inst. Intelligent Transp. Syst. 2021, 20, 1–13. [Google Scholar] [CrossRef]
Korea Institute for Health and Social Affairs. Study on Machine Learning-Based Anomaly Detection Techniques; Korea Institute for Health and Social Affairs: Sejong-si, Republic of Korea, 2018.
Korea Institute of Civil Engineering and Building Technology. Study on Improving and Evaluating the Reliability of ITS Information; Korea Institute of Civil Engineering and Building Technology: Goyang-si, Republic of Korea, 2008.
Steffen, M.; Bartz-Beielstein, T. ‘impute TS: Time Series Missing Value Imputation in R,’ Contributed research article. R J. 2017, 9, 207–218. [Google Scholar]
Zhou, Y.; Meng, P. Diagnosis of causes for high railway traffic based on Bayesian network. Math Modell. Eng. Probl. 2019, 6, 136–140. [Google Scholar] [CrossRef]
Li, L.; Su, X.; Zhang, Y.; Lin, Y.; Li, Z. Trend modeling for traffic time series analysis: An integrated study. IEEE Trans. Intell. Transp. Syst. 2015, 16, 3430–3439. [Google Scholar] [CrossRef]
Hamed, M.M.; Al-Masaeid, H.R.; Said, Z.M.B. Short-term prediction of traffic volume in urban arterials. J. Transp. Eng. 1995, 121, 249–254. [Google Scholar] [CrossRef]
Castro-Neto, M.; Jeong, Y.S.; Jeong, M.K.; Han, L.D. Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions. Expert Syst. Appl. 2009, 36, 6164–6173. [Google Scholar] [CrossRef]
Van der Voort, M.; Dougherty, M.; Watson, S. Combining Kohonen maps with Arima time series models to forecast traffic flow. Transp. Res. C 1996, 4, 307–318. [Google Scholar] [CrossRef]
Lv, Y.; Duan, Y.; Kang, W.; Li, Z.; Wang, F.Y. Traffic flow prediction with big data: A deep learning approach. IEEE Trans. Intell. Transp. Syst. 2015, 16, 865–873. [Google Scholar] [CrossRef]
Fusco, G.; Colombaroni, C.; Comelli, L.; Isaenko, N. Short-term traffic predictions on large urban traffic networks: Applications of network-based machine learning models and dynamic traffic assignment models. In Proceedings of the International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), Budapest, Hungary, 3–5 June 2015; pp. 93–101. [Google Scholar] [CrossRef]
Cho, K.; Lee, S.S.; Nam, D. Forecasting of rental demand for public bicycles using a deep learning model. J. Korea Inst. Intelligent Transp. Syst. 2020, 19, 28–37. [Google Scholar] [CrossRef]
Won-Jae, J.; Korea Transport Institute. Application of Deep Learning in the Transportation Sector; Korea Transport Institute: Sejong, Republic of Korea, 2019.
Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to Forget: Continual Prediction with LSTM. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef]
Shivangi, M.; Vo Vy, A.; Turek Javier, S.; Huth Alexander, G. Multi-timescale representation learning in LSTM language models. ICLR 2021, 2021, 1–19. [Google Scholar]

Figure 1. Basic recursive structure of recurrent neural network.

Figure 2. Development of the architecture for deep-learning-based models.

Figure 3. Design procedure for verifying time series prediction performance.

Figure 4. Prophet model’s prediction results (point 10001).

Figure 5. Prophet model’s prediction results (point 10004).

Figure 6. LSTM model’s prediction results (24 h).

Figure 7. Comparison of performances between LSTM and power-function-based LSTM.

Figure 8. Traffic volume prediction results of the power-function-based LSTM (24 h).

Figure 9. LSTM model’s results (point 10001, 10% data missing).

Figure 10. LSTM model’s prediction results (point 10001, 50% data missing).

Table 1. Traffic volume prediction studies based on statistical learning algorithms.

Category	Methodology
Steffen and Bartz-Beielstein [12] Zhou and Meng [13] Korea Institute for Health and Social Affairs [10]	Linear and nonlinear regression
Li et al. [14]	Historical average algorithms Moving average Smoothing techniques
Steffen and Bartz-Beielstein [12] Zhou and Meng [13]	Autoregressive linear processes
Hamed et al. [15] Van der Voort et al. [17]	ARMA ARIMA

Table 2. Optimal parameter estimation results for the SARIMA model.

Model: SARIMA (4,1,3)(4,0,3) 12
Type	Factor	SE Factor	p > \|z\|
Autoregressive (AR) (time lag 1)	−1.1503	0.041	0.000
AR (time lag 2)	−0.1593	0.051	0.002
AR (time lag 3)	0.0759	0.061	0.210
AR (time lag 4)	−0.2939	0.040	0.000
Moving average (MA) (time lag 1)	0.7886	0.035	0.000
MA (time lag 1)	−0.6714	0.034	0.000
MA (time lag 1)	−0.7927	0.034	0.000
AR season (time lag 1)	0.2791	0.309	0.367
AR season (time lag 2)	0.0555	0.353	0.875
AR season (time lag 3)	−0.7049	0.296	0.017
AR season (time lag 4)	−0.1349	0.049	0.006
MA season (time lag 1)	−0.3960	0.307	0.197
MA season (time lag 1)	−0.0459	0.372	0.902
MA season (time lag 1)	−0.7397	0.316	0.019
AIC	6983	Ljung–Box (Q)	0.04
BIC	7068	Log-likelihood	−3201

Source: traffic volume correction and prediction using the SARIMA algorithm, 2021.

Table 3. Evaluation results of the SARIMA (4,1,3)(4,0,3) 12 model.

Date	Actual Values	Predicted	% Difference	Severity (0–3)
1 December 2019	11,307	12,049	−6.568	0
30 November 2019	8161	9352	−14.596	0
29 November 2019	8759	9684	−10.565	1
28 November 2019	8324	7618	8.478	0
27 November 2019	7936	8196	−3.277	0
26 November 2019	8835	7921	10.348	0

Table 4. Evaluation results of the Prophet model.

Point	MAE	RMSE	R²
10001	0.117	0.147	0.914
10004	111.687	158.050	0.847

Table 5. Prediction performance results of LSTM and power-function-based LSTM.

Model	LSTM	Power-Function-Based LSTM
Evaluation metric (MAE)	0.1288	0.0733

Table 6. Evaluation results of the LSTM model at point 10001.

Category	MAE	RMSE	R²
Scenario 1 (10% missing assumption)	0.025	0.036	0.948
Scenario 2 (50% missing assumption)	0.025	0.035	0.953

Table 7. Model prediction performance verification results.

Model		MAE
Statistical learning	SARIMA	0.158
Machine learning	Prophet	0.117
Deep learning	LSTM	0.128
Deep learning	Power-function-based LSTM model	0.025

Table 8. Comparative analysis of error rates among models.

Model		MAE	Comparison
Statistical learning	SARIMA	0.158	84.18%
Machine learning	Prophet	0.117	78.63%
Deep learning	LSTM	0.128	80.47%
Deep learning	Power-function-based LSTM model	0.025	-

Table 9. Accuracy verification results of the power-function-based LSTM model.

Category	Actual Values	Existing Methods	Developed LSTM
AADT (vehicles/day)	14,623	14,647	14,619
Traffic volume difference (vehicles/day)	-	24	4
Total annual traffic volume (vehicles)	5,337,395	5,346,155	5,335,935
Traffic volume difference (vehicles)	-	8760	1460
Error rate (%)	-	0.164	0.025

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Han, D.C. Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction. Appl. Sci. 2024, 14, 9436. https://doi.org/10.3390/app14209436

AMA Style

Han DC. Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction. Applied Sciences. 2024; 14(20):9436. https://doi.org/10.3390/app14209436

Chicago/Turabian Style

Han, Dae Cheol. 2024. "Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction" Applied Sciences 14, no. 20: 9436. https://doi.org/10.3390/app14209436

APA Style

Han, D. C. (2024). Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction. Applied Sciences, 14(20), 9436. https://doi.org/10.3390/app14209436

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Traffic Volume Based on Deep Learning Model for AADT Correction

Abstract

1. Introduction

2. Review of Related Studies

2.1. Traffic Volume Correction and Methods Based on Historical Data

2.2. Traffic Volume Prediction Based on Statistical Learning

2.3. Deep-Learning-Based Traffic Volume Prediction

2.4. Distinctions from Prior Research

3. Analysis and Procedure of the Development Model

3.1. Analysis of the Development Model

3.2. Development Model Analysis Procedure

4. Statistical Learning Verification and Machine Learning Model Analysis and Evaluation

4.1. Statistical Learning Verification Design

4.2. Machine Learning Model Analysis and Evaluation

5. Model Prediction Performance Verification Results

6. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI