Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management

Abduljabbar, Rusul; Dia, Hussein; Liyanage, Sohani

doi:10.3390/infrastructures10070155

Open AccessArticle

Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management

by

Rusul Abduljabbar

,

Hussein Dia

and

Sohani Liyanage

^*

Department of Civil and Construction Engineering, Swinburne University of Technology, Melbourne, VIC 3122, Australia

^*

Author to whom correspondence should be addressed.

Infrastructures 2025, 10(7), 155; https://doi.org/10.3390/infrastructures10070155

Submission received: 16 May 2025 / Revised: 20 June 2025 / Accepted: 23 June 2025 / Published: 24 June 2025

(This article belongs to the Special Issue Sustainable Road Design and Traffic Management)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Sustainable traffic management relies on accurate traffic flow prediction to reduce congestion, fuel consumption, and emissions and minimise the external environmental impacts of traffic operations. This study contributes to this objective by developing and evaluating advanced machine learning models that leverage multisource data to predict traffic patterns more effectively, allowing for the deployment of proactive measures to prevent or reduce traffic congestion and idling times, leading to enhanced eco-friendly mobility. Specifically, this paper evaluates the impact of multisource sensor inputs and spatial detector interactions on machine learning-based traffic flow prediction. Using a dataset of 839,377 observations from 14 detector stations along Melbourne’s Eastern Freeway, Bidirectional Long Short-Term Memory (BiLSTM) models were developed to assess predictive accuracy under different input configurations. The results demonstrated that incorporating speed and occupancy inputs alongside traffic flow improves prediction accuracy by up to 16% across all detector stations. This study also investigated the role of spatial flow input interactions from upstream and downstream detectors in enhancing prediction performance. The findings confirm that including neighbouring detectors improves prediction accuracy, increasing performance from 96% to 98% for eastbound and westbound directions. These findings highlight the benefits of optimised sensor deployment, data integration, and advanced machine-learning techniques for smart and eco-friendly traffic systems. Additionally, this study provides a foundation for data-driven, adaptive traffic management strategies that contribute to sustainable road network planning, reducing vehicle idling, fuel consumption, and emissions while enhancing urban mobility and supporting sustainability goals. Furthermore, the proposed framework aligns with key United Nations Sustainable Development Goals (SDGs), particularly those promoting sustainable cities, resilient infrastructure, and climate-responsive planning.

Keywords:

sustainable traffic management; smart road infrastructure; machine learning; bidirectional LSTM; traffic flow prediction; spatial interactions; IoT in traffic systems; environmental impact; predictive maintenance; sustainable mobility

1. Introduction

Sustainable traffic management relies on accurate short-term traffic flow prediction to optimise road network performance, reduce congestion, and minimise environmental impact. Effective forecasting enables proactive traffic control, improving mobility while lowering fuel consumption and emissions. However, due to the non-linear dynamics of traffic flow, accurately predicting flow patterns within short horizons, up to 1 h into the future, remains challenging. These complexities impact the prediction accuracy for short-term forecasting, with significant implications for intelligent transportation system (ITS) applications [1,2]. Although considerable research has been undertaken to improve prediction accuracy for ITS, challenges continue in achieving reliable accuracies across varying traffic conditions.

While traditional models like ARIMA and Kalman Filter have been widely used, they often struggle with capturing non-linear dependencies and adapting to diverse traffic conditions. Traditional models like ARIMA, Kalman Filter, and simple neural networks have limitations when predicting traffic data, especially when dealing with multiple input variables such as speed, flow, and occupancy. ARIMA and Kalman Filter rely on linear assumptions and are typically suited for single-variable or simple multivariable time series, making them unable to capture the complex, non-linear relationships and sudden fluctuations characteristic of traffic systems. While the Kalman Filter works well for real-time noise reduction, it struggles with long-term forecasting and modelling non-linear dynamics. Moreover, simple feedforward neural networks can model non-linear relationships but lack the ability to capture temporal dependencies naturally, treating inputs independently without considering the sequential nature of traffic flow. In contrast, BiLSTM networks are specifically designed to handle time-series data with long-term dependencies, capturing both past and future temporal patterns due to their bidirectional architecture. They perform better at modelling complex, non-linear interactions between multiple variables, making them significantly more effective for predicting traffic conditions in dynamic, real-world scenarios [3]. In the authors’ previous study [4], they compared the performance, fault tolerance, and transferability of several machine learning models, including Recurrent Neural Networks (RNNs), Elman networks, unidirectional Long Short-Term Memory (LSTM), Deep Learning Backpropagation (DLBP), and Bidirectional LSTM (BiLSTM), using traffic flow and speed data from Melbourne’s Eastern Freeway. That evaluation concluded that BiLSTM models consistently outperformed the others, particularly under data corruption or noise scenarios. Therefore, this study builds on that foundation by focusing exclusively on the BiLSTM model and seeking to enhance its predictive performance by optimising input configurations.

This study specifically examines the impact of different input configurations on the accuracy of short-term traffic flow predictions, focusing on the BiLSTM model’s performance. A large dataset was utilised, comprising data from multiple detector stations along Melbourne’s Eastern Freeway in Melbourne, Australia. This dataset, collected over a 31-day period (1 min intervals), includes 839,377 observations tested from 14 cleaned detectors for short-term traffic flow prediction for both travel directions. These observations were collected for 31 days, and they represent diverse and complex traffic characteristics, such as peak, non-peak, weekday, weekend, incident, and non-incident information, to capture a broad spectrum of traffic conditions.

This study has two primary objectives. First, it evaluates the effect of different combinations of input variables, such as flow, speed, and occupancy, on the predictive accuracies for 14 detectors for eastbound and westbound travel directions. Second, it explores the influence of spatial interactions by incorporating data from upstream and downstream detectors, assessing whether data from neighbouring detectors can enhance prediction accuracies. This analysis ultimately seeks to identify optimal input configurations for short-term traffic flow prediction, aiming to enhance model adaptability to the diverse and complex traffic patterns encountered in real-world applications.

By optimising traffic flow predictions with data-driven, adaptive models, this research contributes to developing sustainable road networks that are more efficient and environmentally conscious. This study offers insights for designing intelligent, sustainable transportation systems through machine learning and advanced sensor integration.

2. Related Literature

Accurate traffic prediction plays an important role in the effectiveness of ITS, particularly in urban environments where congestion presents a significant challenge. Congestion can be classified into two types: recurrent, such as routine peak-hour traffic, and non-recurrent, which includes incidents like accidents, adverse weather conditions, or roadworks (e.g., incidents such as crashes, severe weather conditions, or road maintenance) [5,6,7,8]. Non-recurrent events are especially problematic due to their unpredictable and complex nature [9,10]. Traditional approaches such as the California Algorithm [11] and statistical models like Autoregressive Integrated Moving Average (ARIMA) and Kalman Filters were commonly employed but proved insufficient in capturing the non-linear and dynamic characteristics of traffic flow, particularly on arterial roads [12,13]. This section reviews significant studies focusing on methodologies for traffic flow prediction.

2.1. Parametric Approaches

Parametric methods, such as ARIMA and Kalman Filters, depend on predefined mathematical structures that assume fixed relationships among traffic variables [14,15,16]. These approaches are effective for short-term traffic predictions under stable conditions but struggle to account for non-linear traffic dynamics [17,18,19]. For instance, ARIMA models are commonly used to forecast traffic speeds and volumes by analysing patterns in historical time-series data [20]. However, their performance declines in unpredictable scenarios, such as those caused by weather changes or incidents [12,21].

Despite these shortcomings, parametric models remain widely utilised in traffic prediction. Kalman Filters, in particular, are valuable for real-time traffic monitoring due to their ability to update predictions dynamically using incoming sensor data [22,23]. However, similar to ARIMA, their effectiveness diminishes in highly dynamic settings, such as arterial roads prone to frequent disruptions [24].

In order to address these challenges, newer approaches like the AdaBoost algorithm have been developed for real-time vehicle detection, enhancing the accuracy of incident detection and response [25]. Integrating such algorithms with nonparametric traffic prediction models provides a more robust solution, especially for urban arterial roads where weather and incidents significantly disrupt traffic flow.

2.2. Nonparametric Approaches

Unlike parametric models, nonparametric methods do not rely on a fixed model structure, allowing them to adapt more effectively to the non-linear characteristics of traffic patterns [26,27,28,29]. Techniques such as Neural Networks (NNs) and Support Vector Regression (SVR) have shown remarkable success in processing complex, multisource traffic data, particularly on arterial roads where traditional methods often fall short [30,31,32].

For example, NNs have demonstrated their ability to accurately predict traffic speed and volume by leveraging large datasets and adapting to fluctuating traffic conditions [33,34,35,36,37]. These models are particularly suited for real-time ITS applications, where rapid changes in traffic conditions demand flexibility.

Long Short-Term Memory (LSTM) networks, a specialised form of Recurrent Neural Networks (RNNs) [38,39,40,41], have become a leading tool in traffic forecasting. Their ability to model temporal dependencies in data makes them highly effective for capturing traffic patterns over time [42]. For example, LSTM models have also been used to monitor and evaluate sequential patterns in vehicular live load data collected through Weigh-In-Motion (WIM) systems, offering a consistent method for detecting anomalies and system drift over time [43]. LSTMs are especially advantageous on arterial roads, where external factors like weather and incidents can cause significant variability. LSTMs deliver more accurate and reliable predictions by integrating real-time sensor data than traditional parametric approaches [44].

2.3. LSTM Models for Traffic Prediction

Predicting short-term traffic conditions, typically up to 60 min ahead, is critical for the success of ITS, such as adaptive traffic management and travel advisory systems. LSTM networks have gained significant attention for traffic prediction because they can utilise real-world data from sources like inductive loop detectors, CCTV, and probe vehicles [45]. Designed to capture long-term dependencies in sequential data, LSTMs excel in forecasting traffic parameters such as speeds, flows, and travel times [42,46].

For instance, Ref. [45] demonstrated that LSTM models achieve greater speed prediction accuracy than traditional methods. Similarly, Ref. [44] highlighted the effectiveness of LSTMs for predicting irregular travel times, showing minimal errors in one-step-ahead forecasts. LSTMs have also been successfully applied to traffic flow predictions, delivering high accuracy across various time horizons. Research by [47,48] showed that LSTMs outperform other models in predicting speed and flow on different road types, including arterial roads, which experience more significant traffic variability.

Moreover, LSTM models have been employed in studies on car-following behaviour, providing insights into vehicle acceleration and deceleration across diverse road networks. These models are particularly beneficial in urban environments, where traffic patterns are influenced by external factors such as weather and incidents [45,49,50,51].

2.4. BiLSTM Models

Although LSTM models have demonstrated strong performance in traffic prediction, BiLSTMs offer an enhanced capability by learning temporal dependencies in both forward and backward directions. This bidirectional processing allows BiLSTM models to excel in predicting traffic speeds and flows, particularly under complex and variable traffic conditions. By accounting for bidirectional dependencies, BiLSTMs provide improved handling of the stochastic nature of traffic flow, resulting in more accurate predictions than their unidirectional counterparts [44].

For example, Ref. [47] proposed an end-to-end deep learning framework with BiLSTM layers for traffic flow forecasting. Their model effectively addressed overfitting challenges and accurately predicted various traffic scenarios. Similarly, a comparative study found that stacked BiLSTM architectures outperformed both Uni-LSTM and standalone BiLSTM models in forecasting network-wide traffic speeds [52].

In summary, while parametric models provide foundational methods for traffic prediction, their limitations in handling dynamic, non-linear data make them less suited to the complexities of urban roads. The flexibility and adaptability of LSTM and BiLSTM models, especially when paired with multisource data, allow them to handle real-time traffic variations more effectively. Therefore, this research focuses on utilising LSTM and BiLSTM models for traffic prediction by integrating data from multiple sources, including neighbouring detectors, to enhance the accuracy and resilience of traffic forecasting in urban settings.

3. Data Description

This study utilised real-world traffic data collected from inductive loop detectors installed along the Eastern Freeway in Melbourne, Australia. This 18-kilometre freeway extends between East Link at Nunawading (eastbound) and Alexandra Parade (westbound). The dataset comprised speed measurements recorded over 31 days, from 1 July to 31 July 2016, for eastbound (EB) and westbound (WB) directions. Data was aggregated at 1 min intervals across all lanes at each site for the entire 24 h period daily.

The information was sourced from multiple detectors distributed along the freeway’s mainline carriageway, with spacing between detectors ranging from 450 to 1070 m. Each detector provided approximately 42,000 data points for key traffic metrics: speed (km/h), flow (vehicle count), and occupancy (percentage of time vehicles occupy the detector). The data was split into two subsets for the analysis: 60% (25,200 observations) for model training and 40% (16,800 observations) for testing and validation.

Seven detector locations were selected in each direction, resulting in 14 locations covering the freeway’s length. Figure 1 illustrates the positioning of these detectors along the carriageway.

For model development, data was obtained from seven detection stations in the westbound direction (designated as EW1, EW2, EW3, EW4, EW5, EW6, and EW7). These stations were spaced sequentially at intervals of 550, 450, 550, 500, 600, and 600 m. Similarly, data was collected from seven detection stations in the eastbound direction (designated as WE1, WE2, WE3, WE4, WE5, WE6, and WE7). The spacing between the eastbound stations was approximately 1000, 550, 550, 450, 800, and 700 m. The layout of these detection stations is depicted in Figure 2. Table 1 summarises the total dataset used in both directions for the model development.

4. Methodology

The BiLSTM model was assessed using the earlier datasets. This model has demonstrated exceptional accuracy in forecasting traffic conditions in previous studies [47,48]. The BiLSTM architecture consists of four layers: a sequence input layer (with one feature), BiLSTM layers (300 hidden units), a fully connected layer (producing one response), and a regression layer.

Hyperparameters were selected through a preliminary grid search process to optimise the model. Several combinations of learning rate, number of hidden units, batch size, and training epochs were evaluated based on validation performance using mean absolute error (MAE) and root mean square error (RMSE) across a subset of detectors. The final configuration was chosen as it consistently achieved the lowest validation error and best generalisation performance. The selected hyperparameters are gradient decay factor (0.9), initial learning rate (0.005), minimum batch size (128), maximum epochs (300), training optimiser (Adaptive Moment Estimation Optimiser), learning rate schedule (Piecewise), learning rate drop period (125 epochs), and a learning rate drop factor (0.2).

The experiments utilised Matlab R2019b with the Deep Learning Toolbox functions, including trainNetwork, trainingOptions, and predictAndUpdateState [54].

5. LSTM and BiLSTM

LSTM is an advanced type of RNN designed to address the limitations of traditional RNNs [55,56,57,58]. RNNs often face challenges with vanishing or exploding gradients during training due to using backpropagation algorithms, which affect weight adjustments and make it challenging to model long-term dependencies effectively. In order to overcome this issue, LSTM was introduced.

LSTM employs multiple gates—namely the input gate, output gate, and forget gate—which regulate the flow of information, ensuring that only relevant data is retained in the hidden layers and carried forward for predictions (see Figure 3). This architecture enables LSTM to capture long-term dependencies and outperform traditional RNN models in generating accurate predictions.

In these models, the predicted values are computed using the following equations [59]:

Input gate (I_{t}) = σ_{g} (W_{i} X_{t} + R_{i} h_{t - 1} + b_{i})

(1)

Forget gate (f_{t}) = σ_{g} (W_{f} X_{t} + R_{f} h_{t - 1} + b_{f})

(2)

Cell state (C_{t}) = σ_{c} (W_{c} X_{t} + R_{c} h_{t - 1} + b_{c})

(3)

Output gate (o_{t}) = σ_{g} (W_{o} X_{t} + R_{o} h_{t - 1} + b_{o})

(4)

where

σ_g represents the gate activation function.
$W_{i}, W_{f}, W_{c}, a n d W_{o}$ are input weight matrices.
$R_{i}, R_{f}, R_{c}, a n d R_{o}$ are recurrent weight matrices
$X_{t}$ is the input at time t.
$h_{t - 1}$ s the output from the previous time step (t − 1).
$b_{i}, b_{f}, b_{c}, a n d b_{o}$ are bias vectors.

The input gate determines which new information should be added to the cell state, while the forget gate controls the removal of previous memory from the cell state. The LSTM’s cell state and output at time t are calculated as follows:

Cell state:

Ct = ft⊙ct − 1 + it⊙gt

(5)

Hidden state:

Ht = ot⊙σc(ct)

(6)

where ⊙ denotes the Hadamard product, which is the element-wise multiplication of vectors.

BiLSTM models expand upon a standard LSTM by processing input data in both forward and backward directions. This approach allows the model to capture relationships from both past and future data, offering a more comprehensive understanding of sequential information (see Figure 4).

The BiLSTM architecture captures temporal–spatial dependencies in sequential data through its internal memory cells and gating mechanisms, which are specifically designed to retain both short-term and long-term patterns. Each LSTM unit uses three gates, the input gate, forget gate, and output gate to regulate the flow of information, allowing the model to decide what new information to store (e.g., recurring peak-hour patterns), what past information to forget (discarding irrelevant noise), and what to output at each time step. This enables the model to maintain relevant long-term trends, such as recurring congestion patterns, while also responding to short-term fluctuations like sudden changes in traffic flow. The bidirectional structure further enhances this capability by processing the sequence in both forward and backward directions during training, providing the model with full temporal–spatial context. The chosen hyperparameters and training strategies play a crucial role in enhancing the BiLSTM model’s ability to learn complex temporal and spatial traffic patterns effectively. A minimum batch size of 128 balances computational efficiency and stable gradient estimation, enabling the model to generalise better across varied traffic conditions. Setting the maximum epochs to 300 allows sufficient training iterations for the model to converge and capture long-term dependencies without overfitting. The use of the Adaptive Moment Estimation (Adam) optimiser helps accelerate convergence by adaptively adjusting learning rates for each parameter, improving the model’s capacity to learn from noisy and complex traffic data. Incorporating a piecewise learning rate schedule with a drop period of 125 epochs and a drop factor of 0.2 gradually reduces the learning rate during training, which stabilises learning by allowing larger initial updates for faster convergence and smaller updates later to fine-tune the model. In addition, the number of hidden units (e.g., 300 in our model) controls the capacity to capture complex features, while the learning rate (e.g., 0.005) guides how effectively the model updates during training. Together, these factors optimise the training process, enabling the BiLSTM to more accurately model the intricate temporal sequences and spatial correlations inherent in multi-source traffic flow data, leading to improved prediction performance. Through these mechanisms, the BiLSTM effectively identifies patterns and dependencies in both time and space, which are critical for accurate traffic flow prediction.

6. Experiment 1

6.1. Variable Inputs Testing

In the first experiment, various combinations of flow (veh/h), speed (km/h), and occupancy (%) were tested for all the detectors for 5 min prediction horizons into the future. First, flow was trained and tested as the only input in the BiLSTM model, followed by (speed and flow), (occupancy and flow), and (speed, flow, and occupancy) as the inputs to the BiLSTM model in the variable inputs testing experiment. A representation of the model is provided in Figure 5. This results in four combinations (C1, C2, C3, and C4) as summarised in Table 2 below. The total number of data points used for model development is 216,084 observations for the speed, flow, and occupancy for each detector in the eastbound direction (1,512,588 for all detectors). The total number of observations in the westbound direction for speed, flow, and occupancy for each detector is 143,685 (1,005,795 for all detectors). The detector station WE4 is the main targeted eastbound detector for traffic flow prediction. WE1, WE2, and WE3 are the upstream detectors, and WE5, WE6, and WE7 are the downstream detectors. For the westbound direction, EW4 is the main targeted detector for traffic flow prediction. EW1, EW2, and EW3 are the downstream detectors, and EW5, EW6, and EW7 are the upstream detectors.

In order to develop the BiLSTM model, 60% of the data was used for training (129,650 observations), and 40% of the data was used for the testing and validation (86,434 observations) for each traffic feature (speed, flow, and occupancy). The mean absolute percentage error (MAPE) is employed to assess the accuracy of model predictions across various time horizons. It computes the average of the absolute differences between the predicted values (Y1) and the actual observed values (Y), expressed as a percentage:

MAPE the (%) = (\frac{1}{n} \sum_{i = 1}^{n} \frac{| Y - Y 1 |}{Y}) * 100

(7)

Accuracy (%) = (100 - MAPE)

(8)

6.2. Experiment 1 Results (Eastbound)

The results of the four input combinations are presented in Table 3, showing that adding speed and occupancy as inputs generally improves flow prediction accuracy across all eastbound detectors.

For WE1 and WE2, the highest accuracies are achieved when speed is added to the flow input (C2: 97.82% and 97.93%, respectively), while using occupancy alone (C3) results in the lowest accuracies. A similar pattern is seen with WE5, where the flow input alone yields 86.19% accuracy, but significantly improves to 97.08% with the addition of speed.

For detectors WE3 and WE4, the best performance is observed when both speed and occupancy are included alongside flow (C4), achieving 97.68% and 96.80%, respectively. Especially, WE3 sees a slight decrease in accuracy when only speed is added (C2), suggesting the complementary value of combining both additional inputs.

Detector WE6 starts with the lowest baseline (C1: 81.05%) but shows significant improvements across all combinations, with the highest performance recorded when only occupancy is added (C3: 96.48%), closely followed by the full combination (C4: 96.20%).

For WE7, while the flow-only model performs well (97.29%), adding both speed and occupancy (C4) slightly improves the result to 97.69%. C3 (occupancy only) again yields the lowest performance.

The best-performing combinations are shaded green in Table 3, while the lowest performances are highlighted in yellow.

Figure 6 compares the prediction accuracies for each of the seven eastbound detectors under the four input combinations (C1–C4). This visualisation highlights the consistent improvements observed with the inclusion of speed and occupancy, especially under C4.

6.3. Experiment 1 Results (Westbound)

The results for the seven westbound detectors are shown in Table 4. In most cases, adding one or more variables to the flow improves prediction accuracy.

EW1, EW3, EW5, EW6, and EW7 show consistent improvements when additional variables are added to the flow. The highest accuracies for EW1 (96.60%), EW5 (96.52%), EW6 (96.58%), and EW7 (97.54%) occur with all three variables (C4), while EW3 performs best with flow and speed only (C2: 97.36%).

In addition to that, EW2 and EW4 show more sensitivity to input selection. For EW2, adding speed alone (C2) yields the best performance (95.90%), whereas adding all three inputs (C4) slightly decreases accuracy to 90.07%. Similarly, for EW4, both speed and occupancy individually degrade performance (C2: 83.27%, C3: 74.74%), while the full combination (C4) slightly improves over the flow-only input (93.43% vs. 92.43%).

As in Table 3, the best-performing combinations are shaded green, and the worst-performing ones are in yellow.

A similar comparison for westbound detectors is shown in Figure 7. The visual clearly demonstrates the varying degrees of benefit from different input combinations across detectors, highlighting that while C4 often yields the best results, some detectors perform best under C2 or C3 due to site-specific traffic dynamics.

7. Experiment 2

7.1. Neighbouring Detector Impacts

In this experiment, 32 short-term traffic BiLSTM models were trained and tested with different input settings in both the eastbound and westbound directions. The purpose is to test the impact of the upstream and downstream detectors on the flow prediction performance.

In this experiment, detectors WE4 (eastbound) and EW4 (westbound) were selected as the targeted detectors, and the improvement of flow prediction was investigated using the neighbouring detectors around those two targeted detectors. For the eastbound direction, the inputs of each BiLSTM model include the historical flow (veh/h) observations from the following detectors (WE1, WE2, WE3, WE4, WE5, WE6, and WE7). The expected output is the flow (veh/h) for the targeted detector WE4 at a prediction horizon of 5 min into the future.

For the westbound direction, the inputs of each BiLSTM model include the historical flow (veh/h) observations from the following detectors (EW1, EW2, EW3, EW4, EW5, EW6, and EW7). The expected output is the flow (veh/h) for the targeted detector EW4 at a prediction horizon of 5 min into the future. See Figure 8 below for the model illustration:

7.2. Experiment 2 Results (Eastbound)

The results of all model combinations are presented in Table 5. These indicate that incorporating flow data from the neighbouring detectors generally improves short-term traffic flow prediction accuracy in the eastbound direction.

Starting with only the target detector WE4 (Model 1), the prediction accuracy was 96.01%. Adding downstream detectors progressively improved performance: WE3 alone (Model 2) increased accuracy to 96.84%, while WE2 and WE3 (Model 4) yielded 96.66%. Including three downstream detectors (WE1, WE2, and WE3 in Model 7) further improved accuracy to 97.67%.

Upstream inputs had an even stronger influence. Adding WE5 (Model 3) raised the accuracy significantly to 98.16%. Extending this to WE5 and WE6 (Model 6) produced 97.26%, and including WE7 as well (Model 14) gave the best result, 98.22%, marked in green in Table 5.

Mixed upstream and downstream combinations showed varying results. Models 5, 10, and 12, each including two upstream and two downstream detectors, produced accuracies of 97.32%, 97.57%, and 97.70%, respectively. However, not all combinations were beneficial: Model 8, using WE1–WE3 and WE5, resulted in a lower accuracy of 94.32%.

In general, models that included a balanced and broader range of upstream and downstream detectors tended to perform better. The poorest-performing model was Model 8 (94.32%), while the best-performing was Model 14 (98.22%).

7.3. Experiment 2 Results (Westbound)

The westbound prediction results, shown in Table 6, similarly demonstrate the value of including data from neighbouring detectors.

With only the target detector EW4 (Model 1), the prediction accuracy was 92.43%. Adding downstream detectors like EW3 (Model 2) improved it to 93.81%, while including both EW2 and EW3 (Model 4) raised it to 96.77%. Three downstream detectors (EW1, EW2, and EW3 in Model 7) achieved 94.99%.

Adding upstream detectors showed strong effects: WE5 alone (Model 3) improved accuracy to 96.23%, and the combination of WE5–WE6–WE7 (Model 10) reached the highest performance of 97.32%. This result is highlighted in green in Table 6.

Combinations of upstream and downstream detectors, such as in Models 9, 10, and 14, produced high accuracies ranging from 96.32% to 97.12%. As with the eastbound results, a broader input range managed to yield better accuracy, though not all combinations improved results equally. For example, Model 8, with upstream and downstream detectors, only reached 94.18%.

8. Relevance of Prediction Framework to the United Nations’ Sustainable Development Goals

The integration of machine learning for traffic flow prediction in urban transport management aligns strongly with the United Nations’ Sustainable Development Goals (SDGs). This study utilised the United Nations’ Sustainable Development Goals (SDGs) framework to analyse how each dimension of the proposed traffic flow prediction system, comprising advanced deep learning models, multisource data integration, and practical urban transport applications, relates to broader urban sustainability challenges. The SDGs were selected for their comprehensive global vision, integrative structure, and institutional relevance among various frameworks (such as the New Urban Agenda or the Global Sustainable Mobility Initiative). The SDGs’ focus on sustainable cities, climate-responsive planning, and infrastructure innovation aligns directly with the research problem addressed in this study. The proposed research contributes directly and indirectly to several SDG targets related to sustainable infrastructure, climate resilience, and inclusive mobility through the application of advanced deep learning models such as LSTM and BiLSTM, combined with multisource data fusion and environmental consideration.

Following a critical review of traffic forecasting literature, urban mobility strategies, and smart city applications, the authors examined how the elements of the proposed prediction framework map onto specific SDG targets. Table 7 maps the core dimensions of the traffic flow prediction approach, including methodological design, data integration, model innovation, and expected outcomes, to specific SDGs. These targets include direct connections such as those related to transport safety (SDG 3.6), sustainable infrastructure (SDG 9.1), inclusive access (SDG 11.2), and climate-responsive policy integration (SDG 13.2). Indirect contributions are also evident, such as enhancing awareness (SDG 12.8), enabling inclusive decision-making (SDG 16.7), and promoting technological capacity (SDG 17.6). Table 7 systematically maps how each core component of the proposed traffic flow prediction system contributes to relevant SDG targets, highlighting specific mechanisms of impact.

Figure 9 visualises these cross-dimensional relationships, highlighting both strong direct (solid lines) and broader indirect (dotted lines) alignments with key SDG targets. This alignment reaffirms the model’s value as a tool for operational efficiency in transport and as a strategy for equitable, resilient, and low-emission urban futures. Figure 9 visualises these connections, highlighting both the strong direct relationships (solid lines) and broader indirect associations (dotted lines) between the traffic flow prediction framework and various SDGs. The strongest alignment was observed with SDG 11 (sustainable cities), SDG 9 (infrastructure), SDG 13 (climate action), and SDG 17 (technology and governance), with supporting roles for SDG 3 (health), SDG 12 (responsible consumption), and others.

This alignment underscores that AI-driven traffic flow prediction is a tool for enhancing operational transport efficiency and a strategic enabler of equitable, low-emission, and resilient urban futures. However, it should be noted that similar to the findings by Brussel et al. [72], current SDG indicators may underrepresent the complexity of real-world transport accessibility and behavioural patterns. Despite this, the SDG framework remains a valuable reference point for aligning transport innovations with global policy and sustainability discourse.

9. Discussion of Results

The above results demonstrate that multisource traffic data inputs, such as occupancy, flow, and speed from the same detector, can improve the accuracy of the flow prediction model. For the westbound direction, the accuracy improvement ranges from 1% to 6% for all detectors. It can also be noted that adding speed, occupancy, and flow together as inputs improves the accuracy of the five westbound detectors. The remaining two detectors achieved the highest accuracy when speed and flow were tested together as inputs to the model. Moreover, adding occupancy to the flow alone has the lowest improvement to model performance for most detectors in both travel directions. Overall, the accuracy improvement reaches up to 15% for all detectors in the eastbound direction.

In the westbound direction, it can also be noted that adding (speed, occupancy, and flow) or (speed and flow) together as inputs improved the accuracy of six detectors. The remaining detector achieved the highest accuracy when speed and occupancy were tested together as inputs to the model. However, adding occupancy to the remaining six detectors has resulted in the lowest improvement to the model performance in general.

In addition, the results above demonstrate that multisource traffic flow data input from the neighbouring detectors improved the accuracy of the flow prediction model. The accuracy improvement ranges from 1% to 5% for the westbound direction. The most significant improvement was achieved when flow data from three neighbouring detectors upstream was added, with an accuracy of 97.32% compared to 92.42% when the flow was predicted from a single detector flow input.

The accuracy improved by around 2% for the eastbound direction by using traffic flow data from the adjacent detectors. The most significant improvement resulted when flow data from two neighbouring detectors upstream and three neighbouring detectors downstream was added, with an accuracy of 98.99% compared to 96.01% when the flow was predicted from a single detector flow input.

The results demonstrate that the most accurate flow predictions are obtained when speed, flow, and occupancy inputs are used from the same detector to train the model. Excluding the speed or occupancy from the inputs leads to a deterioration in flow prediction performance. The results also demonstrate the direct influence of using additional inputs from upstream detector stations and show that, generally, these additional inputs result in better prediction accuracies.

These findings validate the effectiveness of incorporating multiple features and spatially distributed data in traffic flow prediction and carry significant implications for sustainable urban transport development. The improved accuracy in predicting flow can contribute to more efficient traffic control, reduced congestion, and better utilisation of existing infrastructure, aligning directly with Sustainable Development Goal (SDG) 9.1, which promotes reliable and sustainable infrastructure.

From a broader sustainability perspective, the following applies:

The enhancement of prediction accuracy using multisource and spatial data supports the development of intelligent transport systems, which are important for SDG 11.2 (providing access to safe, affordable, and sustainable transport systems for all).
More accurate flow predictions also enable traffic management systems to proactively adjust control measures, reduce emissions from congestion, and promote fuel efficiency, supporting SDG 13.2, for climate-related planning and the reduction of greenhouse gas emissions.
Indirectly, these improvements can reduce exposure to traffic-related air pollution, contributing to SDG 3.9 (reducing illnesses from hazardous air and environmental pollution), and enhance safety by anticipating congestion-related incidents in support of SDG 3.6 (reducing the number of global deaths and injuries from road traffic accidents).
Finally, integrating such high-quality, granular data into model development aligns with SDG 17.18, highlighting the importance of timely, reliable, and disaggregated data for decision-making.

In summary, the observed improvements in prediction accuracy, whether through data variety (speed, flow, occupancy) or spatial diversity (input from upstream/downstream detectors), highlight the potential of advanced machine learning models to support technical transport outcomes and broader sustainability goals.

Practical Implications and Limitations

The results presented in this study demonstrate the potential of incorporating multisource and spatially distributed traffic data to improve short-term flow prediction accuracy using BiLSTM models significantly. The practical implementation of such models in real-world traffic management systems needs further discussion.

The enhanced accuracy of flow predictions offers substantial benefits for existing traffic management practices. For example, integrating adaptive traffic signal control systems could enable dynamic optimisation of signal timings based on predicted demand, reducing congestion and making for more efficient use of road infrastructure. Similarly, predictive outputs from the model could inform real-time traveller information systems, allowing road users to receive early warnings about potential delays and make informed route or departure time decisions. Short-term flow predictions can also support incident management by detecting irregular traffic patterns earlier than traditional threshold-based methods, enabling immediate operator response.

However, transitioning from experimental study to operational use involves several challenges. Real-time performance is essential; the model must deliver predictions with minimal errors. This requires efficient data preprocessing pipelines and computationally lightweight inference procedures to be implemented in parallel with existing control systems. Moreover, the reliability of input data plays a central role in maintaining prediction accuracy. Incomplete or noisy data from traffic detectors can worsen the quality of model outputs; therefore, data cleaning methods are necessary as part of the operational framework.

Another key issue is the adaptability of the model over time. A wide range of dynamic factors, such as seasonal trends, roadworks, and changes in demand, influence traffic patterns. Therefore, periodic retraining of the model using up-to-date data is essential to maintain its accuracy in a continuously evolving traffic environment. Future development could explore real-time learning approaches or feedback mechanisms that enable an incremental model updating without full retraining.

Future work should include collaboration with transport agencies to pilot the model in operational settings, assess scalability, and ensure integration into existing traffic management platforms. Such partnerships will help bridge the gap between model development and practical deployment, improving the advantage of predictive models in real-world scenarios.

10. Conclusions

This study has demonstrated the effectiveness of using multisource traffic data inputs to improve short-term traffic flow prediction accuracy, a key aspect of sustainable traffic management. By incorporating occupancy, flow, and speed data from individual detectors, the prediction accuracy improved significantly for both eastbound and westbound directions. The results indicate that integrating multiple input variables enhances model performance, particularly for specific input combinations. In the westbound direction, accuracy improvements between 1% and 6% were observed for all detectors, with the most substantial improvements achieved when speed, occupancy, and flow were combined. Significantly, five westbound detectors showed improved accuracy with this combination, while two other stations reached optimal accuracy when speed and flow were used without occupancy. On the other hand, adding only occupancy to flow produced minimal improvements, highlighting that the combined use of speed and flow is particularly valuable for accurate traffic flow predictions in the westbound direction.

In the eastbound direction, the findings show that incorporating occupancy, speed, and flow yields an accuracy improvement of up to 15% across all detectors, underscoring the benefit of using multisource data inputs. Similarly, spatial input interactions from neighbouring detectors were found to be beneficial, especially for predicting westbound traffic flows, where incorporating data from three upstream detectors improved accuracy from 92.42% to 97.32%. For the eastbound direction, leveraging data from two upstream and three downstream neighbouring detectors boosted accuracy to 98.99%, compared to 96.01% when using single-detector input.

In summary, this study emphasises the effectiveness of multisource data and spatial interactions for enhancing traffic flow predictions in both travel directions. By identifying optimal input configurations, mainly through the inclusion of speed and occupancy, this research offers a valuable approach for refining short-term traffic prediction models. The study also highlights the value of machine learning and multi-source data integration in reducing congestion, lowering emissions, and improving mobility, contributing to developing sustainable, data-driven traffic management systems. This research supports future strategies for intelligent transportation systems that encourage efficient, environmentally friendly, and resilient road networks, aligning with the sustainability goals for urban mobility.

Additionally, the study incorporated a Sustainable Development Goals (SDGs) analysis, highlighting how improved traffic flow predictions contribute to SDG 9 (infrastructure), SDG 11 (sustainable cities), SDG 13 (climate action), and others, emphasising the broader societal and environmental relevance of data-driven transport modelling.

Author Contributions

H.D. and R.A.: Conceptualisation and research planning. R.A.: Methodology development and results generation. R.A. and S.L.: Paper drafting, content editing, and revisions. H.D.: Reviewing, editing, structuring, and providing supervision and mentorship to PhD students. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Acknowledgments

Rusul Abduljabbar and Sohani Liyanage would like to express their gratitude to Swinburne University of Technology for awarding them PhD scholarships. Rusul Abduljabbar also extends her thanks to the Iraqi Government for supporting her scholarship.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bartlett, Z.; Han, L.; Nguyen, T.T.; Johnson, P. A machine learning based approach for the prediction of road traffic flow on urbanised arterial roads. In Proceedings of the 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Exeter, UK, 28–30 June 2018; pp. 1285–1292. [Google Scholar]
Mackenzie, J.; Roddick, J.F.; Zito, R. An evaluation of HTM and LSTM for short-term arterial traffic flow prediction. IEEE Trans. Intell. Transp. Syst. 2018, 20, 1847–1857. [Google Scholar] [CrossRef]
Liyanage, S.P. Modelling the Impacts of On-Demand Public Transport. Doctoral Dissertation, Swinburne University of Technology, Hawthorn, Australia, 2022. [Google Scholar]
Abduljabbar, R.; Dia, H.; Tsai, P.-W. Fault tolerance and transferability of short-term traffic forecasting hybrid AI models. In Handbook on Artificial Intelligence and Transport; Edward Elgar Publishing: Cheltenham, UK, 2023; pp. 47–79. [Google Scholar]
Lippi, M.; Bertini, M.; Frasconi, P. Short-term traffic flow forecasting: An experimental comparison of time-series analysis and supervised learning. IEEE Trans. Intell. Transp. Syst. 2013, 14, 871–882. [Google Scholar] [CrossRef]
Lund, R. Time Series Analysis and Its Applications: With R Examples; Taylor & Francis: Abingdon, UK, 2007. [Google Scholar]
Papageorgiou, M.; Diakaki, C.; Dinopoulou, V.; Kotsialos, A.; Wang, Y. Review of road traffic control strategies. Proc. IEEE 2003, 91, 2043–2067. [Google Scholar] [CrossRef]
Vlahogianni, E.I.; Karlaftis, M.G.; Golias, J.C. Short-term traffic forecasting: Where we are and where we’re going. Transp. Res. Part C: Emerg. Technol. 2014, 43, 3–19. [Google Scholar] [CrossRef]
Jiang, B.; Fei, Y. Vehicle speed prediction by two-level data driven models in vehicular networks. IEEE Trans. Intell. Transp. Syst. 2016, 18, 1793–1801. [Google Scholar] [CrossRef]
Song, Z.; Guo, Y.; Wu, Y.; Ma, J. Short-term traffic speed prediction under different data collection time intervals using a SARIMA-SDGM hybrid prediction model. PLoS ONE 2019, 14, e0218626. [Google Scholar] [CrossRef]
Yang, F.; Yin, Z.; Liu, H.; Ran, B. Online recursive algorithm for short-term traffic prediction. Transp. Res. Rec. 2004, 1879, 1–8. [Google Scholar] [CrossRef]
Kalman, R.E. A New Approach to Linear Filtering and Prediction Problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
Lv, Y.; Duan, Y.; Kang, W.; Li, Z.; Wang, F.-Y. Traffic flow prediction with big data: A deep learning approach. Ieee Trans. Intell. Transp. Syst. 2014, 16, 865–873. [Google Scholar] [CrossRef]
Van Hinsbergen, C.I.; Van Lint, J.; Van Zuylen, H. Bayesian committee of neural networks to predict travel times with confidence intervals. Transp. Res. Part C Emerg. Technol. 2009, 17, 498–509. [Google Scholar] [CrossRef]
Ahmed, M.S.; Cook, A.R. Analysis of Freeway Traffic Time-Series Data by Using Box-Jenkins Techniques; Transportation Research Board: Washington, DC, USA, 1979. [Google Scholar]
Guo, J.; Huang, W.; Williams, B.M. Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification. Transp. Res. Part C Emerg. Technol. 2014, 43, 50–64. [Google Scholar] [CrossRef]
Davis, G.A.; Nihan, N.L. Nonparametric regression and short-term freeway traffic forecasting. J. Transp. Eng. 1991, 117, 178–188. [Google Scholar] [CrossRef]
Chan, K.Y.; Dillon, T.S.; Singh, J.; Chang, E. Neural-network-based models for short-term traffic flow forecasting using a hybrid exponential smoothing and Levenberg–Marquardt algorithm. IEEE Trans. Intell. Transp. Syst. 2011, 13, 644–654. [Google Scholar] [CrossRef]
Zahid, M.; Chen, Y.; Jamal, A.; Mamadou, C.Z. Freeway short-term travel speed prediction based on data collection time-horizons: A fast forest quantile regression approach. Sustainability 2020, 12, 646. [Google Scholar] [CrossRef]
Fusco, G.; Colombaroni, C.; Isaenko, N. Short-term speed predictions exploiting big data on large urban road networks. Transp. Res. Part C Emerg. Technol. 2016, 73, 183–201. [Google Scholar] [CrossRef]
Karlaftis, M.G.; Vlahogianni, E.I. Memory properties and fractional integration in transportation time-series. Transp. Res. Part C Emerg. Technol. 2009, 17, 444–453. [Google Scholar] [CrossRef]
Kashinath, S.A.; Mostafa, S.A.; Mustapha, A.; Mahdin, H.; Lim, D.; Mahmoud, M.A.; Mohammed, M.A.; Al-Rimy, B.A.S.; Fudzee, M.F.M.; Yang, T.J. Review of Data Fusion Methods for Real-Time and Multi-Sensor Traffic Flow Analysis. IEEE Access 2021, 9, 51258–51276. [Google Scholar] [CrossRef]
Ross, P. Exponential Filtering of Traffic Data; Transportation Research Board: Washington, DC, USA, 1982. [Google Scholar]
Dougherty, M. A review of neural networks applied to transport. Transp. Res. Part C Emerg. Technol. 1995, 3, 247–260. [Google Scholar] [CrossRef]
Sivaraman, S.; Trivedi, M.M. Real-time vehicle detection using parts at intersections. In Proceedings of the 2012 15th International IEEE Conference on Intelligent Transportation Systems, Anchorage, AK, USA, 16–19 September 2012; pp. 1519–1524. [Google Scholar]
Gu, Y.; Lu, W.; Qin, L.; Li, M.; Shao, Z. Short-term prediction of lane-level traffic speeds: A fusion deep learning model. Transp. Res. Part C Emerg. Technol. 2019, 106, 1–16. [Google Scholar] [CrossRef]
Castro-Neto, M.; Jeong, Y.-S.; Jeong, M.-K.; Han, L.D. Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions. Expert Syst. Appl. 2009, 36, 6164–6173. [Google Scholar] [CrossRef]
Smith, B.L.; Demetsky, M.J. Short-term traffic flow prediction models-a comparison of neural network and nonparametric regression approaches. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, San Antonio, TX, USA, 2–5 October 1994; pp. 1706–1709. [Google Scholar]
Sun, C.; Hu, X.; Moura, S.J.; Sun, F. Velocity predictors for predictive energy management in hybrid electric vehicles. IEEE Trans. Control Syst. Technol. 2014, 23, 1197–1204. [Google Scholar]
Chen, C.; Hu, J.; Meng, Q.; Zhang, Y. Short-time traffic flow prediction with ARIMA-GARCH model. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 607–612. [Google Scholar]
Vanajakshi, L.; Rilett, L.R. Support vector machine technique for the short term prediction of travel time. In Proceedings of the 2007 IEEE Intelligent Vehicles Symposium, Istanbul, Turkey, 13–15 June 2007; pp. 600–605. [Google Scholar]
Wang, J.; Shi, Q. Short-term traffic speed forecasting hybrid model based on chaos–wavelet analysis-support vector machine theory. Transp. Res. Part C Emerg. Technol. 2013, 27, 219–232. [Google Scholar] [CrossRef]
Laña, I.; Lobo, J.L.; Capecci, E.; Del Ser, J.; Kasabov, N. Adaptive long-term traffic state estimation with evolving spiking neural networks. Transp. Res. Part C Emerg. Technol. 2019, 101, 126–144. [Google Scholar] [CrossRef]
Li, L.; Qin, L.; Qu, X.; Zhang, J.; Wang, Y.; Ran, B. Day-ahead traffic flow forecasting based on a deep belief network optimized by the multi-objective particle swarm algorithm. Knowl.-Based Syst. 2019, 172, 1–14. [Google Scholar] [CrossRef]
Kuang, X.; Xu, L.; Huang, Y.; Liu, F. Real-time forecasting for short-term traffic flow based on general regression neural network. In Proceedings of the 2010 8th World Congress on Intelligent Control and Automation, Jinan, China, 7–9 July 2010; pp. 2776–2780. [Google Scholar]
Lefèvre, S.; Sun, C.; Bajcsy, R.; Laugier, C. Comparison of parametric and non-parametric approaches for vehicle speed prediction. In Proceedings of the 2014 American Control Conference, Portland, OR, USA, 4–6 June 2014; pp. 3494–3499. [Google Scholar]
Ma, X.; Dai, Z.; He, Z.; Ma, J.; Wang, Y.; Wang, Y. Learning traffic as images: A deep convolutional neural network for large-scale transportation network speed prediction. Sensors 2017, 17, 818. [Google Scholar] [CrossRef]
Manibardo, E.L.; Laña, I.; Del Ser, J. Deep learning for road traffic forecasting: Does it make a difference? IEEE Trans. Intell. Transp. Syst. 2021, 23, 6164–6188. [Google Scholar] [CrossRef]
Gers, F.A.; Schmidhuber, J.; Cummins, F. Learning to forget: Continual prediction with LSTM. Neural Comput. 2000, 12, 2451–2471. [Google Scholar] [CrossRef]
Guo, J.; Luo, Y.; Li, K. Adaptive neural-network sliding mode cascade architecture of longitudinal tracking control for unmanned vehicles. Nonlinear Dyn. 2017, 87, 2497–2510. [Google Scholar] [CrossRef]
Yeon, K.; Min, K.; Shin, J.; Sunwoo, M.; Han, M. Ego-vehicle speed prediction using a long short-term memory based recurrent neural network. Int. J. Automot. Technol. 2019, 20, 713–722. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Sinha, A.; Chorzepa, M.G.; Yang, J.J.; Kim, S.-H.S.; Durham, S. Deep-Learning-Based Temporal Prediction for Mitigating Dynamic Inconsistency in Vehicular Live Loads on Roads and Bridges. Infrastructures 2022, 7, 150. [Google Scholar] [CrossRef]
Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv 2017, arXiv:1707.01926. [Google Scholar]
Cui, Z.; Ke, R.; Pu, Z.; Wang, Y. Deep bidirectional and unidirectional LSTM recurrent neural network for network-wide traffic speed prediction. arXiv 2018, arXiv:1801.02143. [Google Scholar]
Morton, J.; Wheeler, T.A.; Kochenderfer, M.J. Analysis of recurrent neural networks for probabilistic modeling of driver behavior. IEEE Trans. Intell. Transp. Syst. 2016, 18, 1289–1298. [Google Scholar] [CrossRef]
Abduljabbar, R.L.; Dia, H.; Tsai, P.-W.; Liyanage, S. Short-Term Traffic Forecasting: An LSTM Network for Spatial-Temporal Speed Prediction. Future Transp. 2021, 1, 21–37. [Google Scholar] [CrossRef]
Abduljabbar, R.; Dia, H. A deep learning approach for freeway vehicle speed and flow prediction. In Proceedings of the Australasian Transport Research Forum, Canberra, Australia, 30 September–2 October 2019. [Google Scholar]
Yu, H.; Wu, Z.; Wang, S.; Wang, Y.; Ma, X. Spatiotemporal recurrent convolutional networks for traffic prediction in transportation networks. Sensors 2017, 17, 1501. [Google Scholar] [CrossRef]
Vlahogianni, E.I.; Golias, J.C.; Karlaftis, M.G. Short-term traffic forecasting: Overview of objectives and methods. Transp. Rev. 2004, 24, 533–557. [Google Scholar] [CrossRef]
Vlahogianni, E.I.; Karlaftis, M.G.; Golias, J.C. Spatio-temporal short-term urban traffic volume forecasting using genetically optimized modular networks. Comput.-Aided Civ. Infrastruct. Eng. 2007, 22, 317–325. [Google Scholar] [CrossRef]
Siami-Namini, S.; Tavakoli, N.; Namin, A.S. The performance of LSTM and BiLSTM in forecasting time series. In Proceedings of the 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 9–12 December 2019; pp. 3285–3292. [Google Scholar]
Google-Earth. Google Earth. Available online: https://earth.google.com/web/@0,-1.9336001,0a,22251752.77375655d,35y,0h,0t,0r/data=CgRCAggBOgMKATBKDQj___________8BEAA (accessed on 31 August 2024).
Matlab. Recurrent Neural Network (RNN). Recurrent Neural Network (RNN)—MATLAB & Simulink. Available online: https://www.mathworks.com/discovery/rnn.html (accessed on 9 February 2024).
Abduljabbar, R.; Dia, H.; Liyanage, S. Machine Learning Models for Traffic Prediction on Arterial Roads Using Traffic Features and Weather Information. Appl. Sci. 2024, 14, 11047. [Google Scholar] [CrossRef]
Liyanage, S.; Dia, H.; Abduljabbar, R.; Tsai, P.-W. Neural network approaches for forecasting short-term on-road public transport passenger demands. In Handbook on Artificial Intelligence and Transport; Edward Elgar Publishing: Cheltenham, UK, 2023; pp. 176–220. [Google Scholar]
Liyanage, S.; Abduljabbar, R.; Dia, H.; Tsai, P.-W. AI-based neural network models for bus passenger demand forecasting using smart card data. J. Urban Manag. 2022, 11, 365–380. [Google Scholar] [CrossRef]
Liyanage, S.; Dia, H. On-Demand Technologies for Public Transport: Insights From a Melbourne Survey. IEEE Open J. Intell. Transp. Syst. 2025, 6, 653–672. [Google Scholar] [CrossRef]
Matlab. Long Short-Term Memory Networks. Long Short-Term Memory (LSTM) Networks—MATLAB & Simulink. Available online: https://www.mathworks.com/help/deeplearning/ug/long-short-term-memory-networks.html (accessed on 9 February 2024).
Ahmad, S.; de Oliveira, J.A.P. Determinants of urban mobility in India: Lessons for promoting sustainable and inclusive urban transportation in developing countries. Transp. Policy 2016, 50, 106–114. [Google Scholar] [CrossRef]
United-Nations. The Sustainable Development Goals Report. Available online: https://unstats.un.org/sdgs/report/2024/The-Sustainable-Development-Goals-Report-2024.pdf (accessed on 23 April 2025).
United-Nations. Mobilizing Sustainable Transport for Development; United Nation Official Publication: New York, NY, USA, 2021; Available online: https://unstats.un.org/sdgs/report/2021/The-Sustainable-Development-Goals-Report-2021.pdf (accessed on 23 April 2025).
Nilsson, M.; Griggs, D.; Visbeck, M. Policy: Map the interactions between Sustainable Development Goals. Nature 2016, 534, 320–322. [Google Scholar] [CrossRef]
Sachs, J.; Schmidt-Traub, G.; Kroll, C.; Durand-Delacre, D.; Teksoz, K.; SDG Index and Dashboards—Global Report 2016. Sustainable Development Solutions and Network. Available online: http://prohumana.cl/wp-content/uploads/2016/07/sdg_index_and_dashboards_compact.pdf (accessed on 23 April 2025).
Alawadi, K. Rethinking Dubai’s urbanism: Generating sustainable form-based urban design strategies for an integrated neighborhood. Cities 2017, 60, 353–366. [Google Scholar] [CrossRef]
Boulange, C.; Gunn, L.; Giles-Corti, B.; Mavoa, S.; Pettit, C.; Badland, H. Examining associations between urban design attributes and transport mode choice for walking, cycling, public transport and private motor vehicle trips. J. Transp. Health 2017, 6, 155–166. [Google Scholar] [CrossRef]
Crayton, T.J.; Meier, B.M. Autonomous vehicles: Developing a public health research agenda to frame the future of transportation policy. J. Transp. Health 2017, 6, 245–252. [Google Scholar] [CrossRef]
Ramirez-Rubio, O.; Daher, C.; Fanjul, G.; Gascon, M.; Mueller, N.; Pajín, L.; Plasencia, A.; Rojas-Rueda, D.; Thondoo, M.; Nieuwenhuijsen, M.J. Urban health: An example of a “health in all policies” approach in the context of SDGs implementation. Glob. Health 2019, 15, 87. [Google Scholar] [CrossRef]
Lee, C. Impacts of two-scale urban form and their combined effects on commute modes in US metropolitan areas. J. Transp. Geogr. 2020, 88, 102821. [Google Scholar] [CrossRef]
Alipour, D.; Dia, H. A systematic review of the role of land use, transport, and energy-environment integration in shaping sustainable cities. Sustainability 2023, 15, 6447. [Google Scholar] [CrossRef]
United-Nations. The 17 Goals. Department of Economics and Social Affairs. Available online: https://sdgs.un.org/goals (accessed on 25 April 2025).
Brussel, M.; Zuidgeest, M.; Pfeffer, K.; Van Maarseveen, M. Access or accessibility? A critique of the urban transport SDG indicator. ISPRS Int. J. Geo-Inf. 2019, 8, 67. [Google Scholar] [CrossRef]

Figure 1. Detector locations on the mainstream of Eastern Freeway eastbound (EB) and westbound (WB) [53]. Note: Yellow diamonds represent eastbound detector locations, orange diamonds represent westbound detector locations. A yellow diamond with a red dot indicates the mid-eastbound detector location, while an orange diamond with a yellow dot indicates the mid-westbound detector location.

Figure 2. Eastern Freeway section and location of detectors (source: authors).

Figure 3. LSTM Architecture [59].

Figure 4. Unidirectional LSTM and Bi-directional LSTM Architecture (source: authors).

Figure 5. Model representation (source: authors).

Figure 6. Accuracy % of different input variable combinations for flow prediction accuracies for the seven detectors (eastbound direction).

Figure 7. Accuracy % of different input variable combinations for flow prediction accuracies for the seven detectors (westbound direction).

Figure 8. Model representations (source: authors).

Figure 9. Visual representation of the alignment between the machine learning-based traffic flow prediction framework and the United Nations Sustainable Development Goals (SDGs). Note: Solid lines indicate direct contributions to specific SDGs, while dotted lines reflect indirect/supporting relationships.

Table 1. Data used for model development.

Location	Total Data Set
Eastern Freeway, Westbound	335,265 observations
Eastern Freeway, Eastbound	504,112 observations
Total	839,377 observations

Table 2. Various flow, speed, and occupancy combinations for eastbound and westbound directions.

Combinations	Flow (veh/h)	Speed (km/h)	Occupancy (%)
C1
C2
C3
C4

Note: Orange circles represent the variable inputs included in each combination.

Table 3. Performance of different input variable combinations for flow prediction accuracies for the seven detectors (eastbound direction).

Directions	Detectors	Accuracy (%) per Combination
		C1	C2	C3	C4
Eastbound	WE1	95.22	97.82	88.31	96.63
	WE2	97.37	97.93	85.18	96.89
	WE3	96.77	94.60	86.74	97.68
	WE4	96.01	93.59	89.12	96.80
	WE5	86.19	97.08	88.27	91.78
	WE6	81.05	95.14	96.48	96.20
	WE7	97.29	96.38	93.94	97.69

Note: The green cells represent the best-performing models, while the yellow cells represent the worst-performing models.

Table 4. Performance of different input variable combinations for westbound flow prediction accuracies.

Directions	Detectors	Accuracy (%) per Combination
		C1	C2	C3	C4
Westbound	EW1	90.81	96.10	94.99	96.60
	EW2	88.19	95.90	92.03	90.07
	EW3	93.19	97.36	96.43	96.11
	EW4	92.43	83.27	74.74	93.43
	EW5	90.96	96.49	96.15	96.52
	EW6	91.11	95.17	82.49	96.58
	EW7	95.92	96.85	90.73	97.54

Note: The green cells represent the best-performing models, while the yellow cells represent the worst-performing models.

Table 5. Performance of neighbouring detectors on short-term traffic flow prediction for the eastbound direction.

Model	Eastbound Neighbouring Detector Inputs							Accuracy (%)
Model	WE1	WE2	WE3	WE4	WE5	WE6	WE7	Accuracy (%)
1								96.01
2								96.84
3								98.16
4								96.66
5								97.32
6								97.26
7								97.67
8								94.32
9								96.07
10								97.57
11								97.06
12								97.70
13								96.48
14								98.22
15								97.45
16								96.35

Note: Orange circles represent the variable inputs included in each model. The green cells represent the best-performing model.

Table 6. Performance of neighbouring detectors on short-term traffic flow prediction for the westbound direction.

Model	Westbound Neighbouring Detector Inputs							Accuracy (%)
Model	EW1	EW2	EW3	EW4	EW5	EW6	EW7	Accuracy (%)
1								92.43
2								93.81
3								96.23
4								96.77
5								95.80
6								95.43
7								94.99
8								94.18
9								97.12
10								97.32
11								94.82
12								94.62
13								96.32
14								96.73
15								96.11
16								96.43

Note: Orange circles represent the variable inputs included in each combination. The green cells represent the best-performing model.

Table 7. Linkage of machine learning traffic flow prediction to the UN’s Sustainable Development Goals (SDGs).

Framework Components	Directly Connected SDG Targets	Indirectly Connected
Methodological Approach	9.1—Develop quality, reliable, sustainable, and resilient infrastructure for supporting economic development and human well-being, with a focus on affordable and equitable access for all	4.7—Ensure learners acquire knowledge and skills to promote sustainable development
	11.2—Provide access to safe, affordable, accessible, and sustainable transport systems for all	10.7—Facilitate orderly, safe, regular and responsible migration and mobility of people
	13.2—Integrate climate change measures into national policies, strategies, and planning	17.6—Enhance access to science, technology, and innovation
Multisource Data Integration	17.18—Enhance capacity-building support to developing countries to increase the availability of high-quality, timely, and reliable data	12.8—Ensure that people everywhere have relevant information and awareness for sustainable development and lifestyles
Multisource Data Integration	11.6—Reduce the adverse per capita environmental impact of cities, including by paying special attention to air quality and municipal and other waste management	13.3—Improve education and awareness on climate change; build capacity for climate-related planning
Advanced Model Development (BiLSTM)	9.5—Enhance scientific research and upgrade technological capabilities of industrial sectors in developing countries	8.4—Improve progressively, through 2030, global resource efficiency in consumption and production
Advanced Model Development (BiLSTM)	11.3—Enhance inclusive and sustainable urbanisation and capacity for participatory, integrated and sustainable human settlement planning and management	1.5—Build resilience of people with low incomes and those in vulnerable situations, and reduce exposure to climate-related extreme events
Impact on Urban Traffic Management	13.2—Integrate climate change measures into national policies, strategies, and planning	3.9—Substantially reduce the number of deaths and illnesses from air pollution
	3.6—Halve the number of global deaths and injuries from road traffic accidents	12.2—Achieve sustainable management and efficient use of natural resources
	11.2—Provide access to safe, affordable, accessible, and sustainable transport systems for all	16.7—Ensure responsive, inclusive, participatory, and representative decision-making
Sustainable Mobility Contributions	13.2—Integrate environmental concerns into planning and operations	16.7—Ensure inclusive and participatory decision-making
	9.1—Develop quality, reliable, sustainable infrastructure	12.4—Reduce release of hazardous substances
	11.6—Minimise negative environmental impacts from urban traffic	11. a—Link urban–rural development planning

Table source: Author developed based on publicly available SDG data [60,61,62,63,64,65,66,67,68,69,70,71].

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Abduljabbar, R.; Dia, H.; Liyanage, S. Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management. Infrastructures 2025, 10, 155. https://doi.org/10.3390/infrastructures10070155

AMA Style

Abduljabbar R, Dia H, Liyanage S. Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management. Infrastructures. 2025; 10(7):155. https://doi.org/10.3390/infrastructures10070155

Chicago/Turabian Style

Abduljabbar, Rusul, Hussein Dia, and Sohani Liyanage. 2025. "Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management" Infrastructures 10, no. 7: 155. https://doi.org/10.3390/infrastructures10070155

APA Style

Abduljabbar, R., Dia, H., & Liyanage, S. (2025). Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management. Infrastructures, 10(7), 155. https://doi.org/10.3390/infrastructures10070155

Article Menu

Machine Learning Traffic Flow Prediction Models for Smart and Sustainable Traffic Management

Abstract

1. Introduction

2. Related Literature

2.1. Parametric Approaches

2.2. Nonparametric Approaches

2.3. LSTM Models for Traffic Prediction

2.4. BiLSTM Models

3. Data Description

4. Methodology

5. LSTM and BiLSTM

6. Experiment 1

6.1. Variable Inputs Testing

6.2. Experiment 1 Results (Eastbound)

6.3. Experiment 1 Results (Westbound)

7. Experiment 2

7.1. Neighbouring Detector Impacts

7.2. Experiment 2 Results (Eastbound)

7.3. Experiment 2 Results (Westbound)

8. Relevance of Prediction Framework to the United Nations’ Sustainable Development Goals

9. Discussion of Results

Practical Implications and Limitations

10. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI