A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles
Abstract
:1. Introduction
- Build a scalable distributed machine learning system that can predict highway traffic flows. The scheme divide a highway into small complex segments, where each complex segment is located between two exits to control the level of congestion and allow vehicles to exit the highway before entering the congestion zone.
- Build a distributed machine learning model to improve the system performance for abnormal traffic flows. The distributed machine learning model is trained based on different hyper-parameters’ values, where these values are determined based on the traffic flow patterns for each segment of the complex segment. So each segment has a model with its adjusted hyper-parameters’ values based on RSCV technique. Then all models participate to build the proposed distributed learning model. Stacking ensemble learning method is utilized to build the DMLRF model.
2. Related Work
3. Preliminaries
3.1. Kappa Architecture
3.2. Ensemble Learning and Hyper-Parameters Tuning
3.2.1. Ensemble Learning
- Boosting: When some data are misclassified by a model during the training process, a new model will train the misclassified data in order to improve the classification accuracy.
- Bagging: Is the process of combining different machine learning models into one model.
- Bucketing: Selecting the best model that achieved the highest accuracy evaluation metrics.
- Stacking: Is the process of reducing the variance of machine learning classifiers by taking the output of multiple classifiers/regressors as inputs of a new classifier/regressor.
- Random Forests: Increases the overall performance by averaging the prediction of multiple decision trees into an individual tree [48].
3.2.2. Hyper-Parameters Tuning
4. Research Methodology
4.1. Research Methodology
4.2. Model Design
4.2.1. Random Forest Regression
Algorithm 1 Random Forest Base-line Model |
Precondition: A training set T: = , features Z, and number of tree in forest N. 1 function Random-Forest(T, Z) 2 3 for do a bootstrap sample of T RandomizeTreeLearn 4 end for 5 return H 6 end function 7 function RandomizeTreeLearn At each node: very small subset of Z split on best feature in z 8 return the learned tree 9 end function |
Algorithm 2 DMLRF model creation |
Input: Split traffic dataset D into K subsets D = {}, such that is the data subset of Segment i. Combine segments between exits into J complex segments Output: A Distributed Machine Learning Random Forest (DMLRF) model for real-time traffic flow prediction. 1 Step 1: for i ← 1 to K: (a) determine the hyperparameters’ values using RSCV for data subset . (b) First level training: train random forest regression (RFR) model for data subset . End for 2 Step 2: for i ← 1 to J Second level training: stack the output of individual estimators (RFR) and use a meta model (DMLRF) to compute the final prediction for Complex Segment i. End for 3 return DMLRF |
4.2.2. Stacking Ensemble Learning
4.3. Research Challenges
5. Data Description and Preprocessing
5.1. Data Description
5.2. Data Preprocessing and Transformation
6. Proposed System Architecture and Components
6.1. Proposed System Architecture
6.2. Proposed System Components
- Kappa Architecture: KA is a real-time streaming component that can process the real-life data that is obtained from a cluster (segment) head node. This is a one pipeline machine learning process that consists of two main layers: The streaming and serving layers.
- Distributed machine learning model: It is the proposed model that is built based on different random forest algorithms with different hyper-parameters values. The Stacking ensemble learning approach is utilized to build our proposed model.
7. Performance Evaluation and Results
7.1. Classification Models
7.2. Performance Metrics
7.3. Results and Discussion
7.3.1. Complex Segment 1
7.3.2. Complex Segment 2
7.3.3. Complex Segment 3
Complex Segment | Algorithm | Segment | MSE | RMSE | MAE | R2 | AR2 |
---|---|---|---|---|---|---|---|
DMLRF | Segment A | 7.9 | 2.8 | 2.1 | 61% | 61% | |
LR | Segment A | 8.4 | 2.9 | 2.2 | 59% | 58% | |
LogR | Segment A | 9.9 | 3.1 | 2.3 | 15% | 14% | |
KNN | Segment A | 10.9 | 3.3 | 2.6 | 7% | 7% | |
DMLRF | Segment B | 1.3 | 1.1 | 0.9 | 75% | 75% | |
LR | Segment B | 4.3 | 2.0 | 1.7 | 18% | 17% | |
LogR | Segment B | 6.5 | 2.5 | 1.4 | 22% | 22% | |
KNN | Segment B | 1.5 | 1.2 | 1.0 | 71% | 71% | |
Complex Segment 1 | DMLRF | Segment C | 3.5 | 1.8 | 1.5 | 85% | 85% |
LR | Segment C | 10.5 | 3.2 | 2.7 | 56% | 55% | |
LogR | Segment C | 21.1 | 4.5 | 3.5 | 12% | 8% | |
KNN | Segment C | 4.9 | 2.2 | 1.7 | 79% | 79% | |
DMLRF | Segment D | 5.0 | 2.2 | 1.8 | 71% | 71% | |
LR | Segment D | 7.9 | 2.8 | 2.2 | 54% | 54% | |
LogR | Segment D | 9.2 | 3.0 | 2.2 | 46% | 13% | |
KNN | Segment D | 10.5 | 2.6 | 3.2 | 38% | 38% | |
DMLRF | Segment A | 7.0 | 2.6 | 2.0 | 52% | 52% | |
LR | Segment A | 11.4 | 3.3 | 2.4 | 22% | 23% | |
LogR | Segment A | 15.1 | 3.8 | 2.5 | 11% | 11% | |
KNN | Segment A | 10.3 | 3.2 | 2.4 | 30% | 30% | |
DMLRF | Segment B | 5.2 | 2.2 | 1.7 | 52% | 52% | |
LR | Segment B | 6.7 | 2.6 | 1.9 | 38% | 38% | |
LogR | Segment B | 7.7 | 2.7 | 2.0 | 30% | 17% | |
KNN | Segment B | 8.4 | 2.9 | 2.2 | 23% | 23% | |
Complex Segment 2 | DMLRF | Segment C | 5.6 | 2.3 | 1.8 | 69% | 68% |
LR | Segment C | 7.6 | 2.7 | 2.1 | 57% | 57% | |
LogR | Segment C | 8.4 | 2.9 | 2.1 | 52% | 16% | |
KNN | Segment C | 11.6 | 3.4 | 2.5 | 35% | 35% | |
DMLRF | Segment D | 8.6 | 2.9 | 2.3 | 76% | 76% | |
LR | Segment D | 18.4 | 4.2 | 3.2 | 49% | 49% | |
LogR | Segment D | 26.3 | 5.1 | 3.7 | 28% | 9% | |
KNN | Segment D | 33.0 | 5.7 | 4.5 | 9% | 8% | |
DMLRF | Segment A | 5.9 | 2.4 | 1.8 | 35% | 35% | |
LR | Segment A | 7.3 | 2.7 | 2.0 | 20% | 19% | |
LogR | Segment A | 7.1 | 2.6 | 1.9 | 21% | 17% | |
KNN | Segment A | 13.2 | 3.6 | 2.8 | 13% | 13% | |
DMLRF | Segment B | 6.4 | 2.5 | 1.9 | 59% | 60% | |
LR | Segment B | 11.4 | 3.3 | 2.5 | 30% | 30% | |
LogR | Segment B | 15.7 | 3.9 | 2.8 | 13% | 13% | |
KNN | Segment B | 7.7 | 2.7 | 2.1 | 50% | 50% | |
Complex Segment 3 | DMLRF | Segment C | 8.6 | 2.9 | 2.2 | 50% | 50% |
LR | Segment C | 16.4 | 4.0 | 3.1 | 5% | 5% | |
LogR | Segment C | 7.7 | 2.7 | 2.0 | 55% | 17% | |
KNN | Segment C | 19.3 | 4.4 | 3.2 | 12% | 12% | |
DMLRF | Segment D | 6.8 | 2.6 | 1.9 | 58% | 58% | |
LR | Segment D | 11.3 | 3.3 | 2.4 | 31% | 30% | |
LogR | Segment D | 14.2 | 3.7 | 2.6 | 13% | 13% | |
KNN | Segment D | 10.2 | 3.1 | 2.3 | 37% | 37% | |
DMLRF | Segment A | 9.7 | 3.1 | 2.3 | 44% | 43% | |
LR | Segment A | 13.9 | 3.7 | 2.7 | 20% | 19% | |
LogR | Segment A | 16.4 | 4.0 | 2.8 | 14% | 14% | |
KNN | Segment A | 11.9 | 3.4 | 2.6 | 31% | 31% | |
DMLRF | Segment B | 5.4 | 2.3 | 1.9 | 80% | 80% | |
LR | Segment B | 7.2 | 2.6 | 2.0 | 73% | 72% | |
LogR | Segment B | 8.1 | 2.8 | 2.1 | 69% | 20% | |
KNN | Segment B | 13.8 | 3.7 | 2.8 | 48% | 48% | |
Complex Segment 4 | DMLRF | Segment C | 10.7 | 3.2 | 2.5 | 70% | 70% |
LR | Segment C | 12.3 | 3.5 | 2.8 | 65% | 65% | |
LogR | Segment C | 12.2 | 3.5 | 2.7 | 65% | 10% | |
KNN | Segment C | 20.0 | 4.4 | 3.3 | 43% | 43% | |
DMLRF | Segment D | 6.8 | 2.6 | 2.0 | 81% | 81% | |
LR | Segment D | 9.7 | 3.1 | 2.5 | 72% | 72% | |
LogR | Segment D | 12.6 | 3.5 | 2.8 | 64% | 10% | |
KNN | Segment D | 13.7 | 3.7 | 2.8 | 61% | 61% |
7.3.4. Complex Segment 4
8. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Shi, Q.; Abdel-Aty, M. Big data applications in real-time traffic operation and safety monitoring and improvement on urban expressways. Transp. Res. Part C Emerg. Technol. 2015, 58, 380–394. [Google Scholar]
- Abdullah, T.; Nyalugwe, S. A Data Mining Approach for Analysing Road Traffic Accidents. In Proceedings of the 2019 2nd International Conference on Computer Applications & Information Security (ICCAIS), Riyadh, Saudi Arabia, 1–3 May 2019; pp. 1–6. [Google Scholar]
- Micko, K.; Papcun, P.; Zolotova, I. Review of IoT sensor systems used for monitoring the road infrastructure. Sensors 2023, 23, 4469. [Google Scholar] [CrossRef] [PubMed]
- Ozbayoglu, M.; Kucukayan, G.; Dogdu, E. A real-time autonomous highway accident detection model based on big data processing and computational intelligence. In Proceedings of the 2016 IEEE International Conference on Big Data (Big Data), Washington, DC, USA, 5–8 December 2016; pp. 1807–1813. [Google Scholar]
- Dong, C.; Shao, C.; Li, J.; Xiong, Z. An Improved Deep Learning Model for Traffic Crash Prediction. J. Adv. Transp. 2018, 2018, 1–13. [Google Scholar] [CrossRef]
- Ifthikar, A.; Hettiarachchi, S. Analysis of historical accident data to determine accident prone locations and cause of accidents. In Proceedings of the 2018 8th International Conference on Intelligent Systems, Modelling and Simulation (ISMS), Kuala Lumpur, Malaysia, 8–10 May 2018; pp. 11–15. [Google Scholar]
- Kumeda, B.; Zhang, F.; Zhou, F.; Hussain, S.; Almasri, A.; Assefa, M. Classification of Road Traffic Accident Data Using Machine Learning Algorithms. In Proceedings of the 2019 IEEE 11th International Conference on Communication Software and Networks (ICCSN), Chongqing, China, 12–15 June 2019; pp. 682–687. [Google Scholar]
- Huang, W.; Song, G.; Hong, H.; Xie, K. Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning. IEEE Trans. Intell. Transp. Syst. 2014, 15, 2191–2201. [Google Scholar] [CrossRef]
- Chen, Y.; Chen, H.; Ye, P.; Lv, Y.; Wang, F.Y. Acting as a Decision Maker: Traffic-Condition-Aware Ensemble Learning for Traffic Flow Prediction. IEEE Trans. Intell. Transp. Syst. 2022, 23, 3190–3200. [Google Scholar] [CrossRef]
- Jiang, Y.; Fan, J.; Liu, Y.; Zhang, X. Deep Graph Gaussian Processes for Short-Term Traffic Flow Forecasting From Spatiotemporal Data. IEEE Trans. Intell. Transp. Syst. 2022, 23, 20177–20186. [Google Scholar] [CrossRef]
- Oladimeji, D.; Gupta, K.; Kose, N.A.; Gundogan, K.; Ge, L.; Liang, F. Smart transportation: An overview of technologies and applications. Sensors 2023, 23, 3880. [Google Scholar] [CrossRef]
- Al Najada, H.; Mahgoub, I. Anticipation and alert system of congestion and accidents in VANET using Big Data analysis for Intelligent Transportation Systems. In Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence (SSCI), Athens, Greece, 6–9 December 2016; pp. 1–8. [Google Scholar]
- Al Najada, H.; Mahgoub, I.; Mohammed, I. Highway cluster density and average speed prediction in vehicular ad hoc networks (VANETs). In Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence (SSCI), Bangalore, India, 18–21 November 2018; pp. 96–103. [Google Scholar]
- Bello-Orgaz, G.; Jung, J.J.; Camacho, D. Social big data: Recent achievements and new challenges. Inf. Fusion 2016, 28, 45–59. [Google Scholar] [CrossRef] [PubMed]
- Chen, M.; Mao, S.; Liu, Y. Big data: A survey. Mob. Netw. Appl. 2014, 19, 171–209. [Google Scholar]
- An, S.H.; Lee, B.H.; Shin, D.R. A survey of intelligent transportation systems. In Proceedings of the 2011 Third International Conference on Computational Intelligence, Communication Systems and Networks, Bali, Indonesia, 26–28 July 2011; pp. 332–337. [Google Scholar]
- El Faouzi, N.E.; Leung, H.; Kurian, A. Data fusion in intelligent transportation systems: Progress and challenges–A survey. Inf. Fusion 2011, 12, 4–10. [Google Scholar]
- Zhang, J.; Wang, F.Y.; Wang, K.; Lin, W.H.; Xu, X.; Chen, C. Data-driven intelligent transportation systems: A survey. IEEE Trans. Intell. Transp. Syst. 2011, 12, 1624–1639. [Google Scholar]
- Lin, X.; Wang, P.; Wu, B. Log analysis in cloud computing environment with Hadoop and Spark. In Proceedings of the 2013 5th IEEE International Conference on Broadband Network & Multimedia Technology, Guilin, China, 17–19 November 2013; pp. 273–276. [Google Scholar]
- Zaharia, M.; Chowdhury, M.; Das, T.; Dave, A.; Ma, J.; Mccauley, M.; Franklin, M.; Shenker, S.; Stoica, I. Fast and interactive analytics over Hadoop data with Spark. Usenix Login 2012, 37, 45–51. [Google Scholar]
- Shahriari, S.; Ghasri, M.; Sisson, S.; Rashidi, T. Ensemble of ARIMA: Combining parametric and bootstrapping technique for traffic flow prediction. Transp. A Transp. Sci. 2020, 16, 1552–1573. [Google Scholar]
- Sun, P.; Aljeri, N.; Boukerche, A. Machine learning-based models for real-time traffic flow prediction in vehicular networks. IEEE Netw. 2020, 34, 178–185. [Google Scholar]
- Hou, Q.; Leng, J.; Ma, G.; Liu, W.; Cheng, Y. An adaptive hybrid model for short-term urban traffic flow prediction. Phys. A Stat. Mech. Its Appl. 2019, 527, 121065. [Google Scholar]
- Zhao, C.; Chen, C.; Cai, Z.; Shi, M.; Du, X.; Guizani, M. Classification of small UAVs based on auxiliary classifier wasserstein GANs. In Proceedings of the 2018 IEEE Global Communications Conference (GLOBECOM), Abu Dhabi, United Arab, 9–13 December 2018; pp. 206–212. [Google Scholar]
- Ahn, J.; Ko, E.; Kim, E.Y. Highway traffic flow prediction using support vector regression and Bayesian classifier. In Proceedings of the 2016 International Conference on Big Data and Smart Computing (BigComp), Hong Kong, China, 18–20 January 2016; pp. 239–244. [Google Scholar]
- Lv, Y.; Duan, Y.; Kang, W.; Li, Z.; Wang, F.Y. Traffic flow prediction with big data: A deep learning approach. IEEE Trans. Intell. Transp. Syst. 2014, 16, 865–873. [Google Scholar]
- Xu, J.; Deng, D.; Demiryurek, U.; Shahabi, C.; Van der Schaar, M. Mining the situation: Spatiotemporal traffic prediction with big data. IEEE J. Sel. Top. Signal Process. 2015, 9, 702–715. [Google Scholar]
- Jiber, M.; Lamouik, I.; Ali, Y.; Sabri, M.A. Traffic flow prediction using neural network. In Proceedings of the 2018 International Conference on Intelligent Systems and Computer Vision (ISCV), Fez, Morocco, 2–4 April 2018; pp. 1–4. [Google Scholar]
- Lin, J. The lambda and the kappa. IEEE Internet Comput. 2017, 21, 60–66. [Google Scholar]
- Bisong, E. Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners; Apress: New York, NY, USA, 2019. [Google Scholar]
- Florida Department of Transportation. S.S.F.D. of Transportation District 4 (FDOT-D4) US Department of Energy, Florida Department of Transportation—District 4 (FDOT-D4). SSFDOT. 2015. Available online: https://www.d4fdot.com/ (accessed on 23 January 2025).
- Tan, K.; Bremner, D.; Le Kernec, J.; Zhang, L.; Imran, M. Machine learning in vehicular networking: An overview. Digit. Commun. Netw. 2022, 8, 18–24. [Google Scholar]
- Olugbade, S.; Ojo, S.; Imoize, A.L.; Isabona, J.; Alaba, M.O. A review of artificial intelligence and machine learning for incident detectors in road transport systems. Math. Comput. Appl. 2022, 27, 77. [Google Scholar] [CrossRef]
- Abdullah, S.M.; Periyasamy, M.; Kamaludeen, N.A.; Towfek, S.; Marappan, R.; Kidambi Raju, S.; Alharbi, A.H.; Khafaga, D.S. Optimizing traffic flow in smart cities: Soft GRU-based recurrent neural networks for enhanced congestion prediction using deep learning. Sustainability 2023, 15, 5949. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhou, Y.; Lu, H.; Fujita, H. Traffic network flow prediction using parallel training for deep convolutional neural networks on spark cloud. IEEE Trans. Ind. Inform. 2020, 16, 7369–7380. [Google Scholar] [CrossRef]
- Lee, M.C.; Lin, J.C. DALC: Distributed automatic LSTM customization for fine-grained traffic speed prediction. In Proceedings of the International Conference on Advanced Information Networking and Applications; Springer: Berlin/Heidelberg, Germany, 2020; pp. 164–175. [Google Scholar]
- Boukerche, A.; Wang, J. A performance modeling and analysis of a novel vehicular traffic flow prediction system using a hybrid machine learning-based model. Ad. Hoc. Netw. 2020, 106, 102224. [Google Scholar] [CrossRef]
- Zheng, G.; Chai, W.K.; Duanmu, J.L.; Katos, V. Hybrid deep learning models for traffic prediction in large-scale road networks. Inf. Fusion 2023, 92, 93–114. [Google Scholar] [CrossRef]
- Alsubai, S.; Dutta, A.K.; Sait, A.R.W. Hybrid deep learning-based traffic congestion control in IoT environment using enhanced arithmetic optimization technique. Alex. Eng. J. 2024, 105, 331–340. [Google Scholar] [CrossRef]
- Ning, Y.; Samonte, M.J.C.; Li, Y. A Review of Research on Traffic Flow Prediction Methods Based on Deep Learning. In Proceedings of the 2024 International Conference on Digital Society and Artificial Intelligence, Qingdao, China, 24–26 May 2024; pp. 166–170. [Google Scholar]
- Fouladgar, M.; Parchami, M.; Elmasri, R.; Ghaderi, A. Scalable deep traffic flow neural networks for urban traffic congestion prediction. In Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–19 May 2017; pp. 2251–2258. [Google Scholar]
- Wang, Z.; Chu, R.; Zhang, M.; Wang, X.; Luan, S. An improved hybrid highway traffic flow prediction model based on machine learning. Sustainability 2020, 12, 8298. [Google Scholar] [CrossRef]
- Warren, J.; Marz, N. Big Data: Principles and Best Practices of Scalable Realtime Data Systems; Simon and Schuster: New York, NY, USA, 2015. [Google Scholar]
- Zaharia, M.; Xin, R.S.; Wendell, P.; Das, T.; Armbrust, M.; Dave, A.; Meng, X.; Rosen, J.; Venkataraman, S.; Franklin, M.J.; et al. Apache spark: A unified engine for big data processing. Commun. ACM 2016, 59, 56–65. [Google Scholar] [CrossRef]
- Ji, G.; Ling, X. Ensemble learning based distributed clustering. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining; Springer: Berlin/Heidelberg, Germany, 2007; pp. 312–321. [Google Scholar]
- Dong, X.; Yu, Z.; Cao, W.; Shi, Y.; Ma, Q. A survey on ensemble learning. Front. Comput. Sci. 2020, 14, 241–258. [Google Scholar] [CrossRef]
- Fernández-Delgado, M.; Cernadas, E.; Barro, S.; Amorim, D. Do we need hundreds of classifiers to solve real world classification problems? J. Mach. Learn. Res. 2014, 15, 3133–3181. [Google Scholar]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Muntasir Nishat, M.; Faisal, F.; Jahan Ratul, I.; Al-Monsur, A.; Ar-Rafi, A.M.; Nasrullah, S.M.; Reza, M.T.; Khan, M.R.H. A Comprehensive Investigation of the Performances of Different Machine Learning Classifiers with SMOTE-ENN Oversampling Technique and Hyperparameter Optimization for Imbalanced Heart Failure Dataset. Sci. Program. 2022, 2022, 3649406. [Google Scholar]
- Asif, M.; Nishat, M.M.; Faisal, F.; Dip, R.R.; Udoy, M.H.; Shikder, M.; Ahsan, R. Performance Evaluation and Comparative Analysis of Different Machine Learning Algorithms in Predicting Cardiovascular Disease. Eng. Lett. 2021, 29, 1–11. [Google Scholar]
- Bergstra, J.; Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012, 13, 281–305. [Google Scholar]
- Lana, I.; Del Ser, J.; Velez, M.; Vlahogianni, E.I. Road traffic forecasting: Recent advances and new challenges. IEEE Intell. Transp. Syst. Mag. 2018, 10, 93–109. [Google Scholar]
- Cheng, Z.; Pang, M.S.; Pavlou, P.A. Mitigating traffic congestion: The role of intelligent transportation systems. Inf. Syst. Res. 2020, 31, 653–674. [Google Scholar]
- Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Routledge: London, UK, 2017. [Google Scholar]
- Combine Predictors Using Stacking. Available online: https://scikit-learn.org/stable/auto_examples/ensemble/plot_stack_predictors.html (accessed on 23 January 2025).
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Mqadi, N.M.; Naicker, N.; Adeliyi, T. Solving Misclassification of the Credit Card Imbalance Problem Using Near Miss. Math. Probl. Eng. 2021, 2021, 7194728. [Google Scholar]
- Cano, A.; Krawczyk, B. ROSE: Robust Online Self-Adjusting Ensemble for Continual Learning on Imbalanced Drifting Data Streams. Mach. Learn. 2022, 111, 2561–2599. [Google Scholar]
- Alnami, H.M.; Mahgoub, I.; Al-Najada, H. Highway Accident Severity Prediction for Optimal Resource Allocation of Emergency Vehicles and Personnel. In Proceedings of the 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 27–30 January 2021; pp. 1231–1238. [Google Scholar]
- Alnami, H.M.; Mahgoub, I.; Al Najada, H. Segment Based Highway Traffic Flow Prediction in VANET Using Big Data Analysis. In Proceedings of the 2021 IEEE Symposium Series on Computational Intelligence (SSCI), Orlando, FL, USA, 5–7 December 2021; pp. 1–8. [Google Scholar]
- Kursa, M.B.; Rudnicki, W.R. Feature selection with the Boruta package. J. Stat. Softw. 2010, 36, 1–13. [Google Scholar]
# | Parameters | Description |
---|---|---|
1 | Detector ID | The ID number of the detector |
2 | Total Volume | The total number of vehicles detected within a 15-min interval that was recorded by the available detector. |
3 | Average Speed | The average speed of vehicles that are calculated based on a 15-min interval for all the lanes. |
4 | Archive Lane ID | ID of each lane on each bound of the highway. Each lane on each segment has a different lane id. |
5 | Average Occupancy | Average number of vehicles that occupy the segment. |
6 | Detector Lat and log | Latitude and Longitude of the detector. |
7 | Traffic Timestamp | The time-stamp of the traffic record (Year-Month-Day, Hour:Minute:Second). |
8 | Average Latency | The average latency that recorded by a detector. |
Hyper-Parameters | Description |
---|---|
N estimators | Number of trees in the forest |
Max features | Max number of features considered for splitting a node |
Max depth | Max number of levels in each decision tree |
Min samples split | Min number of data points placed in a node before the node is split |
Min samples leaf | Min number of data points allowed in a leaf node |
Bootstrap | Method for sampling data points (with or without replacement) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alnami, H.; Mahgoub, I.; Al-Najada, H.; Alalwany, E. A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles. Future Internet 2025, 17, 131. https://doi.org/10.3390/fi17030131
Alnami H, Mahgoub I, Al-Najada H, Alalwany E. A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles. Future Internet. 2025; 17(3):131. https://doi.org/10.3390/fi17030131
Chicago/Turabian StyleAlnami, Hani, Imad Mahgoub, Hamzah Al-Najada, and Easa Alalwany. 2025. "A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles" Future Internet 17, no. 3: 131. https://doi.org/10.3390/fi17030131
APA StyleAlnami, H., Mahgoub, I., Al-Najada, H., & Alalwany, E. (2025). A Distributed Machine Learning-Based Scheme for Real-Time Highway Traffic Flow Prediction in Internet of Vehicles. Future Internet, 17(3), 131. https://doi.org/10.3390/fi17030131