Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (5)

Search Parameters:
Keywords = YOLOv7–Deep SORT and LSTM

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 6129 KB  
Article
A Novel Machine Vision-Based Collision Risk Warning Method for Unsignalized Intersections on Arterial Roads
by Zhongbin Luo, Yanqiu Bi, Qing Ye, Yong Li and Shaofei Wang
Electronics 2025, 14(6), 1098; https://doi.org/10.3390/electronics14061098 - 11 Mar 2025
Cited by 1 | Viewed by 1070
Abstract
To address the critical need for collision risk warning at unsignalized intersections, this study proposes an advanced predictive system combining YOLOv8 for object detection, Deep SORT for tracking, and Bi-LSTM networks for trajectory prediction. To adapt YOLOv8 for complex intersection scenarios, several architectural [...] Read more.
To address the critical need for collision risk warning at unsignalized intersections, this study proposes an advanced predictive system combining YOLOv8 for object detection, Deep SORT for tracking, and Bi-LSTM networks for trajectory prediction. To adapt YOLOv8 for complex intersection scenarios, several architectural enhancements were incorporated. The RepLayer module replaced the original C2f module in the backbone, integrating large-kernel depthwise separable convolution to better capture contextual information in cluttered environments. The GIoU loss function was introduced to improve bounding box regression accuracy, mitigating the issues related to missed or incorrect detections due to occlusion and overlapping objects. Furthermore, a Global Attention Mechanism (GAM) was implemented in the neck network to better learn both location and semantic information, while the ReContext gradient composition feature pyramid replaced the traditional FPN, enabling more effective multi-scale object detection. Additionally, the CSPNet structure in the neck was substituted with Res-CSP, enhancing feature fusion flexibility and improving detection performance in complex traffic conditions. For tracking, the Deep SORT algorithm was optimized with enhanced appearance feature extraction, reducing the identity switches caused by occlusions and ensuring the stable tracking of vehicles, pedestrians, and non-motorized vehicles. The Bi-LSTM model was employed for trajectory prediction, capturing long-range dependencies to provide accurate forecasting of future positions. The collision risk was quantified using the predictive collision risk area (PCRA) method, categorizing risks into three levels (danger, warning, and caution) based on the predicted overlaps in trajectories. In the experimental setup, the dataset used for training the model consisted of 30,000 images annotated with bounding boxes around vehicles, pedestrians, and non-motorized vehicles. Data augmentation techniques such as Mosaic, Random_perspective, Mixup, HSV adjustments, Flipud, and Fliplr were applied to enrich the dataset and improve model robustness. In real-world testing, the system was deployed as part of the G310 highway safety project, where it achieved a mean Average Precision (mAP) of over 90% for object detection. Over a one-month period, 120 warning events involving vehicles, pedestrians, and non-motorized vehicles were recorded. Manual verification of the warnings indicated a prediction accuracy of 97%, demonstrating the system’s reliability in identifying potential collisions and issuing timely warnings. This approach represents a significant advancement in enhancing safety at unsignalized intersections in urban traffic environments. Full article
(This article belongs to the Special Issue Computer Vision and Image Processing in Machine Learning)
Show Figures

Figure 1

19 pages, 3968 KB  
Article
A Novel Three-Stage Collision-Risk Pre-Warning Model for Construction Vehicles and Workers
by Wenxia Gan, Kedi Gu, Jing Geng, Canzhi Qiu, Ruqin Yang, Huini Wang and Xiaodi Hu
Buildings 2024, 14(8), 2324; https://doi.org/10.3390/buildings14082324 - 27 Jul 2024
Cited by 3 | Viewed by 1622
Abstract
Collision accidents involving construction vehicles and workers frequently occur at construction sites. Computer vision (CV) technology presents an efficient solution for collision-risk pre-warning. However, CV-based methods are still relatively rare and need an enhancement of their performance. Therefore, a novel three-stage collision-risk pre-warning [...] Read more.
Collision accidents involving construction vehicles and workers frequently occur at construction sites. Computer vision (CV) technology presents an efficient solution for collision-risk pre-warning. However, CV-based methods are still relatively rare and need an enhancement of their performance. Therefore, a novel three-stage collision-risk pre-warning model for construction vehicles and workers is proposed in this paper. This model consists of an object-sensing module (OSM), a trajectory prediction module (TPM), and a collision-risk assessment module (CRAM). In the OSM, the YOLOv5 algorithm is applied to identify and locate construction vehicles and workers; meanwhile, the DeepSORT algorithm is applied to the real-time tracking of the construction vehicles and workers. As a result, the historical trajectories of vehicles and workers are sensed. The original coordinates of the data are transformed to common real-world coordinate systems for convenient subsequent data acquisition, comparison, and analysis. Subsequently, the data are provided to a second stage (TPM). In the TPM, the optimized transformer algorithm is used for a real-time trajectory prediction of the construction vehicles and workers. In this paper, we enhance the reliability of the general object detection and trajectory prediction methods in the construction environments. With the assistance afforded by the optimization of the model’s hyperparameters, the prediction horizon is extended, and this gives the workers more time to take preventive measures. Finally, the prediction module indicates the possible trajectories of the vehicles and workers in the future and provides these trajectories to the CRAM. In the CRAM, the worker’s collision-risk level is assessed by a multi-factor-based collision-risk assessment rule, which is innovatively proposed in the present work. The multi-factor-based assessment rule is quantitatively involved in three critical risk factors, i.e., velocity, hazardous zones, and proximity. Experiments are performed within two different construction site scenarios to evaluate the effectiveness of the collision-risk pre-warning model. The research results show that the proposed collision pre-warning model can accurately predict the collision-risk level of workers at construction sites, with good tracking and predicting effect and an efficient collision-risk pre-warning strategy. Compared to the classical models, such as social-GAN and social-LSTM, the transformer-based trajectory prediction model demonstrates a superior accuracy, with an average displacement error of 0.53 m on the construction sites. Additionally, the optimized transformer model is capable of predicting six additional time steps, which equates to approximately 1.8 s. The collision pre-warning model proposed in this paper can help improve the safety of construction vehicles and workers. Full article
(This article belongs to the Section Construction Management, and Computers & Digitization)
Show Figures

Figure 1

20 pages, 3211 KB  
Article
Multi-Adjacent Camera-Based Dangerous Driving Trajectory Recognition for Ultra-Long Highways
by Liguo Zhao, Zhipeng Fu, Jingwen Yang, Ziqiao Zhao and Ping Wang
Appl. Sci. 2024, 14(11), 4593; https://doi.org/10.3390/app14114593 - 27 May 2024
Cited by 1 | Viewed by 1359
Abstract
Fast detection of the trajectory is the key point to improve the further emergency proposal. Especially for ultra-long highway, prompt detection is labor-intensive. However, automatic detection relies on the accuracy and speed of vehicle detection, and tracking. In multi-camera surveillance system for ultra-long [...] Read more.
Fast detection of the trajectory is the key point to improve the further emergency proposal. Especially for ultra-long highway, prompt detection is labor-intensive. However, automatic detection relies on the accuracy and speed of vehicle detection, and tracking. In multi-camera surveillance system for ultra-long highways, it is often difficult to capture the same vehicle without intervals, which makes vehicle re-recognition crucial as well. In this paper, we present a framework that includes vehicle detection and tracking using improved DeepSORT, vehicle re-identification, feature extraction based on trajectory rules, and behavior recognition based on trajectory analysis. In particular, we design a network architecture based on DeepSORT with YOLOv5s to address the need for real-time vehicle detection and tracking in real-world traffic management. We further design an attribute recognition module to generate matching individuality attributes for vehicles to improve vehicle re-identification performance under multiple neighboring cameras. Besides, the use of bidirectional LSTM improves the accuracy of trajectory prediction, demonstrating its robustness to noise and fluctuations. The proposed model has a high advantage from the cumulative matching characteristic (CMC) curve shown and even improves above 15.38% compared to other state-of-the-art methods. The model developed on the local highway vehicle dataset is comprehensively evaluated, including abnormal trajectory recognition, lane change detection, and speed anomaly recognition. Experimental results demonstrate the effectiveness of the proposed method in accurately identifying various vehicle behaviors, including lane changes, stops, and even other dangerous driving behavior. Full article
(This article belongs to the Special Issue Traffic Emergency: Forecasting, Control and Planning)
Show Figures

Figure 1

23 pages, 11765 KB  
Article
Traffic Flow Optimization at Toll Plaza Using Proactive Deep Learning Strategies
by Habib Talha Hashmi, Sameer Ud-Din, Muhammad Asif Khan, Jamal Ahmed Khan, Muhammad Arshad and Muhammad Usman Hassan
Infrastructures 2024, 9(5), 87; https://doi.org/10.3390/infrastructures9050087 - 15 May 2024
Cited by 4 | Viewed by 3110
Abstract
Global urbanization and increasing traffic volume have intensified traffic congestion throughout transportation infrastructure, particularly at toll plazas, highlighting the critical need to implement proactive transportation infrastructure solutions. Traditional toll plaza management approaches, often relying on manual interventions, suffer from inefficiencies that fail to [...] Read more.
Global urbanization and increasing traffic volume have intensified traffic congestion throughout transportation infrastructure, particularly at toll plazas, highlighting the critical need to implement proactive transportation infrastructure solutions. Traditional toll plaza management approaches, often relying on manual interventions, suffer from inefficiencies that fail to adapt to dynamic traffic flow and are unable to produce preemptive control strategies, resulting in prolonged queues, extended travel times, and adverse environmental effects. This study proposes a proactive traffic control strategy using advanced technologies to combat toll plaza congestion and optimize traffic management. The approach involves deep learning convolutional neural network models (YOLOv7–Deep SORT) for vehicle counting and an extended short-term memory model for short-term arrival rate prediction. When projected arrival rates exceed a threshold, the strategy proactively activates variable speed limits (VSLs) and ramp metering (RM) strategies during peak hours. The novelty of this study lies in its predictive and adaptive capabilities, ensuring efficient traffic flow management. Validated through a case study at Ravi Toll Plaza Lahore using PTV VISSIMv7, the proposed method reduces queue length by 57% and vehicle delays by 47% while cutting fuel consumption and pollutant emissions by 28.4% and 34%, respectively. Additionally, by identifying the limitations of conventional approaches, this study presents a novel framework alongside the proposed strategy to bridge the gap between theory and practice, making it easier for toll plaza operators and transportation authorities to adopt and benefit from advanced traffic management techniques. Ultimately, this study underscores the importance of integrated and proactive traffic control strategies in enhancing traffic management, minimizing congestion, and fostering a more sustainable transportation system. Full article
Show Figures

Figure 1

15 pages, 3380 KB  
Article
A General Framework for Reconstructing Full-Sample Continuous Vehicle Trajectories Using Roadside Sensing Data
by Guimin Su, Zimu Zeng, Andi Song, Cong Zhao, Feng Shen, Liangxiao Yuan and Xinghua Li
Appl. Sci. 2023, 13(5), 3141; https://doi.org/10.3390/app13053141 - 28 Feb 2023
Cited by 2 | Viewed by 2722
Abstract
Vehicle trajectory data play an important role in autonomous driving and intelligent traffic control. With the widespread deployment of roadside sensors, such as cameras and millimeter-wave radar, it is possible to obtain full-sample vehicle trajectories for a whole area. This paper proposes a [...] Read more.
Vehicle trajectory data play an important role in autonomous driving and intelligent traffic control. With the widespread deployment of roadside sensors, such as cameras and millimeter-wave radar, it is possible to obtain full-sample vehicle trajectories for a whole area. This paper proposes a general framework for reconstructing continuous vehicle trajectories using roadside visual sensing data. The framework includes three modules: single-region vehicle trajectory extraction, multi-camera cross-region vehicle trajectory splicing, and missing trajectory completion. Firstly, the vehicle trajectory is extracted from each video by YOLOv5 and DeepSORT multi-target tracking algorithms. The vehicle trajectories in different videos are then spliced by the vehicle re-identification algorithm fused with lane features. Finally, the bidirectional long-short-time memory model (LSTM) based on graph attention is applied to complete the missing trajectory to obtain the continuous vehicle trajectory. Measured data from Donghai Bridge in Shanghai are applied to verify the feasibility and effectiveness of the framework. The results indicate that the vehicle re-identification algorithm with the lane features outperforms the vehicle re-identification algorithm that only considers the visual feature by 1.5% in mAP (mean average precision). Additionally, the bidirectional LSTM based on graph attention performs better than the model that does not consider the interaction between vehicles. The experiment demonstrates that our framework can effectively reconstruct the continuous vehicle trajectories on the expressway. Full article
Show Figures

Figure 1

Back to TopTop