Connected Perception Between Lightweight Robot and External Camera for Blind-Spot Awareness
Abstract
1. Introduction
- Designing a targeted perception framework specifically for scenarios where onboard LiDAR is physically obstructed, demonstrating how infrastructure-based vision acts as a crucial auxiliary system to resolve blind-spot challenges in autonomous navigation.
- Proposing an integrated multi-module system that orchestrates YOLOv11 for high-accuracy object detection, Multiple Linear Regression (MLR) for coordinate alignment, and ANN-based position prediction to provide the vehicle with reliable environmental feedback.
- Implementing a direct communication pipeline that transmits external perception coordinates straight to the vehicle’s local control architecture. This includes establishing an onboard-only baseline experiment to empirically demonstrate the absolute failure of localized sensors in occluded environments, thereby proving that the integrated infrastructure system successfully enables safety-critical maneuvers, such as pre-emptive halting, to prevent otherwise unavoidable collisions.
2. Methodology
2.1. Preliminary Testing and Baseline Validation
2.1.1. Obstacle Velocity Characterization
2.1.2. Onboard Sensing Evaluation in Line-of-Sight Scenarios
2.1.3. Onboard Baseline Testing in the Blind-Spot Geometry
- Target Vehicle (RC Car) Priority: Under the hypothesis that the RC obstacle reaches the corner first, the target vehicle passes through the intersection before a collision occurs due to its higher operational velocity. However, in a real-world deployment context, this scenario still introduces an unacceptable safety risk due to the critically low spatial separation margins.
- Autonomous Rover Priority: Under the hypothesis that the autonomous rover arrives at the corner first, a catastrophic side-impact collision occurs. Because the physical architecture completely occludes the upcoming obstacle from the onboard LiDAR’s line of sight, the local perception stack fails to register the threat entirely.
2.2. Blind-Spot Scenario and Experimental Setup
2.3. Lightweight Robot
2.4. External Camera-Based Perception System
2.5. Camera Calibration Procedure
- Physical Grid Setup: A camera configured at a resolution of pixels is positioned to monitor a predefined experimental area. This physical space is marked with a localized physical coordinate grid, as illustrated in Figure 8.
- Virtual Grid Mapping: Four discrete reference points defining a bounding rectangle are selected within the camera’s initial pixel coordinate space. This designated region of interest is subsequently segmented into a corresponding virtual grid overlay, as depicted in Figure 9.
- Perspective Transformation and Coordinate Alignment: The bounded pixel region undergoes a perspective transformation to generate a bird’s-eye-view projection (the specific mathematical formulations for this transformation are detailed in Section 2.6). Through this transformation, image pixel coordinates are mapped directly to physical grid coordinates; for example, the image pixel coordinate is mapped onto the localized grid origin at the bottom-left corner, as shown in Figure 10.
- Target Tracking Integration: Following the spatial transformation layer, the target tracking framework utilizes a YOLOv11 pipeline for continuous object detection (detailed in Section 2.7) paired with the DeepSORT algorithm for multi-object tracking and ID retention (detailed in Section 2.8) to output real-time state vectors relative to the calibrated grid.
2.6. Perspective Transformation
Homography Transformation
2.7. YOLOv11-Based Object Detection
2.8. DeepSORT-Based Object Tracking
2.9. Connected Perception-Based Decision Making
2.9.1. Position Prediction Models
2.9.2. Collision Risk Analysis
2.9.3. Collision-Aware Motion Control
3. Experiments and Results
3.1. Results of Preliminary Testing and Baseline Validation
3.1.1. Obstacle Velocity Characterization
3.1.2. Onboard Baseline Testing in the Blind-Spot Geometry
3.2. Main Experimental Setup
3.3. Detection Results
3.4. Localization Results
3.4.1. External Camera Localization
3.4.2. Lightweight Robot Localization
3.5. Position Prediction Results
3.6. Collision Avoidance Results
4. Conclusions
Author Contributions
Funding
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| LiDAR | Light Detection and Ranging |
| IMU | Inertial Measurement Unit |
| V2I | Vehicle-to-Infrastructure |
| BEV | Bird’s Eye View |
| MLR | Multiple Linear Regression |
References
- Malik, S.; Khan, M.A.; El-Sayed, H. Collaborative autonomous driving—A survey of solution approaches and future challenges. Sensors 2021, 21, 3783. [Google Scholar] [CrossRef]
- Zhang, Y.; Carballo, A.; Yang, H.; Takeda, K. Perception and sensing for autonomous vehicles under adverse weather conditions: A survey. ISPRS J. Photogramm. Remote Sens. 2023, 196, 146–177. [Google Scholar] [CrossRef]
- George, R.; Clancy, J.; Brophy, T.; Sistu, G.; O’Grady, W.; Chandra, S.; Collins, F.; Mullins, D.; Jones, E.; Deegan, B.; et al. Infrastructure Assisted Autonomous Driving: Research, Challenges, and Opportunities. IEEE Open J. Veh. Technol. 2025, 6, 662–716. [Google Scholar] [CrossRef]
- Zhou, X.; Wang, C.; Xie, Q.; Qiu, T. V2I-Coop: Accurate object detection for connected automated vehicles at accident black spots with V2I cross-modality cooperation. IEEE Trans. Mob. Comput. 2025, 24, 2043–2055. [Google Scholar] [CrossRef]
- Mo, Y.; Vijay, R.; Rufus, R.; de Boer, N.; Kim, J.; Yu, M. Enhanced Perception for Autonomous Vehicles at Obstructed Intersections: An Implementation of Vehicle to Infrastructure (V2I) Collaboration. Sensors 2024, 24, 936. [Google Scholar] [CrossRef] [PubMed]
- Xiang, C.; Zhang, L.; Xie, X.; Zhao, L.; Ke, X.; Niu, Z.; Wang, F. Multi-Sensor Fusion Algorithm in Cooperative Vehicle-Infrastructure System for Blind Spot Warning. Int. J. Distrib. Sens. Netw. 2022, 18, 15501329221100412. [Google Scholar] [CrossRef]
- Lang, A.H.; Vora, S.; Caesar, H.; Zhou, L.; Yang, J.; Beijbom, O. PointPillars: Fast Encoders for Object Detection from Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; IEEE: New York, NY, USA, 2019; pp. 12689–12697. [Google Scholar] [CrossRef]
- Li, S.; Shu, K.; Chen, C.; Cao, D. Planning and Decision-Making for Connected Autonomous Vehicles at Road Intersections: A Review. Chin. J. Mech. Eng. 2021, 34, 133. [Google Scholar] [CrossRef]
- Cui, G.; Zhang, W.; Xiao, Y.; Yao, L.; Fang, Z. Cooperative Perception Technology of Autonomous Driving in the Internet of Vehicles Environment: A Review. Sensors 2022, 22, 5535. [Google Scholar] [CrossRef]
- Müller, J.; Strohbeck, J.; Herrmann, M.; Buchholz, M. Motion Planning for Connected Automated Vehicles at Occluded Intersections With Infrastructure Sensors. arXiv 2021, arXiv:2110.11246. [Google Scholar] [CrossRef]
- Zhang, Z. A flexible new technique for camera calibration. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1330–1334. [Google Scholar] [CrossRef]
- Thrun, S.; Burgard, W.; Fox, D. Probabilistic Robotics; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
- Biber, P.; Straßer, W. The normal distributions transform: A new approach to laser scan matching. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 27–31 October 2003; IEEE: New York, NY, USA, 2003. [Google Scholar]
- Lim, H.; Hwang, S.; Shin, S.; Myung, H. Normal Distributions Transform Is Enough: Real-Time 3D Scan Matching for Pose Correction of Mobile Robot under Large Odometry Uncertainties. In Proceedings of the International Conference on Control, Automation and Systems (ICCAS), Busan, Republic of Korea, 13–16 October 2020; IEEE: New York, NY, USA, 2020. [Google Scholar] [CrossRef]
- Bao, Z.; Hossain, S.; Lang, H.; Lin, X. A Review of High-Definition Map Creation Methods for Autonomous Driving. Eng. Appl. Artif. Intell. 2023, 122, 106125. [Google Scholar] [CrossRef]
- Shin, D.; Park, K.-m.; Park, M. High Definition Map-Based Localization Using ADAS Environment Sensors for Application to Automated Driving Vehicles. Appl. Sci. 2020, 10, 4924. [Google Scholar] [CrossRef]
- Peri, N.; Li, M.; Wilson, B.; Wang, Y.-X.; Hays, J.; Ramanan, D. An Empirical Analysis of Range for 3D Object Detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCV Workshops), Paris, France, 1–6 October 2023; IEEE: New York, NY, USA, 2023. [Google Scholar]
- Zhang, J.; Yan, Y.; Cheng, Z.; Wang, W. Lightweight Attention Pyramid Network for Object Detection and Instance Segmentation. Appl. Sci. 2020, 10, 883. [Google Scholar] [CrossRef]
- Mallot, H.A.; Bülthoff, H.H.; Little, J.J.; Bohrer, S. Inverse Perspective Mapping Simplifies Optical Flow Computation and Obstacle Detection. Biol. Cybern. 1991, 64, 177–185. [Google Scholar] [CrossRef]
- Khosravi, K.; Ghaemifar, M.; Ebadollahi, S. Enhancing Spatial Awareness: A Survey of Camera-Based Frontal View to Bird’s-Eye-View Conversion. SSRN Electron. J. 2024. [Google Scholar] [CrossRef]
- Gong, S.; Ye, X.; Tan, X.; Wang, J.; Ding, E.; Zhou, Y.; Bai, X. GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation. In Proceedings of the European Conference on Computer Vision (ECCV), Tel Aviv, Israel, 23–27 October 2022. [Google Scholar]
- Justs, D.J.; Novickis, R.; Ozols, K.; Greitans, M. Bird’s-Eye-View Image Acquisition from Simulated Scenes Using Geometric Inverse Perspective Mapping. In Proceedings of the 2020 IEEE 17th Biennial Baltic Electronics Conference (BEC), Tallinn, Estonia, 6–8 October 2020; IEEE: New York, NY, USA, 2020. [Google Scholar] [CrossRef]
- Hartley, R.; Zisserman, A. Multiple View Geometry in Computer Vision; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; IEEE: New York, NY, USA, 2016; pp. 779–788. [Google Scholar] [CrossRef]
- Al Rabbani, M.A. YOLOv11 for vehicle detection: Advancements, performance, and applications in intelligent transportation systems. arXiv 2024, arXiv:2410.22898. [Google Scholar] [CrossRef]
- He, L.-H.; Zhou, Y.-Z.; Liu, L.; Cao, W.; Ma, J.-H. Research on Object Detection and Recognition in Remote Sensing Images Based on YOLOv11. Sci. Rep. 2025, 15, 14032. [Google Scholar] [CrossRef]
- Wojke, N.; Bewley, A.; Paulus, D. Simple online and realtime tracking with a deep association metric. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Beijing, China, 17–20 September 2017; IEEE: New York, NY, USA, 2017; pp. 3645–3649. [Google Scholar]
- Bharilya, V.; Kumar, N. Machine learning for autonomous vehicle’s trajectory prediction: A comprehensive survey, challenges, and future research directions. Array 2024, 46, 100733. [Google Scholar] [CrossRef]
- Xu, M.; Liu, Z.; Wang, B.; Li, S. A Survey of Autonomous Driving Trajectory Prediction: Methodologies, Challenges, and Future Prospects. Machines 2025, 13, 818. [Google Scholar] [CrossRef]
- Altché, F.; de La Fortelle, A. An LSTM Network for Highway Trajectory Prediction. In Proceedings of the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, 16–19 October 2017; IEEE: New York, NY, USA, 2017; pp. 353–359. [Google Scholar] [CrossRef]
- Deo, N.; Trivedi, M.M. Convolutional Social Pooling for Vehicle Trajectory Prediction. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18–22June 2018; IEEE: New York, NY, USA, 2018. [Google Scholar] [CrossRef]
- Dahal, P.; Mentasti, S.; Paparusso, L.; Arrigoni, S.; Braghin, F. RobustStateNet: Robust ego vehicle state estimation for autonomous driving. Robot. Auton. Syst. 2024, 172, 104585. [Google Scholar] [CrossRef]
- Deo, N.; Trivedi, M.M. Multi-modal trajectory prediction of surrounding vehicles with maneuver based LSTMs. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26–30 June 2018; IEEE: New York, NY, USA, 2018; pp. 1179–1184. [Google Scholar]
- Baek, M.; Jeong, D.; Choi, D.; Lee, S. Vehicle Trajectory Prediction and Collision Warning via Fusion of Multisensors and Wireless Vehicular Communications. Sensors 2020, 20, 288. [Google Scholar] [CrossRef]
- Zhang, R.; Cao, L.; Bao, S.; Tan, J. A Method for Connected Vehicle Trajectory Prediction and Collision Warning Algorithm Based on V2V Communication. Int. J. Crashworthiness 2017, 22, 15–25. [Google Scholar] [CrossRef]
- Goudarzi, P.; Hassanzadeh, B. Collision Risk in Autonomous Vehicles: Classification, Challenges, and Open Research Areas. Vehicles 2024, 6, 157–190. [Google Scholar] [CrossRef]
- Hamidaoui, M.; Talhaoui, M.Z.; Li, M.; Midoun, M.A.; Haouassi, S.; Mekkaoui, D.E.; Smaili, A.; Cherraf, A.; Benyoub, F.Z. Survey of Autonomous Vehicles’ Collision Avoidance Algorithms. Sensors 2025, 25, 395. [Google Scholar] [CrossRef]
- Arun, A.; Haque, M.M.; Washington, S.; Sayed, T.; Mannering, F. A Systematic Review of Traffic Conflict-Based Safety Measures with a Focus on Application Context. Anal. Methods Accid. Res. 2021, 32, 100185. [Google Scholar] [CrossRef]
- Mahmud, S.M.S.; Ferreira, L.; Hoque, M.S.; Tavassoli, A. Application of Proximal Surrogate Indicators for Safety Evaluation: A Review of Recent Developments and Research Needs. IATSS Res. 2017, 41, 153–163. [Google Scholar] [CrossRef]
- Nikolaou, D.; Ziakopoulos, A.; Yannis, G. A Review of Surrogate Safety Measures Uses in Historical Crash Investigations. Sustainability 2023, 15, 7580. [Google Scholar] [CrossRef]
- Wang, P.; Yu, H.; Liu, C.; Wang, Y.; Ye, R. Real-Time Trajectory Prediction Method for Intelligent Connected Vehicles in Urban Intersection Scenarios. Sensors 2023, 23, 2950. [Google Scholar] [CrossRef]
- Shi, J.; Sun, D.; Guo, B. Vehicle Trajectory Prediction Based on Graph Convolutional Networks in Connected Vehicle Environment. Appl. Sci. 2023, 13, 13192. [Google Scholar] [CrossRef]


























| Trial Number | Average Speed (m/s) | Standard Deviation (SD) |
|---|---|---|
| Trial 1 | 3.045 | 0.648 |
| Trial 2 | 2.292 | 0.637 |
| Trial 3 | 2.978 | 0.622 |
| Trial 4 | 2.922 | 0.658 |
| Trial 5 | 3.012 | 0.657 |
| Trial 6 | 2.988 | 0.691 |
| Trial 7 | 2.972 | 0.667 |
| Trial 8 | 3.018 | 0.749 |
| Trial 9 | 2.961 | 0.698 |
| Trial 10 | 2.937 | 0.747 |
| Mean Speed (All Trials) | 2.9125 | 0.6774 |
| Global Baseline Velocity | 2.975 | 0.681 |
| Initial Condition | Case Scenario Outcome | Obstacle Found Event | Obstacle in Front Event | Reaction to Obstacle (After Found) | |||
|---|---|---|---|---|---|---|---|
| Time (s) | Speed Cmd (m/s) | Time (s) | Speed Cmd (m/s) | Time (s) | Speed Cmd (m/s) | ||
| Condition 1: Robot 1.0 m/RC 4.5 m (Both Vehicles Far) | Pre-Intersection Target Clearance (High-Risk) | 1.79 | 1.284 | 2.11 | 1.284 | 1.79 | 0.7746 |
| 2.20 | 1.095 | 2.62 | 1.284 | - | - | ||
| 1.95 | 1.284 | 2.27 | 1.284 | - | - | ||
| 2.08 | 1.284 | 2.50 | 1.095 | 2.27 | 0.5478 | ||
| 2.20 | 1.095 | 3.10 | 1.095 | 2.21 | 0.5478 | ||
| 2.33 | 1.200 | 3.10 | 1.095 | - | - | ||
| 2.24 | 1.200 | 2.84 | 1.095 | 2.52 | 0.5477 | ||
| 1.92 | 1.200 | 2.56 | 1.200 | - | - | ||
| 2.22 | 1.200 | 2.79 | 1.095 | - | - | ||
| 2.24 | 1.095 | 2.88 | 1.095 | - | - | ||
| Condition 2: Robot 1.0 m/RC 3.0 m (RC Obstacle Near Corner) | Early Target Intersection Clearance | 1.98 | 1.095 | 2.49 | 1.095 | - | - |
| 2.05 | 1.095 | 2.56 | 1.095 | - | - | ||
| 1.76 | 1.200 | 2.34 | 1.200 | 2.4 | 0.7746 | ||
| 1.70 | 1.200 | 2.24 | 1.200 | 2.59 | 0.7746 | ||
| 1.70 | 1.200 | 2.31 | 1.200 | - | - | ||
| 2.27 | 1.200 | 2.82 | 1.200 | - | - | ||
| 1.99 | 1.200 | 2.69 | 1.095 | - | - | ||
| 1.41 | 1.200 | 1.95 | 1.095 | - | - | ||
| 1.95 | 1.095 | 2.59 | 1.095 | - | - | ||
| 1.95 | 1.200 | 2.49 | 1.200 | - | - | ||
| Condition 3: Robot 0.5 m/RC 4.5 m (Autonomous Robot Near Corner) | Cross-Traffic Interception Collision | 2.08 | 1.095 | 3.65 | 1.095 | - | - |
| 2.02 | 1.095 | 3.49 | 0.949 | - | - | ||
| 1.85 | 1.095 | 3.58 | 0.776 | - | - | ||
| 2.01 | 1.095 | 3.58 | 0.949 | - | - | ||
| 1.60 | 0.949 | 3.07 | 1.095 | - | - | ||
| 1.88 | 0.949 | 3.16 | 0.949 | - | - | ||
| 1.96 | 0.949 | 3.08 | 1.095 | - | - | ||
| 1.79 | 0.949 | 2.98 | 0.949 | - | - | ||
| 0.93 | 1.225 | 2.05 | 0.949 | - | - | ||
| 1.85 | 1.225 | 2.81 | 1.095 | - | - | ||
| Component | Specification |
|---|---|
| LiDAR sensor | Velodyne VLP-16 (16 channels, 100 m range) |
| IMU | Pixhawk 3 Pro |
| Camera | 1080p RGB IP camera (2.5 m height, 90° FOV) |
| Main computer | Intel Core i7-11700K |
| Robot hardware | NVIDIA Jetson AGX Xavier, 32 GB RAM |
| Metric | Value |
|---|---|
| Precision | 1.00 |
| Recall | 0.99 |
| F1-score | 0.98 |
| mAP@0.5 | 0.991 |
| Metric | Value (m) |
|---|---|
| Mean error (X-axis) | 0.02 |
| Mean error (Y-axis) | 0.03 |
| RMSE | 0.04 |
| Max error | 0.08 |
| Metric | Value |
|---|---|
| Mean squared error (MSE) | 0.00387 |
| Mean absolute error (MAE) | 0.03969 |
| Coefficient of determination () | 0.99588 |
| Metric | Value |
|---|---|
| Mean squared error (MSE) | 0.0058 |
| Mean absolute error (MAE) | 0.0493 |
| Coefficient of determination () | 0.9813 |
| Test | Test | Distance from | System Response | Braking | Stopping | Autonomous |
|---|---|---|---|---|---|---|
| Case | Scenarios | Object (Meters) | Time (Seconds) | Distance (m) | Status | Speed (m/s) |
| 1 | Static object (front) | 1.3 | 0.3 | 1 | Stopped successfully | 2 |
| 2 | Static object (side) | 1.1 | 0.4 | 0.9 | Stopped successfully | 2.5 |
| 3 | Slowly moving object (constant speed) | 1.5 | 0.25 | 1.2 | Stopped successfully | 2.3 |
| 4 | Slowly moving object (constant speed) | 1 | 0.5 | 0.8 | Stopped successfully | 2 |
| 5 | Static object in low-light conditions | 1.7 | 0.35 | 1.3 | Stopped successfully | 2.1 |
| 6 | Slowly moving object (constant speed) | 1.4 | 0.2 | 1.1 | Stopped successfully | 2.2 |
| 7 | Slowly moving object (constant speed) | 1.2 | 0.45 | 0.95 | Stopped successfully | 2.3 |
| 8 | Fast moving object (constant speed) | 1 | 0.3 | 0.9 | Stopped successfully | 2.1 |
| 9 | Environment with water on the ground | 1.4 | 0.4 | 1 | Stopped successfully | 2.2 |
| 10 | Slowly moving object (constant speed) | 1.3 | 0.3 | 1.1 | Stopped successfully | 2.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tantrairatn, S.; Phinphimai, P.; Phuangmalee, N.; Karaked, P.; Petcharat, N.; Pichitkul, A.; Ariyarit, A. Connected Perception Between Lightweight Robot and External Camera for Blind-Spot Awareness. Technologies 2026, 14, 338. https://doi.org/10.3390/technologies14060338
Tantrairatn S, Phinphimai P, Phuangmalee N, Karaked P, Petcharat N, Pichitkul A, Ariyarit A. Connected Perception Between Lightweight Robot and External Camera for Blind-Spot Awareness. Technologies. 2026; 14(6):338. https://doi.org/10.3390/technologies14060338
Chicago/Turabian StyleTantrairatn, Suradet, Poommin Phinphimai, Nattapong Phuangmalee, Pawarut Karaked, Nutchanan Petcharat, Auraluck Pichitkul, and Atthaphon Ariyarit. 2026. "Connected Perception Between Lightweight Robot and External Camera for Blind-Spot Awareness" Technologies 14, no. 6: 338. https://doi.org/10.3390/technologies14060338
APA StyleTantrairatn, S., Phinphimai, P., Phuangmalee, N., Karaked, P., Petcharat, N., Pichitkul, A., & Ariyarit, A. (2026). Connected Perception Between Lightweight Robot and External Camera for Blind-Spot Awareness. Technologies, 14(6), 338. https://doi.org/10.3390/technologies14060338

