A Low-Cost Vision–GPS Framework for the Unified Mapping of Vertical and Horizontal Road Assets Using Deep Learning
Abstract
1. Introduction
2. Materials and Methods
2.1. Vertical Road Signs Setup
2.2. Horizontal Road Marking Acquisition Setup
2.3. System Activation and Data Alignment
2.4. Quantitative Ablation of the Distance-Aware Sampling Strategy
3. Dataset Creation
3.1. Vertical Signage Dataset
Synthetic Data Generation and Class Balancing
3.2. Horizontal Road Marking Dataset
Polygonal Annotation and Class Balancing for Horizontal Road Markings
4. Deep Learning Models and Training Setup
- YOLO11m [24]: an efficient one-stage architecture used for vertical traffic sign detection. Its segmentation variant, YOLO11m-seg, was adopted for horizontal road marking instance segmentation.
- RT-DETR [25]: a transformer-based detector selected for its ability to exploit global image context.
- Faster R-CNN [26]: a consolidated two-stage baseline based on region proposals, used as a reference architecture for object localization.
Training Configuration and Evaluation Metrics
- Architectural Constraints of Alternatives: Two-stage networks (e.g., Mask R-CNN) and transformer-based models can provide accurate segmentation in offline environments, but they generally involve higher computational complexity and memory requirements. In low-cost vehicle-integrated workflows, this may reduce processing throughput and increase the risk of missed observations when operating on dense video streams.
- Optimization and Structural Alignment: Pavement markings feature highly specific, elongated, and continuous geometries (e.g., directional arrows). YOLO11m-seg utilizes advanced single-stage cross-stage spatial attention modules that capture these long, narrow pixel structures effectively without the massive VRAM overhead of vision transformers.
5. Results
5.1. Vertical Traffic Sign Detection Results
5.2. Horizontal Road Marking Results
5.3. Georeferenced Road-Asset Mapping Output
5.4. Quantitative Evaluation of Georeferencing Accuracy
5.5. Independent Operational Validation on Heterogeneous Road Types
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| CV | Coefficient of Variation |
| DFL | Distribution Focal Loss |
| EIS | Electronic Image Stabilization |
| FOV | Field of View |
| GNSS | Global Navigation Satellite System |
| GPS | Global Positioning System |
| HD | High Definition |
| IoU | Intersection over Union |
| ITS | Intelligent Transportation Systems |
| mAP | Mean Average Precision |
| RT-DETR | Real-Time Detection Transformer |
| SD | Standard Deviation of Spacing |
| YOLO | You Only Look Once |
References
- World Health Organization. Global Status Report on Road Safety 2023; WHO: Geneva, Switzerland, 2023. [Google Scholar]
- Seif, H.G.; Hu, X. Autonomous Driving in the iCity—HD Maps as a Key Challenge of the Automotive Industry. Engineering 2016, 2, 159–162. [Google Scholar]
- Mobileye. Road Experience Management (REM). Available online: https://www.mobileye.com (accessed on 10 April 2024).
- Cyclomedia. Condition Assessment for Traffic Signs & Pavement Striping. Available online: https://www.cyclomedia.com (accessed on 10 April 2024).
- Elhashash, M.; Qin, R.; Elbaz, G.; Shoeb, M.A.; Haidary, J.S.; Chow, S.C.Y. A review of mobile mapping systems: From sensors to applications. Sensors 2022, 22, 4262. [Google Scholar] [CrossRef] [PubMed]
- Qiu, Z.; Martínez-Sánchez, J.; Brea, V.M.; Lopez, P.; Arias, P. Low-cost mobile mapping system solution for traffic sign detection and localization in urban environments. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102895. [Google Scholar]
- Profumo, D.; de León, G.; Monticelli, A.; Fredianelli, L.; Licitra, G. Real-Time Low-Cost Traffic Monitoring Based on Quantized Convolutional Neural Networks for the CNOSSOS-EU Noise Model. Sensors 2026, 26, 1736. [Google Scholar] [PubMed]
- Stallkamp, J.; Schlipsing, M.; Salmen, J.; Igel, C. The German traffic sign recognition benchmark: A multi-class classification competition. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), San Jose, CA, USA, 31 July–5 August 2011. [Google Scholar]
- Houben, S.; Stallkamp, J.; Schlipsing, M.; Salmen, J.; Igel, C. Detection of traffic signs in real-world images: The German traffic sign detection benchmark. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), Dallas, TX, USA, 4–9 August 2013. [Google Scholar]
- Zhu, Z.; Liang, D.; Zhang, S.; Huang, X.; Li, B.; Hu, S. Traffic-sign detection and classification in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Ertler, C.; Mislej, J.; Ollmann, T.; Porzi, L.; Neuhold, G.; Kuang, Y. The Mapillary traffic sign dataset for detection and classification on a global scale. In Proceedings of the European Conference on Computer Vision (ECCV), Glasgow, UK, 23–28 August 2020. [Google Scholar]
- Flores-Calero, M.; Astudillo, C.A.; Guevara, D.; Maza, J. Traffic Sign Detection and Recognition Using YOLO Object Detection Algorithm: A Systematic Review. Mathematics 2024, 12, 297. [Google Scholar] [CrossRef]
- Pan, X.; Shi, J.; Luo, P.; Wang, X.; Tang, X. Spatial as deep: Spatial CNN for traffic scene understanding. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
- Lee, S.; Kim, J.; Shin Yoon, J.; Shin, S.; Bailo, O.; Kim, N.; Lee, T.H.; Seok Hong, H.; Han, S.H.; So Kweon, I. VPGNet: Vanishing point guided network for lane and road marking detection and recognition. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
- Neven, D.; De Brabandere, B.; Georgoulis, S.; Proesmans, M.; Van Gool, L. Towards end-to-end lane detection: An instance segmentation approach. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26–30 June 2018. [Google Scholar]
- Wang, H.; Tian, J.; Yuan, J.; Liu, H. Road marking detection based on Mask R-CNN instance segmentation model. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 25–28 October 2020; pp. 1–5. [Google Scholar]
- Fan, S.; Zhang, Y.; Lu, S.; Bi, X. A deep learning approach for lane marking detection applying encode-decode instance segmentation network. Sci. Rep. 2023, 13, 20310. [Google Scholar] [PubMed]
- Mamun, A.A.; Em, P.P.; Hossen, M.J.; Jahan, B.; Tahabilder, A. A comprehensive review on lane marking detection using deep neural networks. Sensors 2022, 22, 7682. [Google Scholar] [CrossRef] [PubMed]
- Dewi, C.; Chen, R.C.; Liu, Y.T.; Tai, S.K. Synthetic data generation using DCGAN for improved traffic sign recognition. Neural Comput. Appl. 2022, 34, 21465–21480. [Google Scholar]
- Konushin, A.; Faizov, B.; Shakhuro, V. Road Images Augmentation with Synthetic Traffic Signs Using Neural Networks. arXiv 2021, arXiv:2101.04927. [Google Scholar]
- da Silva, D.L.; Fernandes, A.R. Traffic Sign Repositories: Bridging the Gap Between Real and Synthetic Data. In Deep Learning Theory and Applications; Springer: Cham, Switzerland, 2023; pp. 56–77. [Google Scholar]
- Roboflow. Roboflow (Version 1.0). Available online: https://roboflow.com (accessed on 10 April 2024).
- Kaggle. Kaggle: Your Home for Data Science. Available online: https://www.kaggle.com (accessed on 10 April 2024).
- Khanam, R.; Hussain, M. YOLOv11: An Overview of the Key Architectural Enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar]
- Zhao, Y.; Lv, W.; Xu, S.; Wei, J.; Wang, G.; Dang, Q.; Liu, Y.; Chen, J. DETRs Beat YOLOs on Real-time Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 17–21 June 2024. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
- Golcarenarenji, G.; Martinez-Alpiste, I.; Wang, Q.; Alcaraz-Calero, J.M. Robust Real-Time Traffic Light Detector on Small-Form Platform for Autonomous Vehicles. J. Intell. Transp. Syst. 2024, 28, 668–678. [Google Scholar]
- Wu, J.; Liu, W.; Maruyama, Y. Street View Image-Based Road Marking Inspection System Using Computer Vision and Deep Learning Techniques. Sensors 2024, 24, 7724. [Google Scholar] [CrossRef] [PubMed]
- Chen, R.-C.; Chao, W.-K.; Manongga, W.E.; Sub-r-pa, C. Instance Segmentation of Road Marking Signs Using YOLO Models. J. Adv. Inf. Technol. 2024, 15, 1131–1137. [Google Scholar] [CrossRef]
- Madeira, S.; Yan, W.; Bastos, L.; Gonçalves, J.A. Accuracy assessment of the integration of GNSS and a MEMS IMU in a terrestrial platform. Sensors 2014, 14, 23504–23525. [Google Scholar] [CrossRef]
- Laconte, J.; Deray, J.; Underwood, J.; Megret, R.; Peynot, T. Accurate collaborative globally-referenced digital mapping with standard GNSS. Sensors 2018, 18, 2721. [Google Scholar] [CrossRef]














| Pipeline | Strategy | N | Target (m) | Mean (m) | SD (m) | CV (%) | Short | Long | ±20% |
|---|---|---|---|---|---|---|---|---|---|
| Horizontal markings | Distance-aware | 7978 | 6.5 | 6.5 | 0.77 | 11.92 | 0 | 0 | 89.14 |
| Horizontal markings | Fixed-time | 7978 | 6.5 | 6.5 | 2.76 | 42.45 | 968 | 1020 | 35.39 |
| Vertical signs | Distance-aware | 2544 | 13 | 13.00 | 0.44 | 3.40 | 0 | 0 | 100.0 |
| Vertical signs | Fixed-time | 2544 | 13 | 13.01 | 4.76 | 36.57 | 216 | 194 | 57.09 |
| Class | Train | Val | Test | Total | Percentage |
|---|---|---|---|---|---|
| Danger | 2003 | 571 | 286 | 2860 | 13% |
| Prohibition | 2464 | 731 | 363 | 3558 | 16% |
| Priority | 1033 | 298 | 147 | 1478 | 7% |
| Mandatory | 1921 | 519 | 316 | 2756 | 12% |
| Stop | 1233 | 361 | 166 | 1750 | 8% |
| Give-way | 1459 | 411 | 185 | 2055 | 9% |
| No-stopping/No-parking | 1370 | 399 | 251 | 2020 | 9% |
| No-entry | 1206 | 361 | 187 | 1754 | 8% |
| Pedestrian crossing | 2336 | 632 | 366 | 3334 | 15% |
| Traffic light | 767 | 227 | 99 | 1093 | 5% |
| Class | Train | Val | Test | Total | Percentage |
|---|---|---|---|---|---|
| Give-way | 627 | 116 | 143 | 886 | 11% |
| Left arrow | 599 | 138 | 126 | 863 | 10% |
| Right arrow | 613 | 120 | 142 | 875 | 10% |
| Stop | 623 | 135 | 140 | 898 | 11% |
| Straight arrow | 681 | 144 | 150 | 975 | 12% |
| Straight-left arrow | 633 | 147 | 135 | 915 | 11% |
| Straight-right arrow | 622 | 153 | 120 | 895 | 11% |
| Triangular give-way marking | 671 | 135 | 145 | 951 | 11% |
| Pedestrian crossing | 780 | 156 | 159 | 1095 | 13% |
| Parameter | Vertical Sign Pipeline | Horizontal Marking Pipeline |
|---|---|---|
| Recognition task | Object detection | Instance segmentation and detection comparison |
| Input resolution | 640 × 640 pixels | 640 × 640 pixels |
| Annotation type | Bounding boxes | Polygonal masks |
| Evaluated models | YOLO11m, RT-DETR, Faster R-CNN | YOLO11m-seg, RT-DETR, Faster R-CNN |
| Dataset split | 70% train, 20% validation, 10% test | 70% train, 15% validation, 15% test |
| Main metrics | Precision, Recall, mAP50, mAP50-95, F1-score | Precision, Recall, mAP50, mAP50-95, F1-score |
| Final model selection | Test-set performance | Test-set performance |
| Class | Instances | Precision | Recall | mAP50 | mAP50-95 | F1 |
|---|---|---|---|---|---|---|
| Danger | 286 | 0.96 | 0.92 | 0.97 | 0.89 | 0.94 |
| Prohibition | 363 | 0.96 | 0.96 | 0.98 | 0.92 | 0.96 |
| Priority | 147 | 1.00 | 0.95 | 0.97 | 0.96 | 0.97 |
| Mandatory | 316 | 0.98 | 0.84 | 0.95 | 0.86 | 0.90 |
| Stop | 143 | 0.99 | 1.00 | 0.99 | 0.99 | 0.99 |
| Give-way | 171 | 0.96 | 0.95 | 0.98 | 0.93 | 0.95 |
| No-parking/No-stopping | 230 | 0.98 | 0.74 | 0.92 | 0.85 | 0.84 |
| No-entry | 170 | 0.98 | 0.93 | 0.98 | 0.94 | 0.95 |
| Pedestrian crossing | 329 | 0.96 | 0.90 | 0.98 | 0.90 | 0.93 |
| Traffic light | 70 | 0.85 | 0.66 | 0.74 | 0.60 | 0.74 |
| ALL | 2225 | 0.96 | 0.88 | 0.94 | 0.88 | 0.92 |
| Class | Instances | Precision | Recall | Mask mAP50 | Mask mAP50-95 | F1 |
|---|---|---|---|---|---|---|
| Give-way | 142 | 0.972 | 0.973 | 0.992 | 0.94 | 0.97 |
| Left arrow | 126 | 0.981 | 0.976 | 0.989 | 0.967 | 0.98 |
| Right arrow | 142 | 0.993 | 0.983 | 0.995 | 0.978 | 0.99 |
| Stop | 140 | 0.98 | 0.964 | 0.983 | 0.92 | 0.97 |
| Straight arrow | 145 | 1 | 0.968 | 0.988 | 0.91 | 0.98 |
| Straight-left arrow | 135 | 0.991 | 1 | 0.995 | 0.959 | 0.99 |
| Straight-right arrow | 120 | 0.99 | 0.992 | 0.995 | 0.948 | 0.99 |
| Triangular give-way marking | 136 | 0.977 | 0.899 | 0.952 | 0.794 | 0.94 |
| Pedestrian crossing | 156 | 0.95 | 0.912 | 0.949 | 0.822 | 0.93 |
| ALL | 1242 | 0.98 | 0.96 | 0.98 | 0.91 | 0.96 |
| Asset Type | Length (km) | Real Assets | Predicted Assets | Difference | Absolute Error | Error per km | Relative Error |
|---|---|---|---|---|---|---|---|
| Vertical signs | 22 | 216 | 245 | +29 | 31 | 1.41 | 14.35% |
| Horizontal markings | 22 | 63 | 74 | +11 | 21 | 0.95 | 33.33% |
| All assets | 22 | 279 | 319 | +40 | 52 | 2.36 | 18.64% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Profumo, D.; Akbar, R.; Fiorella, L.; Fredianelli, L.; Ascari, E.; D’Alessandro, F.; Fidecaro, F.; Licitra, G. A Low-Cost Vision–GPS Framework for the Unified Mapping of Vertical and Horizontal Road Assets Using Deep Learning. Sensors 2026, 26, 4042. https://doi.org/10.3390/s26134042
Profumo D, Akbar R, Fiorella L, Fredianelli L, Ascari E, D’Alessandro F, Fidecaro F, Licitra G. A Low-Cost Vision–GPS Framework for the Unified Mapping of Vertical and Horizontal Road Assets Using Deep Learning. Sensors. 2026; 26(13):4042. https://doi.org/10.3390/s26134042
Chicago/Turabian StyleProfumo, Domenico, Raza Akbar, Laura Fiorella, Luca Fredianelli, Elena Ascari, Francesco D’Alessandro, Francesco Fidecaro, and Gaetano Licitra. 2026. "A Low-Cost Vision–GPS Framework for the Unified Mapping of Vertical and Horizontal Road Assets Using Deep Learning" Sensors 26, no. 13: 4042. https://doi.org/10.3390/s26134042
APA StyleProfumo, D., Akbar, R., Fiorella, L., Fredianelli, L., Ascari, E., D’Alessandro, F., Fidecaro, F., & Licitra, G. (2026). A Low-Cost Vision–GPS Framework for the Unified Mapping of Vertical and Horizontal Road Assets Using Deep Learning. Sensors, 26(13), 4042. https://doi.org/10.3390/s26134042

