Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM
Abstract
1. Introduction
- We propose a unified adverse-weather license plate recognition framework CLEI, which integrates YOLOv12-based detection, GAN-based enhancement, and Mamba-based recognition into a collaborative pipeline to alleviate error accumulation under severe degradation.
- We design a CNN–Mamba Network (CMN) that replaces traditional recurrent sequence modeling with a selective state-space model for character recognition, improving both inference efficiency and robustness to blurred characters.
- We conduct extensive experiments under rain/snow, fog, and low-light conditions on publicly available datasets, and verify that the proposed framework outperforms classical methods in recognition accuracy while maintaining efficiency.
- We evaluate the contribution of each module via ablation studies, confirming that both the enhancement and recognition components improve performance, and their combination achieves the best results.
2. Related Work
2.1. Research on License Plate Detection
2.2. Research on Degraded Image Enhancement
2.3. Research on Character Recognition and Sequence Modeling
2.4. Research on Collaborative Recognition for Harsh Environments Is Insufficient
3. Methodology
3.1. Overall Architecture of CLEI
3.2. License Plate Detection Module
3.3. Recognition-Oriented Enhancement Module
3.4. CMN Recognition Module
- (1)
- Superior Inference Efficiency. BiLSTMs process sequences sequentially, resulting in quadratic complexity growth with sequence length. While capable of handling the typical 7–8 character plates, this creates a latency bottleneck when integrated with real-time front-end modules like YOLO and GAN. In contrast, Mamba, based on a State Space Model (SSM), achieves linear time complexity. It captures long-range character dependencies in parallel, offering an inference speed that is 1–2 orders of magnitude faster than BiLSTM, thereby meeting the real-time demands of the application.
- (2)
- Enhanced Robustness to Degradations. BiLSTMs exhibit limited robustness to noise and blur. When characters are degraded, fragmented, or deformed in harsh conditions, they tend to lose long-range contextual information. Mamba’s “selective state update” mechanism dynamically focuses on relevant character features. Combined with convolutional layers that supplement local texture information, this results in significantly more stable sequence modeling for blurred or incomplete license plates.
4. Experiment and Results
4.1. Dataset Descriptions
4.2. Implementation Details and Results
4.2.1. License Plate Detection Experiment
- (1)
- Test environment
- (2)
- Test Evaluation Indicators
- (3)
- Analysis of Test Results
4.2.2. DeblurGANv2ProcessingExperiment
- (1)
- Test environment
- (2)
- Test Evaluation Indicators
- (3)
- Analysis of Test Results
4.2.3. License Plate Recognition Experiment
- (1)
- Test environment
- (2)
- Test Evaluation Indicators
- (3)
- Analysis of Test Results
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhou, Y.; Peng, S.; Lyu, H.; Tong, F.; Huang, C.; Niu, J. KLOTSKI: Towards Consensus Enabled Collaborative Vehicles in Intelligent Transportation. IEEE Trans. Veh. Technol. 2025, 74, 18620–18634. [Google Scholar] [CrossRef]
- Sonnara, F.; Chihaoui, H.; Filali, F. Efficient real-time license plate recognition using deep learning on edge devices. J. Real-Time Image Process. 2025, 22, 159. [Google Scholar] [CrossRef]
- Mustafa, T.; Karabatak, M. Real time car model and plate detection system by using deep learning architectures. IEEE Access 2024, 12, 107616–107630. [Google Scholar] [CrossRef]
- Solla, M.; Pérez-Gracia, V.; Fontul, S. A review of GPR application on transport infrastructures: Troubleshooting and best practices. Remote Sens. 2021, 13, 672. [Google Scholar] [CrossRef]
- Bensouilah, M.; Zennir, M.N.; Taffar, M. An ALPR System-based Deep Networks for the Detection and Recognition. In Proceedings of the ICPRAM, Virtual, 4–6 February 2021; pp. 204–211. [Google Scholar]
- Wang, J.; Wu, Z.; Liang, Y.; Tang, J.; Chen, H. Perception methods for adverse weather based on vehicle infrastructure cooperation system: A review. Sensors 2024, 24, 374. [Google Scholar] [CrossRef] [PubMed]
- Shashirangana, J.; Padmasiri, H.; Meedeniya, D.; Perera, C. Automated license plate recognition: A survey on methods and techniques. IEEE Access 2020, 9, 11203–11225. [Google Scholar] [CrossRef]
- Ali, M.; Zhang, Z. The YOLO framework: A comprehensive review of evolution, applications, and benchmarks in object detection. Computers 2024, 13, 336. [Google Scholar] [CrossRef]
- Ismail, A.; Mehri, M.; Sahbani, A.; Amara, N.E.B. Automatic license plate recognition in in-the-wild scenarios: A comprehensive review, open issues, and future directions. IEEE Access 2025, 13, 145387–145415. [Google Scholar] [CrossRef]
- Qin, H.; Wang, D.; Cai, Z.; Zeng, J. Real-Time Traffic Arrival Prediction for Intelligent Signal Control Using a Hidden Markov Model-Filtered Dynamic Platoon Dispersion Model and Automatic License Plate Recognition Data. Appl. Sci. 2025, 15, 11537. [Google Scholar] [CrossRef]
- Meesad, P.; Thumthong, W. Advanced deep learning techniques for automated license plate recognition. Sci. Rep. 2025, 15, 41194. [Google Scholar] [CrossRef]
- Sapkota, R.; Flores-Calero, M.; Qureshi, R.; Badgujar, C.; Nepal, U.; Poulose, A.; Zeno, P.; Vaddevolu, U.B.P.; Khan, S.; Shoman, M. YOLO advances to its genesis: A decadal and comprehensive review of the You Only Look Once (YOLO) series. Artif. Intell. Rev. 2025, 58, 274. [Google Scholar] [CrossRef]
- Tang, Y.; Zhang, S.; Liu, W.; Wang, Z.; Wang, J. Ultra-lightweight automatic license plate recognition system for microcontrollers: A cost-effective and energy-efficient solution. IEEE Trans. Intell. Transp. Syst. 2024, 25, 20419–20434. [Google Scholar] [CrossRef]
- Li, W.; Pun, C.-M. A single-target license plate detection with attention. In Proceedings of the 2022 International Workshop on Advanced Imaging Technology (IWAIT); SPIE: Bellingham, WA, USA, 2022; pp. 396–401. [Google Scholar] [CrossRef]
- Yogheedha, K.; Nasir, A.; Jaafar, H.; Mamduh, S. Automatic vehicle license plate recognition system based on image processing and template matching approach. In Proceedings of the 2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA); IEEE: New York, NY, USA, 2018; pp. 1–8. [Google Scholar]
- Samantaray, M.; Biswal, A.K.; Singh, D.; Samanta, D.; Karuppiah, M.; Joseph, N.P. Optical character recognition (ocr) based vehicle’s license plate recognition system using python and opencv. In Proceedings of the 2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA); IEEE: New York, NY, USA, 2021; pp. 849–853. [Google Scholar]
- Chowdhury, D.; Mandal, S.; Das, D.; Banerjee, S.; Shome, S.; Choudhary, D. An adaptive technique for computer vision based vehicles license plate detection system. In Proceedings of the 2019 International Conference on Opto-Electronics and Applied Optics (Optronix); IEEE: New York, NY, USA, 2019; pp. 1–6. [Google Scholar] [CrossRef]
- Menon, A.; Omman, B. Detection and recognition of multiple license plate from still images. In Proceedings of the 2018 International Conference on Circuits and Systems in Digital Enterprise Technology (ICCSDET); IEEE: New York, NY, USA, 2018; pp. 1–5. [Google Scholar] [CrossRef]
- Mathur, M.; Devi, G.U.; Chauhan, S. Robust License Plate Detection and Recognition in Adverse Weather Using Drone-Captured Data and a Hybrid YOLOv8-Swin-BiFAN Framework. In Proceedings of the 2025 13th International Conference on Intelligent Embedded, MicroElectronics, Communication and Optical Networks (IEMECON); IEEE: New York, NY, USA, 2025; pp. 1–6. [Google Scholar]
- Rahmani, M.; Sabaghian, M.; Moghadami, S.M.; Talaie, M.M.; Naghibi, M.; Keyvanrad, M.A. Ir-lpr: A large scale iranian license plate recognition dataset. In Proceedings of the 2022 12th International Conference on Computer and Knowledge Engineering (ICCKE); IEEE: New York, NY, USA, 2022; pp. 053–058. [Google Scholar] [CrossRef]
- Al-Shemarry, M.S.; Li, Y.; Abdulla, S. An efficient texture descriptor for the detection of license plates from vehicle images in difficult conditions. IEEE Trans. Intell. Transp. Syst. 2019, 21, 553–564. [Google Scholar] [CrossRef]
- Khanam, R.; Hussain, M. A review of YOLOv12: Attention-based enhancements vs. previous versions. arXiv 2025, arXiv:2504.11995. [Google Scholar] [CrossRef]
- Zhang, C.; Zhang, C.; Wang, S.; Dong, Y.; Guan, X.; Fan, H.; Zhao, R.; Xu, G. EVF-YOLO: A lightweight network for license plate detection under severe weather conditions. In Proceedings of the 2024 International Conference on Intelligent Computing; Springer: Singapore, 2024; pp. 131–142. [Google Scholar]
- Sugiharto, A.; Kusumaningrum, R. Enhanced automatic license plate detection and recognition using clahe and yolov11 for seat belt compliance detection. Eng. Technol. Appl. Sci. Res. 2025, 15, 20271–20278. [Google Scholar] [CrossRef]
- Na, K.; Park, G.; Kim, I. CharDiff-LP: A Diffusion Model with Character-Level Guidance for License Plate Image Restoration. arXiv 2025, arXiv:2510.17330. [Google Scholar]
- Hu, L.; Zeng, W.; Cai, Z.; Guo, J.; Wang, Y.; Wei, N. Research and implementation of a license plate detection algorithm for complex environments. In Proceedings of the 2023 8th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA); IEEE: New York, NY, USA, 2023; pp. 551–555. [Google Scholar]
- Nguyen, C.; Tran, D.T.; Nguyen, H.; Phan, X.-V.; Nguyen, N.-P. VRAE: Vertical Residual Autoencoder for License Plate Denoising and Deblurring. arXiv 2025, arXiv:2509.08392. [Google Scholar] [CrossRef]
- Zhu, C.; Zhang, H.; He, M.; Li, Y.; Qiao, X. Nighttime Hazy Image Enhancement via Progressively and Mutually Reinforcing Night-Haze Priors. arXiv 2026, arXiv:2601.01998. [Google Scholar] [CrossRef]
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; Curran Associates, Inc.: Red Hook, NY, USA, 2014; Volume 27. [Google Scholar]
- Song, Y.; He, Z.; Qian, H.; Du, X. Vision transformers for single image dehazing. IEEE Trans. Image Process. 2023, 32, 1927–1941. [Google Scholar] [CrossRef]
- Xu, G.; Ke, Z.; Zuo, P.; Lei, B. TransLPRNet: Lite Vision-Language Network for Single/Dual-line Chinese License Plate Recognition. arXiv 2025, arXiv:2507.17335. [Google Scholar]
- Qin, C.; Schlemper, J.; Caballero, J.; Price, A.N.; Hajnal, J.V.; Rueckert, D. Convolutional recurrent neural networks for dynamic MR image reconstruction. IEEE Trans. Med. Imaging 2018, 38, 280–290. [Google Scholar] [CrossRef]
- Tomonaga, S.; Doya, K.; Murata, N. Lag Operator SSMs: A Geometric Framework for Structured State Space Modeling. arXiv 2025, arXiv:2512.18965. [Google Scholar] [CrossRef]
- Shim, S.-O.; Imtiaz, R.; Siddiq, A.; Khan, I.R. License plates detection and recognition with multi-exposure images. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 300–305. [Google Scholar] [CrossRef]
- Li, Z. A method for license plate recognition in low-resolution conditions. In Proceedings of the 2025 ITM Web of Conferences; EDP Sciences: Les Ulis, France, 2025; p. 02028. [Google Scholar] [CrossRef]
- Joshi, S.; Jejure, P.; Jadhav, C.; Jankar, V.; Mote, A. Automatic Number Plate Recognition Using YOLOv8 Model. Int. J. Sci. Res. Sci. Technol. 2025, 12, 1088–1097. [Google Scholar] [CrossRef]
- Tisanarada, A.; Giap, Y.C. Optical Character Recognition (OCR) Of License Plates Using the KNN Method. Indones. Appl. Res. Comput. Inform. 2025, 1, 10–19. [Google Scholar]
- Shabaninia, E.; Asadi-zeydabadi, F.; Nezamabadi-pour, H. Layout-Independent License Plate Recognition via Integrated Vision and Language Models. arXiv 2025, arXiv:2510.10533. [Google Scholar] [CrossRef]
- Du, Y.; Chen, Z.; Jia, C.; Yin, X.; Li, C.; Du, Y.; Jiang, Y.-G. Context perception parallel decoder for scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 4668–4683. [Google Scholar] [CrossRef]
- Fang, S.; Xie, H.; Wang, Y.; Mao, Z.; Zhang, Y. Read like humans: Autonomous, bidirectional and iterative language modeling for scene text recognition. In Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE Computer Society: Los Alamitos, CA, USA, 2021; pp. 7098–7107. [Google Scholar]
- Atienza, R. Vision transformer for fast and efficient scene text recognition. In Proceedings of the 2021 International Conference on Document Analysis and Recognition; Springer: Berlin/Heidelberg, Germany, 2021; pp. 319–334. [Google Scholar]
- Li, M.; Lv, T.; Chen, J.; Cui, L.; Lu, Y.; Florencio, D.; Zhang, C.; Li, Z.; Wei, F. Trocr: Transformer-based optical character recognition with pre-trained models. In Proceedings of the 2023 AAAI Conference on Artificial Intelligence; AAAI Press: Washington, DC, USA, 2023; pp. 13094–13102. [Google Scholar]
- Amin, A.; Mumtaz, R.; Bashir, M.J.; Zaidi, S.M.H. Next-generation license plate detection and recognition system using yolov8. In Proceedings of the 2023 IEEE 20th International Conference on Smart Communities: Improving Quality of Life using AI, Robotics and IoT (HONET); IEEE: New York, NY, USA, 2023; pp. 179–184. [Google Scholar] [CrossRef]
- Vargoorani, Z.E.; Ghoreyshi, A.M.; Suen, C.Y. Efficient License Plate Recognition via Pseudo-Labeled Supervision with Grounding DINO and YOLOv8. In Proceedings of the 2025 IEEE 35th International Workshop on Machine Learning for Signal Processing (MLSP); IEEE: New York, NY, USA, 2025; pp. 1–6. [Google Scholar] [CrossRef]
- Shafiezadeh, B.; Mashmool, A.; Eshghi, F.; Kelarestaghi, M. A new Hybrid Model of Generative Adversarial Network and You Only Look Once Algorithm for Automatic License-Plate Recognition. arXiv 2025, arXiv:2509.06868. [Google Scholar] [CrossRef]
- Viswanathan, K.; Goel, V.; Gholap, S.; Ghosh, D.; Gupta, M.; Ganatra, D.; Potdar, S.; Sethi, A. FANVID: A Benchmark for Face and License Plate Recognition in Low-Resolution Videos. arXiv 2025, arXiv:2506.07304. [Google Scholar]
- Vargoorani, Z.E.; Suen, C.Y. License plate detection and character recognition using deep learning and font evaluation. In Proceedings of the 2024 IAPR Workshop on Artificial Neural Networks in Pattern Recognition; Springer: Berlin/Heidelberg, Germany, 2024; pp. 231–242. [Google Scholar] [CrossRef]
- Rahman, M.J.; Beauchemin, S.S.; Bauer, M.A. License plate detection and recognition: An empirical study. In Proceedings of the 2019 Science and Information Conference; Springer: Cham, Switzerland, 2019; pp. 339–349. [Google Scholar]
- Moussaoui, H.; Akkad, N.E.; Benslimane, M.; El-Shafai, W.; Baihan, A.; Hewage, C.; Rathore, R.S. Enhancing automated vehicle identification by integrating YOLO v8 and OCR techniques for high-precision license plate detection and recognition. Sci. Rep. 2024, 14, 14389. [Google Scholar] [CrossRef]
- Ismagilov, T.; Ferrarini, B.; Milford, M.; Tuyen, N.T.V.; Ramchurn, S.D.; Ehsan, S. On motion blur and deblurring in visual place recognition. IEEE Robot. Autom. Lett. 2025, 10, 4746–4753. [Google Scholar] [CrossRef]








| References | Main Objective | Methods |
|---|---|---|
| Rahmani et al., 2022 [20] | Summary of Research on Detection Enhancement and Recognition | YOLOv11 Positioning + CLAHE Enhancement + YOLOv11 Recognition |
| Zhu et al., 2026 [28] | Solve the problem of license plate detection in monitoring scenarios | Optimize the C2f, SPPF, detection head and loss function of YOLOv8n |
| Wang et al., 2024 [6] | Improve the license plate recognition accuracy in complex scenarios | Optimize YOLOv5s and LPRNet, and add attention and correction modules |
| Li, 2025 [35] | Improve the overall security of multi-image steganography | Multi-scale texture evaluation + image enhancement + adversarial embedding |
| Xu et al., 2025 [31] | Solve the problems of blurry laser stripe images and reduced measurement accuracy caused by vibration | Improved DeblurGAN: Incorporate HDC, RRDB, Skip Connections and L1 Loss |
| Joshi et al., 2025 [36] | Achieve high-precision and robust automatic license plate recognition | YOLOv8 Detection + Image Preprocessing + Character Segmentation + OCR |
| Amin et al., 2023 [43] | Real-time high-precision recognition of complex Korean license plates at the edge terminal | Lightweight Model (SSD-lite/YOLOv7-tiny) + Model Compression |
| Qin et al., 2018 [32] | Build a High-precision and Secure License Plate Recognition System with AI and PyTorch | Implement the entire process of image detection and LPR based on PyTorch, including collection, preprocessing, positioning, segmentation and recognition |
| Model | P/% | R/% | F1/% | mAP50/% | mAP50-95/% | Params (m) | GFLOPs |
|---|---|---|---|---|---|---|---|
| YOLOv5s | 99.64 | 1 | 99.82 | 99.50 | 96.22 | 7.01 | 15.8 |
| YOLOv8s | 99.82 | 99.83 | 99.82 | 99.50 | 97.27 | 11.13 | 28.4 |
| YOLOv10s | 99.67 | 99.55 | 99.61 | 99.50 | 96.62 | 7.22 | 21.4 |
| YOLOv12s | 99.99 | 1 | 99.99 | 99.50 | 96.78 | 9.11 | 19.3 |
| RT-DETR | 98.11 | 96.73 | 97.42 | 98.88 | 96.79 | 42.7 | 178.32 |
| Model | P/% | R/% | mAP50/% | mAP50-95/% | Latency (ms) | FPS |
|---|---|---|---|---|---|---|
| YOLOv5s | 99.9 | 1 | 99.50 | 98 | 6.5 | 154 |
| YOLOv8s | 99.9 | 1 | 99.50 | 98.2 | 9.1 | 110 |
| YOLOv10s | 99.6 | 99.9 | 99.50 | 98 | 8.5 | 118 |
| YOLOv12s | 99.9 | 1 | 99.50 | 97.8 | 4.8 | 208 |
| Model | P/% | R/% | mAP50/% | mAP50-95/% | Latency (ms) | FPS |
|---|---|---|---|---|---|---|
| YOLOv5s | 1 | 1 | 99.50 | 97.7 | 6.6 | 152 |
| YOLOv8s | 1 | 1 | 99.50 | 98.1 | 9.6 | 104 |
| YOLOv10s | 99.6 | 99.7 | 99.50 | 97.3 | 8.9 | 112 |
| YOLOv12s | 1 | 1 | 99.50 | 97.3 | 5.3 | 189 |
| Model | P/% | R/% | mAP50/% | mAP50-95/% | Latency (ms) | FPS |
|---|---|---|---|---|---|---|
| YOLOv5s | 1 | 1 | 99.50 | 98.2 | 6.8 | 147 |
| YOLOv8s | 1 | 1 | 99.50 | 98.5 | 9.2 | 109 |
| YOLOv10s | 99.8 | 99.9 | 99.50 | 98 | 8.7 | 115 |
| YOLOv12s | 1 | 1 | 99.50 | 97.8 | 4.9 | 204 |
| Indicator | Poor | Medium | Good | Excellent |
|---|---|---|---|---|
| PSNR (dB) | <15 | 15~20 | 20~25 | >25 |
| SSIM | <0.7 | 0.7~0.8 | 0.8~0.9 | >0.9 |
| LPIPS | >0.3 | 0.2~0.3 | 0.1~0.2 | <0.1 |
| Indicator | Test Results | Corresponding Level |
|---|---|---|
| PSNR (dB) | 16.61 dB | Medium |
| SSIM | 0.8776 | Good |
| LPIPS | 0.1151 | Good |
| Model Configuration | GAN | Mamba-SSM | Acc (%) | Params (k) |
|---|---|---|---|---|
| CRNN | × | × | 91.00 ± 0.51 | 638 |
| CRNN | √ | × | 91.33 ± 0.48 | -- |
| LPRnet | × | × | 88.50 ± 0.55 | 486 |
| LPRnet | √ | × | 88.83 ± 0.53 | -- |
| Ours (CMN) | × | √ | 93.30 ± 0.42 | 718 |
| Ours (Full Model) | √ | √ | 93.67 ± 0.41 | -- |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tang, F.; Chen, L.; Zeng, L.; Nie, Y.; Yang, J. Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM. Urban Sci. 2026, 10, 325. https://doi.org/10.3390/urbansci10060325
Tang F, Chen L, Zeng L, Nie Y, Yang J. Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM. Urban Science. 2026; 10(6):325. https://doi.org/10.3390/urbansci10060325
Chicago/Turabian StyleTang, Feng, Lei Chen, Lingxuan Zeng, Yaqin Nie, and Jian Yang. 2026. "Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM" Urban Science 10, no. 6: 325. https://doi.org/10.3390/urbansci10060325
APA StyleTang, F., Chen, L., Zeng, L., Nie, Y., & Yang, J. (2026). Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM. Urban Science, 10(6), 325. https://doi.org/10.3390/urbansci10060325
