Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches
Abstract
:1. Introduction
- We propose a novel network that combines indirect and direct branches for license plate detection in the wild. The indirect detection branch utilizes vehicle–plate relation and can precisely locate the license plate in a coarse-to-fine scheme. The direct detection branch localizes the license plate in the input image directly, reducing false negatives in the indirect detection branch due to the miss of vehicles’ detection.
- We propose to detect the multidirectional license plate by localizing the four corners of the license plate. This universal detection module can be easily integrated into standard detection networks.
- Notably, the whole model is constructed in an end-to-end trainable manner. By utilizing the post-processing operations, such as NMS, the final detection results are obtained by merging the indirect and direct branches. Hence, the whole model benefits from joint learning of all tasks. To our knowledge, our model is the first one that combines indirect and direct methods into an end-to-end network for license plate detection.
2. Related Work
2.1. Direct License Plate Detection
2.2. Indirect License Plate Detection
2.3. Multidirectional License Plate Detection
3. Materials and Methods
3.1. Overall Architecture
3.2. Indirect Detection Branch
3.2.1. Approximate License Plate Detection (ALPD)
3.2.2. Local Region Estimation and Aggregation (LREA)
3.2.3. Multidirectional License Plate Refinement (MLPR)
3.3. Direct Detection Branch
3.4. End-to-End Trainable Detection Network
3.5. Post Processing
4. Results
4.1. Datasets
4.2. Evaluation Protocols
4.3. Ablation Study
4.4. Evaluation of Horizontal Bounding Box
4.5. Evaluation of Multidirectional License Plate
4.6. Evaluation of Small-Sized License Plate
4.7. Qualitative Results
5. Discussions
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
LP | License plate |
LPD | License plate detection |
CRF | Conditional random field |
RPN | Region proposal network |
STN | Spatial transformer networks |
IoU | Intersection over union |
RoI | Region of interest |
SSD | Single shot multibox detector |
YOLO | You only look once |
VGG | Visual geometry group |
NMS | Non-maximum suppression |
DLPD | Direct license plate detection |
ALPD | Approximate license plate detection |
LREA | Local region estimation and aggregation |
MLPR | Multidirectional license plate refinement |
FC | Four corners |
References
- Tian, Y.; Song, J.; Zhang, X.; Shen, P.; Zhang, L.; Gong, W.; Wei, W.; Zhu, G. An algorithm combined with color differential models for license-plate location. Neurocomputing 2016, 212, 22–35. [Google Scholar] [CrossRef]
- Zhou, W.; Li, H.; Lu, Y.; Tian, Q. Principal visual word discovery for automatic license plate detection. IEEE Trans. Image Process. 2012, 21, 4269–4279. [Google Scholar] [CrossRef] [PubMed]
- Li, B.; Tian, B.; Li, Y.; Wen, D. Component-based license plate detection using conditional random field Model. IEEE Trans. Intell. Transp. Syst. 2013, 14, 1690–1699. [Google Scholar] [CrossRef]
- Al-Ghaili, A.M.; Mashohor, S.; Ramli, A.R.; Ismail, A. Vertical-edge-based car-license-plate detection method. IEEE Trans. Veh. Technol. 2013, 62, 26–38. [Google Scholar] [CrossRef] [Green Version]
- Han, J.; Yao, J.; Zhao, J.; Tu, J.; Liu, Y. Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks. Sensors 2019, 19, 1175. [Google Scholar] [CrossRef] [Green Version]
- Yuan, Y.; Zou, W.; Zhao, Y.; Wang, X.; Hu, X.; Komodakis, N. A robust and efficient approach to license plate detection. IEEE Trans. Image Process. 2017, 26, 1102–1114. [Google Scholar] [CrossRef]
- Al-qudah, R.; Suen, C.Y. Enhancing YOLO deep networks for the detection of license plates in complex scenes. In Proceedings of the Second International Conference on Data Science, E-Learning and Information Systems (DATA), Dubai, United Arab Emirates, 2–5 December 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Pu, D.; Gu, N.; Zhang, X. A robust and real-time approach for license plate detection. In Proceedings of the International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Huangshan, China, 28–30 July 2018; pp. 1–7. [Google Scholar] [CrossRef]
- Chen, S.; Yang, C.; Ma, J.; Chen, F.; Yin, X. Simultaneous end-to-end vehicle and license plate detection with multi-branch attention neural network. IEEE Trans. Intell. Transp. Syst. 2020, 21, 3686–3695. [Google Scholar] [CrossRef]
- Xu, Z.; Yang, W.; Meng, A.; Lu, N.; Huang, H.; Ying, C.; Huang, L. Towards end-to-end license plate detection and recognition: A large dataset and baseline. In Proceedings of the 15th European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 261–277. [Google Scholar] [CrossRef]
- Li, H.; Wang, P.; Shen, C. Towards end-to-end car license plates detection and recognition with deep neural networks. IEEE Trans. Intell. Transp. Syst. 2019, 20, 1126–1136. [Google Scholar] [CrossRef]
- SG, K.; HG, J.; HI, K. Deep-learning-based license plate detection method using vehicle region extraction. Electron. Lett. 2017, 53, 1034–1036. [Google Scholar]
- Fu, Q.; Shen, Y.; Guo, Z. License plate detection using deep cascaded convolutional neural networks in complex scenes. In Proceedings of the 24th International Conference on Neural Information Processing (ICONIP), Guangzhou, China, 14–18 November 2017; pp. 696–706. [Google Scholar] [CrossRef]
- Silva, S.M.; Jung, C.R. License plate detection and recognition in unconstrained scenarios. In Proceedings of the 15th European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 593–609. [Google Scholar] [CrossRef]
- Yonetsu, S.; Iwamoto, Y.; Chen, Y. Two-stage YOLOv2 for accurate license-plate detection in complex scenes. In Proceedings of the IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 11–13 January 2019; pp. 1–4. [Google Scholar] [CrossRef]
- Laroca, R.; Severo, E.; Zanlorensi, L.A.; Oliveira, L.S.; Gonçalves, G.R.; Schwartz, W.R.; Menotti, D. A robust real-time automatic license plate recognition based on the YOLO detector. In Proceedings of the International Joint Conference on Neural Network (IJCNN), Rio de Janeiro, Brazil, 8–13 July 2018; pp. 1–10. [Google Scholar] [CrossRef] [Green Version]
- Chen, S.L.; Tian, S.; Ma, J.W.; Liu, Q.; Yang, C.; Chen, F.; Yin, X.C. End-to-end trainable network for degraded license plate detection via vehicle-plate relation mining. arXiv 2020, arXiv:2010.14266v1. [Google Scholar]
- Silva, S.M.; Jung, C.R. Real-time brazilian license plate detection and recognition using deep convolutional neural networks. In Proceedings of the Conference on Graphics, Patterns and Images (SIBGRAPI), Niterói, Brazil, 17–20 October 2017; pp. 55–62. [Google Scholar] [CrossRef]
- Jin, L.; Xian, H.; Bie, J.; Sun, Y.; Hou, H.; Niu, Q. License Plate Recognition Algorithm for Passenger Cars in Chinese Residential Areas. Sensors 2012, 12, 8355–8370. [Google Scholar] [CrossRef]
- Ullah, F.; Anwar, H.; Shahzadi, I.; Rehman, A.U.; Mehmood, S.; Niaz, S.; Awan, K.M.; Khan, A.; Kwak, D. Barrier Access Control Using Sensors Platform and Vehicle License Plate Characters Recognition. Sensors 2019, 19, 3015. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Do Vale Dalarmelina, N.; Teixeira, M.A.; Meneguette, R.I. A Real-Time Automatic Plate Recognition System Based on Optical Character Recognition and Wireless Sensor Networks for ITS. Sensors 2020, 20, 55. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Islam, K.T.; Raj, R.G.; Islam, S.M.S.; Wijewickrema, S.N.R.; Hossain, M.S.; Razmovski, T.; O’Leary, S.J. A Vision-Based Machine Learning Method for Barrier Access Control Using Vehicle License Plate Authentication. Sensors 2020, 20, 3578. [Google Scholar] [CrossRef]
- Dong, M.; He, D.; Luo, C.; Liu, D.; Zeng, W. A CNN-Based Approach for Automatic License Plate Recognition in the Wild. In Proceedings of the British Machine Vision Conference (BMVC), London, UK, 4–7 September 2017. [Google Scholar]
- Chen, S.L. Open Source Code for the Network Combining Indirect and Direct Branches. Github (2020). Available online: https://github.com/chensonglu/LPD-indirect-direct (accessed on 3 February 2021).
- Hsieh, J.; Yu, S.; Chen, Y. Morphology-based license plate detection in images of differently illuminated and oriented cars. J. Electron. Imaging 2002, 11, 507–516. [Google Scholar] [CrossRef]
- Zheng, D.; Zhao, Y.; Wang, J. An efficient method of license plate location. Pattern Recognit. Lett. 2005, 26, 2431–2438. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.B.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS), Montreal, QC, Canada, 7–12 December 2015; pp. 91–99. [Google Scholar]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar] [CrossRef] [Green Version]
- Redmon, J.; Farhadi, A. YOLO9000: Better, faster, stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 6517–6525. [Google Scholar] [CrossRef] [Green Version]
- Xie, L.; Ahmad, T.; Jin, L.; Liu, Y.; Zhang, S. A new CNN-based method for multi-directional car license plate detection. IEEE Trans. Intell. Transp. Syst. 2018, 19, 507–517. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.K.; Girshick, R.B.; Farhadi, A. You Only Look Once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar] [CrossRef] [Green Version]
- Tian, J.; Wang, G.; Liu, J. Semantic region proposals for adaptive license plate detection in open environment. J. Electron. Imaging 2019, 28, 023017. [Google Scholar] [CrossRef]
- Lu, N.; Yang, W.; Meng, A.; Xu, Z.; Huang, H.; Huang, L. Automatic Recognition for Arbitrarily Tilted License Plate. In Proceedings of the 2nd International Conference on Video and Image Processing (ICVIP), Hong Kong, China, 29–31 December 2018; pp. 23–28. [Google Scholar] [CrossRef]
- Jaderberg, M.; Simonyan, K.; Zisserman, A.; Kavukcuoglu, K. Spatial Transformer Networks. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS), Montreal, QC, Canada, 7–12 December 2015; pp. 2017–2025. [Google Scholar]
- Dai, J.; He, K.; Sun, J. Instance-aware semantic segmentation via multi-task network cascades. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 3150–3158. [Google Scholar] [CrossRef] [Green Version]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.E.; Fu, C.Y.; Berg, A.C. SSD: Single shot multiBox detector. In Proceedings of the 14th European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands, 11–14 October 2016; pp. 21–37. [Google Scholar] [CrossRef] [Green Version]
- Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef] [Green Version]
- Glorot, X.; Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS), Chia Laguna Resort, Sardinia, Italy, 13–15 May 2010; pp. 249–256. [Google Scholar]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Everingham, M.; Gool, L.J.V.; Williams, C.K.I.; Winn, J.M.; Zisserman, A. The Pascal Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef] [Green Version]
- Wu, Y.; Deng, L.; He, W. BwimNet: A Novel Method for Identifying Moving Vehicles Utilizing a Modified Encoder-Decoder Architecture. Sensors 2020, 20, 7170. [Google Scholar] [CrossRef]
- Shieh, J.; Haq, M.A.; Karam, S.; Chondro, P.; Gao, D.; Ruan, S. Continual Learning Strategy in One-Stage Object Detection Framework Based on Experience Replay for Autonomous Driving Vehicle. Sensors 2020, 20, 6777. [Google Scholar] [CrossRef]
- Stuparu, D.; Ciobanu, R.; Dobre, C. Vehicle Detection in Overhead Satellite Images Using a One-Stage Object Detection Model. Sensors 2020, 20, 6485. [Google Scholar] [CrossRef]
- Barbosa, R.C.; Ayub, M.S.; Rosa, R.L.; Rodríguez, D.Z.; Wuttisittikulkij, L. Lightweight PVIDNet: A Priority Vehicles Detection Network Model Based on Deep Learning for Intelligent Traffic Lights. Sensors 2020, 20, 6218. [Google Scholar] [CrossRef]
- Chen, X.; Chang, C.; Yu, C.; Chen, Y. A Real-Time Vehicle Detection System under Various Bad Weather Conditions Based on a Deep Learning Model without Retraining. Sensors 2020, 20, 5731. [Google Scholar] [CrossRef] [PubMed]
- Liao, M.; Shi, B.; Bai, X.; Wang, X.; Liu, W. TextBoxes: A fast text detector with a single deep neural network. In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, USA, 4–9 February 2017; pp. 4161–4167. [Google Scholar]
- Wang, H.; He, S.; Yu, J.; Wang, L.; Liu, T. Research and Implementation of Vehicle Target Detection and Information Recognition Technology Based on NI myRIO. Sensors 2020, 20, 1765. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Chen, X.; Sui, H.; Fang, J.; Zhou, M.; Wu, C. Multi-Proxy Constraint Loss for Vehicle Re-Identification. Sensors 2020, 20, 5142. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z.; Liu, H.; Rai, L.; Zhang, S. Vehicle Trajectory Prediction Method Based on License Plate Information Obtained from Video-Imaging Detectors in Urban Road Environment. Sensors 2020, 20, 1258. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Method | LREA | MLPR | DLPD | IoU = 0.5 | IoU = 0.75 | ||
---|---|---|---|---|---|---|---|
TILT720 | TILT1080 | TILT720 | TILT1080 | ||||
ALPD | 76.71% | 77.71% | 26.27% | 35.27% | |||
Indirect | √ | 40.35% | 40.62% | 7.48% | 10.76% | ||
√ | √ | 89.19% | 87.67% | 54.51% | 56.92% | ||
Direct | √ | 86.85% | 86.01% | 47.52% | 53.34% | ||
Two-branch | √ | √ | √ | 89.30% | 87.79% | 56.54% | 57.94% |
Method | IoU = 0.5 | IoU = 0.75 | ||
---|---|---|---|---|
TILT720 | TILT1080 | TILT720 | TILT1080 | |
Faster R-CNN [27] | 81.65% | 73.88% | 13.63% | 14.29% |
TextBoxes [48] | 69.67% | 67.56% | 37.24% | 38.66% |
Method [14] | 74.67% | 64.78% | 42.67% | 38.61% |
Method [9] | 84.05% | 82.05% | 45.35% | 53.42% |
YOLOv2 [29] | 80.80% | 79.58% | 51.66% | 49.32% |
SSD [36] | 86.63% | 86.34% | 47.06% | 53.88% |
Method [17] | 89.19% | 87.67% | 54.51% | 56.92% |
Ours (Direct) | 86.85% | 86.01% | 47.52% | 53.34% |
Ours (Indirect) | 89.13% | 87.11% | 54.48% | 56.96% |
Ours (Two-branch) | 89.30% | 87.79% | 56.54% | 57.94% |
Method | TILT720 (IoU = 0.5/0.75) | TILT1080 (IoU = 0.5/0.75) | ||||
---|---|---|---|---|---|---|
Precision | Recall | -Score | Precision | Recall | -Score | |
SSD [36] | 98.66/65.10 | 58.80/38.80 | 73.68/48.62 | 93.88/75.92 | 40.38/30.07 | 56.47/43.08 |
Method [14] | 88.79/53.27 | 76.00/45.60 | 81.90/49.14 | 83.53/55.08 | 68.97/45.48 | 75.55/49.83 |
SSD+FC | 97.47/75.32 | 61.60/47.60 | 75.49/58.33 | 97.57/84.67 | 42.61/36.98 | 59.32/51.48 |
Method [17] | 90.61/60.41 | 88.80/59.20 | 89.70/59.80 | 88.17/61.51 | 87.89/61.32 | 88.03/61.42 |
Ours (Direct) | 98.69/82.31 | 60.40/48.40 | 74.94/60.96 | 96.96/85.95 | 44.00/39.00 | 60.53/53.66 |
Ours (Indirect) | 88.93/60.87 | 90.00/61.60 | 89.46/61.23 | 88.72/61.65 | 87.78/61.00 | 88.25/61.32 |
Ours (Two-branch) | 89.68/61.90 | 90.40/62.40 | 90.04/62.15 | 87.85/62.09 | 89.16/63.02 | 88.50/62.55 |
Method | TILT720 (IoU = 0.5/0.75) | TILT1080 (IoU = 0.5/0.75) | ||||
---|---|---|---|---|---|---|
Large | Medium | Small | Large | Medium | Small | |
SSD [36] | 88.46/69.23 | 74.42/56.59 | 29.47/6.32 | 76.43/59.24 | 45.28/34.45 | 10.87/5.43 |
Method [14] | 92.31/88.46 | 85.27/62.02 | 58.95/11.58 | 88.54/82.80 | 78.74/54.33 | 39.86/7.97 |
SSD+FC | 96.15/92.31 | 77.52/65.12 | 30.53/11.58 | 80.25/73.25 | 47.83/42.13 | 11.59/6.88 |
Method [17] | 96.15/88.46 | 98.45/78.29 | 73.68/25.26 | 99.36/86.62 | 96.65/77.17 | 65.22/17.75 |
Ours (Direct) | 96.15/92.31 | 79.07/67.44 | 25.26/10.53 | 82.17/76.43 | 50.20/45.47 | 10.87/5.80 |
Ours (Indirect) | 96.15/88.46 | 98.45/79.07 | 76.84/30.53 | 99.36/84.71 | 96.65/77.17 | 64.86/17.75 |
Ours (Two-branch) | 96.15/92.31 | 98.45/80.62 | 77.89/29.47 | 100.00/86.62 | 98.43/80.12 | 65.94/18.12 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, S.-L.; Liu, Q.; Ma, J.-W.; Yang, C. Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches. Sensors 2021, 21, 1074. https://doi.org/10.3390/s21041074
Chen S-L, Liu Q, Ma J-W, Yang C. Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches. Sensors. 2021; 21(4):1074. https://doi.org/10.3390/s21041074
Chicago/Turabian StyleChen, Song-Lu, Qi Liu, Jia-Wei Ma, and Chun Yang. 2021. "Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches" Sensors 21, no. 4: 1074. https://doi.org/10.3390/s21041074
APA StyleChen, S.-L., Liu, Q., Ma, J.-W., & Yang, C. (2021). Scale-Invariant Multidirectional License Plate Detection with the Network Combining Indirect and Direct Branches. Sensors, 21(4), 1074. https://doi.org/10.3390/s21041074