MDPI - Publisher of Open Access Journals

26 pages, 4334 KB

Open AccessArticle

RKF-YOLO: A Lightweight Dual-Task Model for Illegal Parking Detection and License Plate Recognition on Edge Devices

by Hao Chen, Yao Li, Yong Jia, Guangle Yao and Ruipeng Zhu

Electronics 2026, 15(12), 2638; https://doi.org/10.3390/electronics15122638 (registering DOI) - 15 Jun 2026

To address the joint requirements of illegal parking detection and license plate recognition under complex traffic scenarios and limited edge-device resources, this study proposes RKF-YOLO, a lightweight dual-task model based on improved YOLOv11n that integrates Rep-CSP structural optimization, knowledge-transfer-enhanced training (KTET), and Focal-CIoU [...] Read more.

To address the joint requirements of illegal parking detection and license plate recognition under complex traffic scenarios and limited edge-device resources, this study proposes RKF-YOLO, a lightweight dual-task model based on improved YOLOv11n that integrates Rep-CSP structural optimization, knowledge-transfer-enhanced training (KTET), and Focal-CIoU loss. Compared with YOLOv11n, RKF-YOLO reduces parameters and FLOPs by 38.2% and 38.1%, respectively, while improving

mAP @ 0.5

and

mAP @ 0.5 : 0.95

by 0.6 and 1.1 percentage points for parking detection; for plate detection, Focal-CIoU improves

mAP @ 0.5 : 0.95

by 1.3 percentage points and contributes to a recognition accuracy of 95.7%. The unified framework uses a shared backbone and task-oriented detection heads to support vehicle-level illegal parking detection and license-plate-oriented localization. Rep-CSP enhances multi-scale feature representation, asymmetric channel reduction with feature compensation reduces redundant computation, and KTET improves convergence through optimizer and learning-rate migration. Deployment on RK3588 achieves 59.5 FPS for parking detection and 95.1% recognition accuracy, demonstrating real-time performance and practical applicability on resource-constrained edge devices. Full article

(This article belongs to the Topic Advanced Artificial Intelligence Solutions for Modern Engineering Applications)

► Show Figures

Figure 1

25 pages, 18029 KB

Open AccessArticle

Urban Intelligent Transportation-Oriented License Plate Recognition Model for Severe Environments Based on Hybrid Architecture of YOLOv12, GAN and Mamba-SSM

by Feng Tang, Lei Chen, Lingxuan Zeng, Yaqin Nie and Jian Yang

Urban Sci. 2026, 10(6), 325; https://doi.org/10.3390/urbansci10060325 - 11 Jun 2026

Viewed by 151

Abstract

Adverse weather and low-illumination conditions in urban road scenarios substantially degrade license plate image quality, posing a major challenge to robust automatic license plate recognition for urban intelligent transportation systems and smart city construction. To address the limitations of conventional pipelines that optimize [...] Read more.

Adverse weather and low-illumination conditions in urban road scenarios substantially degrade license plate image quality, posing a major challenge to robust automatic license plate recognition for urban intelligent transportation systems and smart city construction. To address the limitations of conventional pipelines that optimize detection, enhancement, and recognition in isolation, this study proposes CLEI, a unified framework integrating YOLOv12-based detection, GAN-based image enhancement, and a novel CNN–Mamba network (CMN) for character recognition. Using a curated dataset of 3000 license plate images captured under rain, snow, fog, and nighttime urban roadside conditions, we first benchmarked several mainstream detectors and identified YOLOv12s as the most effective model in terms of accuracy, inference speed, and computational efficiency. To mitigate blur and low-quality degradation in cropped plate regions, DeblurGAN-v2 was employed for adaptive enhancement, achieving PSNR of 16.61 dB, SSIM of 0.8776, and LPIPS of 0.1151. For recognition, the proposed CMN replaces the recurrent module in CRNN with a Mamba-based state-space model, improving sequence modeling efficiency and robustness. CMN achieved 93.3% plate accuracy, outperforming CRNN (91.0%) and LPRNet (88.5%), while the full CLEI framework reached 93.67% accuracy after enhancement. These results demonstrate that collaborative optimization across detection, restoration, and recognition enables accurate and efficient license plate recognition in severely degraded urban traffic environments, providing a reliable technical support for urban traffic monitoring, public security governance and smart city infrastructure construction. Full article

(This article belongs to the Section Intelligent Cities and Technology)

► Show Figures

Figure 1

34 pages, 36077 KB

Open AccessArticle

Modular Multi-Attribute Vehicle Analysis by Color, License Plate, Make and Sub-Model Using YOLO and OCR: A Benchmark Across YOLO Versions

by Cristian Japhet Islas-Yañez, Viridiana Hernández-Herrera and Moisés Márquez-Olivera

Sensors 2026, 26(9), 2785; https://doi.org/10.3390/s26092785 - 29 Apr 2026

Viewed by 1091

Abstract

We present a modular multi-attribute vehicle analysis pipeline that integrates YOLO-based models and an OCR engine into a single workflow. The system detects vehicles, classifies color, recognizes make and sub-model, detects license plates, and extracts plate characters to generate a structured vehicle record. [...] Read more.

We present a modular multi-attribute vehicle analysis pipeline that integrates YOLO-based models and an OCR engine into a single workflow. The system detects vehicles, classifies color, recognizes make and sub-model, detects license plates, and extracts plate characters to generate a structured vehicle record. Vehicle detection is reported with standard metrics (precision, recall, and mAP@0.5), while license plate detection is reported at IoU = 0.3 to reflect the small-object nature of plates and downstream OCR usability. Among the evaluated versions, YOLOv8 provides the most balanced overall performance across modules, while maintaining real-time-equivalent throughput of approximately 18–22 FPS for the full pipeline on recorded traffic videos, depending on scene complexity. We emphasize module-level evaluation and runtime benchmarking; instance-level end-to-end identification across unique vehicles is defined as future work once track-based ground truth becomes available. Full article

(This article belongs to the Topic Deep Visual Recognition: Methods, and Applications)

► Show Figures

Figure 1

23 pages, 5408 KB

Open AccessFeature PaperArticle

Regional Coordinated Traffic Signal Control Based on Improved Chaotic Particle Swarm Optimization

by Ke Ji and Jinjun Tang

Mathematics 2026, 14(8), 1374; https://doi.org/10.3390/math14081374 - 19 Apr 2026

Viewed by 429

Abstract

In urban traffic systems, traditional signal control can no longer meet the increasing traffic demand, and local congestion is severe during peak hours. Fixed detector data is characterized by high deployment density, full sample detection and restorable vehicle paths, providing new data support [...] Read more.

In urban traffic systems, traditional signal control can no longer meet the increasing traffic demand, and local congestion is severe during peak hours. Fixed detector data is characterized by high deployment density, full sample detection and restorable vehicle paths, providing new data support for coordinated signal control. We propose an optimization method for regional coordinated control, with the Changsha road network as the study area. Firstly, based on License Plate Recognition (LPR) data, the road network is divided into sub-networks and combined with the boundary control for regional coordinated control. Then, the critical path is taken as the control object, and the phase coordination rate is introduced as the optimization objective. The particle swarm optimization algorithm improved by the logistic chaotic map is used as the global searcher, and sequential quadratic programming is adopted as the local searcher to solve the optimization strategy for the objective function. Finally, a simulated road network is constructed in VISSIM 6.0 simulation software to verify the effectiveness of the strategy. The results show that the optimization strategy reduces intersection delay and saturation by 20.3% and 19.3% in the critical path coverage area. Road travel time and the average number of vehicle stops are reduced by 21% and 22.1%. This indicates that the regional coordinated control based on the improved particle swarm algorithm can better alleviate the peak hour traffic congestion. Full article

(This article belongs to the Special Issue Application of Mathematical Methods to Transportation: Modeling and Analysis)

► Show Figures

Figure 1

33 pages, 5973 KB

Open AccessArticle

Smart Enforcement of Disability Parking: A Drone-Based License Plate Recognition and Staged Optimization Framework

by Hanaa ZainEldin, Tamer Ahmed Farrag, Shymaa G. Eladl, Malik Almaliki, Mahmoud Badawy and Mostafa A. Elhosseini

Urban Sci. 2026, 10(4), 212; https://doi.org/10.3390/urbansci10040212 - 15 Apr 2026

Viewed by 609

Abstract

Unauthorized occupation of parking spaces designated for individuals with disabilities remains a persistent challenge in urban environments, limiting accessibility and inclusive mobility. This paper proposes an integrated UAV-assisted enforcement framework that combines drone-based imaging, onboard license plate recognition (LPR), IoT connectivity, and a [...] Read more.

Unauthorized occupation of parking spaces designated for individuals with disabilities remains a persistent challenge in urban environments, limiting accessibility and inclusive mobility. This paper proposes an integrated UAV-assisted enforcement framework that combines drone-based imaging, onboard license plate recognition (LPR), IoT connectivity, and a staged optimization strategy for energy-aware surveillance. The framework employs a two-phase approach: first, it derives energy-efficient UAV activation patterns via sleep–active scheduling, followed by coverage maximization under energy constraints. The inherently multi-objective problem—balancing energy consumption, coverage, and redundancy—is addressed via a weighted-aggregation formulation, enabling efficient optimization with classical metaheuristic algorithms. Seven algorithms—Genetic Algorithm (GA), Particle Swarm Optimization (PSO), Simulated Annealing (SA), Ant Colony Optimization (ACO), Differential Evolution (DE), Artificial Bee Colony (ABC), and a Greedy baseline—are implemented in both conventional and staged variants to enable comprehensive evaluation. Experimental results demonstrate 32–45% reductions in energy consumption, over 95% coverage effectiveness, and 50–60% faster convergence compared to single-phase approaches, with all improvements statistically significant (p < 0.001). The proposed framework provides a scalable, practically deployable solution for intelligent enforcement of disability parking regulations while also enabling energy-efficient UAV coordination in smart urban monitoring systems. Full article

► Show Figures

Figure 1

32 pages, 3916 KB

Open AccessArticle

An Automated Detection Method for Motor Vehicles Encroaching on Non-Motorized Lanes Based on Unmanned Aerial Vehicle Imagery and Civilized Behavior Monitoring

by Zichan Tan, Yin Tan, Peijing Lin, Wenjie Su, Tian He and Weishen Wu

Sensors 2026, 26(7), 2027; https://doi.org/10.3390/s26072027 - 24 Mar 2026

Viewed by 476

Abstract

Motor vehicle encroachment into non-motorized lanes is a common but hard-to-verify violation in urban intersections, especially when monitored from unmanned aerial vehicles (UAVs) or high-mounted overhead views. Existing rule-based solutions built on horizontal bounding boxes and center-point/line-crossing criteria are sensitive to perspective distortion, [...] Read more.

Motor vehicle encroachment into non-motorized lanes is a common but hard-to-verify violation in urban intersections, especially when monitored from unmanned aerial vehicles (UAVs) or high-mounted overhead views. Existing rule-based solutions built on horizontal bounding boxes and center-point/line-crossing criteria are sensitive to perspective distortion, occlusion, and frame-to-frame jitter, resulting in unstable decisions and low evidential value. This paper presents a cascaded UAV-view system that closes the loop from perception to evidence output through detection–segmentation–recognition–decision. First, we adopt a two-stage detection cascade: a lightweight vehicle detector localizes vehicles using axis-aligned bounding boxes, and a dedicated YOLOv5n-based oriented bounding box (OBB) license plate detector, constructed via architecture grafting and weight transfer, is then applied within each vehicle region of interest (ROI) to localize rotated license plates under large pose variation and small-target conditions. Second, a U-Net lane region segmentation module provides pixel-level spatial constraints to define an enforceable lane occupancy region. Third, a perspective rectification step is integrated with the PP-OCRv4 optical character recognition (OCR) framework to improve license plate recognition reliability for tilted plates. Finally, an area ratio criterion and an N-frame temporal counter are used to suppress transient misdetections and stabilize alarms. On a representative 100-sample controlled encroachment benchmark, the proposed system improves detection accuracy from 67.0% to 92.0% and reduces the false positive rate from 32.35% to 5.88% compared with a baseline horizontal bounding box (HBB)-based rule. The system outputs both violation alarms and license plate evidence, supporting practical deployment for multi-view traffic governance. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

23 pages, 2867 KB

Open AccessFeature PaperArticle

SDR-Net: A Stage-Wise Degradation-Aware Restoration Network for Robust License Plate Recognition in Complex Port Environments

by Hyungseok Kim, Sungan Yoon and Jeongho Cho

Mathematics 2026, 14(6), 934; https://doi.org/10.3390/math14060934 - 10 Mar 2026

Viewed by 431

Abstract

Port areas are core hubs for national logistics and high-risk security zones that require constant vehicle access control. However, ensuring the reliability of automatic license plate recognition (ALPR) systems in port environments is severely challenged by complex image degradations, such as dense haze, [...] Read more.

Port areas are core hubs for national logistics and high-risk security zones that require constant vehicle access control. However, ensuring the reliability of automatic license plate recognition (ALPR) systems in port environments is severely challenged by complex image degradations, such as dense haze, low light, and motion blur. In this study, we propose a stage-wise degradation-aware restoration network (SDR-Net), which effectively addresses harsh port conditions by sequentially restoring photometric and structural degradations. Particularly, SDR-Net first secures visual cues lost to haze and low light through a photometric restoration module involving a dark-channel-prior-based dehazing and adaptive brightness adjustment. Next, a structural restoration module based on a generative adversarial network featuring edge-guided structural feature blocks and edge-aware refinement blocks is employed to precisely reconstruct character strokes and outlines damaged by motion blur, stably restoring license plate legibility even under complex degradation conditions. Experiments across various intensities of complex degradation demonstrate that SDR-Net maintains high character recognition accuracies of over 97.35% under mild motion blur and low-concentration haze conditions, indicating its superiority over state-of-the-art models. Notably, the performance gap between SDR-Net and comparison models widened as the degradation intensity increased, and SDR-Net achieved the highest multiscale structural similarity index scores across all intervals. Full article

(This article belongs to the Section E1: Mathematics and Computer Science)

► Show Figures

Figure 1

39 pages, 10175 KB

Open AccessArticle

EdgeML-Driven Real-Time Vehicle Tracking and Traffic Control for Traffic Management in Smart Cities

by Hyago V. L. B. Silva, Davi Rosim, Felipe A. P. de Figueiredo, Samuel B. Mafra, Ahmed S. Khwaja and Alagan Anpalagan

Appl. Sci. 2026, 16(5), 2216; https://doi.org/10.3390/app16052216 - 25 Feb 2026

Viewed by 1078

Abstract

The escalating global rates of traffic accidents in urban areas and the growing demands of smart cities underscore the urgent need for advanced real-time monitoring solutions. This paper presents an EdgeML-based system for vehicle tracking that performs real-time speed and distance analysis and [...] Read more.

The escalating global rates of traffic accidents in urban areas and the growing demands of smart cities underscore the urgent need for advanced real-time monitoring solutions. This paper presents an EdgeML-based system for vehicle tracking that performs real-time speed and distance analysis and traffic violation detection. This is achieved by deploying a YOLOv8 object detection model on a Raspberry Pi 5 with a Coral USB Edge TPU accelerator. The system integrates computer vision and IoT technologies to enable real-time processing. It utilizes the Message Queuing Telemetry Transport (MQTT) protocol to allow scalable communication between distributed edge devices and a central MongoDB database, facilitating real-time storage and analysis of traffic data. A synthetic dataset generated via the Blender 3D modeling tool validates the system’s accuracy, demonstrating average speed and distance measurement errors of ±2.11 km/h and ±0.58 m, respectively. These findings are further supported by preliminary practical experiments in a real-world environment, where speed estimation errors remained within 0–2 km/h and distance errors stayed below 0.11 m. Key innovations of this work include license plate recognition, speeding and collision detection, and context analysis using Google’s Gemini-2.5-Flash API. A Streamlit dashboard provides real-time visualization of traffic metrics, violations, and aggregated data. A comparative evaluation of YOLOv5n, YOLOv8n, YOLOv11n, and YOLOv12n identifies YOLOv8n as the most suitable model for embedded deployment, achieving 91.07 ± 0.61% mAP@0.5 without quantization, 88.77 ± 3.31% mAP@0.5 with quantization, while maintaining real-time performance of 30–43 frames per second (FPS) on the Edge TPU. The system’s modular architecture, low latency, and robust performance highlight its suitability for smart city applications, enhancing traffic safety and enabling data-driven urban mobility management. Full article

(This article belongs to the Special Issue Smart Cities: AI-Enhanced Urban Living)

► Show Figures

Figure 1

28 pages, 21245 KB

Open AccessArticle

A Comparative Study of OCR Architectures for Korean License Plate Recognition: CNN–RNN-Based Models and MobileNetV3–Transformer-Based Models

by Seungju Lee and Gooman Park

Sensors 2026, 26(4), 1208; https://doi.org/10.3390/s26041208 - 12 Feb 2026

Viewed by 833

Abstract

This paper presents a systematic comparative study of optical character recognition (OCR) architectures for Korean license plate recognition under identical detection conditions. Although recent automatic license plate recognition (ALPR) systems increasingly adopt Transformer-based decoders, it remains unclear whether performance differences arise primarily from [...] Read more.

This paper presents a systematic comparative study of optical character recognition (OCR) architectures for Korean license plate recognition under identical detection conditions. Although recent automatic license plate recognition (ALPR) systems increasingly adopt Transformer-based decoders, it remains unclear whether performance differences arise primarily from sequence modeling strategies or from backbone feature representations. To address this issue, we employ a unified YOLOv12-based license plate detector and evaluate multiple OCR configurations, including a CNN with an Attention-LSTM decoder and a MobileNetV3 with a Transformer decoder. To ensure a fair comparison, a controlled ablation study is conducted in which the CNN backbone is fixed to ResNet-18 while varying only the sequence decoder. Experiments are performed on both static image datasets and tracking-based sequential datasets, assessing recognition accuracy, error characteristics, and processing speed across GPU and embedded platforms. The results demonstrate that the effectiveness of sequence decoders is highly dataset-dependent and strongly influenced by feature quality and region-of-interest (ROI) stability. Quantitative analysis further shows that tracking-induced error accumulation dominates OCR performance in sequential recognition scenarios. Moreover, Korean license plate–specific error patterns reveal failure modes not captured by generic OCR benchmarks. Finally, experiments on embedded platforms indicate that Transformer-based OCR models introduce significant computational and memory overhead, limiting their suitability for real-time deployment. These findings suggest that robust license plate recognition requires joint consideration of detection, tracking, and recognition rather than isolated optimization of OCR architectures. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

15 pages, 2410 KB

Open AccessArticle

Smart Vision Traffic Surveillance: Vehicle Re-Identification and Tracking Using Vision Transformer

by Muhammad Shoaib Hanif, Zubair Nawaz and Muhammad Kamran Malik

Vehicles 2026, 8(2), 36; https://doi.org/10.3390/vehicles8020036 - 10 Feb 2026

Cited by 1 | Viewed by 912

Abstract

Intelligent transportation systems (ITSs) are crucial for modern traffic management and law enforcement. This paper addresses the challenge of monitoring and managing extensive vehicle traffic in large cities like Lahore, Pakistan. We propose a deep learning based ITS utilizing Vision Transformers combined with [...] Read more.

Intelligent transportation systems (ITSs) are crucial for modern traffic management and law enforcement. This paper addresses the challenge of monitoring and managing extensive vehicle traffic in large cities like Lahore, Pakistan. We propose a deep learning based ITS utilizing Vision Transformers combined with convolutional feature extraction to accurately identify vehicle type, color, make/model, and license plates. Experiments were conducted on a comprehensive dataset collected from multiple checkpoints across Lahore under varying environmental conditions. Our proposed model achieved high accuracy rates: 98.0% for vehicle type classification, 96.0% for color detection, 95.0% for make/model identification, and 89.0% for license plate recognition. These results demonstrate the system’s potential to significantly enhance traffic management and road safety and support law enforcement operations in developing urban environments. Full article

(This article belongs to the Special Issue Intelligent Connected Vehicles)

► Show Figures

Figure 1

20 pages, 2164 KB

Open AccessArticle

Automatic Vehicle Recognition: A Practical Approach with VMMR and VCR

by Andrei Istrate, Madalin-George Boboc, Daniel-Tiberius Hritcu, Florin Rastoceanu, Constantin Grozea and Mihai Enache

AI 2025, 6(12), 329; https://doi.org/10.3390/ai6120329 - 18 Dec 2025

Viewed by 1544

Abstract

Background: Automatic vehicle recognition has recently become an area of great interest, providing substantial support for multiple use cases, including law enforcement and surveillance applications. In real traffic conditions, where for various reasons license plate recognition is impossible or license plates are forged, [...] Read more.

Background: Automatic vehicle recognition has recently become an area of great interest, providing substantial support for multiple use cases, including law enforcement and surveillance applications. In real traffic conditions, where for various reasons license plate recognition is impossible or license plates are forged, alternative solutions are required to support human personnel in identifying vehicles used for illegal activities. In such cases, appearance-based approaches relying on vehicle make and model recognition (VMMR) and vehicle color recognition (VCR) can successfully complement license plate recognition. Methods: This research addresses appearance-based vehicle identification, in which VMMR and VCR rely on inherent visual cues such as body contours, stylistic details, and exterior color. In the first stage, vehicles passing through an intersection are detected, and essential visual characteristics are extracted for the two recognition tasks. The proposed system employs deep learning with semantic segmentation and data augmentation for color recognition, while histogram of oriented gradients (HOG) feature extraction combined with a support vector machine (SVM) classifier is used for make-model recognition. For the VCR task, five different neural network architectures are evaluated to identify the most effective solution. Results: The proposed system achieves an overall accuracy of 94.89% for vehicle make and model recognition. For vehicle color recognition, the best-performing models obtain a Top-1 accuracy of 94.17% and a Top-2 accuracy of 98.41%, demonstrating strong robustness under real-world traffic conditions. Conclusions: The experimental results show that the proposed automatic vehicle recognition system provides an efficient and reliable solution for appearance-based vehicle identification. By combining region-tailored data, segmentation-guided processing, and complementary recognition strategies, the system effectively supports real-world surveillance and law-enforcement scenarios where license plate recognition alone is insufficient. Full article

(This article belongs to the Topic State-of-the-Art Object Detection, Tracking, and Recognition Techniques)

► Show Figures

Figure 1

27 pages, 4420 KB

Open AccessArticle

Real-Time Quarry Truck Monitoring with Deep Learning and License Plate Recognition: Weighbridge Reconciliation for Production Control

by Ibrahima Dia, Bocar Sy, Ousmane Diagne, Sidy Mané and Lamine Diouf

Mining 2025, 5(4), 84; https://doi.org/10.3390/mining5040084 - 14 Dec 2025

Viewed by 1459

Abstract

This paper presents a real-time quarry truck monitoring system that combines deep learning and license plate recognition (LPR) for operational monitoring and weighbridge reconciliation. Rather than estimating load volumes directly from imagery, the system ensures auditable matching between detected trucks and official weight [...] Read more.

This paper presents a real-time quarry truck monitoring system that combines deep learning and license plate recognition (LPR) for operational monitoring and weighbridge reconciliation. Rather than estimating load volumes directly from imagery, the system ensures auditable matching between detected trucks and official weight records. Deployed at quarry checkpoints, fixed cameras stream to an edge stack that performs truck detection, line-crossing counts, and per-frame plate Optical Character Recognition (OCR); a temporal voting and format-constrained post-processing step consolidates plate strings for registry matching. The system exposes a dashboard with auditable session bundles (model/version hashes, Region of Interest (ROI)/line geometry, thresholds, logs) to ensure replay and traceability between offline evaluation and live operations. We evaluate detection (precision, recall, mAP@0.5, and mAP@0.5:0.95), tracking (ID metrics), and (LPR) usability, and we quantify operational validity by reconciling estimated shift-level tonnage T against weighbridge tonnage T* using Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), R², and Bland–Altman analysis. Results show stable convergence of the detection models, reliable plate usability under varied optics (day, dusk, night, and dust), low-latency processing suitable for commodity hardware, and close agreement with weighbridge references at the shift level. The study demonstrates that vision-based counting coupled with plate linkage can provide regulator-ready KPIs and auditable evidence for production control in quarry operations. Full article

(This article belongs to the Special Issue Mine Management Optimization in the Era of AI and Advanced Analytics)

► Show Figures

Graphical abstract

31 pages, 9303 KB

Open AccessArticle

Automatic Quadrotor Dispatch Missions Based on Air-Writing Gesture Recognition

by Pu-Sheng Tsai, Ter-Feng Wu and Yen-Chun Wang

Processes 2025, 13(12), 3984; https://doi.org/10.3390/pr13123984 - 9 Dec 2025

Cited by 1 | Viewed by 860

Abstract

This study develops an automatic dispatch system for quadrotor UAVs that integrates air-writing gesture recognition with a graphical user interface (GUI). The DJI RoboMaster quadrotor UAV (DJI, Shenzhen, China) was employed as the experimental platform, combined with an ESP32 microcontroller (Espressif Systems, Shanghai, [...] Read more.

This study develops an automatic dispatch system for quadrotor UAVs that integrates air-writing gesture recognition with a graphical user interface (GUI). The DJI RoboMaster quadrotor UAV (DJI, Shenzhen, China) was employed as the experimental platform, combined with an ESP32 microcontroller (Espressif Systems, Shanghai, China) and the RoboMaster SDK (version 3.0). On the Python (version 3.12.7) platform, a GUI was implemented using Tkinter (version 8.6), allowing users to input addresses or landmarks, which were then automatically converted into geographic coordinates and imported into Google Maps for route planning. The generated flight commands were transmitted to the UAV via a UDP socket, enabling remote autonomous flight. For gesture recognition, a Raspberry Pi integrated with the MediaPipe Hands module was used to capture 16 types of air-written flight commands in real time through a camera. The training samples were categorized into one-dimensional coordinates and two-dimensional images. In the one-dimensional case, X/Y axis coordinates were concatenated after data augmentation, interpolation, and normalization. In the two-dimensional case, three types of images were generated, namely font trajectory plots (T-plots), coordinate-axis plots (XY-plots), and composite plots combining the two (XYT-plots). To evaluate classification performance, several machine learning and deep learning architectures were employed, including a multi-layer perceptron (MLP), support vector machine (SVM), one-dimensional convolutional neural network (1D-CNN), and two-dimensional convolutional neural network (2D-CNN). The results demonstrated effective recognition accuracy across different models and sample formats, verifying the feasibility of the proposed air-writing trajectory framework for non-contact gesture-based UAV control. Furthermore, by combining gesture recognition with a GUI-based map planning interface, the system enhances the intuitiveness and convenience of UAV operation. Future extensions, such as incorporating aerial image object recognition, could extend the framework’s applications to scenarios including forest disaster management, vehicle license plate recognition, and air pollution monitoring. Full article

(This article belongs to the Special Issue Artificial Intelligence-Based Analytics for Data-Driven Decision-Making in Industrial Process Engineering)

► Show Figures

Figure 1

21 pages, 34323 KB

Open AccessArticle

Ship-RT-DETR: An Improved Model for Ship Plate Detection and Identification

by Chang Qin, Xiaoyu Ji, Zhiyi Mo and Jinming Mo

J. Mar. Sci. Eng. 2025, 13(11), 2205; https://doi.org/10.3390/jmse13112205 - 19 Nov 2025

Cited by 1 | Viewed by 963

Abstract

Ship License Plate Recognition (SLPR) technology serves as a fundamental technological foundation for maritime transportation management. Automated ship identification enhances both regulatory oversight and operational efficiency. However, current recognition models demonstrate significant limitations, including their inability to detect objects in complex environments and [...] Read more.

Ship License Plate Recognition (SLPR) technology serves as a fundamental technological foundation for maritime transportation management. Automated ship identification enhances both regulatory oversight and operational efficiency. However, current recognition models demonstrate significant limitations, including their inability to detect objects in complex environments and challenges in maintaining real-time performance while ensuring accuracy, thereby limiting their practical applicability. This study proposes a novel cascaded framework that integrates RT-DETR-based detection with OCR capabilities. The framework incorporates several key methodological innovations: optimizing the RT-DETR backbone through efficient partial convolutions during training to improve computational efficiency; implementing Conv3XC to modify the ResNet18-backbone BasicBlock using a triple convolutional layer configuration with an enhanced RepC3 kernel design for better feature extraction; and integrating learned position encoding (LPE) to improve the AIFI position encoding mechanism, thereby enhancing detection capabilities. After region detection, PP-OCRv3 is used for character recognition. Experimental results demonstrate the superior performance of our approach: Ship-RT-DETR achieves 96.2% detection accuracy with a 28.5% reduction in parameters and 67.3 FPS, while PP-OCRv3 achieves 91.6% recognition accuracy. Extensive environmental validation across diverse weather conditions (sunny, cloudy, rainy, and foggy) confirms the framework’s robustness, maintaining a detection accuracy above 90% even in challenging foggy conditions, with minimal performance degradation (a 7.7% decrease from optimal conditions). The system’s consistent performance across various environmental conditions (detection standard deviation: 2.84%, OCR confidence standard deviation: 0.0295) establishes a novel and robust methodology for practical SLPR applications. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

23 pages, 32401 KB

Open AccessArticle

An Integrated Rule-Based and Deep Learning Method for Automobile License Plate Image Generation with Enhanced Geometric and Radiometric Details

by Yuanrui Dong, Zhe Peng, Wende Liu and Haiyong Gan

Appl. Sci. 2025, 15(22), 11990; https://doi.org/10.3390/app152211990 - 12 Nov 2025

Viewed by 931

Abstract

Automobile license plate image generation represents a pivotal technology for the development of intelligent transportation systems. However, existing methods are constrained by their inability to simultaneously preserve geometric structure and radiometric properties of both license plates and characters. To overcome this limitation, we [...] Read more.

Automobile license plate image generation represents a pivotal technology for the development of intelligent transportation systems. However, existing methods are constrained by their inability to simultaneously preserve geometric structure and radiometric properties of both license plates and characters. To overcome this limitation, we propose a novel framework for generating geometrically and radiometrically consistent license plate images. The proposed radiometric enhancement framework integrates two specialized modules, which are precise geometric rectification and radiometric property learning. The precise geometric rectification module exploits the perspective transformation consistency between character regions and license plate boundaries. By employing a feature matching algorithm based on character endpoint correspondence, this module achieves precise plate rectification, thereby establishing a geometric foundation for maintaining character structural integrity in generated images. The radiometric property learning module implements a precise character inpainting strategy with fluctuation compensation inpainting to reconstruct background regions, followed by a character-wise style transfer approach to ensure both geometric and radiometric consistency with realistic automobile license plates. Furthermore, we introduce a physical validation and evaluation method to quantitatively assess image quality. Comprehensive evaluation on real-world datasets demonstrate that our method achieves superior performance, with a peak signal-to-noise ratio (PSNR) of 13.83 dB and a structural similarity index measure (SSIM) of 0.57, representing significant improvements over comparative methods in preserving both structural integrity and radiometric properties. This framework effectively enhances the visual fidelity and reliability of generated automobile license plate images, thereby providing high-quality data for intelligent transportation recognition systems while advancing license plate image generation technology. Full article

► Show Figures

Figure 1

Search Results (122)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (122)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI