Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections

Owusu, Gideon Asare; Dumka, Ashutosh; Kojo, Adu-Gyamfi; Asante, Enoch Kwasi; Jain, Rishabh; Knickerbocker, Skylar; Hawkins, Neal; Sharma, Anuj

doi:10.3390/rs17091527

Open AccessArticle

Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections

by

Gideon Asare Owusu

^1,*

,

Ashutosh Dumka

¹

,

Adu-Gyamfi Kojo

¹,

Enoch Kwasi Asante

¹,

Rishabh Jain

²,

Skylar Knickerbocker

³,

Neal Hawkins

³ and

Anuj Sharma

¹

Department of Civil, Construction and Environmental Engineering (CCEE), Iowa State University, Ames, IA 50011-1066, USA

²

Department of Computer Science (COM S), Iowa State University, Ames, IA 50011-1066, USA

³

Institute for Transportation, Iowa State University of Science and Technology, Ames, IA 50011-1066, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2025, 17(9), 1527; https://doi.org/10.3390/rs17091527

Submission received: 28 February 2025 / Revised: 18 April 2025 / Accepted: 23 April 2025 / Published: 25 April 2025

(This article belongs to the Special Issue Advancements in Autonomous UAVs for Infrastructure Inspection and Monitoring)

Download

Browse Figures

Versions Notes

Abstract

Transportation agencies often rely on manual surveys to monitor seat belt compliance; however, these methods are limited by surveyor fatigue, reduced visibility due to tinted windows or low lighting, and restricted geographic coverage, making manual surveys prone to errors and unrepresentative of the broader driving population. This paper presents an automated seat belt detection system leveraging the YOLO11 neural network on video footage captured by a tethered uncrewed aerial vehicle (UAV). The objectives are to (1) develop a robust system for detecting seat belt use at stop-controlled intersections, (2) evaluate factors affecting detection accuracy, and (3) demonstrate the potential of UAV-based compliance monitoring. The model was tested in real-world scenarios at a single-lane and a complex multi-lane stop-controlled intersection in Iowa. Three studies examined key factors influencing detection accuracy: (i) seat belt–shirt color contrast, (ii) sunlight direction, and (iii) vehicle type. System performance was compared against manual video review and large language model (LLM)-assisted analysis, with assessments focused on accuracy, resource requirements, and computational efficiency. The model achieved a mean average precision (mAP) of 0.902, maintained high accuracy across the three studies, and outperformed manual methods in reliability and efficiency while offering a scalable, cost-effective alternative to LLM-based solutions.

Keywords:

automated seat belt compliance detection; UAV-based monitoring; vehicle occupant safety; aerial video analysis; large language models

1. Introduction

Seat belt compliance remains a critical public safety concern, with road traffic fatalities continuing to claim thousands of lives annually. In 2022 alone, over 25,000 passenger vehicle occupants died in crashes, nearly half of whom were unrestrained at the time of impact [1]. The risks are even more pronounced in nighttime crashes, where 57% of fatalities involved individuals not wearing seat belts [2]. Research consistently demonstrates that seat belt use significantly reduces the risk of fatal injuries [3,4] by up to 50% for front seat passengers and 25% for rear seat occupants [5]. In 2017, an estimated 2549 lives could have been saved had all occupants properly worn seat belts [1]. Despite their well-documented life-saving potential, seat belt noncompliance persists due to behavioral resistance, enforcement challenges, and limited public awareness [6].

Current seat belt compliance monitoring relies primarily on manual observational surveys conducted by transportation agencies [7]. While these surveys provide valuable data, they have inherent limitations. Geographic coverage is restricted, as surveyors typically monitor only one lane and direction per site, which may introduce sampling bias and limit the representativeness of the data. Visibility constraints—such as tinted windows, low lighting, and obstructions—can make it challenging to accurately assess compliance, potentially affecting the reliability of reported rates. Additionally, surveyors may need to reposition themselves along routes or use vantage points like overpasses and exit ramps to improve visibility [8], which can introduce variability in observation conditions. These limitations highlight the need for more scalable and efficient approaches to seat belt compliance monitoring to ensure reliable and comprehensive data collection.

Beyond observational surveys, various interventions and advancements in vehicle occupant safety technology have aimed to promote seat belt use, including automated warning systems, mass media campaigns, and legislative enforcement [9,10]. Automated seat belt reminders, commonly integrated into modern vehicles, encourage compliance through audible or visual alerts. However, these systems serve only as reminders, leaving the decision to comply solely to the driver’s discretion. Furthermore, these systems depend on in-vehicle technology that is often absent in older or less advanced vehicle models, leaving many drivers without access to these safety features. Legislative measures, such as primary seat belt laws in 35 U.S. states [11] and enforcement initiatives like “Click It or Ticket” [12] and “Buckle Up America” [13], run by the NHTSA, also effectively promote seat belt use through extensive advertising and increased enforcement periods.

Despite these measures, wide-spread seat belt compliance remains inconsistent, underscoring the need for innovative solutions to address the gaps in existing strategies. Real-time automated seat belt compliance detection systems offer a promising approach [14], particularly when integrated with surveillance technologies. Recognizing the importance of such advancements, the U.S. Department of Transportation (U.S. DOT) has emphasized the development of innovative systems through its Small Business Innovation Research (SBIR) program. A recent SBIR solicitation specifically called for the development of devices capable of automatic seat belt use detection, data collection, and driver feedback, highlighting the urgency of advancing research in this area [15].

The application of computer vision for seat belt detection has gained significant traction in recent years. Early approaches primarily relied on traditional image-processing techniques such as edge detection, salient gradient feature mapping, and color segmentation to identify seat belt presence [4,16]. However, these methods were highly sensitive to variations in lighting conditions, vehicle interiors, and camera angles, limiting their practical applicability. More recent advancements have shifted towards deep learning models, particularly convolutional neural networks (CNNs), which offer significantly improved accuracy and robustness [17]. By extracting hierarchical features from image data, CNN-based models effectively differentiate between buckled and unbuckled seat belts with high precision. For instance, [18] proposed a real-time, single-stage detection model using YOLOv7 and discussed how it compares conceptually to multi-stage approaches such as those incorporating Region Proposal Networks (RPNs). While these models achieved promising results, they primarily relied on datasets of high-resolution images captured inside vehicles, which differ substantially from the perspectives and image quality of road surveillance systems, posing challenges for real-world deployment. Further advancements have incorporated multi-scale feature extraction techniques combined with deep learning. A study by [19] extracted features from key regions of interest such as vehicles, windshields, and seat belts to train CNN models for seat belt detection. By incorporating support vector machines (SVMs) to refine detection scores and spatial relationships, the method improved accuracy and robustness on road surveillance images, highlighting its real-world applicability.

Innovative methods have also explored thermal imaging for seat belt detection, with [20] employing the Fully Connected One Shot (FCOS) network on thermal images captured from 5 to 20 m, demonstrating effectiveness across varying lighting conditions. Additionally, research has integrated seat belt detection into intelligent transportation systems (ITS), proposing IoT-enabled vehicle management systems that combine seat belt monitoring with speed control and alcohol detection for comprehensive road safety solutions [21].

Building upon these advancements, this study presents an automated seat belt detection system that leverages a YOLO11 neural network and video footage captured by a tethered UAV to identify and quantify seat belt compliance among drivers at stop-controlled intersections. By leveraging aerial surveillance, this approach addresses several key challenges associated with manual surveys and in-vehicle detection methods. The system analyzes recorded UAV footage to detect seat belt use during post-processing, allowing for accurate evaluation under real-world conditions. The primary objectives of this research include the following: (i) developing a robust detection system for monitoring seat belt usage at stop-controlled intersections, (ii) evaluating key factors that impact seat belt detection accuracy, and (iii) demonstrating the scalability and reliability of automated UAV-based seat belt data collection as an alternative to traditional manual surveys. The system’s performance was evaluated through three focused studies designed to reflect common real-world challenges:

Seat Belt–Shirt Color Contrast: Investigated the impact of varying color contrasts on detection accuracy, including challenging cases where seat belt and shirt colors closely matched.
Sunlight Direction: Assessed the model’s robustness under diverse lighting conditions, including glare and shadows, across different times of the day.
Vehicle Type: Examined how differences in vehicle design and interior layouts influenced seat belt visibility and detection accuracy.

To validate real-world applicability, the seat belt detection system was deployed at two stop-controlled intersections in Iowa. Video footage from each site was manually reviewed and compared with the system’s detections. Additionally, performance was benchmarked against OpenAI’s GPT-4V(ision) to evaluate its feasibility for large-scale deployment. As part of this evaluation, a confidence threshold of 0.70 was implemented to conclude seat belt status. Detections below this threshold were categorized as “unknown” and excluded from any compliance analysis, effectively reducing false detections and ensuring reliable results.

This study introduces a reliable and adaptable approach for real-world seat belt compliance monitoring, offering valuable insights for policy and safety initiatives. The remainder of this paper is organized as follows: the Materials and Methods section outlines the experimental setup, data collection process, and model development, the Results section highlights the model’s performance and findings, the Discussion explores practical applications and associated challenges, and the Conclusion summarizes key insights and recommendations for future improvements.

2. Materials and Methods

2.1. Experimental Setup and Data Collection

Controlled video data collection was conducted at Iowa State University’s Institute for Transportation (InTrans) parking facility to develop a high-quality dataset for training and evaluating the seat belt detection model. The experimental setup, depicted in Figure 1, simulated real-world stop-controlled intersection dynamics, where drivers were required to yield at designated points before proceeding.

A tethered UAV was used to capture video footage, positioned overhead and slightly to the left of the lane to achieve a near head-on perspective of approaching vehicles. This strategic placement minimized potential visual obstructions from the vehicle’s A-pillar, which could otherwise obscure the driver’s chest area and hinder seat belt detection. The UAV operated at two altitudes, 15 and 18 feet, to assess the optimal elevation for maximizing seat belt visibility while maintaining high detection accuracy. Additionally, the camera was zoomed in on the windshield, precisely focusing on the driver’s chest area to enhance image clarity and improve seat belt detection reliability.

During operation, the UAV was manually piloted into position and stabilized by the tether system, which allowed for controlled hovering and limited lateral movement based on the tether’s length and tension. Drone mounting, lateral movement, elevation adjustment, camera zoom, and landing were all remotely controlled using a handheld controller. A live video feed displayed on the connected tablet allowed operators to view the UAV’s real-time perspective and make fine adjustments to framing, zoom, and positioning as needed. This ensured that each recording was optimized for visibility and clarity, while maintaining safe, flexible, and stable operation.

Data collection was conducted over a period of two days across multiple sessions during varying daylight conditions, including morning, midday, and late afternoons. To enhance visual diversity, footage was captured from different UAV angles and perspectives, providing varied views of approaching vehicles and driver seating positions. While data were not collected during extreme weather conditions due to equipment and safety constraints, the recordings captured a range of lighting scenarios and weather variability common to daytime driving. The dataset included a diverse mix of vehicle types (e.g., sedans, SUVs, trucks) and driver appearances, supporting broader model generalization.

2.2. Evaluation Studies

To evaluate the robustness of our seat belt detection model in real-world conditions, we designed three experimental studies, each targeting key factors that could impact detection accuracy. For each study, we curated distinct training and testing datasets to ensure a controlled and systematic assessment of the model’s performance. These studies allowed us to pinpoint potential limitations and refine the model to improve its generalizability. The following sections provide a detailed overview of each study.

2.2.1. Seat Belt–Shirt Color Contrast

This study examined how seat belt–shirt color contrast affects detection accuracy, especially when the seat belt closely matches the driver’s clothing. Footage was collected of drivers wearing shirts in various colors to ensure contrast variations. The dataset included high-contrast cases (e.g., black seat belt on a white shirt) and low-contrast cases (e.g., gray seat belt on a light-colored shirt) for a thorough performance evaluation.

Beyond assessing detection accuracy across contrast levels, we identified edge cases where the model struggled, such as misinterpreting clothing folds as seat belts or detecting patterned fabrics as seat belt-like features. By analyzing accuracy variations and failure modes, we uncovered the model’s limitations and explored refinements to enhance robustness in diverse real-world conditions.

2.2.2. Sunlight Direction

Lighting variability—particularly direct sunlight, glare, and shadows—poses a significant challenge for seat belt detection. This study examined how sunlight direction affects detection accuracy by capturing footage at different times of the day under varying illumination.

The UAV was positioned in two setups: one facing the sun and another with the sun behind it, allowing us to assess how changing light angles influenced seat belt visibility. The analysis revealed key failure cases, including overexposure washing out seat belt details and shadows creating false seat belt-like features. Based on these findings, we explored targeted improvements, such as exposure-based model adjustments (e.g., normalization and adaptive contrast correction) and data augmentation techniques to enhance detection accuracy under dynamic lighting conditions.

2.2.3. Vehicle Type

Variations in vehicle design—such as windshield height, tint, and interior layout—significantly impact seat belt visibility. Tall, clear windshields provide an unobstructed view, improving detection, while shorter or steeply angled windshields may obscure parts of the seat belt. Tinted windows further complicate detection by reducing contrast and masking seat belt details under certain lighting conditions.

This study assessed model performance across a diverse range of vehicles, including sedans, mini-SUVs, and trucks, to ensure consistent detection accuracy despite design differences. By analyzing detection variations across vehicle types, we identified model limitations and applied adaptive fine-tuning to enhance robustness across different vehicle designs.

2.3. Model Selection

This study utilized YOLO11, one of the latest additions to the You Only Look Once (YOLO) family of object detection models, selected for its superior performance, flexibility, and practical deployment potential [22]. As a deep learning network designed for real-time object detection, YOLO11 identifies object locations within an image and classifies them using pre-trained weights. By analyzing pixel intensities, the model predicts bounding boxes, class labels, and associated probabilities, providing a comprehensive solution for detection tasks. The integration of advanced components—such as the Cross Stage Partial with Spatial Attention (C2PSA) block and Spatial Pyramid Pooling-Fast (SPPF) block—enhances YOLO11’s ability to focus on critical regions within images while preserving computational efficiency. These features make YOLO11 particularly suitable for complex real-world applications, such as detecting seat belt usage from aerial footage, where both accuracy and inference speed are critical [23].

The choice of YOLO11 was also guided by its demonstrated balance between detection performance and computational efficiency. As shown in the benchmark comparison from Ultralytics [24], YOLO11 consistently outperforms earlier YOLO versions in terms of COCO mAP, while maintaining competitive latency across different model sizes. When compared with other state-of-the-art object detectors, YOLO11 ranks among the top-performing models, offering a favorable trade-off between accuracy and speed. This scalability allows for flexible deployment depending on hardware constraints. For example, smaller variants such as YOLO11s and YOLO11m are lightweight and fast, making them well-suited for real-time applications on resource-constrained platforms. In contrast, larger models like YOLO11x provide higher detection accuracy and are more appropriate for offline processing scenarios, such as post-processed seat belt detection.

YOLO11’s architecture is particularly well-suited for seat belt detection in challenging conditions. The C2PSA block enables the effective isolation of critical image regions, improving detection accuracy for objects like seat belts, which can vary significantly in size and orientation. Furthermore, its integration of multi-scale feature extraction via the SPPF block allows the model to adapt to diverse lighting scenarios, such as shadows and direct sunlight, ensuring reliable performance under varying environmental conditions.

A major challenge in seat belt detection is low color contrast between the seat belt and the driver’s clothing or vehicle interior. YOLO11 addresses this through high-resolution feature integration and residual connections, capturing subtle differences in texture and color. Specifically, using a Cross Stage Partial block with 3 convolutions, using a 2 bottleneck layers (C3k2) block, the model captures intricate details efficiently, and distinguishes the seat belt from the surrounding background. This capability becomes crucial in addressing contrast-related challenges, such as when seat belt colors closely match the driver’s shirt or the vehicle’s interior.

For this study, the system was configured with a confidence threshold of 0.70, based on empirical validation to balance precision and recall while minimizing false detections. Detections were categorized into three outcomes: buckled, unbuckled, or unknown. The “unknown” category captured cases where the model’s confidence score fell below the threshold, allowing low-confidence predictions to be excluded from compliance analysis and ensuring uncertain outputs were explicitly accounted for.

2.4. Training YOLO11 for Seat Belt Detection

The YOLO11 architecture was fine-tuned on a composite dataset of 3922 images, comprising UAV-captured footage from our two-day field data collection and external raw images from the Seat Belt Detection dataset on Roboflow Universe [25]. UAV footage was recorded at 30 frames per second as vehicles approached the intersection, and, together with the external images, provided a diverse and high-resolution source for training and validation. The dataset was split into 80% for training and 20% for validation.

To account for extreme environmental conditions not captured during field data collection—such as nighttime, rain, and degraded image quality—targeted data augmentation techniques were applied during training. These included brightness and contrast adjustments, random cropping, scaling, rotation, perspective transformations, and the addition of noise and blur. These augmentations improved the model’s generalization to diverse and challenging real-world conditions.

Model testing was conducted separately and included two components: (1) overall model testing using general test data to evaluate baseline performance across varied conditions, and (2) study-specific testing using carefully selected frames from the three experimental studies to analyze the model’s performance under targeted scenarios, including seat belt–shirt color contrast, sunlight direction, and vehicle type.

Annotations for all data—including training, validation, and both test sets—were performed using the Computer Vision Annotation Tool (CVAT), with drivers labeled as either “Buckled” or “Unbuckled”. This rigorous annotation process ensured high-quality ground truth labels to support reliable and consistent seat belt detection across diverse scenarios.

The YOLO11 training process optimized hyperparameters and validation strategies to balance performance and efficiency for real-world applications. Training began with an initial learning rate of 0.001667 and gradually decayed using a cosine learning rate scheduler to ensure smooth convergence and minimize loss function oscillations in later stages. A batch size of 16 images was used, leveraging available GPU resources for efficient processing. To stabilize gradient updates, momentum was set to 0.9, and a weight decay factor of 0.0005 was applied to regularize the model and prevent overfitting.

The model was trained for 200 epochs to ensure sufficient dataset exposure for convergence. The AdamW optimizer was chosen for its effectiveness in large-scale deep learning tasks, leveraging momentum and weight decay to improve convergence while reducing overfitting. These carefully designed strategies enhanced detection accuracy and ensured robustness across diverse operational conditions.

2.5. Evaluation Metrics

Model performance was assessed based on detection accuracy and computational efficiency to ensure effectiveness and real-world applicability. Detection accuracy was measured using Mean Average Precision (mAP) across IoU thresholds (0.5–0.95) and further evaluated with Precision, Recall, and F1-Score to balance false positives and false negatives. Computational efficiency was analyzed by comparing the model’s detections with OpenAI’s GPT-4V(ision) on the same dataset, assessing resource utilization, cost-effectiveness, and performance consistency to determine feasibility for large-scale deployment.

Precision: reflects the proportion of true positive detections among all predicted positives, computed as follows:

$Precision = \frac{True Positives (TP)}{True Positives (TP) + False Positives (FP)}$

(1)
Recall: measures the proportion of true positives among all actual positives and was calculated as follows:

$Recall = \frac{True Positives (TP)}{True Positives (TP) + False Negatives (FN)}$

(2)
F1-Score: balances Precision and Recall using their harmonic mean, capturing the trade-off between the two metrics:

$F 1 Score = 2 \cdot \frac{Precision \cdot Recall}{Precision + Recall}$

(3)

2.6. Real-World Deployment

The system was deployed at two stop-controlled intersections with distinct geometric and traffic characteristics. The first, a simple single-lane, was located at the intersection between 330th Street and Linn Street (330th–Linn) in Slater, Iowa, while the second, a complex multi-lane intersection with three lanes per approach, was between North Dakota Avenue and Ontario Street (North Dakota–Ontario) in Ames, Iowa.

These sites were selected for their representative traffic patterns and their location along routes historically used for annual manual surveys in Iowa. The goal was to replicate real-world data collection conditions, where surveyors adjust to accessible vantage points along designated routes. Table 1 lists each site’s name, geographic coordinates (latitude and longitude), and weather conditions during deployment. At both locations, one hour of video footage was recorded to capture a representative sample of traffic patterns and environmental conditions.

Setup at Sites

At the North Dakota–Ontario intersection, researchers replicated setup during the experimental phase by positioning the drone directly overhead at the opposing approach with a slight lateral adjustment (Figure 1) and zooming in on the target approach. This setup provided a clear view of vehicles as they approached, stopped, and exited the intersection, optimizing detection by enhancing imaging clarity, particularly for slow-moving and stationary vehicles.

Data collection at 330th–Linn faced challenges in capturing ideal slow–stop–go scenarios. A roadside billboard required positioning the drone farther away to prevent tether interference, limiting footage to instances where drivers began reducing speeds. In some cases, drivers only slowed upon reaching the stop line, further complicating data. While other approaches allowed for broader coverage, the UAV was deliberately positioned with the sun behind it to avoid glare, which the conducted evaluation studies showed significantly reduced detection accuracy. This adjustment optimized imaging conditions but constrained the ability to capture complete driver behavior sequences.

3. Results

3.1. Seat Belt Detection Model Performance Evaluation

The performance of the customized YOLO11 model for detecting seat belt usage was evaluated on a test set of 500 samples, evenly distributed between the two classes (250 Buckled, 250 Unbuckled), under both controlled and diverse scenarios. The model achieved a mean average precision (mAP) of 0.902, with class-specific mAP values of 0.895 for the “Buckled” class and 0.909 for the “Unbuckled” class, as illustrated in the precision–recall curve (Figure 2a). The confusion matrix (Figure 2b) further highlights the model’s classification accuracy, where rows represent actual labels and columns denote predicted labels. Diagonal elements reflect correct predictions, while off-diagonal elements indicate misclassifications. Additionally, Figure 3 presents sample detections, demonstrating the model’s ability to accurately identify seat belt compliance in various conditions.

Table 2 summarizes precision, recall, and F1-Score metrics calculated from the confusion matrix (Figure 2b). The “Buckled” class achieved a precision of 0.955 and recall 0.936, while the “Unbuckled” class achieved a precision of 0.948 and recall of 0.944. The overall F1-Scores indicate robust model performance. The model achieved an overall average accuracy of 94.0% on the test data.

3.2. Impact of UAV Elevation on Detection Performance

To determine the optimal UAV elevation for detecting seat belt use, the model was tested at two heights: 15 feet and 18 feet, both with a 2.3 × camera zoom. For each elevation, 200 test samples (100 per class) were evaluated, with results averaged per class. Table 3 summarizes the performance metrics, including the F1-Scores and detection accuracy, for both scenarios. The 18-foot elevation emerged as the optimal height, achieving an F1-Score of 0.930 and a detection accuracy of 93.0%. In contrast, the 15-foot elevation showed slightly lower performance, with an F1-Score of 0.906 and a detection accuracy of 91.5%. These results indicate that the 18-foot elevation offers the best seat belt visibility and detection accuracy, making it the most suitable height for reliable seat belt use detection.

3.3. Model Performance Across Evaluation Studies

Table 4 provides a summary of the performance of the customized YOLO11 model across three evaluated studies. Each study utilized 200 test samples per condition, evenly distributed between the “Buckled” and “Unbuckled” classes. Performance metrics are reported in terms of F1-Score and detection accuracy. In the seat belt–shirt color contrast study, the model achieved near-perfect performance under high-contrast conditions (e.g., dark seat belts on light-colored shirts), with an F1-Score of 0.995 and a detection accuracy of 99.5%. However, in low-contrast scenarios, where the seat belt closely matched the occupant’s clothing, the model’s performance declined, with the F1-Score dropping to 0.917 and detection accuracy to 91.0%. Sunlight direction significantly influenced detection performance. When the UAV faced the sun, intense glare from direct sunlight reduced visibility, resulting in an F1-Score of 0.612 and an accuracy of 57.0%. In contrast, positioning the UAV with the sun behind it eliminated glare and provided an even illumination of vehicle interiors, significantly improving performance to an F1-Score of 0.929 and an accuracy of 92.5%. The model achieved a high F1-Score of 0.940 and a detection accuracy of 94.0% for vehicles with clear windshields, where the seat belt was easily distinguishable from the vehicle interior. However, tinted windshields introduced obstructions, reducing performance to an F1-Score of 0.865 and 84.5% accuracy. The model demonstrated strong performance across most conditions in the three studies, demonstrating adaptability to environmental challenges while also highlighting opportunities to improve robustness and reliability under direct sunlight.

3.4. Real-World Applications

At the 330th–Linn intersection, 118 vehicle seat belt instances were recorded, but 4 could not be manually confirmed with confidence. Of these four cases, two were completely unclear and undetectable by both reviewers and the model. In the remaining two cases, the seat belt status was somehow visible but not clear-cut enough for manual reviewers to confidently conclude their statuses. Interestingly, in one of these unclear instances, the model confidently (above the 0.70 threshold) detected the same status that reviewers hesitated to confirm.

Also, at the North Dakota–Ontario intersection, 187 vehicle seat belt instances were recorded, with 5 excluded due to uncertainty. All five of these cases were entirely unclear, with the model also failing to detect a status in three of these instances and detecting a status at low confidence levels for two. After excluding these ambiguous cases, 114 instances from 330th–Linn and 182 from North Dakota–Ontario were used for further analysis to ensure a standardized dataset for fair evaluation.

3.5. Threshold Impact on Model Performance

Table 5 summarize the seat belt model’s detection outcomes across the two intersections, comparing results with and without applying the 0.70 confidence threshold. At 330th–Linn, without the threshold, the model correctly detected seat belt status in 105 out of 114 instances, misclassified 9, and missed none. With the threshold in effect, the model achieved 103 accurate detections, only 1 misclassification, and 10 classified as “Unknown”. At the North Dakota–Ontario intersection, the model correctly detected seat belt status in 169 of 182 instances without the threshold, with 7 misclassifications and 6 missed cases. With the threshold applied, it recorded 165 correct detections, no misclassifications, 6 missed cases, and 11 classified as “Unknown”. While the threshold reduced correct detection counts and model’s accuracy, it significantly minimized misclassifications, enhancing the model’s reliability and suitability for real-world applications.

3.6. Model Performance Comparison with Traditional Methods

This section compares manual video reviews, simulating real-world surveys, with the seat belt model’s detections across two settings: the single-lane intersection (330th–Linn) and the complex multi-lane intersection (North Dakota–Ontario). The analysis evaluates performance under varying complexities and examines compliance rates reported by both methods, demonstrating the superior efficiency and reliability of automated seat belt monitoring over manual surveys.

3.6.1. Simple Intersection Setting–330th–Linn

A single-lane setup provided a relatively simple monitoring environment. The model confidently detected seat belt status in 103 out of 114 vehicle instances, missing 11 due to misclassifications or “unknown” instances. Manual reviewers, limited by challenges like fast-moving vehicles, tinted windows, and low-contrast conditions, identified seat belt status in just 83 cases during a single vehicle pass and required multiple video replays to correctly assess an additional 31 instances. The YOLO11 model demonstrated superior performance, achieving faster, more reliable, and scalable results without the need for replays, underscoring its effectiveness for real-world seat belt compliance monitoring. Table 6 provides a detailed summary of these comparisons, and an example of an unbuckled detection is shown in Figure 4.

3.6.2. Model vs. Manual Reported Compliance Rates–Simple Intersection Setting

Table 7 compares seat belt compliance rates reported by the model (automated system) and manual reviews at 330th Street across two scenarios: (i) Restricted to single vehicle pass observations, and (ii) Allowing multiple replays.

After applying the confidence threshold to exclude low-confidence detections (classified as “unknown”), 104 total instances (103 correct detections and 1 misclassification, where a buckled status was misclassified as unbuckled) were used to calculate the model’s compliance rate. This comparison highlights what manual reviewers would have reported under idealized conditions of traditional real-world surveys.

The compliance rates were calculated as follows:

Seat Belt Compliance Rate (%) = \frac{Buckled Cases}{Total Detections Above Threshold} \times 100

(4)

3.6.3. Complex Setting (Three Lanes Per Approach)–North Dakota–Ontario

At North Dakota–Ontario, a more complex monitoring environment with three lanes per approach tested the model’s capabilities under challenging conditions. The YOLO11 model outperformed manual reviews, accurately detecting seat belt status in 165 out of 182 vehicle instances, with 6 undetected instances, 11 categorized as “unknown”, and no misclassifications.

Cloudiness on-site further reduced visibility of vehicle interiors, compounding challenges for manual reviewers. During single-pass observations, reviewers accurately identified seat belt status in just 97 cases, overwhelmed with constraints such as tinted windows, low-contrast conditions, and multiple vehicles per lane. Reviewers required video replays to confidently assess an additional 85 cases.

In contrast, the YOLO11 model handled these complexities effortlessly, delivering faster and more reliable results without the need for replays. Its ability to navigate multi-lane scenarios and maintain high accuracy highlights its effectiveness for real-world seat belt compliance monitoring. The comparative results are summarized in Table 8, and a representative detection example is shown in Figure 5. Additional missed detections caused by windshield tint and poor lighting are illustrated in Figure 6.

3.6.4. Model vs. Manual Reported Compliance Rates–Complex Setting

At North Dakota–Ontario, seat belt compliance rates reported by the model (automated system) and manual reviews were also compared across the same scenarios as the first site. Table 9 presents the results of these comparisons.

3.7. Model Performance Comparison with Large Language Models (LLMs)

A total of 100 frames of vehicle instances from 330th Street and Ontario Street were randomly selected, and GPT-4 V(ision) (GPT-4V) was queried for each frame to classify seat belt status as “Buckled”, “Unbuckled”, or “Uncertain”. We also requested confidence scores (0–1) with justifications. A 0.70 confidence threshold was applied, consistent with our model’s threshold. For comparison, our model was also evaluated on these images. Table 10 summarizes the performance and computational costs for both models. While GPT-4V achieved a slightly higher accuracy (93.0% vs. 91.0%), the difference was minimal, with our model having fewer misclassifications due to confidence thresholding. However, the computational trade-offs are significant. Each GPT-4V inference incurs a cost of USD

0.02

per frame, whereas our model operates at zero cost. Additionally, GPT-4V required preprocessing to isolate the windshield area, increasing inference time and workflow complexity compared to our model’s near-instantaneous operation. These results underscore the advantages of our model for real-world deployment. While LLMs like GPT-4 V(ision) demonstrate strong seat belt detection capabilities, our model delivers comparable accuracy with greater efficiency and no operational cost. Figure 7 presents a word cloud visualizing key terms from GPT-4V’s justifications, highlighting common detection challenges. Notably, glare, shadow interference, seat belt–clothing blending, and poor lighting, which GPT-4V frequently cited, align with the key factors that we addressed during our model development, reinforcing their real-world impact on seat belt detection.

4. Discussion

4.1. UAV Deployment Strategy

This study demonstrates the effectiveness of integrating UAV-captured footage with advanced detection models to automate seat belt compliance monitoring at stop-controlled intersections. Data collection utilized a tethered UAV system. The tether secures the UAV to the base station on the ground and provides power up and data transfer down the UAV. Depending on the power source, the system can operate continuously, be accessed remotely, adjust elevation at any time, and be transported to different locations within minutes, making it adaptable for various field settings. In our implementation, the UAV was powered by a high-capacity, rechargeable battery stationed at the base. This setup supported continuous operation for an average of 5–6 h per session, which was sufficient for each day of experimental trials and real-world deployments. The battery could also be recharged while connected to the UAV, enabling near-continuous monitoring when necessary.

The customized YOLO11 model achieved high detection accuracy across experimental studies, even under challenging scenarios like low seat belt–shirt contrast and tinted windows. A key finding was the impact of direct sunlight, where glare significantly reduced visibility and detection accuracy, emphasizing the importance of strategic UAV positioning. The study identified 18 feet with a 2.3 × camera zoom as the optimal setup for detection accuracy. This configuration ensured clear visibility of drivers during critical moments, as they decelerate while approaching the stop sign, come to a full stop, and begin to accelerate again. These low-speed movements reduced motion blur, allowing the model to reliably detect seat belt status. The zoom level also provided detailed views of the vehicle interior, further enhancing detection performance.

UAV positioning and deployment adaptability are important considerations for real-world implementation. While positioning the UAV with its back to the sun was found to improve visibility and reduce glare, this optimal setup may not always be feasible due to fixed road orientations, nearby obstructions (e.g., buildings, billboards, utility poles), or limited UAV placement zones. As observed at the 330th Street and Linn Street intersection, physical constraints can restrict ideal placement. In such cases, compromises may be necessary, such as scheduling data collection during periods with favorable lighting, prioritizing observation of specific intersection approaches where lighting is more manageable, or adjusting UAV positioning to balance visibility, safety, and sun direction. These factors must be considered during deployment planning to ensure system scalability and reliability across diverse field environments. Future improvements could explore adaptive solutions, such as rotating UAV mounts or polarizing camera filters, to further mitigate glare-related impacts.

In addition to lighting and placement, weather conditions—particularly wind speed—should also be considered during deployment planning. For safety and image stability, data collection should be prioritized on days when wind speeds remain below the recommended 25 mph threshold. Strong winds can cause the tethered line to shift erratically, compromising both UAV stability and flight safety. Ensuring stable flight conditions is critical for capturing clear, consistent footage for accurate seat belt detection.

4.2. Automating Seat Belt Compliance Monitoring–Model vs. Manual Review

The deployment of the seat belt detection model at two stop-controlled intersections, 330th–Linn with a simple one-lane-per-approach setting and North Dakota–Ontario with a complex three-lane-per-approach setting, demonstrated its reliability under real-world monitoring challenges. Manual reviews, simulating traditional methods, were compared against the model’s (automated system) performance.

The comparison criteria for manual reviews were designed to simulate the conditions faced by traditional surveyors who often stand on level ground. However, even these conditions pose significant challenges for accurate observations. Surveyors have been known to stand behind trucks to gain a better view of windshields or observe from overpasses when possible. For example, in a national seat belt compliance report [8], surveyors explicitly stated the advantages of observing from overpasses: “Whenever possible, observations for high-volume, limited-access roadways were made from an overpass, allowing for easy viewing of seat belt use for both the driver and the passenger.” This comparison highlights that the UAV-based system, which provides a consistent top-down view and enhanced zoom capabilities, replicates and even surpasses the preferred observational settings for manual surveys, offering a fair and practical benchmark.

At North Dakota–Ontario, during single-pass observations—reflective of real-world monitoring—reviewers struggled to determine seat belt status reliably. When all lanes were occupied, with some lanes having multiple vehicles lined up moving through the intersection, reviewers found it difficult to keep pace. Conditions such as low color contrast between seat belt and occupant clothing, tinted windshields, fast-moving vehicles, and the cloudiness on-site overwhelmed reviewers, resulting in many missed detections and a very low accuracy of 53.30%. In contrast, the YOLO11 model handled these complexities with ease, detecting seat belt status with an accuracy of 90.66% and no misclassifications, reporting an excellent compliance rate. The model’s ability to simultaneously process multiple lanes and trailing vehicles, unaffected by visibility constraints, showcased its clear advantage over manual methods and reinforced its suitability for real-world seat belt compliance monitoring.

Ideally, the tethered UAV is positioned at a safe distance from utility poles to prevent any potential interference with the tether line during flight. However, at the 330th Street and Linn Street intersection, even a reasonable and typically acceptable distance from nearby utility poles was complicated by the presence of a large billboard adjacent to the roadway. In the event of strong winds, this billboard posed an additional risk of tether entanglement or obstruction. As a result, the UAV’s position had to be adjusted to maintain flight safety and tether clearance. Alternative UAV placements that would have enabled full capture of the stop-and-go cycle would have required the UAV to face direct sunlight. However, the experimental results from this study (Section 3.3) showed that UAVs facing sunlight experienced glare and shadow interference, which negatively affected detection accuracy. To avoid these issues, we prioritized camera positioning with favorable lighting. This further limited our ability to record the complete stop cycle, restricting footage to vehicles approaching the intersection and beginning to decelerate. In some cases, drivers did not begin slowing down until they reached the stop bar.

Despite these limitations, the model demonstrated strong performance, correctly detecting seat belt usage in 103 out of 114 vehicle pass observations (90.35%), compared to 72.81% achieved through manual video review under the same conditions. It is also worth noting that the posted speed limit at this intersection was 25 mph, which may have contributed to favorable detection outcomes, even when some vehicles did not fully decelerate.

Although the model achieved commendable accuracy, there is room for refinement to further reduce the number of instances classified as “unknown”. The application of a 0.70 confidence threshold significantly enhanced reliability by minimizing false detections, but it also excluded detections with potential for improvement. For example, at North Dakota–Ontario, applying the threshold resulted in zero misclassifications, compared to seven without the threshold, but 11 instances were still categorized as “unknown”. Future iterations of the model should aim to reduce these ambiguous outcomes by improving confidence in challenging conditions, such as low-contrast settings or partially obstructed views. Each instance of detected and classified driver seat belt compliance was meticulously verified, yielding highly satisfactory results.

4.3. Automating Seat Belt Compliance Monitoring–Model vs. LLMs

Despite GPT-4 V(ision)’s ability to detect seat belt usage with slightly higher accuracy, its computational cost and inference time present scalability concerns. Each inference requires token-based API calls, making large-scale deployment costly, whereas our model runs inference at no additional expense, ensuring seamless and cost-effective implementation.

The study also highlights the inherent challenges in seat belt detection. GPT-4V’s justifications point to common sources of uncertainty, such as low image resolution, lighting variations, and seat belt blending with clothing. However, our model was specifically designed to address these limitations through targeted evaluation studies and training on diverse datasets. This structured approach enhances its ability to perform reliably across different conditions, reinforcing its adaptability to real-world scenarios.

By balancing accuracy, efficiency, and scalability, our model provides a practical and deployable solution for automated seat belt compliance monitoring, making it well-suited for large-scale enforcement and traffic safety applications.

Cost Analysis of GPT-4 V(ision) for Seat Belt Detection

The cost of using OpenAI’s GPT-4 V(ision) for seat belt detection is determined by its token-based pricing model, which accounts for both input tokens (image encoding and prompt) and output tokens (AI-generated response). GPT-4 V processes images in 512 × 512-pixel tiles, with a token cost per tile calculated as follows:

Total Tokens = 85 + 170 \times n

(5)

where n represents the number of 512 × 512 tiles required to cover the image. For a 1365 × 768-pixel image, n is obtained by dividing the width:

\frac{1365}{512} \approx 2.67, \frac{768}{512} \approx 1.5

rounding up each, and multiplying:

(3 \times 2 = 6)

leading to an estimated 1105 input tokens.

Additionally, GPT-4V generates a text-based response (300 output tokens) for seat belt classification, confidence score, and reasoning.

Using OpenAI’s pricing (USD 0.01 per 1000 input tokens, USD 0.03 per 1000 output tokens), the cost breakdown is shown in Table 11.

4.4. Implications and Future Research Directions

Real-World Applications

By employing a tethered UAV system and a customized YOLO11 model, this study demonstrates a practical and reliable method for monitoring seat belt use at stop-controlled intersections. Traffic enforcement agencies can adopt this system to enhance compliance monitoring, inform safety policies, and ultimately reduce severe injuries and fatalities.

2.: Insights into Factors Affecting Seat Belt Detection

The study provides valuable insights into factors influencing seat belt detection performance, including seat belt–shirt color contrast, sunlight direction, and vehicle type. Understanding these variables is essential for optimizing deployment strategies and ensuring consistent detection accuracy in diverse field conditions.

While this study confirms the feasibility and accuracy of UAV-based automated seat belt compliance detection, several future improvements are worth exploring. The current system relies on manual UAV positioning; however, future deployments may incorporate pre-programmed or semi-autonomous flight control—potentially supported by Simultaneous Localization and Mapping (SLAM) techniques—for enhanced localization, obstacle avoidance, and dynamic repositioning in response to fluctuations in traffic flow or changing approach patterns and geometric constraints. SLAM integration could also facilitate smarter navigation around roadside obstructions and ensure consistent viewpoint framing despite varying infrastructure layouts.

In addition, addressing the challenges posed by direct sunlight remains critical. The strategic placement of the UAV with its back to the sun was shown to minimize glare and shadow interference; however, future systems may benefit from adaptive hardware enhancements such as polarizing camera filters, automatic gimbal adjustments, or exposure correction algorithms to maintain visibility when ideal positioning is not feasible.

Seat belt compliance systems can also evolve beyond detection to support real-time feedback and enforcement. For instance, dynamic message signs could be installed to encourage seat belt use, similar to speed feedback signs. In cases where a driver is detected without a seat belt, an immediate alert could be sent to law enforcement, or repeated violations could be logged to prompt targeted interventions. A regular analysis of these data could also reveal patterns of noncompliance, enabling safety implementations tailored to specific locations or demographic groups. By integrating detection, enforcement, education, and incentives, future systems can promote long-term behavioral change and contribute to broader traffic safety goals.

4.5. Study Limitations

Environmental Conditions

Although data augmentation processes simulated extreme conditions, actual extreme weather conditions and nighttime footage were not tested. These factors could significantly impact detection accuracy. Future studies should include data collection under a broader range of environmental conditions, such as heavy rain, snow, fog, and low-light or nighttime scenarios, to thoroughly evaluate the model’s robustness.

2.: Front Seat Passenger Detection

The model performs exceptionally well in detecting drivers but struggles with the consistent detection of front seat passengers in certain scenarios. It performs well when vehicles are directly facing the UAV, but detection accuracy decreases when vehicles are inclined, and the windshield pillar obstructs the view of the front passenger. Addressing this limitation in future iterations could further improve the model’s reliability.

3.: Higher-Speed Environments

This study focused on seat belt detection at stop-controlled intersections, where drivers are generally expected to slow down or stop. As such, the model’s performance in higher-speed traffic environments remains untested. Future work should investigate detection accuracy in scenarios where vehicles are moving at higher speeds, such as arterial roads or highways, to assess the model’s applicability in broader traffic contexts.

5. Conclusions

Integrating UAV technology with advanced detection models represents a significant advancement in monitoring seat belt compliance at stop-controlled intersections. This paper introduced an automated seat belt compliance system using a state-of-the-art YOLO11 detection model to assess seat belt use at stop-controlled intersections. The model achieved an overall mean average precision (mAP) of 0.902, with exceptionally high accuracy across the various conditions evaluated. By implementing a confidence threshold of 0.70, the model minimized false detections, providing an accurate and reliable assessment of seat belt compliance. The real-world deployment further validated the model’s effectiveness in diverse conditions, including challenging scenarios such as extreme cloudiness and complex multi-lane intersections. Its ability to efficiently process data with consistent accuracy underscores its practicality and reliability as a tool for real-world seat belt compliance monitoring, paving the way for scalable and automated traffic safety solutions. The strategic placement of UAVs relative to the sun emerged as a key factor in maximizing detection accuracy. Positioning the UAV with the sun behind it consistently provided evenly illuminated vehicle interiors, minimizing glare and shadows that could obscure the driver’s seat belt. This placement ensured clearer imaging and significantly improved the model’s performance, reinforcing its importance in real-world applications. Deploying these systems at intersections where drivers naturally yield or stop is also recommended, as these points provide an excellent opportunity for clear and unobstructed imaging. Vehicles moving at slower speeds or remaining stationary minimize motion blur, further enhancing detection accuracy. Together, these findings mark a pivotal step toward fully automated, data-driven approaches to seat belt compliance monitoring, bringing transportation agencies closer to safer, smarter, and more accountable roadway systems.

Author Contributions

Conceptualization, G.A.O., A.S.; methodology, G.A.O., R.J.; software, G.A.O.; validation, G.A.O., A.D. and A.-G.K.; formal analysis, G.A.O.; investigation, G.A.O.; resources, S.K., A.S. and N.H.; data curation, G.A.O., A.D. and E.K.A.; writing—original draft preparation, G.A.O.; writing—review and editing, A.-G.K., E.K.A.; visualization, G.A.O.; supervision, S.K., A.S. and N.H.; project administration, G.A.O.; funding acquisition, N.H., A.S., A.-G.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The datasets used for the experimental study evaluations are available upon request from the corresponding author. However, the real-world application datasets are not readily available due to privacy considerations, as they involve identifiable participants who were aware of the data collection process. To maintain ethical data stewardship and participant confidentiality, these datasets are not publicly accessible. Requests to access the datasets should be directed to the corresponding author. To support reproducibility, a subset of sampled experimental data along with the code for model training and inferencing has been made available in the following GitHub repository: https://github.com/Gideon-Asare-Owusu/Drone-SeatBelt (accessed on 28 December 2024). Additionally, the external dataset used to supplement training data, the Seat Belt Detection dataset from Roboflow Universe [25], is publicly available and can be accessed at Roboflow Universe (accessed on 28 December 2024).

Acknowledgments

We express our sincere gratitude to all participants of this study, including our colleagues and the drivers who generously volunteered their time and vehicles for data collection. We are particularly indebted to Skylar Knickerbocker, Anuj Sharma, and Neal Hawkins, whose guidance and expertise as co-advisors significantly enhanced the quality and scope of this research. Additionally, we acknowledge the use of AI tools such as ChatGPT (GPT-4, OpenAI) and Grammarly (version 1.243.638), which contributed to the refinement and comprehensiveness of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

YOLO	You Only Look Once;
UAV	Uncrewed Aerial Vehicle;
mAP	Mean Average Precision;
NHTSA	National Highway Traffic Safety Administration;
RPN	Region Proposed Networks;
CNN	Convolutional Neural Network;
SVM	Support Vector Machines;
FCOS	Fully Connected One Shot;
LLMs	Large Language Models;
C2PSA	Cross Stage Partial with Spatial Attention;
SPFF	Spatial Pyramid Pooling–Fast;
C3k2	Cross Stage Partial block with 3 convolutions, using 2 bottleneck layers;
CVAT	Computer Vision Annotation Tool;
GPU	Graphic Processing Unit;
AdamW	Adaptive Moment Estimation with Weight Decay;
IoU	Intersection Over Union;
TP	True Positive;
FP	False Positive;
FN	False Negative;
GPT-4V	GPT-4V(ision);
SLAM	Simultaneous Localization and Mapping.

References

Seat Belts. National Highway Traffic Safety Administration. Available online: https://www.nhtsa.gov/vehicle-safety/seat-belts (accessed on 15 December 2024).
Seat Belt Safety. Available online: https://www.trafficsafetymarketing.gov/safety-topics/seat-belt-safety (accessed on 20 December 2024).
Wang, D. Intelligent Detection of Vehicle Driving Safety Based on Deep Learning. Wirel. Commun. Mob. Comput. 2022, 2022, 1095524. [Google Scholar] [CrossRef]
Qiao, Y.; Qu, Y. Safety Belt Wearing Detection Algorithm Based on Human Joint Points. In Proceedings of the 2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE), Xi’an, China, 15–17 January 2021; pp. 538–541. [Google Scholar] [CrossRef]
Kargar, S.; Ansari-Moghaddam, A.; Ansari, H. The Prevalence of Seat Belt Use among Drivers and Passengers: A Systematic Review and Meta-Analysis. J. Egypt. Public Health Assoc. 2023, 98, 14. [Google Scholar] [CrossRef] [PubMed]
What Works: Strategies to Increase Restraint Use. Centers for Disease Control and Prevention. Available online: https://www.cdc.gov/seat-belts/what-works/index.html (accessed on 20 December 2024).
National Highway Traffic Safety Administration. Uniform Criteria for State Observational Surveys of Seat Belt Use. Fed. Regist. 2011, 76, 18042–18059. [Google Scholar]
Lehman, N.; Berg, E.; Anderson, A. Iowa Seat Belt Use Survey: 2024 Data Collection Methodology Report; Technical Report; Iowa State University, Center for Survey Statistics & Methodology: Ames, IA, USA, 2024. [Google Scholar]
Agarwal, G.; Kidambi, N.; Lange, R. Seat Belts: A Review of Technological Milestones, Regulatory Advancements, and Anticipated Future Trajectories. SAE Tech. Pap. 2021, 2021-01-5097, 1–14. [Google Scholar] [CrossRef]
Akbari, M.; Lankarani, K.B.; Tabrizi, R.; Heydari, S.T.; Vali, M.; Motevalian, S.A.; Sullman, M.J.M. The Effectiveness of Mass Media Campaigns in Increasing the Use of Seat Belts: A Systematic Review. Traffic Inj. Prev. 2021, 22, 495–500. [Google Scholar] [CrossRef] [PubMed]
National Center for Statistics and Analysis. Seat Belt Use in 2022–Use Rates in the States and Territories; Technical Report DOT HS 813 487; National Highway Traffic Safety Administration: Washington, DC, USA, 2023. [Google Scholar]
Click It or Ticket. Available online: https://www.trafficsafetymarketing.gov/safety-topics/seat-belt-safety/click-it-or-ticket (accessed on 28 December 2024).
Buckle Up Every Trip, Every Time. National Highway Traffic Safety Administration. Available online: https://www.trafficsafetymarketing.gov/safety-topics/seat-belt-safety/buckle-every-trip-every-time (accessed on 28 December 2024).
Almatar, H.; Alamri, S.; Alduhayan, R.; Alabdulkader, B.; Albdah, B.; Stalin, A.; Alsomaie, B.; Almazroa, A. Visual Functions, Seatbelt Usage, Speed, and Alcohol Consumption Standards for Driving and Their Impact on Road Traffic Accidents. Clin. Optom. 2023, 15, 225–246. [Google Scholar] [CrossRef] [PubMed]
Small Business Innovation Research (SBIR). SBIR Award Information; U.S. Department of Transportation: Washington, DC, USA, 2024. [Google Scholar]
Zhou, B.; Chen, L.; Tian, J.; Peng, Z. Learning-based Seat Belt Detection in Image Using Salient Gradient. In Proceedings of the 12th IEEE Conference on Industrial Electronics and Applications (ICIEA), Siem Reap, Cambodia, 18–20 June 2017; pp. 547–550. [Google Scholar] [CrossRef]
Khamparia, A.; Singh, C. Advanced Safety Systems: Seat Belt and Occupancy Detection Using Attention Spiking Neural Networks. Int. J. Eng. Artif. Intell. Manag. Decis. Support Policies 2025, 2, 1–13. [Google Scholar] [CrossRef]
Nkuzo, L.; Sibiya, M.; Markus, E.D. A Comprehensive Analysis of Real-Time Car Safety Belt Detection Using the YOLOv7 Algorithm. Algorithms 2023, 16, 400. [Google Scholar] [CrossRef]
Chen, Y.; Tao, G.; Ren, H.; Lin, X.; Zhang, L. Accurate Seat Belt Detection in Road Surveillance Images Based on CNN and SVM. Neurocomputing 2018, 274, 80–87. [Google Scholar] [CrossRef]
Kannadaguli, P. FCOS-Based Seatbelt Detection System Using Thermal Imaging for Monitoring Traffic Rule Violations. In Proceedings of the IEEE International Conference on Industrial Electronics and Applications (ICIEA), Kolkata, India, 2–4 October 2020. [Google Scholar] [CrossRef]
Saranya, N.; Shanmuga Priya, S.; Patric, R.F.; Santhiya, B. Intelligent Vehicle Management System Using IoT. In Proceedings of the 2024 5th International Conference on Electronics and Sustainable Communication Systems (ICESC), Tiruchirappalli, India, 7–9 August 2024; pp. 493–497. [Google Scholar] [CrossRef]
Khanam, R.; Hussain, M. YOLOv11: An Overview of the Key Architectural Enhancements. arXiv 2024, arXiv:2410.17725v1. [Google Scholar]
Vina, A. All You Need to Know About Ultralytics YOLO11 and Its Applications. 2024. Available online: https://www.ultralytics.com/blog/all-you-need-to-know-about-ultralytics-yolo11-and-its-applications/ (accessed on 30 January 2025).
Introducing Ultralytics YOLO11. Available online: https://docs.ultralytics.com/models/yolo11/ (accessed on 8 April 2025).
Roboflow. Seatbelt Detection Dataset. Available online: https://universe.roboflow.com/traffic-violations/seatbelt-detection-esut6 (accessed on 25 December 2024).

Figure 1. Experimental setup during data collection. (a) The drone mounted for aerial recording. (b) Sample output from the drone’s camera after mounting and zoom adjustment.

Figure 2. (a) Precision–recall curve illustrating the model’s performance; (b) confusion matrix showing classification accuracy between buckled and unbuckled classes.

Figure 3. Model detection examples: (a) driver buckled, (b) driver unbuckled.

Figure 4. Detection example: model confidently detecting an unbuckled driver.

Figure 5. Model accurately detecting seat belt compliance in a multi-lane setting (North Dakota–Ontario) under cloudy site conditions.

Figure 6. Missed detections—(a) extreme windshield tint prevented detection; however, this case was excluded from analysis, as discussed in Section 3.4. (b) Seat belt detected with high confidence on the left vehicle, but missed on the right due to heavy tint and low lighting.

Figure 7. Word cloud of GPT-4V’s justifications for seat belt detection decisions.

Table 1. Study site name, location coordinates, and weather conditions.

Intersection	Latitude	Longitude	Weather Conditions
330th–Linn	41.877860	−93.678465	Mostly Sunny
North Dakota–Ontario	42.034588	−93.678842	Cloudy

Table 2. Summary of model’s performance on test data.

Class	Precision	Recall	F1-Score	Accuracy	Support
Buckled	0.955	0.936	0.945	93.60	250
Unbuckled	0.948	0.944	0.946	94.40	250
Average	0.952	0.940	0.946	94.00	500

Table 3. Model performance across UAV elevations.

UAV Elevation (Feet)	Precision	Recall	F1-Score	Accuracy (%)	Support
15	0.897	0.915	0.906	91.50	200
18	0.930	0.930	0.930	93.00	200

Table 4. Summary of model performance across evaluation studies.

Study	Condition	Precision	Recall	F1-Score	Accuracy (%)	Support
Seat Belt–Shirt Color Contrast	High Contrast	0.995	0.995	0.995	99.50	200
Seat Belt–Shirt Color Contrast	Low Contrast	0.924	0.910	0.917	91.00	200
Sun Direction	Sun Behind UAV	0.934	0.925	0.929	92.50	200
Sun Direction	UAV Facing Sun	0.661	0.570	0.612	57.00	200
Vehicle Type	Clear Windshield	0.940	0.940	0.940	94.00	200
Vehicle Type	Tinted Windshield	0.885	0.845	0.865	84.50	200

Table 5. Impact of confidence threshold configuration on model reliability.

Intersection	Metric	Detections Without Threshold	Detections With Threshold
330th–Linn	Correct Detections	105	103
	Misclassifications	9	1
	Missed (Undetected) Cases	0	0
	Unknowns	0	10
	Total Instances	114	114
	Model Accuracy (%)	92.11	90.35
North Dakota–Ontario	Correct Detections	169	165
	Misclassifications	7	0
	Missed (Undetected) Cases	6	6
	Unknowns	0	11
	Total Instances	182	182
	Model Accuracy (%)	92.86	90.66

Table 6. Model and manual review comparison: detection success at 330th–Linn.

Metric	Model	Manual Review
Cases Identified on Single Observation	103	83
Missed Cases on Single Observation¹	11	31
Cases Requiring Replays	0	31
Total Instances Observed	114	114
Accuracy on a Single Observation (%)	90.35	72.81

¹ Missed cases in the model’s single observation include undetected instances, misclassifications, and unknowns.

Table 7. Manual vs. model reported compliance rates at 330th–Linn.

Scenario	Method	Total	Buckled	Unbuckled	Compliance Rate (%)
Single Observation	Manual	83	80	3	96.39
Single Observation	Model (Automated)	104	91	13	87.50
Replays Allowed	Manual	114	98	16	85.96
Replays Allowed	Model (Automated)	104	91	13	87.50

Table 8. Model and manual review comparison: detection success at North Dakota–Ontario.

Metric	Model	Manual Review
Cases Identified on Single Observation	165	97
Missed Cases on Single Observation ¹	17	85
Cases Requiring Replays	0	85
Total Instances Observed	182	182
Accuracy on a Single Observation (%)	90.66	53.30

¹ Missed cases in the model’s single observation include undetected instances, misclassifications, and unknowns.

Table 9. Manual vs. model reported compliance rates at North Dakota–Ontario.

Scenario	Method	Total	Buckled	Unbuckled	Compliance Rate (%)
Single Observation	Manual	97	96	1	98.97
Single Observation	Model (Automated)	165	164	1	99.39
Replays Allowed	Manual	182	179	3	98.35
Replays Allowed	Model (Automated)	165	164	1	99.39

Table 10. Performance comparison of our model and GPT-4 V(ision) on sampled frames.

Metric	Model	GPT-4V(ision)
Correctly Detected	91	93
Misclassifications	2	6
Missed Detections	1	1
Unknowns	6	-
Detection Accuracy (%)	91.0	93.0
Computational Workflow	Near-Instantaneous	Requires Preprocessing
Operational Cost (USD)	0.00	0.02 per frame
Total Samples	100	100

Table 11. Cost breakdown for GPT-4V(ision) inference per image.

Token Type	Tokens	Cost per 1000 Tokens (USD)	Total Cost (USD)
Input	1105	0.01	0.01105
Output	300	0.03	0.009
Total Cost per Image	1405	-	0.02005 ( $\approx 0.02$ )

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Owusu, G.A.; Dumka, A.; Kojo, A.-G.; Asante, E.K.; Jain, R.; Knickerbocker, S.; Hawkins, N.; Sharma, A. Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections. Remote Sens. 2025, 17, 1527. https://doi.org/10.3390/rs17091527

AMA Style

Owusu GA, Dumka A, Kojo A-G, Asante EK, Jain R, Knickerbocker S, Hawkins N, Sharma A. Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections. Remote Sensing. 2025; 17(9):1527. https://doi.org/10.3390/rs17091527

Chicago/Turabian Style

Owusu, Gideon Asare, Ashutosh Dumka, Adu-Gyamfi Kojo, Enoch Kwasi Asante, Rishabh Jain, Skylar Knickerbocker, Neal Hawkins, and Anuj Sharma. 2025. "Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections" Remote Sensing 17, no. 9: 1527. https://doi.org/10.3390/rs17091527

APA Style

Owusu, G. A., Dumka, A., Kojo, A.-G., Asante, E. K., Jain, R., Knickerbocker, S., Hawkins, N., & Sharma, A. (2025). Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections. Remote Sensing, 17(9), 1527. https://doi.org/10.3390/rs17091527

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Uncrewed Aerial Vehicle-Based Automatic System for Seat Belt Compliance Detection at Stop-Controlled Intersections

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Setup and Data Collection

2.2. Evaluation Studies

2.2.1. Seat Belt–Shirt Color Contrast

2.2.2. Sunlight Direction

2.2.3. Vehicle Type

2.3. Model Selection

2.4. Training YOLO11 for Seat Belt Detection

2.5. Evaluation Metrics

2.6. Real-World Deployment

Setup at Sites

3. Results

3.1. Seat Belt Detection Model Performance Evaluation

3.2. Impact of UAV Elevation on Detection Performance

3.3. Model Performance Across Evaluation Studies

3.4. Real-World Applications

3.5. Threshold Impact on Model Performance

3.6. Model Performance Comparison with Traditional Methods

3.6.1. Simple Intersection Setting–330th–Linn

3.6.2. Model vs. Manual Reported Compliance Rates–Simple Intersection Setting

3.6.3. Complex Setting (Three Lanes Per Approach)–North Dakota–Ontario

3.6.4. Model vs. Manual Reported Compliance Rates–Complex Setting

3.7. Model Performance Comparison with Large Language Models (LLMs)

4. Discussion

4.1. UAV Deployment Strategy

4.2. Automating Seat Belt Compliance Monitoring–Model vs. Manual Review

4.3. Automating Seat Belt Compliance Monitoring–Model vs. LLMs

Cost Analysis of GPT-4 V(ision) for Seat Belt Detection

4.4. Implications and Future Research Directions

4.5. Study Limitations

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI