Fusion of Driving Behavior and Monitoring System in Scenarios of Driving Under the Influence: An Experimental Approach

Jan-Philipp Göbel; Niklas Peuckmann; Thomas Kundinger; Andreas Riener

doi:10.3390/app15105302

,

and

¹

CARIAD SE, Major-Hirst-Straße 7, 38442 Wolfsburg, Germany

²

CARISSMA Institute of Automatic Driving, Technische Hochschule Ingolstadt (THI), Esplanade 10, 85049 Ingolstadt, Germany

³

Faculty of Computer Science, Johannes Kepler University (JKU), Altenberger Straße 69, 4040 Linz, Austria

^*

Author to whom correspondence should be addressed.

Appl. Sci.2025, 15(10), 5302;https://doi.org/10.3390/app15105302

This article belongs to the Special Issue Human-Centered Approaches to Automated Vehicles

Version Notes

Order Reprints

Featured Application

The findings of this study and the developed classification models for detecting driving states offer a foundation for future series development of driver state detection systems. The results highlight how alcohol impacts driving and viewing behavior and demonstrate effective methods for identifying these changes. Importantly, the fusion of multiple sensor sources significantly enhances detection accuracy. While the current outcomes are not yet ready for mass production, they provide valuable insights for the future development of such systems.

Abstract

Driving under the influence of alcohol (DUI) remains a leading cause of accidents globally, with accident risk rising exponentially with blood alcohol concentration (BAC). This study aims to distinguish between sober and intoxicated drivers using driving behavior analysis and driver monitoring system (DMS), technologies that align with emerging EU regulations. In a driving simulator, twenty-three participants (average age: 32) completed five drives (one practice and two each while sober and intoxicated) on separate days across city, rural, and highway settings. Each 30-minute drive was analyzed using eye-tracking and driving behavior data. We applied significance testing and classification models to assess the data. Our study goes beyond the state of the art by a) combining data from various sensors and b) not only examining the effects of alcohol on driving behavior but also using these data to classify driver impairment. Fusing gaze and driving behavior data improved classification accuracy, with models achieving over 70% accuracy in city and rural conditions and a Long Short-Term Memory (LSTM) network reaching up to 80% on rural roads. Although the detection rate is, of course, still far too low for a productive system, the results nevertheless provide valuable insights for improving DUI detection technologies and enhancing road safety.

Keywords:

driving under the influence; intoxication detection; study; fusion; classification; driving behavior; gaze behavior; in-cabin monitoring systems

1. Introduction

Driving under the influence (DUI), especially under the influence of alcohol, is a global problem that costs many lives, seriously injures people, causes immense costs, and is one of the leading causes of car accidents [1,2]. In Europe, it is responsible for 25% of accidents [3]. Not every accident is fatal, but alcohol was responsible for 6.4% of all accident victims in Germany in 2021 [4,5]. According to the National Highway Traffic Safety Administration (NHTSA), about 10,000 people die each year in the U.S. [6]. This makes driving under the influence of alcohol (DUI) the number one cause of fatal accidents in the U.S., and 31% of fatal car accidents are correlated with alcohol [6]. The relative probability of causing an accident increases exponentially with blood alcohol concentration (BAC) [7,8]. This is visualized in Figure 1. Figure 1 shows the relative crash risk in relation to the BAC, where the relative risk is set in relation to the accident risk of driving sober (BAC = 0).

Figure 1. Blue: relative crash risk in relation to blood alcohol concentration (BAC) [7,8]; red: the legal limit in most EU countries.

This is why countries around the world have set limits above which it is illegal to drive a car. In Germany, the legal limit is 0.05% BAC. The relative crash risk at the legal limit is only 38% higher than driving sober [7]. This is why this study examined a blood alcohol concentration just above the legal limit of 0.06%. Various approaches exist in the literature to distinguish between sober and impaired drivers. These include methods based on thermal (Far-Infrared (FIR)) cameras [9], near-infrared (NIR) cameras [10,11], gas sensors [1,2,12], electrocardiography (ECG) [13], driving behavior analysis [14,15], and multi-sensor systems [2,15,16,17]. Most of today’s production vehicles do not integrate these sensors, but the available driving data can still be utilized. In addition, camera-based driver monitoring systems (DMSs) are being developed (e.g., DMSs from the Frauenhofer Institute [18]). According to a study by ABI Research, DMSs with driver monitoring cameras are considered a key technology for systems for driver condition recognition required by EU regulations (General Safety Regulations (GSRs) [19]) from 2024 and for semi-autonomous driving [18,20]. Therefore, the combination of driving behavior analysis and DMSs should increase the accuracy of detecting DUI.

3. Study Design

3.1. Participants

In the study, 23 participants (18 men and 5 women) took part. Their ages ranged from 24 to 47, with a mean age of 32.14. Two participants completed only the test drives, resulting in 21 test subjects (17 men and 4 women) who completed all drives. Each participant completed one test drive, consisting of two sober and two drunk drives, all conducted on separate days. The target BAC for drunk driving was 0.06%. More on this is covered in Section 3.2. The weight of the test persons was determined to calculate the correct amount of alcohol. The calculation of the amount of alcohol to be consumed in g was calculated using the modified Widmark formula [36]:

a = p \cdot r \cdot (c t + β \cdot t)

(1)

where a is the amount of alcohol in grams, p is the body weight in kg, r is the reduction factor (man 0.7, woman 0.6), ct is the target value of the BAC at the time of collection,

β

is the rate of alcohol degradation, and t is the time elapsed between the alcohol test and the blood sample. In this case, t = 0 as only the breath alcohol and not the blood alcohol is measured, and this is recorded immediately during the measurement. As the weight of the subjects was between 53 and 125 kg, the amount of alcohol administered varied greatly (24–60 g).

Recruiting Criteria

In Section 2, the correlation between driving experience and driving behavior is determined. In order to filter out novice drivers, the prerequisites for participation were defined as holding a driving license for more than 3 years and driving a car at least once a week. This minimizes the variance due to driving experience. All test subjects were able to tolerate alcohol. Therefore, only subjects who drank alcohol at least once a month participated. The same applied to subjects with Alcohol Use Disorder (AUD), further called alcoholics. Through filtering out non-drinkers and alcoholics, the variance within the test group was reduced. This increases the significance of the results. The smaller the test group, the more homogeneous the group of test subjects must be. The detailed requirements of the recruitment criteria are in Appendix B.

3.2. Experimental Procedure

The experimental procedure for this study consisted of three main phases: an intake screening, a practice session, and test sessions for each participant. Initially, potential participants who met the recruiting criteria outlined in Section 3.1 were contacted via email or phone, where the purpose of the study was explained. Those interested underwent a telephone screening, which assessed their demographics, health status, drinking habits, and driving experience. Participants who did not meet the eligibility requirements were excluded from the study, while qualified individuals provided informed consent in line with GDPRs. On the day of testing, participants were again screened to ensure they met the acute recruiting criteria.

Following screening, participants engaged in a practice session to minimize learning effects and assess simulator sickness. Participants had one hour to familiarize themselves with the procedure and complete a full driving scenario. Simulator sickness was evaluated through pre- and post-session questionnaires, leading to the exclusion of two participants.

The remaining participants proceeded to the test sessions, completing two sober and two drunk drives on different days within 14 days. A target BAC of 0.06% was aimed for during alcohol sessions, with driving sessions alternating to mitigate time series effects. Half began with sober driving, while the other half started with the alcoholized drive. Participants completed questionnaires before and after each session to assess health status and potential simulator sickness.

Prior to sober driving, a breathalyzer (AL5500) verified a BAC of 0.0%. Based on the available measurement methods, this study approximated the BAC using the BrAC. On alcohol test days, participants consumed a vodka (40%) and orange juice mixture calibrated to achieve a BAC of 0.06%. Alcohol was administered incrementally every 10 min. Participants self-assessed their intoxication level. The BrAC was measured approximately every 15 min during the session. After about 60 min, subjects completed their next simulator ride.

Post drive, participants self-assessed their experience and completed a simulator sickness questionnaire. Supervisors ensured that participants had an alcohol level below 0.01% before release. A trained first aider was present to manage any medical emergencies, though none occurred during the study.

3.3. Simulator

A dynamic simulator was used for the test. The simulator consisted of 3 65-inch screens with full HD resolution (1920 × 1080), reflecting the driving content. The driver’s seat was located on a movable platform that moves to match the driving action. This aims to make the ride feel more realistic and immerse the driver more deeply in the driving experience. The simulator also consisted of the following hardware and software:

Three 65-inch screens;
Next-level motion platform for the driver seat;
Fanatec steering wheels and pedals (16-bit encoder for steering);
Four IR cameras;
Three webcams for driver and scene supervision;
iMotions software for data collection;
Coherent light source for proper lighting conditions.

The setup is shown in Figure 2. The individual sensors were placed in the same positions as in a car. A CAD model of a car was used to position the sensors.

Figure 2. Simulator setup: overview of the driving simulator.

3.4. Scenarios

Scenarios were created using Unity (version 2022.3.14) software. They were divided into city, country road, and highway scenarios. Each driving scenario lasted around 10 min and had its own driving events in which the driver’s behavior could be explicitly observed. The events made the drunk driving and the sober driving scenarios more comparable.

The rides always started in the city. A route is shown in Figure 3. The orange, green, and red events are traffic light events with the respective colors of a traffic light. The blue events are stop events, where the driver has to stop due to an event. The yellow arrows symbolize the course of the route. The occurring events are listed in Table A1 in Appendix C. Each event is associated with an expected driver behavior, serving as a clear indicator of whether the intoxicated driver can still meet expectations.

Figure 3. City Map with overview over the events (E). The orange, green, and red events are traffic light events with the respective colors of a traffic light. The blue events are stop events, where the driver has to stop due to an event.

After the city scenario, participants drive onto a country road. The country road consists mainly of straight stretches and has three tight bends. The main idea behind this setup was to focus on the intersections of the middle lanes and the differences in steering behavior due to the curved stretches combined with emergency events. The rural road map is shown in Figure 4a.

Figure 4. Overview of the driving routes.

In the last section of the test track, the driver completes a section on the highway. The route shown in Figure 4b was designed so that the driver can be observed without interruption for as long as possible.

4. Method

The primary objective of the data analysis is to identify distinct features that allow for reliable classification. Additionally, various classification models will be developed and compared to determine the most effective ones for distinguishing between sober and intoxicated drivers. In integrating data from multiple sensors, the aim is to achieve the highest possible classification accuracy, demonstrating that sensor fusion enhances the precision of the results.

4.1. Method of Significance Analysis

Data preparation followed the principles of exploratory data analysis (EDA). Initially, the data were validated and checked for completeness and plausibility. Subsequently, the data were correctly labeled and cleansed of impurities. This process included a graphical representation of the signals, computational checks, and comparison with video recordings of the experiment. Larger outliers were filtered, and the overall data distribution was analyzed. In cases of a non-normal data distribution, the Wilcoxon signed-rank test was applied as an alternative.

The literature describes behavioral changes under the influence of alcohol, particularly affecting driving and gaze behavior, which serve as indicators of alcohol influence. Specific changes in driving behavior, such as acceleration, speed, braking, distance, lane keeping, steering behavior, and reaction times, were examined. Additionally, camera-based tracking showed changes in gaze patterns (fixations and saccades), facial expressions, and eye-steering coordination. In the next step, the recorded signals were compared with the indicators described in the literature. Paired t-tests were employed to identify significant differences between the data collected under the influence of alcohol and in a sober state.

In the study design, each driver completed five drives: one test drive, two sober drives, and two drives under the influence of alcohol. To compare the sober drives with the alcohol-affected drives, whether significant differences existed within the groups (sober and alcoholized) was first examined. If no significant differences were found within the groups, the drives were pooled and then compared.

Each drive included three scenarios: city driving, rural roads, and highways. Therefore, the analysis was conducted both across all scenarios and individually for each scenario. This approach was necessary because calculating averages, variances, and other statistical values for the entire route might obscure effects that become evident when analyzing individual scenarios. Furthermore, it was crucial to determine in which scenarios the most significant differences occurred to identify the situations where a drunk driver could be detected most reliably.

After identifying significant differences between the individual indicators, each was analyzed in detail to assess the nature of the observed changes. For instance, speed behavior was examined to determine how it evolved—whether it tended to increase or decrease, how the variance shifted, and in which scenarios the changes were most pronounced. While a paired t-test can confirm significant changes between two paired groups, it does not reveal the direction or characteristics of those changes. Therefore, a closer speed analysis was conducted, focusing on trends and variance.

Additionally, reaction behavior was specifically evaluated using events from the driving scenarios. These events allowed for the straightforward measurement of reaction times and the corresponding changes. Reaction time was assessed in two ways: first, during events requiring an emergency brake or stop, and second, in response to a traffic light turning from red to green, where acceleration was measured.

4.2. Method of Classification

The first step involved determining whether significant differences existed depending on the scenario and the specific indicators under consideration. Building on this, a classification algorithm was developed to distinguish between sober and intoxicated drivers. Given the complexity of alcohol’s influence—which varies depending on individual factors and blood alcohol concentration—a machine learning approach is well suited for detection.

This approach employs classification models based on calculated indicators and key values, which are compared against a more complex machine learning model that leverages the entire time series dataset. Among the most promising models based on the indicators are logistic regression, random forest, and gradient boosting.

Logistic regression is advantageous due to its simplicity and ease of interpretation. It models the probability of a binary state by applying a logistic function to a linear combination of the indicators. Random forest, by contrast, is a more complex model that constructs multiple decision trees from randomly selected subsets of indicators, and then classifies based on a majority vote. This method is particularly effective for capturing nonlinear relationships and is more resistant to outliers and overfitting. Gradient boosting also uses decision trees, but it builds them sequentially, with each tree correcting the errors of its predecessor. Although this method can achieve higher accuracy, it is more prone to overfitting and can be sensitive to outliers.

For the more complex machine learning model, a Long Short-Term Memory (LSTM) network was utilized. The LSTM model’s essential advantage over the others is its ability to process entire time series without the need for feature extraction. In contrast, simpler models rely on summary statistics such as the mean, median, standard deviation, and variance, which can result in a loss of important information. The LSTM’s ability to retain more of the original data is expected to yield better classification results.

In comparing the models, each driving scenario is analyzed individually, as well as the entire driving sequence. Additionally, the models are trained and evaluated based on four configurations:

Driving data only.
Camera data only.
Data-level fusion of driving and camera data.
Interpretation-level fusion of driving and camera data.

The aim was to demonstrate that fusing data from multiple sensors enhances the classification accuracy. Data-level fusion first combines the two data sources—driving data and camera data—before using them to train the models. In contrast, interpretation-level fusion trains and tests models separately on the driving and camera data. During testing, each model provides a prediction about the driver’s state, along with a confidence value. Ultimately, the prediction with the higher confidence value is selected as the final result.

For training our classification methods, we applied supervised machine learning. The sample set, consisting of only 21 participants (each with two drives: sober and intoxicated), limited the possibilities for machine learning approaches. Using unsupervised machine learning with all observed indicators (see Appendix D) led to overfitting and poor results. Therefore, we opted to select only a subset of indicators for model training. Similarly, for the LSTM model, we selected specific time series rather than key values for training. Data augmentation was not applied, as the number of sober drives matched the number of intoxicated drives, offering no clear advantage from augmentation.

Given the relatively small dataset, a leave-one-out cross-validation (LOOCV) approach was applied. In this method, one subject serves as the test set while the remaining subjects were used for training. This process was repeated until each subject was used as a test set once. This technique ensures meaningful insights despite a limited sample size.

Due to lengthy and costly computational demands, the LSTM model was not tested using the LOOCV method. LOOCV requires substantial computing power, as the LSTM model would need to be trained and tested for each subject across every dataset and test track. Instead, a traditional approach was adopted, using approximately 90% of the subjects for training and 10% for testing. This high training proportion was selected due to the relatively small sample size. The differing approach, compared to other models, may have influenced the results. Therefore, an initial comparison was made among the three simpler models, followed by a separate comparison with the LSTM model.

A detailed analysis and comparison of the various models and data configurations provided valuable insights into the most effective methods for detecting alcohol-impaired driving behavior and assessing their real-world applicability. In examining the various data configurations, the benefits of sensor data fusion are highlighted, showcasing its added value in improving detection accuracy.

5. Results

This study involved simulated drives conducted both under the influence of alcohol and in a sober state. The main objective was to identify the most effective classification algorithm for distinguishing between these two conditions and show that combining multiple data sources improves the classification accuracy. To achieve this, the characteristic differences between the two groups were analyzed, following the approach outlined in Section 4.

5.1. Significance Testing

To compare sober driving with driving under the influence of alcohol, it was first necessary to determine whether significant differences existed within each group (sober and intoxicated). Identifying these internal variations was crucial for isolating the key features that are essential for a reliable classification.

Using paired t-tests, no significant differences were found within the groups—neither in the recorded driving data nor in the camera-based data. This was consistent across all individual scenarios (city, country road, and highway) as well as the entire route. Therefore, merging the data from the sober and intoxicated drives within each group was possible for further analysis.

In the next step, the sober and intoxicated groups were compared to identify significant differences, again using paired t-tests.

In terms of gaze behavior, it became clear that the presence of significant differences depended heavily on the driving scenario. In the city scenario, 6 out of 10 indicators showed significant differences, while only 3 out of 10 did so on the country road, and no significant differences were observed on the highway (0 out of 10). When analyzing the entire route, only two of the ten indicators revealed significant differences.

Conversely, the significant differences in driving behavior were less dependent on the scenario. Across the entire drive, 16 out of 19 indicators showed significant differences; in the city scenario, 17 out of 20; on the country road, 21 out of 26; and on the highway, 17 out of 22 (See Table 1). However, not every indicator was recorded in each scenario, complicating direct comparisons. This variability also influenced the number of available indicators for each scenario. A list of all observed indicators is in Appendix D.

Table 1. Number of significant indicators in driving behavior.

5.2. Analysis of Gaze and Driving Behavior

5.2.1. Acceleration Behavior

Changes in acceleration behavior were most pronounced on the rural road. Not only did the majority of indicators show significant differences in this scenario, but the magnitude of these differences was also the greatest. For instance, positive acceleration increased by 7.47% over the entire route, 5.95% in urban areas, 9.56% in rural areas, and 9.53% on the highway. However, significant effects for average acceleration were only found in rural areas. The most reliable indicators of intoxication include average positive acceleration, average throttle position, average acceleration speed, overall deceleration, and acceleration variance. Intoxicated drivers tend to accelerate more abruptly and erratically, which correlates with difficulties in maintaining a steady speed and a general tendency to drive faster [21].

5.2.2. Braking Behavior

Braking behavior, in contrast, proves to be a highly reliable indicator across all scenarios. Indicators such as the average brake pedal position, the standard deviation of pedal position, the average brake pedal speed, and the standard deviation of pedal speed all show significant differences. These indicators suggest that intoxicated drivers brake more frequently or forcefully or do so less consistently, resulting in greater variability in brake pedal position.

5.2.3. Speed Behavior

Speed behavior is closely linked to acceleration and braking patterns. These factors can partially explain the observed differences in speed between sober and intoxicated drives. Intoxicated drivers tend to drive faster on country roads, highways, and across the entire route, especially in rural curves and at maximum speeds. This confirms the expected increase in speed due to intoxication. Additionally, speed variability increases (higher variance and standard deviation), likely due to erratic acceleration and braking. Reliable indicators include average speed, speed standard deviation, speed variance, and the relative time spent above the speed limit.

5.2.4. Steering Behavior

Changes in steering behavior, as described in the literature, were confirmed. Intoxicated drivers exhibited faster steering movements and an increased number of steering corrections, which affects lane-keeping ability. Drunk drivers struggle to maintain a stable lane position, resulting in more frequent steering adjustments. Significant differences in steering behavior were observed across all road types, with additional effects noted on rural roads during curves. Reliable indicators across all scenarios include the number of steering reversals greater than 5° and 10° per minute and the average steering speed in both clockwise and counterclockwise directions.

5.2.5. Reaction Behavior

Reaction times were analyzed through events simulating hazardous situations that required a braking response. Reaction time was measured as the time between the appearance of the hazard and the driver’s initial braking action. In urban areas, hazardous events included pedestrians suddenly crossing the road, and while in rural areas, they involved an animal crossing or a rock slide. Three events remained for analysis (two pedestrian crossings and one animal crossing) due to traffic-related braking obscuring other reactions. However, learning effects overshadowed the results, preventing significant conclusions about reaction times.

To supplement the analysis, reaction times were also measured when traffic lights changed from red to yellow. A significant difference in reaction time was found at only one of four traffic lights, with drunk drivers consistently showing slower reaction times. However, in all four traffic light scenarios, there was a tendency toward a slower reaction time.

5.2.6. Fixations and Saccades

Fixations and saccades were identified as the most meaningful gaze indicators. The analysis of the speed, amplitude, duration, and frequency of fixations and saccades revealed that the significance of these indicators varied depending on the driving scenario. In urban areas, the number of fixations and saccades decreased, fixation duration increased, and saccade amplitude decreased significantly. Saccade speed, however, remained essentially unchanged. On country roads, only fixation frequency and saccade amplitude showed significant differences, while on highways, no significant differences were observed.

The results align with findings in the literature, where alcohol consumption is associated with tunnel vision. This manifests as a reduction in the number of fixations and saccades, longer fixation durations, and a narrower field of vision, as indicated by a decreased saccade amplitude. Scenario-specific differences can be explained by varying demands on the driver. For example, urban driving requires frequent changes in the line of sight due to turning and lane changes, whereas highway driving involves a more constant, forward-focused vision. As a result, significant differences in gaze behavior are harder to detect on highways.

5.2.7. Eye-Steering Coordination

Eye-steering coordination was analyzed by examining two aspects: the temporal delay between eye and steering movements and the frequency of turns or lane changes made without prior sideways glances. Normally, drivers look into a curve or turn before initiating a steering maneuver. Under the influence of alcohol, this foresight is reduced, and a shorter time lead between eye and steering movements is expected. However, no significant changes in this behavior were observed in any scenario.

On the other hand, significant differences were noted in lane-changing and turning behavior in urban and rural road scenarios, where intoxicated drivers showed fewer sideways glances before making these maneuvers. This suggests a tendency for drunk drivers to drive more recklessly and carelessly.

5.3. Classification Algorithms for DUI

To ensure a reliable classification of driving under the influence of alcohol, various classification algorithms were tested and compared. The performance of each algorithm was evaluated across different driving scenarios.

5.3.1. Logistic Regression

Logistic regression proved to be an effective method for classifying DUI across all scenarios. As demonstrated in Table 2, accuracy generally improved when data from multiple sensor sources were combined. The fusion of data at both the data and interpretation levels yielded comparable results.

Table 2. Model performance of logistic regression.

In the training data, the fusion of both datasets outperforms each individual data source, both at the data level and interpretation level. This trend appears to continue when predicting the test data; however, the highest classification accuracy was observed when using only the camera data for the complete scenario. This may result from the small dataset size, potentially leading to overfitting. The situation was further examined using a confusion matrix.

Figure 5 illustrates that the logistic regression model significantly outperforms random chance in predicting DUI. The figure highlights where the model made correct predictions and where it erred. The true positive rate (drunk drivers correctly classified) and true negative rate are roughly equal, indicating that the model is not biased toward classifying only sober or only drunk drivers. Instead, both groups were accurately identified to a large extent. Consequently, the precision and recall result in an overall accuracy of 66.76% and 70%, respectively.

Figure 5. Confusion matrix of logistic regression: complete track with camera data.

5.3.2. Random Forest

Table 3 presents the results of the random forest model. Similar to logistic regression, accuracy improves when multiple sensor sources are fused. However, as with the previous model, the training accuracy exceeds the test accuracy. In most scenarios, the random forest model achieves 100% accuracy on the training data, indicating significant overfitting and noticeably poorer performance on the test data. Despite this, the test accuracy remains comparable to that of logistic regression. Due to the inherent randomness of the random forest algorithm, the results are not fully reproducible, with slight variations occurring in each run. Nevertheless, these variations are minimal, and the overall accuracy remains consistent with the values shown in Table 3.

Table 3. Model performance of random forest.

In the best-case scenario on the country road, the random forest model achieves an accuracy of over 70% (see Figure 6). It also attains precision and recall values of 75% and 70%, respectively. These results indicate both a high true positive rate and a strong true negative rate, reflecting the model’s ability to accurately classify both drunk and sober drivers.

Figure 6. Confusion matrix of random forest: rural road with fusion at data level.

5.3.3. XGBoost

The results of the XGBoost model are presented in Table 4. The notably high training accuracies suggest significant overfitting, leading to relatively poor performance on the test data. In urban scenarios, the model performs reasonably well, with an accuracy ranging from 56% to 63%. On rural roads, the model shows moderate performance, achieving between 51% and 63% accuracy. However, on the highway, prediction accuracy is notably low, even falling below the threshold of random guessing (50%). Overall, XGBoost performs better in urban scenarios, particularly when data are fused at the interpretation level (see Figure 7).

Table 4. Model performance of XGBoost.

Figure 7. Confusion matrix of XGBoost: city with fusion on interpretation level.

5.3.4. Comparison of the Models

When comparing the models, their performance across different scenarios was evaluated. In terms of training accuracy, all models show very high performance, reaching up to 100%. While XGBoost and random forest consistently achieved close to 100%, logistic regression reaches between 67% and 97%, which suggests a lower risk of overfitting compared to the other models. The near-perfect training accuracy of XGBoost and random forest indicates a strong tendency for overfitting, as they almost perfectly memorize the training data.

In the case of logistic regression, there is a clear trend toward higher training accuracy as the datasets are fused. While this increases the risk of overfitting, it also improves the overall accuracy.

However, the focus should be on test accuracy, which is a more reliable indicator of model performance. Logistic regression maintains consistent test accuracy across all scenarios, with a slight reduction in performance on rural roads. In contrast, XGBoost and random forest exhibit greater variability in test accuracy across scenarios. Test accuracy is notably higher in city and rural road scenarios compared to the highway. When the entire route is considered, the models achieve test accuracy values that fall between the scenario-specific results.

Additionally, the results demonstrate that data fusion tends to improve test accuracy. As shown in Table 5, the table highlights which model performs best in each scenario and indicates the data sources used to achieve the corresponding test accuracy.

Table 5. Best model performance in scenarios.

In general, logistic regression and random forest tend to outperform XGBoost. Random forest achieves classification accuracies of up to 73%. However, across all three models, overfitting remains a significant issue, with near-perfect training accuracies contrasting sharply with the fluctuating test accuracies and F1 scores.

5.4. LSTM Network

Another classification model, based on Long Short-Term Memory (LSTM) networks, was developed to predict the driver’s state more accurately. Unlike the previous models, the LSTM model uses the full time series of measured values rather than rely on calculated metrics such as average speed. It was anticipated that using time series data would lead to better classification accuracy by preserving more of the dynamic information inherent in the signals.

The model was trained with approximately 90% of the participants’ data, while the remaining 10% were used for testing. This high training proportion was chosen due to the relatively small sample size. Each scenario was analyzed individually, and the model was trained and tested with the respective datasets. The model was trained over 100 epochs, with training improvements generally stabilizing around 60–70 epochs. In line with previous results, the fusion of multiple data sources tended to yield better results than using a single dataset, though the accuracy varied across scenarios:

The average testing accuracy in urban scenarios ranged from approximately 50% to 70%.
The best testing accuracy was observed on rural roads, with maximum values exceeding 80%.
A poor testing accuracy was noted on the highway, with values around 50%.

During training and testing, it became evident that classification accuracy was more influenced by driving behavior than by eye-tracking data. This explains why the test accuracy was higher on rural roads, where driving behavior showed the most pronounced differences between sober and intoxicated states.

However, the small sample size posed a significant challenge for the LSTM model. Overfitting was a persistent issue during training, and attempts to mitigate this by augmenting the data with synthetic samples did not yield any meaningful improvement in classification performance.

6. Discussion

By focusing our study on a highly homogeneous group of participants, we minimized confounding factors such as variations in drinking habits and driving experience [28,30,35]. This approach allowed us to draw significant conclusions despite the relatively small sample size of 21 subjects.

6.1. Features

In the study, each subject completed five drives. Excluding the test drive, each driver performed two sober and two intoxicated drives. The results indicate that the features considered for drives in the same state (sober or intoxicated) did not differ significantly. This suggests that learning effects did not notably influence the driving and gaze behavior. Consequently, it was possible to merge the drives with the same alcohol state and compare them with those from the opposite state.

When comparing sober and intoxicated states, the features showed significant differences, aligning with the expected results from the literature. This study confirms the findings regarding risky and more aggressive driving behaviors, such as increased speed, acceleration, and variance [21,22,26]. We also observed a tendency toward slower reaction times. However, the changes were not significant enough to be detected with the paired t-test, nor did we find an increase in errors when stopping at red traffic lights [21]. This absence of notable changes may be attributed to learning effects from the repeated driving scenarios. Fortunately, the learning effects were limited to the events that occurred and did not influence general driving and viewing behavior.

Changes in lateral stability and steering behavior were also noted [22,23,26]. The most pronounced changes were observed on the rural road. Although significant differences were found in all scenarios (see Table 1), the extent of these changes was greatest on the rural road. It is hypothesized that the overlay of numerous events in urban settings results in less noticeable changes in driving behavior. At the same time, the more monotonous driving on highways may lead to more minor changes.

Gaze behavior showed different patterns of change, with a stronger dependence on the driving scenario. Gaze behavior exhibited the most changes in the city, which is expected due to increased driver activity. In urban environments, drivers must frequently shift their gaze when turning or changing lanes. On rural roads, drivers still need to look around, whereas on highways, they can focus ahead for extended periods. The reduced gaze activity in certain scenarios makes it more challenging to detect differences between sober and intoxicated states. As noted by Makowski et al., Watten et al., Roche et al., and Silva et al., fixation and saccades are effective indicators of changing gaze patterns [31,32,33,35].

Figure 1 demonstrates that the relative risk of causing an accident increases exponentially with the BAC. This indicates that driving behavior deteriorates more severely with higher BAC levels. In our study, participants consumed alcohol only up to a BAC of 0.6 per mile. As the graph illustrates, the risk at this BAC is still relatively low (only 68% higher risk than driving sober), which might explain why we did not reproduce some of the changes reported in the literature. For instance, we did not observe significant changes in eye-steering coordination, contrary to the findings of Marple-Horvat et al. [15]. Additionally, smaller changes are harder to classify compared to more pronounced changes at higher BACs. Nonetheless, detecting impairment just above the legal limit is particularly valuable. We hypothesize that our models would perform better at higher BACs, although the opposite conclusion may not be achievable.

6.2. Models

While it is anticipated that the selected models will achieve better classification accuracy at higher BACs, Section 5.3 shows that the classification accuracy is influenced by other factors as well. Various scenarios and datasets were examined during the training and testing of different models. While the literature typically focuses on identifying changes that occur, we went further by evaluating how reliably these changes can be used to detect a driver’s state. Moreover, we advanced the analysis by integrating multiple sensor sources for state classification and comparing the outcomes. Various classification algorithms were employed to assess detection accuracy (see Section 5.3.4). The focus was on three simplified models—logistic regression, random forest, and XGBoost—which were then compared to a more complex LSTM network.

As expected, combining multiple datasets improved model performance. Fusion at both the interpretation and data levels produced similar results. However, the driving scenario had a more significant impact on test accuracy than the choice of dataset. On average, the models performed best in urban and rural road scenarios, with significantly worse performance on highways. This observation aligns with the findings from the feature analysis, where the most significant differences were noted in city and rural conditions. The analysis across all scenarios resulted in test accuracies between those observed in city and highway settings.

Similar trends were observed with the LSTM model, which achieved particularly high test accuracies on rural roads. A significant challenge for all models, especially the LSTM, was the relatively small sample size. The limited number of participants was insufficient for training the models effectively, leading to overfitting issues.

6.3. Limitations

Our models and conclusions about altered driving behavior are based on a narrowly defined, homogeneous group of participants. It is essential to verify whether these findings can be generalized to a broader population, as the classification accuracy may shift with a more diverse subject pool. Factors like varying drinking habits and driving experience could impact the reliability of detection [28,30,35].

Additionally, this study was conducted under controlled, optimal conditions. Participants were required to appear sober, having consumed only a light snack, and to abstain from alcohol, caffeine, or medication prior to the study. Given these conditions and the fact that the study took place in a driving simulator rather than a real vehicle—where the driving routes were identical each time—the applicability of the results to real-world scenarios must be considered. Furthermore, individual factors such as daily physical and mental conditions, as well as inherent differences between participants, could strongly influence the results. Factors like adrenaline rushes, fatigue after partying, or other variables that impact real driving behavior were not accounted for.

However, this study’s primary objective was to provide a direct comparison between sober and intoxicated driving. The controlled simulator environment, with its consistent conditions, was the most suitable approach for achieving this comparison. Nonetheless, the validity of these findings could be strengthened by increasing the number of participants and conducting tests in a more realistic driving environment.

7. Conclusions

Road safety continues to be a paramount concern worldwide, with driving under the influence (DUI) remaining one of the leading causes of traffic accidents. Investigating DUI is a logical step to enhance safety measures. Our study extends previous research by not only confirming the known effects of alcohol on driving and gaze behavior but also using these indicators to classify a driver’s state. In doing so, we took an innovative step further by classifying a driver’s impairment state using multiple-sensor data. By combining eye-tracking and driving behavior data, we were able to fuse these sources of information in various ways, leading to valuable insights and high-performing classification models.

This study confirms the well-documented impact of alcohol on driving, particularly highlighting risky behaviors such as increased speed, more aggressive maneuvers, and changes in steering and lateral stability. These changes were most pronounced in rural road scenarios, where monotonous driving heightened the visibility of alcohol-induced impairments. Urban environments, while complex and event-heavy, also demonstrated noticeable shifts in driving behavior, though these were less distinct due to the multitude of visual and driving tasks required. Interestingly, our findings reveal that gaze behavior (specifically fixation and saccades) was strongly dependent on the driving scenario, with city settings displaying the most dynamic shifts due to the frequent need for attention shifts when turning or changing lanes.

One of the novel contributions of our study was the application of classification models to detect intoxication based on sensor data. The fusion of eye-tracking and driving behavior features allowed for a strong classification performance, particularly in urban and rural settings. Our models achieved test accuracies exceeding 70%, with the more complex LSTM model reaching up to 80% in accuracy on rural roads. These results illustrate that combining data sources enhances the ability to detect a driver’s state, offering a more robust method for identifying impairment compared to relying on a single data stream.

However, while our models showed strong performance, they were hindered by overfitting, primarily due to the relatively small sample size of participants. This overfitting was evident in the gap between training and testing accuracies, especially on highways, where the more consistent driving environment made it harder for the models to distinguish between sober and intoxicated states. Additionally, the moderate BAC level of 0.06% may have limited the detection of more severe impairments that might occur at higher levels of intoxication. Nonetheless, detecting impairment just above the legal limit is particularly important for real-world applications, where even mild impairment can pose significant risks.

Looking ahead, future research should focus on addressing the limitations encountered in this study. A larger sample size would help mitigate the overfitting issue and improve the generalizability of the models. Furthermore, investigating higher BAC levels could reveal more pronounced behavioral changes and offer deeper insights into the relationship between alcohol consumption and driving performance. Additionally, exploring correlations between gaze behavior and driving performance could provide valuable insights into how attentional focus is impacted by intoxication. By refining these approaches, future studies could further enhance the accuracy and reliability of classification algorithms, offering powerful tools for improving road safety and potentially informing the development of real-time DUI detection systems in vehicles.

In conclusion, this study not only validated previous research on the effects of alcohol on driving but also advanced the field by demonstrating how multiple sensor sources can be leveraged to classify a driver’s intoxication level accurately. With improvements in sample size and expanded research into higher BAC levels, these models hold great potential for contributing to future road safety technologies, making our roads safer for everyone.

Author Contributions

Conceptualization, J.-P.G.; Methodology, J.-P.G. and N.P.; Software, J.-P.G. and N.P.; Validation, J.-P.G. and N.P.; Formal analysis, J.-P.G. and N.P.; Investigation, J.-P.G. and N.P.; Resources, J.-P.G.; Writing—original draft, J.-P.G.; Writing—review & editing, J.-P.G., T.K. and A.R.; Visualization, J.-P.G.; Supervision, T.K. and A.R.; Project administration, J.-P.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study protocol was reviewed by a qualified medical professional responsible for overseeing the research. The study involved healthy adult volunteers, moderate alcohol consumption, and anonymized data collection in a simulated driving environment. The anonymized data was received by VW and the data protection departments of CARIAD SE and VW, ensuring compliance with relevant data protection regulations. The ethical aspects of the study were reviewed and approved by the Editor-in-Chief during the editorial process.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data are not available due to German data protection law and confidentiality requirements.

Acknowledgments

The authors would like to thank VW for planning and conducting the study and providing the data. In this paper, artificial intelligence was used in places to correct grammatical errors and make stylistic improvements. These adjustments were made carefully, taking into account the original content, and are intended to help optimize the quality and comprehensibility of this paper.

Conflicts of Interest

Authors Jan-Philipp Goebel, Niklas Peuckmann and Thomas Kundinger were employed by the company CARIAD SE. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BAC	Blood alcohol concentration;
BrAC	Breath alcohol concentration;
LOOCV	Leave-one-out cross-validation;
NHTSA	National Highway Traffic Safety Administration;
DUI	Driving under the influence;
DMS	Driving monitoring system;
GSR	General Safety Regulation;
SPAVG	Variance in average speed;
LPAVG	Variance in average of lane position;
SPSD	Standard deviation of speed;
LPSD	Variance in standard deviation of lane position;
CS	Contrast sensitivity;
SA	Steering angle;
SS	Steering speed;
SRR	Steering reversal rate.

Appendix A. Ethical Framework

Appendix A.1. Fundamental Points

The breath alcohol concentration (BrAC) should be examined and not the blood alcohol concentration (BAC).
A certified measuring device should be used to measure the BAC.
The target value for the alcohol concentration for the complete experiment is a maximum of 0.6‰.
If the target value is not reached, it is not allowed to be “readjusted”.
Clear alcohol is best suited for setting the alcohol concentration.
Subjects taking acute or long-term medication are not allowed to participate in the study.

Appendix A.2. Study Preparation

No acute medication (pain medication, etc.) may be taken in the 24 h prior to the start of the experiment.
The subjects must be sober (no food or alcohol before the test), just a small snack.
The necessary amount of alcohol is calculated individually for each subject (Widmark formula)
At least one first aider must be on hand, ready to call an ambulance immediately in the event of vomiting.
Study participants test the driving simulator in a sober state to see if they experience simulator sickness.

Appendix A.3. Study Conduct

Alcohol intake is evenly distributed over a period of 45 min; do not drink a large amount of alcohol at once. The BrAC is to be determined for each test subject every 15 min.
Immediately before each test drive, the BrAC must be determined again. Depending on the duration of the drive, it may also be necessary to determine the BrAC again during the course of the drive.
After the end of the study, the subjects must remain under observation on site until an $B r A C \leq 0.2$ ‰ is reached. Note: the rate of alcohol breakdown is approx. 0.1–0.15‰/h

Appendix B. Recruting Criteria

Exclude participants with Alcohol Use Disorder.
Exclude drivers with driving under influence history.
Exclude drivers with crash history.
Exclude high-risk drivers.
Exclude participants with neuropathology.
Exclude participants with metabolism problems (at least based on their BMI).
Focus on a single and relatively narrow age group.
Participants cannot consume alcohol at least 24 h before the study.
Participants should have 8 h sleep before the study.
All participants have a good physical and mental health status (on the day of testing).
No food two hours before test day.
-
No medical conditions that can interact with alcohol.
-
No acute (taken in the last 24 h) or long-term medication that can interact with alcohol.
-
No eye disease or deviation (good visual acuity).

Appendix C. Description of the City Events

Table A1. City event description: events from Figure 3.

Event	Event Explanation	Expectation
E1, E12, E18, E20	Green intersection	Driver continues as usual.
E2, E10, E14, E21	Red intersection	Driver stops and waits for the green signal.
E3, E22	Pedestrian crossing	Driver stops and lets the pedestrian cross.
E4, E19	STOP—No Car	Driver stops, looks both ways to ensure clear pass, and then continues.
E5, E16	STOP—Car	Driver stops, lets the car pass, then looks both ways to ensure clear pass and continues.
E6	Pedestrian + 30 km/h	Driver stops and lets the pedestrian cross and then continues at a lower speed.
E7, E8, E9, E17	Orange intersection	Driver has the option to either speed up, or slow down and wait at the traffic lights.
E11, E15	Priority by right	Driver stops and lets the car pass.
E13	Turn right + pedestrian	Driver is surprised by the pedestrian just after the right turn; the driver stops and lets the pedestrian cross.

Appendix D. Features for Significant Test

Table A2. Übersicht der Features für verschiedene Auswahlbereiche.

Indicators	Features
Acceleration Behavior	- Average acceleration - Average deceleration - Variance of acceleration - Average throttle position - Average duration of throttle engagement - Average speed of throttle engagement - Throttle activity
Braking Behavior	- Average brake pedal position - Brake pedal position standard deviation - Average brake pedal application speed - Standard deviation of brake pedal application speed
Speed behavior	- Average speed - Average maximum speed - Speed standard deviation - Speed variance - Average speed in curve situations - Speed standard deviation in curve situations - Average duration above 100 km/h
Steering behavior	- Lane deviation standard deviation - Lane position standard deviation - Lane position variance - Number of line crossings - Average duration of line crossings - Standard deviation of time outside lane
Gaze	- Mean eye time lead - Median eye time lead - Proportion without side view - Speed standard deviation - Speed variance - Number of fixations - Number of fixations per minute - Mean fixation length - Number of saccades - Number of saccades per minute - Mean saccade amplitude - Mean saccade velocity

References

Garg, D.; Kumar Srivastava, A.; Paliwal, D.; Shekhar, S.; Singh Chauhan, A. Alcohol Detection System in Vehicle Using Arduino. J. Xi’an Shiyou Univ. Nat. Sci. Ed. 2020, 16, 105–108. [Google Scholar]
Dairi, A.; Harrou, F.; Sun, Y. Efficient Driver Drunk Detection by Sensors: A Manifold Learning-Based Anomaly Detector. IEEE Access 2022, 10, 119001–119012. [Google Scholar] [CrossRef]
Euro NCAP. Euro NCAP Vision 2030: A Safer Future for Mobility. 2022. Available online: https://cdn.euroncap.com/media/74468/euro-ncap-roadmap-vision-2030.pdf (accessed on 29 September 2024).
Statistisches Bundesamt. Verkehrsunfälle und Verunglückte im Zeitvergleich (ab 1950). 2023. Available online: https://www.destatis.de/DE/Themen/Gesellschaft-Umwelt/Verkehrsunfaelle/Tabellen/liste-strassenverkehrsunfaelle.html (accessed on 29 September 2024).
Deutsche Hauptstelle für Suchtfragen e. V. FS_Alkohol_im_Strassenverkehr. 2017. Available online: https://www.dhs.de/fileadmin/user_upload/pdf/Broschueren/FS_Alkohol_im_Strassenverkehr.pdf (accessed on 29 September 2024).
National Highway Traffic Safety Administration and U.S. Department of Transportation. Traffic Safety Facts 2013 Data: Alcohol-Impaired Driving. 2013. Available online: https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/812102 (accessed on 29 September 2024).
Blomberg, R.D.; Peck, R.C.; Moskowitz, H.; Burns, M.; Fiorentino, D. The Long Beach/Fort Lauderdale relative risk study. J. Saf. Res. 2009, 40, 285–292. [Google Scholar] [CrossRef]
Blincoe, L.; Miller, T.; Wang, J.-S.; Swedler, D.; Coughlin, T.; Lawrence, B.; Guo, F.; Klauer, S.; Dingus, T. The Economic and Societal Impact of Motor Vehicle Crashes, 2019 (Revised). 2023. Available online: https://www.researchgate.net/publication/367460305_The_Economic_and_Societal_Impact_of_Motor_Vehicle_Crashes_2019_Revised (accessed on 29 September 2024).
Koukiou, G. Intoxication Identification Using Thermal Imaging: 8. In Human-Robot Interaction; Anbarjafari, G., Escalera, S., Eds.; IntechOpen: Rijeka, Croatia, 2017. [Google Scholar] [CrossRef]
Tapia, J.E.; Droguett, E.L.; Valenzuela, A.; Benalcazar, D.P.; Causa, L.; Busch, C. Semantic Segmentation of Periocular Near-Infra-Red Eye Images Under Alcohol Effects. IEEE Access 2021, 9, 109732–109744. [Google Scholar] [CrossRef]
Makowski, S.; Prasse, P.; Jager, L.A.; Scheffer, T. Oculomotoric Biometric Identification under the Influence of Alcohol and Fatigue. In Proceedings of the 2022 IEEE International Joint Conference on Biometrics (IJCB), IEEE, Abu Dhabi, United Arab Emirates, 10–13 October 2022; pp. 1–9. [Google Scholar] [CrossRef]
Dharani, N.P.; Ismail, M.; Vidhya, M. Drunk and Drive Detection System for Safety Driving. In Proceedings of the 2022 IEEE International Conference on Current Development in Engineering and Technology (CCET), IEEE, Bhopal, India, 23–24 December 2022; pp. 1–5. [Google Scholar] [CrossRef]
Wu, C.K.; Tsang, K.F.; Chi, H.R.; Hung, F.H. A Precise Drunk Driving Detection Using Weighted Kernel Based on Electrocardiogram. Sensors 2016, 16, 659. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Wang, H.; Zhang, Y.; Zhao, X. Random forest–based feature selection and detection method for drunk driving recognition. Int. J. Distrib. Sens. Netw. 2020, 16, 1550147720905234. [Google Scholar] [CrossRef]
Marple-Horvat, D.E.; Cooper, H.L.; Gilbey, S.L.; Watson, J.C.; Mehta, N.; Kaur-Mann, D.; Wilson, M.; Keil, D. Alcohol badly affects eye movements linked to steering, providing for automatic in-car detection of drink driving. Neuropsychopharmacol. Off. Publ. Am. Coll. Neuropsychopharmacol. 2008, 33, 849–858. [Google Scholar] [CrossRef]
Islam, M.H.; Khandoker, A.A.; Safi Sami, T.M.; Talukder, T.I.; Rahman, M.I.; Sarkar, P.K. Car Accident Prevention And Health Monitoring System For Drivers. In Proceedings of the 2021 IEEE Region 10 Symposium (TENSYMP), IEEE, Jeju, Republic of Korea, 23–25 August 2021; pp. 1–6. [Google Scholar] [CrossRef]
Ljungblad, J.; Hök, B.; Allalou, A.; Pettersson, H. Passive in-vehicle driver breath alcohol detection using advanced sensor signal acquisition and fusion. Traffic Inj. Prev. 2017, 18, S31–S36. [Google Scholar] [CrossRef]
Fraunhofer-Institut für Integrierte Schaltungen IIS. In-Cabin Sensing für die Analyse des Fahrerzustands. 2024. Available online: https://www.iis.fraunhofer.de/de/ff/sse/sse-automotive.html (accessed on 29 September 2024).
EU Parlament. Regulation (EU) 2019/2144 of the European Parliament and of the Council. 2019. Available online: https://eur-lex.europa.eu/eli/reg/2019/2144/oj (accessed on 29 September 2024).
ABI Research. Camera-Based Driver Monitoring Systems to be Chief Enablers of Safe, Semi-Autonomous Driving. 2016. Available online: https://www.abiresearch.com/press/camera-based-driver-monitoring-systems-be-chief-en (accessed on 29 September 2024).
Fillmore, M.T.; Blackburn, J.S.; Harrison, E.L.R. Acute disinhibiting effects of alcohol as a factor in risky driving behavior. Drug Alcohol Depend. 2008, 95, 97–106. [Google Scholar] [CrossRef]
Zhang, X.; Zhao, X.; Du, H.; Ma, J.; Rong, J. Effect of different breath alcohol concentrations on driving performance in horizontal curves. Accid. Anal. Prev. 2014, 72, 401–410. [Google Scholar] [CrossRef]
Li, Z.; Li, X.; Zhao, X.; Zhang, Q. Effects of Different Alcohol Dosages on Steering Behavior in Curve Driving. Hum. Factors 2019, 61, 139–151. [Google Scholar] [CrossRef]
Helland, A.; Jenssen, G.D.; Lervåg, L.E.; Westin, A.A.; Moen, T.; Sakshaug, K.; Lydersen, S.; Mørland, J.; Slørdal, L. Comparison of driving simulator performance with real driving after alcohol intake: A randomised, single blind, placebo-controlled, cross-over trial. Accid. Anal. Prev. 2013, 53, 9–16. [Google Scholar] [CrossRef] [PubMed]
Helland, A.; Lydersen, S.; Lervåg, L.E.; Jenssen, G.D.; Mørland, J.; Slørdal, L. Driving simulator sickness: Impact on driving performance, influence of blood alcohol concentration, and effect of repeated simulator exposures. Accid. Anal. Prev. 2016, 94, 180–187. [Google Scholar] [CrossRef]
Yao, Y.; Zhao, X.; Du, H.; Zhang, Y.; Zhang, G.; Rong, J. Classification of Fatigued and Drunk Driving Based on Decision Tree Methods: A Simulator Study. Int. J. Environ. Res. Public Health 2019, 16, 1935. [Google Scholar] [CrossRef] [PubMed]
Ramaekers, J.G.; Robbe, H.W.J.; O’Hanlon, J.F. Marijuana, alcohol and actual driving performance. Hum. Psychopharmacol. Clin. Exp. 2000, 15, 551–558. [Google Scholar] [CrossRef]
Yadav, A.K.; Velaga, N.R. Modelling the relationship between different Blood Alcohol Concentrations and reaction time of young and mature drivers. Transp. Res. Part F Traffic Psychol. Behav. 2019, 64, 227–245. [Google Scholar] [CrossRef]
Čulík, K.; Kalašová, A.; Štefancová, V. Evaluation of Driver’s Reaction Time Measured in Driving Simulator. Sensors 2022, 22, 3542. [Google Scholar] [CrossRef] [PubMed]
Li, Y.C.; Sze, N.N.; Wong, S.C.; Yan, W.; Tsui, K.L.; So, F.L. A simulation study of the effects of alcohol on driving performance in a Chinese population. Accid. Anal. Prev. 2016, 95, 334–342. [Google Scholar] [CrossRef]
Makowski, S.; Bätz, A.; Prasse, P.; Jäger, L.A.; Scheffer, T. Detection of Alcohol Inebriation from Eye Movements. Procedia Comput. Sci. 2023, 225, 2086–2095. [Google Scholar] [CrossRef]
Silva, J.B.S.; Cristino, E.D.; de Almeida, N.L.; de Medeiros, P.C.B.; Santos, N.A.D. Effects of acute alcohol ingestion on eye movements and cognition: A double-blind, placebo-controlled study. PLoS ONE 2017, 12, e0186061. [Google Scholar] [CrossRef]
Watten, R.G.; Lie, I. The effects of alcohol on eye movements during reading. Alcohol Alcohol. 1997, 32, 275–280. [Google Scholar] [CrossRef] [PubMed]
Zhang, P.; Guo, Y.; Qiao, Y.; Yan, N.; Zhang, Y.; Ren, W.; Zhang, S.; Wu, D. Acute Alcohol Intake Affects Internal Additive Noise and the Perceptual Template in Visual Perception. Front. Neurosci. 2022, 16, 873671. [Google Scholar] [CrossRef] [PubMed]
Roche, D.J.O.; King, A.C. Alcohol impairment of saccadic and smooth pursuit eye movements: Impact of risk factors for alcohol dependence. Psychopharmacology 2010, 212, 33–44. [Google Scholar] [CrossRef] [PubMed]
Gressner, A.M.; Gressner, O.A. Widmark-Formel. In Lexikon der Medizinischen Laboratoriumsdiagnostik; Gressner, A.M., Arndt, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2017; p. 1. [Google Scholar] [CrossRef]

Figure 1. Blue: relative crash risk in relation to blood alcohol concentration (BAC) [7,8]; red: the legal limit in most EU countries.

Figure 2. Simulator setup: overview of the driving simulator.

Figure 3. City Map with overview over the events (E). The orange, green, and red events are traffic light events with the respective colors of a traffic light. The blue events are stop events, where the driver has to stop due to an event.

Figure 4. Overview of the driving routes.

Figure 5. Confusion matrix of logistic regression: complete track with camera data.

Figure 6. Confusion matrix of random forest: rural road with fusion at data level.

Figure 7. Confusion matrix of XGBoost: city with fusion on interpretation level.

Table 1. Number of significant indicators in driving behavior.

Driving Behavior	Complete Track	City	Rural Road	Highway
Acceleration Behavior	5 of 7	4 of 6	6 of 8	5 of 8
Braking Behavior	4 of 4	4 of 4	3 of 3	3 of 3
Speed Behavior	3 of 4	5 of 5	5 of 6	5 of 7
Steering Behavior	4 of 4	5 of 7	7 of 9	4 of 4
Total	16 of 19	17 of 20	21 of 26	17 of 22

Table 2. Model performance of logistic regression.

Data	Scenario	Accuracy_Train	Accuracy_Test	F1_Value	Precision	Recall
Vehicle	City	67.38	56.67	55.17	57.14	53.33
Camera	City	70.83	58.33	56.14	59.26	53.33
Fusion_Data	City	83.21	56.67	56.67	56.67	56.67
Fusion_Interpretation	City	78.81	60.00	60.00	60.00	60.00
Vehicle	Rural	66.79	55.00	55.74	54.84	56.67
Camera	Rural	68.69	55.00	54.24	55.17	53.33
Fusion_Data	Rural	75.00	51.67	47.27	52.00	43.33
Fusion_Interpretation	Rural	70.36	50.00	50.00	50.00	50.00
Vehicle	Highway	65.71	56.67	53.57	57.69	50.00
Camera	Highway	62.38	51.67	50.85	51.72	50.00
Fusion_Data	Highway	82.86	61.67	62.30	61.29	63.33
Fusion_Interpretation	Highway	74.40	58.33	60.32	57.58	63.33
Vehicle	Complete Track	69.40	60.00	60.00	60.00	60.00
Camera	Complete Track	78.69	68.33	68.85	67.74	70.00
Fusion_Data	Complete Track	96.79	65.00	63.16	66.67	60.00
Fusion_Interpretation	Complete Track	82.98	66.67	66.67	66.67	66.67

Table 3. Model performance of random forest.

Data	Scenario	Accuracy_Train	Accuracy_Test	F1_Value	Precision	Recall
Vehicle	City	99.76	58.33	59.02	58.06	60.00
Camera	City	99.88	58.33	57.63	58.62	56.67
Fusion_Data	City	100.00	63.33	59.26	66.67	53.33
Fusion_Interpretation	City	100.00	70.00	68.97	71.43	66.67
Vehicle	Rural	99.52	58.33	57.63	58.62	56.67
Camera	Rural	99.52	58.33	52.83	60.87	46.67
Fusion_Data	Rural	99.88	73.33	72.41	75.00	70.00
Fusion_Interpretation	Rural	100.00	60.00	53.85	63.64	46.67
Vehicle	Highway	99.52	48.33	52.31	48.57	56.67
Camera	Highway	99.76	51.67	52.46	51.61	53.33
Fusion_Data	Highway	99.88	45.00	44.07	44.83	43.33
Fusion_Interpretation	Highway	100.00	48.33	53.73	48.65	60.00
Vehicle	Complete Track	99.76	65.00	64.41	65.52	63.33
Camera	Complete Track	99.64	56.67	58.06	56.25	60.00
Fusion_Data	Complete Track	99.76	58.33	56.14	59.26	53.33
Fusion_Interpretation	Complete Track	100.00	58.33	56.14	59.26	53.33

Table 4. Model performance of XGBoost.

Data	Scenario	Accuracy_Train	Accuracy_Test	F1_Value	Precision	Recall
Vehicle	City	100.00	56.67	56.67	56.67	56.67
Camera	City	100.00	60.00	62.50	58.82	66.67
Fusion_Data	City	100.00	56.67	58.06	56.25	60.00
Fusion_Interpretation	City	100.00	63.33	63.33	63.33	63.33
Vehicle	Rural	100.00	51.67	50.85	51.72	50.00
Camera	Rural	99.76	58.33	54.55	60.00	50.00
Fusion_Data	Rural	100.00	63.33	60.71	65.38	56.67
Fusion_Interpretation	Rural	100.00	55.00	50.91	56.00	46.67
Vehicle	Highway	100.00	41.67	46.15	42.86	50.00
Camera	Highway	99.88	41.67	40.68	41.38	40.00
Fusion_Data	Highway	100.00	48.33	49.18	48.39	50.00
Fusion_Interpretation	Highway	100.00	45.00	47.62	45.45	50.00
Vehicle	Complete Track	100.00	61.67	62.30	61.29	63.33
Camera	Complete Track	99.88	60.00	62.50	58.82	66.67
Fusion_Data	Complete Track	100.00	56.67	56.67	56.67	56.67
Fusion_Interpretation	Complete Track	100.00	63.33	63.33	63.33	63.33

Table 5. Best model performance in scenarios.

Scenario	Model	Data	Accuracy
City	Random Forest	Fusion Interpretation Level	70
Rural Road	Random Forest	Fusion Data Level	73
Highway	Logistic Regression	Fusion Data Level	61.67
Complete Track	Logistic Regression	Camera	68.3

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Fusion of Driving Behavior and Monitoring System in Scenarios of Driving Under the Influence: An Experimental Approach

Featured Application

Abstract

1. Introduction

2. Related Work

3. Study Design

3.1. Participants

Recruiting Criteria

3.2. Experimental Procedure

3.3. Simulator

3.4. Scenarios

4. Method

4.1. Method of Significance Analysis

4.2. Method of Classification

5. Results

5.1. Significance Testing

5.2. Analysis of Gaze and Driving Behavior

5.2.1. Acceleration Behavior

5.2.2. Braking Behavior

5.2.3. Speed Behavior

5.2.4. Steering Behavior

5.2.5. Reaction Behavior

5.2.6. Fixations and Saccades

5.2.7. Eye-Steering Coordination

5.3. Classification Algorithms for DUI

5.3.1. Logistic Regression

5.3.2. Random Forest

5.3.3. XGBoost

5.3.4. Comparison of the Models

5.4. LSTM Network

6. Discussion

6.1. Features

6.2. Models

6.3. Limitations

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Ethical Framework

Appendix A.1. Fundamental Points

Appendix A.2. Study Preparation

Appendix A.3. Study Conduct

Appendix B. Recruting Criteria

Appendix C. Description of the City Events

Appendix D. Features for Significant Test

References

Article Metrics

Citations

Article Access Statistics