Next Article in Journal
A Network Analysis Approach to Detecting Social Issues with Web-Based Data
Previous Article in Journal
Mobile Charging Scheduling Approach for Wireless Rechargeable Sensor Networks Based on Multiple Discrete-Action Space Deep Q-Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Decision-Making Model for Dynamic Scenario Vehicles in Autonomous Driving Simulations

The State Key Laboratory of Automotive Simulation and Control, Jilin University, Changchun 130025, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(14), 8515; https://doi.org/10.3390/app13148515
Submission received: 14 June 2023 / Revised: 14 July 2023 / Accepted: 21 July 2023 / Published: 23 July 2023

Abstract

:
A scenario vehicle in autonomous driving simulations is a dynamic entity that is expected to perform trustworthy bidirectional interaction tasks with the autonomous vehicle under test. Modeling interactive behavior can not only facilitate better prediction of human drivers’ intentions and motions but also be valuable in generating more human-like decisions and trajectories for autonomous vehicle testing. However, simulations of most of the available scenario vehicles on existing platforms behave conservatively. This study summarizes five driving motivations based on human-need theories of multiple psychologists, namely safety, dominance, achievement, order, and relatedness, and organizes the framework using a behavior tree. The proposed model generates different driving behaviors by simulating the changing psychological needs of human drivers during vehicle operation. Using a self-developed two-dimensional simulator, experiments were conducted by considering multiple scenarios in urban, rural, and highway road sections. The obtained results indicate that the scenario vehicles controlled by the proposed model exhibit a significant interactive nature, facilitating proactive communication rather than providing simple responses.

1. Introduction

As fidelity significantly affects the response of a tested autonomous vehicle, in a more realistic virtual world, the validity and acceptance of tests in a simulation are higher [1,2]. Therefore, to enhance their realism, important elements such as vehicle dynamics, sensor models, and rendering technologies can be easily incorporated into the existing automated driving simulations. This is because the nature of their work remains unchanged before or after the integration and provides an adequate balance between performance and efficiency [3]. However, a primary element of this test environment is the scenario vehicle, which is expected to perform trustworthy interaction bidirectional tasks with the autonomous vehicle under test. With this emerging demand for realistic behaviors of scenario vehicles [1], the accurate simulation of the behaviors of human drivers is becoming increasingly challenging [4].
One of the major issues in the existing models is their conservative behavior, as they can only respond negatively to the environment or other agents instead of taking assertive actions. Another problem is the underlying methodology, wherein a large proportion of the models use the finite-state machine (FSM), Markov decision process, or neural networks, which cannot be easily modified. However, rapid adjustments to the behaviors of the scenario vehicles are often necessary during simulation tests.
To address the aforementioned problems, this study proposes a dynamic scenario vehicle model considering the perspective of a human driver. Based on the theories of Maslow and Reiss [5,6], every action is considered to be determined by an underlying motive. The basic framework of the model is organized using a behavior tree, which describes switching between the finite sets of tasks in a modular manner. This implies that the actions performed, decisions made, and transfers between states are visible and easy to compose.
Simulations were conducted using a self-developed two-dimensional (2D) simulator, which currently supports an earlier version of the OpenDRIVE format map description file. The results indicate that scenario vehicles equipped with the proposed model successfully perform long-distance driving tasks to reach their targets and demonstrate interactions with other vehicles.

2. Related Work

Typically, scenario vehicle models in open-source autonomous driving simulators are designed to follow certain simple yet hard-coded rules. Their behaviors may exhibit a limited number of predefined driving styles in terms of lane changing, car following, and collision avoidance [7,8]. Nevertheless, in comparison with the more conservative tracking modes of open-source platforms, commercial software, such as Automotive Artificial Intelligence (AAI) and Cognata, generally uses artificial intelligence to build decision-making systems for scenario vehicles in simulations. As the development and adjustment of such models are expensive, the range of scenarios is extremely limited.
Since the proposal of the car-following model in the 1950s, the continuous optimization of the microscopic traffic simulation has rendered it highly mature. Several researchers have attempted to create a dynamic driving environment in the aforementioned simulator, with traffic participants provided by microscopic traffic simulations [9,10,11]. However, some researchers have argued that directly deploying microscopic traffic simulations may cause significantly biased evaluation results during the testing of autonomous vehicles. Therefore, the parameter values of scenario vehicles are adjusted such that different levels of aggressiveness are produced [12] or their behaviors appear statistically closer to reality [2]. However, the background requirement of traffic flow simulation is significant in traffic engineering, which is used to accurately reproduce the macroscopic characteristics [13]. Although these models have achieved the desired granularity in autonomous driving tests, they are over-simplified in their dynamics [14], implicitly use safe rather than actual behaviors [15], and are unrealistic at the inter-individual level despite calibrations with real-world data [4].
Recently, certain researchers have attempted to eliminate the conventional framework of microscopic traffic simulation and develop more realistic interactive models. Chao et al. [16] proposed a force-based framework that could replicate the sophisticated behaviors of various road users and their interactions in a unified manner. Wang et al. [17] introduced a cognition-driven traffic simulation method capable of generating virtual drivers with heterogeneous behaviors in unstructured road networks. Additionally, researchers at the University of Michigan have addressed the need for a more realistic and interactive model using the “level-k” game theory [18,19,20]. Several researchers have also borrowed popular development methods from autonomous driving. O’Kelly et al. [21] trained the policies of scenario vehicles using imitation learning to mimic human-like driving behaviors. However, these new models provide staged results as they can be applied only in specific scenarios.

3. Driving Motives Analysis

Psychologists have proposed numerous theories to explain what motives are and how they function, and these theories are now widely used in various fields. However, only a few studies have been conducted on the systematic evaluation of driver’s behavior [22]. This section presents different research studies conducted on psychology theories of motivation and human needs. The aim is to analyze the motives that affect the driver’s operation during driving, which are safety, dominance, achievement, order, and relatedness.

3.1. Safety

Safety is one of the most basic motives for survival. When faced with external threats, it compels individuals to avoid risks and harm. Safety motives find their closest associations with “flight” and “repulsion” in McDougall’s list of instincts [23], “defendance” and “harmavoidance” in Murray’s personology [24], “safety” in Maslow’s hierarchy of needs [5], “existence” in Existence, Relatedness, and Growth (ERG) theory [25], and “tranquility” in Reiss’s theory of 16 basic desires [6].
For drivers, the safety motive represents the most basic requirement to ensure their survival and prevent collision. Only when the safety motive is met can the driver feel safe and comfortable while driving without excessive psychological burden. Summala’s “Safety Margin” is a model used to describe safety needs in time and space [22]. In addition, Taubman–Ben-Ari’s “life endangerment” in the Driving Costs and Benefits Questionnaire (DCBQ) [26] and Hoedemaeker’s “risk or safety” are also manifestations of safety motives [27].

3.2. Dominance

The motive for dominance pertains to the need to exert influence on the environment to fulfill one’s desires. The motive for dominance is related to the “dominance” of Murray’s personology [24], the “power” of McClelland’s achievement motivation theory [28], the “power” of Reiss’s 16 basic desires [6], and the “competence” of Deci and Ryan’s Self-Determination Theory (SDT) [29].
In the context of driving, the dominance motive is evident in the driver’s active control of speed and direction. Summala suggested that drivers derive pleasure from maintaining both normal speeds and pushing the limits for thrills, likely because of their sense of dominance over the vehicle [22]. This notion is also supported by Taubman–Ben-Ari’s “sense of control” in the DCBQ [26] and Rumar’s goal of speed control [30].

3.3. Achievement

Achievement motives typically involve purposefulness, encompassing the need to accomplish specific goals or tasks and actualize personal ideals. “Achievement” in Murray’s personology [24], “achievement” in McClelland’s achievement motivation theory [28], “self-actualization” in Maslow’s hierarchy of needs [5], and “growth” in Alderfer’s ERG Theory [25] are all related to the achievement requirements during driving presented in this section.
For drivers, the primary objective is to reach their destination [30], which represents the “achievement” they strive to accomplish during a driving task. Summala’s “good or expected progress of trip” also reflects the driver’s achievement motive [22]. The forced lane change in the micro-traffic flow model is a typical lane-changing behavior driven by the motive for achievement in order to reach the destination [31].

3.4. Order

The order motive refers to the need to adhere to norms and avoid punishment. Although there are limited discussions on the importance of order in various theories of needs, Murray’s personology [24] explores the concept of “blamavoidance”, while Reiss’s 16 basic desires touch upon the concept of “honor” [6], which is somewhat related.
For drivers, the order motivation translates into specific rules that must be followed. These rules include formal traffic laws such as obeying traffic signals, driving in designated lanes, and adhering to speed limits [30,32]. Additionally, there are informal rules to consider [33]. Adhering to these rules not only helps drivers avoid accidents and penalties associated with rule violations [22] but also enhances overall driving efficiency, prevents traffic congestion and accidents, and saves time and money.

3.5. Relatedness

Relatedness refers to the desired state of being with others, which can manifest in positive ways such as acceptance, respect, and reciprocity or negative ways such as refusal to cooperate, exclusion, and confrontation. In McDougall’s list of instincts, relational motives are related to two instincts: “pugnacity” and “gregariousness” [23]. Murray’s personology identifies interacting with others as “abasement”, “rejection”, “narcissism”, “affiliation”, and “succorance” [24]. In contrast to the previous two theories, where there are several distinct states, in Reiss’s theory of 16 basic desires, the two desires “acceptance” and “social contact” are relatively positive expressions [6]. In addition, there are “affiliative motives” in McClelland’s achievement motivation theory [28], “belonging” in Maslow’s hierarchy of needs [5], “relatedness” summarized in Alderfer’s ERG theory [25], and “relatedness” in SDT [29].
The desire for connection involves drivers’ inclination to cooperate with fellow traffic participants, seek approval and acknowledgment, or express disagreement when alerted by others. For example, if a vehicle in a nearby lane merges into the driver’s lane, the driver may adapt their speed and increase the following distance to facilitate the merging process if they accept or approve of the other vehicle’s actions. Conversely, if the driver disapproves or opposes the merging, they may maintain or slightly increase their speed while decreasing the following distance to impede the other vehicle from merging.

4. Decision-Making Model for Dynamic Scenario Vehicles

Maslow’s theory states that human needs can be expressed hierarchically from a low to high level [5]. However, the motivations for driving are not identical to those in the regular pyramid of needs. Therefore, the five motivations mentioned in the previous subsection are prioritized as follows in this study.
P r i o r i t y : s a f e t y > o r d e r > a c h i e v e m e n t > r e l a t e d n e s s > d o m i n a n c e
For instance, the priority of safety motivation is higher than that of order motivation. Therefore, the scenario vehicle may consider order motivation only when the environment is sufficiently safe. Additionally, once a high-priority motivation appears, the scenario vehicle should discard the current motivation, despite the implemented behavior being generated from a low-priority motivation. For instance, if the safety motivation occurs when the scenario vehicle is implementing the behavior generated by the achievement motivation, the vehicle should immediately terminate this behavior and perform the behavior associated with the safety motivation.
The study assumed that the behavior layer of the scenario vehicle determines the three main types of behaviors B : changing speed ( B c h a n g e S p e e d ), changing lane ( B c h a n g e L a n e ), and maintaining current motion ( B i d l e ). Each behavior contains several actions. Among these, the B c h a n g e S p e e d set includes five actions: maintaining safe headway ( b h e a d w a y c s ), emergent deceleration ( b e m e r g e n t c s ), slowing down and stopping ( b s t o p c s ), complying with the speed limit ( b l i m i t c s ), and giving way to others ( b m o r a l i t y c s ). The B c h a n g e L a n e set includes four actions—that is, changing lanes to the target lane ( b d e s t i n a t i o n c l ), changing lanes to the faster lane ( b e x c i t e m e n t c l ), changing lanes to a random adjacent lane ( b r a n d o m c l ), and aborting changing lanes ( b a b o r t c l ). The B i d l e set includes two actions: maintaining the current speed and lane ( b n o r m a l c n ) and maintaining current speed and lane while alerting the front vehicle ( b a l e r t c n ).
Figure 1 depicts the overall structure of the proposed decision-making model. At the core of the behavior tree lies a selector node, comprising five child nodes representing different levels of motivation, along with a default state to ensure the tree’s accuracy (Figure 1a). In this research, behavior trees were utilized to represent behavioral decisions, as they are widely employed in designing intelligent behaviors across various systems. These trees offer a hierarchical framework that facilitates the organization of smaller behavioral units, known as nodes, enabling the creation of complex and adaptable behaviors.
The execution of a behavior tree initiates from the root, which sends periodic ticks to its child nodes. A tick serves as a triggering signal for the execution of a child node. When a node in the behavior tree becomes eligible for execution, it communicates its status to the parent node, indicating whether it is “running” (ongoing execution), “success” (goal achieved), or “failure” (goal not achieved).
For this study, three control nodes (sequence, selector, and random selector) and two execution nodes (action and condition) were employed. The sequence node searches and executes the first child that has not yet succeeded. The selector node identifies and executes the first child that does not fail. The random selector node randomly selects and executes one of its child nodes. The condition node evaluates a specific condition and returns a success or failure status based on the evaluation. Finally, an action node represents a specific action or behavior that the agent or system should perform.

4.1. Safety

Typically, drivers do not intend to cause or be victims in accidents when driving vehicles. To this end, they maintain a certain distance from their surrounding vehicles. As depicted in Figure 1b, the longitudinal safety motivation originates from the judgment of the distance between a scenario vehicle ( v e h s v ) and the vehicle in front of it ( v e h f r o n t ). If v e h s v is extremely close to v e h f r o n t , v e h s v adjusts its distance from v e h f r o n t via the action b h e a d w a y c s . If this action fails to protect v e h s v from the dangerous situation, v e h s v applies emergency brakes through b e m e r g e n t c s , immediately lowering the speed and increasing its distance from v e h f r o n t until the scenario vehicle stops or avoids danger. Considering that b e m e r g e n t c s may encounter a failure in motion planning, an action b e m e r g e n t c s is added beyond the sequence nodes to handle emergency situations.
The lateral safety motivation originates from the judgment of the distance between v e h s v and adjacent vehicles on the left and right sides ( v e h s i d e ). If v e h s v is laterally close to v e h s i d e and is changing lanes, v e h s v should immediately stop changing lanes and return to the original lane via the action b a b o r t c l . If v e h s v is not changing lanes, it is expected to avoid collision with adjacent vehicles on both sides because of the longitudinal emergency deceleration behavior ( b e m e r g e n t c s ).

Safety Distance

Figure 2 depicts the rectangular safety field that exists around each v e h s v . The safety field changes in real time with the speed and acceleration of v e h s v . A potentially dangerous situation occurs when the safety fields of two vehicles intersect. Based on the responsibility-sensitive safety model, we calculated the size of the safety field as follows:
L b s a f e = 0.5 + κ 1 l + v l o n g + a a c c m a x ρ 2 2 a b r k
L a s a f e = v l o n g ρ + 1 2 a a c c m a x ρ 2 + L b s a f e
W a = κ 2 w + v l a t ρ + 1 2 a l a t ρ 2 + v l a t + a l a t ρ 2 2 a l a t m a x
W b = 2 κ 2 w W a
where L a s a f e and L b s a f e denote the distances from the center of v e h s v to the front and rear edges of the safety field, respectively; W a and W b indicate the distances from the left and right edges of v e h s v respectively, to the corresponding edges of the safety field; l and w represent the length and width of v e h s v , respectively; κ 1 and κ 2 denote the initial expansion coefficients of the length and width, respectively; ρ indicates the driver’s reaction time; v l o n g represents the current longitudinal velocity; a a c c m a x denotes the longitudinal maximum acceleration; a b r k indicates the current longitudinal deceleration; v l a t represents the current lateral velocity; a l a t denotes the current lateral acceleration; and a l a t m a x indicates the maximum lateral acceleration.

4.2. Order

The traffic rules at intersections without signal lights are temporarily applied in the model. In other words, all vehicles pass in the order in which they reach the intersection. If other vehicles exist at the intersection ahead, a scenario vehicle should stop before the stop line ( b s t o p c s ) and wait in line ( b n o r m a l c n ), as depicted in Figure 1c.

4.3. Achievement

As depicted in Figure 1d, v e h s v estimates whether the remaining distance d l e f t in the current road section is sufficient to change from the current lane to the nearest target lane for a forced lane change. The achievement motivation does not emerge if the remaining distance is sufficient. If the remaining distance is insufficient, v e h s v further checks whether the current lane is the target lane for forced lane change. If this criterion is satisfied, v e h s v maintains the current driving state via the action b n o r m a l c n ; otherwise, v e h s v switches to the target lane ( b d e s t i n a t i o n c l ).

Remaining Distance Estimation

Multiple target lanes exist for the forced lane change, which can be considered as l a n e s on road section R .
L a n e s = i = 1 N l a n e s i , N 1
The target lane l a n e t closest to the current lane l a n e s v is selected from L a n e s . The function I D x is the identification (ID) of the acquired lane.
L a n e D i s t a n c e l a n e = a b s I D l a n e I D l a n e s v
l a n e t = a r g m i n L a n e D i s t a n c e l a n e s , l a n e s L a n e s
For the sake of computational convenience, the following condition was assumed:
0 < I D l a n e s v I D l a n e t
Considering the average vehicle speed v ¯ R i on each lane of the road section R , we estimated the longitudinal distance d s v t from l a n e s v to l a n e t as follows:
d s v t = i = I D l a n e s v I D l a n e t 1 v R i τ w + v R i + v R i + 1 2 τ c
where τ w denotes the time that v e h s v waits to change lanes, and τ c indicates the time required by v e h s v to change lanes. Furthermore,
d l e f t = d l e f t d 0
where d 0 denotes the distance reserved in the road section, and d l e f t represents the actual remaining distance of the road section.
The goal motivation emerges only if d s v t > d l e f t .

4.4. Relatedness

As depicted in Figure 1e, the entire decision-making process for relatedness motivation involves two perspectives. The first perspective is when v e h s v is considered as the front vehicle. During the lane changing process (Figure 3a), if the rear vehicle v e h r e a r sends a signal to v e h s v expecting v e h s v to give way, then v e h s v can choose to continue changing lanes based on a preset probability ( b d e l e g a t e c l ) or abort the lane change and return to the original lane ( b a b o r t c l ). When the motivation for lane change is unclear, action b d e l e g a t e c l can be used to represent the lane change from a lower level. When v e h r e a r sends a negotiation signal during normal driving along the lane (Figure 3b), v e h s v can increase speed ( b l i m i t c s ), maintain current speed ( b n o r m a l c n ), or randomly change lanes and move to the adjacent lane ( b r a n d o m c l ).
The second perspective considers v e h s v as a rear vehicle. A possible scenario is that a vehicle in the adjacent lane v e h s i d e merges with the lane in front of v e h s v and expects v e h s v to maintain a safe distance or reduce its speed (Figure 3c). v e h s v can agree to this request and adjust its speed ( b m o r a l i t y c s ) to help v e h s i d e change lanes or maintain the current state ( b n o r m a l c n ) and prevent v e h s i d e from changing lanes.

4.5. Dominance

As depicted in Figure 1f, the decision-making process for dominance motivation comprises two stages. In the first stage, v e h s v compares its speed v s v with that of v e h f r o n t in the current lane. If v f r o n t < v s v , v e h s v can remind v e h f r o n t to increase speed appropriately ( b a l e r t c n ) or move to the faster adjacent lane ( b e x c i t e m e n t c l ). If the current lane is not blocked by v e h f r o n t , it implies that the speed of v e h f r o n t is greater than that of v e h s v or that no vehicle exists in the front. In this case, v e h s v proceeds to the subsequent stage, wherein the current speed v s v is compared with the expected speed v s v e of v e h s v . If v s v < v s v e , v e h s v increases the speed via the action b l i m i t c s . If v s v = v s v e , the model exits the judgment of dominance motivation.

4.6. Default

When no motivation is satisfied, a default state must be set to ensure normal operation of the entire decision model. In this state, the driver chooses to maintain the current state b n o r m a l c n or randomly change lanes b r a n d o m c l , as depicted in Figure 1g.

5. Experiments

The experiments were conducted on a MacBook Pro (16-inch, 2019), with 2.3 GHz Intel Core i9, 32 GB 2667 MHz DDR4, and AMD Radeon Pro 5500M 8 GB. The simulation platform used in this study was a self-developed 2D simulator based on Unity 2020.3.23f1. The platform comprises four key components: camera, map, static environment, and dynamic environment. The camera serves the purpose of capturing the surrounding environment and monitoring the movements of the scenario vehicles. The map component converts map data from OpenDRIVE format into visual images that are then integrated into the simulation environment. The static environment oversees the management of roads and intersections, providing accurate information for scenario vehicle perception. Regarding the dynamic environment, it exclusively encompasses the scenario vehicles themselves. The programs were primarily coded in C#, with the exception of the map drawing program, which was developed using Python. Figure 4 illustrates the map used for the experiments; it comprises urban, rural, and highway areas on which the experiments were performed to evaluate the proposed model. All the scenario vehicles used in the simulation were equipped with the same decision-making model considering different preferences. As the model contains multiple states, different features were validated in three separate experiments.
  • Weaving ability in multilane traffic (Scenario 1): a scenario vehicle is expected to cross groups of vehicles from the rear on multilane highways to increase its speed.
  • Handling unsignalized traffic junctions (Scenario 2): three vehicles converge at a junction and wait for the vehicles that arrived earlier to leave before entering the junction.
  • Cooperation between vehicles on rural roads (Scenario 3): a vehicle is expected to slow down to match the speed of an adjacent vehicle to change lanes.

5.1. Scenario 1

Figure 5 depicts that at t = 0 , V e h 1000 , V e h 1001 , and V e h 1002 were driving on lanes 2, 3, and 4 of R o a d 5188 , respectively, and V e h 1003 was driving approximately 50 m behind V e h 1000 . All four vehicles were in the default motivation and maintained the current state (Figure 6a) with a constant speed of approximately 10 m/s. V e h 1004 accelerated from a standstill; after t = 3 , the speed of the vehicle was stable at 15 m/s. As the front vehicle V e h 1003 was slow, V e h 1004 entered the dominance motivation and began to change lanes (Figure 6b). The lane change was completed at approximately t = 6 , and V e h 1004 successfully steered into lane 3 from lane 2. Subsequently, V e h 1004 was in the default motivation, maintaining the vehicle speed constant with other vehicles. This continued until V e h 1004 gradually approached V e h 1001 , at this point V e h 1004 was blocked again and was ready to change lanes (Figure 6c). V e h 1004 completed the lane change at approximately 11 s, successfully switching from lane 3 to lane 4. After a certain period, V e h 1004 re-entered the dominance motivation and began to change lanes because the vehicle in front of it was slower (Figure 6d). However, V e h 1001 failed to notice V e h 1004 in the adjacent lane and continued to maintain its current vehicle speed. Both vehicles eventually entered the safety motivation. Although V e h 1001 applied emergency brakes and V e h 1004 reduced speed accordingly, V e h 1004 inevitably collided with V e h 1001 from the side and rear, as depicted in Figure 6e.

5.2. Scenario 2

Figure 7 depicts the motivation, action, longitudinal displacement, and speed during the simulation, and Figure 8 shows the screenshots of the top views of the five processes. Initially (Figure 8a), V e h 1000 was driving on R o a d 5188 at a speed of 10 m/s and intended to move to R o a d 5189 when passing J u n c t i o n 35 . Additionally, in the rear of V e h 1000 , V e h 1001 intended to drive towards R o a d 5106 at an initial speed of 10 m/s while passing J u n c t i o n 35 . V e h 1002 set out from standstill and intended to drive toward R o a d 5189 by passing R o a d 5106 and J u n c t i o n 35 . At approximately 12 s, V e h 1000 first arrived at J u n c t i o n 35 , slowed down, and observed the traffic. As no vehicles existed at the junction at the time, the vehicle was allowed to pass (Figure 8b). Therefore, V e h 1000 continued to increase its speed, left the junction according to the preset destination, and continued to drive along R o a d 5189 . When V e h 1001 arrived at J u n c t i o n 35 after V e h 1000 , V e h 1001 had to wait because V e h 1000 was at the junction at that time. At approximately 14 s, V e h 1000 moved away from the junction, and only then did V e h 1001 begin to speed up and pass the junction (Figure 8c). V e h 1002 slowed down and waited when V e h 1001 was passing J u n c t i o n 35 . At approximately 21 s, V e h 1001 switched to R o a d 5189 , and V e h 1002 gradually accelerated to pass J u n c t i o n 35 and drove towards R o a d 5106 .

5.3. Scenario 3

As depicted in Figure 9, V e h 1000 , V e h 1001 , and V e h 1002 drove on the left side of the reference direction on R o a d 2291 . V e h 1000 was the foremost vehicle, traveling at a speed of 9 m/s in lane 3. V e h 1001 was in lane 2, approximately 40 m behind V e h 1000 .   V e h 1002 was in the rear position, rapidly approaching V e h 1001 with the speed of 12 m/s in lane 3. After a short period, V e h 1000 believed that its speed might be affected by V e h 1001 in the adjacent lane and reminded V e h 1001 to change its current state. V e h 1001 received the reminder and attempted to increase the speed b l i m i t c s ; however, the vehicle reached its speed limit. At approximately 8 s, V e h 1002 approached V e h 1000 and reminded V e h 1000 to change its current state. Similar to V e h 1001 , V e h 1000 was unable to increase its speed despite receiving the signal from the rear vehicle. Consequently, V e h 1002 had to change lanes. As only two lanes existed in this road section, V e h 1002 could only select the adjacent lane 3 as the target lane. In this case, V e h 1002 reminded the rear vehicle V e h 1001 to cooperate for changing lanes successfully. Although V e h 1001 agreed to cooperate, it was extremely close to V e h 1002 . Therefore, V e h 1001 chose to slow down using b h e a d w a y c s , and V e h 1002 merged into the lane from the front. After V e h 1002 changed the lane, V e h 1001 entered the dominance motivation, adjusted speed appropriately, and returned to the default state m d e f a u l t . Figure 10 depicts the screenshots of the top views of the four scenarios indicating the entire process.

6. Conclusions

This study proposed a scenario vehicle decision-making model for virtual testing of autonomous driving. The developed model could decide behaviors by simulating the changing psychological needs of human drivers during driving. The motivation-based approach offered an intuitive showcase of the motivation generation process and decision-making process for actions. In every test scenario, the motives and actions chosen by the scenario vehicle were observable throughout the simulation. This addressed the interpretability challenge present in current scenario vehicle behaviors for autonomous driving simulations. Moreover, it provided reliable reference data for autonomous driving systems to accurately predict and analyze the behavior of surrounding vehicles. Several experiments were conducted to evaluate the performance of the model. Despite the promising simulation results, the model is in its preliminary stage. In certain simple scenarios, such as Scenario 3, the vehicles controlled by the model successfully completed the driving tasks and significantly increased the interactions between the vehicles. Notably, other autonomous driving simulation platforms such as Carla and VTD had limited inter-vehicle communication capabilities. Consequently, their models exhibited a high level of caution, often maintaining significant distances or resorting to emergency stops when confronted with vehicles attempting lane changes. However, in scenarios with increased traffic participants, such as Scenario 1, accidents were observed. This implies that despite several trials being performed on the same scenario without recurring accidents, the model needs to be strengthened in terms of safety. Another aspect that must be addressed is the behavioral adaptation at signalized junctions, which are crucial in urban scenarios.
In the future, in addition to refining the aforementioned aspects of the model, the efforts will be extended to explore and incorporate new methods of trajectory generation and macro statistical features to improve the overall performance and feasibility.

Author Contributions

Theoretical formalism and numerical simulations execution, Y.L.; writing the final version of the manuscript, X.J. and C.D.; project supervision, H.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Acknowledgments

I would like to thank my supervisor, Hsin Guan, for his guidance through each stage of the process.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yu, R.-J.; Ye, T.; Sun, J. Highly automated vehicle virtual testing: A review of recent developments and research frontiers. China J. Highw. Transp. 2020, 33, 125–138. [Google Scholar] [CrossRef]
  2. Yan, X.; Feng, S.; Sun, H.; Liu, H.X. Distributionally consistent simulation of naturalistic driving environment for autonomous vehicle testing. arXiv 2021, arXiv:2101.02828. [Google Scholar] [CrossRef]
  3. Fadaie, J. The state of modeling, simulation, and data utilization within industry: An autonomous vehicles perspective. arXiv 2019, arXiv:1910.06075. [Google Scholar] [CrossRef]
  4. Suo, S.; Regalado, S.; Casas, S.; Urtasun, R. Trafficsim: Learning to Simulate Realistic Multi-Agent Behaviors; IEEE Computer Society: Washington, DC, USA, 2021. [Google Scholar] [CrossRef]
  5. Maslow, A.H. A theory of human motivation. Psychol. Rev. 1943, 50, 370–396. [Google Scholar] [CrossRef] [Green Version]
  6. Reiss, S. Multifaceted nature of intrinsic motivation: The theory of 16 basic desires. Rev. Gen. Psychol. 2004, 8, 179–193. [Google Scholar] [CrossRef]
  7. Dosovitskiy, A.; Ros, G.; Codevilla, F.; Lopez, A.; Koltun, V. CARLA: An open urban driving simulator. In Proceedings of the PMLR Conference on Robot Learning, Mountain View, CA, USA, 13–15 November 2017; pp. 1–16. [Google Scholar]
  8. Rong, G.; Shin, B.H.; Tabatabaee, H.; Lu, Q.; Lemke, S.; Mozeiko, M.; Boise, E.; Uhm, G.; Gerow, M.; Mehta, S.; et al. Lgsvl simulator: A high fidelity simulator for autonomous driving. In Proceedings of the 23rd International Conference on Intelligent Transportation Systems, Rhodes, Greece, 20–23 September 2020; IEEE Publications: Piscataway, NJ, USA, 2020; Volume 2020, pp. 1–6. [Google Scholar] [CrossRef]
  9. Garzón, M.; Spalanzani, A. An hybrid simulation tool for autonomous cars in very high traffic scenarios. In Proceedings of the 15th International Conference on Control, Automation, Robotics and Vision, Singapore, 18–21 November 2018; IEEE Publications: Piscataway, NJ, USA, 2018; pp. 803–808. [Google Scholar] [CrossRef] [Green Version]
  10. Klischat, M.; Dragoi, O.; Eissa, M.; Althoff, M. Coupling sumo with a motion planning framework for automated vehicles. In Proceedings of the SUMO User Conference 2019, Berlin, Germany, 13–15 May 2019; pp. 1–9. [Google Scholar]
  11. Li, P.; Kusari, A.; LeBlanc, D.J. A novel traffic simulation framework for testing autonomous vehicles using SUMO and CARLA. arXiv 2021, arXiv:2110.07111. [Google Scholar] [CrossRef]
  12. Mavrogiannis, A.; Chandra, R.; Manocha, D. B-GAP: Behavior-rich simulation and navigation for autonomous driving. IEEE Robot. Autom. Lett. 2022, 7, 4718–4725. [Google Scholar] [CrossRef]
  13. Li, Z. Traffic Engineering, 2nd ed.; People China’s Communications Press: Beijing, China, 2017. (In Chinese) [Google Scholar]
  14. Xu, W.; ZhaYao, W.; Zhao, H.; Zha, H. A vehicle model for micro-traffic simulation in dynamic urban scenarios. In Proceedings of the IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; Volume 2011, pp. 2267–2274. [Google Scholar] [CrossRef]
  15. Young, W.; Sobhani, A.; Lenné, M.G.; Sarvi, M. Simulation of safety: A review of the state of the art in road safety simulation modelling. Accid. Anal. Prev. 2014, 66, 89–103. [Google Scholar] [CrossRef] [PubMed]
  16. Chao, Q.; Jin, X.; Huang, H.W.; Foong, S.; Yu, L.F.; Yeung, S.K. Force-based heterogeneous traffic simulation for autonomous vehicle testing. In Proceedings of the IEEE International Conference on Robotics and Automation, Montreal, QC, USA, 20–24 May 2019; Volume 2019, pp. 8298–8304. [Google Scholar] [CrossRef]
  17. Wang, H.; He, X.Y.; Chen, L.Y.; Yin, J.R.; Han, L.; Liang, H.; Zhu, F.; Zhu, R.; Gao, Z.; Xu, M. Cognition-driven traffic simulation for unstructured road networks. J. Comput. Sci. Technol. 2020, 35, 875–888. [Google Scholar] [CrossRef]
  18. Li, N.; Oyler, D.W.; Zhang, M.; Yildiz, Y.; Kolmanovsky, I.; Girard, A.R. Game theoretic modeling of driver and vehicle interactions for verification and validation of autonomous vehicle control systems. IEEE Trans. Control Syst. Technol. 2017, 26, 1782–1797. [Google Scholar] [CrossRef] [Green Version]
  19. Tian, R.; Li, N.; Kolmanovsky, I.; Yildiz, Y.; Girard, A.R. Game-theoretic modeling of traffic in unsignalized intersection network for autonomous vehicle control verification and validation. IEEE Trans. Intell. Transport. Syst. Trans. IEEE 2020, 23, 2211–2226. [Google Scholar] [CrossRef]
  20. Wang, X.; Peng, H.; Zhang, S.; Lee, K.H. An interaction-aware evaluation method for highly automated vehicles. In Proceedings of the IEEE International Intelligent Transportation Systems Conference, Indianapolis, IN, USA, 19–22 September 2021; Volume 2021, pp. 394–401. [Google Scholar] [CrossRef]
  21. O’Kelly, M.; Sinha, A.; Namkoong, H.; Tedrake, R.; Duchi, J.C. Scalable End-to-End Autonomous Vehicle Testing via Rare-Event Simulation. Adv. Neural Inf. Process. Syst. 2018, 31, 9849–9860. [Google Scholar]
  22. Summala, H. Traffic psychology theories: Towards understanding driving behaviour and safety factors. In Proceedings of the International Conference of Traffic and Transport Psychology, Nottingham, UK, November 2004. [Google Scholar]
  23. McDougall, W. An Introduction to Social Psychology; Psychology Press: London, UK, 2015; pp. 42–69. [Google Scholar]
  24. Murray, H.A.; McAdams, D.P. Explorations in Personality; Oxford University Press: Singapore, 2007. [Google Scholar]
  25. Alderfer, C.P. An empirical test of a new theory of human needs. Organ. Behav. Hum. Perform. 1969, 4, 142–175. [Google Scholar] [CrossRef]
  26. Taubman Ben-Ari, O. Motivational sources of driving and their associations with reckless driving cognitions and behavior. Eur. Rev. Appl. Psychol. 2008, 58, 51–64. [Google Scholar] [CrossRef]
  27. Boer, E.R.; Hoedemaeker, M. Modeling driver behavior with different degrees of automation: A hierarchical decision framework of interacting mental models. In Proceedings of the Conference on Human Decision Making and Manual Control, Valenciennes, France, 14–16 December 1998; LAMIH: Famars, France, 1998. [Google Scholar]
  28. McClelland, D.C. Human Motivation; Cambridge University Press: New York, NY, USA, 1987; pp. 595–601. [Google Scholar]
  29. Ryan, R.M.; Deci, E.L. Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. Am. Psychol. 2000, 55, 68–78. [Google Scholar] [CrossRef] [PubMed]
  30. Parkes, A.M.; Franzen, S. Road user need. In Driving Future Vehicles; CRC Press: Boca Raton, FL, USA, 1993; pp. 37–45. [Google Scholar]
  31. Erdmann, J. Lane-changing model in SUMO. In Proceedings of the SUMO2014 Modeling Mobility with Open Data, Berlin, Germany, 15–16 May 2014; Volume 24, pp. 77–88. [Google Scholar]
  32. Florent, V.; Marie-Axelle, G.; Laurent, C.; Frédéric, M.; Marie, P.; Anthony, P. The role of perceived legitimacy in understanding traffic rule compliance: A scoping review. Accid. Anal. Prev. 2021, 159, 106299. [Google Scholar] [CrossRef] [PubMed]
  33. Björklund, G.M.; Åberg, L. Driver behaviour in intersections: Formal and informal traffic rules. Transp. Res. Part F Traffic Psychol. Behav. 2005, 8, 239–253. [Google Scholar] [CrossRef]
Figure 1. General structure of the decision-making model. As (a) depicts, the root of the behavior tree is a selector node, with five child nodes as different levels of motivation and a default state to ensure the accurate operation of the tree. From (bg), the six figure panels represent the behavior subtrees for safety motivation, order motivation, achievement motivation, relatedness motivation, dominance motivation, and default state.
Figure 1. General structure of the decision-making model. As (a) depicts, the root of the behavior tree is a selector node, with five child nodes as different levels of motivation and a default state to ensure the accurate operation of the tree. From (bg), the six figure panels represent the behavior subtrees for safety motivation, order motivation, achievement motivation, relatedness motivation, dominance motivation, and default state.
Applsci 13 08515 g001
Figure 2. Safety field of a scenario vehicle. The cyan rectangle represents the scenario vehicle, the arrow inside denotes the vehicle’s heading, and the red rectangle indicates the safety field of the vehicle.
Figure 2. Safety field of a scenario vehicle. The cyan rectangle represents the scenario vehicle, the arrow inside denotes the vehicle’s heading, and the red rectangle indicates the safety field of the vehicle.
Applsci 13 08515 g002
Figure 3. Typical interaction scenarios: (a) The rear vehicle v e h r e a r sends a signal to the front vehicle v e h f r o n t , requesting the driver to give way. v e h f r o n t may continue to (I) change lane or (II) return to the original lane. (b) When v e h r e a r sends a signal during normal driving along the lane, v e h f r o n t can (I) increase its speed, (II) maintain the current speed, or (III) randomly move to an adjacent lane. (c) A vehicle in the adjacent lane ( v e h f r o n t ) merges into the lane in front of v e h r e a r . Therefore, v e h f r o n t sends a signal to v e h r e a r requesting the driver to maintain a safe distance. v e h r e a r can (I) agree to the request of v e h f r o n t and adjust the speed to help v e h f r o n t change the lane or (II) maintain the current state to prevent v e h f r o n t from changing the lane.
Figure 3. Typical interaction scenarios: (a) The rear vehicle v e h r e a r sends a signal to the front vehicle v e h f r o n t , requesting the driver to give way. v e h f r o n t may continue to (I) change lane or (II) return to the original lane. (b) When v e h r e a r sends a signal during normal driving along the lane, v e h f r o n t can (I) increase its speed, (II) maintain the current speed, or (III) randomly move to an adjacent lane. (c) A vehicle in the adjacent lane ( v e h f r o n t ) merges into the lane in front of v e h r e a r . Therefore, v e h f r o n t sends a signal to v e h r e a r requesting the driver to maintain a safe distance. v e h r e a r can (I) agree to the request of v e h f r o n t and adjust the speed to help v e h f r o n t change the lane or (II) maintain the current state to prevent v e h f r o n t from changing the lane.
Applsci 13 08515 g003
Figure 4. Part of the map showing locations where the experiments were performed.
Figure 4. Part of the map showing locations where the experiments were performed.
Applsci 13 08515 g004
Figure 5. Scenario 1: Weaving ability in multi-lane traffic. From (af), the six figure panels represent motivation, action, lane identification, longitudinal displacement, speed, and lateral displacement versus time of the five scenario vehicles.
Figure 5. Scenario 1: Weaving ability in multi-lane traffic. From (af), the six figure panels represent motivation, action, lane identification, longitudinal displacement, speed, and lateral displacement versus time of the five scenario vehicles.
Applsci 13 08515 g005
Figure 6. Scenario 1: Top-view screenshots of the weaving ability in multi-lane traffic. (a) V e h 1000 , V e h 1001 , V e h 1002 , and V e h 1003 are driving steadily, and V e h 1004 accelerates from standstill. The green arrow represents the direction of movement for all the scenario vehicles. (b) After being blocked by V e h 1003 , V e h 1004 moves to the adjacent lane. (c) V e h 1004 changes the lane again after being blocked by V e h 1001 . (d) Five vehicles continue to move towards their target. (e) V e h 1004 collides with V e h 1001 when changing lanes.
Figure 6. Scenario 1: Top-view screenshots of the weaving ability in multi-lane traffic. (a) V e h 1000 , V e h 1001 , V e h 1002 , and V e h 1003 are driving steadily, and V e h 1004 accelerates from standstill. The green arrow represents the direction of movement for all the scenario vehicles. (b) After being blocked by V e h 1003 , V e h 1004 moves to the adjacent lane. (c) V e h 1004 changes the lane again after being blocked by V e h 1001 . (d) Five vehicles continue to move towards their target. (e) V e h 1004 collides with V e h 1001 when changing lanes.
Applsci 13 08515 g006
Figure 7. Scenario 2: Handling the unsignalized traffic junctions. From (ad), the four figure panels represent motivation, action, longitudinal displacement, and speed versus time of the three scenario vehicles. The blue solid, orange dashed, and green dash-dotted lines represent V e h 1000 , V e h 1001 , and V e h 1002 , respectively.
Figure 7. Scenario 2: Handling the unsignalized traffic junctions. From (ad), the four figure panels represent motivation, action, longitudinal displacement, and speed versus time of the three scenario vehicles. The blue solid, orange dashed, and green dash-dotted lines represent V e h 1000 , V e h 1001 , and V e h 1002 , respectively.
Applsci 13 08515 g007
Figure 8. Scenario 2: Top view screenshots of handling the unsignalized traffic junctions. (a) Initially, V e h 1000 and V e h 1001 are on the same road, whereas V e h 1002 is on another road. V e h 1000 and V e h 1002 share the same destination. The green arrows represent the direction of movement for the scenario vehicles. (b) V e h 1000 reaches the junction before V e h 1002 . (c) V e h 1002 waits as V e h 1000 is passing the junction. (d) V e h 1001 waits while V e h 1002 is passing the junction. (e) All the three scenario vehicles pass the junction.
Figure 8. Scenario 2: Top view screenshots of handling the unsignalized traffic junctions. (a) Initially, V e h 1000 and V e h 1001 are on the same road, whereas V e h 1002 is on another road. V e h 1000 and V e h 1002 share the same destination. The green arrows represent the direction of movement for the scenario vehicles. (b) V e h 1000 reaches the junction before V e h 1002 . (c) V e h 1002 waits as V e h 1000 is passing the junction. (d) V e h 1001 waits while V e h 1002 is passing the junction. (e) All the three scenario vehicles pass the junction.
Applsci 13 08515 g008
Figure 9. Scenario 3: Cooperation between vehicles on rural roads. From (af), the six figure panels represent motivation, action, lane ID, longitudinal displacement, speed, and lateral displacement versus time of the three scenario vehicles. The blue solid, orange dashed, and green dash-dotted lines represent V e h 1000 , V e h 1001 , V e h 1002 , respectively.
Figure 9. Scenario 3: Cooperation between vehicles on rural roads. From (af), the six figure panels represent motivation, action, lane ID, longitudinal displacement, speed, and lateral displacement versus time of the three scenario vehicles. The blue solid, orange dashed, and green dash-dotted lines represent V e h 1000 , V e h 1001 , V e h 1002 , respectively.
Applsci 13 08515 g009
Figure 10. Scenario 3: Top view screenshots of the cooperation between vehicles on rural roads. (a) V e h 1000 is in the front-most position, V e h 1001 is in the middle, and V e h 1002 is in the rear-most position. The green arrow represent the direction of movement for all the scenario vehicles. (b) V e h 1002 overtakes V e h 1000 . (c) V e h 1002 gradually approaches V e h 1000 . (d) V e h 1002 overtakes V e h 1000 and V e h 1001 .
Figure 10. Scenario 3: Top view screenshots of the cooperation between vehicles on rural roads. (a) V e h 1000 is in the front-most position, V e h 1001 is in the middle, and V e h 1002 is in the rear-most position. The green arrow represent the direction of movement for all the scenario vehicles. (b) V e h 1002 overtakes V e h 1000 . (c) V e h 1002 gradually approaches V e h 1000 . (d) V e h 1002 overtakes V e h 1000 and V e h 1001 .
Applsci 13 08515 g010
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Y.; Guan, H.; Jia, X.; Duan, C. Decision-Making Model for Dynamic Scenario Vehicles in Autonomous Driving Simulations. Appl. Sci. 2023, 13, 8515. https://doi.org/10.3390/app13148515

AMA Style

Li Y, Guan H, Jia X, Duan C. Decision-Making Model for Dynamic Scenario Vehicles in Autonomous Driving Simulations. Applied Sciences. 2023; 13(14):8515. https://doi.org/10.3390/app13148515

Chicago/Turabian Style

Li, Yanfeng, Hsin Guan, Xin Jia, and Chunguang Duan. 2023. "Decision-Making Model for Dynamic Scenario Vehicles in Autonomous Driving Simulations" Applied Sciences 13, no. 14: 8515. https://doi.org/10.3390/app13148515

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop