Distributed Stochastic Model Predictive Control for a Microscopic Interactive Traffic Model

Ni Dang; Tim Brüdigam; Zengjie Zhang; Fangzhou Liu; Marion Leibold; Martin Buss

doi:10.3390/electronics12061270

,

and

¹

Chair of Automatic Control Engineering, TUM School of Computation, Information and Technology, Technical University of Munich, 80333 Munich, Germany

²

Department of Electrical Engineering, Eindhoven University of Technology, 5600 MB Eindhoven, The Netherlands

³

School of Astronautics, Harbin Institute of Technology, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

Electronics2023, 12(6), 1270;https://doi.org/10.3390/electronics12061270

This article belongs to the Special Issue Advanced Application of Artificial Intelligence in Networked Control Systems

Version Notes

Order Reprints

Abstract

Stochastic Model Predictive Control (SMPC) has attracted increasing attention for autonomous driving in recent years, since it enables collision-free maneuvers and trajectory planning and can deal with uncertainties in a non-conservative way. Many promising strategies have been proposed on how to use SMPC to select appropriate maneuvers and plan safe trajectories in uncertain environments. The limitation of these approaches is that they focus on scenarios where only one vehicle is controlled by SMPC and is, thus, reacting to the surrounding vehicles; however, the surrounding vehicles do not react to the SMPC-controlled vehicle, which means there is no mutual interaction. However, when multiple autonomous vehicles are driving on the road, each individual vehicle will take the behavior of the other surrounding vehicles into account and adjust its individual decisions accordingly in trajectory planning. This paper, therefore, examines in simulations how the interactive control system of multiple SMPC-controlled vehicles behave based on a Distributed SMPC (DSMPC) framework. For a three-lane highway scenario, we first investigate the effects of the risk parameter of the collision avoidance probabilistic constraint on non-interactive and interactive vehicle systems and provide insights into how to parameterize the controllers in interactive vehicle systems.

Keywords:

Model Predictive Control; autonomous vehicles; interactive systems

1. Introduction

Recent decades have witnessed rapid development in autonomous driving. Autonomous vehicles driving in dynamic highway environments must be able to manage the uncertainty resulting from the behavior of other traffic participants [1]. Nominal control approaches that cannot deal with system uncertainties might cause hazardous performance [1]. In contrast, robust control approaches can consider uncertainties but are too conservative because they also consider worst-case scenarios [2]. SMPC has been used to control autonomous vehicles because of its ability to consider uncertainties and simultaneously avoid overly conservative behaviors [1,3,4,5].

MPC iteratively solves a constrained optimal-control problem on a finite prediction horizon. That means a cost function is minimized while satisfying multiple constraints [6], including a system dynamic model used for generating predictions and safety constraints. In contrast, SMPC allows constraint violation with a specified small probability by applying probabilistic chance constraints, resulting in non-conservative behaviors [7,8]. Autonomous vehicles controlled by SMPC treat collision-avoidance constraints with adjustable risk parameters as probabilistic constraints [1,4]. However, in previous studies [1,3,4,5], simulations that confirmed the suitability of SMPC-based controllers for autonomous vehicles assumed that the surrounding vehicles used a much simpler controller and, in particular, did not consider predictions of the surrounding vehicles and, thus, also did not react to SMPC-controlled vehicles.

In the simulations of this paper, for simplicity of description, vehicles that react to other vehicles are called reactive vehicles, and those that do not react to other vehicles are non-reactive vehicles. A system consisting of only one reactive vehicle and multiple non-reactive vehicles is a non-interactive system. In real traffic, all vehicles tend to react to the vehicles in the environment. Thus, after designing an individual SMPC algorithm for a vehicle treating all surrounding vehicles as non-reactive vehicles, it is essential to also embed this controlled vehicle system in an environment where all surrounding vehicles are reactive. We, thus, obtain an interactive system. Not only is it important to investigate the performance of the overall interactive control system but the findings from this investigation must be included in the design of the SMPC for an individual autonomous vehicle.

Evaluating the performance of a novel controller requires simulating it in different scenarios employing either microscopic traffic models or macroscopic traffic models. Microscopic traffic models focus on studying the traffic phenomena of individual vehicles and analyzing how they interact with each other [9]. In microscopic traffic models, the dynamics of each traffic participant are individually modeled [10]; this allows us to know each vehicle’s detailed information, including the location, velocity, inertial heading, acceleration, and steering angle [10]. Macroscopic traffic models research the overall characteristics, e.g., the intensity, density, and mean speed, of the traffic flow, in which the details of individual interactions between vehicles are ignored [9,11]. In this paper, we study the performance of individual SMPC-controlled vehicles and the interactions between them and, consequently, select a microscopic traffic model.

In a multi-vehicle highway environment, considering the interactions between reactive vehicles contributes to more precise traffic prediction [12], which is fundamentally required in intelligent transportation systems [11]. Interactions between reactive vehicles have previously been investigated in microscopic traffic simulations [12,13]; however, to the best of our knowledge, the interactions between SMPC-controlled vehicles, where it is of interest to see the impact and the interplay of different risk parameters that determine the aggressiveness/conservativeness level of vehicles when reacting to other vehicles, have not been investigated. This motivated us to examine the interactions between SMPC-controlled vehicles in a multi-vehicle interactive system for a highway environment in this paper.

To do this, we model the multi-vehicle interactive system using a Distributed SMPC (DSMPC) framework [14,15,16]. In this framework, each vehicle interacts with its surrounding vehicles by observing their current states and predicting their future behaviors and avoiding potential collisions. Distributed MPC (DMPC) has been applied to solve vehicle platooning problems [17,18,19,20], where multiple vehicles are typically involved and are controlled to cruise at a constant speed. However, DMPC has not been used for problems where individual vehicles do not have a common driving goal. In this paper, we use a DSMPC framework to model multi-vehicle interactive systems, where individual vehicles have unique driving goals, which are usually different. Here, we assume that all vehicles have the same controller but with different parameterizations. In particular, the risk parameter is chosen differently.

We summarize the contributions of this paper as follows:

Investigating the effects of SMPC risk parameters on non-interactive and interactive vehicle-control systems on highways.
Providing guidelines on how to set risk parameters for vehicles in interactive systems.

Our work regarding interactive systems of SMPC-controlled vehicles is based on the hypotheses below:

Hypothesis 1.

The behaviors of a vehicle are determined not only by its own controller but also by the controllers of other vehicles.

Hypothesis 2.

The behaviors of one vehicle can influence the performance of the whole system.

The rest of the paper is organized as follows. Section 2 presents the communication topology used in the distributed framework and formulates the individual SMPC controller and how all individual SMPC controllers are combined together. In Section 3, we introduce elements of these SMPC problems and transform the stochastic optimal-control problem into a deterministic problem. Finally, our simulation results are presented in Section 4 followed by our conclusions in Section 5.

Notation:

R

is the set of real numbers.

R^{m}

denotes the set of all column vectors with m elements, which are real numbers.

R^{m \times n}

stands for the set of all

m \times n

matrices whose elements are real numbers. We use

S^{n}

to denote the set of symmetric matrices of order n.

{∥ x ∥}_{Q}^{2} = x^{⊺} Q x

.

2. Model of the Multi-Vehicle System

In our multi-vehicle interactive control system model, each vehicle detects the position in the lateral and longitudinal direction, velocity, and inertial heading angle of all currently neighboring vehicles, which we refer to as ’information’ in the following; a graph theoretic time-varying communication topology [19] models this information transmission.

In this section, we introduce the communication topology and the SMPC problem that is solved by an individual vehicle and how all SMPC problems are combined into the distributed SMPC control framework.

2.1. Communication Topology

The communication topology shows which of the surrounding vehicles is considered in the controller of one particular vehicle. We assume that all vehicles are equipped with sensors to detect information about their surrounding vehicles at a specified detectable distance. This distance depends on the detection ability of each vehicle’s sensors. For simplicity, we assume that the sensors of all vehicles have the same detection ability, which means the detectable distance is the same.

Here, we introduce a communication topology at one time step as shown in Figure 1. The communication topology is updated at each time step to account for changing vehicle positions.

Figure 1. Communication topology at one time step. This topology shows, at one time step, which of the surrounding vehicles is considered in the controller of one particular vehicle. This topology is updated at each time step.

It is modeled as an undirected graph

G = {V, E}

, where

V = {1, 2, \dots, N_{v}}

is the set of nodes, which represent vehicles, and

E \subseteq V \times V

is the set of edges describing the information detection among vehicles. The number of nodes (vehicles) in the graph is given by

N_{v}

. The graph

G

can be denoted with an adjacency matrix

A \in R^{N_{v} \times N_{v}}

A = [a_{i j}] = \{\begin{matrix} a_{i j} & = 1, if {i, j} \in E \\ a_{i j} & = 0, if {i, j} \notin E \end{matrix},

(1)

where

{i, j} \in E

means vehicle i senses the information about vehicle j, which is within the detectable distance of vehicle i. Vehicle j is, therefore, a neighbor of vehicle i. The set consisting of the neighbors of vehicle i is denoted by

N_{i} = {j ∣ a_{i j} = 1, j \in V}

. We define a dual set

O_{i} = {j ∣ a_{j i} = 1, j \in V}

, which includes all vehicles that identify i as a neighbor. The union of the two sets

N_{i}

and

O_{i}

is

N_{i} \cup O_{i}

. All vehicles in

N_{i} \cup O_{i}

categorize i as one of their neighbors and are themselves simultaneously neighbors of vehicle i. The sets

N_{i}

,

O_{i}

, and

N_{i} \cup O_{i}

are updated at each sampling time. Assuming that all vehicles have the same detectable distance, the sets

N_{i}

,

O_{i}

, and

N_{i} \cup O_{i}

are equal.

Each controlled vehicle will attempt to avoid collisions with its neighboring vehicles, and probabilistic constraints in the optimal-control problem reflect this requirement. Any vehicle i incorporates the information about its neighbors from

N_{i}

into its collision-avoidance constraints. All vehicles in

O_{i}

take the information of vehicle i into account in their collision-avoidance constraints.

2.2. Vehicle Controllers

The multi-vehicle interactive control system consists of a number of individual vehicles that are interactive because their individual controllers consider information about the current states of surrounding vehicles. We assume that each vehicle is controlled by SMPC; thus, the overall system is modeled using a distributed SMPC framework. The SMPC optimal-control problem that is solved by each vehicle at every sampling time is introduced in this section.

To decide on the current optimal control, each controlled vehicle, denoted as an Ego Vehicle (EV), must consider its predicted behaviors as well as those of vehicles, denoted as Target Vehicles (TVs), within a detectable distance. Simultaneously, an EV might be a TV of other Ego Vehicles (EVs). For each vehicle i (

i \in V

) and each prediction time step k

(k = 0, \dots, N - 1)

, we define the predicted state

ξ_{i, k}^{p}

and predicted control input

u_{i, k}^{p}

that will later be optimized over a prediction horizon of length N. Further, we introduce assumed states

ξ_{i, k}^{a}

and assumed control inputs

u_{i, k}^{a}

[18,19] to describe what other vehicles (vehicles in

O_{i}

) assume about the future behaviors of vehicle i. Finally,

ξ_{i, k}^{*}

and

u_{i, k}^{*}

define the optimal trajectories that vehicle i determines by solving the SMPC optimal control problem.

The SMPC optimal-control problem for EV

i \in V

is specified by a cost function

J_{i}

and constraints. The cost function is minimized over all admissible control input trajectories

u_{i}^{p} = {(u_{i, 0}^{p}, u_{i, 1}^{p}, \dots, u_{i, N - 1}^{p})}^{⊺}

, where admissibility means that the control inputs

u_{i}^{p}

as well as the corresponding state trajectory

ξ_{i}^{p} = {(ξ_{i, 0}^{p}, ξ_{i, 1}^{p}, \dots, ξ_{i, N}^{p})}^{⊺}

, which is found by iterating the system dynamics

ξ_{i, k + 1}^{p} = f^{p} (ξ_{i, k}^{p}, u_{i, k}^{p}), i \in V, k = 0, \dots, N - 1,

(2a)

do not violate constraints. The initial predicted state

ξ_{i, 0}^{p}

is the current state of the EV i. A first version of the optimal-control problem is, thus, given by

min_{u_{i}^{p}} J_{i} (ξ_{i}^{p}, u_{i}^{p})

(2b)

subject to state and input constraints

ξ_{i}^{\min} \leq ξ_{i, k}^{p} \leq ξ_{i}^{\max}, k = 0, \dots, N

(2c)

u_{i}^{\min} \leq u_{i, k}^{p} \leq u_{i}^{\max}, k = 0, \dots, N - 1 .

(2d)

Remark: We used only box constraints here, though more general constraints would be allowed.

We still have to add collision-avoidance constraints that involve assumptions on the surrounding vehicles’ behaviors. Summarizing the assumed models of all TVs in

ξ_{TV, k + 1}^{a} = f_{TV}^{a} (ξ_{TV, k}^{a}, u_{TV, k}^{a}, ω_{TV, k}^{a}), k = 0, \dots, N - 1,

(2e)

where

ω_{TV, k}^{a}

is the uncertainty in the prediction of TV behaviors, we obtain assumptions for all times in the prediction horizon used to formulate probabilistic collision-avoidance constraints for each TV summed up in

\Pr (ξ_{i, k}^{p} \in Ξ_{i, k}^{safe, TV}) \geq p_{i}, p_{i} \in [0.5, 1], k = 1, \dots, N .

(2f)

These constraints are probabilistic constraints in our approach. The requirement

ξ_{i, k}^{p} \in Ξ_{i, k}^{safe, TV}

means that states

ξ_{i, k}^{p}

have to be in the safe set

Ξ_{i, k}^{safe, TV}

to avoid potential collisions with the TVs at prediction step k. The set

Ξ_{i, k}^{safe, TV}

is determined from the predicted states of EV i and the assumed states of all its TVs

ξ_{TV, k}^{a}

. Employing

\Pr (*) \geq p_{i}

, we ensure that the event ∗ occurs with a probability of not less than

p_{i}

. The probabilistic constraints (2f) are designed to soften the collision-avoidance constraint between the EV i and its TVs.

A small probability of collisions between the EV i and its TVs is acceptable. This softening prevents overly conservative driving behaviors caused by hard constraints in robust MPC. In the following,

p_{i}

in constraints (2f) is identified as a risk parameter of EV i and is specified in advance. A smaller risk parameter

p_{i}

corresponds to more aggressive driving behaviors, which might increase the probability of collisions. Conversely, a larger

p_{i}

results in more conservative behaviors, a defensive driving mode.

We refer to the expressions (2a)–(2f) as ’the SMPC optimal-control problem’ in the following. The model in (2e) collects the system models of all TVs of EV i. If we assume that EV i takes m TVs labeled

i_{1}, i_{2}, \dots, i_{m}

(

i_{1}, i_{2}, \dots, i_{m} \in N_{i}

) into account, then (2e) summarizes

\{\begin{matrix} ξ_{i_{1}, k + 1}^{a} = f^{a} (ξ_{i_{1}, k}^{a}, u_{i_{1}, k}^{a}, ω_{i_{1}, k}^{a}) \\ ξ_{i_{2}, k + 1}^{a} = f^{a} (ξ_{i_{2}, k}^{a}, u_{i_{2}, k}^{a}, ω_{i_{2}, k}^{a}) \\ ⋮ \\ ξ_{i_{m}, k + 1}^{a} = f^{a} (ξ_{i_{m}, k}^{a}, u_{i_{m}, k}^{a}, ω_{i_{m}, k}^{a}) \end{matrix}, k = 0, \dots, N - 1 .

(3)

The assumed states

ξ_{i_{1}, k}^{a}

,

ξ_{i_{2}, k}^{a}

, and

ξ_{i_{m}, k}^{a}

correspond to TVs

i_{1}

,

i_{2}

, and

i_{m}

, respectively. Similarly, the assumed control inputs are

u_{i_{1}, k}^{a}

,

u_{i_{2}, k}^{a}

, and

u_{i_{m}, k}^{a}

; the prediction uncertainties are denoted by

ω_{i_{1}, k}^{a}

,

ω_{i_{2}, k}^{a}

, and

ω_{i_{m}, k}^{a}

. The dynamic model of the EV and TVs will be discussed in more detail in Section 3.

In the same way, expression (2f) contains the collision-avoidance constraints between EV i and all its TVs (

i_{1}, i_{2}, \dots, i_{m} \in N_{i}

):

\{\begin{matrix} \Pr (ξ_{i, k}^{p} \in Ξ_{i, k}^{safe, i_{1}}) \geq p_{i} \\ \Pr (ξ_{i, k}^{p} \in Ξ_{i, k}^{safe, i_{2}}) \geq p_{i} \\ ⋮ \\ \Pr (ξ_{i, k}^{p} \in Ξ_{i, k}^{safe, i_{m}}) \geq p_{i} \end{matrix}, p_{i} \in [0.5, 1], k = 1, \dots, N .

(4)

Here,

Ξ_{i, k}^{safe, i_{1}}

,

Ξ_{i, k}^{safe, i_{2}}

and

Ξ_{i, k}^{safe, i_{m}}

are the sets of safe states of EV i for preventing collisions with TVs

i_{1}

,

i_{2}

, ⋯,

i_{m}

at prediction step k, respectively.

3. Elements of the SMPC Problem

In this section, we introduce the elements of the SMPC optimal-control problem, including the vehicle models in Section 3.1, constraints in Section 3.2, and cost function in Section 3.3.

Additionally, due to the presence of stochastic disturbances

ω_{TV, k}^{a}

in the TV model (2e) and the probabilistic chance constraints (2f), the SMPC optimal-control problem cannot be solved directly [1]. To solve this, we transfer the stochastic optimal-control problem into a deterministic one by (1) reformulating the dynamic model of the TV, as shown in Section 3.1.4; and (2) tightening the probabilistic constraints as shown in Section 3.2.3.

3.1. Vehicle Models

A predictive controller requires a system model (2a). Thus, for our application, we need a system model of each EV, which is used by the EV to decide on optimal controls. In addition, we need a system model of each TV, which EVs use to predict TV trajectories to avoid potential collisions.

Vehicle models with different modeling depths have been proposed in the literature, including, e.g., the Fiala tire model, the dynamic bicycle model, and the kinematic bicycle model [1,21,22,23]. In this paper, we use the kinematic bicycle model [24] because it is a relatively coarse model and, thus, contributes to avoiding excessive computational load in optimizations [24].

The kinematic bicycle model consists of nonlinear differential equations (see, e.g., [24]), which are summarized as

\dot{ξ} = f^{c} (ξ, u)

in this paper. The state vector

ξ = {(x, y, ψ, v)}^{⊺}

contains the longitudinal position x and lateral position y of the center of mass of the vehicle as well as the velocity of the vehicle v and inertial heading

ψ

. The control inputs

u = {(a, δ)}^{⊺}

contain the acceleration a and steering angle

δ

. The nonlinear differential equations and all notations for describing the kinematic bicycle model can be found in Appendix A. In simulations, we use a linearized, discretized version of the model (see [5,25]).

3.1.1. Linear Discrete-Time Model

The linearized and discretized kinematic bicycle model (see [5]) is denoted as

ξ_{k + 1} = ξ_{0} + T f^{c} (ξ_{0}, 0) + A (ξ_{k} - ξ_{0}) + B u_{k}, k = 0, \dots, N - 1,

(5)

where the state and control input at prediction step k are represented by

ξ_{k}

and

u_{k}

, respectively. The initial state is

ξ_{0}

, and the sampling time is T. The system matrices A and B are given in Appendix B.

3.1.2. Model of EVs

We use the model (5) for each EV i to generate predictions:

ξ_{i, k + 1}^{p} = ξ_{i, 0}^{p} + T f^{c} (ξ_{i, 0}^{p}, 0) + A_{i} (ξ_{i, k}^{p} - ξ_{i, 0}^{p}) + B_{i} u_{i, k}^{p}, i \in V, k = 0, \dots, N - 1,

(6)

where

ξ_{i, k}^{p} = {(x_{i, k}^{p}, y_{i, k}^{p}, ψ_{i, k}^{p}, v_{i, k}^{p})}^{⊺} \in R^{N_{ξ, i}}

and

u_{i, k}^{p} = {(δ_{i, k}^{p}, a_{i, k}^{p})}^{⊺} \in R^{N_{u, i}}

are the predicted states and control inputs of EV i in prediction step k, respectively.

3.1.3. Model of TVs

For TVs, we choose a slightly adapted version of model (5) to include prediction uncertainty. Let vehicle

\overset{˘}{i}

be one TV of any EV i (

\overset{˘}{i} \in N_{i}

), then

ξ_{\overset{˘}{i}, k}^{a}

is the assumed trajectory of TV

\overset{˘}{i}

at prediction step k, and the TV model is

ξ_{\overset{˘}{i}, k + 1}^{a} = ξ_{\overset{˘}{i}, 0}^{a} + T f^{c} (ξ_{\overset{˘}{i}, 0}^{a}, 0) + A_{\overset{˘}{i}} (ξ_{\overset{˘}{i}, k}^{a} - ξ_{\overset{˘}{i}, 0}^{a}) + B_{\overset{˘}{i}} u_{\overset{˘}{i}, k}^{a} + G_{\overset{˘}{i}} ω_{\overset{˘}{i}, k}^{a}, k = 0, \dots, N - 1,

(7)

where

ξ_{\overset{˘}{i}, k}^{a} = {(x_{\overset{˘}{i}, k}^{a}, y_{\overset{˘}{i}, k}^{a}, ψ_{\overset{˘}{i}, k}^{a}, v_{\overset{˘}{i}, k}^{a})}^{⊺} \in R^{N_{ξ, \overset{˘}{i}}}

and

u_{\overset{˘}{i}, k}^{a} = {(δ_{\overset{˘}{i}, k}^{a}, a_{\overset{˘}{i}, k}^{a})}^{⊺} \in R^{N_{u, \overset{˘}{i}}}

are the assumed states and control inputs of TV

\overset{˘}{i}

at prediction step k, respectively. The system matrices

A_{\overset{˘}{i}}

and

B_{\overset{˘}{i}}

can be found in [25]. The vector

ω_{\overset{˘}{i}, k}^{a} \in R_{ω, \overset{˘}{i}}^{N}

is included to account for the uncertainty at any prediction step k, which comes from the imprecision of the prediction.

The uncertainties

ω_{\overset{˘}{i}, k}^{a} \in R_{ω, \overset{˘}{i}}^{N}

are assumed to be subject to a Gaussian distribution with zero mean and covariance matrix

\sum_{ω_{\overset{˘}{i}}^{a}}

, and thus

ω_{\overset{˘}{i}, k}^{a} \sim N (0, \sum_{ω_{\overset{˘}{i}}^{a}})

.

3.1.4. Reformulation of the TV Model

The SMPC optimal-control problem in expressions (2a)–(2f) is replaced by an equivalent deterministic problem that is numerically tractable. Here, we prepare this replacement by splitting the TV model into deterministic and stochastic equations (see [26]).

The state of TV

\overset{˘}{i}

at prediction step k is decomposed into two components: the deterministic, nominal component

z_{\overset{˘}{i}, k}^{a}

(

z_{\overset{˘}{i}, k}^{a} = {({\tilde{x}}_{\overset{˘}{i}, k}^{a}, {\tilde{y}}_{\overset{˘}{i}, k}^{a}, {\tilde{ψ}}_{\overset{˘}{i}, k}^{a}, {\tilde{v}}_{\overset{˘}{i}, k}^{a})}^{⊺} \in R^{N_{z, \overset{˘}{i}}}

) and a zero-mean stochastic error component

e_{\overset{˘}{i}, k}^{a}

ξ_{\overset{˘}{i}, k}^{a} = z_{\overset{˘}{i}, k}^{a} + e_{\overset{˘}{i}, k}^{a} .

(8)

The following assumption is made (see [1]):

Assumption 1.

The state feedback is perfect, i.e.,

ξ_{\overset{˘}{i}, 0}^{a} = z_{\overset{˘}{i}, 0}^{a}

, which suggests

e_{\overset{˘}{i}, 0}^{a} = 0

with a probability of 1.

We incorporate a prestabilizing error feedback (see [7]) into the control input

u_{\overset{˘}{i}, k}^{a} = K_{\overset{˘}{i}} e_{\overset{˘}{i}, k}^{a} + v_{\overset{˘}{i}, k}^{a},

(9)

where

K_{\overset{˘}{i}}

is a stabilizing feedback gain that is obtained by applying a linear quadratic control strategy, and

v_{\overset{˘}{i}, k}^{a} = {({\tilde{δ}}_{i, k}^{a}, {\tilde{a}}_{i, k}^{a})}^{⊺} \in R^{N_{v, \overset{˘}{i}}}

is the assumed control input used for an EV to predict the behaviors of its TV

\overset{˘}{i}

. In the following, we set

v_{\overset{˘}{i}, k}^{a} = 0 (k = 0, \dots, N - 1)

, so that the EVs assume that TVs will drive with almost constant speed in the prediction horizon. The equations for the TV model are summarized as

z_{\overset{˘}{i}, k + 1}^{a} = z_{\overset{˘}{i}, 0}^{a} + T f^{c} (z_{\overset{˘}{i}, 0}^{a}, 0) + A_{\overset{˘}{i}} (z_{\overset{˘}{i}, k}^{a} - z_{\overset{˘}{i}, 0}^{a}) + B_{\overset{˘}{i}} v_{\overset{˘}{i}, k}^{a},

(10a)

e_{\overset{˘}{i}, k + 1}^{a} = Φ_{\overset{˘}{i}} e_{\overset{˘}{i}, k}^{a} + G_{\overset{˘}{i}} ω_{\overset{˘}{i}, k}^{a},

(10b)

where

Φ_{\overset{˘}{i}} = A_{\overset{˘}{i}} + B_{\overset{˘}{i}} K_{\overset{˘}{i}}

is strictly stable for the system

(A_{\overset{˘}{i}}, B_{\overset{˘}{i}})

of TV

\overset{˘}{i}

. The deterministic equation (10a) will generate predictions of TV behavior, while the stochastic equation (10b) will be used to evaluate the collision-avoidance constraints.

The distribution of all predicted errors

e_{\overset{˘}{i}, k}^{a}

is determined iteratively from the distributions of the initial error

e_{\overset{˘}{i}, 0}^{a}

and the disturbances

ω_{\overset{˘}{i}, k}^{a}

. Let

e_{\overset{˘}{i}, k}^{a} \sim N (0, \sum_{\overset{˘}{i}, k})

, then

e_{\overset{˘}{i}, k + 1}^{a} \sim N (0, \sum_{\overset{˘}{i}, k + 1})

, where

\sum_{\overset{˘}{i}, k + 1} = Φ_{\overset{˘}{i}} \sum_{\overset{˘}{i}, k} Φ_{\overset{˘}{i}}^{⊺} + G_{\overset{˘}{i}} \sum_{ω_{\overset{˘}{i}}^{a}} G_{\overset{˘}{i}}^{⊺}

(see [1]). From Assumption 1, we find that the covariance of the initial error

e_{\overset{˘}{i}, 0}^{a}

is 0, and thus

\sum_{\overset{˘}{i}, 0} = 0

.

3.2. Constraints

In this subsection, we introduce constraints on states and inputs for the SMPC optimal-control problems of EVs. We consider (1) road boundaries, limitations on the inertial heading, speed, and acceleration; and (2) collision avoidance, where collision-avoidance constraints are probabilistic constraints, and all others are hard constraints.

3.2.1. Hard Constraints

For any EV i, we incorporate the following hard constraints into the SMPC problem

ξ_{i}^{\min} \leq ξ_{i, k} \leq ξ_{i}^{\max}

(11a)

u_{i}^{\min} \leq u_{i, k} \leq u_{i}^{\max}

(11b)

where

ξ_{i}^{\min} = {(0, y^{r, l} + w_{i}^{veh} / 2, ψ_{i}^{\min}, v_{i}^{\min})}^{⊺}

,

ξ_{i}^{\max} = {(l^{road}, y^{r, u} - w_{i}^{veh} / 2, ψ_{i}^{\max}, v_{i}^{\max})}^{⊺}

,

u_{i}^{\min} = {(a_{i}^{\min}, δ_{i}^{\min})}^{⊺}

, and

u_{i}^{\max} = {(a_{i}^{\max}, δ_{i}^{\max})}^{⊺}

. The lower and upper boundaries of the road are represented by

y^{r, l}

and

y^{r, u}

, respectively. The length of the road is denoted by

l^{road}

. The width of EV i is given by

w_{i}^{veh}

. The lower bounds of the inertial heading angle

ψ_{i}^{\min}

, speed

v_{i}^{\min}

, acceleration

a_{i}^{\min}

, and front steering angle

δ_{i}^{\min}

are considered. We also consider the upper bounds of these states, denoted by

ψ_{i}^{\max}

and

v_{i}^{\max}

, and the control inputs, which are represented by

a_{i}^{\max}

and

δ_{i}^{\max}

. The values of these parameters are shown in Table A1 in Appendix D.

3.2.2. Collision-Avoidance Constraints

In the following, we explain the calculation of the safe sets

Ξ_{i, k}^{safe, \overset{˘}{i}}

in the collision-avoidance constraints (4), where ellipse regions approximate the occupied area of one vehicle that other vehicles are not allowed to enter (see [1,4]).

The non-accessible region around TV

\overset{˘}{i}

is given by an inequality constraint

d_{\overset{˘}{i}, k} = \frac{{(Δ x_{\overset{˘}{i}, k})}^{2}}{s_{a}^{2}} + \frac{{(Δ y_{\overset{˘}{i}, k})}^{2}}{s_{b}^{2}} - 1 \geq 0

(12)

that defines an ellipse where the center of vehicle

\overset{˘}{i}

is the center of the ellipse (see [4]). The size of the ellipse is determined by the semi-major axis

s_{a}

and the semi-minor axis

s_{b}

. The longitudinal distance and lateral distance between EV i and its TV

\overset{˘}{i}

are given by

Δ x_{\overset{˘}{i}, k}

and

Δ y_{\overset{˘}{i}, k}

, respectively, and are defined below:

[\begin{matrix} Δ x_{\overset{˘}{i}, k} \\ Δ y_{\overset{˘}{i}, k} \end{matrix}] = [\begin{matrix} x_{i, k}^{p} - {\tilde{x}}_{\overset{˘}{i}, k}^{a} \\ y_{i, k}^{p} - {\tilde{y}}_{\overset{˘}{i}, k}^{a} \end{matrix}] .

(13)

The constraint (12) is usually overly conservative because, when the ellipse region around the TV

\overset{˘}{i}

is larger than the actual vehicle shape, a vehicle might enter the ellipse region without causing a collision. For this reason, we employ the probabilistic chance constraint for collision avoidance that allows vehicles a small probability to enter the safety ellipse of another vehicle:

\Pr (d_{\overset{˘}{i}, k} \geq 0) \geq p_{i} .

(14)

3.2.3. Constraint Tightening

In order to directly solve the SMPC optimal-control problem, we replace the probabilistic chance constraint (2f) by a tightened version of

d_{\overset{˘}{i}, k} \geq 0

, where the upper bounds of the tightened constraints depend on the risk parameter

p_{i}

and the distribution of the prediction uncertainties

ω

in the TV models. This allows for replacing the stochastic optimal-control problem with a deterministic optimal-control problem. We adopt the constraint tightening from [1,4] and summarize it as follows.

From (8) in Section 3.1.4, the error between the actual and nominal states of TV

\overset{˘}{i}

is

e_{\overset{˘}{i}, k}^{a} = ξ_{\overset{˘}{i}, k}^{a} - z_{\overset{˘}{i}, k}^{a}

. Given (13), the constraint (12) is linearized around the nominal state

z_{\overset{˘}{i}, k}^{a}

of TV

\overset{˘}{i}

, resulting in

d_{\overset{˘}{i}, k} + \nabla d_{\overset{˘}{i}, k} e_{\overset{˘}{i}, k}^{a} \geq 0

(15)

where

\nabla d_{\overset{˘}{i}, k} = \frac{\partial d_{\overset{˘}{i}, k}}{\partial z_{\overset{˘}{i}, k}^{a}} = (\frac{- 2 Δ x_{\overset{˘}{i}, k}}{s_{a}^{2}}, \frac{- 2 Δ y_{\overset{˘}{i}, k}}{s_{b}^{2}}, 0, 0) .

(16)

Using inequality (15), the probabilistic chance constraint (14) is rewritten as

\Pr (- \nabla d_{\overset{˘}{i}, k} e_{\overset{˘}{i}, k}^{a} \leq d_{\overset{˘}{i}, k}) \geq p_{i}, p_{i} \in [0.5, 1], k = 1, \dots, N,

(17)

which can be divided into a deterministic inequality and a probabilistic equation:

d_{\overset{˘}{i}, k} \geq γ_{\overset{˘}{i}, k}

(18a)

\Pr (- \nabla d_{\overset{˘}{i}, k} e_{\overset{˘}{i}, k}^{a} \leq γ_{\overset{˘}{i}, k}) = p_{i}, p_{i} \in [0.5, 1], k = 1, \dots, N .

(18b)

Then, according to Theorem 1 in [4], the probabilistic equation in (18b) is tightened by choosing

γ_{\overset{˘}{i}, k}

as

γ_{\overset{˘}{i}, k} = \sqrt{2 \nabla d_{\overset{˘}{i}, k} \sum_{\overset{˘}{i}, k} {(\nabla d_{\overset{˘}{i}, k})}^{⊺}} \erf^{- 1} (2 p_{i} - 1) .

(19)

With the deterministic part of the TV model (10a) and the deterministic constraint (18a), the SMPC optimal-control problem in expressions (2a)–(2f) can be transformed into a deterministic problem (see Appendix C for the deterministic collision-avoidance constraints for multiple TVs).

3.3. Cost Function

In this subsection, we explain how the cost function (2b) in the SMPC optimal-control problem is designed to enable the tracking of reference states as well as to minimize control inputs.

For any EV i, the cost function in expression (2b) [18] is chosen as

J_{i} (ξ_{i}^{p}, u_{i}^{p}) = \sum_{k = 0}^{N - 1} ∥ ξ_{i, k}^{p} - ξ_{i, k}^{ref} ∥_{Q_{i}}^{2} + ∥ u_{i, k}^{p} ∥_{R_{i}}^{2} + {∥ ξ_{i, N}^{p} - ξ_{i, N}^{ref} ∥}_{Q_{i}}^{2} .

(20)

We define reference states for EV i as

ξ_{i, k}^{ref}

and

ξ_{i, N}^{ref}

to command EV i to enter or maintain a target lane at a desired velocity for every prediction step

k (k = 1, \dots, N)

.

The weighting matrices

Q_{i} \in S^{4}

and

R_{i} \in S^{2}

are symmetric and positive definite.

3.4. Control Algorithm for One Vehicle

We summarize the process of solving the SMPC optimal-control problem by any EV i in Algorithm 1.

Note that, in order to simplify the notation, we omitted a symbol for the current time t in the previous sections, when we defined predictions starting from time t. Here, however, in addition to the current time t, we use

t + T

for the successor time and use

ξ_{k | t}

and

u_{k | t}

, instead of

ξ_{k}

and

u_{k}

to describe the states and control inputs at prediction step k ahead of current time t. In simulations, we chose the system dynamic (5) as the real dynamics.

Algorithm 1 The SMPC problem for each EV i

Input:: $A_{i}$ , $B_{i}$ , $p_{i}$ , $t_{0}$ , $t_{end}$ , $ξ_{0}$ .
Output:: $u_{i}^{*}$

1:: $t = t_{0}$
2:: while $t < t_{end}$ do
3:: Detect the current states of EV i and its TVs
4:: Update $N_{i}$
5:: Solve the deterministic SMPC optimal-control problem to find the optimal control input trajectory $u_{i, k | t}^{*} (k = 0, 1, \dots, N - 1)$
6:: Apply first entry $u_{i, 0 | t}^{*}$ to real dynamics (5) and obtain successor state $ξ_{i, 1 | t}^{*}$
7:: $t = t + T$
8:: end while

4. Simulation Results

The performance of the multi-vehicle interactive system was examined via simulations of multiple vehicles on a three-lane highway. The right-most lane is the slow lane. For simplicity, in our simulations, we assumed that all three lanes have the same width and that all vehicles are the same size. The simulation setup, including the parameters of the highway, vehicles, and controller, can be found in Table A1 in Appendix D. The simulations were executed in MATLAB on a desktop computer with an Intel (R) Core (TM) i3-7100 CPU @ 3.90GHz 3.91 GHz processor. The solving algorithm for the SMPC is based on the NMPC toolbox [27], in which fmincon is used as a solver.

We first investigated the effects of the risk parameters on the behaviors of individual SMPC-controlled vehicles in non-interactive systems. Then, we examined how various settings of risk parameters of the SMPC-controlled vehicles influence the performance of interactive systems and provide insight into how to set risk parameters.

4.1. The Effects of Risk Parameters on an Individual Vehicle

We studied the effects of risk parameters on the distance between vehicles in a non-interactive system based on a two-vehicle scenario as shown in Figure 2. Here, one vehicle is controlled by SMPC, and the other vehicle is non-reactive. The two vehicles start in different lanes with different initial velocities. Vehicle 1 (the non-reactive vehicle) stays in the center lane, and Vehicle 2 (the SMPC-controlled vehicle) merges into the center lane. The simulation lasts 10 s. The corresponding initial settings, including the initial states

x_{0}

,

y_{0}

,

ψ_{0}

, and

v_{0}

and reference states

y_{ref}

and

v_{ref}

for the vehicles, are shown in Table 1.

Figure 2. A two-vehicle scenario. There are two vehicles on a three-lane highway. Vehicle 1, in red, is non-reactive in a non-interactive system but reactive in an interactive system and will remain in the center lane. Vehicle 2, in blue, is an SMPC-controlled vehicle, starting in the right, slow lane and later changing into the center lane. The longitudinal and lateral directions are represented by x and y, respectively.

Table 1. Initial settings for a non-interactive two-vehicle scenario.

We define the distance

d_{\overset{˘}{i}, t}

between EV i and its TV

\overset{˘}{i}

at any iteration

t / T

(current time t) by the evaluation of the collision-avoidance constraint (12) along the resulting closed-loop trajectories.

d_{\overset{˘}{i}, t} = \sqrt{\frac{{(Δ x_{\overset{˘}{i}, t})}^{2}}{s_{a}^{2}} + \frac{{(Δ y_{\overset{˘}{i}, t})}^{2}}{s_{b}^{2}}}

(21)

where the distance between EV i and its TV

\overset{˘}{i}

at iteration

t / T

(time t) in the longitudinal direction is denoted as

Δ x_{\overset{˘}{i}, t}

, and its lateral counterpart is

Δ y_{\overset{˘}{i}, t}

.

We investigated how the risk parameters influence the distances between vehicles. The risk parameter determines the probability of collision and, thus, controls the distance between two vehicles. Small risk parameters indicate more-aggressive, less-conservative driving with a higher probability of collision and small distances.

To better visualize the influence of the risk parameters on the distances between vehicles, we chose the distance for risk parameter 0.95 as a baseline and evaluated the deviations between the baseline (minuend) and the resulting distances for each of the other risk parameters 0.70, 0.75, 0.80, 0.85, and 0.90. Each risk parameter setting was simulated 100 times, and in each simulation, the initial states of the vehicles were slightly different.

They were generated from a normal distribution with the initial states

(x_{0}, y_{0}, ψ_{0}, v_{0})

, as presented in Table 1, as the mean values and a covariance matrix

diag (0.1, 0.01, 0, 0.01)

. The simulation results are presented in Figure 3, and it was confirmed that the greater the risk parameter of the SMPC-controlled vehicle (the more conservative), the smaller the average distance deviations—meaning larger distances between the two vehicles.

Figure 3. Distance deviations in a non−interactive two−vehicle scenario. The six colored lines represent deviations between the distances for all risk parameters (0.70, 0.75, 0.80, 0.85, 0.90, and 0.95) and the distance for the risk parameter 0.95 during the whole 12 iterations, respectively. The iteration is represented by

t / T

, where t is the time, and T denotes the sampling time.

4.2. The Effects of Risk Parameters on Interactive Systems

In principle, the risk parameter will also determine the distance between vehicles in an interactive system during close interaction. The performance of an individual vehicle depends not only on its own risk parameter but also on the risk parameters of its surrounding vehicles.

4.2.1. The Same vs. Different Risk Parameters

We investigated the state trajectories of two vehicles for different pairs of risk parameters

(p_{1}, p_{2})

, including

(0.70, 0.70)

,

(0.70, 0.95)

,

(0.95, 0.70)

, and

(0.95, 0.95)

, based on the two vehicle scenario in Figure 2. Here, both vehicles were reactive and controlled by SMPC. In order to simulate a highly interactive scenario, we slightly adjusted the initial settings of Vehicle 2, as described in Table 1, by (1) moving the longitudinal initial position

x_{0}

of Vehicle 2 to 67

m

, (2) increasing the initial velocity to 25

{m s}^{- 1}

, and (3) decreasing the reference velocity

v_{ref}

of Vehicle 2 to 27

{m s}^{- 1}

as summarized in Table 2. We depict the lateral position y of these two vehicles as shown in Figure 4.

Table 2. Initial settings for an interactive two-vehicle scenario.

Figure 4. Lateral positions in an interactive two-vehicle scenario. The risk parameter pairs

(p_{1}, p_{2})

for Vehicle 1 and Vehicle 2 are specified in the legend of the figures.

Figure 4 shows that both Vehicle 1 and Vehicle 2 finally reach their target lanes. We first studied the performance for if both vehicles use the same risk parameter by comparing their lateral positions for risk parameter pairs

(0.70, 0.70)

(red) and

(0.95, 0.95)

(purple). Both vehicles reach their target lane slightly earlier when the common risk parameter is

0.70

. Thus, setting a smaller risk parameter helped the vehicles reach the target lane earlier but not significantly. In total, the resulting trajectories for the risk parameter

(0.70, 0.70)

did not differ too much from those with

(0.95, 0.95)

.

We next investigated how the vehicles behave if they use different risk parameters. Comparing the trajectories for the risk parameter pair

(0.70, 0.70)

(red) with that for

(0.95, 0.70)

(blue), we see that if Vehicle 1 chooses a small risk parameter (driving more aggressively), it only slightly adjusts its behavior to avoid potential collisions before reaching the target lane. Vehicle 2 behaves similarly when Vehicle 1 has a greater risk parameter (driving more conservatively). These results are comparable to those for adjusting the risk parameter of Vehicle 2, which can be found in the comparison of the plots for

(0.70, 0.70)

(red) and

(0.70, 0.95)

(green). These results align with the symmetric roles that the two vehicles play in the two-vehicle interactive system, where both vehicles are EVs and treat the other vehicle as a TV.

We summarize the findings for the two-vehicle interactive system as follows:

When two vehicles have the same risk parameters:
- Driving more aggressively can help both to reach the target lane slightly earlier.
- Changing the risk parameters for all vehicles in the same way does not affect the resulting trajectories significantly and only introduces slightly more or less distance between vehicles.
When two vehicles have different risk parameters:
- The more aggressive that an EV drives, the fewer collision avoidance adjustments to its behavior are required before reaching the target lane.
- An EV’s more-aggressive driving style can contribute to reaching its target lane earlier.
- A TV’s more-conservative driving style can help the EV to reach the target lane earlier with fewer collision avoidance adjustments.

4.2.2. Resolving Conflicts

We now examine the role of the risk parameters in conflict situations. These conflicts were observed in the simulations of the previous subsection where two vehicles could not decide which of them had a higher priority to enter the target lane. This resulted in unnecessarily long lane change durations with oscillations around the target lane. We see that the aggressive vehicle typically dominated the behavior and reached the target lane earlier. When both vehicles had the same risk parameter, target lane and reference velocity, conflict situations often occurred.

We created this kind of conflict by (1) moving Vehicle 2 closer to Vehicle 1 in the longitudinal direction of the initial settings, adjusting the longitudinal initial position

x_{0}

of Vehicle 2, as described in Table 2, from 67 to 66

m

; and (2) setting the same risk parameter

0.95

for both vehicles. Thus, the vehicles were initially close to each other, shared the same target velocity of 27

{m s}^{- 1}

, and had the same target lane, the center lane; thus, they competed to occupy the center lane.

We investigated how the choice of risk parameters affects vehicle dominance by observing the position and the steering angle

δ

of the vehicles for different risk parameter pairs, including

(0.95, 0.95)

,

(0.95, 0.75)

, and

(0.75, 0.95)

as shown in Figure 5. We mark the time periods where an obvious conflict appears in gray. The oscillating behavior, which is seen in the steering angles in particular, indicates that both vehicles repetitively switched between attempting to approach the target lane and moving away from the target lane to avoid collisions.

Figure 5. The trajectories and steering angles of Vehicles 1 and 2 for different risk parameter pairs in an interactive scenario. The gray regions in the plots mark time periods of conflict. In sub-figure (a), to display the relative positions of vehicles, we drew the vehicles as small squares every 10 iterations and colored the squares in different shades of red and blue. (a) The trajectories of the vehicles. (b) The steering angles of the vehicles.

Figure 5a displays the trajectories of the vehicles for different risk parameter pairs. Figure 5b shows the corresponding steering angles. When both vehicles used the same risk parameter

0.95

, they remained in conflict until they longitudinally reached around 420 m at approximately iteration 70 and then exited the conflict situation. Reducing the risk parameter of Vehicle 2 from

0.95

to

0.75

helped both vehicles escape from the conflict situation even earlier—at around 160 m in the longitudinal direction and after around 20 iterations. Later, Vehicle 2 occupied the target lane most of the time, playing the dominant role (see the trajectories for

(0.95, 0.75)

). However, if we reduced the risk parameter of Vehicle 1 from

0.95

to

0.75

, the conflict situation did not appear anymore, and Vehicle 1 played the dominant role in terms of occupying the target lane (see the trajectories for

(0.75, 0.95)

).

Therefore, we can conclude that (1) reducing the risk parameter of one vehicle in the two-vehicle interactive system shortened or fully eliminates conflict; (2) the vehicle with a smaller risk parameter (more aggressive) tended to be the dominant one; additionally, (3) the same amount of risk parameter reduction for Vehicle 1 and Vehicle 2 had different effects on the conflict situations.

4.2.3. Risk Differences

In the previous discussion in Section 4.2.2, we found that maintaining a difference between the risk parameters of the two vehicles helped to either shorten or completely avoid conflict. However, it is also important to know whether the absolute value of the difference matters because this determines how much a vehicle should adjust its behaviors to escape from a conflict situation. Consequently, we decided to further investigate how gradually adjusting the risk parameters of one vehicle affected the resolution of the conflict.

We incrementally increased the risk parameter of Vehicle 1 from

0.75

to

0.95

, and the risk parameter of Vehicle 2 remained unchanged,

0.95

, resulting in the following risk parameter pairs:

(0.75, 0.95)

,

(0.80, 0.95)

,

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

. We evaluated the effects of these risk parameter pairs employing two metrics, the Distance Deviation (DD) and State Deviation (SD), introduced as follows:

DD: We consider the Euclidean distance between the centers of the two vehicles (different from the distance definition in Section 4.1). The DD is defined as the deviation between the Euclidean distances for any risk parameter pair and the Euclidean distance for the risk parameter pair $(0.75, 0.95)$ .
SD: The deviation between states and reference states, as defined below:

${err}_{ξ} = \sqrt{\frac{1}{N_{ite} + 1} \sum_{n = 0}^{N_{ite}} {(ξ_{n} - ξ_{n}^{ref})}^{2}}$

(22)

where ${err}_{ξ}$ ( ${err}_{ξ} = {({err}_{x}, {err}_{y}, {err}_{ψ}, {err}_{v})}^{⊺}$ ) represents the deviation between the real states $ξ_{n}$ and the corresponding reference states $ξ_{n}^{ref}$ during all $N_{ite}$ iterations.

The results for DD and SD are illustrated in Figure 6 and Figure 7, respectively.

Figure 6. Distance Deviations (DDs) for the vehicles with different pairs of risk parameters,

(0.75, 0.95)

,

(0.80, 0.95)

,

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

in an interactive scenario. To better see the details, we enlarged the first 32 iterations of the plot and show them on the top left side of the figure.

Figure 7. State Deviations (SDs) for the vehicles with different risk parameter pairs,

(0.75, 0.95)

,

(0.80, 0.95)

,

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

in an interactive scenario.

In Figure 6, the oscillations reflect conflict where both vehicles are struggling between reaching/maintaining the common target lane and moving away from the target lane to ensure safety, which causes variations in the distances between them. We conclude from the figure that: (1) the greater the risk parameter of Vehicle 1 (the more conservative), the larger the distance between the two vehicles, which is safer; and (2) a smaller risk parameter of Vehicle 1 can help the two-vehicle interactive system escape from the conflict situation earlier as demonstrated by the results that, for the risk parameter pairs

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

, the conflict situations end roughly after 14, 30, and 68 iterations, respectively.

We show the effect of different pairs of the risk parameters on the SD, including the deviations of the lateral position

{err}_{y}

, inertial heading

{err}_{ψ}

, and velocity

{err}_{v}

, in Figure 7. A greater risk parameter of Vehicle 1 (more conservative) causes larger state deviations for Vehicle 1, smaller deviations in the lateral positions and velocities for Vehicle 2, and larger inertial heading deviations for Vehicle 2. Therefore, when Vehicle 1 drives more conservatively, Vehicle 2 can benefit from the conservatism more in terms of reaching the target lane and reference velocity (see the first and third sub-figures in Figure 7). In contrast, this results in larger inertial heading deviations for both vehicles (see the second sub-figure in Figure 7) because they are trapped in the conflict situations for a longer time.

The effects of one vehicle’s driving style on the two-vehicle interactive system in conflict situations when the other vehicle drives conservatively are summarized as follows:

The vehicles benefit from the conservative driving style in terms of safety.
An aggressive driving style can help the two-vehicle interactive system escape conflict situations.
A vehicle driving more aggressively tends to reach its target lane and reference velocity earlier.

5. Conclusions and Future Work

In this paper, we introduced a Distributed Stochastic Model Predictive Control (DSMPC) framework for a system of vehicles that are coupled through their interactive controllers. Within this framework, each vehicle is controlled by Stochastic Model Predictive Control (SMPC), and each SMPC-controlled vehicle interacts with its TVs, attempting to drive safely at a certain level through the consideration of probabilistic collision-avoidance constraints. Based on this distributed control framework, we studied the effects of risk parameters, which decide vehicles’ driving styles, on non-interactive and interactive systems and provide insights into how to set risk parameters in a multi-SMPC-vehicle interactive system.

The simulation in non-interactive systems showed that, when an SMPC-controlled vehicle drives more conservatively, with a greater risk parameter, safety is increased. We found the same results in the simulations in interactive systems. Further, in interactive systems, an aggressive vehicle can reach its driving goals earlier, thus, requiring fewer adjustments to its behaviors. An individual vehicle driving conservatively can also help another vehicle to reach its driving goals earlier. Moreover, one vehicle can also influence the whole system by adjusting its own risk parameter. Vehicles might be trapped in conflict situations; therefore, they cannot decide which one has the higher priority to attain one’s driving goals if there are conflicts among the goals. Modifying the risk parameters of one vehicle can help both escape conflict situations; however, the vehicle with a smaller risk parameter tends to dominate the situations.

The results in interactive systems confirmed our hypotheses that the behaviors of one vehicle are not only determined by its own control and influenced by other vehicles’ behaviors but also can influence the performance of the whole system. These results can be generalized to vehicles that are controlled by other controllers in the future.

In our future controller design, incorporating a more realistic prediction of TV’s behaviors into the SMPC optimal-control problem will also be considered. In our current SMPC optimal-control problem, any EV assumes that its TVs will stay in their current lanes and maintain their current velocities. This is overly simplified and might cause huge deviations between the TVs’ real trajectories and the assumed ones from the perspective of the EV. Therefore, methods that provide more precise predictions of TV behaviors are required. Research into this will be performed in the future.

We performed simulations with two vehicles. In the future, we will research more complicated scenarios with multiple vehicles interacting with the surrounding vehicles.

Author Contributions

Conceptualization, N.D., M.L. and M.B.; methodology, N.D.; software, N.D. and T.B.; validation, N.D. and M.L.; formal analysis, N.D.; investigation, N.D., M.L. and Z.Z.; resources, N.D. and T.B.; data curation, N.D.; writing—original draft preparation, N.D.; writing—review and editing, N.D., M.L., T.B., Z.Z. and F.L.; visualization, N.D.; supervision, M.L. and M.B.; project administration, M.B.; funding acquisition, M.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data is unavailable due to privacy.

Acknowledgments

We gratefully acknowledge the valuable discussions on simulations with Tommaso Benciolini and academic English with Stephen Starck.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MPC	Model Predictive Control
SMPC	Stochastic Model Predictive Control
DMPC	Distributed Model Predictive Control
DSMPC	Distributed Stochastic Model Predictive Control
EV	Ego Vehicle
TV	Target Vehicle
DD	Distance Deviation
SD	State Deviation

Appendix A. Kinematic Bicycle Model

The kinematic bicycle model is described by the following nonlinear continuous time equations [24],

\dot{x} = v cos (ψ + β)

(A1a)

\dot{y} = v sin (ψ + β)

(A1b)

\dot{ψ} = \frac{v}{l_{r}} sin β

(A1c)

\dot{v} = a

(A1d)

β = {tan}^{- 1} (\frac{l_{r}}{l_{f} + l_{r}} tan δ)

(A1e)

where x and y represent the longitudinal position and lateral position of the the center of mass of the vehicle, respectively. The inertial heading is given by

ψ

, and the velocity of the vehicle is denoted by v. The distances from the center of mass of the vehicle to the front and rear axles are

l_{f}

and

l_{r}

, respectively. The angle of the vehicle with respect to the longitudinal axis of the road is shown by

β

. The acceleration of the center of mass of the vehicle is represented by a. The front steering angle is

δ

. The state and input vectors are

ξ = {(x, y, ψ, v)}^{⊺}

and

u = {(a, δ)}^{⊺}

, respectively. The nonlinear continuous kinematic bicycle model are summarized as

\dot{ξ} = f^{c} (ξ, u)

.

Appendix B. Linearized and Discretized System Matrices

The linearized, discretized system matrices A and B [25] in model (5) are given by

A = [\begin{matrix} 1 & 0 & - T v sin z_{1} & T cos z_{1} - \frac{z_{2} sin z_{1}}{2 z_{4}} \\ 0 & 1 & T v cos z_{1} & T sin z_{1} - \frac{z_{2} cos z_{1}}{2 z_{4}} \\ 0 & 0 & 1 & \frac{T tan δ}{z_{4}} \\ 0 & 0 & 0 & 1 \end{matrix}]

and

B = [\begin{matrix} \frac{T^{2} cos z_{1}}{2} & - \frac{T^{2} v z_{7} sin z_{1}}{2} - \frac{z_{8} sin z_{1}}{z_{9}} \\ \frac{T^{2} sin z_{1}}{2} & \frac{T^{2} v z_{7} cos z_{1}}{2} + \frac{z_{8} cos z_{1}}{z_{9}} \\ \frac{T^{2} tan δ}{2 z_{4}} & T z_{7} \\ T & 0 \end{matrix}]

with

z_{1} = ψ + arctan (\frac{l_{r} tan δ}{l_{r} + l_{f}})

,

z_{2} = T^{2} v tan δ

,

z_{3} = {(l_{r} tan δ)}^{2}

,

z_{4} = (l_{r} + l_{f}) {(\frac{z_{3}}{{(l_{r} + l_{f})}^{2}} + 1)}^{\frac{1}{2}}

,

z_{5} = v ({(tan δ)}^{2} + 1)

,

z_{6} = {(l_{r} + l_{f})}^{3} {(\frac{z_{3}}{{(l_{r} + l_{f})}^{2}} + 1)}^{\frac{3}{2}}

,

z_{7} = \frac{z_{5}}{z_{4}} - \frac{z_{3} z_{5}}{z_{6}}

,

z_{8} = T l_{r} z_{5}

and

z_{9} = (l_{r} + l_{f}) (\frac{z_{3}}{{(l_{r} + l_{f})}^{2}} + 1)

.

Appendix C. Deterministic Collision-Avoidance Constraints for Multiple TVs

We introduce the deterministic collision-avoidance constraints for multiple TVs in this section.

With the tightened constraints in expressions (18a) and (19), the probabilistic chance constraints in (4) can be rewritten as the following deterministic expressions:

\{\begin{matrix} \{\begin{matrix} d_{i_{1}, k} \geq γ_{i_{1}, k} \\ γ_{i_{1}, k} = \sqrt{2 \nabla d_{i_{1}, k} \sum_{i_{1}, k} {(\nabla d_{i_{1}, k})}^{⊺}} e r f^{- 1} (2 p_{i} - 1) \end{matrix} \\ \{\begin{matrix} d_{i_{2}, k} \geq γ_{i_{2}, k} \\ γ_{i_{2}, k} = \sqrt{2 \nabla d_{i_{2}, k} \sum_{i_{2}, k} {(\nabla d_{i_{2}, k})}^{⊺}} e r f^{- 1} (2 p_{i} - 1) \end{matrix} \\ ⋮ \\ \{\begin{matrix} d_{i_{m}, k} \geq γ_{i_{m}, k} \\ γ_{i_{m}, k} = \sqrt{2 \nabla d_{i_{m}, k} \sum_{i_{m}, k} {(\nabla d_{i_{m}, k})}^{⊺}} e r f^{- 1} (2 p_{i} - 1) \end{matrix} \end{matrix}

(A2)

where

i_{1}, i_{2}, \dots, i_{m} \in N_{i}

and

k = 1, \dots, N

.

Appendix D. Simulation Setup

We describe the parameter settings in the simulations in Table A1.

Table A1. Parameter Settings.

Physical Meaning	Notation	Value
Length of road	$l^{road}$	1500 $m$
Width of lane	$w^{lane}$	5.25 $m$
Length of vehicle	$l^{veh}$	5 $m$
Width of vehicle	$w^{veh}$	2 $m$
Distance from mass center to front axle	$l_{f}$	2 $m$
Distance from mass center to rear axle	$l_{r}$	2 $m$
Lower boundary of road	$y^{r, l}$	0 $m$
Upper boundary of road	$y^{r, u}$	15.75 $m$
Minimum speed	$v^{\min}$	0 ${m s}^{- 1}$
Maximum allowable speed	$v^{\max}$	70 ${m s}^{- 1}$
Minimum inertial heading	$ψ^{\min}$	−1.2 $rad$
Maximum inertial heading	$ψ^{\max}$	1.2 $rad$
Minimum acceleration	$a^{\min}$	−9 ${m s}^{- 2}$
Maximum acceleration	$a^{\max}$	6 ${m s}^{- 2}$
Minimum front steering angle	$δ^{\min}$	−0.2 $rad$
Maximum front steering angle	$δ^{\max}$	0.2 $rad$
Semi-major axis	$s_{a}$	9 $m$
Semi-minor axis	$s_{b}$	5.5 $m$
Prediction horizon	N	10
Sampling time	T	0.2 $s$
Weighting matrix	Q	$diag (0, 0.5, 0.1, 1)$
Weighting matrix	R	$diag (3, 5)$

References

Carvalho, A.; Gao, Y.; Lefevre, S.; Borrelli, F. Stochastic predictive control of autonomous vehicles in uncertain environments. In Proceedings of the 12th International Symposium on Advanced Vehicle Control, Tokyo, Japan, 22–26 September 2014; pp. 712–719. [Google Scholar]
Soloperto, R.; Köhler, J.; Allgöwer, F.; Müller, M.A. Collision avoidance for uncertain nonlinear systems with moving obstacles using robust Model Predictive Control. In Proceedings of the 2019 18th European Control Conference (ECC), Naples, Italy, 25–28 June 2019; pp. 811–817. [Google Scholar] [CrossRef]
Suh, J.; Chae, H.; Yi, K. Stochastic Model-Predictive Control for Lane Change Decision of Automated Driving Vehicles. IEEE Trans. Veh. Technol. 2018, 67, 4771–4782. [Google Scholar] [CrossRef]
Brüdigam, T.; Olbrich, M.; Leibold, M.; Wollherr, D. Combining Stochastic and Scenario Model Predictive Control to Handle Target Vehicle Uncertainty in an Autonomous Driving Highway Scenario. In Proceedings of the 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, USA, 4–7 November 2018; pp. 1317–1324. [Google Scholar] [CrossRef]
Brüdigam, T.; Olbrich, M.; Wollherr, D.; Leibold, M. Stochastic Model Predictive Control with a Safety Guarantee for Automated Driving. IEEE Trans. Intell. Veh. 2023, 8, 22–36. [Google Scholar] [CrossRef]
Dang, N.; Brüdigam, T.; Leibold, M.; Buss, M. Combining Event-Based Maneuver Selection and MPC Based Trajectory Generation in Autonomous Driving. Electronics 2022, 11, 1518. [Google Scholar] [CrossRef]
Heirung, T.A.N.; Paulson, J.A.; O’Leary, J.; Mesbah, A. Stochastic model predictive control—How does it work? Comput. Chem. Eng. 2018, 114, 158–170. [Google Scholar] [CrossRef]
Kouvaritakis, B.; Cannon, M.; Raković, S.V.; Cheng, Q. Explicit use of probabilistic distributions in linear predictive control. In Proceedings of the UKACC International Conference on Control 2010, Coventry, UK, 7–10 September 2010; pp. 1–6. [Google Scholar] [CrossRef]
Maroto, J.; Delso, E.; Felez, J.; Cabanellas, J.M. Real-Time Traffic Simulation with a Microscopic Model. IEEE Trans. Intell. Transp. Syst. 2006, 7, 513–527. [Google Scholar] [CrossRef]
Zambrano-Martinez, J.L.; Calafate, C.T.; Soler, D.; Cano, J.C.; Manzoni, P. Modeling and Characterization of Traffic Flows in Urban Environments. Sensors 2018, 18, 2020. [Google Scholar] [CrossRef]
Min, W.; Wynter, L. Real-time road traffic prediction with spatio-temporal correlations. Transp. Res. Part C Emerg. Technol. 2011, 19, 606–616. [Google Scholar] [CrossRef]
Althoff, M.; Stursberg, O.; Buss, M. Model-Based Probabilistic Collision Detection in Autonomous Driving. IEEE Trans. Intell. Transp. Syst. 2009, 10, 299–310. [Google Scholar] [CrossRef]
Yang, B.; Monterola, C. A simple distributed algorithm for lightless intersection control based on non-linear interactions between vehicles. In Proceedings of the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, 16–19 October 2017; pp. 1–6. [Google Scholar] [CrossRef]
Dai, L.; Xia, Y.; Gao, Y.; Kouvaritakis, B.; Cannon, M. Cooperative distributed stochastic MPC for systems with state estimation and coupled probabilistic constraints. Automatica 2015, 61, 89–96. [Google Scholar] [CrossRef]
Dai, L.; Xia, Y.; Gao, Y.; Cannon, M. Distributed Stochastic MPC of Linear Systems with Additive Uncertainty and Coupled Probabilistic Constraints. IEEE Trans. Autom. Control 2017, 62, 3474–3481. [Google Scholar] [CrossRef]
Zhao, G.; Yang, S. Distributed stochastic MPC for linear systems with probabilistic constraints and quantisation. IET Control Theory Appl. 2020, 14, 396–404. [Google Scholar] [CrossRef]
Dunbar, W.B.; Murray, R.M. Distributed receding horizon control for multi-vehicle formation stabilization. Automatica 2006, 42, 549–558. [Google Scholar] [CrossRef]
Dunbar, W.B.; Caveney, D.S. Distributed Receding Horizon Control of Vehicle Platoons: Stability and String Stability. IEEE Trans. Autom. Control 2012, 57, 620–633. [Google Scholar] [CrossRef]
Zheng, Y.; Li, S.E.; Li, K.; Borrelli, F.; Hedrick, J.K. Distributed Model Predictive Control for Heterogeneous Vehicle Platoons Under Unidirectional Topologies. IEEE Trans. Control Syst. Technol. 2017, 25, 899–910. [Google Scholar] [CrossRef]
Liu, P.; Kurt, A.; Ozguner, U. Distributed Model Predictive Control for Cooperative and Flexible Vehicle Platooning. IEEE Trans. Control Syst. Technol. 2019, 27, 1115–1128. [Google Scholar] [CrossRef]
Gao, Y.; Lin, T.; Borrelli, F.; Tseng, E.; Hrovat, D. Predictive Control of Autonomous Ground Vehicles with Obstacle Avoidance on Slippery Roads. In Proceedings of the ASME 2010 Dynamic Systems and Control Conference, Cambridge, MA, USA, 12–15 September 2010. [Google Scholar] [CrossRef]
Levinson, J.; Askeland, J.; Becker, J.; Dolson, J.; Held, D.; Kammel, S.; Kolter, J.Z.; Langer, D.; Pink, O.; Pratt, V.; et al. Towards fully autonomous driving: Systems and algorithms. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 163–168. [Google Scholar] [CrossRef]
Carvalho, A.; Gao, Y.; Gray, A.; Tseng, H.E.; Borrelli, F. Predictive control of an autonomous ground vehicle using an iterative linearization approach. In Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013), Hague, The Netherlands, 6–9 October 2013; pp. 2335–2340. [Google Scholar] [CrossRef]
Kong, J.; Pfeiffer, M.; Schildbach, G.; Borrelli, F. Kinematic and dynamic vehicle models for autonomous driving control design. In Proceedings of the 2015 IEEE Intelligent Vehicles Symposium (IV), Seoul, Republic of Korea, 28 June–1 July 2015; pp. 1094–1099. [Google Scholar]
Brüdigam, T.; Olbrich, M.; Wollherr, D.; Leibold, M. Stochastic Model Predictive Control with a Safety Guarantee for Automated Driving: Extended Version. arXiv 2020, arXiv:2009.09381. [Google Scholar] [CrossRef]
Lorenzen, M.; Dabbene, F.; Tempo, R.; Allgöwer, F. Constraint-Tightening and Stability in Stochastic Model Predictive Control. IEEE Trans. Autom. Control 2017, 62, 3165–3177. [Google Scholar] [CrossRef]
Grüne, L.; Pannek, J. Nonlinear Model Predictive Control: Theory and Algorithms, 2nd ed.; Communications and Control Engineering; Springer: Cham, Switzerland, 2017. [Google Scholar]

Figure 1. Communication topology at one time step. This topology shows, at one time step, which of the surrounding vehicles is considered in the controller of one particular vehicle. This topology is updated at each time step.

Figure 2. A two-vehicle scenario. There are two vehicles on a three-lane highway. Vehicle 1, in red, is non-reactive in a non-interactive system but reactive in an interactive system and will remain in the center lane. Vehicle 2, in blue, is an SMPC-controlled vehicle, starting in the right, slow lane and later changing into the center lane. The longitudinal and lateral directions are represented by x and y, respectively.

Figure 3. Distance deviations in a non−interactive two−vehicle scenario. The six colored lines represent deviations between the distances for all risk parameters (0.70, 0.75, 0.80, 0.85, 0.90, and 0.95) and the distance for the risk parameter 0.95 during the whole 12 iterations, respectively. The iteration is represented by

t / T

, where t is the time, and T denotes the sampling time.

Figure 4. Lateral positions in an interactive two-vehicle scenario. The risk parameter pairs

(p_{1}, p_{2})

for Vehicle 1 and Vehicle 2 are specified in the legend of the figures.

Figure 5. The trajectories and steering angles of Vehicles 1 and 2 for different risk parameter pairs in an interactive scenario. The gray regions in the plots mark time periods of conflict. In sub-figure (a), to display the relative positions of vehicles, we drew the vehicles as small squares every 10 iterations and colored the squares in different shades of red and blue. (a) The trajectories of the vehicles. (b) The steering angles of the vehicles.

Figure 6. Distance Deviations (DDs) for the vehicles with different pairs of risk parameters,

(0.75, 0.95)

,

(0.80, 0.95)

,

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

in an interactive scenario. To better see the details, we enlarged the first 32 iterations of the plot and show them on the top left side of the figure.

Figure 7. State Deviations (SDs) for the vehicles with different risk parameter pairs,

(0.75, 0.95)

,

(0.80, 0.95)

,

(0.85, 0.95)

,

(0.90, 0.95)

, and

(0.95, 0.95)

in an interactive scenario.

Table 1. Initial settings for a non-interactive two-vehicle scenario.

	$x_{0}$	$y_{0}$	$ψ_{0}$	$v_{0}$	$y_{ref}$	$v_{ref}$
Vehicle 1	50	7.875	0	27	7.875	27
Vehicle 2	72	2.625	0	24	7.875	30

Table 2. Initial settings for an interactive two-vehicle scenario.

	$x_{0}$	$y_{0}$	$ψ_{0}$	$v_{0}$	$y_{ref}$	$v_{ref}$
Vehicle 1	50	7.875	0	27	7.875	27
Vehicle 2	67	2.625	0	25	7.875	27

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Distributed Stochastic Model Predictive Control for a Microscopic Interactive Traffic Model

Abstract

1. Introduction

2. Model of the Multi-Vehicle System

2.1. Communication Topology

2.2. Vehicle Controllers

3. Elements of the SMPC Problem

3.1. Vehicle Models

3.1.1. Linear Discrete-Time Model

3.1.2. Model of EVs

3.1.3. Model of TVs

3.1.4. Reformulation of the TV Model

3.2. Constraints

3.2.1. Hard Constraints

3.2.2. Collision-Avoidance Constraints

3.2.3. Constraint Tightening

3.3. Cost Function

3.4. Control Algorithm for One Vehicle

4. Simulation Results

4.1. The Effects of Risk Parameters on an Individual Vehicle

4.2. The Effects of Risk Parameters on Interactive Systems

4.2.1. The Same vs. Different Risk Parameters

4.2.2. Resolving Conflicts

4.2.3. Risk Differences

5. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Kinematic Bicycle Model

Appendix B. Linearized and Discretized System Matrices

Appendix C. Deterministic Collision-Avoidance Constraints for Multiple TVs

Appendix D. Simulation Setup

References

Article Metrics

Citations

Article Access Statistics