1. Introduction and Motivation
Providing control strategies for automated vehicles has become a subject of significant focus in research and development centers of vehicle industry. An important task is to design a velocity profile for vehicles in order to guarantee effective, comfortable, safe and economical traffic by exploiting vehicle dynamics and environmental circumstances, e.g., certain characteristics of fuel consumption, delivered cargo, road inclinations, speed limits, traffic flow and traffic forecast. The complexity of the control task results in various performance requirements which must be simultaneously guaranteed by the control system [
1].
Existing control solutions result in speed profiles for automated vehicles, which differ from the speed selection strategy of human drivers. The reason for this is that the control systems of automated vehicles can obtain information on the road ahead, e.g., the usage of the road capacity or the upcoming downhill terrain characteristics. At the same time, most of these data are not available for human drivers. It can be shown that the speed selection of automated vehicles and human-driven vehicles are not independent from each other. Consequently, the motion of human-driven vehicles must be adapted to automated vehicles. Moreover, the ratio of automated vehicles () in the entire traffic network also influences the characteristics of the traffic flow through the modification of traffic speed.
1.1. Motivation Example on the Variation of Traffic Flow
Figure 1 presents an illustration of the impact of
on the variation in average traffic speed. In this scenario, a 20 km-long three-lane (outer, middle and inner lanes) segment of the hilly Hungarian M1 highway is modeled in a VISSIM traffic simulator. In the simulation scenario, the speed limit is 130 km/h, except for the section between 6 and 8 km, where the speed limit is 90 km/h. Without automated vehicles
, the average traffic speed is close to the speed limit, as can be seen in
Figure 1a. When there is an increased number of automated vehicles on the highway
, the average traffic speed varies due to the automated vehicles, whose speed profiles are also influenced by the uphill and downhill sections. The latter case results in an approximately
decrease in energy consumption, within the entire traffic, compared to the previous case, i.e., without automated vehicles.
Figure 2 shows further examinations of the traffic flow in the distinct segments of the highway.
q denotes the inflow of the motorway (veh/h).
Figure 2a,b show the results of the scenario in the cases of the previous example, i.e.,
and
, respectively. In these examples, the inflow into the highway section is
veh/h.
Figure 2c presents the result of the scenario, in which the ratio of automated vehicles is
. Moreover,
Figure 2d presents the result of the scenario, in which the inflow into the highway section is significantly increased to
veh/h, and
. Through the illustrations, it can be concluded that increasing the ratio
and/or increasing the inflow
has a significant impact on traffic flow.
1.2. Brief Literature Overview on the Related Achievements
Since automated vehicles have a significant impact on traffic flow, it is necessary to take the modeling and control design of the traffic system into consideration. A comprehensive overview of different classical control solutions for ramp metering has been proposed by [
2,
3]. Moreover, the vehicle-to-infrastructure (V2I) communication provides new perspectives in the control of the traffic system, because a huge amount of data about the motion of the vehicles in the traffic network can be obtained [
4]. This allows a more sophisticated prediction of the upcoming traffic scenario, in which the effectiveness of the traffic control can be improved.
The exploitation of the result of the traffic flow analysis in the modeling and control of automated vehicles is one of the hot topics among researchers, as can be seen in (e.g., [
5]). The most important approaches of traffic flow modeling were summarized by [
6]. The analysis of the traffic flow, in which semi-automated and automated vehicles travel alongside human-driven vehicles, was proposed by [
7]. Stability issues of the traffic flow of connected and automated vehicles were examined by [
8]. In [
9], it has been shown that automated vehicles have only slightly negative effects but significant positive effects on the traffic flow depending on the penetration rate of the automated vehicles and the traffic scenario. Interactions between automated and human-driven vehicles were presented in [
10]. The authors in [
11] highlighted new achievements in the construction of a macroscopic fundamental diagram for traffic flow with automated vehicles. The presented data analysis results revealed that the traditional triangular fundamental diagram structure remains applicable to describe the traffic flow characteristics of traffic with automated vehicles. For mixed traffic, a parsimonious formula was provided to estimate the fundamental diagram with measures from pure human-driven traffic and pure automated vehicle traffic. Control-oriented applications are strongly connected to the modeling of traffic flow. For example, in [
12], a network-level coordination for automated vehicle control and traffic light control was presented, i.e., a distributed optimization scheme to reduce the computational complexity and to improve the effectiveness of coordination was developed. A more complex problem was that of control in mixed traffic, because the motion of automated vehicles and the motion of human-driven vehicles simultaneously impact the traffic flow. The interactions between human-driven and autonomous vehicles in optimal control synthesis for tolls were studied by [
13].
Based on the huge amount of data on traffic, a novel data-driven approach for the analysis and modeling of traffic flow dynamics was proposed by [
14]. Traffic flow prediction using a deep-learning algorithm was presented by [
15]. In that study, a deep-learning architecture model was applied by using auto-encoders as building blocks to represent traffic flow features for prediction. Similarly, Lasso regression was used for traffic flow prediction in [
16]. Cell phone information-based big data analysis and control for transportation purposes was proposed by [
17]. The work of [
18] focused on generating models for microscopic traffic simulation, which was built upon real-world data. The identification and prediction of traffic flow states based on the big data analysis method was presented by [
19]. An increased number of achievable information on traffic flow was used for training deep neural networks, which were then able to predict traffic flow under urban traffic scenarios, as can be seen in [
20]. A deep learning method using convolutional neural network and long short-term memory architectures for monitoring traffic flow in urban region was provided by [
21]. The fusion-based technique resulted in high accuracy based on the evaluation through simulation scenarios.
1.3. Proposed Methodology of the Paper
The overview of the literature shows that several approaches exist for modeling traffic flow dynamics, but most of these are related to pure human-driven or automated vehicles. The modeling and analysis of mixed traffic flow is an emerging research field, in which partial solutions have been achieved. Modeling methods with classical model-based approaches do exist [
11], as do those with unconventional, e.g., network-level [
12] or data-driven approaches [
20]. Methodologically, the classical traffic modeling methods are based on physical relationships, in which the nonlinear characteristics of the traffic flow are described. The advantage of these methods is that the traffic flow model provides theoretical fundamentals for designing a controller with guaranteed performances [
22]. Nevertheless, their drawback is the increased uncertainty concerning the modeling of short-time traffic flow, due to robustness features. In the case of data-driven modeling methods, the actual measured information on the traffic flow is highly relevant, i.e., the short-time prediction of forthcoming traffic flow can be more accurate due to the actual information. However, the data-driven modeling solutions also have disadvantages. In the case of neural-network-based formulation, the evaluation of the prediction/control process is challenging, while in the case of a regression-based formulation, the accuracy of long-time prediction can be limited.
In spite of the various existing methods, it is difficult to find a systematic control-oriented modeling method which exploits the advantages of data-driven analysis. The aim of this paper was to integrate data-driven traffic flow prediction in a polynomial model-based traffic flow model and thus, improve the accuracy of the prediction. The proposed novel prediction method for control design purposes was applied, i.e., an optimization problem of traffic flow volume with a ramp metering by the min–max principle was provided. The resulting control system provides robustness against disturbances of the system, i.e., the uncontrolled inflows of the traffic network. A novel contribution of this paper is that the proposed prediction model is able to handle the presence of automated vehicles in the traffic network at the level of data-driven prediction as well as at the level of the polynomial traffic flow model.
The prediction and control process is illustrated in
Figure 3. The proposed prediction method has two components, namely data-driven prediction and model-based prediction. Their outputs are the predicted flow volumes
at the end of the highway section on a
horizon. The outputs of the predictions were integrated into a final prediction
, which is used in the control process. The output of the control comprises the controlled inflows on
N number of ramps
on horizon
. The control inputs on the entire horizon are used in the prediction process and their values at the
step are used as control inputs of the traffic network. Different measurements on the traffic network for each prediction block are used as inputs and thus, the control loop is closed.
The paper is organized as follows. In
Section 2, the data-driven estimation of the traffic flow is presented. This is performed through a two-step analysis. First, a subset selection method is applied, which is able to prioritize the main attributes based on their relations to the measured signals. Second, a linear regression model using the least squares (LS) method is applied to derive a relationship between the attributes and the traffic flow.
Section 3 proposes an enhanced traffic flow model, which results from the interconnection of the data-driven prediction and the classical traffic flow modeling.
Section 4 proposes the traffic flow control. The effectiveness of the novel optimal control is demonstrated through simulation scenarios. In this paper, the VISSIM complex traffic simulator was used the modeling and simulation of the traffic network, while the WEKA data-mining software was used in the LS-based analysis, as can be seen in [
23]. Finally,
Section 6 summarizes with some concluding remarks.
3. Formulating Control-Oriented Traffic Flow Prediction Model
In this section, the control-oriented model for traffic flow dynamics, in which the data-driven prediction model and traffic modeling principles are used together, is proposed.
In the formulation of traffic dynamics, the traffic network is gridded into
N number of segments. The traffic flow of each segment is represented by a dynamical equation, which is based on the law of conservation. The relationship contains the sum of inflows and outflows for a given segment
i. Traffic density
(veh/km) is expressed as
where
k is the index of the discrete time step;
T is the discrete sample time;
is the length of the segment;
(veh/h) and
(veh/h) are the inflow of the traffic in segments
i and
, and finally,
(veh/h) is the sum of the controlled ramp inflow.
Another important relation of the traffic dynamics is the fundamental relationship which creates a connection between outflow
, traffic density
and average traffic speed
[
26]. The fundamental relationship is formed as
Conventionally, the fundamental relationship is derived through historic measurements and depends on several factors, as can be seen in, e.g., [
27,
28].
and
are formed in the traffic flow model as nonlinear functions of traffic density [
29], such as
and
, where
are nonlinear functions. In the modeling of the traffic flow dynamics, the impact of
on
can be considered in the nonlinear function of
, such as
Function
can be effectively formulated through polynomial relationships [
30]. An example of
, which depends on the
and the presence of automated vehicles in the traffic, is illustrated in
Figure 6.
The dynamics of the traffic flow based on (
15) and (
17) is written as
which contains the nonlinear characteristics of the fundamental diagram
. The advantage of the expression (
18) is that it incorporates the nonlinear behavior of the traffic flow dynamics, and thus, (
18) can be effective for the long-term prediction of
through
. However,
is derived based on historic measurements, which means that it has an increased error in terms of short-term prediction. Consequently, (
18) uses only a small number of actual data.
The following model combines the data-driven prediction model and the conventional traffic model. The purpose of this solution is to eliminate the drawbacks of each method in the prediction. The highway in the case of the conventional traffic model is formed as a queue with one segment
, while in the case of the data-driven model, the highway is divided into
N number of sections. The prediction of the outflow of the highway is as follows:
where
is the prediction from the data-driven model and
is the prediction from the traffic model. The following form is applied:
, in which
. Moreover, the forgetting factor
, is introduced, which depends on the step of the prediction
p. In the case of
,
has high value, while the increase in
p leads to the reduction in
. Through the modification of
, a balance of the prediction can be achieved. In a short-term prediction,
has priority, while in a long-term prediction,
has priority.
The prediction of the outflow at
is formed as
The factors are the following.
is expressed in the following form:
and thus,
is also used in the prediction.
is expressed based on (
14), such as
where
results from (
14) and
for all
k.
Through the proposed traffic model, the prediction of the traffic flow can be updated through the results of the data-driven analysis. Through the increase in the value p, the impact of the data-driven model through is reduced; consequently, the emphasis of the nonlinear characteristics of the traffic flow is considered.
4. Optimal Control Design for Traffic Flow Maximization
The design of an optimal control was based on the previously formed enhanced traffic flow model. The purpose of the control design is to guarantee the maximum outflow of the traffic network through the inflow of the controlled ramp . Since the system contains disturbances, their impact on must be reduced. This leads to an optimization task.
In the control design of the traffic system, the following performance requirements must be guaranteed.
It is necessary to achieve the maximum outflow of the traffic network
, such as:
where
represents the length of the horizon. This performance specification is advantageous compared to the classical solution, which is based on the setting of
related to critical density
, as can be seen in, e.g., [
1]. Namely, the setting of
requires preliminary knowledge on the critical density of the traffic flow. Nevertheless, it depends on several factors, as can be seen in [
31]. At the same time, in the proposed control strategy, data are obtained from the traffic system through data-driven analysis. This means that the result of the optimization can be effective without a fixed fundamental diagram.
In the control task, the control inputs must be as small as possible. The control input
,
is a positive variable, which has physical limits. Moreover, it is necessary to limit its variation to prevent the rapid change during the actuation. Therefore, two constraints on
are defined, such as
where
is the maximum of
and
is its maximum variation.
The traffic system has also disturbances within the inflow data,
because the number of future entering vehicles is unknown, even if the routes of the automated vehicles are assumed to be known. Due to the
ratio of automated vehicles,
can be divided into known (
) and unknown (
) disturbances in the following way:
The known disturbances can be incorporated in the traffic flow maximization procedure as constant values. Thus, only a part of the disturbances, i.e., the unknown disturbances, can be handled in a worst-case scenario.
The form of the control task requires three components. First, the performance in (
23) was used as an objective of optimization. Second, the performance (24) is handled as a constraint in the control design. Third, disturbance
in (
25) must be minimized in
. Thus, the control design leads to a min–max task:
with the following constraints:
where
are the bounds of the unknown disturbance. The results of the optimization are the intervention signals
and the values of the unknown disturbances
. The control signal at time step
k is
.
The solution of task (
26) requires the joint handling of the minimization and maximization tasks. In practice, these are separated and an iterative solution is applied.
First, the minimization task is solved for initial fixed values
. The solution is achieved by an optimization algorithm, which is able to handle nonlinear constraints, as can be seen in, e.g., [
32]. This results in
values, which are used as fixed values during the solution of the maximization task in (
26).
Second, the maximization task is also solved by using values. The maximization task also results in values.
The iteration procedure is stopped, when the relative errors of the solutions , in the actual and the previous steps are smaller than a predefined value.
5. Simulation Examples
In this section, the effectiveness of the proposed method is illustrated through simulation examples. In the examples, a 5 km-long highway section with three lanes for was selected illustration purposes. In the example,
was set, meaning that the traffic flow contains a significant number of automated vehicles. Several simulations were performed in the VISSIM traffic simulator in order to determine the function
. The illustration of a selected scenario is shown in
Figure 7a.
Figure 7a shows the variation of traffic flow volume, depending on highway section (axis X) and on time (axis Y). Here, the values of the volume in several small sections of the highway depending on the time are shown. There is a congestion at the end of the highway section at 10 min, which has a significant impact on the entire highway section. The reduced traffic flow has reached the 1 km section point at 20 min. The example shows that the dynamics of the traffic flow is close to the experience in the context of highway scenarios. In the illustration, the simulated data are synchronized, which is guaranteed by the data acquisition process in VISSIM.
Figure 7b shows the derived fundamental diagram of the highway. The data for the determination of
were obtained by various VISSIM simulations. The volume–density pairs of all simulation scenarios on this figure were matched through their time stamp in VISSIM.
was approximated to a sixth-order polynomial form.
The effectiveness of the outflow prediction is illustrated in
Figure 8. The example presents the effectiveness of the integration of the data-driven and the classical traffic models for the prediction of the forthcoming traffic flow.
Figure 8 shows the real outflow, which is approximated in three ways. First,
illustrates the prediction made by the classical traffic model (
18). In the simulation, the model uses
as new data and
as its state in all time step
k. Although the model predicts the forthcoming congestion for high
p values, it has a higher prediction error at low
p values. Second, the data-driven prediction is shown by
. The prediction model (
14) uses
as new data for all
k step and
states in the computation of
. This model is unable to predict non-linearities due to its linear form. Consequently,
predicts the real outflow for small
p values, while for increased
p values,
significantly differs from the real-traffic flow. Third, the results of the interconnected model are shown in
Figure 8. In the interconnection between
and
, the
function is selected as
if
, otherwise
. The selection guarantees the smooth transition between the interconnected signals. The predicted outflow approximates the real outflow on the entire term of the prediction.
The operation of the controlled system is shown in
Figure 9. In the example, the 5 km-long-highway section has an on-ramp at the highway segment 1 km. The inflow on the ramp
r can be controlled by traffic light. The purpose of the control is to guarantee the maximum outflow at the end of the highway section through the actuation of
r. The inflows represent a traffic scenario, in which there is a rush hour in the middle of the simulation. The
inflow is shown in
Figure 9a and the inflow demand on the on-ramp
is shown in
Figure 9b. The inflow
is limited by the computed
r, which results in the real inflow
. In the simulation, the variation limit is set to
veh/h. Another result of the min–max optimization task is
, which is illustrated in
Figure 9c. The limits of
are
veh/h and
veh/h. The role of the computation of
is to characterize the worst-case scenario. For example, between 250 s and 1000 s, the worst-case is
, which facilitates reduced outflow through a congestion. Although
overestimates the real value of the unknown
, it guarantees the avoidance of the traffic jam. The achieved outflow of the highway is illustrated in
Figure 9d. It can be seen that the performance of the control, i.e., the maximization of the outflow, is achieved.
Figure 10 presents two counter examples to illustrate the effectiveness of the proposed control strategy.
Figure 10a illustrates the scenario without control, which means that the
inflow enters the highway. Due to the increasing
and
, the traffic flow on the highway is over saturated, which leads to a congestion, between 900 s and 1500 s. The comparison of
Figure 9d and
Figure 10a shows the effectiveness of the traffic control system, i.e., the outflow is significantly higher due to the avoidance of congestion.
The following simulation example is shown in
Figure 10b,c. In this scenario, the control strategy is modified by the simplification of the optimization task (
26), which leads to the following form:
such that
This means that the worst-case scenario is not considered during optimization. This results in increased inflow on the ramp, as can be seen in
Figure 10b. In this scenario, the limitation of the inflow started at 450 s, while in the scenario of
Figure 10b,
r is reduced after 250 s. Although the achieved outflow is similar until 1000 s, its significant reduction in the simulation between 1000 s and 1500 s was due to the over saturation of the traffic flow.