Hybrid Model-Based Traffic Network Control Using Population Games

Amaya, Sindy Paola; Ñañez, Pablo Andrés; Martínez Vásquez, David Alejandro; Calderón Chávez, Juan Manuel; Mateus Rojas, Armando

doi:10.3390/asi8040102

Open AccessArticle

Hybrid Model-Based Traffic Network Control Using Population Games

by

Sindy Paola Amaya

^1,*,

Pablo Andrés Ñañez

²,

David Alejandro Martínez Vásquez

^1,3,*

,

Juan Manuel Calderón Chávez

^1,4 and

Armando Mateus Rojas

¹

Electronic Engineering Faculty, Universidad Santo Tomás, Bogotá 110231, Colombia

²

Lockheed Martin, RMS (Rotary and Mission Systems), Moorestown, NJ 08057, USA

³

Department of Technology, Universidad Pedagógica Nacional, Bogotá 110221, Colombia

⁴

Department of Computer Science & Engineering, Bethune-Cookman University, Daytona Beach, FL 32114, USA

^*

Authors to whom correspondence should be addressed.

Appl. Syst. Innov. 2025, 8(4), 102; https://doi.org/10.3390/asi8040102

Submission received: 20 June 2025 / Revised: 17 July 2025 / Accepted: 23 July 2025 / Published: 25 July 2025

Download

Browse Figures

Review Reports Versions Notes

Abstract

Modern traffic management requires sophisticated approaches to address the complexities of urban road networks, which continue to grow in complexity due to increasing urbanization and vehicle usage. Traditional methods often fall short in mitigating congestion and optimizing traffic flow, inducing the exploration of innovative traffic control strategies based on advanced theoretical frameworks. In this sense, we explore different game theory-based control strategies in an eight-intersection traffic network modeled by means of hybrid systems and graph theory, using a software simulator that combines the multi-modal traffic simulation software VISSIM and MATLAB to integrate traffic network parameters and population game criteria. Across five distinct network scenarios with varying saturation conditions, we explore a fixed-time scheme of signaling by means of fictitious play dynamics and adaptive schemes, using dynamics such as Smith, replicator, Logit and Brown–Von Neumann–Nash (BNN). Results show better performance for Smith and replicator dynamics in terms of traffic parameters both for fixed and variable signaling times, with an interesting outcome of fictitious play over BNN and Logit.

Keywords:

population dynamics; hybrid system; traffic network; game theory; graph theory

1. Introduction

The continuous growth of cities, especially in developing countries, has produced a considerable increase in the number of vehicles circulating through roads that were not originally designed to accommodate such heavy traffic loads, particularly during rush hours, in which, according to the INRIX Global Traffic Scorecard [1] and the TOMTOM Traffic Index [2], people spent more than 157 h in cities like Bogotá and Lima in 2023. This problem has historically been addressed by means of limited alternatives such as the increase in lanes, circulation restrictions depending on the last number of the car plate, and the banning for heavy vehicles to transit on weekends in some city zones or even country regions, to mention a few, which only tackle the problem in a partial form. In this sense, it is crucial to optimize the use of the available infrastructure for traffic control, such as traffic lights, cameras, and general signaling [3], in order to involve decision making about the green times in intersections considering street lengths, input and output flows, and street capacity to store vehicles, among other traffic parameters. Often, researchers focus their work on traffic signal timing (TST) and traffic signal control (TSC) optimization to develop mechanisms that assure the right of way (ROW) for multiple road users in two different strategy types, one based on fixed time, whose scheme defines the traffic lights plan according to historical data, and other that follows an adaptive schedule considering the traffic variability to assign dynamic times for traffic lights [4].

Fixed-time strategies [5], in spite of their limited applicability in environments with high traffic variability, continue to be widely employed in situations where technological infrastructure upgrades are not feasible. However, modern techniques have emerged to address this challenge, including nonlinear combinatorial programming models for pedestrian and vehicular signal plans [6], and the implementation of platoon size control based on lane dimensions for connected and automated vehicles (CAVs) [7]. Recently, fixed-time strategies have primarily served as benchmarks for comparing performance with novel adaptive approaches. These approaches leverage machine learning algorithms, IoT-based traffic control focused on CAVs [8], semi-actuated networks [8], and multi-agent methods [9,10], as well as many game theory-based controllers that consider not only vehicles but also pedestrian dynamics [11].

In terms of adaptive strategies, multiple solutions have been proposed to integrate multi-agent and game theory concepts [12,13,14,15,16,17]. These strategies aim to improve real-time control over green light cycles and tackle vehicular congestion challenges in contexts involving conventional car drivers, pedestrians, and autonomous vehicles (AV). In this regard, in [18], authors propose a game theory based multi-agent algorithm called GAMEPLAN that determines the optimal action for each agent based on their driving style (passive or aggressive), considering AV and drivers as agents with hidden intentions. In [19], a mixed traffic environment including CAVs and human-driven vehicles (HDVs) is studied using a model involving adaptive dynamic programming (ADP) techniques, alongside quadratic zero-sum games to minimize external disturbances that cause stop and go waves in vehicular platoons. Another mixed study, this time involving pedestrians and vehicles, is proposed in [16]. In this analysis, authors employ a Nash bargaining solution to determine the optimal green time that reduces pedestrian and vehicle delays. Bargaining games, supported by predictive control techniques, have also been employed in [20] to establish an operating agreement among vehicles within a platoon, demonstrating acceptable performance in vehicular network coordination. In general, game theory and multi-agent systems have provided approaches for vehicular time delay reduction [17,21,22,23], space allocation control in vehicular platoons [24], AV driving strategies based on velocity control considering road networks’ structure [25], and traffic signal time modeling in intersections, where cooperating players or agents represent phase sequences as in [26], or agents, which model intersections, work in non-cooperative schemes based on reinforcement learning, or in a cooperative way using Pareto optimal strategy [27].

From the perspective of traffic system modeling, simulation tools offer relevant capabilities for analyzing a wide range of traffic conditions. These tools allow to set control parameters, such as cycle time, phase sequence, and cycle length, enabling detailed examination of various scenarios. Traditionally, simulation tools have been classified based on the level of detail that they consider regarding to the traffic actors in three types: microscopic, macroscopic and mesoscopic. Microscopic models delve into individual interactions between drivers and pedestrians. Between the most popular platforms of this type of models are AIMSUN, CORSIM, MATSim, Paramics, SUMO and VISSIM, with the latter being the most widely accepted in the research community [4]. Macroscopic models, exemplified in platforms such as VISUM, TRANSYT-7F, and FREFLO, among others, provide a general view of the entire traffic network. A Mesoscopic model, on the other hand, combines the above mentioned approaches. Most of the mentioned modeling approaches are focused on traffic parameters for vehicular and pedestrian congestion control without the incorporation of additional and novel techniques, such as game theory, which is of primary interest in this work. Additionally, none of these works, which are heavily focused on techniques related to game theory and multi-agent algorithms, have modeled the traffic system as a hybrid system that represents vehicle storage as a continuous system and the transitions between network modes caused by traffic light changes as a discrete system. Furthermore, these studies have primarily focused on analyzing delay times, without considering other important factors of the traffic network, such as the number of stops, total travel time, average speed, and total distance traveled, among others, which are crucial for determining the relevance of an algorithm.

In this sense, the main contributions of this work, which, to the best of our knowledge, have not been considered in previous research, are described below.

First, we implement a traffic simulator that integrates the microscopic simulation tool VISSIM with MATLAB to evaluate the dynamics of various evolutionary game-theoretic techniques within a traffic network. This network employs a hybrid signaling system, combining continuous dynamics—modeling vehicular storage changes—with discrete dynamics—representing transitions between network modes driven by traffic light changes. This hybrid approach, combined with control strategies based on evolutionary game theory that consider queue lengths and traffic light times as utility functions both in fixed-time and adaptive traffic control schemes, allows for a more realistic representation of traffic behavior, which, to the best of our knowledge, has not been addressed in the literature.
Second, beyond traditional metrics like delay, we comprehensively analyze a range of performance parameters that are often overlooked in prior studies involving game theory and multi-agent algorithms, such as the number of stops, the total travel time, average speed, total distance traveled, and average stop time per vehicle. By incorporating these metrics, we provide a more holistic evaluation of traffic control strategies, capturing their impact on efficiency.
Third, we implement and compare multiple traffic control strategies based on population dynamics and payoff functions that consider queue lengths and green times of network links. These strategies include the following:
–
Fictitious play, implemented from a fixed-time perspective due to its offline nature, and
–
Adaptive strategies such as Smith, Replicator, Logit, and Brown–Von Neumann–Nash (BNN), which leverage their ability to be continuously updated in real time.

This approach allows us to contrast the performance of offline and adaptive control methods, highlighting their respective strengths and limitations in managing traffic flow under varying conditions.

The remainder of this paper is structured as follows. First, in Section 3, we provide a mathematical description of a vehicular traffic network using graph theory, modeling intersections as nodes and streets as links. In Section 3.1, we describe the mathematical details of the continuous traffic model implemented in the proposed traffic simulator. In Section 4, we show the different population dynamics implemented within the traffic simulator, along with the fitness function definitions of each case, which were obtained considering network parameters such as the queue length and number of vehicles in links. In Section 6, we test the game theory-based model and the traffic software simulator in a nine-intersection network in order to evaluate traffic parameters such as number of stops (NS), total travel time (TTT), total distance traveled (TDT), average speed (AS), total stop time (TST) and average stop time per vehicle (ASTV). Finally, we evaluate the performance of each game-theory based control strategy in Section 7.

2. Notations

In this section, we describe the main notations used in the mathematical model description of Section 3.

$G = (V, E, ψ)$ : Directed graph representing a vehicular traffic network.
$V = \{1, 2, 3, \dots, m\}$ : Set of network nodes. They represent the traffic network intersections.
$E = \{1, 2, \dots, n\}$ : Set of network links. They represent the links between traffic network intersections (streets).
$(p, r)$ : Input and output nodes to link $i \in E$ , respectively.
$ψ (i)$ : Incidence function that matches each edge $i \in E$ to an ordered pair $(p, r) \in V$ .
$I_{p}, O_{p}$ : Sets of input and output links from node p, respectively.
$L_{p}$ : Idle time for node p. Includes red–yellow transition and yellow light time.
$g_{i}$ : Green time for link $i \in E$ .
$C_{p}$ : Cycle time for node p. Includes the idle and green times.
$o_{i}$ : Output flow capacity of link $i \in E$ , given in $[v e h / h]$ .
$w_{i}$ : Storing capacity of vehicles in link $i \in E$ .
$τ (i, j)$ : Flow rate of vehicles from link i to j, $τ (i, j) \in {0, 1}$ .
$s_{i}$ : Traffic light state of link $i \in E$ , $s_{i} \in {0, 1}$ .
$σ$ : Network mode given by the traffic light states.
$s_{i} (σ)$ : Traffic light state in a link i and mode $σ$ , $s_{i} \in {0, 1}$ .
$l_{i}$ : Queue length. Number of vehicles in a given link.
$q_{i}$ : Queue input flow, given in $[v e h / h]$ .
$h_{i}$ : Queue output flow, given in $[v e h / h]$ .
$F_{p}$ : Set of stages for node p. One stage is the time at which an input link to a node p has the right of way (ROW).

3. Mathematical Description of a Urban Traffic Network

According to [28], a vehicular traffic network can be represented as a directed graph

G = (V, E, ψ)

, where

V = \{1, 2, 3, \dots, m\}

and

E = \{1, 2, \dots, n\}

are the sets of m nodes and n edges of the network, respectively.

ψ

is the incidence function that matches each edge

i \in E

to an ordered pair

(p, r) \in V

,

ψ (i) = \{(p, r) : p, r \in V\} .

(1)

Each network node p has a set

I_{p} \subset E

of input edges given by

I_{p} = \{i : i \in E, a n d (l, p) \in \leftarrow_{G} (E)\}

(2)

with

(l, p)

as the input link to p from node l.

The set

O_{p} \subset E

of output edges for node p is given by

O_{p} = \{i : i \in E, a n d (p, r) \in \leftarrow_{G} (E)\}

(3)

with

(p, r)

as the output link from node p to node r.

To illustrate this, consider the vehicular traffic network shown in Figure 1a, whose graph representation is shown in Figure 1b, and traffic light behavior described in Figure 1c, where (1,2) represents the link between streets 1 and 2, whereas (3,2) represents the link between streets 3 and 2. The traffic light scheme shows two idle periods in (1,2), one that represents the red–yellow transition, and one that represents the yellow light after the green one. The total idle time is denoted as

L_{p}

. During the period composed of the idle and green light times in (1,2), the traffic light in (3,2) must be in red. The same scheme applies for (3,2) in the second half of the cycle time

C_{p}

. Considering the above definitions, the following constraint applies to node p:

\sum_{i \in I_{p}} g_{i} + L_{p} = C_{p},

(4)

where

g_{i}

is the green light time for link i.

One stage is defined as the time at which an input link to a node p has the right of way (ROW), whereas the set of stages for a node p is denoted as

F_{p}

. The saturation flow

o_{i}

, with

i \in I_{p}

, represents the output vehicular flow capacity for the link i while the traffic light is green. The set of values

o = {[o_{1}, \dots, o_{n}]}^{T}

is normally known as

\forall i \in E

. The turn rate on each node, denoted by

τ_{(i, j)} \in [0, 1]

, with

i \in I_{p}

and

j \in O_{p}

, represents the distribution of vehicles turning from link i to j. In other words,

τ_{(i, j)}

is the turn rate from link i to j, with

ψ (i) = (p, r)

and

ψ (j) = (l, p)

.

The traffic light state in a link

i \in I_{p}

, denoted as

s_{i} \in \{0, 1\}

, can be equal to 1 when the traffic light is green (ROW) and 0 in a different case (yellow or red). In general, the set of network states is

s = {[s_{1}, \dots, s_{n}]}^{T}

,

\forall i \in E

.

The expressions

c_{i}

and

w_{i}

, with

i \in I_{p}

, represent the external vehicular input flow and the vehicular storing capacity of i, respectively. In general terms,

c = {[c_{i}, \dots, c_{n}]}^{T}

,

Ω = [w_{i}, \dots, w_{n}]

, and

\forall i \in E

represent the vehicular input flows and storing capacity of all the network links.

Network Restrictions. The geometry restriction on each link is given by

0 \leq l_{i} \leq {\bar{l}}_{i}

, and the control input restriction is represented as

{\underset{̲}{g}}_{i} \leq g_{i} \leq {\bar{g}}_{i}

, where

l_{i}

and

{\bar{l}}_{i}

are the current and average queue lengths of link i, respectively, and

{\underset{̲}{g}}_{i}

is the minimum value in seconds for the green light, which depends on the system properties (e.g., it can be equal to zero when there are no vehicles in link i). Finally, the term

{\bar{g}}_{i}

represents the addition of all green times in intersection p and is given by

{\bar{g}}_{i} = \sum_{i \in I_{p}} g_{i} \leq C_{p} - L_{p}

. Observe that this value must not exceed the difference between the cycle time and the total idle time.

3.1. Continuous Modeling of a Urban Traffic Network

The vehicular storing change in a link

i \in E

can be modeled as a mass balance, given by

w_{i} {\dot{l}}_{i} = c_{i} + q_{i} - h_{i}, w i t h I_{p} \neq \{ϕ\},

(5)

where

h_{i}

and

{\dot{l}}_{i}

represent the queue output flow and queue length changes, respectively. The input flow change

q_{i}

is given by

\begin{matrix} q_{i} = \sum_{\begin{matrix} j \in I_{p} \end{matrix}} τ_{(j, i)} h_{j}, \end{matrix}

(6)

with

τ_{(j, i)}

being the turn rate from the input link

j \in I_{p}

to node

p \in V

direct towards the output link

i \in O_{p}

. Using (6) and (5), we have

\begin{matrix} w_{i} {\dot{l}}_{i} = c_{i} + \sum_{j \in I_{p}} τ_{(j, i)} h_{j} - h_{i} . \end{matrix}

(7)

The output traffic flow change of link i is modeled as

\begin{matrix} \begin{matrix} {\dot{h}}_{i} & = & o_{i} - h_{i} & i f s_{i} & = & 1, \\ h_{i} & = & 0 & i f s_{i} & = & 0, \end{matrix} \end{matrix}

(8)

i.e., the traffic flow change is given by the difference between the output vehicular flow capacity for link i while the traffic light is green (saturation flow

o_{i}

), and the queue output flow change is

h_{i}

.

Additionally, the green time variation

{\dot{g}}_{i}

is expressed as

\begin{matrix} \begin{matrix} {\dot{g}}_{i} & = & 1 & i f s_{i} & = & 1, \\ g_{i} & = & 0 & i f s_{i} & = & 0, \end{matrix} \end{matrix}

(9)

which is constant when the traffic light state

s_{i}

is equal to 1 (green time), since the green time

g_{i}

increases linearly from its minimum value

{\underset{̲}{g}}_{i}

to its maximum value

{\bar{g}}_{i}

. The second term in (9) indicates that there is no green time variation if the traffic light state

s_{i}

is equal to 0 (yellow or red).

Finally, considering (7)–(9) and a

n = |E|

link network, we can represent the system as the differential-algebraic system of equations (DAE)

E_{σ} \dot{ξ} = A_{σ} ξ + B_{σ}

, with the discontinuous coefficients matrix form

\begin{matrix} [\begin{matrix} E_{Ω} & 0 & 0 \\ 0 & s_{σ} & 0 \\ 0 & 0 & s_{σ} \end{matrix}] [\begin{matrix} \dot{x} \\ \dot{h} \\ \dot{g} \end{matrix}] & = \\ [\begin{matrix} 0 & A_{τ} & 0 \\ 0 & {\hat{s}}_{σ} & 0 \\ 0 & 0 & {\bar{s}}_{σ} \end{matrix}] [\begin{matrix} x \\ h \\ g \end{matrix}] & + [\begin{matrix} c \\ s_{σ} o \\ s_{σ}^{T} \end{matrix}] \end{matrix},

(10)

where

s_{σ} = [s_{1}^{*}, \dots, s_{n}^{*}]

is the network configuration in a given mode

σ

.

E_{Ω} \in R^{n \times n}

is a diagonal matrix whose elements are the

E_{Ω}^{i} = w_{i} \in Ω

.

{\bar{s}}_{i}^{*} = 1 - s_{i}^{*}

, and

{\bar{s}}_{σ} = d i a g ([{\bar{s}}_{1}^{*}, \dots, {\bar{s}}_{n}^{*}])

.

{\hat{s}}_{i}^{*} = - 2 s_{i}^{*} + 1

, and

{\hat{s}}_{σ}^{*} = d i a g ([{\hat{s}}_{1}^{*}, \dots, {\hat{s}}_{n}^{*}])

. The matrix

A_{τ} \in R^{n \times n}

is built by the proper selection of the turn rates.

Rewriting (10) as an homogeneous system, we have

E_{σ} \dot{ξ} = A_{σ} ξ,

(11)

whose matrix form is

\begin{matrix} [\begin{matrix} E_{Ω} & 0 & 0 & 0 & 0 & 0 \\ 0 & s_{σ} & 0 & 0 & 0 & 0 \\ 0 & 0 & s_{σ} & 0 & 0 & 0 \\ 0 & 0 & 0 & I_{n} & 0 & 0 \\ 0 & 0 & 0 & 0 & I_{n} & 0 \\ 0 & 0 & 0 & 0 & 0 & I_{n} \end{matrix}] [\begin{matrix} \dot{l} \\ \dot{h} \\ \dot{g} \\ \dot{s} \\ \dot{c} \\ \dot{o} \end{matrix}] & = [\begin{matrix} 0 & A_{τ} & 0 & 0 & I_{n} & 0 \\ 0 & {\hat{s}}_{σ} & 0 & 0 & 0 & s_{σ} I_{n} \\ 0 & 0 & {\bar{s}}_{σ} & I_{n} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix}] [\begin{matrix} l \\ h \\ g \\ s \\ c \\ o \end{matrix}] . \end{matrix}

3.2. Hybrid Urban Traffic System

The urban traffic network integrates the continuous scheme described in Section 3.1 with the discrete behavior resulting from transitions between different network modes. This integration allows us to model the network as a hybrid system. To illustrate this, consider the network depicted in Figure 1a and its representation by the automaton in Figure 2. In this case, we have four traffic lights, each one with two possible states

{0, 1}

, i.e., we have

2^{4} = 16

possible network configurations. However, since

s_{1} = \sim s_{2}

and

s_{3} = \sim s_{4}

, this number can be reduced to

2^{2} = 4

, which is the number of ovals in the automaton. The set of modes is denoted as

Σ = {σ_{1}, σ_{2}, σ_{3}, σ_{4}}

. The transitions between states are governed by a threshold condition based on the difference between the green time of individual links (

g_{i}

) and the total green time at each node (

{\bar{g}}_{i}

). For instance, the transition from mode

σ_{2} = [1, 0, 0, 1]

to

σ_{1} = [1, 0, 1, 0]

is triggered when

g_{4} \geq {\bar{g}}_{4}

, i.e., when the green time in link 4 is greater or equal than the sum of green times of links 3 and 4 (

{\bar{g}}_{4} = g_{4} + g_{3}

). In this sense, the switching scheme between modes is dynamic and depends on the variation of green times of links, which changes according to the fitness function used in the control strategies described in Section 4.

In general, the state vector for this system is

z = (ξ, σ) \in R^{n} \times Σ,

where

Σ

is a finite discrete set of modes. The DAE hybrid system representation is

H_{D A E} \{\begin{matrix} [\begin{matrix} E_{σ} & 0 \\ 0 & 1 \end{matrix}] & [\begin{matrix} \dot{ξ} \\ \dot{σ} \end{matrix}] = [\begin{matrix} f_{σ} (ξ) \\ 0 \end{matrix}] = F (z), z \in C \\ [\begin{matrix} ξ^{+} \\ σ^{+} \end{matrix}] \in [\begin{matrix} {\tilde{g}}_{σ} (ξ) \\ φ_{σ} (ξ) \end{matrix}] = G (z), z \in D \end{matrix},

where

\begin{matrix} f_{σ} (ξ) & = & A_{σ} ξ \\ C & = & ⋃_{σ \in \sum} (C_{σ} \cap O_{σ}) \\ {\tilde{g}}_{σ} (ξ) & = & g_{D} (z) \cup g_{O} (z) \\ D & = & ⋃_{σ \in \sum} ((D_{σ} \cap O_{σ}) \cup ((R^{n} \times \{σ\}) ∖ O_{σ})) \end{matrix}

and

\begin{matrix} g_{D} (z) & = & \{\begin{matrix} \prod_{φ σ (ξ)} g_{σ} (ξ) & i f z \in D_{σ} \cap O_{σ} \\ 0 & o t h e r w i s e \end{matrix} \end{matrix}

\begin{matrix} g_{O} (z) & = & \{\begin{matrix} \prod_{φ σ (ξ)} ξ & i f z \in (R^{n} \times \{σ\}) ∖ O_{σ} \\ 0 & o t h e r w i s e \end{matrix} \end{matrix}

O_{σ}

and

\prod_{σ}

represent the consistency sets and the projectors, respectively [29].

C_{σ} = \{z : g_{i} \in [0, {\bar{g}}_{i}], s = [s_{1}, \dots, s_{n}], σ \in \sum\}

, and

D_{σ} = \{z : g_{i} \geq {\bar{g}}_{i}, s = [s_{1}, \dots, s_{n}], σ \in \sum\}

are subsets

\in R^{n}

that define the system evolution according to F and G. In the transitions between modes, the changes of

ξ

are defined by the map

g_{σ} (ξ) = {[l, h, y_{g} (g), y_{s} (g, σ), c, o]}^{T}

, where

y_{g} (g) = \begin{matrix} \{{[g_{1}, \dots, g_{n}]}^{T} : g_{i} \{\begin{matrix} g_{i} & i f & g_{i} & \leq & {\bar{g}}_{i} \\ 0 & i f & g_{i} & \geq & {\bar{g}}_{i} \\ g_{i}, 0 & i f & g_{i} & = & {\bar{g}}_{i} \end{matrix}\} \end{matrix}

and

y_{s} (g, σ) = \{{[s_{1}, \dots, s_{n}]}^{T} \begin{matrix} : & g_{i} \geq {\bar{g}}_{i}, \end{matrix} σ \in \sum\} .

Additionally, the changes of modes in the network are determined by

φ_{σ} (ξ) = \{\begin{matrix} σ : & g_{i} \geq {\bar{g}}_{i}, σ \in \sum \end{matrix}\} .

4. Population Game-Based Model for Urban Traffic Control

In the urban traffic system, each node

p \in V : I_{p} \neq ϕ

is represented as a population of agents that share the effective green time

m = \sum_{i \in I_{p}} g_{i}

. In this sense, agents form a mass m distributed between the set of pure strategies

S = \{1, \dots, n\}

of the population. The set S, corresponds to the set of stages

F_{p}

described in Section 3, i.e., the proportion of m that the input links of node p have for the right of way (ROW). Thus, each link

j \in I_{p}

is assigned a proportion

x_{j}

of m, which is a pure strategy of the population.

The whole behavior of the agents defines the population state x, which belongs to the simplex

Δ = {x \in R_{+}^{n} : \sum_{j \in S} x_{j} = m} .

(12)

The set of agents assuming a strategy

j \in S

allows us to obtain a payoff provided by the fitness function

F_{j} (x) : Δ \to R

, in a determined population state x. The average fitness function is defined as

\bar{F} (x) = \frac{1}{m} \sum_{j \in S} x_{j} F_{j} (x)

, and the excess payoff of j is

{\hat{F}}_{j} (x) = F_{j} (x) - \bar{F} (x)

.

In this approach, the fitness function for a given link j is the relation between its queue length and its assigned proportion of m. This lead us to Definition 1.

Definition 1.

The fitness function for the adaptive control strategies based in population games depends on the relation between the link queue capacity and its green time assignment. This is given by

F_{j} (x) = \frac{l_{j}}{x_{j}},

(13)

which must be minimized.

4.1. Mean Dynamics and Revision Protocols

In mean dynamics, each of the n agents receive equally likely revision opportunities at rate R, which means that agents playing strategy j have an expected number of opportunities given by

n x_{j} R d t

in a time space

d t

to observe their opponents behavior in order to make decisions. On the other hand, the expected number of opportunities that the agents have to change from strategy j to k is

n x_{j} ρ_{j k} (x) d t

, where

ρ_{j k}

represents the conditional switch rate to change from j to k. In this way, the expected change of agents using strategy j is given by

(\sum_{k \in S} x_{k} ρ_{k j} - x_{j} \sum_{k \in S} ρ_{j k}) d t,

(14)

from which we define an expected-value-based dynamic known as the mean dynamic, given by

{\dot{x}}_{j} = \sum_{k \in S} x_{k} ρ_{k j} - x_{j} \sum_{k \in S} ρ_{j k} .

(15)

The above expression determines how the proportion of agents playing strategy j varies [30].

The switch change

ρ_{j k}

also defines the revision protocols shown in Table 1. These revision protocols determine the population dynamic used by the agent population. We describe them in detail in Section 4.2, with a particular focus on the fictitious play case, since it uses historical data to define agent strategies, demonstrating its viability within a fixed-time traffic framework.

4.2. Population Dynamics

In this section, we describe the population dynamics employed in this approach to optimize the green times for each node in the traffic network. First, we describe the dynamics that, due to their online nature, are used for adaptive control strategies. These dynamics are replicator, Brown–Von Neuman–Nash, Logit and Smith, which use (13) as a fitness function. Subsequently, we describe fictitious play as a fixed-time control strategy, since it uses historical data for agent decision making. In this case, the payoff function is given by (22).

4.2.1. Replicator Dynamics

The replicator dynamic (RD) describes how the frequencies of different strategies change over time based on the fitness associated with each one, i.e., strategies having higher payoffs tend to increase in frequency within the population, while less successful strategies decrease. An agent using this type of dynamics chooses an opponent and applies the imitation protocol. The opponent’s strategy is followed by this when its utility is better than the current one. This dynamic is described as the difference between the fitness of the strategy j and the average fitness, as follows:

{\dot{x}}_{j} = x_{j} [F_{j} (x) - \bar{F} (x)] .

(16)

4.2.2. Brown–Von Neumann–Nash

The Brown–Von Neumann–Nash (BNN) dynamic relies on the excess payoff dynamic, since it compares the excess payoff of strategy j with the excess payoff caused by the switch from strategy j to k. It can be expressed as

{\dot{x}}_{j} = {[{\hat{F}}_{j} (x)]}_{+} - x_{j} \sum_{k \in S} {[{\hat{F}}_{k} (x)]}_{+} .

(17)

4.2.3. Logit Dynamic

The Logit dynamic adopts the Gibbs–Boltzmann form of (18), in which the rationality parameter

β

is included to increase the probability of strategies having better payoffs or to assign them the same probability to be chosen. In the first case, when

β \to \infty

, agents play following the best response rule, and in the second case, when

β \approx 0

, agents play under a uniform distribution scheme to select strategies.

\begin{matrix} {\dot{x}}_{j} = \frac{exp (β F_{j} (x))}{\sum_{j \in S} exp (β F_{j} (x))} - x_{j} \end{matrix}

(18)

Rationality

β

can be interpreted as the inverse of the noise

η

, and it is included in the Logit selection protocol shown in Table 1.

4.2.4. Smith Dynamic

In Smith dynamics, agents randomly select a strategy and evaluate its payoff against the current strategy. If the new payoff surpasses the current one, the agent switches to the new strategy with a probability proportional to the difference between the payoffs. Unlike the BNN dynamic, which carries out a comparison against the average payoff, or RD, which compares against an opponent strategy, the Smith dynamic involves a direct comparison between two strategies, which avoids some strategies being eventually discarded. This dynamic is described by

\begin{matrix} {\dot{x}}_{j} = \sum_{k \in S} x_{k} {[F_{j} (x) - F_{k} (x)]}_{+} - x_{j} \sum_{k \in S} {[F_{k} (x) - F_{j} (x)]}_{+} \end{matrix}

(19)

4.3. Fictitious Play

In fictitious play, a player i plays a pure best response (BR) action

a_{i}

based on the joint distribution of the selected actions of its opponents. This can be expressed by

a_{i}^{t} \in B R_{i} (p_{_i}^{t}),

(20)

where

a_{i}^{t}

represents the action performed by agent i at time t, and

p_{_i}^{t}

is the joint probability distribution of actions performed by all players except i. The marginal probability for each player i up to time t is calculated using the expression

p_{i}^{t} = \frac{γ_{i}^{t} (a_{i})}{t},

(21)

where

γ_{i}^{t} (a_{i}) = \sum_{τ = 0}^{t - 1} I {a_{i}^{τ} = a_{i}}

is an S-dimensional vector that counts the times that player i has played each action. According to the above expressions, it is noticeable that in fictitious play, each agent requires historical data in order to make decisions. Consequently, in our context, this control strategy is assumed to follow a fixed-time approach.

The payoff for an agent i is a function that depends on its current action and the action set

a_{_i}

played by the agents different to i. In our case, this payoff depends on the queue length of links converging to a node. This is presented in Definition 2.

Definition 2.

The payoff function in a fixed-time scheme modeled as a fictitious play based-control strategy depends on the accumulative queue length of the incidents links. This is given by

U (a_{i}, a_{_i}) = \overset{n}{\sum_{i = 1}} l_{i},

(22)

which must be minimized.

According to Definitions 1 and 2, fitness and payoff functions determine the control strategies’ dynamics in adaptive and fixed-time schemes, respectively. For the adaptive control case, given by the online population dynamics described in Section 4.2, the green time is continuously updated according to the fitness function variation, which impacts the transition between the modes of the hybrid system described in Section 3.2. In the fixed-time case, the payoff function depends exclusively on the queue lengths of incident links to a specific node, which means that the mode transitions are determined by the current network configuration and the modes that have provided the best queue length reduction historically.

Both functions are strongly related with the traffic network dynamics for which they were taught, i.e., to associate good fitness or payoffs in links with low queue lengths and acceptable green times. The results of the dynamic changes in the functions’ values are described through the performance parameters for different scenarios, as shown in Section 7.

5. Traffic Simulator

The implemented traffic simulator is shown in Figure 3. This integrates, through a COM server, the VISSIM platform, in which the traffic network is implemented, and MATLAB, where the control strategies described in Section 4.1 and the hybrid model described in Section 3.2 are implemented.

The following are the main steps in the information flow between both simulator sections:

First of all, the vehicular traffic network is implemented in VISSIM. Subsequently, MATLAB obtains all its properties, such as number of controllers, traffic lights, queue counters, number of links, cycle time ( $C_{p}$ ), duration of yellow and red lights (intermediate times), and simulation time, among others.
The runtime environment, which is implemented in an object-oriented scheme, allows for the creation of class instances that represent the traffic network (implemented in VISSIM) and the virtual traffic controller associated with each node that is modeled by means of the hybrid dynamics described in Section 3.2.
The initial green times are loaded through class instances that represent the control strategies described in Section 4.1. These times, along with the cycle time ( $C_{p}$ ) and the intermediate times (red and yellow), are used to generate the signaling plan for each controller, which is sent to VISSIM.
The update of the signaling plan, as well as the reading of the queue counters, are made in sample intervals $δ$ . After this time, the queue lengths calculated in VISSIM are sent to the control strategy instance in order to recalculate the green times. This process is repeated until the simulation time expires.
Finally, the communication between both modules ends and the information related to performance parameters is stored in VISSIM.

VISSIM includes a COM programming module that allows for an interaction with external programming languages and platforms such as MATLAB in a stable form. In our case, the traffic simulator architecture has an object-oriented scheme implemented in MATLAB, whose main classes are the following:

COMInterfaceVISSIM: Builds the VISSIM and MATLAB COM connection. It also contains methods to send information to VISSIM (cycle time, simulation time, performance parameters to be measured, etc).
HybridController: Each instance of this class creates a “virtual controller” that interprets the signal plans and sends the instructions that the VISSIM controllers must follow.
ControllerUserDefined: Used to define the control strategy to be used. It also generates the green times for each virtual controller generated.
SignalPlan: Creates the signal plans from the information provided in the graphic interface and the green times given by ControllerUserDefined.
MainSimulation: Main class in which the instances of the previously described classes are created.

6. Case Study

Figure 4 shows the geometry and movement scheme for the traffic network used as a case study, which is located in the urban zone of Barranquilla city in Colombia (between 82 and 84 streets, and Kra. 50 to Kra. 52). The current traffic plans, which are based on a fixed-time strategy that considers only the queue length and historical data, were provided by the city traffic controller company (IMATIC). The results shown in Section 7, compare its performance with the proposed game theory-based controllers for multiple network parameters. In particular, this network has a reduced space between nodes, which avoids a large amount of vehicles within links. The number of input links for each node is 2, which is associated with the number of strategies that each agent can assume in the current node. In general, 16 links are modeled in this approach. Since the network is continuously changing. Moreover, with the aim to test the different control strategies provided by the population dynamics described in Section 4.2, we propose the five scenarios described in Table 2.

7. Discussion

The implemented control strategies based on the population dynamics described in Section 4.1 were used on each one of the scenarios described in Table 2, during a simulation time of

3600 s

, with sample intervals

δ = 5 s

, cycle time

C_{p} = 60 s

and idle time

L_{p} = 10 s

.

The performance of each control strategy is evaluated according to the parameters shown in Table 3 and compared in Table 4, with the current fixed-time strategy used by the traffic controller company of Barranquilla city. Observe how most of the proposed control strategies outperform the current one, achieving reductions in the number of stops (NS) up to 90% and increasing the average speed (AS) up to 79% and 80% in scenario 4 with RD and Smith dynamics, respectively. There is not a notable improvement for scenario 1 due to the low and constant flow of vehicles. However, it is noticeable that under saturation and variable conditions, the proposed strategies enhance the results for the evaluated parameters, except the total distance traveled (TDT), which has increased up to 17% in scenario 4 in almost all the proposed strategies. This can be attributed to the change of routes inherent to the dynamics of the strategies.

Another important comparison provided by Table 4 encompasses game theory-based control strategies. In general, both Smith and replicator dynamics demonstrate superior performance in saturated and non-saturated scenarios due to their decentralized mechanisms. In replicator dynamics (RD), agents adjust their strategies based on pairwise fitness comparisons, whereas in Smith dynamics, they compare the expected payoffs of different strategies. This localized decision-making approach eliminates the need for complete information—unlike BNN or Logit dynamics—enhancing adaptability to unpredictable environments and, consequently, traffic volatility.

Scenario 1.

In this case, the control strategy with better performance was Smith dynamics. As shown in Table 4, all parameters show the most favorable values in comparison with the other controllers. On the other hand, the fixed-time control strategy, fictitious play, demonstrates that is also a good option, with values that do not deviate significantly from the Smith case; Logit (

η = 1

) yields the least favorable outcomes.

Scenario 2.

In this scenario, all the control strategies demonstrate a notable improvement in comparison with scenario 1, especially Smith and replicator dynamics. However, there are some parameters in which some control strategies exhibit better behaviour, since Smith dynamics surpasses replicator dynamics in ASTV and TST, whereas the replicator dynamic outperforms the Smith dynamic in TTT.

Scenario 3.

In this case, the fixed-time strategy, fictitious play, yields superior outcomes compared to the adaptive strategies, replicator and Smith, in certain parameters such as TTT and AS. However, in terms of TST and ASTV, fictitious play is surpassed by them. Once more, the Smith dynamic and replicator demonstrate superior performance, with the replicator dynamic outperforming it in NS and TTT, while Smith dynamics slightly surpass replicator dynamics in TST.

Scenario 4.

Saturated network conditions are maintained but this time with a reduction in flows in relation to scenario 3. The adaptive strategy, Smith, exhibits the best performance among all strategies, followed by replicator dynamics. The fixed-time strategy based on fictitious play presents better results compared to the BNN and Logit cases.

Scenario 5.

This last scenario pretends to test the adaptive strategies’ performance. Again, the best results are obtained for Smith and replicator dynamics, in that order. Since this scenario is completely variable, the fixed-time strategy, fictitious play, exhibits a performance reduction in comparison with BNN and Logit, in relation to scenario 4.

8. Conclusions

In this work, we implemented a traffic simulator that combines the micro-simulation platforms VISSIM and MATLAB in order to include population game-based control strategies to define green times in an eight-intersection traffic network. Due to its determinant role in network congestion, the queue length of links is an important parameter for fitness and payoff functions’ definitions, both in the fixed case, which is modeled using fictitious play dynamics, and for the adaptive case, modeled using the control strategies based on RD, BNN, Logit and Smith dynamics. Within the traffic simulator, we included a hybrid system to model both the continuous and discrete behavior of the network in order to involve both the vehicular storing change in links and the transitions between modes caused by the traffic light changes in nodes. The results, obtained in five different traffic scenarios, demonstrated acceptable performance for the fixed-time scheme modeled with fictitious play in a traffic network without saturation conditions. In conditions considering saturation and variability in traffic flows, fictitious play exhibited good performance in many of the analyzed parameters, specifically in NS, TTT, AS, TST and ASTV, in which it outperformed BNN and Logit dynamics. On the other hand, Smith dynamics emerged as the most effective control strategy across all studied scenarios, consistently outperforming the others, particularly in situations where increased congestion led to a decline in fitness values. Replicator dynamics also demonstrated strong performance, though slightly behind Smith dynamics. This superior performance can be attributed to the decentralized nature of Smith dynamics and RD. In the case of RD, agents adjust their strategies based on pairwise comparisons of fitness values, while in Smith dynamics, agents compare the expected payoffs of different strategies. This localized decision-making process enhances their adaptability in unpredictable environments, such as traffic conditions, as they do not rely on a complete information scheme, which is a requirement that other dynamics, such as BNN or Logit, depend on.

The proposed model, unlike other related works, considers five distinct levels of saturation or traffic characteristics, ranging from constant flows to variable flows in both vertical and horizontal links. Under these scenarios, the performance of various control strategies based on game theory was evaluated across multiple parameters, including the number of stops (NS), total travel time (TTT), total distance traveled (TDT), average speed (AS), total stop time (TST), and average stop time per vehicle (ASTV). Although the density of traffic flows can be adjusted to further validate the results, this study demonstrates that certain control strategies outperform others across most performance metrics, particularly under high saturation levels. The findings provide clear evidence of which game theory-based control strategies are best-suited for fixed or adaptive schemes. Furthermore, the traffic network is modeled as a hybrid system, a novel approach not explored in previous studies. This hybrid modeling provides a more realistic representation of traffic behavior, capturing both the continuous and discrete dynamics inherent in real-world traffic systems.

Limitations and Future Directions

Although VISSIM and MATLAB are widely used for traffic analysis, the proposed combination operates in a simulated environment that accounts for various congestion conditions and performance metrics. However, it does not incorporate real-world data, which would enhance the accuracy and practical applicability of the results. Therefore, future research should focus on implementing this approach using actual traffic datasets and more complex traffic networks. Additionally, it is important to consider how the proposed control strategies could improve the performance in terms of the TDT parameter, which was the only one presenting negative behavior in saturated scenarios.

Author Contributions

Formal analysis, S.P.A., P.A.Ñ. and D.A.M.V.; investigation, S.P.A., P.A.Ñ. and D.A.M.V.; methodology, S.P.A. and P.A.Ñ.; resources, D.A.M.V., J.M.C.C. and A.M.R.; software, P.A.Ñ.; validation, D.A.M.V.; writing—original draft preparation, S.P.A. and D.A.M.V.; writing—review and editing, D.A.M.V., J.M.C.C. and A.M.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

INRIX Research Global Traffic Scorecard. 2024. Available online: https://inrix.com/scorecard/ (accessed on 23 April 2024).
TomTom International BV. TomTom Traffic Index. 2024. Available online: www.tomtom.com/traffic-index (accessed on 23 April 2024).
An, S.H.; Lee, B.H.; Shin, D.R. A Survey of Intelligent Transportation Systems. In Proceedings of the 2011 Third International Conference on Computational Intelligence, Communication Systems and Networks, Bali, Indonesia, 26–28 July 2011. [Google Scholar]
Qadri, S.; Gökçe, M.; Öner, E. State-of-art review of traffic signal control methods: Challenges and opportunities. Eur. Transp. Res. Rev. 2020, 12, 55. [Google Scholar] [CrossRef]
Miller, A.J. Settings for Fixed-Cycle Traffic Signals. OR 1963, 14, 373–386. [Google Scholar] [CrossRef]
Chen, W.; Hu, L.; Huang, K.; Zhang, H.; Chen, E.; Shao, Y.; Ye, Z. Exploring Optimal Signal Plans for Isolated Signalized Intersections with Central Pedestrian Refuges. J. Transp. Eng. Part A Syst. 2024, 150, 04024018. [Google Scholar] [CrossRef]
Zhang, J.; Li, H.; Ma, Y.; Zhang, C.; Qin, L.; Chen, N. Modeling and optimization of platooning behaviors in fixed-time signalized intersection entrance areas. Simul. Model. Pract. Theory 2024, 132, 102900. [Google Scholar] [CrossRef]
Duraku, R.; Boshnjaku, D. Enhancing Traffic Sustainability: An Analysis of Isolation Intersection Effectiveness through Fixed Time and Logic Control Design Using VisVAP Algorithm. Sustainability 2024, 16, 2930. [Google Scholar] [CrossRef]
Chiou, S.W. A Cooperative Agent-Based Traffic Signal Control for Vehicular Networks Under Stochastic Flow. IEEE Trans. Veh. Technol. 2023, 72, 12592–12601. [Google Scholar] [CrossRef]
Wang, X.; Ke, L.; Qiao, Z.; Chai, X. Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning. IEEE Trans. Cybern. 2021, 51, 174–187. [Google Scholar] [CrossRef]
Dong, H.; Dai, Z. A multi intersections signal coordinate control method based on game theory. In Proceedings of the 2011 International Conference on Electronics, Communications and Control (ICECC), Ningbo, China, 9–11 September 2011; pp. 1232–1235. [Google Scholar] [CrossRef]
Villalobos, I.A.; Poznyak, A.S.; Tamayo, A.M. Urban Traffic Control Problem: A Game Theory Approach. In Proceedings of the 17th World Congress The International Federation of Automatic Control, Seoul, Republic of Korea, 6–11 July 2008; Volume 41, pp. 7154–7159. [Google Scholar] [CrossRef]
Elhenawy, M.; Elbery, A.A.; Hassan, A.A.; Rakha, H.A. An Intersection Game-Theory-Based Traffic Control Algorithm in a Connected Vehicle Environment. In Proceedings of the 2015 IEEE 18th International Conference on Intelligent Transportation Systems, Gran Canaria, Spain, 15–18 September 2015; pp. 343–347. [Google Scholar] [CrossRef]
Alvarez, I.; Poznyak, A. Game theory applied to urban traffic control problem. In Proceedings of the ICCAS 2010, Gyeonggi-do, Republic of Korea, 27–30 October 2010; pp. 2164–2169. [Google Scholar] [CrossRef]
Guo, J.; Harmati, I. Lane-changing decision modelling in congested traffic with a game theory-based decomposition algorithm. Eng. Appl. Artif. Intell. 2022, 107, 104530. [Google Scholar] [CrossRef]
Wang, A.; Zhang, K.; Li, M.; Shao, J.; Li, S. Game Theory-Based Signal Control Considering Both Pedestrians and Vehicles in Connected Environment. Sensors 2023, 23, 9438. [Google Scholar] [CrossRef]
Ahmad, F.; Almarri, O.; Shah, Z.; Al-Fagih, L. Game theory applications in traffic management: A review of authority-based travel modelling. Travel Behav. Soc. 2023, 32, 100585. [Google Scholar] [CrossRef]
Chandra, R.; Manocha, D. GamePlan: Game-Theoretic Multi-Agent Planning With Human Drivers at Intersections, Roundabouts, and Merging. IEEE Robot. Autom. Lett. 2022, 7, 2676–2683. [Google Scholar] [CrossRef]
Liu, T.; Cui, L.; Pang, B.; Jiang, Z.P. A Unified Framework for Data-Driven Optimal Control of Connected Vehicles in Mixed Traffic. IEEE Trans. Intell. Veh. 2023, 8, 4131–4145. [Google Scholar] [CrossRef]
Arevalo-Castiblanco, M.F.; Pachon, J.; Tellez-Castro, D.; Mojica-Nava, E. Cooperative Cruise Control for Intelligent Connected Vehicles: A Bargaining Game Approach. Sustainability 2023, 15, 11898. [Google Scholar] [CrossRef]
Bastianello, N.; Badia, L. Decentralized Intersection Control Using Bayesian Game Theory. In Proceedings of the 2022 2nd International Seminar on Machine Learning, Optimization, and Data Science (ISMODE), Virtual, 22–23 December 2022; pp. 442–447. [Google Scholar] [CrossRef]
Khan, Z.; Koubaa, A.; Benjdira, B.; Boulila, W. A game theory approach for smart traffic management. Comput. Electr. Eng. 2023, 110, 108825. [Google Scholar] [CrossRef]
Cortés-Berrueco, L.E.; Gershenson, C.; Stephens, C.R. Traffic Games: Modeling Freeway Traffic with Game Theory. PLoS ONE 2016, 11, e0165381. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Zong, C.; Han, X.; Zhang, D.; Zheng, H.; Shi, C. Spacing Allocation Method for Vehicular Platoon: A Cooperative Game Theory Approach. Appl. Sci. 2020, 10, 5589. [Google Scholar] [CrossRef]
Huang, K.; Chen, X.; Di, X.; Du, Q. Dynamic driving and routing games for autonomous vehicles on networks: A mean field game approach. Transp. Res. Part C Emerg. Technol. 2021, 128, 103189. [Google Scholar] [CrossRef]
Abdelghaffar, H.M.; Rakha, H.A. A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing. Sensors 2019, 19, 2282. [Google Scholar] [CrossRef]
Abdoos, M. A Cooperative Multiagent System for Traffic Signal Control Using Game Theory and Reinforcement Learning. IEEE Intell. Transp. Syst. Mag. 2020, 13, 6–16. [Google Scholar] [CrossRef]
Bondy, J.A.; Murty, U.S. Graph Theory with Applications; Noth-Holland: Amsterdam, The Netherlands, 1982. [Google Scholar]
Ñañez, P.; Sanfelice, R.G.; Quijano, N. On an invariance principle for differential-algebraic equations with jumps and its application to switched differential-algebraic equations. Math. Control. Signals Syst. 2017, 29, 5. [Google Scholar] [CrossRef]
Sandholm, W.H. Chapter 13—Population Games and Deterministic Evolutionary Dynamics. In Handbook of Game Theory with Economic Applications; Elsevier: Amsterdam, The Netherlands, 2015; Volume 4, pp. 703–778. [Google Scholar] [CrossRef]
Susilo, B.H.; Solihin, Y. Modification of Saturation Flow Formula by Width of Road Approach. Procedia-Soc. Behav. Sci. 2011, 16, 620–629. [Google Scholar] [CrossRef]
Alshabibi, N.M. Comparing the Saturation Flow Rate on the Exit Lane Between Urban Multilane Roundabouts and Urban Signalized Intersections Through Field Data. Infrastructures 2025, 10, 15. [Google Scholar] [CrossRef]
Fornalchyk, Y.; Mohyla, I.; Hilevych, V. The saturation flow volume as a function of the intersection passing speed. Transp. Probl. 2013, 8, 43–51. [Google Scholar]

Figure 1. Urban traffic network. (a) A traffic network case including intersections and traffic lights. (b) Urban traffic network graph representation. (c) Traffic lights’ plan for node 2.

Figure 2. Automaton for urban traffic network in Figure 1a.

Figure 3. General structure of the implemented simulator.

Figure 4. Traffic network with eight intersections.

Table 1. Revision protocols and switch rates.

Revision Protocol	Function
Pairwise proportional imitation	$ρ_{j k} = x_{k} {[F_{k} - F_{j}]}_{+}$
Pairwise comparison	$ρ_{j k} = {[F_{k} - F_{j}]}_{+}$
Logit selection	$ρ_{j k} = \frac{exp (η^{- 1} F_{k})}{\sum_{w \in S} exp (η^{- 1} F_{w})}$
Comparison with average utility	$ρ_{j k} = {[F_{k} - \sum_{w \in S} x_{w} F_{w}]}_{+}$

Table 2. Simulation scenarios.

Scenario	Name	Description
1	No saturation condition 1	The input flow of vehicles is constant and equal to $1000 \frac{v e h}{h}$ during the simulation.
2	No saturation condition 2	Input flow increased to $1600 \frac{v e h}{h}$ during the simulation.
3 ¹	Saturation condition 1	Horizontal links (Streets 82 and 84) are saturated with input flows of $2400 \frac{v e h}{h}$ . Additionally, vertical links have flows of $1400 \frac{v e h}{h}$ . These values are constant during the simulation time.
4	Saturation condition 2	Vertical links (Kras. 50 to 52) are saturated with input flows of $2000 \frac{v e h}{h}$ . Additionally, streets (horizontal links) have flows of $1000 \frac{v e h}{h}$ . These values are constant during the simulation time.
5	Variable conditions	Flows for horizontal and vertical links change during the simulation time

¹ The saturation flow rate of 2400 vehicles/hour is a parameter that has been reported in multiple works related to traffic analysis, mostly using a passenger car as a standard unit [31,32,33].

Table 3. Performance parameters.

Parameter	Description
NS	Number of Stops
TTT	Total Travel Time
TDT	Total Distance Traveled
AS	Average Speed
TST	Total Stop Time
ASTV	Average Stop Time per Vehicle

Table 4. Performance parameters for control strategies.

Scenario	Controller	NS[veh]	TTT[h]	TDT[km]	AS[km/h]	TST[seg]	ASTV[h]
	FP	7917 $(- 6 %)$	183.1 $(- 3 %)$	8526.2 $(0.1 %)$	46.6 $(3.3 %)$	13.7 $(- 11 %)$	22.6 $(- 11 %)$
	RD	8257 $(- 2 %)$	188.1 $(- 0.4 %)$	8522.3 $(0.1 %)$	45.3 $(0.4 %)$	15.9 $(3 %)$	26.1 $(3 %)$
1	BNN	8618 $(2 %)$	188.9 $(0.1 %)$	8568.8 $(0.6 %)$	45.4 $(0.7 %)$	15.4 $(0 %)$	25.4 $(0 %)$
	Smith	7803 $(- 8 %)$	181.9 $(- 4 %)$	8491.7 $(- 0.3 %)$	46.7 $(3.5 %)$	13.6 $(- 12 %)$	22.4 $(- 11 %)$
	Logit $(η = 1)$	8699 $(3 %)$	189.9 $(0.6 %)$	8558.3 $(0.5 %)$	45.1 $(0 %)$	15.8 $(3 %)$	26.0 $(3 %)$
	FP	16,941 $(- 71 %)$	331.5 $(- 18 %)$	13,557.1 $(1 %)$	40.9 $(23 %)$	18.1 $(- 29 %)$	47.9 $(- 29 %)$
	RD	15,321 $(- 74 %)$	318.8 $(- 21 %)$	13,496.4 $(1 %)$	42.3 $(27 %)$	16.24 $(- 37 %)$	42.9 $(- 36 %)$
2	BNN	17,870 $(- 70 %)$	337.4 $(- 16 %)$	13,569.9 $(1 %)$	40.2 $(21 %)$	18.9 $(- 26 %)$	50.0 $(- 26 %)$
	Smith	15,608 $(- 74 %)$	320.7 $(- 20 %)$	13,537.8 $(1 %)$	42.2 $(27 %)$	16.0 $(- 38 %)$	42.3 $(- 37 %)$
	Logit $(η = 1)$	17,885 $(- 70 %)$	337.7 $(- 16 %)$	13,570.6 $(1 %)$	40.2 $(21 %)$	18.9 $(- 26 %)$	49.8 $(- 26 %)$
	FP	56,974 $(- 17 %)$	452.1 $(- 6 %)$	14,377.1 $(6 %)$	31.8 $(13 %)$	24.7 $(11 %)$	70.2 $(18 %)$
	RD	52,301 $(- 24 %)$	456.7 $(- 5 %)$	14,209.7 $(5 %)$	31.1 $(10 %)$	19.6 $(- 12 %)$	55.7 $(- 7 %)$
3	BNN	67,506 $(- 1 %)$	470.3 $(- 2 %)$	13,624.5 $(1 %)$	29.0 $(3 %)$	20.7 $(- 7 %)$	56.2 $(- 6 %)$
	Smith	54,569 $(- 20 %)$	458.5 $(- 4 %)$	14,185.8 $(5 %)$	30.9 $(10 %)$	19.5 $(- 12 %)$	54.9 $(- 8 %)$
	Logit $(η = 1)$	67,158 $(- 2 %)$	471.6 $(- 2 %)$	13,620.6 $(1 %)$	28.9 $(2 %)$	20.9 $(- 6 %)$	56.4 $(- 6 %)$
	FP	21,087 $(- 88 %)$	367.4 $(- 32 %)$	14,164.8 $(17 %)$	38.6 $(72 %)$	19.2 $(- 43 %)$	52.9 $(- 35 %)$
	RD	19,087 $(- 90 %)$	353.4 $(- 34 %)$	14,192.7 $(17 %)$	40.2 $(79 %)$	17.2 $(- 49 %)$	47.4 $(- 42 %)$
4	BNN	51,206 $(- 72 %)$	432.9 $(- 20 %)$	14,106.1 $(17 %)$	32.4 $(44 %)$	23.1 $(- 32 %)$	63.4 $(- 22 %)$
	Smith	18,407 $(- 90 %)$	350.3 $(- 35 %)$	14,166.2 $(17 %)$	40.4 $(80 %)$	17.0 $(- 50 %)$	46.9 $(- 43 %)$
	Logit $(η = 1)$	59,657 $(- 67 %)$	450.8 $(- 16 %)$	14,041.3 $(16 %)$	31.1 $(38 %)$	24.0 $(- 29 %)$	65.9 $(- 19 %)$
	FP	52,650 $(- 53 %)$	419.5 $(- 13 %)$	13,823.6 $(7 %)$	32.9 $(23 %)$	21.9 $(- 20 %)$	60.7 $(- 14 %)$
	RD	25,001 $(- 77 %)$	367.6 $(- 24 %)$	14,206.7 $(10 %)$	38.6 $(44 %)$	18.3 $(- 33 %)$	51.4 $(- 28 %)$
5	BNN	50,421 $(- 55 %)$	430.9 $(- 11 %)$	13,978.1 $(8 %)$	32.4 $(21 %)$	22.0 $(- 20 %)$	61.3 $(- 14 %)$
	Smith	22,758 $(- 80 %)$	363.0 $(- 25 %)$	14,232.5 $(10 %)$	39.2 $(46 %)$	18.1 $(- 34 %)$	50.8 $(- 28 %)$
	Logit $(η = 1)$	54,132 $(- 51 %)$	435.1 $(- 10 %)$	13,940.7 $(8 %)$	32.0 $(19 %)$	21.9 $(- 20 %)$	60.9 $(- 14 %)$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Institute of Knowledge Innovation and Invention. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Amaya, S.P.; Ñañez, P.A.; Martínez Vásquez, D.A.; Calderón Chávez, J.M.; Mateus Rojas, A. Hybrid Model-Based Traffic Network Control Using Population Games. Appl. Syst. Innov. 2025, 8, 102. https://doi.org/10.3390/asi8040102

AMA Style

Amaya SP, Ñañez PA, Martínez Vásquez DA, Calderón Chávez JM, Mateus Rojas A. Hybrid Model-Based Traffic Network Control Using Population Games. Applied System Innovation. 2025; 8(4):102. https://doi.org/10.3390/asi8040102

Chicago/Turabian Style

Amaya, Sindy Paola, Pablo Andrés Ñañez, David Alejandro Martínez Vásquez, Juan Manuel Calderón Chávez, and Armando Mateus Rojas. 2025. "Hybrid Model-Based Traffic Network Control Using Population Games" Applied System Innovation 8, no. 4: 102. https://doi.org/10.3390/asi8040102

APA Style

Amaya, S. P., Ñañez, P. A., Martínez Vásquez, D. A., Calderón Chávez, J. M., & Mateus Rojas, A. (2025). Hybrid Model-Based Traffic Network Control Using Population Games. Applied System Innovation, 8(4), 102. https://doi.org/10.3390/asi8040102

Article Menu

Hybrid Model-Based Traffic Network Control Using Population Games

Abstract

1. Introduction

2. Notations

3. Mathematical Description of a Urban Traffic Network

3.1. Continuous Modeling of a Urban Traffic Network

3.2. Hybrid Urban Traffic System

4. Population Game-Based Model for Urban Traffic Control

4.1. Mean Dynamics and Revision Protocols

4.2. Population Dynamics

4.2.1. Replicator Dynamics

4.2.2. Brown–Von Neumann–Nash

4.2.3. Logit Dynamic

4.2.4. Smith Dynamic

4.3. Fictitious Play

5. Traffic Simulator

6. Case Study

7. Discussion

8. Conclusions

Limitations and Future Directions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI