Next Article in Journal
Microcontroller Implementation of Support Vector Machine for Detecting Blood Glucose Levels Using Breath Volatile Organic Compounds
Next Article in Special Issue
An Adaptive Track Segmentation Algorithm for a Railway Intrusion Detection System
Previous Article in Journal
Comparison and Combination of Thermal, Fluorescence, and Hyperspectral Imaging for Monitoring Fusarium Head Blight of Wheat on Spikelet Scale
Previous Article in Special Issue
Efficient Traffic Video Dehazing Using Adaptive Dark Channel Prior and Spatial–Temporal Correlations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing

by
Hossam M. Abdelghaffar
1,2 and
Hesham A. Rakha
3,*
1
Department of Computers & Control Systems, Engineering Faculty, Mansoura University, Mansoura, Dakahlia 35516, Egypt
2
Center for Sustainable Mobility, Virginia Tech Transportation Institute, Virginia Tech, Blacksburg, VA 24061, USA
3
Charles E. Via, Jr. Dept. of Civil and Environmental Engineering, Director of the Center of Sustainable Mobility, Virginia Tech Transportation Institute, Virginia Tech, Blacksburg, VA 24061, USA
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(10), 2282; https://doi.org/10.3390/s19102282
Submission received: 26 March 2019 / Revised: 10 May 2019 / Accepted: 13 May 2019 / Published: 17 May 2019
(This article belongs to the Special Issue Intelligent Transportation Related Complex Systems and Sensors)

Abstract

:
This paper presents a novel de-centralized flexible phasing scheme, cycle-free, adaptive traffic signal controller using a Nash bargaining game-theoretic framework. The Nash bargaining algorithm optimizes the traffic signal timings at each signalized intersection by modeling each phase as a player in a game, where players cooperate to reach a mutually agreeable outcome. The controller is implemented and tested in the INTEGRATION microscopic traffic assignment and simulation software, comparing its performance to that of a traditional decentralized adaptive cycle length and phase split traffic signal controller and a centralized fully-coordinated adaptive phase split, cycle length, and offset optimization controller. The comparisons are conducted in the town of Blacksburg, Virginia (38 traffic signalized intersections) and in downtown Los Angeles, California (457 signalized intersections). The results for the downtown Blacksburg evaluation show significant network-wide efficiency improvements. Specifically, there is a 23.6 % reduction in travel time, a 37.6 % reduction in queue lengths, and a 10.4 % reduction in CO 2 emissions relative to traditional adaptive traffic signal controllers. In addition, the testing on the downtown Los Angeles network produces a 35.1 % reduction in travel time on the intersection approaches, a 54.7 % reduction in queue lengths, and a 10 % reduction in CO 2 emissions compared to traditional adaptive traffic signal controllers. The results demonstrate significant potential benefits of using the proposed controller over other state-of-the-art centralized and de-centralized adaptive traffic signal controllers on large-scale networks both during uncongested and congested conditions.

1. Introduction

Traffic growth and limited available capacity within the roadway system produces problems and challenges for transportation agencies. Traffic congestion affects traveler mobility and has an impact on air quality, and consequently on public health. The stopping and starting in traffic jams burns fuel at a higher rate than the smooth rate of travel, and contributes to the amount of emissions released by vehicles that create air pollution and are related to global warming [1]. Reduction in traffic congestion improves traveler mobility and accessibility, while also reducing vehicle fuel consumption and emissions.
Traffic congestion in 2013 cost Americans $ 124.2 billion [2], and this number is projected to rise to $ 186.2 billion in 2030. Traffic signal controllers attempt to optimize various traffic variables (e.g., delay, queue length, and energy and emission levels), by optimizing signal control variables, including the cycle length, the phasing scheme and sequence, the phase split, and the offset. Most of the currently implemented traffic signal systems can be categorized into one of the following categories: fixed-time control (FP), actuated control (ACT), responsive control, or adaptive control [3].
An FP control system is developed off-line using historical traffic data to compute traffic signal timings; real-time traffic data is not taken into account, and the duration and order of all phases stay fixed without any adaptation to real-time traffic demand fluctuations [4]. Previous studies have found this approach to only be appropriate for under-saturated conditions and traffic flows that are stable or relatively stable [5]. By comparison, ACT systems respond to changes in traffic demand patterns by communicating with the controller based on the presence or absence of vehicles as identified by local detectors installed at intersection approach stop lines. While ACT has been proven to generally perform better than FP for very low demand levels, it still offers no real-time optimization to adapt to traffic fluctuations, and may result in long network queues [6]. Adaptive systems have the potential to alleviate traffic congestion by adjusting signal timing parameters in response to real-time traffic fluctuations. These systems use detector inputs, historical trends, and predictive models to predict vehicle arrivals at intersections, and then use the predictions to determine the best gradual changes in cycle length, phase splits, and offsets to minimize vehicle delays or queue lengths [7]. Some examples in this category are: the Split Cycle Offset Optimization Tool (SCOOT) [8], a macroscopic model that minimizes delay and the number of vehicle stops at all intersection approaches, and performs effectively in under-saturated traffic conditions. The Sydney Coordinated Adaptive Traffic System (SCATS) [9] operates in a centralized hierarchical mode, and allocates green times to the phases of greatest need. OPAC [10] optimizes an objective function for a specified rolling horizon using dynamic-programming-based traffic prediction models that require a traffic environment state transition probability model, which can be difficult to generate. TR2 and UTCS-1 [11], optimized off-line, are incapable of handling stochastic variations in traffic patterns.
The operation of actuated and adaptive controllers is constrained by minimum and maximum cycle lengths, green indication durations and offsets, and also require going through a pre-defined sequence of phases. In addition, some systems use hierarchies that either partially or totally centralize decisions, rendering them more susceptible to failures. Hierarchies make scaling these systems up more difficult, relatively more complex to operate, and more expensive [13].
Various computational intelligence-based techniques have been investigated in the domain of traffic signal optimization domain, and are still under continuous research and development, using fuzzy sets, genetic algorithms, reinforcement learning, and neural networks. Genetic algorithms compute the optimal solution using an evolutionary process of possible solutions [13,14]; it solves simple networks and deals with static traffic volumes. However, as the network increases in size, the search space involved in finding effective signal plans increases significantly, and a large amount of centralized computing power is required. Pappis [15] proposed the first signal controller using fuzzy logic for an isolated intersection. Ella [16] proposed a neuro-fuzzy controller, where the parameters of the fuzzy membership functions were adjusted using a neural network. The neural learning algorithm in Ella’s work was reinforcement learning, which was found to be successful at constant traffic volumes, but failed when the traffic demand changed rapidly. The choice of the membership functions (building blocks of fuzzy set theory) are important for a particular problem since they affect a fuzzy inference system. As a traffic control system is a complex large-scale system with many interactive factors, it is more appropriate to use fuzzy control for isolated intersections [17].
Several approaches have been proposed for designing traffic signal controllers using neural networks [18,19]. Most of these works are based on a distributed approach, where an agent is assigned to update the traffic signals of a single intersection. Neural networks also adapt very slowly to changing traffic parameters, where on-line learning has to take place continuously. Some networks require multiple models to be maintained for various times within a day. Most intelligence-based approaches are still being researched and are thus under development or have only been implemented and tested on an isolated intersection, so their effectiveness for controlling a large-scale traffic network is also unknown.
Reinforcement learning is inspired by behavioral psychology [20]. It is a machine learning approach which allows agents to interact with the environment, attempting to learn the optimal behavior based on the feedback received from interactions. The feedback may be available right after the action, or several time steps later, which makes the learning more challenging [21]. Abdulhai et al. [22] applied a model-free Q-learning technique to a simple two-phase isolated traffic signal in a two-dimensional road network. Salkham et al. [23] applied a Q-learning strategy that allowed an agent to exchange rewards with its neighbors on 64 signalized intersections. The state-action space was simple and very time coarse. Each agent decided the phase splits every two cycles, which did not capture of the rapid dynamics of congestion–coordination between the agents actions was missing. Studies have considered the use of RL algorithms for traffic control, but they are very limited in terms of network complexity and traffic loadings, so that realistic scenarios, over saturated conditions, and transitions from under saturation to over saturation (and vice versa) have not been fully explored.
Game theory studies the interactive cooperation between intelligent rational decision makers with the specific goal of cooperating and benefiting from reaching a mutually agreeable outcome. It has been widely used in economic, military, communication applications [24,25], model traveler route choice behavior [26], control connected vehicle movements [27], and to in-route guidance [28]. The literature indicates that investigation of game-theoretic traffic signal control is very limited. Bargaining theory is related to cooperative games through the concept of Nash bargaining (NB). A bargaining situation is defined as a situation in which multiple players with specific objectives cooperate and benefit by reaching a mutually agreeable outcome [29]. The bargaining process is the procedure that bargainers follow to reach an agreement (outcome) [30], and the bargaining outcome is the result of the bargaining process [31,32].
Traffic flow is affected by a number of factors, including weather, time-of-day, day-of-week, and unpredictable events, such as special events, incidents, and work zones. Consequently, traffic control strategies could be improved if control systems responded not only to actual conditions, but also adapted their actions to transient conditions. Due to the stochastic nature of traffic flows, an adaptive control strategy that adjusts to stochastic changes is needed. Cycle-free strategies may present an innovative and less restrictive means of accommodating variations in traffic conditions.
Traffic signal controllers can be categorized as centralized or decentralized. Centralized systems require a reliable and direct communication network between a central computer and the local controllers. The main advantage of these systems is that they allow for traffic signal coordination. However, decentralized systems offer many advantages over centralized control systems as they are computationally less demanding and require only relevant information from adjacent intersections/controllers. Robustness is also guaranteed in decentralized control systems, because if one or more controllers fail, the remaining controllers can take over some of their tasks. Decentralized systems are scalable and easy to expand by inserting new controllers into the system. Additionally, decentralized systems are often inexpensive to establish and operate, as there is no essential need for a reliable and direct communication network between a central computer and the local controllers in the field.
To mitigate traffic congestion, a novel de-centralized traffic signal controller, considering a flexible phasing sequence and cycle-free operation, using a NB game-theoretic framework (DNB) is developed. The proposed controller was implemented and evaluated in the INTEGRATION microscopic traffic assignment and simulation software [33,34,35]. INTEGRATION is a microsopic model that replicates vehicle longitudinal motion using the Rakha–Pasumarthy–Adjerid collision-free car-following model, also known as the RPA model [36]. The RPA model captures vehicle steady-state car-following behavior using the Van Aerde model [37,38]. Movement from one steady state to another is constrained by a vehicle dynamics model described in [39,40]. Vehicle lateral motion is modeled using lane-changing models described in [35]. The model estimates of vehicle delay were validated in [41], while vehicle stop estimation procedures are described and validated in [42]. Vehicle fuel consumption and emissions are modeled using the VT-Micro model [43,44,45]. The developed controller was compared to the operation of a decentralized phase split and cycle length controller (PSC) [6], and a fully coordinated adaptive phase split-cycle length and offset optimization controller (PSCO) to evaluate its performance, where PSCO is based on the REALTRAN (REAL-time TRANsyt) controller that emulates the SCOOT system [46,47]. The DNB controller was implemented and evaluated on large-scale networks consisting of 38 (Blacksburg) and 457 (downtown Los Angeles) signalized intersections.
This paper describes the application and the testing of the proposed DNB controller on large-scale networks and is organized as follows. Section 2 describes the developed de-centralized traffic signal controller using a game-theoretic framework. Section 3 presents the experimental setup and results of a large-scale study in the town of Blacksburg, Virginia, consisting of 38 signalized intersections. Section 4 describes the experimental setup and the experimental results of a large-scale study on a downtown network in Los Angeles, California, consisting of 457 signalized intersections. Section 5 presents a summary and conclusions drawn from these studies.

2. Traffic Signal Controller

This section describes the NB solution for two players (Section 2.1), Section 2.2 describes how the NB approach is adapted and extended to control a multi-phase (player) signalized intersection (DNB), and Section 2.3 describes the de-centralized mechanism of the DNB controller over an entire transportation network.

2.1. NB Solution for Two Players

A bargaining situation is defined as a situation in which multiple players with specific objectives cooperate and benefit by reaching a mutually agreeable outcome (agreement). In bargaining theory, there are two concepts: the bargaining process and the bargaining outcome.
The bargaining process is the procedure that bargainers follow to reach an agreement (outcome). Nash adopted an axiomatic approach that abstracts the bargaining process and considers only the bargaining outcome [31]. The bargaining problem consists of three basic elements: players, strategies, and utilities (rewards). Bargaining between two players is illustrated in the bi-matrix shown in Table 1. Each player, namely P 1 and P 2 , has a set of possible actions A 1 and A 2 , whose outcome preferences are given by the utility functions u and v, respectively, as they take relevant actions.
The space (S) shown in Figure 1, is the set of all possible utilities that the two players can achieve; the vertices of the area are the utilities where each player chooses their pure strategy. The disagreement or the threat point d = ( d 1 , d 2 ) corresponds to the minimum utilities that the players want to achieve. The threat point is a benchmark, and its selection affects the bargaining solution. Each player attempts to choose their threat point in order to maximize their bargaining position. Subsequently, a bargaining problem is defined as the pair (S,d) where S R 2 and d S such that S is a convex and compact set, and there exists some s S such that s > d .
Nash’s theorem states that there exist a unique solution satisfying four axioms (Pareto efficiency, symmetry, invariance to equivalent utility representation, and independence of irrelevant alternatives), and this solution is the pair of utilities ( u * , v * ) that solves the following optimization problem:
max u , v ( u d 1 ) ( v d 2 ) s . t . ( u , v ) S , ( u , v ) ( d 1 , d 2 )
The NB solution ( u * , v * ) of this optimization problem can be calculated as the point in the bargaining set that maximizes the product of the players utility gains relative to a fixed threat point.

2.2. DNB Traffic Signal Controller for Multi-Players

This section describes the game model and the DNB solution for multi-players (N), and shows how the model is adapted (from Section 2.1) and applied to control a multi-phase signalized intersection. First, a four-phase scheme for a four-legged intersection is used, assuming four players ( N = 4 ), to represent the intersection phases as shown in Figure 2, with protected, leading main street left-turn phases.
In the game model, the four phases are modeled as four players P 1 , P 2 , P 3 , and P 4 in a four-player cooperative game. For each player (phase), there are two possible actions: maintain ( A 1 ) or change ( A 2 ) . These actions produce the state for the traffic signal. Specifically, the action m a i n t a i n maintains the traffic signal (i.e., if it is displaying a green indication, it will remain green; if it is displaying a red indication, it will remain red). The action c h a n g e entails changing the state of the traffic signal (i.e., if it is displaying a green indication, it will switch its state by first introducing a yellow indication followed by a red indication; if it is red, it will switch to a green indication) in the simulated time interval. The combinations of phases offer four possibilities, where only one player holds the green indication and all others hold red indications [48].
The INTEGRATION software is a microscopic traffic simulation model that traces individual vehicle movements every deci-second. Driver characteristics such as reaction times, acceleration and deceleration levels, desired speeds, and lane-changing behavior are examples of stochastic variables that are modeled in INTEGRATION. The threshold speed is fixed and assigned to the entire network (chosen to be equal to the typical pedestrian speed, s T h = 4.5 ( km / h ) ). We continuously check the vehicle speeds when they are within the threat distance from the approach stop bar. If the vehicle (v) speed ( s v t ) is less than the threshold speed ( s T h ) at time (t), the vehicle is assigned to the queue, and the current queue length associated with the corresponding lane (l) is updated. Once the vehicle’s speed exceeds ( s T h ) the queue length is updated (i.e., shortened by the number of vehicles leaving the queue). This is formulated mathematically as
q l t = v v l t q v t
q v t = 1 if s v t 1 > s T h & s v t s T h 1 if s v t 1 s T h & s v t > s T h 0 if s v t 1 s T h & s v t s T h if s v t 1 > s T h & s v t > s T h
q l t is the number of queued vehicles in lane l at time t. The index ( t 1 ) is used to refer to the previous time step. In this case the previous deci-second as the INTEGRATION model tracks vehicle movements at a frequency of 10 hertz.
The utilities (rewards) for each player (phase) in the game can be defined as the estimated sum of the queue lengths in each phase after applying a specific action. The estimated queue length after applying a specific action is calculated according to the following equation:
Q P ( t + Δ t ) = l P q l t + Q i n l Δ t Q o u t l Δ t
where Δ t is the updating time interval, q l t is the current queue length at time t, Q P ( t + Δ t ) is the estimated queue length after Δ t for phase P, Q i n l is the arrival flow rate (veh/h/lane), and Q o u t l is the departure flow rate (veh/h/lane).
The NB solution is extended to four players (N=4) with a four-dimensional utility space and threat points. The solution for the four-phase NB problem can be formulated as:
max ( u 1 , , u N ) i = 1 N ( u i d i ) s . t . ( u 1 , , u N ) S , ( u 1 , , u N ) ( d 1 , , d N )
The NB solution can be calculated as the vector that maximizes the product of the player’s utility gains relative to a fixed threat point. The threat point represents the maximum number of vehicles that could be accumulated per lane (i.e., the maximum measurable queue length). The objective is to minimize and equalize the queue lengths across the different phases. Hence, the negative queue length is used as the utility of each strategy considering a negative threat point. In other words, the reward (u) is defined to be the negative of the estimated queue length ( Q P ), i.e., u = Q P , and we substitute (d) with a negative number. Consequently, the objective function can be rewritten as follows:
max ( Q P 1 , , Q P N ) i = 1 N ( d i Q P i ) s . t . ( Q P 1 , , Q P N ) S , ( Q P 1 , , Q P N ) ( d 1 , , d N )
The block diagram for the DNB controller is shown in Figure 3, where the predefined threat point values are an input to the controller (i.e., the maximum queue size that each player can accommodate). Q o u t l are generally measured at the approach stop bar, whereas Q i n l are measured at a distance from the stop bar equal to the threat point divided by the approach jam density (i.e., the maximum length of the queue assuming all vehicles are stopped).
The flows Q i n l and Q o u t l can be measured using stationary sensors (e.g., loop detectors or through video image processor (VIP) detection obtained from CCTV cameras). The queue length estimates can be obtained using CCTV cameras or via GPS-equipped vehicles that communicate with the the traffic signal controller. As such, the proposed controller is technology agnostic.

2.3. DNB Controller for Multi-Intersections

This section presents the DNB controller formulation for a network composed of multiple signalized intersections. For illustration purposes only, we formulate the problem considering three signalized intersections, as shown in Table 2. It should be noted, however that the algorithm can operate on a network of any number of signalized intersections.
Assume we have three signalized intersections ( I 1 , I 2 , I 3 ), each traffic signal has three phases ( P h 1 , P h 2 , P h 3 ), where each phase is modeled as a player in a game resulting in a total of nine players where I 1 has three players ( P 1 , P 2 , P 3 ), I 2 has three players ( P 4 , P 5 , P 6 ), and I 3 has three players ( P 7 , P 8 , P 9 ). Each traffic signal has three possible actions (A), where one phase displays a green indication (G) while the others display a red indication (R), as illustrated in Table 2.
Consequently, for the three signalized network illustrated in Table 2, there are 27 possible scenarios (action permutations) as shown in Table 3. The optimum overall network performance (NB optimum, Equation (6)) can be computed from Table 3.
Referring to Table 2, and assuming that the first traffic signal ( I 1 ) has action ( A 12 ) that optimizes its performance, traffic signal ( I 2 ) has action ( A 21 ) that optimizes its performance, and traffic signal ( I 3 ) has action ( A 33 ) that optimizes its performance. Consequently, searching in Table 3 for the Nash optimum combination yields scenario 12. This implies that in order to achieve the Nash optimum network performance, it is sufficient to search for the actions that optimize the operations of each signalized intersection. This can be described using the NB optimization problem shown in the following equations.
max ( u 1 , , u 9 ) i = 1 9 ( u i d i ) = max ( u 1 , , u 9 ) [ i = 1 3 ( u i d i ) I 1 i = 4 6 ( u i d i ) I 2 i = 7 9 ( u i d i ) I 3 ] = max ( u 1 , , u 3 ) i = 1 3 ( u i d i ) I 1 max ( u 4 , , u 6 ) i = 4 6 ( u i d i ) I 2 max ( u 7 , , u 9 ) i = 7 9 ( u i d i ) I 3
The network-wide Nash optimum solution is obtained by maintaining the Nash optimum solution at each signalized intersection. As such, while the proposed NB controller is decentralized (i.e., DNB), it still produces the network-wide Nash-optimum control strategy relying solely on edge computing. The Nash optimum should not be mistaken for the system-optimum solution, where the system optimum might sacrifice the performance of one or more traffic signals to achieve optimum network-wide performance. It should be noted that obtaining the system-optimum solution is impossible given the scale and level of interactions of the various network-wide traffic signal controllers. The DNB controller, thus, provides a scalable and resilient controller that circumvents the problems inherent in complex centralized systems with minimum sacrifices to network-wide performance.
Note that a single traffic signal cannot be decomposed (i.e., optimize each decision variable independently), as the utilities of the players within the same traffic signal are dependent on each other. Specifically, if one player displays a green indication by default the other players have to display a red indication given that this would result in conflicting movements being discharged simultaneously. Alternatively, each traffic signal controller operates independently. Consequently, decomposition is invalid within a traffic signal but valid between traffic signals, as players within a traffic signal compete for the same resource, namely green time.

3. Blacksburg Town Experiments

This section presents the experimental setup and the results of a testing of the proposed system in the town of Blacksburg, Virginia, illustrated in Figure 4. The simulations were conducted using the morning peak hour (7–8 a.m.) traffic demand. The town of Blacksburg has 38 signalized intersections, 549 stop signs, 30 yield signs, and 1844 links. The minimum free-flow speed on the network was 30 ( km / h ) , and the maximum free-flow speed on the network was 105 ( km / h ) . The minimum link length was 50 m while the maximum link length was 2932 m . The jam density was set at 160 ( veh / km / lane ) . The traffic signal phasing scheme used in the study was the same as those in the field. These varied between 2 to 4 phases.

3.1. Blacksburg Experimental Setup

The time-dependent static O-D demand matrices were generated every 15 min using the QueensOD software [49,50,51]. QueensOD estimates the most likely O-D matrix that is as close structurally as a seed matrix while at the same time minimizing the error between the estimated and field observed link flow counts. The time-dependent static O-Ds were then used to compute a dynamic O-D matrix using procedures described in [52]. The final peak-hour dynamic O-D matrix consisted of 23,260 vehicular trips. Vehicles were loaded for one hour, while the simulation continued until all vehicles cleared the network to ensure that the same number of vehicles were used in comparing the performance of the various traffic signal control algorithms.
The performance of the DNB controller was evaluated by comparing its performance to that of the PSC and PSCO controllers. The network-wide average of each of the following measures of effectiveness (MOEs) was calculated to assess the DNB controller’s performance: travel time, total delay, stopped delay, queue length, fuel consumption, and emission levels. The INTEGRATION microscopic traffic assignment and simulation software was used to model the network, shown in Figure 4. Three experiments were conducted on the BB network, as discussed in the following sections.

3.2. BB Experimental Results: 1

In this experiment the performance of the DNB controller was compared to the PSC and PSCO controllers. The threat point (d) values per lane for the DNB controller were assigned based on the link’s lengths (L), the link’s free-flow speeds ( U f ), and the updating time intervals ( Δ t ), using the following formula; d=min[ N ( L / 2 ) , N ( U f × Δ t ) ], where N ( L / 2 ) represents the number of vehicles that could be accumulated up to the half length of the link, and N ( U f × Δ t ) represents the maximum number of stopped vehicles that could be stored in the distance ( U f × Δ t ). Using this distance allowed vehicles to proceed through the intersection in a minimal time without stopping if there was no queue ahead of them. A distance of L / 2 was used instead of L to get a better estimate of the queue length for each movement because drivers typically moved to their desired lanes as they got closer to the signalized intersection, and to avoid being fully queued (i.e., players will accept a fully occupied (queued) link).
The average MOE values over the entire simulation for the PSC, PSCO, and DNB control scenarios are summarized in Table 4. In addition, Table 4 shows the percent improvement in MOEs using the proposed DNB controller over the PSC and PSCO controllers. The improvement (%) is calculated as:
Improvement ( % ) = MOE ( PSC / PSCO ) MOE ( DNB ) MOE ( PSC / PSCO ) × 100
The simulation results demonstrated a significant reduction in the average travel time of 5.25 % , a reduction in the average total delay of 16.5 % , and a reduction in the average stopped delay of 40.3 % over the PSC controller. In addition, the results indicated significant reduction in the average travel time of 6.5 % , a reduction in the average total delay of 19.8 % , and a reduction in the average stopped delay of 52.7 % over the PSCO controller. These results show that the proposed DNB controller outperforms both the PSC and PSCO controllers.

3.3. BB Experimental Results: 2

This section presents a potential solution to better estimate the queue length considering the driver’s lane changing behavior close to the intersections. A suggested phasing scheme, shown in Figure 5b, where all vehicles on the link discharge in a single phase, might provide a better estimate of the queue length per phase over the currently implemented phase scheme shown in Figure 5a, where each link discharges in two phases. Two simulations were conducted using the DNB controller to evaluate the effectiveness of the two phasing scheme on the MOEs, where the threat point per lane was assigned using the following formula: d=min[ N ( L / 2 ) , N ( U f × Δ t ) ]. The simulation results using the two schemes (Figure 5) are shown in Table 5.
The simulation results demonstrate that the suggested phasing scheme does not improve the network performance.

3.4. BB Experimental Results: 3

This section presents the effect of reducing the number of vehicles that can be accumulated in a lane on the network’s performance. The minimum free-flow speed on the network was 30 ( km / h ) , and the maximum free-flow speed on the network was 105 ( km / h ) , with updating time intervals of 10 s . Assigning the detector locations to be the min( L / 2 , U f × Δ t ), the detectors could be located for long links between 84 m (i.e., 13 veh/lane) to 292 m (i.e., 47 veh/lane). Employing the free-flow speed to determine the threat point (d = min[ N ( L / 2 ) , N ( U f × Δ t ) ]) is a good choice for low traffic demand, as vehicles are not required to stop at the intersection); however, for high traffic demand, long links can accommodate long queues, which causes delays for the vehicles on that link. Hence, reducing the number of vehicles that can accumulate in a lane might enhance the network’s performance. To examine the effectiveness of changing the maximum number of vehicles that could be accumulated per lane on the MOEs, a sensitivity analysis was conducted, as shown in Figure 6, with d = min[ N ( L / 2 ) , N V ], where N V presents the maximum number of vehicles that can be stored in a lane; this number ranges between 6 to 32 vehicles.
Analysis of the results in Figure 6 demonstrated that better performance using the DNB controller could be achieved if the threat points are assigned as a minimum of 12 veh/lane and the number of vehicles that could be accumulated in L / 2 , (d=min[ N ( L / 2 ) , 12]).
Table 6 shows the average MOEs values over the entire simulation time and the percent improvement in MOEs using the proposed DNB controller over PSC and PSCO controllers. Simulation results indicate significant reduction in the average total delay of 19.38 % , a reduction in the average stopped delay of 51.18 % , a reduction in the average travel time of 6.162 % , a reduction in the average number of stops of 8.39 % , a reduction in the average fuel consumption of 3.89 % , and a reduction in the emission levels of 3.84 % over the PSC controller. The results show that the proposed DNB approach outperforms both the PSC and PSCO controllers.
To further investigate the achieved improvements using the DNB controller, it was taken into consideration that the network has 459 stop signs and 30 yield signs, which might conceal the full degree of improvement achieved using the DNB controller on the signalized intersection. Accordingly, we investigated the percent improvement in MOEs using the DNB controller over the PSC controller over only the links that were directly associated with intersections. Table 7 shows the percent improvement in MOEs using the DNB controller over the PSC controller on the 38 intersections.
Table 7 demonstrates an improvement in the travel time on the intersections between 6 % to 52 % , an improvement in the queue length on the intersections between 8 % to 60 % , and an improvement in the number of stops on the intersections between 8 % to 80 % . In addition, Table 7 demonstrates an overall reduction in the average travel time of 23.63 % , in the average queued vehicles of 37.66 % , in the average number of stops of 23.58 % , in the average fuel consumption of 10.44 % , in the average CO 2 emitted of 9.84 % , and in the average NO X emitted of 5.4 % over the PSC controller. These results revealed that the DNB controller performs significantly better than the PSC controller.

4. Downtown Los Angeles Experiments

This section describes the experimental setup and the experimental results of large scale studies in downtown Los Angeles, California comprised of 457 signalized intersections.

4.1. Los Angeles Experimental Setup

These experiments were large scale studies of a network in downtown Los Angeles (LA), California, including the most congested downtown area, as shown in Figure 7a. The INTEGRATION microscopic traffic assignment and simulation software was used to model the network, as shown in Figure 7b.
Simulations were conducted using the morning peak hour (7–8 a.m.) traffic demand that was calibrated in a previous effort [53]. The downtown LA network has 457 signalized intersections, 285 stop signs, 23 yield signs, and 3556 links. The origin-destination (O-D) demand matrices were generated, as described earlier, using a combination of the QueensOD software, to generate time-dependent static O-D demands, and then converting these static O-D demands to a dynamic O-D demand. The resulting O-D consisted of a total of 143,957 vehicle trips. Vehicles were loaded for the one-hour period and the simulation continued until all vehicles cleared the network to ensure that all comparisons were made for the same number of vehicles.
The traffic signal phasing schemes varied from 2 to 6 phases, reflecting the field implemented traffic signal settings in downtown LA. The minimum free-flow speed on the network was 15 ( km / h ) , and the maximum free-flow speed on the network was 120 ( km / h ) . The minimum link length on the network was 50 m , and the maximum link length on the network was 4400 m . The jam density of the various network links was set equal to 180 ( veh / km / lane ) .
The DNB controller was compared to the PSC controller to evaluate their relative performance. The average of each of the following measures of effectiveness (MOEs) was calculated to assess the performance of the DNB controller: travel time, total delay, stopped delay, queue length, fuel consumption, and emission levels.

4.2. LA Experimental Results: 1

In this experiment, the performance of the DNB controller was compared to that of the PSC controller using the full traffic demand in the morning peak hour. The threat point per lane for the DNB controller was assigned as the minimum of 12 veh/lane and the number of vehicles that could be accumulated on L / 2 (i.e., d = min[ N ( L / 2 ) , 12]) based on the sensitivity analysis shown in Figure 8.
The average MOE values over the entire simulation for the PSC and DNB controllers are shown in Table 8. In addition, Table 8 shows the percent improvement in MOEs using the proposed DNB controller relative to the PSC controller. The simulation results demonstrate a significant reduction in the average travel time of 7.89 % , a reduction in the total delay of 14.55 % , a reduction in the average stopped delay of 25.18 % , a reduction in the average number of vehicle stops of 12.4 % , a reduction in the average fuel consumption of 4.0 % , and a reduction in CO 2 emission levels of 4.25 % , relative to the PSC controller. Analysis of the results demonstrated that the proposed DNB controller outperforms current state-of-the-art de-centralized traffic signal controllers.
The improvements produced by the DNB controller, only at the signalized intersections, were further analyzed. Accordingly, we investigated the percent improvement in MOEs using the DNB controller over the PSC controller over only the links that were directly associated with signalized intersections.
Table 9 demonstrates a reduction in the average travel time of 35.16 % , a reduction in the average queued vehicles of 54.67 % , a reduction in the average number of stops of 44.03 % , a reduction in the average fuel consumption of 9.97 % , a reduction in the CO 2 emissions of 9.92 % , and a reduction in the NO X emissions of 11.78 % relative to the PSC controller. These results revealed that the DNB controller has significantly better performance potential than the PSC controller.

4.3. LA Experimental Results: 2

A simulation was conducted for lower levels of traffic congestion by scaling the demand down by 90 % (i.e., 10 % of the peak demand) to investigate the performance potential using the DNB controller. Table 10 shows a reduction in the average travel time of 7.1 % , a reduction in the average total delay of 36.79 % , a reduction in the average stopped delay of 90.26 % , a reduction in the average number of vehicle stops of 34.66 % , a reduction in the average fuel consumption of 4.8 % , and a reduction in CO 2 emission levels of 4.79 % , relative to the PSC controller.
Once more, to further investigate the achieved improvements using the DNB controller, we investigated the improvement in MOEs over only the links that were directly associated with signalized intersections, as shown in Table 11. Table 11 demonstrates a reduction in the average travel time of 19.19 % , a reduction in the average queued vehicles of 49.84 % , a reduction in the average number of stops of 53.71 % , a reduction in the average fuel consumption of 54.16 % , a reduction in the average CO 2 emitted of 16.09 % , and a reduction in the average NO X emitted of 25.94 % over PSC controller.
These results demonstrate that the DNB controller performed significantly better than the PSC controller in both congested and uncongested conditions, however, produced more savings as the traffic demand decreased.
The results show that the DNB controller yielded significant improvements in the average values of all MOEs, demonstrating improved system efficiency.

5. Summary & Conclusions

The research presented in this paper develops and evaluates a Nash bargaining de-centralized flexible phasing cycle-free traffic signal controller (DNB controller) on large-scale networks. The controller was implemented and tested in the INTEGRATION microscopic traffic assignment and simulation software. The performance of the DNB controller was compared to a decentralized phase split and cycle length optimization controller based on the HCM procedures (PSC) and a fully-coordinated adaptive phase split, cycle length and offset optimization controller (PSCO), in the town of Blacksburg, Virginia and in downtown Los Angeles, California.
Several simulations were conducted on the Blacksburg network using different threat point values and phasing schemes to determine their effect on the controller’s performance. The results show significant reductions in the network-wide average travel time of 6.1 % and 7.3 % , a reduction in the average total delay of 19.3 % and 22.6 % , a reduction in the stopped delay of 51 % and 61 % , and a reduction in CO 2 emission levels of 3.8 % and 3.7 % , over the PSC and PSCO controllers, respectively. In addition, the results show significant reductions on the intersection approach average travel time of 23.6 % , a reduction in the average queue length of 37.6 % , a reduction in the average number of vehicle stops of 23.6 % , a reduction in the fuel consumption of 9.8 % , a reduction in the CO 2 emissions of 10.4 % , and a reduction in NO X emissions of 5.4 % .
In addition, the DNB controller’s performance was tested in downtown Los Angeles, California, and compared to the performance of the de-centralized PSC controller. The results show significant improvements in various network-wide measures of performance. Specifically, a reduction in the average travel time of 8 % , a reduction in the average total delay of 14.5 % , a reduction in the stopped delay of 25.1 % , a reduction in the average number of vehicle stops of 12.4 % , and a reduction in CO 2 emissions of 4.25 % , over the PSC controller. Moreover, the results show significant improvements in the signalized intersection operations with a reduction in the average travel time of 35.1 % , a reduction in the average queue length of 54.7 % , a reduction in the average number of vehicle stops of 44 % , a reduction in the fuel consumption and CO 2 emissions of 10 % , and a reduction in NO X emissions of 11.7 % . Furthermore, simulations conducted for lower traffic demand levels showed significant network-wide improvements with a reduction in the average total delay of 36.7 % , a reduction in the stopped delay of 90.2 % , and a reduction in the average number of stops of 35 % over the PSC controller. As these results indicate, the DNB controller can generate major performance improvements at lower demands. The results demonstrate significant potential benefits of using the proposed controller over other state-of-the-art centralized and de-centralized controllers on large scale networks.
In summary, a novel traffic signal controller is developed that offers a number of unique features. First, the controller adapts signal timings dynamically to changing traffic conditions without using historical data, which tends to be inaccurate, resulting in inefficient traffic signal plans. Second, the developed controller is de-centralized, which increases both the scalability and robustness of the system, to avoid the problems inherent with complex centralized communication. Decentralized systems are often inexpensive to establish and operate, as there is no essential need for a reliable and direct communication network between a central computer and the local controllers in the field. Third, the controller, while de-centralized, does not sacrifice in system-wide performance and computes the network-wide Nash optimum solution. Finally, the controller is designed to operate with current traffic signal controllers. This controller should increase the traffic handling capacity of roads, and reduce unnecessary stop-and-go vehicular movement, which will reduce fuel consumption and, accordingly, air pollution.

Author Contributions

The work described in this article is the collaborative development of all authors, conceptualization, H.M.A. and H.A.R; methodology, H.M.A. and H.A.R.; software, H.M.A. and H.A.R.; validation, H.M.A. and H.A.R.; formal analysis, H.M.A. and H.A.R.; investigation, H.M.A. and H.A.R.; writing—review and editing, H.M.A. and H.A.R.

Funding

This effort was funded by the US Department of Transportation through the University Mobility and Equity Center (Award 69A3551747123).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Elbakary, M.I.; Abdelghaffar, H.M.; Afrifa, K.; Rakha, H.A.; Cetin, M.; Iftekharuddin, K.M. Aerosol Detection Using Lidar-Based Atmospheric Profiling. In SPIE Optics and Photonics for Information Processing XI; SPIE: Bellingham, WA, USA, 2017. [Google Scholar] [CrossRef]
  2. The Future Economic and Environmental Costs of Gridlock in 2030; Technical Report; Center for Economics and Business Research: London, UK, 2014.
  3. Dai, Y.; Zhao, D.; Zhang, Z. Computational Intelligence in Urban Traffic Signal Control: A Survey. IEEE Trans. Syst. Man, Cybern. 2012, 42, 485–494. [Google Scholar] [CrossRef]
  4. Turky, A.M.; Ahmad, M.S.; Yusoff, M.; Hammad, B.T. Using genetic algorithm for traffic light control system with a pedestrian crossing. In Proceedings of the 4th International Conference, RSKT, Gold Coast, Australia, 14–16 July 2009. [Google Scholar] [CrossRef]
  5. Yang, X. Comparison Among Computer Packages in Providing Timing Plans for Iowa Arterial in Lawrence, Kansas. J. Transp. Eng. 2001, 127, 311–318. [Google Scholar] [CrossRef]
  6. Roess, R.; Prassas, E.S.; McShane, W.R. Traffic Engineering, 4th ed.; Pearson Higher Education, Inc.: Upper Saddle River, NJ, USA, 2010. [Google Scholar]
  7. French, L.J.; French, M.S. Benefits of Signal Timing Optimization and ITS to Corridor Operations; Technical Report; French Engineering, LLC: Spring, TX, USA, 2006. [Google Scholar]
  8. Hunt, P.B.; Robertson, D.I.; Bretherton, R.D.; Winton, R.I. SCOOT-A Traffic Responsive Method of Coordinating Signals; Technical Report; Transport and Road Research Laboratory: Wokingham, UK, 1981. [Google Scholar]
  9. Sims, A.G.; Dobinson, K.W. SCAT-The Sydney Co-ordinated Adaptive Traffic System Philosophy and Benefits. In Proceedings of the International Symposium on Traffic Control Systems, Berkeley, CA, USA, 6–9 August 1979. [Google Scholar]
  10. Gartner, N.H. OPAC: A demand-responsive strategy for traffic signal control. Transp. Res. Rec. J. Transp. Res. Board 1983, 906, 75–81. [Google Scholar]
  11. MacGowan, J.; Fullerton, I.J. Development and testing of advanced control strategies in the urban traffic control system. Public Roads 1979, 43, 97–105. [Google Scholar]
  12. Evans, M.R. Balancing Safety and Capacity in an Adaptive Signal Control System—Phase 1; Technical Report FHWA-HRT-10-038; Federal Highway Administration: Washington, DC, USA, 2010.
  13. Ceylan, H.; Bell, M.G.H. Traffic signal timing optimization based on genetic algorithm approach, including driver’s routing. Transp. Res. Part B 2004, 38, 329–342. [Google Scholar] [CrossRef]
  14. Chin, Y.; Yong, K.; Bolong, N.; Yang, S.; Teo, K. Multiple Intersections Traffic Signal Timing Optimization with Genetic Algorithm. In Proceedings of the IEEE International Conference on Control System, Computing and Engineering, Penang, Malaysia, 25–27 November 2011. [Google Scholar] [CrossRef]
  15. Pappis, C.; Mamdani, E. A Fuzzy Logic Controller for a Traffic Junction Systems. IEEE Trans. Man Cybern. 1977, 7, 707–717. [Google Scholar] [CrossRef]
  16. Bingham, E. Neurofuzzy Traffic Signal Control; Helsinki University of Technology: Espoo, Finland, 1998. [Google Scholar]
  17. Liu, Z. A Survey of Intelligence Methods in Urban Traffic Signal Control. Int. J. Comput. Sci. Netw. Secur. 2007, 7, 105–112. [Google Scholar]
  18. Spall, J.; Chin, D. A model-free approach to optimal signal light timing for system-wide traffic control. In Proceedings of the 33rd IEEE Conference on Decision and Control, Lake Buena Vista, FL, USA, 14–16 December 1994. [Google Scholar] [CrossRef]
  19. Srinivasan, D.; Choy, M.C.; Cheu, R.L. Neural Networks for Real-Time Traffic Signal Control. IEEE Trans. Intell. Transp. Syst. 2006, 7, 261–272. [Google Scholar] [CrossRef] [Green Version]
  20. Shoham, Y.; Powers, R.; Grenager, T. Multi-Agent Reinforcement Learning: A Critical Survey; Technical Report; Computer Science Department, Stanford University: Stanford, CA, USA, 2003. [Google Scholar]
  21. Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; The MIT Press: Cambridge, MA, USA; London, UK, 2012. [Google Scholar]
  22. Abdulhai, B.; Pringle, R.; Karakoulas, G.J. Reinforcement learning for true adaptive traffic signal control. J. Transp. Eng. 2003, 129, 278–285. [Google Scholar] [CrossRef]
  23. Salkham, A.; Cunningham, R.; Garg, A.; Cahill, V. A collaborative reinforcement learning approach to urban traffic control optimization. In Proceedings of the International Conference on Web Intelligence and Intelligent Agent Technology, Sydney, Australia, 9–12 December 2008. [Google Scholar] [CrossRef]
  24. Park, H.; van der Schaar, M. Bargaining strategies for networked multimedia resource management. IEEE Trans. Signal Process 2007, 55, 3496–3511. [Google Scholar] [CrossRef]
  25. Han, Z.; Liu, K.J.R. Fair Multiuser Channel Allocation for OFDMA Networks Using Nash Bargaining Solutions and Coalitions. IEEE Trans. Commun. 2005, 53, 1366–1376. [Google Scholar] [CrossRef]
  26. Chen, J. Game-Theoretic Formulations Of Interaction Between Dynamic Traffic Control and Dynamic Traffic Assignment. Transp. Res. Rec. 1998, 1617, 179–188. [Google Scholar] [CrossRef]
  27. Elhenawy, M.; Elbery, A.A.; Hassan, A.A.; Rakha, H.A. An Intersection Game-Theory-Based Traffic Control Algorithm in a Connected Vehicle Environment. In Proceedings of the 2015 IEEE 18th International Conference on Intelligent Transportation Systems, Las Palmas, Spain, 15–18 September 2015; pp. 343–347. [Google Scholar] [CrossRef]
  28. Jun, L. Study on Game-Theory-Based Integration Model for Traffic Control and Route Guidance; Tian Jin University: Tianjin, China, 2003. [Google Scholar]
  29. Abdelghaffar, H.M.; Yang, H.; Rakha, H.A. Isolated Traffic Signal Control using Nash Bargaining Optimization. Glob. J. Res. Eng. B Automot. Eng. 2016, 16, 27–36. [Google Scholar]
  30. Abdelghaffar, H.M.; Yang, H.; Rakha, H.A. Isolated traffic signal control using a game theoretic framework. In Proceedings of the IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil, 1–4 November 2016; pp. 1496–1501. [Google Scholar] [CrossRef]
  31. Han, Z.; Niyato, D.; Saad, W.; Basar, T.; Hjorungnes, A. Game Theory in Wireless and Communication Networks: Theory, Models, and Applications; Cambridge University Press: New York, NY, USA, 2012. [Google Scholar]
  32. Abdelghaffar, H.M.; Yang, H.; Rakha, H.A. A Novel Game Theoretic De-Centralized Traffic Signal Controller: Model Development and Testing. In Proceedings of the 97th Annual Meeting of Transportation Research Board, Washington, DC, USA, 7–11 January 2018. [Google Scholar]
  33. Aerde, M.V.; Rakha, H.A. INTEGRATION© Release 2.40 for Windows: User’s Guide—Volume I: Fundamental Model Features; Technical Report; Center for Sustainable Mobility, Virginia Tech Transportation Institute: Blacksburg, VA, USA, 2012. [Google Scholar]
  34. Aerde, M.V.; Rakha, H.A. INTEGRATION© Release 2.40 for Windows: User’s Guide—Volume II: Advanced Model Features; Technical Report; Center for Sustainable Mobility, Virginia Tech Transportation Institute: Blacksburg, VA, USA, 2013. [Google Scholar]
  35. Rakha, H.A.; Zhang, Y. The INTEGRATION 2.30 Framework for Modeling Lane-Changing Behavior in Weaving Sections. Transp. Res. Rec. J. Transp. Res. Board 2004, 1883, 140–149. [Google Scholar] [CrossRef]
  36. Rakha, H.A.; Pasumarthy, P.; Adjerid, S. A Simplified Behavioral Vehicle Longitudinal Motion Model. Transp. Lett. Int. J. Transp. Res. 2009, 1, 95–110. [Google Scholar] [CrossRef]
  37. Rakha, H.A. Validation of Van Aerde’s Simplified Steady-state Car-following and Traffic Stream Model. Transp. Lett. Int. J. Transp. Res. 2009, 1, 227–244. [Google Scholar] [CrossRef]
  38. Wu, N.; Rakha, H.A. Derivation of Van Aerde Traffic Stream Model from Tandem-Queueing Theory. Transp. Res. Rec. J. Transp. Res. Board 2009, 2124, 18–27. [Google Scholar] [CrossRef]
  39. Rakha, H.A.; Lucic, I.; Demarchi, S.; Setti, J. Vehicle Dynamics Model for Predicting Maximum Truck Accelerations. J. Transp. Eng. 2001, 127, 418–425. [Google Scholar] [CrossRef]
  40. Rakha, H.A.; Snare, M.; Dion, F. Vehicle Dynamics Model for Estimating Maximum Light Duty Vehicle Acceleration Levels. Transp. Res. Rec. J. Transp. Res. Board 2004, 1883, 40–49. [Google Scholar] [CrossRef]
  41. Dion, F.; Rakha, H.A.; Kang, Y.S. Comparison of Delay Estimates at Under-saturated and Over-saturated Pre-timed Signalized Intersections. Transp. Res. Part B Methodol. 2004, 38, 99–122. [Google Scholar] [CrossRef]
  42. Rakha, H.A.; Kang, Y.S.; Dion, F. Estimating Vehicle Stops at Under-Saturated and Over-Saturated Fixed-Time Signalized Intersections. Transp. Res. Rec. 2001, 1776, 128–137. [Google Scholar] [CrossRef]
  43. Ahn, K.; Rakha, H.A.; Trani, A.; Aerde, M.V. Estimating Vehicle Fuel Consumption and Emissions Based on Instantaneous Speed and Acceleration Levels. J. Transp. Eng. 2002, 128, 182–190. [Google Scholar] [CrossRef]
  44. Rakha, H.A.; Ahn, K.; Trani, A. Development of VT-Micro Framework for Estimating Hot Stabilized Light Duty Vehicle and Truck Emissions. Transp. Res. Part D Transp. Environ. 2004, 9, 49–74. [Google Scholar] [CrossRef]
  45. Rakha, H.A.; Ahn, K. INTEGRATION Modeling Framework for Estimating Mobile Source Emissions. J. Transp. Eng. 2004, 130, 183–193. [Google Scholar] [CrossRef]
  46. Aerde, M.V.; Rakha, H.A. REALTRAN: An Off-line Emulator for Estimating the Impacts of SCOOT. In Proceedings of the 74th Transportation Research Board Annual Meeting, Washington, DC, USA, 22–28 January 1995; pp. 124–128. [Google Scholar]
  47. Rakha, H.; Van Aerde, M.; Case, E.R. Experiments in Incremental Real-Time Optimization of Phase, Cycle, and Offset Times Using an On-Line Adaptation of TRANSYT-7F. In Proceedings of the Engineering Foundation Conference on Traffic Management: Issues and Techniques, Palm Coast, FL, USA, 1–6 April 1991. [Google Scholar]
  48. Abdelghaffar, H.M.; Yang, H.; Rakha, H.A. Developing a De-centralized Cycle-free Nash Bargaining Arterial Traffic Signal Controller. In Proceedings of the 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems, IEEE MT-ITS, Naples, Italy, 26–28 June 2017. [Google Scholar] [CrossRef]
  49. Aerde, M.V.; Rakha, H.A. QUEENSOD Rel. 2.10—User’s Guide: Estimating Origin—Destination Traffic Demands from Link Flow Counts; Technical Report; Center for Sustainable Mobility, Virginia Tech Transportation Institute: Blacksburg, VA, USA, 2010. [Google Scholar]
  50. Rakha, H.A.; Paramahamsan, H.; Aerde, M.V. Comparison of Static Maximum Likelihood Origin-Destination Formulations. In Transportation and Traffic Theory: Flow, Dynamics and Human Interaction; Elsevier: College Park, MD, USA, July 2005; pp. 693–716. [Google Scholar] [CrossRef]
  51. Aerde, M.V.; Rakha, H.A.; Paramahamsan, H. Estimation of O-D Matrices: The Relationship between Practical and Theoretical Considerations. Transp. Res. Rec. 2003, 1831, 122–130. [Google Scholar] [CrossRef]
  52. Yang, H.; Rakha, H.A. A Novel Approach for Estimation of Dynamic from Static Origin-Destination Matrices. Transp. Lett. Int. J. Transp. Res. 2019, 11, 219–228. [Google Scholar] [CrossRef]
  53. Du, J.; Rakha, H.A.; Elbery, A.; Klenk, M. Microscopic Simulation and Calibration of a Large-Scale Metropolitan Network: Issues and Proposed Solutions. In Proceedings of the 97th Annual Meeting of Transportation Research Board, Washington, DC, USA, 7–11 January 2018. [Google Scholar]
Figure 1. Utility region.
Figure 1. Utility region.
Sensors 19 02282 g001
Figure 2. Phasing scheme.
Figure 2. Phasing scheme.
Sensors 19 02282 g002
Figure 3. System block diagram.
Figure 3. System block diagram.
Sensors 19 02282 g003
Figure 4. Blacksburg network.
Figure 4. Blacksburg network.
Sensors 19 02282 g004
Figure 5. Four phasing scheme. (a) Implemented phasing scheme. (b) Suggested phasing scheme.
Figure 5. Four phasing scheme. (a) Implemented phasing scheme. (b) Suggested phasing scheme.
Sensors 19 02282 g005
Figure 6. Sensitivity analysis. (a) Average travel time. (b) Average CO 2 .
Figure 6. Sensitivity analysis. (a) Average travel time. (b) Average CO 2 .
Sensors 19 02282 g006
Figure 7. Downtown Los Angeles network. (a) LA, Google maps. (b) LA, INTEGRATION.
Figure 7. Downtown Los Angeles network. (a) LA, Google maps. (b) LA, INTEGRATION.
Sensors 19 02282 g007
Figure 8. LA Sensitivity Analysis. (a) Average Travel Time. (b) Average Fuel Consumption.
Figure 8. LA Sensitivity Analysis. (a) Average Travel Time. (b) Average Fuel Consumption.
Sensors 19 02282 g008
Table 1. Two players matrix game.
Table 1. Two players matrix game.
P 2
A 1 A 2
P 1 A 1 u 1 , v 1 u 2 , v 2
A 2 u 3 , v 3 u 4 , v 4
Table 2. Multi-player matrix game.
Table 2. Multi-player matrix game.
IntersectionFirst Intersection (I1)Second Intersection (I2)Third Intersection (I3)
PlayerPh1 (P1)Ph2 (P2)Ph3 (P3)Ph1 (P4)Ph2 (P5)Ph3 (P6)Ph1 (P7)Ph2 (P8)Ph3 (P9)
Action
First G R R A 11 G R R A 21 G R R A 31
Second R G R A 12 R G R A 22 R G R A 32
Third R R G A 13 R R G A 23 R R G A 33
Table 3. All possible Network Actions (Permutations).
Table 3. All possible Network Actions (Permutations).
Scenario #Network Action
1 A 11 A 21 A 31
2 A 11 A 21 A 32
3 A 11 A 21 A 33
4 A 11 A 22 A 31
5 A 11 A 22 A 32
6 A 11 A 22 A 33
7 A 11 A 23 A 31
8 A 11 A 23 A 32
9 A 11 A 23 A 33
10 A 12 A 21 A 31
11 A 12 A 21 A 32
12 A 12 A 21 A 33
13 A 12 A 22 A 31
14 A 12 A 22 A 32
15 A 12 A 22 A 33
16 A 12 A 23 A 31
17 A 12 A 23 A 32
18 A 12 A 23 A 33
19 A 13 A 21 A 31
20 A 13 A 21 A 32
21 A 13 A 21 A 33
22 A 13 A 22 A 31
23 A 13 A 22 A 32
24 A 13 A 22 A 33
25 A 13 A 23 A 31
26 A 13 A 23 A 32
27 A 13 A 23 A 33
Table 4. Average measures of effectiveness (MOEs) and (%) improvement for game-theoretic framework (DNB) over phase split and cycle length controller (PSC) and phase split-cycle length and offset optimization controller (PSCO) controllers.
Table 4. Average measures of effectiveness (MOEs) and (%) improvement for game-theoretic framework (DNB) over phase split and cycle length controller (PSC) and phase split-cycle length and offset optimization controller (PSCO) controllers.
SystemPSCPSCODNB
MOE
Average Total Delay (s/veh)96.234100.19780.323
Improvement %16.53419.823
Average Stopped Delay (s/veh)20.28525.64912.1074
Improvement %40.31452.7962
Average Travel time (s)306.254310.225290.175
Improvement %5.2506.463
Average Number of Stops4.6624.58994.281
Improvement %8.186.734
Average Fuel (L)0.41420.41290.40
Improvement %3.383.07
Average CO 2 Emissions (g)913.833912.495883.127
Improvement %3.363.22
Table 5. MOEs using two different phasing schemes.
Table 5. MOEs using two different phasing schemes.
SystemDNB (Field Scheme)DNB (Modified Scheme)Imp. (%)
MOE
Average Total Delay (s/veh)80.32394.712−17.913
Average Stopped Delay (s/veh)12.10724.381−101.374
Average Travel Time (s)290.175302.425−4.222
Average Number of Stops4.2814.417−3.177
Average Fuel (L)0.400.41−2.274
Average CO 2 Emissions (g)883.127902.277−2.168
Table 6. Average MOEs and (%) improvement using DNB over the PSC and PSCO controllers.
Table 6. Average MOEs and (%) improvement using DNB over the PSC and PSCO controllers.
SystemPSCPSCODNB
MOE
Average Total Delay (s/veh)96.234100.19777.577
Improvement %19.387122.575
Average Stopped Delay (s/veh)20.28525.6499.903
Improvement %51.18261.391
Average Travel Time (s)306.254310.225287.384
Improvement %6.1627.362
Average Number of Stops4.6624.58994.271
Improvement %8.3936.95
Average Fuel (L)0.41420.41290.3981
Improvement %3.8873.584
Average CO 2 (grams)913.833912.495878.739
Improvement %3.843.7
Table 7. Intersections (%) improvement of MOEs using DNB over PSC controller.
Table 7. Intersections (%) improvement of MOEs using DNB over PSC controller.
MOEsTravel TimeQueueNum. of Stops CO 2 Fuel NO X
Int. #
16.15322.01524.3112.6452.5660.161
216.40926.80121.1847.7067.7105.859
38.48518.23332.7776.0346.4509.040
431.11452.87439.5648.1666.5958.756
522.23053.87552.9149.3558.9623.309
623.17634.43514.24011.59410.7164.751
78.96715.88117.8323.8893.5972.271
824.05741.86816.11413.75313.4809.162
940.70956.26729.85025.25324.65413.842
1013.39526.34641.4368.6348.6539.772
1117.62826.34011.8029.0148.3531.352
127.6427.96832.6503.4813.3733.476
1319.41437.90920.9158.9918.7453.758
1428.50335.49925.3597.8546.6178.147
1523.87039.63034.58412.55312.2726.166
1627.55259.09541.87615.10914.7858.836
1742.00160.00056.97416.89614.82712.842
1826.25849.88332.72314.49113.4145.703
1919.67636.53321.1044.9634.2534.976
2052.23776.08363.08832.96631.76220.159
2134.82250.15946.26521.56821.38518.268
2238.26759.39637.46627.62827.28426.528
2317.19330.86316.2727.5956.9225.258
2434.66943.99711.26914.63213.3423.239
2523.48044.58857.3815.7604.5020.085
2618.02926.02830.5034.0172.4780.750
2728.12936.3408.56516.76916.19414.480
2814.53035.04611.9029.4599.84611.611
2913.13119.1159.6035.3474.9851.142
3023.63247.38223.22419.33019.40924.772
3132.76155.70180.38118.00417.27319.333
3234.76153.07035.45626.64127.04529.311
3335.98448.47215.25620.34819.56311.668
3416.67932.67630.33511.27311.15111.757
3518.01228.95021.57518.24118.67226.116
3622.58846.50934.3317.6767.0282.465
3729.30746.50231.4867.3996.6781.081
3814.31714.5528.0614.6694.1681.143
Overall (%)23.63337.66623.58610.4449.8425.390
Table 8. Average MOEs and the (%) improvement using DNB controller over PSC controller (100% Demand).
Table 8. Average MOEs and the (%) improvement using DNB controller over PSC controller (100% Demand).
SystemPSCDNBDNB Imp. (%)
MOE
Average Total Delay (s/veh)557.463476.34614.55
Average Stopped Delay (s/veh)256.766192.11625.178
Average Travel Time (s)1034.27952.7327.89
Average Number of Stops7.4066.48712.4
Average Fuel (L)1.1551.1094.0
Average CO 2 (grams)2482.132376.594.25
Table 9. Average (%) improvements of MOEs using DNB controller over PSC controller (100% Demand), over the links that are directly associated with intersections.
Table 9. Average (%) improvements of MOEs using DNB controller over PSC controller (100% Demand), over the links that are directly associated with intersections.
MOEsTravel TimeQueueNum. of Stops CO 2 Fuel NO X
Int. #
Overall 457 Int. (%)35.15654.6644.0319.9669.91911.774
Table 10. Average MOEs and the (%) improvement using DNB over PSC controller (10% Demand).
Table 10. Average MOEs and the (%) improvement using DNB over PSC controller (10% Demand).
SystemPSCDNBDNB Imp. (%)
MOE
Average Total Delay (s/veh)84.93853.68936.79
Average Stopped Delay (s/veh)19.9711.945190.261
Average Travel Time (s)450.114418.1777.1
Average Number of Stops4.4752.92434.66
Average Fuel (L)0.8460.8054.8
Average CO 2 (grams)1830.271742.534.79
Table 11. Average (%) improvements of MOEs using DNB over PSC controller (10% Demand) over the links directly associated with intersections.
Table 11. Average (%) improvements of MOEs using DNB over PSC controller (10% Demand) over the links directly associated with intersections.
MOEsTravel TimeQueueNum. of Stops CO 2 Fuel NO X
Int. #
Overall 457 Int. (%)19.18649.84453.70854.15816.08525.939

Share and Cite

MDPI and ACS Style

Abdelghaffar, H.M.; Rakha, H.A. A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing. Sensors 2019, 19, 2282. https://doi.org/10.3390/s19102282

AMA Style

Abdelghaffar HM, Rakha HA. A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing. Sensors. 2019; 19(10):2282. https://doi.org/10.3390/s19102282

Chicago/Turabian Style

Abdelghaffar, Hossam M., and Hesham A. Rakha. 2019. "A Novel Decentralized Game-Theoretic Adaptive Traffic Signal Controller: Large-Scale Testing" Sensors 19, no. 10: 2282. https://doi.org/10.3390/s19102282

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop