Next Article in Journal
Integration of Multi-Gas Sensors and Aerial Thermography into UAVs for Environmental Monitoring of a Landfill
Next Article in Special Issue
Hidden Carbon Emissions Induced by Functional Curbside Capacity Loss in Urban Freight Systems
Previous Article in Journal
Multi-Scale Atrous Feature Fusion Based on a VGG19-UNet Encoder for Brain Tumor Segmentation
Previous Article in Special Issue
Large-Scale Metro Train Timetable Rescheduling via Multi-Agent Deep Reinforcement Learning: A High-Dimensional Optimization Approach in Flatland Environment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resilient Multi-Agent State Estimation for Smart City Traffic: A Systems Engineering Approach to Emission Mitigation

Department of Industrial Engineering, Faculty of Engineering, Düzce University, Düzce 81620, Türkiye
Appl. Sci. 2026, 16(8), 3972; https://doi.org/10.3390/app16083972
Submission received: 24 March 2026 / Revised: 13 April 2026 / Accepted: 16 April 2026 / Published: 19 April 2026
(This article belongs to the Special Issue Advances in Transportation and Smart City)

Abstract

Uninterrupted traffic flow monitoring is a prerequisite for optimal resource allocation and minimizing vehicular emissions in smart cities. However, centralized traffic management architectures are highly vulnerable to single points of failure. When structural sensor malfunctions occur, the resulting network unobservability paralyzes dynamic signalization, triggering cascading traffic congestion, extended idling times, and severe greenhouse gas emissions. To address this cyber-ecological vulnerability, we propose the Hybrid Multi-Agent State Estimation (H-MASE) protocol, a fully decentralized decision-support framework designed from an applied systems reliability engineering perspective. By deploying PSAs and VLAs directly onto IoT-enabled edge devices at smart intersections, H-MASE leverages a hop-by-hop edge computing topology to collaboratively track macroscopic route flow dynamics. Mathematically, this distributed estimation process is formulated as a network-wide least-squares convex optimization problem, where local projection operators function as exact Distributed Gradient Descent steps to minimize the global residual sum of squares. The distributed consensus mechanism acts as a spatial variance reduction tool, effectively dampening measurement noise and stochastic demand fluctuations. Furthermore, we introduce an autonomous anomaly detection logic that isolates severe structural faults rapidly, which is mathematically structured to prevent false alarms under bounded disturbance conditions. Numerical simulations demonstrate that the protocol yields a highly resilient optimality gap (e.g., a Root Mean Square Error of merely 0.81 vehicles per estimated state) even under catastrophic hardware failures. Ultimately, H-MASE provides a robust, fail-safe data foundation for sustainable urban logistics and green-wave signalization, ensuring that smart cities maintain ecological resilience and optimal resource utilization under severe structural disruptions.

1. Introduction

Rapid urbanization and the exponential surge in vehicle ownership have transformed urban traffic networks into highly congested macroscopic service systems. This congestion represents a major bottleneck in sustainable urban logistics and operations management. Beyond the severe economic losses and supply chain delays, the transportation sector remains a primary driver of greenhouse gas (GHG) emissions [1]. Inefficient idling at intersections and stop-and-go traffic waves act as massive resource sinks, contributing substantially to local air pollution and global climate change [2]. To minimize the impact of pollutant emissions and unsustainable fuel consumption in urban areas, a paradigm shift toward advanced Intelligent Transportation Systems (ITS) has become mandatory. Harnessing multi-agent technologies to manage urban traffic forms a fertile research area for improving transportation efficiency, environmental care, and safety in smart cities [3].
Central to this smart mobility mission is Traffic State Estimation (TSE), which serves as the foundational data-driven decision-support system for next-generation ITS. It seeks to infer operational parameters—such as queue lengths, flow capacities, and spatial resource utilization—from sparse observational data [4]. High-fidelity TSE is the essential cyber-physical backbone for downstream operations research applications, including network-wide eco-routing and green-wave signal optimization, which are vital for mitigating the environmental impacts of recurrent congestion events [5].
However, despite the proliferation of Internet of Things (IoT) sensors, conventional traffic monitoring architectures are severely encumbered by their centralized nature. From a systems reliability engineering perspective, central servers introduce a severe single-point-of-failure vulnerability [6]. When unexpected structural sensor malfunctions or communication link failures occur, centralized observers frequently fail to adapt, leading to network-wide unobservability. In the physical realm, this cyber-failure paralyzes dynamic traffic light controllers, forcing them into suboptimal, fixed-cycle fallback operations. This operational degradation instantly triggers cascading artificial bottlenecks, extended vehicular idling, and a drastic surge in localized emissions [7]. Therefore, ensuring the continuous, fail-safe operation of TSE under catastrophic hardware disruptions is not merely a computational challenge, but a ecological imperative for smart cities. Transitioning from vulnerable cloud-based architectures to resilient, IoT-enabled edge computing paradigms is essential to maintain uninterrupted traffic intelligence.
Recent literature highlights that mitigating sensor-induced traffic disruptions and optimizing intersection flows are important for sustainable urban mobility management. Empirical studies demonstrate that suboptimal traffic light synchronizations and inefficient flows at complex physical junctions, such as multi-arm intersections and large roundabouts, drastically exacerbate vehicular pollutant emissions and fuel consumption [8,9]. Consequently, establishing robust and fail-safe traffic state estimation is essential to directly curtail GHG emissions by enabling dynamic, sustainable travel route planning and eco-routing protocols [10]. Furthermore, to achieve these sustainability-oriented urban traffic optimizations without the immense energy and bandwidth consumption of centralized cloud servers, recent advancements strongly advocate for the deployment of decentralized, multi-agent frameworks [11]. By bridging this gap, the proposed framework embeds distributed convex optimization directly into IoT edge devices (e.g., roadside units). This applied edge-computing approach explicitly aligns robust multi-agent mathematics with the practical, cyber-ecological resilience required by modern smart transport systems.

2. Related Work

To overcome the inherent fragility and computational bottlenecks of centralized monitoring, researchers are increasingly pivoting toward distributed architectures [6]. In these frameworks, the urban traffic network is modeled as a multi-agent system, where autonomous agents collaborate over a decentralized communication topology without relying on a central server [6]. This paradigm shift not only ensures greater expansibility and structural robustness against local failures but also allows the system to approach global optimization through hop-by-hop local interactions and consensus mechanisms [3,6].
A particularly effective strategy for managing the immense complexity of metropolitan grids is the implementation of hierarchical multi-agent frameworks [12]. By decomposing a large-scale network into manageable layers—such as region agents, intersection agents, and turning-movement agents—these hierarchical structures effectively reduce the problem’s dimensionality and enhance learning efficiency across the system [12]. Recent advancements have further integrated these structures with game-theoretic models, such as the Nash-Stackelberg hierarchical game, to systematically resolve goal conflicts between primary and secondary intersections while ensuring overall network stability [13]. Furthermore, to ensure that these distributed controllers remain sensitive to both immediate and long-term dynamics, modern algorithms now employ multi-granularity fusion [14]. This allows agents to hierarchically learn both fine-grained current step-states and coarse-grained historical trends, facilitating the derivation of more physically consistent traffic-signal policies [14]. To address the real-time constraints of rapidly evolving urban environments, broad reinforcement learning has emerged as a high-speed alternative to traditional deep architectures, utilizing ridge regression to achieve fast remodeling and adaptation [15].
Origin-Destination (O-D) matrix estimation is widely recognized as one of the most important and challenging components of TSE. To address this problem, various methodologies have been developed, ranging from traditional mathematical programming to modern data-driven approaches. For instance, Englezou et al. proposed a nonlinear mathematical program utilizing a signalized path-based cell transmission model to capture detailed traffic dynamics more accurately [16]. Similarly, Dey et al. performed statistical O-D flow estimation using network tomography based on link counts alone, notably requiring no historical trip data to achieve accurate flow predictions [17]. With the integration of next-generation data sources, O-D estimation has become increasingly complex and data-centric. Wei et al. evaluated a data-driven network assignment method that replaces traditional traffic assignment with GPS probe data to achieve consistent O-D and link flow estimation across urban networks [18]. In a related vein, Vahidi and Shafahi presented a least squares model that fuses vehicle identification data from Automatic License Plate Recognition (ALPR) cameras with link counts, establishing temporal and spatial dependencies through variance-covariance matrices [19]. To manage the computational burden of large-scale networks, Pourhassan et al. developed a heuristic approach utilizing zone-aggregate traffic profiles to support the development of large-scale dynamic traffic assignment models [20]. Meanwhile, Peng et al. introduced a physics-constrained deep learning framework that incorporates GraphSAGE architecture to capture intricate network topology relationships, enhancing robustness against missing data scenarios [21]. Directly addressing the need for decentralized management, Etemadnia and Abdelghany proposed a distributed recursive heuristic that partitions the study area into subareas and merges local estimates through a hierarchical multi-threading mechanism to reduce processing time [22]. Despite these advancements, a significant reliability gap remains in the literature. Most existing frameworks either rely on a centralized optimization authority or lack the dynamic fault-tolerance necessary to withstand structural sensor failures. Furthermore, while modern data-driven and deep learning models exhibit high adaptability, they act as black-box systems that lack formal, worst-case mathematical guarantees regarding estimation stability and error bounds.
To bridge this cyber-physical reliability gap, this paper adopts a strict Operations Research (OR) perspective. Finding the true macroscopic route flow vector r given distributed network observations is formulated as a network-wide least-squares convex optimization problem:
min r R n J ( r ) = 1 2 i = 1 N M i r b i 2 2
where r R n is the unknown macroscopic route-flow vector to be estimated, N represents the total number of collaborative autonomous agents, M i is the local topological sensing matrix mapping the global state to agent i, and b i denotes its corresponding local noisy measurement or tautological topological anchor. By reformulating the decentralized state estimation task through this distributed convex optimization framework, we replace heuristic black-box learning processes with provable analytical architectures. In the proposed framework, the local orthogonal projections executed by the agents correspond to exact Distributed Gradient Descent (DGD) steps aimed at minimizing their local cost functions, while the topological consensus matrix acts as the distributed penalty mechanism enforcing global feasibility.
The practical significance of the proposed Hybrid Multi-Agent State Estimation (H-MASE) framework lies in its ability to equip every localized edge node (i.e., intersection sensor) with real-time, global awareness of the entire macroscopic traffic network. By enabling an isolated intersection to autonomously compute and track the densities of all network-wide routes without relying on a central fusion center, this decentralized architecture provides significant operational advantages for smart city operations:
  • Traditional traffic lights operate reactively based on localized queue lengths, inherently suffering from a myopic control perspective. In contrast, global awareness allows isolated intersections to anticipate incoming traffic flows originating from distant nodes. This predictive capability forms the foundation for proactive operations research interventions, enabling system-wide dynamic scheduling and optimal resource allocation to minimize average waiting times across the network.
  • In the paradigm of Connected and Autonomous Vehicles (CAVs), relying on centralized cloud architectures for routing introduces significant round-trip latency and unnecessary energy overhead. By utilizing IoT-enabled edge nodes equipped with complete global states, CAVs can seamlessly query the nearest smart intersection via Vehicle-to-Infrastructure (V2I) communication to acquire instantaneous, network-wide route densities. This edge-level query capability directly facilitates real-time dynamic eco-routing and sustainable network flow optimization [23], thereby bypassing the need for energy-intensive cloud processing.
  • Conventional Supervisory Control and Data Acquisition (SCADA) based traffic management centers are highly susceptible to communication bottlenecks, natural disasters, or cyber-attacks (e.g., DDoS). The proposed swarm-intelligence topology ensures high resilience; if a central server or a specific regional node collapses, the remaining healthy nodes continue to collaboratively monitor the global state with uncompromised accuracy.
  • Continuously transmitting raw vehicle trajectories, GPS coordinates, or license plate data to a central cloud poses severe privacy risks and consumes massive bandwidth. The H-MASE protocol operates by exchanging abstracted mathematical consensus variables between adjacent physical neighbors, which inherently protects user privacy and effectively mitigates the telecommunication overhead.
  • For next-generation urban logistics, including Mobility as a Service (MaaS) and crowd-shipping platforms, continuous and reliable traffic data is a prerequisite. The resilient, high-fidelity state vector provided by H-MASE serves as a robust data foundation for these smart city applications, ensuring that sustainable fleet routing and dynamic dispatching operations remain uninterrupted even during severe IoT infrastructure degradations.
The main contributions of this paper can be summarized as follows:
  • Decentralized Cyber-Physical Architecture: We propose the H-MASE protocol, a novel hybrid framework combining Physical Sensor Agents (PSAs) and Virtual Logic Agents (VLAs). While PSAs process stochastic measurements, VLAs act as fault-immune topological relays. This localized, hop-by-hop distributed topology ensures that the communication graph remains firmly connected, effectively eliminating the single-point-of-failure vulnerability inherent in conventional centralized systems.
  • Autonomous Fault Isolation with High Reliability: We introduce a robust, localized switching logic that instantaneously detects and isolates severe structural sensor faults. As mathematically proven and empirically validated, this mechanism is designed to theoretically avoid false alarms under bounded nominal conditions.
  • Theoretical Resilience and Input-to-State Stability (ISS) Guarantees: The paper provides mathematical proofs establishing ISS. We demonstrate that as long as the surviving ensemble of healthy PSAs maintains structural observability, the fault-immune VLAs preserve network-wide consensus, preventing global error divergence even after the autonomous isolation of compromised nodes.
  • Foundation for Network-Wide Optimization: From an operations research perspective, the proposed framework supplies resilient, real-time route density estimates. These high-fidelity states serve as a reliable foundational input for dynamic resource allocation, stochastic queue management and traffic signal optimization problems in smart cities.
The remainder of this paper is organized as follows. Section 2 provides a review of related literature. Section 3 details the physical and graph-theoretic models governing urban traffic flows. Section 4 introduces the design of the proposed H-MASE protocol. Section 5 presents convergence and resilience analysis. Section 6 provides numerical results, comparative benchmarking, and an ecological impact assessment. Finally, Section 7 concludes the paper and outlines future research directions.
The key notations and parameters used throughout the mathematical formulations of this framework are summarized in Table 1.

3. Problem Formulation and System Modeling

This section details the physical and graph-theoretic models governing urban traffic flows.

3.1. Physical Graph Topology and Incidence Structures

An urban traffic network is modeled as a directed graph G = ( V , E ) , where V = { v 1 , , v p } and E = { e 1 , , e m } denote the sets of nodes and physical links, respectively, with I E = { 1 , , m } representing the edge index set. Each link e k E has a static saturation capacity c k R > 0 . The structural connectivity is captured by the directed incidence matrix I R p × m , defined element-wise as [ I ] i k = 1 if link e k enters node v i , [ I ] i k = 1 if it leaves v i , and 0 otherwise. An algebraic property of I for a weakly connected graph is its rank deficiency, rank ( I ) = p 1 . This algebraic linear dependence directly dictates the network-wide mass conservation laws, effectively bounding the feasible operations space.

3.2. Physical Flow Dynamics and Boundary Conditions

We partition the network to explicitly isolate the internal junctions ( V i n t , p i n t = | V i n t | ) where no external flow enters or exits. Let the vector y i d e a l ( t ) = [ y 1 ( t ) , , y m ( t ) ] T R 0 m represent the noise-free time-averaged physical link flows at the discrete macroscopic step t. Each component y k ( t ) explicitly denotes the ideal flow volume traversing link e k over the macroscopic sampling interval. These flow rates serve as the foundational variables for evaluating operational metrics, where the utilization ratio y k ( t ) / c k determines the queueing performance and service levels.
In urban networks, the strict algebraic conservation lies at the internal junctions ( V i n t ), where no external flow directly enters or exits the system. For each internal node v i V i n t , the mass conservation law is algebraically formalized as a zero-sum balance:
k E i n ( v i ) y k ( t ) k E o u t ( v i ) y k ( t ) = 0
where E i n ( v i ) = { k I E [ I ] i k = 1 } and E o u t ( v i ) = { k I E [ I ] i k = 1 } are the index subsets for incoming and outgoing links, respectively. This deterministic constraint ensures that the estimated flows remain physically consistent, forming the virtual sensing basis for the network.
The global representation for all internal junctions is given by the deterministic equality:
I i n t y i d e a l ( t ) = 0 p i n t
where I i n t R p i n t × m is the reduced incidence matrix consisting of the rows of I corresponding to V i n t , and 0 p i n t is the p i n t -dimensional zero vector. Equation (3) defines the virtual sensing layer. As visually demonstrated in Figure 1, while physical measurements are susceptible to noise and hardware failures, this topological backbone provides deterministic zero-sum mathematical constraints. This layer ensures that the estimated link flows remain consistent with the network manifold.
Remark 1.
By excluding redundant or boundary nodes, I i n t is assumed to be full row rank ( rank ( I i n t ) = p i n t ), ensuring that the conservation constraints provide maximal and non-redundant algebraic information.

3.3. Path-Based State-Space Characterization

While the nodal balance equations established in Section 3.2 provide macroscopic constraints on the link-flow manifold, they are insufficient to resolve the true causal origins of traffic due to the under-determined nature of the network. Aggregated link flows often mask the underlying routing decisions; therefore, to achieve high-fidelity monitoring, we characterize the hidden state of the network using route-level flows. As conceptually illustrated in the reference topology in Figure 2, multiple causal routes often overlap on shared physical links.
Let R = { R 1 , , R n } be the predefined finite set of dominant acyclic routes actively utilized by the network traffic. We define the true network state as the route-flow vector r ( t ) = [ r 1 ( t ) , , r n ( t ) ] T R 0 n , where each component r j ( t ) represents the vehicle volume utilizing route R j during the macroscopic interval t. The mapping from the hidden route space to the physical link space is governed by the routing matrix A { 0 , 1 } m × n , defined element-wise as:
[ A ] k j = 1 , if link e k belongs to route R j 0 , otherwise
Substituting the linear mapping y i d e a l ( t ) = A r ( t ) into the internal nodal balance equality (3) yields the topological constraint in the route state-space:
A τ r ( t ) = 0 p i n t , where A τ = I i n t A R p i n t × n
Remark 2
(Practical Generation of the Dominant Route Set). Enumerating all theoretically possible acyclic paths in a metropolitan network is an NP-hard problem. However, the proposed framework explicitly avoids this combinatorial explosion by defining R solely as the set of dominant routes. In practical smart city deployments, this subset can be readily extracted from historical O-D matrices or historical GPS trajectory clusters from connected vehicles [24,25]. Any minor traffic flows utilizing non-dominant, anomalous paths—such as temporary detours due to unexpected traffic accidents or spontaneous routing shifts—are not ignored. Instead, they are mathematically absorbed into the bounded random-walk state drift Δ r ( t ) and the observation noise ω ˜ ( t ) , ensuring the protocol remains robust against unmodeled macroscopic variations.
Remark 3
(Tautological Anchor Property). Since every valid acyclic route R j traversing an internal junction v i must both enter and exit it, the matrix A τ 0 p i n t × n is identically the zero matrix. This tautology ( 0 = 0 ) forms the topological foundation for the fault-immune VLAs introduced in Section 4.

3.4. Augmented Measurement Model

The physical sensors (e.g., inductive loops, cameras) deployed on the network links provide noisy measurements of the true traffic flows. Let y p ( t ) R m denote the physical observation vector, modeled as:
y p ( t ) = y i d e a l ( t ) + ω ˜ ( t ) = A r ( t ) + ω ˜ ( t )
where ω ˜ ( t ) R m is the bounded physical measurement noise vector, such that ω ˜ ( t ) 2 ω m a x for some known constant ω m a x > 0 . To unify the estimation framework, we augment the physical measurements with the tautological zero-sum Equation (5). The global augmented observation model b ( t ) = M r ( t ) + ω ¯ ( t ) is algebraically constructed as:
b ( t ) = y p ( t ) 0 p i n t , M = A A τ A 0 p i n t × n , ω ¯ ( t ) = ω ˜ ( t ) 0 p i n t
This augmentation expands the operational dimension of the network without altering the state space. Because ω ¯ ( t ) 2 = ω ˜ ( t ) 2 ω m a x , the VLAs corresponding to the lower blocks remain permanently isolated from physical noise, allowing them to function as pure-consensus topological bridges.

3.5. State Evolution and Drift Model

The route-flow state r ( t ) is inherently dynamic, fluctuating due to stochastic variations in O-D demands and time-varying routing preferences. In dense urban networks, identifying a stable and explicit state-transition matrix is often computationally prohibitive and physically inaccurate due to the volatility of traffic bursts. While macroscopic traffic flow frequently exhibits highly non-linear phenomena such as kinematic shockwaves, incorporating these non-linearities directly into the transition operator often destroys the convexity of the estimation problem. Consequently, from a robust operations management perspective, the temporal state evolution is modeled as a bounded random walk. This deliberate abstraction mathematically encapsulates all complex physical variations, demand shifts, and routing preference fluctuations into the bounded drift term Δ r ( t ) . Because the tracking mechanism focuses exclusively on the predefined set of dominant, actively utilized routes ( R ), the route volumes remain positive ( r ( t ) > 0 ) during nominal operations. Consequently, the state evolution operates sufficiently far from the zero-boundary, allowing it to be accurately characterized by the linear difference equation:
r ( t + 1 ) = r ( t ) + Δ r ( t )
where the term Δ r ( t ) R n represents the unknown but bounded macroscopic state drift. We formalize the physical limits of this network volatility by imposing a strict upper bound on the drift magnitude, such that Δ r ( t ) 2 Δ m a x for all t Z 0 .
The primary objective of the proposed H-MASE architecture is to establish a high-fidelity data foundation for real-time sustainable decision-support systems. Accurately tracking this hidden, stochastically evolving state r ( t ) using the sequence of augmented measurements b ( t ) is required to optimize urban resource allocation and dynamically mitigate congestion-induced emissions. To quantify the network-wide tracking performance, we explicitly consider the finalized local optimality gaps defined as e i ( t , K ) = r ^ i ( t , K ) r ( t ) , where r ^ i ( t , K ) is the state vector consolidated by agent i after completing K microscopic consensus iterations. The subsequent sections detail the decentralized protocol designed to drive these errors into a bounded invariant set proportional to the noise level ω m a x and the volatility Δ m a x .

4. The H-MASE Protocol

Building upon the hybrid measurement framework established in Section 3, this section introduces the design of the H-MASE protocol. The overarching operational architecture and the decision-making sequence of the protocol are visually summarized in Figure 3. The proposed design relies on multi-scale temporal separation between the macroscopic physical traffic dynamics and the microscopic distributed estimation process, a concept frequently employed in hierarchical operations planning. Specifically, the physical traffic state r ( t ) is governed by a macroscopic sampling period T s (typically in the range of 60–300 s), whereas the multi-agent system executes high-frequency communication-computation cycles at a microscopic scale k, where the total block of K microscopic iterations is executed within the duration of a single macroscopic interval T s . This hierarchical temporal structure ensures that the estimator achieves a consensus-based equilibrium before the subsequent physical state transition occurs.

4.1. Cyber-Physical Topology and Agent Classification

Unlike conventional centralized architectures, the H-MASE protocol relies on a distributed cyber-physical framework where information exchange is localized. We model the monitoring environment as a dual-layer graph where the physical transit network G is superimposed by a communication topology G M A S = ( V M A S , E M A S ) . Here, V M A S represents the set of N = m + p i n t autonomous agents, and E M A S denotes the set of bidirectional digital links. To preserve the decentralized nature of urban monitoring, E M A S is topology-induced; communication occurs only between physically adjacent sensors or intersections (hop-by-hop interaction).
To maintain a recursive mapping with the hybrid sensing layers, the agents are partitioned into two functional classes:
  • PSAs: Each PSA i { 1 , , m } is collocated with the physical link detectors to process the local scalar stochastic measurement y p , i ( t ) .
  • VLAs: Residing at the internal junctions, each VLA i { m + 1 , , N } mathematically embodies the tautological nodal-balance equations ( 0 p i n t ). While mathematically represented merely as augmented linear equality constraints with zero sensing rows ( M i = 0 ), their cyber-physical function is profound. They do not restrict the state space; rather, from a systems reliability engineering perspective, they act as fault-immune topological relays. They ensure that even under catastrophic physical sensor degradation, the distributed decision-support architecture maintains its topological connectivity and computational consensus, thereby preventing the ecological disruptions associated with network-wide information fragmentation. In real-world implementations, VLAs are lightweight software routines running directly on the IoT edge computing units (e.g., smart intersection controllers). While they do not process raw physical measurements, their inclusion is structurally vital. During severe multi-sensor failures, isolated PSAs drop their corrupted constraints but rely on the topological bridges maintained by the VLAs to prevent the fragmentation of the cyber-communication graph, thereby ensuring that global consensus is computationally sustained.
The interaction for each agent i is restricted to its inclusive communication neighborhood N i = { j V M A S ( j , i ) E M A S } { i } .
Assumption 1
(Topological Connectedness). The cyber-communication graph G M A S is assumed to be undirected and connected, ensuring that local state estimates can diffuse across the entire network via the consensus mechanism to achieve global observability.
To achieve asymptotic consensus without a global coordinator, the agents employ a doubly-stochastic weight matrix W = [ w i j ] R N × N . We utilize the Metropolis-Hastings algorithm to construct these weights locally, requiring only the knowledge of immediate neighbor degrees d i = | N i | 1 :
w i j = 1 max ( d i , d j ) + 1 , if j N i and j i 1 l N i { i } w i l , if j = i 0 , otherwise
This construction satisfies the row and column sum conditions j w i j = 1 and i w i j = 1 , which is a necessary condition for the preservation of the average state during the diffusion phase.

4.2. Local Projection Operators and Singularity Handling

Each agent i processes its local scalar observational constraint b i ( t ) using a local orthogonal projection matrix P i = I n M i M i R n × n . The generalized pseudoinverse of the local observation row is analytically defined as M i = M i T / M i 2 2 for PSAs (where M i 2 > 0 ), and as the null column vector M i = 0 n for VLAs (where the sensing row is M i = 0 n T ). Consequently, for any VLA, substituting the null sensing row M i = 0 n T inherently yields P i = I n . This algebraic result verifies that VLAs do not contract the state space; instead, they function as perfect identity filters, acting as fault-immune topological relays.

4.3. Distributed State Update and Time-Scale Separation

The H-MASE framework operates on a dual time-scale architecture. Recall from Section 3.5 that t Z 0 denotes the slow macroscopic time scale governing the physical traffic dynamics. Let k { 0 , 1 , , K 1 } denote the fast microscopic time scale governing the cyber-communication iterations.
A core assumption of this architecture is the time-scale separation principle: the cyber-communication bandwidth is sufficiently large such that K consensus iterations occur within a single macroscopic interval [ t , t + 1 ) . Consequently, the local physical observations b i ( t ) and the projection operators P i are treated as static parameters; they remain constant throughout the duration of the internal k-loop.
Remark 4
(Idealized Communication Infrastructure). The strict execution of K microscopic iterations within a single macroscopic interval [ t , t + 1 ) assumes the presence of an Ultra-Reliable Low-Latency Communication framework, such as 5G or standard Dedicated Short-Range Communications in smart cities. The impact of asynchronous packet drops, finite bandwidth constraints, and cyber-communication delays is beyond the scope of this theoretical foundation and remains a vital direction for future robust optimization research.
To ensure continuous tracking without losing historical information, the estimation process leverages a warm-start initialization mechanism. At the beginning of each macroscopic step t, the initial condition for the internal iterations is seeded by the final estimate of the previous step:
r ^ i ( t , 0 ) = r ^ i ( t 1 , K )
where r ^ i ( 1 , K ) = 0 n represents the absolute cold-start condition before the network operations begin.
During the interval [ t , t + 1 ) , each healthy agent i updates its local estimate r ^ i ( t , k ) R n via a distributed optimization scheme consisting of a two-phase predict-and-update process. First, to enforce global feasibility and parameter agreement across the network, the agent aggregates the prior estimates from its inclusive neighborhood N i to form the diffusion vector:
r ^ d i f f , i ( t , k ) = j N i w i j r ^ j ( t , k )
Subsequently, the agent executes a local optimization step to enforce its own observational constraint. From a distributed network optimization perspective, minimizing the local least-squares cost function J i ( r ) = 1 2 M i r b i ( t ) 2 —which physically translates to eliminating uncertainties in intersection queue lengths to prevent vehicular idling—reduces to an orthogonal projection onto the measurement hyperplane.
The core distributed update mechanism utilized in this phase builds upon the foundational discrete-time projection algorithm originally proposed in [26]. However, in this work, we significantly extend that baseline mathematical structure to accommodate the hierarchical cyber-physical topology of IoT edge devices (i.e., PSAs and VLAs) and to integrate the autonomous fault-isolation logic required for resilient smart city deployments. The nominal distributed update law is formalized as:
r ^ i ( t , k + 1 ) = P i r ^ d i f f , i ( t , k ) + M i b i ( t )
Before applying this optimal measurement innovation M i b i ( t ) , each agent calculates a local discrepancy residual η i ( t , k ) R 0 , representing the deviation between the neighborhood consensus and the local physical constraint:
η i ( t , k ) = M i r ^ d i f f , i ( t , k ) b i ( t )
This residual implicitly measures the magnitude of the local gradient; a severe mismatch triggers the autonomous fault isolation logic, which will be studied in the following section.

4.4. Resilient Estimation and Fault Isolation

To prevent the propagation of severe sensor faults, each agent implements an autonomous binary isolation mechanism σ i ( t , k ) { 0 , 1 } . The agent operates nominally ( σ i = 1 ) if the consistency residual satisfies η i ( t , k ) Γ ( t ) , and isolates itself ( σ i = 0 ) otherwise. Here, Γ ( t ) = Γ s t e a d y + Γ i n i t e β t acts as a time-varying resilience threshold derived analytically from the nominal ISS bounds. The exponential decaying term Γ i n i t e β t is strategically incorporated to absorb the massive transient errors associated with the cold-start initialization phase. Since the initial arbitrary estimates r ^ i ( 1 , K ) naturally deviate significantly from the true hidden state, this temporary tolerance buffer mathematically prevents the autonomous logic from prematurely triggering false alarms during the early consensus-building phase, or during the immediate transient shockwave following a topological switch. By integrating this binary switch, we establish the final Resilient Distributed Update Rule:
r ^ i ( t , k + 1 ) = σ i ( t , k ) P i r ^ d i f f , i ( t , k ) + M i b i ( t ) + ( 1 σ i ( t , k ) ) r ^ d i f f , i ( t , k )
When isolated ( σ i = 0 ), this decentralized triage mechanism acts as a rapid systemic fail-safe. The corrupted measurement innovation is bypassed, quarantining the hardware fault before it can propagate through the network and paralyze downstream eco-routing operations, while the agent relies solely on the prior knowledge diffused from its healthy neighbors. Since VLAs are mathematically immune to physical faults ( η i 0 ), they remain permanent topological bridges, ensuring the surviving network remains strongly connected.

4.5. Algorithmic Realization

The theoretical components developed in the preceding subsections are integrated into a unified, decentralized procedural framework. This protocol, termed the H-MASE Algorithm, is designed to execute independently on each agent i. The algorithm operates on the dual-time-scale architecture: a macroscopic outer loop synchronized with the physical sampling period T s , and a microscopic inner loop dedicated to high-frequency iterations k for iterative consensus and constraint enforcement.
Remark 5
(Inference of Traffic Performance Metrics). The estimated route-flow vector r ^ ( t , K ) provided by the H-MASE algorithm allows for the indirect observation of intersection delays and queue lengths. By utilizing the mapping y ^ ( t ) = A r ^ ( t , K ) , the saturation ratio for each physical link e k E can be directly calculated as x k = y ^ k / c k . Based on the Webster delay model, the average delay d k per vehicle on link e k is inferred as [27]:
d k C c y c l e ( 1 λ ) 2 2 ( 1 λ x k ) + x k 2 2 y ^ k ( 1 x k )
where d k denotes the average waiting delay at the k-th intersection approach, C c y c l e is the traffic signal cycle length, and λ represents the effective green ratio for the corresponding signal phase. The variable y ^ k represents the estimated arrival flow rate at node k, which is directly reconstructed from the network-wide route-flow estimates provided by the distributed H-MASE protocol. Consequently, x k explicitly defines the degree of saturation (i.e., the volume-to-capacity ratio) at the intersection approach.
From a data-driven decision-support perspective, maintaining the high-fidelity estimation of y ^ k is important. An erroneous overestimation caused by unmitigated sensor faults could artificially inflate the perceived saturation level x k 1 . Because of the ( 1 x k ) term in the denominators, such an error would force the macroscopic delay model to exhibit an artificial exponential growth, causing downstream decision-support systems to execute highly suboptimal resource allocations. By guaranteeing structurally resilient and bounded estimates, H-MASE ensures the analytical validity of this queueing model. While this mapping theoretically bridges the state estimation to physical intersection queues, the numerical validations in Section 6 focus on the core macroscopic optimality gaps ( B I S S ) to purely isolate and evaluate the algorithmic tracking performance.
The H-MASE Decentralized Estimation Protocol proposed in the paper is given in Algorithm 1.
Algorithm 1 H-MASE Decentralized Estimation Protocol.
1:
Initialization:
2:
   Pre-compute projection operators P i and M i for each agent i V M A S
3:
   Apply local constraint initialization: r ^ i ( 1 , K ) M i b i ( 0 )
4:
for each physical sampling time t = 0 , 1 , 2 ,  do
5:
      Acquire local observation b i ( t )
6:
       r ^ i ( t , 0 ) r ^ i ( t 1 , K )   ▹Temporal Handover from previous step
7:
      for each iteration k = 0 , 1 , , K 1  do
8:
            1. Consensus Phase:
9:
                 Share r ^ i ( t , k ) with neighbors j N i
10:
               Compute diffused average: r ^ d i f f , i ( t , k ) = j N i w i j r ^ j ( t , k )
11:
            2. Integrity Monitoring:
12:
                η i ( t , k ) = | M i r ^ d i f f , i ( t , k ) b i ( t ) |    ▹Check local-to-global consistency
13:
            3. Resilient Update:
14:
            if  η i ( t , k ) > Γ ( t )  then
15:
                                                        ▹ Isolation Mode: Bypass corrupted local data
16:
                r ^ i ( t , k + 1 ) r ^ d i f f , i ( t , k )
17:
          else
18:
                                                 ▹ Nominal Mode: Apply hybrid projection update
19:
                r ^ i ( t , k + 1 ) P i r ^ d i f f , i ( t , k ) + M i b i ( t )
20:
            end if
21:
      end for
22:
      Return  r ^ i ( t , K ) as the finalized estimate for time t
23:
end for
A primary advantage of the H-MASE algorithm is its minimal computational footprint. Each iteration k involves only sparse matrix-vector multiplications and scalar comparisons. Since the dimensions of P i and M i depend solely on the number of routes n, and the communication is limited to local neighborhoods with a maximum degree d m a x , the computational complexity per agent is bounded by O ( K · ( n 2 + d m a x · n ) ) . Crucially, this local computational load is completely independent of the total network size N, ensuring that the algorithm remains highly scalable as the urban network expands. This makes the H-MASE highly suitable for real-time monitoring in large-scale urban networks where centralized processing would suffer from significant communication latency and computational bottlenecks. Furthermore, compared to existing distributed estimation benchmarks, H-MASE offers distinct structural advantages. Unlike Distributed Alternating Direction Method of Multipliers framework [28], which typically require computationally expensive matrix inversions at each local iteration, H-MASE relies exclusively on computationally lightweight orthogonal projections ( P i ). Additionally, compared to Distributed Kalman Filters [29], which heavily depend on the exact knowledge of a state-transition matrix, the proposed robust optimization framework models traffic evolution as a bounded random walk, making it structurally immune to transition-matrix modeling errors. While empirical benchmarking of computational convergence speeds against these conventional frameworks is valuable, the primary objective of the H-MASE protocol is establishing theoretical guarantees. Specifically, the framework prioritizes minimizing false alarms and maintaining exact cyber-physical fault isolation over accelerating nominal iteration speeds.

5. Convergence and Resilience Analysis

In this section, we analyze the convergence properties of the distributed optimization algorithm underpinning the H-MASE architecture. We establish the ISS of the algorithm, which characterizes the robustness of the DGD mechanism and guarantees a bounded optimality gap under the simultaneous presence of macroscopic demand drift Δ r ( t ) and stochastic measurement noise ω ¯ ( t ) .
To analyze the network holistically, we first define the local estimation error (i.e., the distance to the optimal true state) for each agent i V M A S at microscopic optimization iteration k during the macroscopic step t:
e i ( t , k ) = r ^ i ( t , k ) r ( t )
By substituting the nominal update rule (12) into (16) and utilizing the geometric property ( P i + M i M i ) = I n , we obtain the local error dynamics:
e i ( t , k + 1 ) = P i j N i w i j e j ( t , k ) + M i ω ¯ i ( t )
where ω ¯ i ( t ) is the i-th scalar element of the augmented noise vector ω ¯ ( t ) .
We aggregate the local errors into a global state error vector
e ( t , k ) = [ e 1 T ( t , k ) , , e N T ( t , k ) ] T R n N .
The network-wide error evolution is compactly formulated using Kronecker products as:
e ( t , k + 1 ) = Φ e ( t , k ) + M ω ¯ ( t )
where M = diag ( M 1 , , M N ) R n N × N is the block-diagonal pseudoinverse matrix and Φ R n N × n N is the global state-transition operator defined as
Φ = P W
with P = diag ( P 1 , , P N ) and W = ( W I n ) . To guarantee that the distributed algorithm converges to the true physical state rather than an arbitrary consensus, the underlying cyber-physical network must possess sufficient sensory information. We formalize this physical requirement as follows:
Assumption 2
(Global Network Observability). The physical placement of the measurement sensors (PSAs) provides sufficient rank to fully reconstruct the hidden network state. Algebraically, the intersection of the null spaces of the PSA observation rows ( i { 1 , , m } ) is the zero vector:
i = 1 m N ( M i ) = { 0 n }
Remark 6.
Note that Assumption 2 is a standard requirement for ensuring global asymptotic stability and vanishing estimation errors. In practical large-scale deployments where this condition might be locally violated (e.g., due to sparse sensor coverage), the H-MASE protocol still guarantees convergence to the best possible estimate within the observable subspace. In such cases, the error remains bounded by the projection of the state onto the unobservable null space, preventing global divergence.
Under this assumption, we establish the spectral properties of the transition matrix Φ .
Lemma 1
(Spectral Contraction and Convergence Rate). Under Assumptions 1 and 2, the global state-transition operator Φ functions as a strict contraction mapping, guaranteeing the linear convergence of the distributed least-squares optimization algorithm. Specifically, the spectral radius satisfies:
ρ ( Φ ) < 1
Proof. 
To analyze the unforced error dynamics (i.e., the nominal system without the noise term ω ¯ ( t ) ), we aggregate the local error update rules of all N agents into a single global system. Let e ( k ) = [ e 1 ( k ) T , , e N ( k ) T ] T R n N denote the concatenated global error vector at internal iteration k.
We define the global weight matrix as W = W I n , where W = [ w i j ] R N × N is the doubly stochastic adjacency matrix of the graph G M A S , and ⊗ denotes the Kronecker product. Furthermore, let P = diag ( P 1 , P 2 , , P N ) R n N × n N be the global block-diagonal projection matrix.
Using these global operators, the unforced local error dynamics can be compactly written in the matrix form:
e ( k + 1 ) = P W e ( k )
Consequently, the global state-transition matrix is defined as Φ = P W . To prove asymptotic stability, we must show that the spectral radius satisfies ρ ( Φ ) < 1 .
We now analyze the properties of the individual matrices W and P . Since the communication graph is strongly connected (Assumption 1) and W is doubly stochastic, the Perron-Frobenius theorem dictates that the spectral radius of W is exactly ρ ( W ) = 1 . The eigenvalue λ = 1 has an algebraic multiplicity of 1, with the corresponding right eigenvector being the vector of ones, 1 N .
Extending this to W = W I n , the largest eigenvalue remains 1, but with a multiplicity of n. The corresponding eigenspace, known as the consensus subspace, is defined over the complex field to perfectly accommodate general spectral analysis:
C = x C n N x = 1 N α , α C n
For any vector residing in this subspace ( x C ), the consensus operation acts as the identity mapping, yielding W x = x .
We now consider the projection matrix P . Because each local operator P i is an orthogonal projector (satisfying P i = P i T and P i 2 = P i ), its eigenvalues are exclusively either 0 or 1. Therefore, the induced 2-norm (spectral norm) of the block-diagonal matrix is bounded by P 2 1 . Since W is doubly stochastic, we also have W 2 = 1 .
Applying the sub-multiplicative property of matrix norms to Φ = P W , we establish the upper bound of the spectral radius:
ρ ( Φ ) Φ 2 P 2 W 2 1
This guarantees that the global error dynamics are non-expansive ( ρ ( Φ ) 1 ). To prove strict contraction, assume for the sake of contradiction that ρ ( Φ ) = 1 . Consequently, there exists a non-zero eigenvector v = [ v 1 T , , v N T ] T C n N and a corresponding eigenvalue λ C with | λ | = 1 , such that:
P W v = λ v
Taking the 2-norm of both sides, and recalling that P 2 1 and W 2 = 1 , we obtain the inequality chain:
v 2 = | λ | v 2 = P W v 2 P 2 W v 2 W v 2 W 2 v 2 = v 2
Since the extremities are identical, the internal condition W v 2 = v 2 must hold. This implies that v corresponds to the principal singular space of W .
Crucially, because the underlying communication graph is undirected, the Metropolis-Hastings weight matrix W is inherently symmetric ( W = W T ), meaning its singular values perfectly coincide with the absolute values of its eigenvalues. It is important to note that while the physical traffic flow is highly directional (directed graph), the overlaid digital cyber-communication topology ( G M A S ) is explicitly designed as an undirected network. Applying the Metropolis-Hastings rule to this undirected communication graph inherently produces a symmetric and doubly stochastic weight matrix W [30]. Furthermore, since the weights guarantee strict self-loops ( w i i > 0 ), W is primitive. By the Perron-Frobenius theorem for primitive matrices [30], the only eigenvalue on the unit circle is exactly 1. Thus, v is confined to the consensus subspace C , meaning W v = v and its local components are identical:
v = 1 N α v 1 = = v N = α
for some non-zero α C n . Substituting W v = v back into our initial hypothesis (26) yields:
P v = λ v
Because P is an orthogonal projector, its eigenvalues are exclusively 0 and 1. For v 0 and | λ | = 1 to hold simultaneously, we must have λ = 1 and P v = v .
Given the block-diagonal structure P = diag ( P 1 , , P N ) , the global equation P v = v decouples into exact local equalities for every agent:
P i α = α , i { 1 , , N }
Substituting the local operator definition P i = I n M i M i , we obtain:
( I n M i M i ) α = α M i M i α = 0 n
To eliminate the pseudoinverse without invoking complex quadratic forms, we apply the first property of the Moore-Penrose pseudoinverse ( A A A = A ) [30]. Pre-multiplying both sides by the row vector M i collapses the dimension to a scalar, yielding:
M i ( M i M i α ) = 0 ( M i M i M i ) α = 0 M i α = 0
This algebraic step reveals that α must lie exactly in the null space of the observation matrix:
α N ( M i ) , i { 1 , , N }
Because this holds for the entire network, it inherently applies to the subset of PSAs indexed by i { 1 , , m } . Thus, α must reside in their intersection:
α i = 1 m N ( M i )
By Assumption 2 (Global Network Observability), this intersection contains only the zero vector { 0 n } . Consequently, it holds that α = 0 , which means the global eigenvector v = 0 . This directly contradicts the premise that v is a non-zero eigenvector. Therefore, no such eigenvalue exists on the unit circle, and the spectral radius is bounded:
ρ ( Φ ) < 1
From a distributed optimization perspective, this strict contraction inherently ensures that the exact DGD algorithm possesses a linear convergence rate toward the globally optimal feasibility manifold. This concludes the proof of Lemma 1. □
This lemma provides the essential spectral guarantee that the estimation error will contract over time. Based on this framework, we can now derive the formal convergence bounds for both nominal and faulty conditions.

5.1. Convergence Analysis Under Nominal Conditions

In this subsection, we establish the convergence properties of the DGD algorithm under nominal conditions, where all sensors are operational (i.e., σ i ( t ) = 1 , i V M A S ). We utilize the ISS framework to establish theoretical performance bounds, demonstrating that the distance to the optimal solution (optimality gap) remains bounded despite stochastic sensor noise and temporal shifts in traffic demands.
The composite disturbance vectors for the global system are defined as follows. Recall the augmented hybrid noise vector ω ¯ ( t ) R N (where N = m + p i n t ) defined in Section 3.4, which is bounded by ω ¯ ( t ) ω m a x . Furthermore, let Δ ( t 1 ) = 1 N Δ r ( t 1 ) R n N be the global state drift representing temporal fluctuations in the stochastic O-D demands. Driven by the warm-start initialization, the unforced global error undergoes a macroscopic temporal transition defined as:
e ( t , 0 ) = e ( t 1 , K ) Δ ( t )
Theorem 1
(Nominal ISS and Bounded Optimality Gap). Consider the H-MASE network operating under nominal (fault-free) conditions ( σ i ( t ) = 1 , i V M A S ) and satisfying Assumptions 1 and 2. Let the augmented hybrid noise vector and the macroscopic route-demand drift be bounded by ω ¯ ( t ) 2 ω m a x and Δ r ( t ) 2 Δ m a x , respectively. Assuming the number of microscopic optimization iterations K is chosen sufficiently large such that C ϵ γ K < 1 , the distributed least-squares algorithm is ISS, ensuring robust convergence. Specifically, the Euclidean magnitude of the concatenated optimality gap e ( t , K ) at the end of the t-th macroscopic step is bounded by the recursive inequality:
e ( t , K ) 2 C ϵ γ K e ( t 1 , K ) 2 + B I S S ( ω m a x , Δ m a x )
where γ = ρ ( Φ ) + ϵ ( ρ ( Φ ) , 1 ) is the effective exponential decay rate chosen via an arbitrarily small ϵ > 0 , C ϵ 1 is the corresponding condition constant accommodating the non-normality of the transition matrix Φ, and B I S S ( ω m a x , Δ m a x ) is the theoretical upper bound isolating the cumulative perturbative effects over K iterations.
Proof. 
The proof proceeds by recursively unrolling the inner-loop error dynamics and applying the triangle inequality over the dual time-scale boundaries.
Step 1: 
Recursive Unrolling of the Inner Loop
Recall the global error dynamics inside the microscopic k-loop, which can be compactly formulated using the global transition matrix Φ = P W and the block-diagonal pseudoinverse operator M = diag ( M 1 , , M N ) R n N × N :
e ( t , k + 1 ) = Φ e ( t , k ) + M ω ¯ ( t )
By recursively expanding (38) from the initial internal iteration k = 0 to the final iteration K, and noting that the augmented hybrid noise ω ¯ ( t ) remains invariant over the fast k-loop, we accumulate the persistent effects:
e ( t , K ) = Φ K e ( t , 0 ) + j = 0 K 1 Φ j M ω ¯ ( t )
Step 2: 
Macroscopic Time-Scale Handover
The unrolled Equation (39) isolates the error at the beginning of the current macroscopic step, e ( t , 0 ) . By invoking the time-scale separation principle established in (36), the initialization of the t-th step must incorporate the true state drift that occurred during the physical sampling interval. Substituting the warm-start boundary condition e ( t , 0 ) = e ( t 1 , K ) Δ ( t 1 ) into (39) yields the complete temporal transition of the estimation error:
e ( t , K ) = Φ K e ( t 1 , K ) Δ ( t 1 ) + j = 0 K 1 Φ j M ω ¯ ( t )
Rearranging the terms to isolate the perturbative effects from the nominal unforced response, we obtain the exact error state at the end of the macroscopic step t:
e ( t , K ) = Φ K e ( t 1 , K ) Φ K Δ ( t 1 ) + j = 0 K 1 Φ j M ω ¯ ( t )
Step 3: 
Triangle Inequality and Norm Decompositions
Taking the Euclidean norm of (41) and applying the standard triangle and sub-multiplicative norm inequalities, we bound the global error. Given the mixed-product property of the Kronecker product ( 1 N Δ r ( t ) 2 N Δ m a x ), the noise bound ω ¯ ( t ) 2 ω m a x , and the statically bounded pseudoinverse operator ( M 2   < ), we directly obtain the decoupled scalar formulation:
e ( t , K ) 2 Φ K 2 e ( t 1 , K ) 2 + Φ K 2 N Δ m a x + M 2 ω m a x j = 0 K 1 Φ j 2
Next, we systematically decouple the global disturbance terms into their scalar upper bounds as defined in the theorem hypothesis. For the macroscopic demand drift, applying the mixed-product property of the Kronecker product to the Euclidean norm yields the exact analytical relationship:
Δ ( t 1 ) 2 = 1 N Δ r ( t 1 ) 2 = 1 N 2 Δ r ( t 1 ) 2 N Δ m a x
For the augmented hybrid noise, the operational limit is directly given as ω ¯ ( t ) 2 ω m a x . Furthermore, since the global projection operator M = diag ( M 1 , , M N ) is uniquely derived from the static routing matrix (A) and the incidence topology of the network, its induced 2-norm (i.e., its maximum singular value) is a finite, time-invariant constant ( M 2   < ).
Step 4: 
Spectral Bounding and ISS Formulation
The global transition matrix Φ = P W is generally non-normal. By the spectral radius theorem for non-normal matrices [30], since ρ ( Φ ) < 1 , for any arbitrarily small ϵ > 0 , there exists a finite condition constant C ϵ 1 such that Φ j 2 C ϵ ( ρ ( Φ ) + ϵ ) j C ϵ γ j , where γ ( ρ ( Φ ) , 1 ) . Substituting this worst-case relation into the decoupled error dynamics (42) and factoring out C ϵ , we obtain:
e ( t , K ) 2 C ϵ γ K e ( t 1 , K ) 2 + C ϵ γ K N Δ m a x + C ϵ M 2 ω m a x j = 0 K 1 γ j
By the theorem hypothesis, K is sufficiently large such that C ϵ γ K < 1 . Since γ < 1 , the geometric series evaluates to ( 1 γ K ) / ( 1 γ ) . To formalize the ISS condition, we consolidate the disturbance-driven terms into a single theoretical boundary, B I S S :
B I S S ( ω m a x , Δ m a x ) C ϵ γ K N Δ m a x + 1 γ K 1 γ M 2 ω m a x
Substituting this definition directly back into the error dynamics yields the final recursive macroscopic optimality gap:
e ( t , K ) 2 C ϵ γ K e ( t 1 , K ) 2 + B I S S ( ω m a x , Δ m a x )
Because the structural choice of the internal iteration count K explicitly guarantees that the macroscopic contraction factor satisfies C ϵ γ K < 1 , the DGD mechanism is globally asymptotically stable. Consequently, by the discrete-time ISS theorem [31], the error sequence is uniformly ultimately bounded. Taking the limit superior as t , the distance to the optimal true state inherently converges to a strict asymptotic optimality gap bounded by:
lim sup t e ( t , K ) 2 B I S S ( ω m a x , Δ m a x ) 1 C ϵ γ K
This confinement of the optimality gap, proportional to the physical demand volatility and topological noise bounds, definitively proves that the distributed optimization algorithm exhibits robust ISS. It should be explicitly noted that the analytical bounds established in B I S S are inherently conservative, as they are derived using sub-multiplicative norm inequalities to provide strict worst-case mathematical guarantees. In practical deployments, stochastic noise cancellation during the diffusion phase typically yields an actual empirical error significantly lower than this theoretical upper bound. This concludes the proof of Theorem 1. □
Remark 7
(Bridging Theory and Resilient Operations). The analytical derivation of B I S S in (45) establishes the mathematical bridge between the nominal stability analysis and the autonomous fault-isolation logic presented in Section 4.4. By anchoring the operational steady-state fault tolerance threshold to the theoretical ultimate bound of the local residual (e.g., setting Γ s t e a d y max i M i 2 B I S S 1 C ϵ γ K + ω m a x ), the H-MASE protocol guarantees that nominal stochastic fluctuations will not mathematically trigger the isolation mechanism. This strategic thresholding ensures the theoretical avoidance of false alarms as long as the physical disturbances remain within their specified supremum bounds.
While the nominal stability of the H-MASE is established through Theorem 1, the primary challenge in urban traffic monitoring remains the presence of structural sensor failures and data corruption. Having proved that the system remains stable under stochastic noise, the following subsection addresses the resilience of the protocol when the network undergoes localized instrumentation faults.

5.2. Resilience Analysis and Fault Tolerance

While the nominal stability established in Section 5.1 ensures robust performance under stochastic noise, macroscopic service networks are inherently vulnerable to structural component failures. From a systems reliability perspective, this subsection analyzes the fault tolerance of the H-MASE protocol. We demonstrate that the autonomous switching logic prevents cascading errors and preserves network-wide stability even when a subset of agents experiences structural sensor faults.

5.2.1. Fault Characterization and Detection Sensitivity

We model an abrupt structural fault (hard fault) at agent i V P S A as an additive signal f i ( t ) R affecting the local measurement. The corrupted observation, denoted by b i f ( t ) , is expressed as:
b i f ( t ) = M i r ( t ) + ω ¯ i ( t ) + f i ( t )
where f i ( t ) represents severe, non-stochastic perturbations such as sudden sensor bias or catastrophic sensing-hardware calibration failure. It is important to note that this models a physical data corruption fault; the agent’s cyber-communication module remains fully operational to relay consensus variables, preserving the graph topology and the doubly stochastic property of the weight matrix W.
Note that the proposed isolation mechanism is specifically designed to detect such abrupt anomalies; gradually drifting soft faults may require moving-average residual windows, which fall outside the scope of this paper. The efficacy of the fault isolation mechanism depends on the sensitivity of the local consistency residual η i ( t , k ) . To facilitate this analysis, let e d i f f , i ( t , k ) = r ^ d i f f , i ( t , k ) r ( t ) denote the local diffused estimation error.
It is crucial to note that the agents operationally execute isolation relying solely on the locally computable residual η i ( t , k ) > Γ ( t ) . However, to establish the theoretical reliability of this mechanism, we must analytically define the fault detectability bound—the fault magnitude that mathematically guarantees this operational threshold will be breached.
Lemma 2
(Autonomous Constraint Exclusion and Fault Detectability). In the context of robust distributed optimization, an agent i is designed to autonomously exclude its corrupted local constraint (triggering the isolation mode σ i = 0 ) if the scalar magnitude of the structural fault f i ( t ) satisfies:
| f i ( t ) | > Γ ( t ) + M i 2 e d i f f , i ( t , k ) 2 + ω m a x
where Γ ( t ) is the dynamic detection threshold defined in Section 4.4, and ω m a x is the global measurement noise bound.
Proof. 
The consistency residual is defined in (13) as η i ( t , k ) = | M i r ^ d i f f , i ( t , k ) b i f ( t ) | . Substituting the faulty measurement (48) and the local diffused error definition r ^ d i f f , i = r + e d i f f , i , we obtain:
η i ( t , k ) = | M i ( r ( t ) + e d i f f , i ( t , k ) ) ( M i r ( t ) + ω ¯ i ( t ) + f i ( t ) ) | = | ( M i e d i f f , i ( t , k ) ω ¯ i ( t ) ) f i ( t ) |
Applying the reverse triangle inequality yields a strict lower bound: η i ( t , k ) | f i ( t ) | | M i e d i f f , i ( t , k ) ω ¯ i ( t ) | . To decouple the inner term, we apply the standard triangle and Cauchy-Schwarz inequalities. Acknowledging that the scalar noise is bounded ( | ω ¯ i ( t ) | ω m a x ), we establish | M i e d i f f , i ( t , k ) ω ¯ i ( t ) | M i 2 e d i f f , i ( t , k ) 2 + ω m a x . Substituting this upper bound back into the residual inequality yields:
η i ( t , k ) | f i ( t ) | M i 2 e d i f f , i ( t , k ) 2 + ω m a x
According to the isolation logic defined in Section 4.4, the anomaly mode is triggered when η i ( t , k ) > Γ ( t ) . Therefore, a sufficient condition to guarantee immediate isolation is:
| f i ( t ) | M i 2 e d i f f , i ( t , k ) 2 + ω m a x > Γ ( t )
Rearranging this algebraic inequality to isolate the fault magnitude | f i ( t ) | yields the statement of the lemma. Analytically, this establishes the exact mathematical boundary where a catastrophic local constraint perturbation is autonomously rejected by the distributed optimization algorithm, ensuring resilient continuity. This completes the proof. □

5.2.2. Stability of the Resilient Switched Dynamics

When a subset of structurally compromised PSAs, explicitly denoted as V f { 1 , , m } , enters isolation mode ( σ i ( t , k ) = 0 ), the global system transition switches to a resilient operator Φ σ = P σ W . The modified global projection block P σ = diag ( D 1 , , D N ) is defined such that the local operator is D i = P i for healthy agents ( i V h ) and forced to the identity mapping D i = I n for isolated agents ( i V f ).
Assumption 3
(Residual Observability). Let V h = V M A S V f be the set of healthy agents. By design, VLAs are structurally insulated from hardware failures. Therefore, they permanently bypass the isolation mechanism ( σ i = 1 ) and continuously preserve the underlying cyber-graph connectivity. However, the system maintains structural observability if and only if the surviving PSAs within V h provide sufficient rank. Algebraically, the intersection of the null spaces of all agents in the healthy ensemble must remain trivial: i V h N ( M i ) = { 0 n } .
From a practical municipal engineering standpoint, satisfying this residual observability condition after a severe sensor isolation relies on the inherently redundant deployment of physical sensors (e.g., overlapping camera fields and inductive loops) at critical urban intersections. When a localized hardware unit fails, the surviving adjacent sensors, coupled with the spatial correlation of overlapping macroscopic routes, collaboratively compensate for the localized rank deficiency. This redundant information is seamlessly routed across the network via the fault-immune VLAs, ensuring the rank condition is preserved.
Theorem 2
(Robust Convergence and Resilient Optimality Gap). Suppose the structural disturbance bounds ω ¯ ( t ) 2 ω m a x and Δ ( t ) 2 N Δ m a x established in Theorem 1 hold. Under the connectedness of G M A S and the residual observability condition (Assumption 3), the distributed optimization algorithm remains ISS even if a subset of PSAs V f undergoes severe structural failure. Assuming the fixed microscopic iteration count K is conservatively designed a priori to be sufficiently large such that the resilient macroscopic contraction condition C σ , ϵ γ σ K < 1 is preserved, the global optimality gap converges to a resilient invariant set independent of the catastrophic fault magnitudes f i ( t ) , bounded by:
lim sup t e ( t , K ) 2 B I S S σ ( ω m a x , Δ m a x ) 1 C σ , ϵ γ σ K
where γ σ = ρ ( Φ σ ) + ϵ ( ρ ( Φ σ ) , 1 ) is the resilient effective decay rate (i.e., linear convergence rate) under the switched topology, C σ , ϵ 1 is the corresponding condition constant, and B I S S σ is the cumulative resilient disturbance bound on the optimality gap.
Proof. 
The proof proceeds by analyzing the switched system dynamics under the autonomous constraint-exclusion mechanism.
Step 1: 
Switched Dynamics and Time-Invariant Unrolling
When catastrophic faults f ( t ) trigger the autonomous isolation ( σ i = 0 for i V f ), the effective local pseudoinverse blocks within the global matrix are forced to zero. This disjoint algebraic property yields the exact nullification of the fault vector: M σ f ( t ) 0 n N . Consequently, the corrupted innovation is entirely bypassed. Since the severe anomaly persistently breaches the threshold ( η i Γ ), the switched operator Φ σ = P σ W remains time-invariant across the inner loop. Furthermore, structural faults do not alter the physical demand drift Δ ( t 1 ) or the temporal handover. Thus, recursively unrolling the decoupled dynamics and applying the macroscopic boundary condition yields the exact isolated error state:
e ( t , K ) = Φ σ K e ( t 1 , K ) Φ σ K Δ ( t 1 ) + j = 0 K 1 Φ σ j M σ ω ¯ ( t )
Step 2: 
Preservation of Global Feasibility and Spectral Contraction
Since physical faults do not sever the cyber-links, W remains doubly stochastic. The modified projection block P σ consists of orthogonal projectors ( P i ) and identity mappings ( I n ), preserving the non-expansive property P σ 2 1 . Thus, ρ ( Φ σ ) 1 . To prove strict contraction, we invoke the exact algebraic contradiction sequence established in Lemma 1. If ρ ( Φ σ ) = 1 , the eigenvector is confined to the consensus subspace ( v = 1 N α ), dictating P σ v = v . For isolated agents ( D i = I n ), this imposes no algebraic restriction. Consequently, the preservation of global feasibility relies exclusively on the healthy ensemble, requiring α i V h N ( M i ) . By the Residual Observability condition (Assumption 3), this intersection is trivial ( α = 0 ). Thus, no such eigenvalue exists on the unit circle, proving that the resilient operator remains a strict contraction mapping: ρ ( Φ σ ) < 1 .
Step 3: 
Resilient Spectral Bounding and Final ISS Formulation
Since Φ σ is generally non-normal but contracting, we apply the spectral radius theorem analogous to Theorem 1. For an arbitrarily small ϵ > 0 , there exists C σ , ϵ 1 such that Φ σ j 2 C σ , ϵ γ σ j , where γ σ = ρ ( Φ σ ) + ϵ < 1 . Taking the Euclidean norm of (54), substituting the spectral bound, and utilizing the invariant physical disturbance limits, we analytically evaluate the convergent geometric series j = 0 K 1 γ σ j . Consolidating the disturbance-driven terms defines the ultimate cumulative resilient disturbance bound, B I S S σ :
B I S S σ ( ω m a x , Δ m a x ) C σ , ϵ γ σ K N Δ m a x + 1 γ σ K 1 γ σ M σ 2 ω m a x
Substituting this directly back into the bounded error dynamics theoretically confirms the final ultimate bound established in (53). Because the catastrophic structural fault vector f ( t ) has been mathematically eradicated ( M σ f ( t ) 0 ), the optimality gap is a function of the stochastic noise and macroscopic drift. This confirms that the robust optimization protocol successfully rejects corrupted constraints while preserving global feasibility. This concludes the proof of Theorem 2. □
Remark 8
(The Structural Necessity of VLAs). The resilience of the H-MASE architecture relies heavily on the topological anchors provided by the VLAs. While the surviving PSAs are solely responsible for maintaining the algebraic rank of the system (Assumption 3), VLAs mathematically guarantee the preservation of cyber-graph connectivity (Assumption 1). Since VLAs are immune to physical faults ( σ i ( t , k ) 1 ), their local projection matrices permanently act as perfect identity filters ( P i = I n ). This ensures that even if multiple contiguous physical sensors catastrophically fail and isolate their measurements, the underlying consensus topology is never fragmented. The VLAs seamlessly route the diffused state estimates across the network, allowing the surviving PSAs to collaboratively share their remaining rank and achieve global observability.

6. Numerical Results and Emission Mitigation Analysis

The primary objective of this work is to establish the mathematical foundation of the proposed H-MASE architecture rather than to provide an empirical benchmarking study on unstructured datasets. While real-world data (e.g., Performance Measurement System (PeMS), Metropolitan Transportation Authority-Los Angeles (METR-LA)) introduces highly complex, unmodeled exogenous variables, validating the theoretical worst-case guarantees—specifically the ISS bounds proven in Theorems 1 and 2, and the exact fault detectability threshold established in Lemma 2—requires a controlled stochastic environment. To this end, this section presents numerical simulations designed to isolate and evaluate the algorithm’s operational properties under bounded structural noise, random-walk state drifts, and catastrophic sensor malfunctions.
It should be explicitly noted that while the ultimate system-level objective of the H-MASE protocol is emission mitigation, this study deliberately isolates the cyber-physical state estimation layer. Therefore, rather than presenting coupled vehicular emission graphs derived from simulators like Motor Vehicle Emission Simulator (MOVES), the numerical results focus on validating Root Mean Square Error (RMSE). Achieving a resilient, sub-unit optimality gap is the absolute prerequisite data foundation before any downstream eco-routing or emission mitigation strategies can be reliably executed.

6.1. Simulation Environment and Cyber-Physical Topology

To emulate a highly realistic and topologically heterogeneous stochastic network flow problem within an urban logistics context, a complex ring-radial transportation network is constructed, as illustrated in Figure 4. Although the generated dataset is synthetic to preserve mathematical traceability, the network topology mirrors standard metropolitan grid structures. Furthermore, the injected route demand volumes (≈20–40 vehicles per discrete step) are explicitly scaled to match typical hourly flow rates in metropolitan roundabout scenarios, ensuring that the macroscopic flow magnitudes reflect real-world empirical conditions. Unlike oversimplified symmetric grid topologies frequently utilized in theoretical literature, the physical layer of this network deliberately incorporates a diverse mix of unidirectional streets and bidirectional urban arterials, tightly coupled with a central counter-clockwise roundabout. The network comprises 15 internal signalized intersections ( p i n t = 15 ) and 10 boundary nodes representing 5 origins ( O 1 O 5 ) and 5 destinations ( D 1 D 5 ), connected via m = 55 directed road segments (edges). This structural diversity ensures that the distributed estimation protocol is tested against the asymmetric bottlenecks, varying nodal degrees, and complex overlapping route flows characteristic of genuine metropolitan networks. Consequently, the multi-agent network is formed by N = 70 autonomous agents, integrating the 55 PSAs with 15 VLAs stationed strategically at the intersections. The complete set of simulation variables, including temporal parameters and disturbance bounds, is summarized in Table 2. Specifically, the threshold exponential decay rate ( β = 0.15 ) is empirically tuned to safely accommodate the initial transient disagreements during the consensus warm-start phase. This calibration ensures that the theoretical fault detectability bound (Lemma 2) is preserved before the system firmly settles into its steady-state resilience boundary ( Γ s t e a d y ).
A crucial feature of the proposed simulation environment is the implementation of a topology-induced cyber-physical communication graph. Unlike conventional Erdös-Rényi random networks, the communication links in this setup are dictated by physical proximity (hop-by-hop communication). Specifically, PSAs only communicate with adjacent sensors sharing the same intersection and with their local VLA, thereby preserving the fully decentralized and localized nature of the estimation framework.
To evaluate the algorithm under an asymmetric and realistic traffic load, n = 25 linearly independent dominant logical routes are defined, originating from the boundary nodes and traversing the internal junctions. The exact node-by-node traversal sequences of these routes are detailed in Table 3. Topological utilization analysis reveals that while 100% of the internal junctions are actively engaged by the routing set, the physical edge coverage is inherently sparse at 72.73% (40 out of 55 edges utilized). Furthermore, the spatial distribution of the routes introduces significant traffic hotspots, such as Edge 1, which is heavily shared by 5 distinct routes. Despite this asymmetrical flow distribution, the augmented observation matrix M constructed by the VLAs yields a full column rank, i.e., rank ( M ) = 25 . This satisfies the structural observability requirement established in the problem formulation, ensuring that the global traffic state vector is recoverable from the distributed local projections.

6.2. Distributed Tracking Performance in Nominal Operation

In this subsection, the nominal tracking capability of the proposed H-MASE protocol is evaluated prior to the injection of any severe structural faults. The primary objective is to demonstrate how the multi-agent network collectively tracks the dynamically evolving traffic state relying solely on localized, hop-by-hop communication.
The temporal evolution of the traffic densities across all n = 25 dominant routes is presented in Figure 5. In this visualization, the dash-dot lines represent the ground truth traffic flows, which are subjected to continuous stochastic demand fluctuations (modeled as bounded random-walk drifts with Δ m a x = 1.0 ) at each macroscopic time step t. Conversely, the solid lines illustrate the average estimated densities collaboratively computed by the distributed agents. Due to the rapid spectral convergence of the inner-loop consensus mechanism, the individual estimates of all 70 agents densely cluster around this network-wide average with negligible spatial variance. As observed in Figure 5, these consensus-driven solid lines closely track the dynamic dash-dot trajectories. Despite the presence of physical measurement noise ( ω m a x = 2.0 ) affecting the local observations, the algorithm maintains tracking accuracy. Empirical performance metrics collected during the steady-state operational phase reveal an RMSE of approximately 0.81 vehicles per route and a Mean Absolute Percentage Error (MAPE) of 2.44%. This robust solution quality is mathematically attributed to the inner-loop distributed optimization mechanism. By executing K = 80 microscopic DGD iterations per macroscopic update, the algorithm effectively solves the network-wide least-squares problem, functioning as a spatial variance reduction mechanism. The decentralized architecture successfully exploits the stochastic cancellation of zero-mean physical noise, ensuring that the global state vector securely converges to a tight optimality gap (near-optimal feasible region) without requiring a centralized data fusion center. From a sustainable systems engineering standpoint, this high-fidelity bounding (e.g., an RMSE of merely 0.81 vehicles per estimated state) ensures that downstream dynamic resource allocation models and stochastic queue management systems receive highly reliable input parameters. In the physical realm, tracking route flows with sub-unit precision essentially eradicates the uncertainties that cause suboptimal traffic light synchronizations. This creates a fail-safe data foundation for implementing network-wide green-wave signalization, directly minimizing stop-and-go driving cycles, vehicle idling times, and the associated greenhouse gas emissions.

6.3. Resilient Fault Isolation and Comparative ISS Validation

The final simulation phase validates the resilience guarantees of the H-MASE protocol, specifically the autonomous fault isolation mechanism (Lemma 2) and the ISS bounds (Theorems 1 and 2). To robustly demonstrate the algorithmic superiority and the strict necessity of the proposed autonomous isolation logic, we conduct an ablation study by benchmarking H-MASE against a conventional DGD algorithm (e.g., Mou et al. [26]). While advanced frameworks such as the Alternating Direction Method of Multipliers (ADMM) or Distributed Kalman Filtering (DKF) exist, they inherently demand massive computational overhead (e.g., continuous matrix inversions or exact transition-matrix modeling). Such heavy dependencies violate the lightweight, edge-computing premise of the proposed architecture. Therefore, a standard first-order DGD approach (Mou et al. [26]) serves as the most structurally appropriate baseline. By contrasting H-MASE with this widely-adopted, computationally equivalent baseline, we specifically isolate the impact of our novel autonomous switching logic.
To emulate highly realistic and non-ideal IoT communication environments, the simulation is subjected to a continuous 10% random packet loss (link failure) rate across the digital communication graph E M A S . From a network systems perspective, a packet loss event can be interpreted as a manifestation of infinite communication latency. By demonstrating that the H-MASE protocol maintains ISS under such severe stochastic interruptions, we effectively verify its robustness against variable transmission delays, as the consensus mechanism inherently compensates for delayed or missing information by relying on the surviving local neighborhoods. Furthermore, to test cyber-physical fault tolerance, we simulate a catastrophic IoT hardware malfunction at a critical intersection. Specifically, a structural bias ( f = 80.0 )—representing a severe sensor calibration loss or edge-device cyber interference—is injected into a randomly selected active PSA at time step t f a u l t = 80 . Given that the nominal route demands operate within [ 20 , 40 ] vehicles per step, this anomaly represents a massive data corruption relative to the actual traffic volume.
The temporal evolution of the Euclidean norm of the global estimation error (i.e., the global optimality gap), e ( t , K ) 2 , for both algorithms is depicted in Figure 6. Prior to the fault injection ( t < 80 ), both algorithms successfully track the state despite the 10 % packet loss, oscillating within a tightly bounded invariant set. This confirms that the H-MASE protocol highly effectively avoids false alarms during nominal operations (satisfying Lemma 2).
At macroscopic step t = 80 , the onset of the IoT hardware fault causes the residual of the compromised edge device to exhibit an abrupt surge ( η i Γ ( t ) ). The autonomous isolation mode ( σ i = 0 ) defined in Section 4.4 is successfully triggered. Acting as a localized cyber-physical fail-safe, the edge agent immediately overrides its local projection phase, quarantining its corrupted physical sensor data from the network while maintaining its digital communication links.
The effect of this autonomous constraint-dropping mechanism is distinctly observed in the post-fault trajectories of Figure 6. The error remains bounded before the fault (validating Theorem 1 ISS bounds). As can be observed from the figure, following the sensor malfunction at t = 80 , the baseline Conventional DGD algorithm severely diverges, whereas H-MASE autonomously isolates the fault and maintains a resilient, bounded optimality gap (validating Theorem 2). The baseline Conventional DGD algorithm, unable to isolate the severe 80.0 vehicle bias, instantly absorbs the corrupted constraint, causing its global estimation error to diverge uncontrollably. Conversely, the H-MASE protocol mathematically eliminates the catastrophic fault vector ( M σ f ( t ) 0 ). As theoretically predicted in Theorem 2, while the topological exclusion mildly degrades the network’s condition, the distributed optimization algorithm provably preserves global stability, firmly maintaining the robust optimal tracking of the network state.

6.4. Ecological Impact and Emission Mitigation

The ultimate operations research objective of establishing a highly resilient, sub-unit traffic state estimator is to prevent localized IoT hardware malfunctions from cascading into network-wide ecological bottlenecks. When an intersection controller relies on the diverging traffic states provided by a conventional, non-resilient estimator (as seen in Figure 6), it perceives an artificial gridlock ( x k 1 ). This forces the local dynamic signalization into highly suboptimal phase allocations, triggering extensive vehicular idling.
By substituting the dynamically estimated route flows into the macroscopic Webster delay model (Equation (15)), we quantified the ecological impact of the H-MASE protocol’s resilience. Assuming a standard emission rate of 0.6 g of CO2 per second for idling vehicles, the divergence of the conventional DGD algorithm resulted in an estimated 2611.89 kg of CO2 emissions over the post-fault simulation window. In stark contrast, the resilient signalization data foundation preserved by the H-MASE protocol constrained the network emissions to 2591.72 kg.
This translates to a direct 0.77 % reduction in total network-wide greenhouse gas emissions. While seemingly modest as a percentage, a 0.77 % continuous emission reduction stemming from the mitigation of a single compromised edge sensor translates to an immense absolute reduction in daily atmospheric pollutants across a metropolitan grid. A comprehensive summary of these comparative performance metrics and the resulting ecological impact is presented in Table 4. This quantitative analysis conclusively demonstrates that the cyber-physical resilience established by the H-MASE protocol directly yields real-world ecological sustainability. It is important to emphasize that the computed 20.17 kg reduction in CO2 is a direct consequence of eliminating artificial gridlock states in the signaling controller. In the baseline scenario, the estimation divergence causes the saturation ratio ( x k ) to saturate at 0.995, which, according to the Webster model, yields a near-exponential increase in vehicle delay. By maintaining sub-unit estimation precision even during a catastrophic sensor failure, H-MASE prevents this operational collapse. Given that these results are derived from a single-sensor failure in a restricted simulation window, the cumulative ecological benefit across an entire city-scale network with thousands of edge nodes would be substantial, directly contributing to municipal carbon neutrality goals.

7. Conclusions

This paper presented the H-MASE protocol, a fully decentralized decision-support framework formulated as a distributed convex optimization problem to safeguard smart transportation systems under severe cyber-physical uncertainties. By distributing the computational burden across IoT-enabled edge devices (acting as PSA and VLA) via localized hop-by-hop communication, the proposed applied architecture effectively eliminates the single-point-of-failure vulnerability inherent in traditional centralized cloud-based traffic management systems. Consequently, it prevents localized IoT hardware malfunctions from cascading into network-wide traffic bottlenecks.
Theoretical analysis and comprehensive numerical simulations confirmed the mathematical efficacy of the protocol. By executing exact DGD steps intertwined with topological consensus, the multi-agent network successfully solved the network-wide least-squares estimation problem in real-time. Even under bounded random-walk state drifts and persistent measurement noise, the algorithm successfully neutralized local spatial variances. The swarm-intelligence mechanism tightly bounded the global optimality gap without requiring any central data-fusion center.
Furthermore, the autonomous switching logic demonstrated highly reliable fault detectability without triggering erroneous isolations under nominal conditions, successfully and instantaneously isolating structurally compromised sensors. Following this autonomous fault isolation, the global error remained confined within its theoretical ISS bounds, validating the robust optimization capabilities and cyber-physical resilience of the H-MASE protocol. Ultimately, the ability to maintain sub-unit estimation precision (e.g., an RMSE of 0.81 vehicles per estimated state) during catastrophic failures provides the fail-safe data foundation required for uninterrupted eco-routing and sustainable urban logistics.
Future work will focus on leveraging the resilient route density estimates provided by H-MASE to develop dynamic eco-signalization frameworks and intelligent Mobility-as-a-Service (MaaS) routing engines. By formulating the intersection signaling process as a stochastic resource allocation problem, these high-fidelity estimates can be directly coupled with macroscopic emission models (e.g., MOVES) and edge-based traffic controllers. This integration will allow for the dynamic optimization of green-light phase timings, explicitly aimed at establishing green-wave corridors, minimizing network-wide vehicle idling times, and reducing overall greenhouse gas emissions. Additionally, to bridge the gap between macroscopic theoretical models and practical industrial deployment, future studies will integrate the proposed robust optimization protocol with microscopic traffic simulators (e.g., Simulation of Urban MObility (SUMO)) to create a plug-and-play IoT software architecture for modern municipalities. Validation on large-scale real-world datasets (e.g., PeMS, METR-LA) within the context of edge-computing networks will further demonstrate how the proposed framework drives the continuous advances in transportation and smart city deployments.

Funding

This research received no external funding.

Data Availability Statement

The complete MATLAB 2019a simulation source code underlying the H-MASE protocol, including the synthetic topology generation and distributed estimation loops, is publicly available in the GitHub repository at https://github.com/AhmetCihanDuzce/H-MASE (accessed on 15 April 2026). This ensures the full reproducibility of the numerical results presented in this study.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ALPRAutomatic License Plate Recognition
CAVConnected and Autonomous Vehicle
DDoSDistributed Denial of Service
DGDDistributed Gradient Descent
GHGGreenhouse Gas
GPSGlobal Positioning System
H-MASEHybrid Multi-Agent State Estimation
IoTInternet of Things
ISSInput-to-State Stability
ITSIntelligent Transportation Systems
MaaSMobility as a Service
MAPEMean Absolute Percentage Error
MOVESMotor Vehicle Emission Simulator
O-DOrigin-Destination
OROperations Research
PeMSPerformance Measurement System
PSAPhysical Sensor Agent
RMSERoot Mean Square Error
SCADASupervisory Control and Data Acquisition
SUMOSimulation of Urban MObility
TSETraffic State Estimation
VLAVirtual Logic Agent
V2IVehicle-to-Infrastructure

References

  1. Rasheed, F.; Yau, K.L.A.; Noor, R.M.; Wu, C.; Low, Y.C. Deep Reinforcement Learning for Traffic Signal Control: A Review. IEEE Access 2020, 8, 208016–208044. [Google Scholar] [CrossRef]
  2. Kolat, M.; Kővári, B.; Bécsi, T.; Aradi, S. Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach. Sustainability 2023, 15, 3479. [Google Scholar] [CrossRef]
  3. Damadam, S.; Zourbakhsh, M.; Javidan, R.; Faroughi, A. An Intelligent IoT Based Traffic Light Management System: Deep Reinforcement Learning. Smart Cities 2022, 5, 1293–1311. [Google Scholar] [CrossRef]
  4. Zhang, X.; Han, Y. Multi-Agent Reinforcement Learning for Traffic State Estimation on Highways Using Fundamental Diagram and LWR Theory. Appl. Sci. 2026, 16, 1219. [Google Scholar] [CrossRef]
  5. Karthick Raghunath, K.M.; Rohith Bhat, C.; Vinoth Kumar, V.; Athiyoor Kannan, V.; Mahesh, T.R.; Manikandan, K.; Krishnamoorthy, N. Redefining Urban Traffic Dynamics With TCN-FL Driven Traffic Prediction and Control Strategies. IEEE Access 2024, 12, 115386–115399. [Google Scholar] [CrossRef]
  6. Liu, B.; Ding, Z. A distributed deep reinforcement learning method for traffic light control. Neurocomputing 2022, 490, 390–399. [Google Scholar] [CrossRef]
  7. Zambrano-Martinez, J.; Calafate, C.; Soler, D.; Cano, J.C.; Manzoni, P. Modeling and Characterization of Traffic Flows in Urban Environments. Sensors 2018, 18, 2020. [Google Scholar] [CrossRef]
  8. Mądziel, M.; Campisi, T. Investigation of Vehicular Pollutant Emissions at 4-Arm Intersections for the Improvement of Integrated Actions in the Sustainable Urban Mobility Plans (SUMPs). Sustainability 2023, 15, 1860. [Google Scholar] [CrossRef]
  9. Liang, X.; Song, H.; Wu, G.; Guo, Y.; Zhang, S. Complex Traffic Flow Model for Analysis and Optimization of Fuel Consumption and Emissions at Large Roundabouts. Sustainability 2024, 16, 9464. [Google Scholar] [CrossRef]
  10. Emek, S.; Ildırar, G.; Gürbüzer, Y. Application of Long Short-Term Memory and XGBoost Model for Carbon Emission Reduction: Sustainable Travel Route Planning. Sustainability 2025, 17, 10802. [Google Scholar] [CrossRef]
  11. Cao, Q.; Li, J.; Trucco, P. Sustainability-Oriented Urban Traffic System Optimization Through a Hierarchical Multi-Agent Deep Reinforcement Learning Framework. Sustainability 2026, 18, 1606. [Google Scholar] [CrossRef]
  12. Jin, J.; Ma, X. Hierarchical multi-agent control of traffic lights based on collective learning. Eng. Appl. Artif. Intell. 2018, 68, 236–248. [Google Scholar] [CrossRef]
  13. Shen, H.; Zhao, H.; Zhang, Z.; Yang, X.; Song, Y.; Liu, X. Network-Wide Traffic Signal Control Based on MARL with Hierarchical Nash-Stackelberg Game Model. IEEE Access 2023, 11, 145085–145100. [Google Scholar] [CrossRef]
  14. Yang, S. Hierarchical graph multi-agent reinforcement learning for traffic signal control. Inf. Sci. 2023, 634, 55–72. [Google Scholar] [CrossRef]
  15. Zhu, R.; Li, L.; Wu, S.; Lv, P.; Li, Y.; Xu, M. Multi-agent broad reinforcement learning for intelligent traffic light control. Inf. Sci. 2023, 619, 509–525. [Google Scholar] [CrossRef]
  16. Englezou, Y.; Timotheou, S.; Panayiotou, C. Estimating the origin-destination matrix using link count observations and traffic flow dynamics. In Proceedings of the 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China, 8–12 October 2022; pp. 3213–3218. [Google Scholar] [CrossRef]
  17. Dey, S.; Winter, S.; Tomko, M. Origin–Destination Flow Estimation from Link Count Data Only. Sensors 2020, 20, 5226. [Google Scholar] [CrossRef] [PubMed]
  18. Wei, G.; Gundlegård, D.; Rydergren, C. Consistent origin-destination and link flow estimation based on data-driven network assignment. Transp. Res. Procedia 2025, 86, 668–675. [Google Scholar] [CrossRef]
  19. Vahidi, M.; Shafahi, Y. Time-dependent estimation of origin–destination matrices using partial path data and link counts. Transportation 2023, 52, 55–92. [Google Scholar] [CrossRef]
  20. Pourhassan, K.; Lim, T.; Payne, S.; Somenahalli, S. A heuristic approach to time-dependent origin-destination traffic demand estimation for the development of large-scale dynamic traffic assignment models. Transp. Eng. 2025, 20, 100328. [Google Scholar] [CrossRef]
  21. Peng, X.; Li, L.; Lo, H.K.; Huang, W. A physics-constrained deep learning approach for dynamic origin-destination estimation using link counts. Transp. A Transp. Sci. 2026. [Google Scholar] [CrossRef]
  22. Etemadnia, H.; Abdelghany, K. A distributed origin–destination demand estimation approach for real-time traffic network management. Transp. Plan. Technol. 2011, 34, 217–230. [Google Scholar] [CrossRef]
  23. Iryo, T. Properties of dynamic user equilibrium solution: Existence, uniqueness, stability, and robust solution methodology. Transp. B Transp. Dyn. 2013, 1, 52–67. [Google Scholar] [CrossRef]
  24. Wu, T.; Zhou, P.; Liu, K.; Yuan, Y.; Wang, X.; Huang, H.; Wu, D.O. Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks. IEEE Trans. Veh. Technol. 2020, 69, 8243–8256. [Google Scholar] [CrossRef]
  25. Vishnoi, S.C.; Nugroho, S.A.; Taha, A.F.; Claudel, C.G. Traffic State Estimation for Connected Vehicles Using the Second-Order Aw-Rascle-Zhang Traffic Model. IEEE Trans. Intell. Transp. Syst. 2024, 25, 16719–16733. [Google Scholar] [CrossRef]
  26. Mou, S.; Liu, J.; Morse, A.S. A Distributed Algorithm for Solving a Linear Algebraic Equation. IEEE Trans. Autom. Control 2015, 60, 2863–2878. [Google Scholar] [CrossRef]
  27. Webster, F.V. Traffic Signal Settings; Technical Report Road Research Technique Paper No. 39; Road Research Laboratory: London, UK, 1958. [Google Scholar]
  28. Boyd, S. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers. Found. Trends Mach. Learn. 2010, 3, 1–122. [Google Scholar] [CrossRef]
  29. Olfati-Saber, R. Distributed Kalman filtering for sensor networks. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; pp. 5492–5498. [Google Scholar] [CrossRef]
  30. Horn, R.A.; Johnson, C.R. Matrix Analysis; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
  31. Jiang, Z.P.; Wang, Y. Input-to-state stability for discrete-time nonlinear systems. Automatica 2001, 37, 857–869. [Google Scholar] [CrossRef]
Figure 1. Deterministic nodal balance at a generalized internal junction.
Figure 1. Deterministic nodal balance at a generalized internal junction.
Applsci 16 03972 g001
Figure 2. The reference traffic topology with two O-D pairs and overlapping route flows. The mapping from route-space to link-space is exemplified by link e 3 , which serves as a shared segment for the causal flows R 1 ( t ) and R 2 ( t ) .
Figure 2. The reference traffic topology with two O-D pairs and overlapping route flows. The mapping from route-space to link-space is exemplified by link e 3 , which serves as a shared segment for the causal flows R 1 ( t ) and R 2 ( t ) .
Applsci 16 03972 g002
Figure 3. High-level operational flowchart of the H-MASE protocol. The architecture separates macroscopic physical data acquisition from the microscopic distributed anomaly detection and feasibility projection loops, forming a resilient decision-support pipeline.
Figure 3. High-level operational flowchart of the H-MASE protocol. The architecture separates macroscopic physical data acquisition from the microscopic distributed anomaly detection and feasibility projection loops, forming a resilient decision-support pipeline.
Applsci 16 03972 g003
Figure 4. The physical layout of the proposed planar ring-radial urban traffic network. The simulated topology comprises 15 internal signalized intersections ( J 1 J 15 ) including a central counter-clockwise roundabout, 10 boundary nodes representing traffic origins ( O 1 O 5 ) and destinations ( D 1 D 5 ), and 55 directed road segments equipped with physical sensors.
Figure 4. The physical layout of the proposed planar ring-radial urban traffic network. The simulated topology comprises 15 internal signalized intersections ( J 1 J 15 ) including a central counter-clockwise roundabout, 10 boundary nodes representing traffic origins ( O 1 O 5 ) and destinations ( D 1 D 5 ), and 55 directed road segments equipped with physical sensors.
Applsci 16 03972 g004
Figure 5. Dynamic tracking performance of the H-MASE protocol across all n = 25 dominant routes. The dash-dot lines represent the true traffic densities (r) subject to random-walk state drifts, while the solid lines indicate the average estimated densities collaboratively computed by the multi-agent network. The vertical dashed line at t = 80 marks the injection of the severe structural sensor fault, demonstrating no visible divergence in the estimation trajectories.
Figure 5. Dynamic tracking performance of the H-MASE protocol across all n = 25 dominant routes. The dash-dot lines represent the true traffic densities (r) subject to random-walk state drifts, while the solid lines indicate the average estimated densities collaboratively computed by the multi-agent network. The vertical dashed line at t = 80 marks the injection of the severe structural sensor fault, demonstrating no visible divergence in the estimation trajectories.
Applsci 16 03972 g005
Figure 6. Temporal evolution of the global estimation error norm e ( t , K ) 2 [26].
Figure 6. Temporal evolution of the global estimation error norm e ( t , K ) 2 [26].
Applsci 16 03972 g006
Table 1. Nomenclature.
Table 1. Nomenclature.
SymbolDimension/SpaceDescription
Sets, Graphs, and Indices
G , G M A S GraphPhysical traffic network and cyber-communication graph
V , E Set ( | V | = p , | E | = m )Sets of physical nodes and directed physical links (edges)
V i n t Set ( | V i n t |   = p i n t )Subset of internal signalized junctions (zero-sum nodes)
R Set ( | R | = n )Set of dominant acyclic routes, with R j as its j-th element
V M A S , E M A S Set ( | V M A S |   = N )Sets of autonomous agents and digital communication links
N i SetInclusive communication neighborhood of agent i
V h , V f SetSubsets of healthy and structurally compromised (faulty) agents
C Subspace (⊆ C n N )Network-wide consensus eigenspace
i , j , k , t Z 0 Agent/route index, inner iteration (k), and macroscopic step (t)
Scalars and System Parameters
T s , K R > 0 , Z > 0 Macroscopic sampling period and total inner consensus iterations
c k , x k , d k R > 0 Saturation capacity, utilization ratio, and average delay for link e k
y k ( t ) , y ^ k R 0 True ideal flow and estimated arrival flow for link e k
r j ( t ) R 0 True dynamic vehicle volume utilizing route R j
y p , i ( t ) , b i ( t ) R Physical stochastic measurement and augmented local constraint
b i f ( t ) , f i ( t ) R Corrupted observation and structural fault magnitude at agent i
w i j [ 0 , 1 ] Metropolis-Hastings communication weight between agents i and j
η i ( t , k ) R 0 Local consistency residual (discrepancy) of agent i
σ i ( t , k ) Binary { 0 , 1 } Autonomous anomaly isolation switch (1 for nominal, 0 for isolated)
Γ ( t ) , β R > 0 Dynamic fault detection threshold and its exponential decay rate
ω m a x , Δ m a x R > 0 Supreme bounds for measurement noise and macroscopic state drift
γ , γ σ ( 0 , 1 ) Effective exponential decay rates for nominal and resilient topologies
B I S S , B I S S σ R > 0 Theoretical ultimate ISS bounds for nominal and resilient optimality gaps
Vectors
r ( t ) , Δ r ( t ) R n True macroscopic route-flow vector and its stochastic demand drift
r ^ i , r ^ d i f f , i R n Agent i’s local route-flow estimate and its topological diffused state
e i , e d i f f , i R n Agent i’s local estimation error and local diffused error
y i d e a l , y p R m Noise-free ideal link flows and physical stochastic observations
ω ˜ ( t ) , ω ¯ ( t ) R m , R N Physical measurement noise and augmented global hybrid noise
b ( t ) , f ( t ) R N Global augmented observation and global catastrophic fault vector
e ( t , k ) R n N Concatenated global estimation optimality gap (error)
Δ ( t 1 ) R n N Kronecker-extended global macroscopic state drift
v , α C n N , C n Global complex eigenvector and its local projection component
1 N , 0 , 0 n VariousVector of ones, general zero vector, and n-dimensional zero vector
Matrices and Operators
I , I i n t R p × m , R p i n t × m Full directed incidence matrix and internal reduced incidence matrix
A , A τ R m × n , R p i n t × n Physical routing matrix and tautological virtual sensing matrix
M R N × n Global augmented observation matrix
M i , M i R 1 × n , R n × 1 Local sensing row of agent i and its Moore-Penrose pseudoinverse
W R N × N Doubly-stochastic communication topology weight matrix
P i , D i , I n R n × n Local orthogonal projection, resilient switched operator, and identity
M , M σ R n N × N Global block-diagonal pseudoinverse operators (nominal and resilient)
W , P , P σ R n N × n N Kronecker-extended weight matrix and global projection operators
Φ , Φ σ R n N × n N Global state-transition operators for nominal and switched topologies
J ( · ) , ρ ( · ) , N ( · ) FunctionsLeast-squares cost function, spectral radius, and null space operators
Table 2. Network Attributes and H-MASE Simulation Parameters.
Table 2. Network Attributes and H-MASE Simulation Parameters.
SymbolParameter DescriptionValue
Network Topology & Graph Parameters
NTotal number of agents (55 PSAs + 15 VLAs)70
nNumber of dominant logical routes25
mNumber of directed physical links (edges)55
Algorithm & Temporal Parameters
T m a x Total macroscopic simulation time steps150
KMicroscopic consensus iterations (inner loop)80
r ( 0 ) Initial traffic flow densities (vehicles/step)∈[20, 40]
Disturbances & Fault Injection Settings
ω m a x Maximum physical sensor noise bound2.0
Δ m a x Maximum route flow drift (random walk)1.0
Γ s t e a d y Steady-state fault resilience threshold5.0
Γ i n i t Transient initialization tolerance200.0
β Threshold exponential decay rate0.15
t f a u l t Time step of structural fault injection80
fMagnitude of the structural fault bias80.0
Table 3. The Dominant Route Set of the Traffic Network ( n = 25 ).
Table 3. The Dominant Route Set of the Traffic Network ( n = 25 ).
Route IDOriginIntermediate JunctionsDestination
r 1 O 1 J 1 J 2 D 1
r 2 O 1 J 1 J 11 J 12 J 4 D 2
r 3 O 1 J 1 J 11 J 12 J 13 J 6 D 3
r 4 O 1 J 1 J 10 J 9 J 8 D 4
r 5 O 1 J 1 J 10 D 5
r 6 O 2 J 3 J 2 D 1
r 7 O 2 J 3 J 4 D 2
r 8 O 2 J 3 J 12 J 13 J 6 D 3
r 9 O 2 J 3 J 12 J 13 J 14 J 8 D 4
r 10 O 2 J 3 J 2 J 1 J 10 D 5
r 11 O 3 J 5 J 4 J 3 J 2 D 1
r 12 O 3 J 5 J 4 D 2
r 13 O 3 J 5 J 6 D 3
r 14 O 3 J 5 J 13 J 14 J 8 D 4
r 15 O 3 J 5 J 13 J 14 J 15 J 10 D 5
r 16 O 4 J 7 J 14 J 15 J 11 J 2 D 1
r 17 O 4 J 7 J 6 J 5 J 4 D 2
r 18 O 4 J 7 J 6 D 3
r 19 O 4 J 7 J 8 D 4
r 20 O 4 J 7 J 14 J 15 J 10 D 5
r 21 O 5 J 9 J 15 J 11 J 2 D 1
r 22 O 5 J 9 J 15 J 11 J 12 J 4 D 2
r 23 O 5 J 9 J 8 J 7 J 6 D 3
r 24 O 5 J 9 J 8 D 4
r 25 O 5 J 9 J 10 D 5
Table 4. Quantitative Performance and Ecological Impact Under Severe Sensor Malfunction.
Table 4. Quantitative Performance and Ecological Impact Under Severe Sensor Malfunction.
Performance MetricConventional DGD [26]Proposed H-MASE
Total Idling Delay (vehicle-hours) 1209.2 1199.8
Total CO2 Emissions (kg) 2611.89 2591.72
Net CO2 Reduction0.77% (20.17 kg saved per incident window)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cihan, A. Resilient Multi-Agent State Estimation for Smart City Traffic: A Systems Engineering Approach to Emission Mitigation. Appl. Sci. 2026, 16, 3972. https://doi.org/10.3390/app16083972

AMA Style

Cihan A. Resilient Multi-Agent State Estimation for Smart City Traffic: A Systems Engineering Approach to Emission Mitigation. Applied Sciences. 2026; 16(8):3972. https://doi.org/10.3390/app16083972

Chicago/Turabian Style

Cihan, Ahmet. 2026. "Resilient Multi-Agent State Estimation for Smart City Traffic: A Systems Engineering Approach to Emission Mitigation" Applied Sciences 16, no. 8: 3972. https://doi.org/10.3390/app16083972

APA Style

Cihan, A. (2026). Resilient Multi-Agent State Estimation for Smart City Traffic: A Systems Engineering Approach to Emission Mitigation. Applied Sciences, 16(8), 3972. https://doi.org/10.3390/app16083972

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop