Abstract
Adaptive traffic signal control is a critical component of intelligent transportation systems, and multi-agent deep reinforcement learning (MARL) has attracted increasing interest due to its scalability and control efficiency. However, existing methods have two major drawbacks: (i) they are largely driven by current and historical traffic states, without explicit forecasting of upcoming traffic conditions, and (ii) their coordination mechanisms are often weak, making it difficult to model complex spatial dependencies in large-scale road networks and thereby limiting the benefits of coordinated control. To address these issues, we propose TG-MADDPG, which integrates short-term traffic prediction with a graph attention network (GAT) for regional signal control. A WT-GWO-CNN-LSTM traffic forecasting module predicts near-future states and injects them into the MARL framework to support anticipatory decision-making. Meanwhile, the GAT dynamically encodes road-network topology and adaptively captures inter-intersection spatial correlations. In addition, we design a reward based on normalized pressure difference to guide cooperative optimization of signal timing. Experiments on the SUMO simulator across synthetic and real-world networks under both off-peak and peak demands show that TG-MADDPG consistently achieves lower average waiting times, shorter queue lengths, and higher cumulative rewards than IQL, MADDPG, and GMADDPG, demonstrating strong effectiveness and generalization.