Real-Time Network-Level Traffic Signal Control: An Explicit Multiagent Coordination Method

Traffic signal control (TSC) has been one of the most useful ways for reducing urban road congestion. The challenge of TSC includes 1) real-time signal decision, 2) the complexity in traffic dynamics, and 3) the network-level coordination. Reinforcement learning (RL) methods can query policies by ma...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on intelligent transportation systems 2024-12, Vol.25 (12), p.19688-19698
Hauptverfasser: Wang, Wanyuan, Zhang, Haipeng, Qiao, Tianchi, Ma, Jinming, Jin, Jiahui, Li, Zhibin, Wu, Weiwei, Jiang, Yichuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Traffic signal control (TSC) has been one of the most useful ways for reducing urban road congestion. The challenge of TSC includes 1) real-time signal decision, 2) the complexity in traffic dynamics, and 3) the network-level coordination. Reinforcement learning (RL) methods can query policies by mapping the traffic state to the signal decision in real-time, however, are inadequate for different traffic flow environment. By observing real traffic information, online planning methods can compute the signal decisions in a responsive manner. Unfortunately, existing online planning methods either require high computation complexity or get stuck in local coordination. Against this background, we propose an explicit multiagent coordination (EMC)-based online planning methods that can satisfy adaptive, real-time and network-level TSC. By multiagent, we model each intersection as an autonomous agent, and the coordination efficiency is modeled by a cost function between neighbor intersections. By network-level coordination, each agent exchanges messages of cost function with its neighbors in a fully decentralized manner. By real-time, the message-passing procedure can interrupt at any time when the real time limit is reached and agents select the optimal signal decisions according to current message. Finally, we test our EMC method in both synthetic and real road network datasets. Experimental results are encouraging: compared to RL and conventional transportation baselines, our EMC method performs reasonably well in terms of adapting to real-time traffic dynamics, minimizing vehicle travel time and scalability to city-scale road networks.
ISSN:1524-9050
1558-0016
DOI:10.1109/TITS.2024.3468295