Performance of State-Shared Multiagent Deep Reinforcement Learning Controlled Signal Corridor with Platooning-Based CAVs

AbstractThe emerging technologies of connected and automated vehicles (CAVs) and deep reinforcement learning (DRL) provide innovative methods and have a great potential for developing new solutions to improve the efficiency of several intersection systems. Based on the multisource data collected fro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of transportation engineering, Part A Part A, 2023-08, Vol.149 (8)
Hauptverfasser: Song, Li, Fan, Wei “David”
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:AbstractThe emerging technologies of connected and automated vehicles (CAVs) and deep reinforcement learning (DRL) provide innovative methods and have a great potential for developing new solutions to improve the efficiency of several intersection systems. Based on the multisource data collected from the transportation environments, CAVs with the cooperative adaptive cruise control (CACC) system could merge into platoons and traverse the intersection quickly and smoothly. Meanwhile, the traffic information about the CAVs enables intelligent traffic signal controls with the help of DRL technologies. This research investigates the performance of a state-shared multiagent deep reinforcement learning (MADRL) controlled signal corridor with platooning-based CAVs. A corridor with seven intersections from the Ingolstadt Traffic Scenario (InTAS) in Germany is selected as a case study. The state information is shared between neighboring intersections to overcome the partial information observation of the decentralized agents in the MADRL framework. A platooning framework with specific CACC systems for the leading and following vehicles is proposed. Results indicate that the state-shared MADRL with CAV platoons could significantly decrease the total waiting time, average queue length, and total CO2 emission of the corridor by 80%, 73%, and 54%, respectively, which could be beneficial in further improving the intersection efficiency, designing future intersections, and cooperating signals and CAVs platoons.
ISSN:2473-2907
2473-2893
DOI:10.1061/JTEPBS.TEENG-7768