STN-GCN: Spatial and Temporal Normalization Graph Convolutional Neural Networks for Traffic Flow Forecasting

In recent years, traffic forecasting has gradually become a core component of smart cities. Due to the complex spatial-temporal correlation of traffic data, traffic flow prediction is highly challenging. Existing studies are mainly focused on graphical modeling of fixed road structures. However, thi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Electronics (Basel) 2023-07, Vol.12 (14), p.3158
Hauptverfasser: Wang, Chunzhi, Wang, Lu, Wei, Siwei, Sun, Yun, Liu, Bowen, Yan, Lingyu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In recent years, traffic forecasting has gradually become a core component of smart cities. Due to the complex spatial-temporal correlation of traffic data, traffic flow prediction is highly challenging. Existing studies are mainly focused on graphical modeling of fixed road structures. However, this fixed graphical structure cannot accurately capture the relationship between different roads, affecting the accuracy of long-term traffic flow prediction. In order to address this problem, this paper proposes a modeling framework STN-GCN for spatial-temporal normalized graphical convolutional neural networks. In terms of temporal dependence, spatial-temporal normalization was used to divide the data into high-frequency and low-frequency parts, allowing the model to extract more distinct features. In addition, fine data input to the temporal convolutional network (TCN) was used in this module to conduct more detailed temporal feature extraction so as to ensure the accuracy of long-term sequence extraction. In addition, the transformer module was added to the model, which captured the real-time state of traffic flow by extracting spatial dependencies and dynamically establishing spatial correlations through a self-attention mechanism. During the training process, a curriculum learning (CL) method was adopted, which provided optimized target sequences. Learning from easier targets can help avoid getting trapped in local minima and yields better generalization performance to more accurately approximate global minima. As shown by experimental results the model performed well on two real-world public transportation datasets, METR-LA and PEMS-BAY.
ISSN:2079-9292
2079-9292
DOI:10.3390/electronics12143158