Modeling human–human interaction with attention-based high-order GCN for trajectory prediction

This paper presents a novel high-order graph convolutional network (GCN) for pedestrian trajectory prediction. Specifically, the walking state of a target pedestrian depends on both its historical trajectory, which encodes its speed, walking direction and acceleration information, as well as the mov...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Visual computer 2022-07, Vol.38 (7), p.2257-2269
Hauptverfasser: Fang, Yanyan, Jin, Zhiyu, Cui, Zhenhua, Yang, Qiaowen, Xie, Tianyi, Hu, Bo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper presents a novel high-order graph convolutional network (GCN) for pedestrian trajectory prediction. Specifically, the walking state of a target pedestrian depends on both its historical trajectory, which encodes its speed, walking direction and acceleration information, as well as the movement of its neighbors. Thus we propose to leverage GCNs to aggregate the trajectory features of the target pedestrian and its neighbors to predict the movement of the target pedestrian. Considering that the movement of the neighbors’ neighbors affects the movement of the target pedestrian’s neighbors, thus indirectly affecting the movement of the target pedestrian, we propose to use a high-order GCN for human–human interaction modelling. Such a high-order GCN considers the target pedestrian’s neighbors as well as its neighbors’ neighbors. Further, a pedestrian avoids collision with others by estimating its locations and its neighbors’ upcoming locations, and it slows down or changes direction if it believes a collision may occur, especially in very crowded scenes. In light of this, we propose to model such anticipation-based decision making behavior as attention and combine it with our high-order GCN. Thus we first roughly estimate the future trajectories of all pedestrians with a simple method. By using the coarse predicted future trajectory and GCN outputs, we calculate the attention in our attention-based high-order GCN and predict future trajectory. Extensive experiments validate the effectiveness of our approach. In addition, our model shows a higher data efficiency. On the ETH&UCY dataset, using only 5 % of the training data for each training epoch, our model outperforms the state of the art.
ISSN:0178-2789
1432-2315
DOI:10.1007/s00371-021-02109-2