SparseComm: An Efficient Sparse Communication Framework for Vehicle-Infrastructure Cooperative 3D Detection

Collaborative perception, aiming at achieving a comprehensive perception range through inter-agent communication, faces challenges such as high communication costs and domain gaps between multiple agents. This paper introduces SparseComm, an innovative sparse communication collaborative perception f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2025-02, Vol.158, p.110961, Article 110961
Hauptverfasser: Liu, Haizhuang, Chu, Huazhen, Zhuo, Junbao, Zou, Bochao, Chen, Jiansheng, Ma, Huimin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Collaborative perception, aiming at achieving a comprehensive perception range through inter-agent communication, faces challenges such as high communication costs and domain gaps between multiple agents. This paper introduces SparseComm, an innovative sparse communication collaborative perception framework designed to mitigate these challenges. SparseComm efficiently operates in sparse feature spaces and aggregates features related to the same objects by a sparse instance communication module. Meanwhile, a sparse 3D cooperation module is incorporated to enhance 3D feature representation during communication, thus improving detection performance. Furthermore, a bounding box restoration module is designed to recover undetected bounding boxes due to feature fusion and to address the quality drop issue caused by domain gaps at minimal additional communication cost. Extensive experiments conducted on the DAIR-V2X and V2XSet demonstrate the efficacy of SparseComm, achieving 47.12% at 211.46 communication bytes on DAIR-V2X and 78.03% at 216.63 communication bytes on V2XSet. Notably, SparseComm reduces communication consumption from 10× to 1000× compared with the prior methods while maintaining the detection performance. •Features within an object are aggregated to reduce communication volume.•A sparse 3D cooperation module is proposed to enhance agent interaction.•A bounding box restoration module is introduced to recover the undetected boxes.•SparseComm demonstrates state-of-the-art results on the DAIR-V2X and V2XSet datasets.
ISSN:0031-3203
DOI:10.1016/j.patcog.2024.110961