SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation

This paper studies referring video object segmentation (RVOS) by boosting video-level visual-linguistic alignment. Recent approaches model the RVOS task as a sequence prediction problem and perform multi-modal interaction as well as segmentation for each frame separately. However, the lack of a glob...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-05
Hauptverfasser: Luo, Zhuoyan, Xiao, Yicheng, Liu, Yong, Li, Shuyan, Wang, Yitong, Tang, Yansong, Li, Xiu, Yang, Yujiu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!