VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

Text spotting, a task involving the extraction of textual information from image or video sequences, faces challenges in cross-domain adaption, such as image-to-image and image-to-video generalization. In this paper, we introduce a new method, termed VimTS, which enhances the generalization ability...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Liu, Yuliang, Huang, Mingxin, Yan, Hao, Deng, Linger, Wu, Weijia, Lu, Hao, Shen, Chunhua, Jin, Lianwen, Bai, Xiang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!