A Parallel Backbone Networks Structure for Scene Text Detection

Text detection in complex scenes is very hard realize by the diversification of text distribution, direction, and typesetting. This paper proposes one scene text detection method with end-to-end structure with parallel backbone network and region segmentation. With multiple deformable convolutions a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series 2021-05, Vol.1927 (1), p.12016
1. Verfasser: Li, Linyuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Text detection in complex scenes is very hard realize by the diversification of text distribution, direction, and typesetting. This paper proposes one scene text detection method with end-to-end structure with parallel backbone network and region segmentation. With multiple deformable convolutions and extracting features of multi-dimensional text regions, multiple candidate regions of different sizes are generated and corresponding states are further given. Experiments show that compared with baseline, this method can further adapt to the problem that the different shapes and angles of the target in the image lead to the decrease of accuracy.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1927/1/012016