Chart-RCNN: Efficient Line Chart Data Extraction from Camera Images
Line Chart Data Extraction is a natural extension of Optical Character Recognition where the objective is to recover the underlying numerical information a chart image represents. Some recent works such as ChartOCR approach this problem using multi-stage networks combining OCR models with object det...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Line Chart Data Extraction is a natural extension of Optical Character
Recognition where the objective is to recover the underlying numerical
information a chart image represents. Some recent works such as ChartOCR
approach this problem using multi-stage networks combining OCR models with
object detection frameworks. However, most of the existing datasets and models
are based on "clean" images such as screenshots that drastically differ from
camera photos. In addition, creating domain-specific new datasets requires
extensive labeling which can be time-consuming. Our main contributions are as
follows: we propose a synthetic data generation framework and a one-stage model
that outputs text labels, mark coordinates, and perspective estimation
simultaneously. We collected two datasets consisting of real camera photos for
evaluation. Results show that our model trained only on synthetic data can be
applied to real photos without any fine-tuning and is feasible for real-world
application. |
---|---|
DOI: | 10.48550/arxiv.2211.14362 |