Dataset For Icfhr2018 Competition On Automated Text Recognition On A Read Dataset
The main idea of this dataset is to analyse the impact of training data. How many training data specific to the document, you are transcribing, is necessary? general data: This is a collection of heterogeneous documents to train an initial system. For each text line there is an image file of that li...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Dataset |
Sprache: | ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The main idea of this dataset is to analyse the impact of training data. How many training data specific to the document, you are transcribing, is necessary?
general data: This is a collection of heterogeneous documents to train an initial system. For each text line there is an image file of that line, a file with the ground truth text and an information file containing an automatically generated surrounding polygon.
specific data: The specific data contains documents related to the test data. For the specific systems only the images of the train list may be used. The file are of the same type as the general data.
test data: The test data contains only the images and the information files.
More Information, some published results and an evaluation procedure at https://scriptnet.iit.demokritos.gr/competitions/10/ |
---|---|
DOI: | 10.5281/zenodo.1442181 |