Dataset For Icfhr2018 Competition On Automated Text Recognition On A Read Dataset

The main idea of this dataset is to analyse the impact of training data. How many training data specific to the document, you are transcribing, is necessary? general data: This is a collection of heterogeneous documents to train an initial system. For each text line there is an image file of that li...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Strauss, Tobias, Leifert, Gundram, Labahn, Roger, Hodel, Tobias, Mühlberger, Günter
Format:	Dataset
Sprache:	ger
Schlagworte:	text recognition, adaptation to new hands
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The main idea of this dataset is to analyse the impact of training data. How many training data specific to the document, you are transcribing, is necessary? general data: This is a collection of heterogeneous documents to train an initial system. For each text line there is an image file of that line, a file with the ground truth text and an information file containing an automatically generated surrounding polygon. specific data: The specific data contains documents related to the test data. For the specific systems only the images of the train list may be used. The file are of the same type as the general data. test data: The test data contains only the images and the information files. More Information, some published results and an evaluation procedure at https://scriptnet.iit.demokritos.gr/competitions/10/
DOI:	10.5281/zenodo.1442181