DeepWriter: A Multi-Stream Deep CNN for Text-independent Writer Identification
Text-independent writer identification is challenging due to the huge variation of written contents and the ambiguous written styles of different writers. This paper proposes DeepWriter, a deep multi-stream CNN to learn deep powerful representation for recognizing writers. DeepWriter takes local han...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Text-independent writer identification is challenging due to the huge
variation of written contents and the ambiguous written styles of different
writers. This paper proposes DeepWriter, a deep multi-stream CNN to learn deep
powerful representation for recognizing writers. DeepWriter takes local
handwritten patches as input and is trained with softmax classification loss.
The main contributions are: 1) we design and optimize multi-stream structure
for writer identification task; 2) we introduce data augmentation learning to
enhance the performance of DeepWriter; 3) we introduce a patch scanning
strategy to handle text image with different lengths. In addition, we find that
different languages such as English and Chinese may share common features for
writer identification, and joint training can yield better performance.
Experimental results on IAM and HWDB datasets show that our models achieve high
identification accuracy: 99.01% on 301 writers and 97.03% on 657 writers with
one English sentence input, 93.85% on 300 writers with one Chinese character
input, which outperform previous methods with a large margin. Moreover, our
models obtain accuracy of 98.01% on 301 writers with only 4 English alphabets
as input. |
---|---|
DOI: | 10.48550/arxiv.1606.06472 |