Das PaGeS-Korpus, ein Parallelkorpus der deutschen und spanischen Gegenwartssprache
The corpus PaGeS is a bilingual parallel corpus, that comprises a collection of contemporary Spanish and German texts. This paper describes the different steps in the construction of the corpus. The description includes the manual preparation process of the texts to make the documents suitable for...
Gespeichert in:
Veröffentlicht in: | Revista de filología alemana 2018-01, Vol.26, p.181 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | ger |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The corpus PaGeS is a bilingual parallel corpus, that comprises a collection of contemporary Spanish and German texts. This paper describes the different steps in the construction of the corpus. The description includes the manual preparation process of the texts to make the documents suitable for further processing, the linguistic annotation and the manual and automatic procedure of the sentence alignment of the texts. It is dealt with the access and the visualization of the data and the different search possibilities are explained. Finally, the next steps of future work are outlined. |
---|---|
ISSN: | 1133-0406 1988-2823 |
DOI: | 10.5209/RFAL.60148 |