BeCoS corpus: Belgian Covid-19 Sign language corpus. A corpus for training Sign Language Recognition and Translation

We are presenting the Belgian Federal COVID-19 corpus, nicknamed the BeCoS (Belgian CovidSign language) corpus. It consists of the entire archive of official press conferences from the BelgianFederal Government concerning the COVID-19 pandemic. The speakers speak mostly in Dutchor French and occasio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational Linguistics in the Netherlands Journal 2022-12, Vol.12, p.7-17
Hauptverfasser: Vandeghinste, Vincent, Van Dyck, Bob, De Coster, Mathieu, Goddefroy, Maud, Dambre, Joni
Format: Artikel
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We are presenting the Belgian Federal COVID-19 corpus, nicknamed the BeCoS (Belgian CovidSign language) corpus. It consists of the entire archive of official press conferences from the BelgianFederal Government concerning the COVID-19 pandemic. The speakers speak mostly in Dutchor French and occasionally in German, and nearly all speech is accompanied by a deaf signer whoperforms live interpreting from what is being said. We have preprocessed the corpus with speakerdiarisation, applied Belgian Dutch ASR, and post-ASR language identification and punctuationprediction as well as signer diarisation, sign language identification and sign language keypointrecognition. The corpus is made publicly available.
ISSN:2211-4009