Improving Language Identification of Accented Speech
Language identification from speech is a common preprocessing step in many spoken language processing systems. In recent years, this field has seen fast progress, mostly due to the use of self-supervised models pretrained on multilingual data and the use of large training corpora. This paper shows t...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Language identification from speech is a common preprocessing step in many
spoken language processing systems. In recent years, this field has seen fast
progress, mostly due to the use of self-supervised models pretrained on
multilingual data and the use of large training corpora. This paper shows that
for speech with a non-native or regional accent, the accuracy of spoken
language identification systems drops dramatically, and that the accuracy of
identifying the language is inversely correlated with the strength of the
accent. We also show that using the output of a lexicon-free speech recognition
system of the particular language helps to improve language identification
performance on accented speech by a large margin, without sacrificing accuracy
on native speech. We obtain relative error rate reductions ranging from to 35
to 63% over the state-of-the-art model across several non-native speech
datasets. |
---|---|
DOI: | 10.48550/arxiv.2203.16972 |