The Speed Submission to DIHARD II: Contributions & Lessons Learned
This paper describes the speaker diarization systems developed for the Second DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides describing the system, which considerably outperformed the challenge baselines, we also focus on the lessons learned from numerous approaches that...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper describes the speaker diarization systems developed for the Second
DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides
describing the system, which considerably outperformed the challenge baselines,
we also focus on the lessons learned from numerous approaches that we tried for
single and multi-channel systems. We present several components of our
diarization system, including categorization of domains, speech enhancement,
speech activity detection, speaker embeddings, clustering methods,
resegmentation, and system fusion. We analyze and discuss the effect of each
such component on the overall diarization performance within the realistic
settings of the challenge. |
---|---|
DOI: | 10.48550/arxiv.1911.02388 |