Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing
This paper proposes Remixed2Remixed, a domain adaptation method for speech enhancement, which adopts Noise2Noise (N2N) learning to adapt models trained on artificially generated (out-of-domain: OOD) noisy-clean pair data to better separate real-world recorded (in-domain) noisy data. The proposed met...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper proposes Remixed2Remixed, a domain adaptation method for speech
enhancement, which adopts Noise2Noise (N2N) learning to adapt models trained on
artificially generated (out-of-domain: OOD) noisy-clean pair data to better
separate real-world recorded (in-domain) noisy data. The proposed method uses a
teacher model trained on OOD data to acquire pseudo-in-domain speech and noise
signals, which are shuffled and remixed twice in each batch to generate two
bootstrapped mixtures. The student model is then trained by optimizing an
N2N-based cost function computed using these two bootstrapped mixtures. As the
training strategy is similar to the recently proposed RemixIT, we also
investigate the effectiveness of N2N-based loss as a regularization of RemixIT.
Experimental results on the CHiME-7 unsupervised domain adaptation for
conversational speech enhancement (UDASE) task revealed that the proposed
method outperformed the challenge baseline system, RemixIT, and reduced the
blurring of performance caused by teacher models. |
---|---|
DOI: | 10.48550/arxiv.2312.16836 |