Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On PercepNet
Speech enhancement algorithms based on deep learning have greatly surpassed their traditional counterparts and are now being considered for the task of removing acoustic echo from hands-free communication systems. This is a challenging problem due to both real-world constraints like loudspeaker non-...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Speech enhancement algorithms based on deep learning have greatly surpassed
their traditional counterparts and are now being considered for the task of
removing acoustic echo from hands-free communication systems. This is a
challenging problem due to both real-world constraints like loudspeaker
non-linearities, and to limited compute capabilities in some communication
systems. In this work, we propose a system combining a traditional acoustic
echo canceller, and a low-complexity joint residual echo and noise suppressor
based on a hybrid signal processing/deep neural network (DSP/DNN) approach. We
show that the proposed system outperforms both traditional and other neural
approaches, while requiring only 5.5% CPU for real-time operation. We further
show that the system can scale to even lower complexity levels. |
---|---|
DOI: | 10.48550/arxiv.2102.05245 |