METHOD FOR ENHANCING TELEPHONE VOICE SIGNAL BASED ON CONVOLUTIONAL NEURAL NETWORK

To provide a method for reducing effects of acoustic distortion on telephone voice based on a convolutional neural network (CNN).SOLUTION: A method includes: a preprocessing step A which involves extracting an amplitude and a phase of spectral representation of a telephone voice signal; a voice enha...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: INIIGO GARCIA MORTE, ANTONIO MIGUEL ARTIAGA, JAVIER GALLART MAURI, ALFONSO ORTEGA GIMENEZ, EDUARDO LLEIDA SOLANO, DAYANA RIBAS GONZALEZ
Format: Patent
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:To provide a method for reducing effects of acoustic distortion on telephone voice based on a convolutional neural network (CNN).SOLUTION: A method includes: a preprocessing step A which involves extracting an amplitude and a phase of spectral representation of a telephone voice signal; a voice enhancement step B which estimates a Wiener gain by a trained convolutional neural network, and emphasizes the amplitude of the spectral representation of the voice signal using a voice enhancement filter based on the estimated Wiener gain; and a post-processing step C which obtains the enhanced voice signal based on the amplitude and initial phase of the emphasized spectral representation.SELECTED DRAWING: Figure 2 【課題】畳み込みニューラルネットワーク(CNN)に基づいて電話音声における音響歪みの影響を低減する方法を提供する。【解決手段】方法は、電話音声信号のスペクトル表現の振幅及び位相を抽出することを含む前処理段階Aと、訓練された畳み込みニューラルネットワークによりウィーナーゲインを推定し、推定されたウィーナーゲインに基づく音声強調フィルタにより音声信号のスペクトル表現の振幅を強調する音声強調段階Bと、強調されたスペクトル表現の振幅と初期位相とに基づき強調された音声信号を得る後処理段階Cと、を含む。【選択図】図2