Beam forming voice separation method based on complex space angle center Gaussian hybrid clustering model and bidirectional long and short time memory network

The invention discloses a beam forming voice separation method based on a complex space angle center Gaussian hybrid clustering model and a bidirectional long and short time memory network. In the training stage, a logarithmic power spectrum of a reference channel training voice signal is calculated...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZHOU LIN, WANG QIRUI, XU YUE, DENG YUXI, CHENG YUNLING, CAO YANXIANG, ZHUANG CHENGHAO
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention discloses a beam forming voice separation method based on a complex space angle center Gaussian hybrid clustering model and a bidirectional long and short time memory network. In the training stage, a logarithmic power spectrum of a reference channel training voice signal is calculated, and sine and cosine values of a phase difference between the reference channel training voice signal and other channel training voice signals are used as input characteristics of the bidirectional long-short time memory network. Based on a complex space angle center Gaussian hybrid clustering model, calculating a masking value of each target sound source as a training target of the bidirectional long-short term memory network, and using mean square error loss as a loss function. In the test stage, according to the masking estimation value of each target sound source output by the bidirectional long-short-term memory network in the reference channel test voice signal, the covariance matrix of the multi-channel tes