A Fixed Dimension Modified Sinusoid Model (FD-MSM) for Single Microphone Sound Separation

The lack of a flexible analysis model has been introduced as an important issue in different applications like source separation. In this paper, a fixed dimension modified sinusoid model (FD-MSM) is proposed for analysis of all audible signals consisting of speech, music and their mixtures. Employin...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Mahale, P.M.B., Sayadiyan, A., Tashk, A.B.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Computational complexity Linear predictive coding Mel-scale Microphones Multiple signal classification phase coherency Power harmonic filters Signal analysis Sinusoidal model Source separation Speech analysis Speech synthesis Vector quantization
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The lack of a flexible analysis model has been introduced as an important issue in different applications like source separation. In this paper, a fixed dimension modified sinusoid model (FD-MSM) is proposed for analysis of all audible signals consisting of speech, music and their mixtures. Employing the peak picking in Meldomain gives rise to a fixed number of parameters in the proposed FDMSM, which is desired in clustering algorithms like VQ (vector quantization) or GMM (gaussian mixture model), commonly used for source separation scenarios. Applying the proposed FD-MSM to various audible signals, it is observed that the resulting signal is perceptually indistinguishable from the original.
DOI:	10.1109/ICSPC.2007.4728536