A Fixed Dimension Modified Sinusoid Model (FD-MSM) for Single Microphone Sound Separation

The lack of a flexible analysis model has been introduced as an important issue in different applications like source separation. In this paper, a fixed dimension modified sinusoid model (FD-MSM) is proposed for analysis of all audible signals consisting of speech, music and their mixtures. Employin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mahale, P.M.B., Sayadiyan, A., Tashk, A.B.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The lack of a flexible analysis model has been introduced as an important issue in different applications like source separation. In this paper, a fixed dimension modified sinusoid model (FD-MSM) is proposed for analysis of all audible signals consisting of speech, music and their mixtures. Employing the peak picking in Meldomain gives rise to a fixed number of parameters in the proposed FDMSM, which is desired in clustering algorithms like VQ (vector quantization) or GMM (gaussian mixture model), commonly used for source separation scenarios. Applying the proposed FD-MSM to various audible signals, it is observed that the resulting signal is perceptually indistinguishable from the original.
DOI:10.1109/ICSPC.2007.4728536