SVM-Enabled Voice Activity Detection

Detecting the presence of speech in a noisy signal is an unsolved problem affecting numerous speech processing applications. This paper shows an effective method employing support vector machines (SVM) for voice activity detection (VAD) in noisy environments. The use of kernels in SVM enables to map...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ramírez, Javier, Yélamos, Pablo, Górriz, Juan Manuel, Puntonet, Carlos G., Segura, José C.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Computer science control theory systems Connectionism. Neural networks Data processing. List processing. Character string processing Exact sciences and technology False Alarm Rate Memory organisation. Data processing Radial Basis Function Software Speech and sound recognition and synthesis. Linguistics Support Vector Machine Support Vector Machine Model Voice Activity Detector
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Detecting the presence of speech in a noisy signal is an unsolved problem affecting numerous speech processing applications. This paper shows an effective method employing support vector machines (SVM) for voice activity detection (VAD) in noisy environments. The use of kernels in SVM enables to map the data into some other dot product space (called feature space) via a nonlinear transformation. The feature vector includes the subband signal-to-noise ratios of the input speech and a radial basis function (RBF) kernel is used as SVM model. It is shown the ability of the proposed method to learn how the signal is masked by the acoustic noise and to define an effective non-linear decision rule. The proposed approach shows clear improvements over standardized VADs for discontinuous speech transmission and distributed speech recognition, and other recently reported VADs.
ISSN:	0302-9743 1611-3349
DOI:	10.1007/11760023_99