Generalized Likelihood Ratio Test for Voiced-Unvoiced Decision in Noisy Speech Using the Harmonic Model

In this paper, a novel method for voiced-unvoiced decision within a pitch tracking algorithm is presented. Voiced-unvoiced decision is required for many applications, including modeling for analysis/synthesis, detection of model changes for segmentation purposes and signal characterization for index...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on audio, speech, and language processing speech, and language processing, 2006-03, Vol.14 (2), p.502-510
Hauptverfasser:	Fisher, E., Tabrikian, J., Dubnov, S.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Applied sciences Character recognition Detection, estimation, filtering, equalization, prediction Exact sciences and technology Generalized likelihood ratio test (GLRT) harmonic model Harmonics Indexing Information, signal and communications theory Likelihood ratio likelihood ratio test (LRT) Mathematical models maximum a-posteriori probability Miscellaneous Noise noisy speech pitch tracking Signal analysis Signal and communications theory Signal processing Signal synthesis Signal to noise ratio Signal, noise Speech Speech analysis Speech enhancement Speech processing Speech recognition Speech synthesis Telecommunications and information theory Testing Tracking voice activity detection (VAD) voiced-unvoiced decision
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, a novel method for voiced-unvoiced decision within a pitch tracking algorithm is presented. Voiced-unvoiced decision is required for many applications, including modeling for analysis/synthesis, detection of model changes for segmentation purposes and signal characterization for indexing and recognition applications. The proposed method is based on the generalized likelihood ratio test (GLRT) and assumes colored Gaussian noise with unknown covariance. Under voiced hypothesis, a harmonic plus noise model is assumed. The derived method is combined with a maximum a-posteriori probability (MAP) scheme to obtain a pitch and voicing tracking algorithm. The performance of the proposed method is tested using several speech databases for different levels of additive noise and phone speech conditions. Results show that the GLRT is robust to speaker and environmental conditions and performs better than existing algorithms.
ISSN:	1558-7916 2329-9290 1558-7924 2329-9304
DOI:	10.1109/TSA.2005.857806