Comparative evaluations of robust and accurate F0 estimates in reverberant environments

This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F 0 ) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room acoustics). They...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Unoki, M., Hosorogiya, T., Ishimoto, Y.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F 0 ) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room acoustics). They involve several classic algorithms (Cepstrum, AMDF, TPC, and modified autocorrelation) and a few modern algorithms (TEMPO, YIN, and PHIA). The comparative results revealed that the percentage correct rates of the estimated FOs using them were drastically reduced as the reverberation time increased while F o estimated with the proposed method was completely robust and accurate. They also demonstrated that homomorphic analysis and the concept of a source-filter model were relatively effective for estimating F o . The results also demonstrated that it was much better than the previously reported methods in terms of robustness and providing accurate F 0 estimates in both artificial and realistic reverberant environments.
ISSN:1520-6149
2379-190X
DOI:10.1109/ICASSP.2008.4518673