Pitch Restoration for Robust Speech Recognition

The changing on speech peaks structure is perhaps the most important cause of degradation of speech recognition systems under adverse conditions. Another drawback concerned to the additive noise effect occurs on the flat spectral zones which are usually raised. These combined effects on both the pea...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Lima, C. S., Tavares, Adriano, Silva, Carlos A.
Format:	Buchkapitel
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Automatic Speech Recognition System Computer science control theory systems Exact sciences and technology Features robustness HMM modelling Noise Effect Noise Estimate Science & Technology Social Sciences Software Speech and sound recognition and synthesis. Linguistics Speech Recognition System Speech Region
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The changing on speech peaks structure is perhaps the most important cause of degradation of speech recognition systems under adverse conditions. Another drawback concerned to the additive noise effect occurs on the flat spectral zones which are usually raised. These combined effects on both the peaked and the flat spectral zones can be alleviated by trying to restore its original structure, which assumes noise knowledge. This paper suggests noise estimation in a frame by frame basis by assuming the clean database as lightly corrupted. The noise estimate is then used to restore both the peaked and the flat spectral zones of the speech spectrum. This algorithm was implemented over a baseline spectral normalisation method. This method was developed by taking into consideration that, while the speech regions with less energy need more robustness, since in these regions the noise is more dominant, the “peaked” spectral regions which are the most reliable due to the higher speech energy must also be preserved as much as possible by the feature extraction process.
ISSN:	0302-9743 1611-3349
DOI:	10.1007/3-540-45011-4_3