Organizing a System for Continuous Speech Understanding

In spectrogram reading experiments [Klatt and Stevens, Boston Speech Conference (1972)], the performance obtained by human experts for phonetic segmentation and labeling was approximately 75% partially or completely labeled but correctly specified, 15% mislabeled, and 10% segments missed. When synta...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 1973-07, Vol.54 (1_Supplement), p.338-339
Hauptverfasser: Woods, W. A., Makhoul, J., Wolf, J., Rovner, P.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In spectrogram reading experiments [Klatt and Stevens, Boston Speech Conference (1972)], the performance obtained by human experts for phonetic segmentation and labeling was approximately 75% partially or completely labeled but correctly specified, 15% mislabeled, and 10% segments missed. When syntactic, semantic, and vocabulary constraints were applied, the success rate in identifying words was 96%. How does one design a computer system to match this performance at word identification? This paper describes the BBN speech understanding system, which is being designed to extract the meaning from continuous speech inputs in a limited semantic environment by using syntactic, semantic, and vocabulary information to overcome vagueness and errors in phonetic segmentation and labeling. The system contains components for acoustic feature extraction, phonetic segmentation and labeling, word retrieval from partial phonetic information, evaluation of word-match quality, and syntactic and semantic evaluation of partial utterances and proposal of words from context. A control component embodies the strategy for applying different components at appropriate times and places depending on conditions which will vary from point to point in an analysis.
ISSN:0001-4966
1520-8524
DOI:10.1121/1.1978370