Use of geographical meta-data in ASR language and acoustic models

The query distribution, in the speech recognition applications of directory assistance (DA) and voice-search, depends on the customer's location. This motivates the research on query models conditioned on the user location, here denoted as local models. We describe and test our methods for the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bocchieri, Enrico, Caseiro, Diamantino
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	acoustic Acoustic applications ASR Automatic speech recognition Cities and towns Hidden Markov models language Local metadata Natural languages Speech recognition State estimation Telephony Testing User interfaces
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The query distribution, in the speech recognition applications of directory assistance (DA) and voice-search, depends on the customer's location. This motivates the research on query models conditioned on the user location, here denoted as local models. We describe and test our methods for the estimation of local models with various degrees of spacial "granularity", for the recognition of city-state (sub-task of DA) and for the recognition of business listings, spoken over iPhones in a nation-wide business-listing voice-search service. Our local language models improve the accuracy of city-state by 2.4% absolute (32% relative error reduction), and of voice-search by 2.2% (7% relative).
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2010.5495026