FAST SEARCH IN SPEECH RECOGNITION

Speech recognition involves searching for the most likely one of a number of sequences of words, given a speech signal. Each such sequence is a composite sequence, composed of consecutive sequences of states. Searching involves a number of searches, each in a respective search space containing a sub...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	SEIDE, FRANK, T., B
Format:	Patent
Sprache:	eng ; fre ; ger
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Speech recognition involves searching for the most likely one of a number of sequences of words, given a speech signal. Each such sequence is a composite sequence, composed of consecutive sequences of states. Searching involves a number of searches, each in a respective search space containing a subset of the sequences of states. In each search only the more likely sequences of states in the relevant search space are considered. In a first embodiment different search spaces are made up of sequences of states that follow preceding sequences from a class of sequences of words. Different classes define different ones of the search spaces. Classes are distinguished on the basis of phonetic history rather than word history, as represented by the sequences of states in the composite sequence up to the sequence of states in the search space. Thus, the number of words or parts thereof whose identity is used to distinguish different classes is varied depending on a length of one or more last words represented by the composite sequence. In a second embodiment, a plurality different composite sequences are involved in a search through a joint sequence of states, for which representative likelihood information for the plurality is used to decide whether or not to discard it in the search. At the end of the search the likelihood for the different composite sequences is regenerated from the joint sequence if it survived the search, and further search is based on the regenerated likelihood. In a third embodiment, this technique is applied within searches at the subword level.