Error correction method based on multi-modal speech recognition result and related equipment

The embodiment of the invention provides an error correction method based on a multi-modal speech recognition result and related equipment. The method comprises the steps: carrying out the processing of the speech data of a user through employing an acoustic model and a language model, and obtaining...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	WEI TAO, XIAO JING, MA JUN, WANG SHAOJUN, ZHUANG ZIYANG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The embodiment of the invention provides an error correction method based on a multi-modal speech recognition result and related equipment. The method comprises the steps: carrying out the processing of the speech data of a user through employing an acoustic model and a language model, and obtaining a plurality of first candidate recognition results, and corresponding acoustic scores and language scores; obtaining a weight score corresponding to each first candidate recognition result; taking the first candidate recognition result with the highest weight score as a target recognition result, and obtaining a text sequence vector of the target recognition result; determining a first candidate recognition result with the highest acoustic score from the plurality of first candidate recognition results, and obtaining a pinyin sequence vector corresponding to the first candidate recognition result with the highest acoustic score; and inputting the text sequence vector and the pinyin sequence vector into a pre-train