Error correction method based on multi-modal speech recognition result and related equipment

The embodiment of the invention provides an error correction method based on a multi-modal speech recognition result and related equipment. The method comprises the steps: carrying out the processing of the speech data of a user through employing an acoustic model and a language model, and obtaining...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WEI TAO, XIAO JING, MA JUN, WANG SHAOJUN, ZHUANG ZIYANG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention provides an error correction method based on a multi-modal speech recognition result and related equipment. The method comprises the steps: carrying out the processing of the speech data of a user through employing an acoustic model and a language model, and obtaining a plurality of first candidate recognition results, and corresponding acoustic scores and language scores; obtaining a weight score corresponding to each first candidate recognition result; taking the first candidate recognition result with the highest weight score as a target recognition result, and obtaining a text sequence vector of the target recognition result; determining a first candidate recognition result with the highest acoustic score from the plurality of first candidate recognition results, and obtaining a pinyin sequence vector corresponding to the first candidate recognition result with the highest acoustic score; and inputting the text sequence vector and the pinyin sequence vector into a pre-train