LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation

Recent advancements in integrating speech information into large language models (LLMs) have significantly improved automatic speech recognition (ASR) accuracy. However, existing methods often constrained by the capabilities of the speech encoders under varied acoustic conditions, such as accents. T...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Shaojun, Shang, Hengchao, Wei, Daimeng, Guo, Jiaxin, Li, Zongyao, He, Xianghui, Zhang, Min, Yang, Hao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!