En-HACN: Enhancing Hybrid Architecture With Fast Attention and Capsule Network for End-to-end Speech Recognition

Automatic speech recognition (ASR) is a fundamental technology in the field of artificial intelligence. End-to-end (E2E) ASR is favored for its state-of-the-art performance. However, E2E speech recognition still faces speech spatial information loss and text local information loss, which results in...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2023, Vol.31, p.1050-1062
Hauptverfasser:	Lyu, Boyang, Fan, Chunxiao, Ming, Yue, Zhao, Panzi, Hu, Nannan
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial intelligence Attention Automatic speech recognition capsule network Computational complexity Computational modeling conformer Convolution end-to-end fast attention mechanism Feature extraction Generalization Hybrid structures Inference Mass media Novels Spatial data Spectrogram Speech recognition State of the art Transformers Voice recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!