An FPGA implementation of speech recognition with weighted finite state transducers

In this paper we present a hardware architecture for large vocabulary continuous speech recognition that conducts a search over a weighted finite state transducer (WFST) network. A pipelined architecture is proposed to fully utilize the memory bandwidth. A hash table is used to manage small sized wo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Jungwook Choi, Kisun You, Wonyong Sung
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Automata Bandwidth Character recognition Field programmable gate arrays FPGA Hardware Hidden Markov models Real time systems Speech recognition Transducers Vocabulary Weighted Finite State Transducer
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper we present a hardware architecture for large vocabulary continuous speech recognition that conducts a search over a weighted finite state transducer (WFST) network. A pipelined architecture is proposed to fully utilize the memory bandwidth. A hash table is used to manage small sized working sets efficiently. We also applied a parallelization technique that increases the traversal speed by 17%. The recognition system is fully functional on an FPGA, which runs at 100 MHz. The experimental result on the Wall Street Journal 5,000 vocabulary task shows that the recognition speed of the system is 5.3 × faster than real-time.
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2010.5495538