Speech resources for a Serbian LVCSR system

This paper describes the whole procedure of speech database collection and processing required for building a good large vocabulary speech recognition system for the Serbian language. The speech database consists of speech recordings from audio books, radio programs and talk shows, as well as read u...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ostrogonac, Stevan, Suzic, Sinisa, Bojanic, Milana, Pakoci, Edvin
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acoustics large vocabulary continuous speech recognition Materials Serbian Speech speech database Speech recognition Training Vocabulary
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper describes the whole procedure of speech database collection and processing required for building a good large vocabulary speech recognition system for the Serbian language. The speech database consists of speech recordings from audio books, radio programs and talk shows, as well as read utterances from an array of male and female speakers. To date, around 200 hours of read speech is collected, as well as about 10 hours of radio recordings.
DOI:	10.1109/TELFOR.2013.6716271