A System for Information Retrieval from Large Records of Czech Spoken Data

In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Nouza, Jan, Žďánský, Jindřich, Červa, Petr, Kolorenč, Jan
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Audio Signal Automatic Speech Recognition Broadcast News Computer science control theory systems Exact sciences and technology Information systems. Data bases Memory organisation. Data processing Software Speaker Identification Speech and sound recognition and synthesis. Linguistics Speech Recognition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time.
ISSN:	0302-9743 1611-3349
DOI:	10.1007/11846406_61