A System for Information Retrieval from Large Records of Czech Spoken Data

In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Nouza, Jan, Žďánský, Jindřich, Červa, Petr, Kolorenč, Jan
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Applied sciences Artificial intelligence Audio Signal Automatic Speech Recognition Broadcast News Computer science control theory systems Exact sciences and technology Information systems. Data bases Memory organisation. Data processing Software Speaker Identification Speech and sound recognition and synthesis. Linguistics Speech Recognition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	492
container_issue
container_start_page	485
container_title
container_volume
creator	Nouza, Jan Žďánský, Jindřich Červa, Petr Kolorenč, Jan
description	In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time.
doi_str_mv	10.1007/11846406_61
format	Conference Proceeding
fullrecord	<record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_19688167</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>19688167</sourcerecordid><originalsourceid>FETCH-LOGICAL-p219t-96ded8766446040e8979e30886e5db1233c926a7da5e390fd7e801319600c4843</originalsourceid><addsrcrecordid>eNpNkEtLAzEUheMLrLUr_0A2LlyM3jvJ5LGU-qoUBKvgLqQzmTq2MxmSQai_3kgVvIt74Z6PA-cQcoZwiQDyClFxwUEYgXvkhBUcmAaNb_tkhAIxY4zrAzLRUv1poA_JCBjkmZacHZNJjB-QhuXAinxEHq_pYhsH19LaBzrr0m7t0PiOPrshNO7TbmgdfEvnNqxcepY-VJH6mk6_XPlOF71fu47e2MGekqPabqKb_N4xeb27fZk-ZPOn-9n0ep71Oeoh06JylZJCcC6Ag1NaasdAKeGKaok5Y6XOhZWVLVxKUFfSKUCGWgCUXHE2Juc7397G0m7qYLuyiaYPTWvD1iRQKRQycRc7LiapW7lglt6vo0EwP2Waf2Wyb6_MXuE</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A System for Information Retrieval from Large Records of Czech Spoken Data</title><source>Springer Books</source><creator>Nouza, Jan ; Žďánský, Jindřich ; Červa, Petr ; Kolorenč, Jan</creator><contributor>Pala, Karel ; Sojka, Petr ; Kopeček, Ivan</contributor><creatorcontrib>Nouza, Jan ; Žďánský, Jindřich ; Červa, Petr ; Kolorenč, Jan ; Pala, Karel ; Sojka, Petr ; Kopeček, Ivan</creatorcontrib><description>In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 9783540390909</identifier><identifier>ISBN: 3540390901</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 354039091X</identifier><identifier>EISBN: 9783540390916</identifier><identifier>DOI: 10.1007/11846406_61</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Applied sciences ; Artificial intelligence ; Audio Signal ; Automatic Speech Recognition ; Broadcast News ; Computer science; control theory; systems ; Exact sciences and technology ; Information systems. Data bases ; Memory organisation. Data processing ; Software ; Speaker Identification ; Speech and sound recognition and synthesis. Linguistics ; Speech Recognition</subject><ispartof>Lecture notes in computer science, 2006, p.485-492</ispartof><rights>Springer-Verlag Berlin Heidelberg 2006</rights><rights>2007 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/11846406_61$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/11846406_61$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4050,4051,27925,38255,41442,42511</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=19688167$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Pala, Karel</contributor><contributor>Sojka, Petr</contributor><contributor>Kopeček, Ivan</contributor><creatorcontrib>Nouza, Jan</creatorcontrib><creatorcontrib>Žďánský, Jindřich</creatorcontrib><creatorcontrib>Červa, Petr</creatorcontrib><creatorcontrib>Kolorenč, Jan</creatorcontrib><title>A System for Information Retrieval from Large Records of Czech Spoken Data</title><title>Lecture notes in computer science</title><description>In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time.</description><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Audio Signal</subject><subject>Automatic Speech Recognition</subject><subject>Broadcast News</subject><subject>Computer science; control theory; systems</subject><subject>Exact sciences and technology</subject><subject>Information systems. Data bases</subject><subject>Memory organisation. Data processing</subject><subject>Software</subject><subject>Speaker Identification</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Speech Recognition</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>9783540390909</isbn><isbn>3540390901</isbn><isbn>354039091X</isbn><isbn>9783540390916</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2006</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNpNkEtLAzEUheMLrLUr_0A2LlyM3jvJ5LGU-qoUBKvgLqQzmTq2MxmSQai_3kgVvIt74Z6PA-cQcoZwiQDyClFxwUEYgXvkhBUcmAaNb_tkhAIxY4zrAzLRUv1poA_JCBjkmZacHZNJjB-QhuXAinxEHq_pYhsH19LaBzrr0m7t0PiOPrshNO7TbmgdfEvnNqxcepY-VJH6mk6_XPlOF71fu47e2MGekqPabqKb_N4xeb27fZk-ZPOn-9n0ep71Oeoh06JylZJCcC6Ag1NaasdAKeGKaok5Y6XOhZWVLVxKUFfSKUCGWgCUXHE2Juc7397G0m7qYLuyiaYPTWvD1iRQKRQycRc7LiapW7lglt6vo0EwP2Waf2Wyb6_MXuE</recordid><startdate>2006</startdate><enddate>2006</enddate><creator>Nouza, Jan</creator><creator>Žďánský, Jindřich</creator><creator>Červa, Petr</creator><creator>Kolorenč, Jan</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2006</creationdate><title>A System for Information Retrieval from Large Records of Czech Spoken Data</title><author>Nouza, Jan ; Žďánský, Jindřich ; Červa, Petr ; Kolorenč, Jan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p219t-96ded8766446040e8979e30886e5db1233c926a7da5e390fd7e801319600c4843</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Audio Signal</topic><topic>Automatic Speech Recognition</topic><topic>Broadcast News</topic><topic>Computer science; control theory; systems</topic><topic>Exact sciences and technology</topic><topic>Information systems. Data bases</topic><topic>Memory organisation. Data processing</topic><topic>Software</topic><topic>Speaker Identification</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Speech Recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nouza, Jan</creatorcontrib><creatorcontrib>Žďánský, Jindřich</creatorcontrib><creatorcontrib>Červa, Petr</creatorcontrib><creatorcontrib>Kolorenč, Jan</creatorcontrib><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nouza, Jan</au><au>Žďánský, Jindřich</au><au>Červa, Petr</au><au>Kolorenč, Jan</au><au>Pala, Karel</au><au>Sojka, Petr</au><au>Kopeček, Ivan</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A System for Information Retrieval from Large Records of Czech Spoken Data</atitle><btitle>Lecture notes in computer science</btitle><date>2006</date><risdate>2006</risdate><spage>485</spage><epage>492</epage><pages>485-492</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>9783540390909</isbn><isbn>3540390901</isbn><eisbn>354039091X</eisbn><eisbn>9783540390916</eisbn><abstract>In the paper we describe a complex multi-level system that serves for automatic search in large records of Czech spoken data. It includes modules for audio signal segmentation, speaker identification and adaptation, speech recognition and full-text search. The search can focus both on key-words and key-speakers. The transcription accuracy is about 79 % (for broadcast programs), search accuracy about 90 %. Due to its distributed platform, the system can operate in almost real-time.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/11846406_61</doi><tpages>8</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0302-9743
ispartof	Lecture notes in computer science, 2006, p.485-492
issn	0302-9743 1611-3349
language	eng
recordid	cdi_pascalfrancis_primary_19688167
source	Springer Books
subjects	Applied sciences Artificial intelligence Audio Signal Automatic Speech Recognition Broadcast News Computer science control theory systems Exact sciences and technology Information systems. Data bases Memory organisation. Data processing Software Speaker Identification Speech and sound recognition and synthesis. Linguistics Speech Recognition
title	A System for Information Retrieval from Large Records of Czech Spoken Data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T12%3A58%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20System%20for%20Information%20Retrieval%20from%20Large%20Records%20of%20Czech%20Spoken%20Data&rft.btitle=Lecture%20notes%20in%20computer%20science&rft.au=Nouza,%20Jan&rft.date=2006&rft.spage=485&rft.epage=492&rft.pages=485-492&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=9783540390909&rft.isbn_list=3540390901&rft_id=info:doi/10.1007/11846406_61&rft_dat=%3Cpascalfrancis_sprin%3E19688167%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=354039091X&rft.eisbn_list=9783540390916&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true