Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition

HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Jarina, Roman, Kuba, Michal, Paralic, Martin
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acoustic signal processing Acoustics Applied sciences Artificial intelligence Automatic Speech Recognition Compact Representation Computer science control theory systems Discrete Cosine Transform Exact sciences and technology Fundamental areas of phenomenology (including applications) Physics Speech and sound recognition and synthesis. Linguistics Speech Recognition Speech Signal
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	347
container_issue
container_start_page	342
container_title
container_volume
creator	Jarina, Roman Kuba, Michal Paralic, Martin
description	HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of the proposed model is a massive reduction of speech features used for recognition what lessens computational and memory requirements, so it may be favourable for limited-power ASR applications. Experiments on isolated Slovak digits recognition task show that the method gives comparable results as the conventional MFCC approach. For speech degraded by additive white noise, it reaches better performance than the MFCC method.
doi_str_mv	10.1007/11551874_44
format	Conference Proceeding
fullrecord	<record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_17115565</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>17115565</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-9be108de584763ea9d2ec0065db31866e78c84787834faa3215018cf1f8ca6b13</originalsourceid><addsrcrecordid>eNpNkLtOwzAYhc1NopROvIAXBoaAf9uxnbFKuUmVkCidI8d1gmkaW3GoxMY78IY8CamKENMZvqMjnQ-hCyDXQIi8AUhTUJIXnB-gSSYVSzlhoECSQzQCAZAwxrOjP0aVVJk4RiPCCE0yydkpOovxjRBCZUZHyOR-E7Tp8bMNnY227XXvfIt9hRfBWvOKl9G1NabJDOc2xL573-Dvzy88bfE0hMaZfb_3eNH4rV7jmatdH4c94-vW7eA5Oql0E-3kN8doeXf7kj8k86f7x3w6TwwV0CdZaYGolU0Vl4JZna2oNYSIdFUOF4WwUpkBDY8Yr7RmFFICylRQKaNFCWyMLve7QUejm6rTrXGxCJ3b6O6jALnTJ9Khd7XvxQG1te2K0vt1LIAUO8vFP8vsBzkoabM</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition</title><source>Springer Books</source><creator>Jarina, Roman ; Kuba, Michal ; Paralic, Martin</creator><contributor>Pavelka, Tomáš ; Matoušek, Václav ; Mautner, Pavel</contributor><creatorcontrib>Jarina, Roman ; Kuba, Michal ; Paralic, Martin ; Pavelka, Tomáš ; Matoušek, Václav ; Mautner, Pavel</creatorcontrib><description>HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of the proposed model is a massive reduction of speech features used for recognition what lessens computational and memory requirements, so it may be favourable for limited-power ASR applications. Experiments on isolated Slovak digits recognition task show that the method gives comparable results as the conventional MFCC approach. For speech degraded by additive white noise, it reaches better performance than the MFCC method.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 9783540287896</identifier><identifier>ISBN: 3540287892</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 9783540318170</identifier><identifier>EISBN: 3540318178</identifier><identifier>DOI: 10.1007/11551874_44</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Acoustic signal processing ; Acoustics ; Applied sciences ; Artificial intelligence ; Automatic Speech Recognition ; Compact Representation ; Computer science; control theory; systems ; Discrete Cosine Transform ; Exact sciences and technology ; Fundamental areas of phenomenology (including applications) ; Physics ; Speech and sound recognition and synthesis. Linguistics ; Speech Recognition ; Speech Signal</subject><ispartof>Lecture notes in computer science, 2005, p.342-347</ispartof><rights>Springer-Verlag Berlin Heidelberg 2005</rights><rights>2005 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-9be108de584763ea9d2ec0065db31866e78c84787834faa3215018cf1f8ca6b13</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/11551874_44$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/11551874_44$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,775,776,780,785,786,789,4036,4037,27902,38232,41418,42487</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17115565$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Pavelka, Tomáš</contributor><contributor>Matoušek, Václav</contributor><contributor>Mautner, Pavel</contributor><creatorcontrib>Jarina, Roman</creatorcontrib><creatorcontrib>Kuba, Michal</creatorcontrib><creatorcontrib>Paralic, Martin</creatorcontrib><title>Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition</title><title>Lecture notes in computer science</title><description>HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of the proposed model is a massive reduction of speech features used for recognition what lessens computational and memory requirements, so it may be favourable for limited-power ASR applications. Experiments on isolated Slovak digits recognition task show that the method gives comparable results as the conventional MFCC approach. For speech degraded by additive white noise, it reaches better performance than the MFCC method.</description><subject>Acoustic signal processing</subject><subject>Acoustics</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Automatic Speech Recognition</subject><subject>Compact Representation</subject><subject>Computer science; control theory; systems</subject><subject>Discrete Cosine Transform</subject><subject>Exact sciences and technology</subject><subject>Fundamental areas of phenomenology (including applications)</subject><subject>Physics</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Speech Recognition</subject><subject>Speech Signal</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>9783540287896</isbn><isbn>3540287892</isbn><isbn>9783540318170</isbn><isbn>3540318178</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2005</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNpNkLtOwzAYhc1NopROvIAXBoaAf9uxnbFKuUmVkCidI8d1gmkaW3GoxMY78IY8CamKENMZvqMjnQ-hCyDXQIi8AUhTUJIXnB-gSSYVSzlhoECSQzQCAZAwxrOjP0aVVJk4RiPCCE0yydkpOovxjRBCZUZHyOR-E7Tp8bMNnY227XXvfIt9hRfBWvOKl9G1NabJDOc2xL573-Dvzy88bfE0hMaZfb_3eNH4rV7jmatdH4c94-vW7eA5Oql0E-3kN8doeXf7kj8k86f7x3w6TwwV0CdZaYGolU0Vl4JZna2oNYSIdFUOF4WwUpkBDY8Yr7RmFFICylRQKaNFCWyMLve7QUejm6rTrXGxCJ3b6O6jALnTJ9Khd7XvxQG1te2K0vt1LIAUO8vFP8vsBzkoabM</recordid><startdate>2005</startdate><enddate>2005</enddate><creator>Jarina, Roman</creator><creator>Kuba, Michal</creator><creator>Paralic, Martin</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2005</creationdate><title>Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition</title><author>Jarina, Roman ; Kuba, Michal ; Paralic, Martin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-9be108de584763ea9d2ec0065db31866e78c84787834faa3215018cf1f8ca6b13</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Acoustic signal processing</topic><topic>Acoustics</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Automatic Speech Recognition</topic><topic>Compact Representation</topic><topic>Computer science; control theory; systems</topic><topic>Discrete Cosine Transform</topic><topic>Exact sciences and technology</topic><topic>Fundamental areas of phenomenology (including applications)</topic><topic>Physics</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Speech Recognition</topic><topic>Speech Signal</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jarina, Roman</creatorcontrib><creatorcontrib>Kuba, Michal</creatorcontrib><creatorcontrib>Paralic, Martin</creatorcontrib><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jarina, Roman</au><au>Kuba, Michal</au><au>Paralic, Martin</au><au>Pavelka, Tomáš</au><au>Matoušek, Václav</au><au>Mautner, Pavel</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition</atitle><btitle>Lecture notes in computer science</btitle><date>2005</date><risdate>2005</risdate><spage>342</spage><epage>347</epage><pages>342-347</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>9783540287896</isbn><isbn>3540287892</isbn><eisbn>9783540318170</eisbn><eisbn>3540318178</eisbn><abstract>HMM speech recogniser with a small number of acoustic observations based on 2-D cepstrum (TDC) is proposed. TDC represents both static and dynamic features of speech implicitly in matrix form. It is shown that TDC analysis enables a compact representation of speech signals. Thus a great advantage of the proposed model is a massive reduction of speech features used for recognition what lessens computational and memory requirements, so it may be favourable for limited-power ASR applications. Experiments on isolated Slovak digits recognition task show that the method gives comparable results as the conventional MFCC approach. For speech degraded by additive white noise, it reaches better performance than the MFCC method.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/11551874_44</doi><tpages>6</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0302-9743
ispartof	Lecture notes in computer science, 2005, p.342-347
issn	0302-9743 1611-3349
language	eng
recordid	cdi_pascalfrancis_primary_17115565
source	Springer Books
subjects	Acoustic signal processing Acoustics Applied sciences Artificial intelligence Automatic Speech Recognition Compact Representation Computer science control theory systems Discrete Cosine Transform Exact sciences and technology Fundamental areas of phenomenology (including applications) Physics Speech and sound recognition and synthesis. Linguistics Speech Recognition Speech Signal
title	Compact Representation of Speech Using 2-D Cepstrum – An Application to Slovak Digits Recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T20%3A08%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Compact%20Representation%20of%20Speech%20Using%202-D%20Cepstrum%20%E2%80%93%20An%20Application%20to%20Slovak%20Digits%20Recognition&rft.btitle=Lecture%20notes%20in%20computer%20science&rft.au=Jarina,%20Roman&rft.date=2005&rft.spage=342&rft.epage=347&rft.pages=342-347&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=9783540287896&rft.isbn_list=3540287892&rft_id=info:doi/10.1007/11551874_44&rft_dat=%3Cpascalfrancis_sprin%3E17115565%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783540318170&rft.eisbn_list=3540318178&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true