The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction

In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enh...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ramabadran, T., Sorin, A., McLaughlin, M., Chazan, D., Pearce, D., Hoory, R.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Applied sciences Biological and medical sciences Code standards Coding, codes Computerized, statistical medical data processing and models in biomedicine Exact sciences and technology Humans Information, signal and communications theory Medical management aid. Diagnosis aid Medical sciences Natural languages Noise robustness Reconstruction algorithms Signal and communications theory Signal processing Speech analysis Speech processing Speech recognition Standards development Telecommunication standards Telecommunications and information theory Testing
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	53
container_issue
container_start_page	I
container_title
container_volume	1
creator	Ramabadran, T. Sorin, A. McLaughlin, M. Chazan, D. Pearce, D. Hoory, R.
description	In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the server-side speech reconstruction whereas a companion paper discusses the front-end extension and tonal language recognition. Experimental results show that the reconstructed speech produced by the standards is highly intelligible under clean and noisy background conditions with the DRT (diagnostic rhyme test) and TT (transcription test) scores meeting or exceeding the objective values corresponding to the USA DoD (Department of Defence) federal standard MELP (mixed-excitation linear predictive) coder operating at 2400 bit/s.
doi_str_mv	10.1109/ICASSP.2004.1325920
format	Conference Proceeding
fullrecord	<record><control><sourceid>pascalfrancis_6IE</sourceid><recordid>TN_cdi_ieee_primary_1325920</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1325920</ieee_id><sourcerecordid>17565854</sourcerecordid><originalsourceid>FETCH-LOGICAL-i504-3f57cefa9e82817ceff27581659a53d71d4e6af1aed0516c7382dc31b1c0fee33</originalsourceid><addsrcrecordid>eNpNkM1LAzEQxYMfYK39C3rZi6CHrZl8bBJv0lYtFBR3D95KmszaSN2WZCv637ulgsLAPJjfe_CGkCHQEQA1N7PxXVk-jxilYgScScPoEekxrkwOhr4ek4FRmnbDtdCCnZAeSEbzAoQ5I-cpvVNKtRK6R1y1wmxalbMMv1psPPrMh9TGsNy1nU5bRLfKIrrNWxPasGmyq0n5cp2l1jbeRp9us4TxE2Oegsf_fNOl7NzeckFOa7tOOPjdfVLdT6vxYz5_euiazPMgqch5LZXD2hrUTMNe1kxJDYU0VnKvwAssbA0WPZVQOMU1847DEhytETnvk8tD7NYmZ9d1tI0LabGN4cPG7wUoWUgtRccND1xAxL_z4Y38B13uZcQ</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Ramabadran, T. ; Sorin, A. ; McLaughlin, M. ; Chazan, D. ; Pearce, D. ; Hoory, R.</creator><creatorcontrib>Ramabadran, T. ; Sorin, A. ; McLaughlin, M. ; Chazan, D. ; Pearce, D. ; Hoory, R.</creatorcontrib><description>In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the server-side speech reconstruction whereas a companion paper discusses the front-end extension and tonal language recognition. Experimental results show that the reconstructed speech produced by the standards is highly intelligible under clean and noisy background conditions with the DRT (diagnostic rhyme test) and TT (transcription test) scores meeting or exceeding the objective values corresponding to the USA DoD (Department of Defence) federal standard MELP (mixed-excitation linear predictive) coder operating at 2400 bit/s.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780384842</identifier><identifier>ISBN: 0780384849</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2004.1325920</identifier><language>eng</language><publisher>Piscataway, N.J: IEEE</publisher><subject>Applied sciences ; Biological and medical sciences ; Code standards ; Coding, codes ; Computerized, statistical medical data processing and models in biomedicine ; Exact sciences and technology ; Humans ; Information, signal and communications theory ; Medical management aid. Diagnosis aid ; Medical sciences ; Natural languages ; Noise robustness ; Reconstruction algorithms ; Signal and communications theory ; Signal processing ; Speech analysis ; Speech processing ; Speech recognition ; Standards development ; Telecommunication standards ; Telecommunications and information theory ; Testing</subject><ispartof>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, Vol.1, p.I-53</ispartof><rights>2006 INIST-CNRS</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1325920$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,4036,4037,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1325920$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17565854$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Ramabadran, T.</creatorcontrib><creatorcontrib>Sorin, A.</creatorcontrib><creatorcontrib>McLaughlin, M.</creatorcontrib><creatorcontrib>Chazan, D.</creatorcontrib><creatorcontrib>Pearce, D.</creatorcontrib><creatorcontrib>Hoory, R.</creatorcontrib><title>The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction</title><title>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing</title><addtitle>ICASSP</addtitle><description>In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the server-side speech reconstruction whereas a companion paper discusses the front-end extension and tonal language recognition. Experimental results show that the reconstructed speech produced by the standards is highly intelligible under clean and noisy background conditions with the DRT (diagnostic rhyme test) and TT (transcription test) scores meeting or exceeding the objective values corresponding to the USA DoD (Department of Defence) federal standard MELP (mixed-excitation linear predictive) coder operating at 2400 bit/s.</description><subject>Applied sciences</subject><subject>Biological and medical sciences</subject><subject>Code standards</subject><subject>Coding, codes</subject><subject>Computerized, statistical medical data processing and models in biomedicine</subject><subject>Exact sciences and technology</subject><subject>Humans</subject><subject>Information, signal and communications theory</subject><subject>Medical management aid. Diagnosis aid</subject><subject>Medical sciences</subject><subject>Natural languages</subject><subject>Noise robustness</subject><subject>Reconstruction algorithms</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Speech analysis</subject><subject>Speech processing</subject><subject>Speech recognition</subject><subject>Standards development</subject><subject>Telecommunication standards</subject><subject>Telecommunications and information theory</subject><subject>Testing</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780384842</isbn><isbn>0780384849</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpNkM1LAzEQxYMfYK39C3rZi6CHrZl8bBJv0lYtFBR3D95KmszaSN2WZCv637ulgsLAPJjfe_CGkCHQEQA1N7PxXVk-jxilYgScScPoEekxrkwOhr4ek4FRmnbDtdCCnZAeSEbzAoQ5I-cpvVNKtRK6R1y1wmxalbMMv1psPPrMh9TGsNy1nU5bRLfKIrrNWxPasGmyq0n5cp2l1jbeRp9us4TxE2Oegsf_fNOl7NzeckFOa7tOOPjdfVLdT6vxYz5_euiazPMgqch5LZXD2hrUTMNe1kxJDYU0VnKvwAssbA0WPZVQOMU1847DEhytETnvk8tD7NYmZ9d1tI0LabGN4cPG7wUoWUgtRccND1xAxL_z4Y38B13uZcQ</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Ramabadran, T.</creator><creator>Sorin, A.</creator><creator>McLaughlin, M.</creator><creator>Chazan, D.</creator><creator>Pearce, D.</creator><creator>Hoory, R.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction</title><author>Ramabadran, T. ; Sorin, A. ; McLaughlin, M. ; Chazan, D. ; Pearce, D. ; Hoory, R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i504-3f57cefa9e82817ceff27581659a53d71d4e6af1aed0516c7382dc31b1c0fee33</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Biological and medical sciences</topic><topic>Code standards</topic><topic>Coding, codes</topic><topic>Computerized, statistical medical data processing and models in biomedicine</topic><topic>Exact sciences and technology</topic><topic>Humans</topic><topic>Information, signal and communications theory</topic><topic>Medical management aid. Diagnosis aid</topic><topic>Medical sciences</topic><topic>Natural languages</topic><topic>Noise robustness</topic><topic>Reconstruction algorithms</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Speech analysis</topic><topic>Speech processing</topic><topic>Speech recognition</topic><topic>Standards development</topic><topic>Telecommunication standards</topic><topic>Telecommunications and information theory</topic><topic>Testing</topic><toplevel>online_resources</toplevel><creatorcontrib>Ramabadran, T.</creatorcontrib><creatorcontrib>Sorin, A.</creatorcontrib><creatorcontrib>McLaughlin, M.</creatorcontrib><creatorcontrib>Chazan, D.</creatorcontrib><creatorcontrib>Pearce, D.</creatorcontrib><creatorcontrib>Hoory, R.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ramabadran, T.</au><au>Sorin, A.</au><au>McLaughlin, M.</au><au>Chazan, D.</au><au>Pearce, D.</au><au>Hoory, R.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction</atitle><btitle>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing</btitle><stitle>ICASSP</stitle><date>2004</date><risdate>2004</risdate><volume>1</volume><spage>I</spage><epage>53</epage><pages>I-53</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780384842</isbn><isbn>0780384849</isbn><abstract>In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the server-side speech reconstruction whereas a companion paper discusses the front-end extension and tonal language recognition. Experimental results show that the reconstructed speech produced by the standards is highly intelligible under clean and noisy background conditions with the DRT (diagnostic rhyme test) and TT (transcription test) scores meeting or exceeding the objective values corresponding to the USA DoD (Department of Defence) federal standard MELP (mixed-excitation linear predictive) coder operating at 2400 bit/s.</abstract><cop>Piscataway, N.J</cop><pub>IEEE</pub><doi>10.1109/ICASSP.2004.1325920</doi></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, Vol.1, p.I-53
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_1325920
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Applied sciences Biological and medical sciences Code standards Coding, codes Computerized, statistical medical data processing and models in biomedicine Exact sciences and technology Humans Information, signal and communications theory Medical management aid. Diagnosis aid Medical sciences Natural languages Noise robustness Reconstruction algorithms Signal and communications theory Signal processing Speech analysis Speech processing Speech recognition Standards development Telecommunication standards Telecommunications and information theory Testing
title	The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T16%3A16%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=The%20ETSI%20extended%20distributed%20speech%20recognition%20(DSR)%20standards:%20server-side%20speech%20reconstruction&rft.btitle=2004%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing&rft.au=Ramabadran,%20T.&rft.date=2004&rft.volume=1&rft.spage=I&rft.epage=53&rft.pages=I-53&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780384842&rft.isbn_list=0780384849&rft_id=info:doi/10.1109/ICASSP.2004.1325920&rft_dat=%3Cpascalfrancis_6IE%3E17565854%3C/pascalfrancis_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1325920&rfr_iscdi=true