GSM speech coding and speaker recognition

This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Besacier, L., Grassi, S., Dufaux, A., Ansorge, M., Pellandini, F.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Code standards Data mining Decoding Degradation Feature extraction GSM Linear predictive coding Spatial databases Speaker recognition Speech coding
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	II1088 vol.2
container_issue
container_start_page	II1085
container_title
container_volume	2
creator	Besacier, L. Grassi, S. Dufaux, A. Ansorge, M. Pellandini, F.
description	This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.
doi_str_mv	10.1109/ICASSP.2000.859152
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_859152</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>859152</ieee_id><sourcerecordid>859152</sourcerecordid><originalsourceid>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</originalsourceid><addsrcrecordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>GSM speech coding and speaker recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creator><creatorcontrib>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creatorcontrib><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780362932</identifier><identifier>ISBN: 0780362934</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2000.859152</identifier><language>eng</language><publisher>IEEE</publisher><subject>Code standards ; Data mining ; Decoding ; Degradation ; Feature extraction ; GSM ; Linear predictive coding ; Spatial databases ; Speaker recognition ; Speech coding</subject><ispartof>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/859152$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/859152$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><title>GSM speech coding and speaker recognition</title><title>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</title><addtitle>ICASSP</addtitle><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><subject>Code standards</subject><subject>Data mining</subject><subject>Decoding</subject><subject>Degradation</subject><subject>Feature extraction</subject><subject>GSM</subject><subject>Linear predictive coding</subject><subject>Spatial databases</subject><subject>Speaker recognition</subject><subject>Speech coding</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780362932</isbn><isbn>0780362934</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2000</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</recordid><startdate>2000</startdate><enddate>2000</enddate><creator>Besacier, L.</creator><creator>Grassi, S.</creator><creator>Dufaux, A.</creator><creator>Ansorge, M.</creator><creator>Pellandini, F.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2000</creationdate><title>GSM speech coding and speaker recognition</title><author>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Code standards</topic><topic>Data mining</topic><topic>Decoding</topic><topic>Degradation</topic><topic>Feature extraction</topic><topic>GSM</topic><topic>Linear predictive coding</topic><topic>Spatial databases</topic><topic>Speaker recognition</topic><topic>Speech coding</topic><toplevel>online_resources</toplevel><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Besacier, L.</au><au>Grassi, S.</au><au>Dufaux, A.</au><au>Ansorge, M.</au><au>Pellandini, F.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>GSM speech coding and speaker recognition</atitle><btitle>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</btitle><stitle>ICASSP</stitle><date>2000</date><risdate>2000</risdate><volume>2</volume><spage>II1085</spage><epage>II1088 vol.2</epage><pages>II1085-II1088 vol.2</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780362932</isbn><isbn>0780362934</isbn><abstract>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2000.859152</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_859152
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Code standards Data mining Decoding Degradation Feature extraction GSM Linear predictive coding Spatial databases Speaker recognition Speech coding
title	GSM speech coding and speaker recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T20%3A22%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=GSM%20speech%20coding%20and%20speaker%20recognition&rft.btitle=2000%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing.%20Proceedings%20(Cat.%20No.00CH37100)&rft.au=Besacier,%20L.&rft.date=2000&rft.volume=2&rft.spage=II1085&rft.epage=II1088%20vol.2&rft.pages=II1085-II1088%20vol.2&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780362932&rft.isbn_list=0780362934&rft_id=info:doi/10.1109/ICASSP.2000.859152&rft_dat=%3Cieee_6IE%3E859152%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=859152&rfr_iscdi=true