GSM speech coding and speaker recognition

This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Besacier, L., Grassi, S., Dufaux, A., Ansorge, M., Pellandini, F.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page II1088 vol.2
container_issue
container_start_page II1085
container_title
container_volume 2
creator Besacier, L.
Grassi, S.
Dufaux, A.
Ansorge, M.
Pellandini, F.
description This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.
doi_str_mv 10.1109/ICASSP.2000.859152
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_859152</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>859152</ieee_id><sourcerecordid>859152</sourcerecordid><originalsourceid>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</originalsourceid><addsrcrecordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>GSM speech coding and speaker recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creator><creatorcontrib>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creatorcontrib><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780362932</identifier><identifier>ISBN: 0780362934</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2000.859152</identifier><language>eng</language><publisher>IEEE</publisher><subject>Code standards ; Data mining ; Decoding ; Degradation ; Feature extraction ; GSM ; Linear predictive coding ; Spatial databases ; Speaker recognition ; Speech coding</subject><ispartof>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/859152$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/859152$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><title>GSM speech coding and speaker recognition</title><title>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</title><addtitle>ICASSP</addtitle><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><subject>Code standards</subject><subject>Data mining</subject><subject>Decoding</subject><subject>Degradation</subject><subject>Feature extraction</subject><subject>GSM</subject><subject>Linear predictive coding</subject><subject>Spatial databases</subject><subject>Speaker recognition</subject><subject>Speech coding</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780362932</isbn><isbn>0780362934</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2000</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</recordid><startdate>2000</startdate><enddate>2000</enddate><creator>Besacier, L.</creator><creator>Grassi, S.</creator><creator>Dufaux, A.</creator><creator>Ansorge, M.</creator><creator>Pellandini, F.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2000</creationdate><title>GSM speech coding and speaker recognition</title><author>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Code standards</topic><topic>Data mining</topic><topic>Decoding</topic><topic>Degradation</topic><topic>Feature extraction</topic><topic>GSM</topic><topic>Linear predictive coding</topic><topic>Spatial databases</topic><topic>Speaker recognition</topic><topic>Speech coding</topic><toplevel>online_resources</toplevel><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Besacier, L.</au><au>Grassi, S.</au><au>Dufaux, A.</au><au>Ansorge, M.</au><au>Pellandini, F.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>GSM speech coding and speaker recognition</atitle><btitle>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</btitle><stitle>ICASSP</stitle><date>2000</date><risdate>2000</risdate><volume>2</volume><spage>II1085</spage><epage>II1088 vol.2</epage><pages>II1085-II1088 vol.2</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780362932</isbn><isbn>0780362934</isbn><abstract>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2000.859152</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_859152
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Code standards
Data mining
Decoding
Degradation
Feature extraction
GSM
Linear predictive coding
Spatial databases
Speaker recognition
Speech coding
title GSM speech coding and speaker recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T20%3A22%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=GSM%20speech%20coding%20and%20speaker%20recognition&rft.btitle=2000%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing.%20Proceedings%20(Cat.%20No.00CH37100)&rft.au=Besacier,%20L.&rft.date=2000&rft.volume=2&rft.spage=II1085&rft.epage=II1088%20vol.2&rft.pages=II1085-II1088%20vol.2&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780362932&rft.isbn_list=0780362934&rft_id=info:doi/10.1109/ICASSP.2000.859152&rft_dat=%3Cieee_6IE%3E859152%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=859152&rfr_iscdi=true