GSM speech coding and speaker recognition
This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | II1088 vol.2 |
---|---|
container_issue | |
container_start_page | II1085 |
container_title | |
container_volume | 2 |
creator | Besacier, L. Grassi, S. Dufaux, A. Ansorge, M. Pellandini, F. |
description | This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition. |
doi_str_mv | 10.1109/ICASSP.2000.859152 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_859152</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>859152</ieee_id><sourcerecordid>859152</sourcerecordid><originalsourceid>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</originalsourceid><addsrcrecordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>GSM speech coding and speaker recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creator><creatorcontrib>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</creatorcontrib><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780362932</identifier><identifier>ISBN: 0780362934</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2000.859152</identifier><language>eng</language><publisher>IEEE</publisher><subject>Code standards ; Data mining ; Decoding ; Degradation ; Feature extraction ; GSM ; Linear predictive coding ; Spatial databases ; Speaker recognition ; Speech coding</subject><ispartof>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/859152$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/859152$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><title>GSM speech coding and speaker recognition</title><title>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</title><addtitle>ICASSP</addtitle><description>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</description><subject>Code standards</subject><subject>Data mining</subject><subject>Decoding</subject><subject>Degradation</subject><subject>Feature extraction</subject><subject>GSM</subject><subject>Linear predictive coding</subject><subject>Spatial databases</subject><subject>Speaker recognition</subject><subject>Speech coding</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780362932</isbn><isbn>0780362934</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2000</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotT01Lw0AUXPwAY-0f6ClXD4lv92V33x6laBUqClHwVja7L3X9SErSi_--kQoDA8PMMCPEQkIpJbibx-VtXb-UCgBK0k5qdSIyhdYV0sH7qZg7SzABjXKozkQ2OaAwsnIX4nIcP6cc2Yoycb2qn_Jxxxw-8tDH1G1z38U_xX_xkA8c-m2X9qnvrsR5679Hnv_zTLzd370uH4r182qasy6CMnpfaKeCstEbYEnSMwXL3rhGty23DqM13pBiZQm95EYjAVEVEZvGRLKIM7E49iZm3uyG9OOH383xJB4AWzdC2w</recordid><startdate>2000</startdate><enddate>2000</enddate><creator>Besacier, L.</creator><creator>Grassi, S.</creator><creator>Dufaux, A.</creator><creator>Ansorge, M.</creator><creator>Pellandini, F.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2000</creationdate><title>GSM speech coding and speaker recognition</title><author>Besacier, L. ; Grassi, S. ; Dufaux, A. ; Ansorge, M. ; Pellandini, F.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c265t-592c27da60e181ae8c7ea69b5ffef93d76a682e2783a1eb5380884d33bb6d8733</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Code standards</topic><topic>Data mining</topic><topic>Decoding</topic><topic>Degradation</topic><topic>Feature extraction</topic><topic>GSM</topic><topic>Linear predictive coding</topic><topic>Spatial databases</topic><topic>Speaker recognition</topic><topic>Speech coding</topic><toplevel>online_resources</toplevel><creatorcontrib>Besacier, L.</creatorcontrib><creatorcontrib>Grassi, S.</creatorcontrib><creatorcontrib>Dufaux, A.</creatorcontrib><creatorcontrib>Ansorge, M.</creatorcontrib><creatorcontrib>Pellandini, F.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Besacier, L.</au><au>Grassi, S.</au><au>Dufaux, A.</au><au>Ansorge, M.</au><au>Pellandini, F.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>GSM speech coding and speaker recognition</atitle><btitle>2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)</btitle><stitle>ICASSP</stitle><date>2000</date><risdate>2000</risdate><volume>2</volume><spage>II1085</spage><epage>II1088 vol.2</epage><pages>II1085-II1088 vol.2</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780362932</isbn><isbn>0780362934</isbn><abstract>This paper investigates the influence of GSM speech coding on text independent speaker recognition performance. The three existing GSM speech coder standards were considered. The whole TIMIT database was passed through these coders, obtaining three transcoded databases. In a first experiment, it was found that the use of GSM coding degrades significantly the identification and verification performance (performance in correspondence with the perceptual speech quality of each coder). In a second experiment, the features for the speaker recognition system were calculated directly from the information available in the encoded bit stream. It was found that a low LPC order in GSM coding is responsible for most performance degradations. By extracting the features directly from the encoded bit-stream, we also managed to obtain a speaker recognition system equivalent in performance to the original one which decodes and reanalyzes speech before performing recognition.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2000.859152</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), 2000, Vol.2, p.II1085-II1088 vol.2 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_859152 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Code standards Data mining Decoding Degradation Feature extraction GSM Linear predictive coding Spatial databases Speaker recognition Speech coding |
title | GSM speech coding and speaker recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T20%3A22%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=GSM%20speech%20coding%20and%20speaker%20recognition&rft.btitle=2000%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing.%20Proceedings%20(Cat.%20No.00CH37100)&rft.au=Besacier,%20L.&rft.date=2000&rft.volume=2&rft.spage=II1085&rft.epage=II1088%20vol.2&rft.pages=II1085-II1088%20vol.2&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780362932&rft.isbn_list=0780362934&rft_id=info:doi/10.1109/ICASSP.2000.859152&rft_dat=%3Cieee_6IE%3E859152%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=859152&rfr_iscdi=true |