Pitch-controlled variable bitrate CELP speech coding

Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 1998-05, Vol.103 (5_Supplement), p.2777-2777
Hauptverfasser: Oberhofer, Robert, Owens, Frank
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2777
container_issue 5_Supplement
container_start_page 2777
container_title The Journal of the Acoustical Society of America
container_volume 103
creator Oberhofer, Robert
Owens, Frank
description Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.
doi_str_mv 10.1121/1.422244
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1121_1_422244</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1121_1_422244</sourcerecordid><originalsourceid>FETCH-crossref_primary_10_1121_1_4222443</originalsourceid><addsrcrecordid>eNqVjr0KwjAYAIMoWH_AR8jokpovTUs7l4qDQwf3kKapjdSmJEHw7VX0BZyOgxsOoR3QGIDBAWLOGON8hiJIGSV5yvgcRZRSILzIsiVaeX97a5onRYR4bYLqibJjcHYYdIsf0hnZDBo3JjgZNC6rc439pLXqsbKtGa8btOjk4PX2xzXaH6tLeSLKWe-d7sTkzF26pwAqPlcCxPcq-SN9AaSuOuE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Pitch-controlled variable bitrate CELP speech coding</title><source>AIP Acoustical Society of America</source><creator>Oberhofer, Robert ; Owens, Frank</creator><creatorcontrib>Oberhofer, Robert ; Owens, Frank</creatorcontrib><description>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.422244</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 1998-05, Vol.103 (5_Supplement), p.2777-2777</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>207,314,780,784,27915,27916</link.rule.ids></links><search><creatorcontrib>Oberhofer, Robert</creatorcontrib><creatorcontrib>Owens, Frank</creatorcontrib><title>Pitch-controlled variable bitrate CELP speech coding</title><title>The Journal of the Acoustical Society of America</title><description>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1998</creationdate><recordtype>article</recordtype><recordid>eNqVjr0KwjAYAIMoWH_AR8jokpovTUs7l4qDQwf3kKapjdSmJEHw7VX0BZyOgxsOoR3QGIDBAWLOGON8hiJIGSV5yvgcRZRSILzIsiVaeX97a5onRYR4bYLqibJjcHYYdIsf0hnZDBo3JjgZNC6rc439pLXqsbKtGa8btOjk4PX2xzXaH6tLeSLKWe-d7sTkzF26pwAqPlcCxPcq-SN9AaSuOuE</recordid><startdate>19980501</startdate><enddate>19980501</enddate><creator>Oberhofer, Robert</creator><creator>Owens, Frank</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>19980501</creationdate><title>Pitch-controlled variable bitrate CELP speech coding</title><author>Oberhofer, Robert ; Owens, Frank</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-crossref_primary_10_1121_1_4222443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1998</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oberhofer, Robert</creatorcontrib><creatorcontrib>Owens, Frank</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oberhofer, Robert</au><au>Owens, Frank</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Pitch-controlled variable bitrate CELP speech coding</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>1998-05-01</date><risdate>1998</risdate><volume>103</volume><issue>5_Supplement</issue><spage>2777</spage><epage>2777</epage><pages>2777-2777</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><abstract>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</abstract><doi>10.1121/1.422244</doi></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 1998-05, Vol.103 (5_Supplement), p.2777-2777
issn 0001-4966
1520-8524
language eng
recordid cdi_crossref_primary_10_1121_1_422244
source AIP Acoustical Society of America
title Pitch-controlled variable bitrate CELP speech coding
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T19%3A01%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Pitch-controlled%20variable%20bitrate%20CELP%20speech%20coding&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Oberhofer,%20Robert&rft.date=1998-05-01&rft.volume=103&rft.issue=5_Supplement&rft.spage=2777&rft.epage=2777&rft.pages=2777-2777&rft.issn=0001-4966&rft.eissn=1520-8524&rft_id=info:doi/10.1121/1.422244&rft_dat=%3Ccrossref%3E10_1121_1_422244%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true