Pitch-controlled variable bitrate CELP speech coding

Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Acoustical Society of America 1998-05, Vol.103 (5_Supplement), p.2777-2777
Hauptverfasser:	Oberhofer, Robert, Owens, Frank
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	2777
container_issue	5_Supplement
container_start_page	2777
container_title	The Journal of the Acoustical Society of America
container_volume	103
creator	Oberhofer, Robert Owens, Frank
description	Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.
doi_str_mv	10.1121/1.422244
format	Article
fullrecord	<record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1121_1_422244</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1121_1_422244</sourcerecordid><originalsourceid>FETCH-crossref_primary_10_1121_1_4222443</originalsourceid><addsrcrecordid>eNqVjr0KwjAYAIMoWH_AR8jokpovTUs7l4qDQwf3kKapjdSmJEHw7VX0BZyOgxsOoR3QGIDBAWLOGON8hiJIGSV5yvgcRZRSILzIsiVaeX97a5onRYR4bYLqibJjcHYYdIsf0hnZDBo3JjgZNC6rc439pLXqsbKtGa8btOjk4PX2xzXaH6tLeSLKWe-d7sTkzF26pwAqPlcCxPcq-SN9AaSuOuE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Pitch-controlled variable bitrate CELP speech coding</title><source>AIP Acoustical Society of America</source><creator>Oberhofer, Robert ; Owens, Frank</creator><creatorcontrib>Oberhofer, Robert ; Owens, Frank</creatorcontrib><description>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.422244</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 1998-05, Vol.103 (5_Supplement), p.2777-2777</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>207,314,780,784,27915,27916</link.rule.ids></links><search><creatorcontrib>Oberhofer, Robert</creatorcontrib><creatorcontrib>Owens, Frank</creatorcontrib><title>Pitch-controlled variable bitrate CELP speech coding</title><title>The Journal of the Acoustical Society of America</title><description>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1998</creationdate><recordtype>article</recordtype><recordid>eNqVjr0KwjAYAIMoWH_AR8jokpovTUs7l4qDQwf3kKapjdSmJEHw7VX0BZyOgxsOoR3QGIDBAWLOGON8hiJIGSV5yvgcRZRSILzIsiVaeX97a5onRYR4bYLqibJjcHYYdIsf0hnZDBo3JjgZNC6rc439pLXqsbKtGa8btOjk4PX2xzXaH6tLeSLKWe-d7sTkzF26pwAqPlcCxPcq-SN9AaSuOuE</recordid><startdate>19980501</startdate><enddate>19980501</enddate><creator>Oberhofer, Robert</creator><creator>Owens, Frank</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>19980501</creationdate><title>Pitch-controlled variable bitrate CELP speech coding</title><author>Oberhofer, Robert ; Owens, Frank</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-crossref_primary_10_1121_1_4222443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1998</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oberhofer, Robert</creatorcontrib><creatorcontrib>Owens, Frank</creatorcontrib><collection>CrossRef</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oberhofer, Robert</au><au>Owens, Frank</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Pitch-controlled variable bitrate CELP speech coding</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>1998-05-01</date><risdate>1998</risdate><volume>103</volume><issue>5_Supplement</issue><spage>2777</spage><epage>2777</epage><pages>2777-2777</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><abstract>Most research in efficient speech coding concentrated for many years on algorithms which produced a fixed bit rate. Fixed bit rates are, however, not a requirement for modern packet-based telecommunications and computer networks as well as voice storage applications. This makes the implementation of speech coders feasible, which adapt to the changing properties of speech for improved efficiency or quality. Variable bit-rate coding has therefore become the focus of considerable research activity in recent years [A. Gersho and E. Paksoy, ‘‘Variable Bit Rate Coding,’’ Signal Processing VII, Theories and Applications, EUSIPCO 1994, pp. 1169–1173; L. Zhang et al., ‘‘A CELP Variable Rate Speech Codec with Low Average Rate,’’ ICASSP 1997, pp. 735–738; B. C. Xydeas, ‘‘Source Driven Variable Bit Rate Prototype Interpolation Coding,’’ ICASSP 1996, p. 220; B. Shen et al., ‘‘A Robust Variable-Rate Speech Coder,’’ ICASSP 1995, p. 249]. While all of these coders focus on classification procedures for efficient bit allocation, the analysis frame size remains mainly static. The coder featured in this paper adapts the analysis frame size and bit allocation according to the pitch of the signal. A pitch frame extractor and classifier at the front end feeds the detected frame into the speech coding back end, which uses traditional CELP-based techniques [B. S. Schroeder et al., ‘‘CodeExcited Linear Prediction (CELP): High Quality Speech at Very Low Bit Rates,’’ ICASSP 1985, pp. 937–940]. This approach allows the reduction of the bit rate from a constant 4800 bps (bits per second) with original CELP to typically 3200–4000 bps, depending on the speech situation. This figure is even reduced to typically 2600–3200 with the inclusion of Voice Activity Detection. The speech quality equals the quality of the original CELP coder.</abstract><doi>10.1121/1.422244</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0001-4966
ispartof	The Journal of the Acoustical Society of America, 1998-05, Vol.103 (5_Supplement), p.2777-2777
issn	0001-4966 1520-8524
language	eng
recordid	cdi_crossref_primary_10_1121_1_422244
source	AIP Acoustical Society of America
title	Pitch-controlled variable bitrate CELP speech coding
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T19%3A01%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Pitch-controlled%20variable%20bitrate%20CELP%20speech%20coding&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Oberhofer,%20Robert&rft.date=1998-05-01&rft.volume=103&rft.issue=5_Supplement&rft.spage=2777&rft.epage=2777&rft.pages=2777-2777&rft.issn=0001-4966&rft.eissn=1520-8524&rft_id=info:doi/10.1121/1.422244&rft_dat=%3Ccrossref%3E10_1121_1_422244%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true