Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction

Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral compon...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Helmrich, Christian R., Carlsson, Pontus, Disch, Sascha, Edler, Bernd, Hilpert, Johannes, Neusinger, Matthias, Purnhagen, Heiko, Rettelbach, Nikolaus, Robilliard, Julien, Villemoes, Lars
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 500
container_issue
container_start_page 497
container_title
container_volume
creator Helmrich, Christian R.
Carlsson, Pontus
Disch, Sascha
Edler, Bernd
Hilpert, Johannes
Neusinger, Matthias
Purnhagen, Heiko
Rettelbach, Nikolaus
Robilliard, Julien
Villemoes, Lars
description Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach pro vides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.
doi_str_mv 10.1109/ICASSP.2011.5946449
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5946449</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5946449</ieee_id><sourcerecordid>5946449</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-8e8448bd88bc9d141f70b6c4c54991e6ba5794ef02013b4fe05f889d79a4e9453</originalsourceid><addsrcrecordid>eNo1UMlOwzAUNJtEKf2CXvwDDnZix35HVJVFqgRSQeJWOfZzMUriKkmB_j1BlLnMYRZphpC54JkQHG4eF7fr9XOWcyEyBbKUEk7IlZBKa64K0KdkkhcamAD-dkZmoM2_Zvg5mQiVc1YKCZdk1vcffESZa61gQvwyhOgitgMdOtv2IXUNdcnHdktToMNXYu7dti3W1O59TLSP29bWPa0OtMEx8OtyqdnV-M0-bb1HT_sBO0x016GPboipvSYXYczg7MhT8nq3fFk8sNXT_ThtxaLQamAGjZSm8sZUDryQImhelU46JQEElpVVGiQGPv5QVDIgV8EY8BqsRJCqmJL5X29ExM2ui43tDpvjYcUPHF5ciQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Helmrich, Christian R. ; Carlsson, Pontus ; Disch, Sascha ; Edler, Bernd ; Hilpert, Johannes ; Neusinger, Matthias ; Purnhagen, Heiko ; Rettelbach, Nikolaus ; Robilliard, Julien ; Villemoes, Lars</creator><creatorcontrib>Helmrich, Christian R. ; Carlsson, Pontus ; Disch, Sascha ; Edler, Bernd ; Hilpert, Johannes ; Neusinger, Matthias ; Purnhagen, Heiko ; Rettelbach, Nikolaus ; Robilliard, Julien ; Villemoes, Lars</creatorcontrib><description>Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach pro vides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781457705380</identifier><identifier>ISBN: 1457705389</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1457705397</identifier><identifier>EISBN: 9781457705373</identifier><identifier>EISBN: 9781457705397</identifier><identifier>EISBN: 1457705370</identifier><identifier>DOI: 10.1109/ICASSP.2011.5946449</identifier><language>eng</language><publisher>IEEE</publisher><subject>Audio coding ; Compaction ; Complexity theory ; Decoding ; M/S stereo ; MDCT ; prediction ; Transform coding ; Transforms</subject><ispartof>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.497-500</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5946449$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>310,311,782,786,791,792,2060,27932,54927</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5946449$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Helmrich, Christian R.</creatorcontrib><creatorcontrib>Carlsson, Pontus</creatorcontrib><creatorcontrib>Disch, Sascha</creatorcontrib><creatorcontrib>Edler, Bernd</creatorcontrib><creatorcontrib>Hilpert, Johannes</creatorcontrib><creatorcontrib>Neusinger, Matthias</creatorcontrib><creatorcontrib>Purnhagen, Heiko</creatorcontrib><creatorcontrib>Rettelbach, Nikolaus</creatorcontrib><creatorcontrib>Robilliard, Julien</creatorcontrib><creatorcontrib>Villemoes, Lars</creatorcontrib><title>Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction</title><title>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title><addtitle>ICASSP</addtitle><description>Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach pro vides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.</description><subject>Audio coding</subject><subject>Compaction</subject><subject>Complexity theory</subject><subject>Decoding</subject><subject>M/S stereo</subject><subject>MDCT</subject><subject>prediction</subject><subject>Transform coding</subject><subject>Transforms</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781457705380</isbn><isbn>1457705389</isbn><isbn>1457705397</isbn><isbn>9781457705373</isbn><isbn>9781457705397</isbn><isbn>1457705370</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UMlOwzAUNJtEKf2CXvwDDnZix35HVJVFqgRSQeJWOfZzMUriKkmB_j1BlLnMYRZphpC54JkQHG4eF7fr9XOWcyEyBbKUEk7IlZBKa64K0KdkkhcamAD-dkZmoM2_Zvg5mQiVc1YKCZdk1vcffESZa61gQvwyhOgitgMdOtv2IXUNdcnHdktToMNXYu7dti3W1O59TLSP29bWPa0OtMEx8OtyqdnV-M0-bb1HT_sBO0x016GPboipvSYXYczg7MhT8nq3fFk8sNXT_ThtxaLQamAGjZSm8sZUDryQImhelU46JQEElpVVGiQGPv5QVDIgV8EY8BqsRJCqmJL5X29ExM2ui43tDpvjYcUPHF5ciQ</recordid><startdate>201105</startdate><enddate>201105</enddate><creator>Helmrich, Christian R.</creator><creator>Carlsson, Pontus</creator><creator>Disch, Sascha</creator><creator>Edler, Bernd</creator><creator>Hilpert, Johannes</creator><creator>Neusinger, Matthias</creator><creator>Purnhagen, Heiko</creator><creator>Rettelbach, Nikolaus</creator><creator>Robilliard, Julien</creator><creator>Villemoes, Lars</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201105</creationdate><title>Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction</title><author>Helmrich, Christian R. ; Carlsson, Pontus ; Disch, Sascha ; Edler, Bernd ; Hilpert, Johannes ; Neusinger, Matthias ; Purnhagen, Heiko ; Rettelbach, Nikolaus ; Robilliard, Julien ; Villemoes, Lars</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-8e8448bd88bc9d141f70b6c4c54991e6ba5794ef02013b4fe05f889d79a4e9453</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Audio coding</topic><topic>Compaction</topic><topic>Complexity theory</topic><topic>Decoding</topic><topic>M/S stereo</topic><topic>MDCT</topic><topic>prediction</topic><topic>Transform coding</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Helmrich, Christian R.</creatorcontrib><creatorcontrib>Carlsson, Pontus</creatorcontrib><creatorcontrib>Disch, Sascha</creatorcontrib><creatorcontrib>Edler, Bernd</creatorcontrib><creatorcontrib>Hilpert, Johannes</creatorcontrib><creatorcontrib>Neusinger, Matthias</creatorcontrib><creatorcontrib>Purnhagen, Heiko</creatorcontrib><creatorcontrib>Rettelbach, Nikolaus</creatorcontrib><creatorcontrib>Robilliard, Julien</creatorcontrib><creatorcontrib>Villemoes, Lars</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Helmrich, Christian R.</au><au>Carlsson, Pontus</au><au>Disch, Sascha</au><au>Edler, Bernd</au><au>Hilpert, Johannes</au><au>Neusinger, Matthias</au><au>Purnhagen, Heiko</au><au>Rettelbach, Nikolaus</au><au>Robilliard, Julien</au><au>Villemoes, Lars</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction</atitle><btitle>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</btitle><stitle>ICASSP</stitle><date>2011-05</date><risdate>2011</risdate><spage>497</spage><epage>500</epage><pages>497-500</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781457705380</isbn><isbn>1457705389</isbn><eisbn>1457705397</eisbn><eisbn>9781457705373</eisbn><eisbn>9781457705397</eisbn><eisbn>1457705370</eisbn><abstract>Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach pro vides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2011.5946449</doi><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.497-500
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_5946449
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Audio coding
Compaction
Complexity theory
Decoding
M/S stereo
MDCT
prediction
Transform coding
Transforms
title Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-06T05%3A20%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Efficient%20transform%20coding%20of%20two-channel%20audio%20signals%20by%20means%20of%20complex-valued%20stereo%20prediction&rft.btitle=2011%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20(ICASSP)&rft.au=Helmrich,%20Christian%20R.&rft.date=2011-05&rft.spage=497&rft.epage=500&rft.pages=497-500&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781457705380&rft.isbn_list=1457705389&rft_id=info:doi/10.1109/ICASSP.2011.5946449&rft_dat=%3Cieee_6IE%3E5946449%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457705397&rft.eisbn_list=9781457705373&rft.eisbn_list=9781457705397&rft.eisbn_list=1457705370&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5946449&rfr_iscdi=true