Phase retrieval with Bregman divergences and application to audio signal recovery

Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing pha...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2021-01
Hauptverfasser: Pierre-Hugo Vial, Magron, Paul, Oberlin, Thomas, Févotte, Cédric
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Pierre-Hugo Vial
Magron, Paul
Oberlin, Thomas
Févotte, Cédric
description Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions.
doi_str_mv 10.48550/arxiv.2010.00392
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2010_00392</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2448035835</sourcerecordid><originalsourceid>FETCH-LOGICAL-a525-de0b8e9da1de92140d93e2fbe292573339181f73078dab03bf7805099a8f8e3f3</originalsourceid><addsrcrecordid>eNotj0FLw0AQhRdBsNT-AE8ueE6d7GTN7lGLWqGgQu9h0p20W9ok7ibV_ntj62ng8b3HfELcpDDNjNZwT-HHH6YKhgAArboQI4WYJiZT6kpMYtwCgHrIldY4Ep8fG4osA3fB84F28tt3G_kUeL2nWjp_4LDmesVRUu0kte3Or6jzTS27RlLvfCOjX9dDMfCqGejjtbisaBd58n_HYvnyvJzNk8X769vscZGQVjpxDKVh6yh1bFWagbPIqipZWaVzRLSpSascITeOSsCyyg1osJZMZRgrHIvb8-zJt2iD31M4Fn_excl7IO7ORBuar55jV2ybPgyvxkJlmQHUBjX-AhV4Wyw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2448035835</pqid></control><display><type>article</type><title>Phase retrieval with Bregman divergences and application to audio signal recovery</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Pierre-Hugo Vial ; Magron, Paul ; Oberlin, Thomas ; Févotte, Cédric</creator><creatorcontrib>Pierre-Hugo Vial ; Magron, Paul ; Oberlin, Thomas ; Févotte, Cédric</creatorcontrib><description>Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2010.00392</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Computer Science - Sound ; Fourier transforms ; Optimization ; Phase retrieval ; Recovery ; Signal processing ; Signal reconstruction ; Spectrograms</subject><ispartof>arXiv.org, 2021-01</ispartof><rights>2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27925</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2010.00392$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1109/JSTSP.2021.3051870$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Pierre-Hugo Vial</creatorcontrib><creatorcontrib>Magron, Paul</creatorcontrib><creatorcontrib>Oberlin, Thomas</creatorcontrib><creatorcontrib>Févotte, Cédric</creatorcontrib><title>Phase retrieval with Bregman divergences and application to audio signal recovery</title><title>arXiv.org</title><description>Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions.</description><subject>Algorithms</subject><subject>Computer Science - Sound</subject><subject>Fourier transforms</subject><subject>Optimization</subject><subject>Phase retrieval</subject><subject>Recovery</subject><subject>Signal processing</subject><subject>Signal reconstruction</subject><subject>Spectrograms</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj0FLw0AQhRdBsNT-AE8ueE6d7GTN7lGLWqGgQu9h0p20W9ok7ibV_ntj62ng8b3HfELcpDDNjNZwT-HHH6YKhgAArboQI4WYJiZT6kpMYtwCgHrIldY4Ep8fG4osA3fB84F28tt3G_kUeL2nWjp_4LDmesVRUu0kte3Or6jzTS27RlLvfCOjX9dDMfCqGejjtbisaBd58n_HYvnyvJzNk8X769vscZGQVjpxDKVh6yh1bFWagbPIqipZWaVzRLSpSascITeOSsCyyg1osJZMZRgrHIvb8-zJt2iD31M4Fn_excl7IO7ORBuar55jV2ybPgyvxkJlmQHUBjX-AhV4Wyw</recordid><startdate>20210113</startdate><enddate>20210113</enddate><creator>Pierre-Hugo Vial</creator><creator>Magron, Paul</creator><creator>Oberlin, Thomas</creator><creator>Févotte, Cédric</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210113</creationdate><title>Phase retrieval with Bregman divergences and application to audio signal recovery</title><author>Pierre-Hugo Vial ; Magron, Paul ; Oberlin, Thomas ; Févotte, Cédric</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a525-de0b8e9da1de92140d93e2fbe292573339181f73078dab03bf7805099a8f8e3f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Computer Science - Sound</topic><topic>Fourier transforms</topic><topic>Optimization</topic><topic>Phase retrieval</topic><topic>Recovery</topic><topic>Signal processing</topic><topic>Signal reconstruction</topic><topic>Spectrograms</topic><toplevel>online_resources</toplevel><creatorcontrib>Pierre-Hugo Vial</creatorcontrib><creatorcontrib>Magron, Paul</creatorcontrib><creatorcontrib>Oberlin, Thomas</creatorcontrib><creatorcontrib>Févotte, Cédric</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pierre-Hugo Vial</au><au>Magron, Paul</au><au>Oberlin, Thomas</au><au>Févotte, Cédric</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Phase retrieval with Bregman divergences and application to audio signal recovery</atitle><jtitle>arXiv.org</jtitle><date>2021-01-13</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2010.00392</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-01
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2010_00392
source arXiv.org; Free E- Journals
subjects Algorithms
Computer Science - Sound
Fourier transforms
Optimization
Phase retrieval
Recovery
Signal processing
Signal reconstruction
Spectrograms
title Phase retrieval with Bregman divergences and application to audio signal recovery
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T06%3A03%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Phase%20retrieval%20with%20Bregman%20divergences%20and%20application%20to%20audio%20signal%20recovery&rft.jtitle=arXiv.org&rft.au=Pierre-Hugo%20Vial&rft.date=2021-01-13&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2010.00392&rft_dat=%3Cproquest_arxiv%3E2448035835%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2448035835&rft_id=info:pmid/&rfr_iscdi=true