MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA

The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	MCGRATH, David S
Format:	Patent
Sprache:	eng ; fre ; ger
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	MCGRATH, David S
description	The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP4052257B1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP4052257B1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP4052257B13</originalsourceid><addsrcrecordid>eNrjZLD3DfUJ8XT2cPTzc_VRcAx18fRXcPVz9ndxVXD0c1FwcQUzQ4M9_dwVXDyDXJ1DPP39HH0UfF1DHF0cQxx5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4CJgamRkam5k6ExEUoAEQIpsg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA</title><source>esp@cenet</source><creator>MCGRATH, David S</creator><creatorcontrib>MCGRATH, David S</creatorcontrib><description>The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.</description><language>eng ; fre ; ger</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20241002&DB=EPODOC&CC=EP&NR=4052257B1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20241002&DB=EPODOC&CC=EP&NR=4052257B1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>MCGRATH, David S</creatorcontrib><title>MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA</title><description>The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLD3DfUJ8XT2cPTzc_VRcAx18fRXcPVz9ndxVXD0c1FwcQUzQ4M9_dwVXDyDXJ1DPP39HH0UfF1DHF0cQxx5GFjTEnOKU3mhNDeDgptriLOHbmpBfnxqcUFicmpeakm8a4CJgamRkam5k6ExEUoAEQIpsg</recordid><startdate>20241002</startdate><enddate>20241002</enddate><creator>MCGRATH, David S</creator><scope>EVB</scope></search><sort><creationdate>20241002</creationdate><title>MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA</title><author>MCGRATH, David S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP4052257B13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2024</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>MCGRATH, David S</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>MCGRATH, David S</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA</title><date>2024-10-02</date><risdate>2024</risdate><abstract>The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre ; ger
recordid	cdi_epo_espacenet_EP4052257B1
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T01%3A33%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=MCGRATH,%20David%20S&rft.date=2024-10-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP4052257B1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true