Apparatus, methods and computer programs for obtaining spatial metadata

A machine learning model is trained to enable high quality spatial audio metadata to be obtained even from sub-optimal or low-quality microphone arrays. Input data for the machine learning model based on two or more microphone signals is determined (eg. by cross-correlation of delay or frequency dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Juha Tapio Vilkamo, Mikko Johannes Honkala
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Juha Tapio Vilkamo
Mikko Johannes Honkala
description A machine learning model is trained to enable high quality spatial audio metadata to be obtained even from sub-optimal or low-quality microphone arrays. Input data for the machine learning model based on two or more microphone signals is determined (eg. by cross-correlation of delay or frequency data in the channels) and processed to obtain spatial metadata (eg. source direction or directionality) which is used in turn to render the signal.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_GB2607934A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>GB2607934A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_GB2607934A3</originalsourceid><addsrcrecordid>eNqFyzEKwkAQRuE0FqKewTmAghgxWEbReAD78JudxIXszrA7ub8K9lav-d68aGpVJNiUNxTYXuIyITrqJOhknEiTDAkhUy-J5Gnw0ceBssI8xu8DB8OymPUYM69-XRTr2_VxuW9ZpeWP7jiytc15f9xVp_JQl__FG8GHM2c</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Apparatus, methods and computer programs for obtaining spatial metadata</title><source>esp@cenet</source><creator>Juha Tapio Vilkamo ; Mikko Johannes Honkala</creator><creatorcontrib>Juha Tapio Vilkamo ; Mikko Johannes Honkala</creatorcontrib><description>A machine learning model is trained to enable high quality spatial audio metadata to be obtained even from sub-optimal or low-quality microphone arrays. Input data for the machine learning model based on two or more microphone signals is determined (eg. by cross-correlation of delay or frequency data in the channels) and processed to obtain spatial metadata (eg. source direction or directionality) which is used in turn to render the signal.</description><language>eng</language><subject>ACOUSTICS ; ELECTRIC COMMUNICATION TECHNIQUE ; ELECTRICITY ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION ; STEREOPHONIC SYSTEMS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20221221&amp;DB=EPODOC&amp;CC=GB&amp;NR=2607934A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,777,882,25545,76296</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20221221&amp;DB=EPODOC&amp;CC=GB&amp;NR=2607934A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Juha Tapio Vilkamo</creatorcontrib><creatorcontrib>Mikko Johannes Honkala</creatorcontrib><title>Apparatus, methods and computer programs for obtaining spatial metadata</title><description>A machine learning model is trained to enable high quality spatial audio metadata to be obtained even from sub-optimal or low-quality microphone arrays. Input data for the machine learning model based on two or more microphone signals is determined (eg. by cross-correlation of delay or frequency data in the channels) and processed to obtain spatial metadata (eg. source direction or directionality) which is used in turn to render the signal.</description><subject>ACOUSTICS</subject><subject>ELECTRIC COMMUNICATION TECHNIQUE</subject><subject>ELECTRICITY</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><subject>STEREOPHONIC SYSTEMS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqFyzEKwkAQRuE0FqKewTmAghgxWEbReAD78JudxIXszrA7ub8K9lav-d68aGpVJNiUNxTYXuIyITrqJOhknEiTDAkhUy-J5Gnw0ceBssI8xu8DB8OymPUYM69-XRTr2_VxuW9ZpeWP7jiytc15f9xVp_JQl__FG8GHM2c</recordid><startdate>20221221</startdate><enddate>20221221</enddate><creator>Juha Tapio Vilkamo</creator><creator>Mikko Johannes Honkala</creator><scope>EVB</scope></search><sort><creationdate>20221221</creationdate><title>Apparatus, methods and computer programs for obtaining spatial metadata</title><author>Juha Tapio Vilkamo ; Mikko Johannes Honkala</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_GB2607934A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2022</creationdate><topic>ACOUSTICS</topic><topic>ELECTRIC COMMUNICATION TECHNIQUE</topic><topic>ELECTRICITY</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><topic>STEREOPHONIC SYSTEMS</topic><toplevel>online_resources</toplevel><creatorcontrib>Juha Tapio Vilkamo</creatorcontrib><creatorcontrib>Mikko Johannes Honkala</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Juha Tapio Vilkamo</au><au>Mikko Johannes Honkala</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Apparatus, methods and computer programs for obtaining spatial metadata</title><date>2022-12-21</date><risdate>2022</risdate><abstract>A machine learning model is trained to enable high quality spatial audio metadata to be obtained even from sub-optimal or low-quality microphone arrays. Input data for the machine learning model based on two or more microphone signals is determined (eg. by cross-correlation of delay or frequency data in the channels) and processed to obtain spatial metadata (eg. source direction or directionality) which is used in turn to render the signal.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_GB2607934A
source esp@cenet
subjects ACOUSTICS
ELECTRIC COMMUNICATION TECHNIQUE
ELECTRICITY
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
STEREOPHONIC SYSTEMS
title Apparatus, methods and computer programs for obtaining spatial metadata
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T14%3A29%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Juha%20Tapio%20Vilkamo&rft.date=2022-12-21&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EGB2607934A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true