AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM

The present invention implements audio signal processing capable of extracting the voice of each speaker even when a plurality of speakers speaks at the same time. An audio signal processing device 400 is provided with: a determination unit 401 that determines a first voice segment for a target spea...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: ARAKAWA Takayuki
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ARAKAWA Takayuki
description The present invention implements audio signal processing capable of extracting the voice of each speaker even when a plurality of speakers speaks at the same time. An audio signal processing device 400 is provided with: a determination unit 401 that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit 402 that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit 403 that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit 404 that removes the voice of the non-target speaker from the first audio signal to thus create a first voice from which the voice of the non-target speaker is removed.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_EP4036911A4</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EP4036911A4</sourcerecordid><originalsourceid>FETCH-epo_espacenet_EP4036911A43</originalsourceid><addsrcrecordid>eNrjZAhyDHXx9FcI9nT3c_RRCAjyd3YNDvb0c1dwcQ3zdHbVUcAl7-sa4uHvApT3c1EIDvEPcnR3BYq5eIb68jCwpiXmFKfyQmluBgU31xBnD93Ugvz41OKCxOTUvNSSeNcAEwNjM0tDQ0cTYyKUAABQCC7j</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM</title><source>esp@cenet</source><creator>ARAKAWA Takayuki</creator><creatorcontrib>ARAKAWA Takayuki</creatorcontrib><description>The present invention implements audio signal processing capable of extracting the voice of each speaker even when a plurality of speakers speaks at the same time. An audio signal processing device 400 is provided with: a determination unit 401 that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit 402 that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit 403 that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit 404 that removes the voice of the non-target speaker from the first audio signal to thus create a first voice from which the voice of the non-target speaker is removed.</description><language>eng ; fre ; ger</language><subject>ACOUSTICS ; DEAF-AID SETS ; ELECTRIC COMMUNICATION TECHNIQUE ; ELECTRICITY ; LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKEACOUSTIC ELECTROMECHANICAL TRANSDUCERS ; MUSICAL INSTRUMENTS ; PHYSICS ; PUBLIC ADDRESS SYSTEMS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220928&amp;DB=EPODOC&amp;CC=EP&amp;NR=4036911A4$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220928&amp;DB=EPODOC&amp;CC=EP&amp;NR=4036911A4$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ARAKAWA Takayuki</creatorcontrib><title>AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM</title><description>The present invention implements audio signal processing capable of extracting the voice of each speaker even when a plurality of speakers speaks at the same time. An audio signal processing device 400 is provided with: a determination unit 401 that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit 402 that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit 403 that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit 404 that removes the voice of the non-target speaker from the first audio signal to thus create a first voice from which the voice of the non-target speaker is removed.</description><subject>ACOUSTICS</subject><subject>DEAF-AID SETS</subject><subject>ELECTRIC COMMUNICATION TECHNIQUE</subject><subject>ELECTRICITY</subject><subject>LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKEACOUSTIC ELECTROMECHANICAL TRANSDUCERS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>PUBLIC ADDRESS SYSTEMS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZAhyDHXx9FcI9nT3c_RRCAjyd3YNDvb0c1dwcQ3zdHbVUcAl7-sa4uHvApT3c1EIDvEPcnR3BYq5eIb68jCwpiXmFKfyQmluBgU31xBnD93Ugvz41OKCxOTUvNSSeNcAEwNjM0tDQ0cTYyKUAABQCC7j</recordid><startdate>20220928</startdate><enddate>20220928</enddate><creator>ARAKAWA Takayuki</creator><scope>EVB</scope></search><sort><creationdate>20220928</creationdate><title>AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM</title><author>ARAKAWA Takayuki</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_EP4036911A43</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; ger</language><creationdate>2022</creationdate><topic>ACOUSTICS</topic><topic>DEAF-AID SETS</topic><topic>ELECTRIC COMMUNICATION TECHNIQUE</topic><topic>ELECTRICITY</topic><topic>LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKEACOUSTIC ELECTROMECHANICAL TRANSDUCERS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>PUBLIC ADDRESS SYSTEMS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>ARAKAWA Takayuki</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ARAKAWA Takayuki</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM</title><date>2022-09-28</date><risdate>2022</risdate><abstract>The present invention implements audio signal processing capable of extracting the voice of each speaker even when a plurality of speakers speaks at the same time. An audio signal processing device 400 is provided with: a determination unit 401 that determines a first voice segment for a target speaker linked to a host device on the basis of an externally acquired first audio signal; a sharing unit 402 that transmits the first audio signal and the first voice segment to another device linked to a non-target speaker and receives a second audio signal and a second voice segment associated with the non-target speaker from the other device; an estimation unit 403 that estimates the voice of the non-target speaker mixed in the first audio signal on the basis of the second audio signal and the second voice segment that are received and an estimation parameter associated with the target speaker that is acquired; and a removal unit 404 that removes the voice of the non-target speaker from the first audio signal to thus create a first voice from which the voice of the non-target speaker is removed.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; ger
recordid cdi_epo_espacenet_EP4036911A4
source esp@cenet
subjects ACOUSTICS
DEAF-AID SETS
ELECTRIC COMMUNICATION TECHNIQUE
ELECTRICITY
LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKEACOUSTIC ELECTROMECHANICAL TRANSDUCERS
MUSICAL INSTRUMENTS
PHYSICS
PUBLIC ADDRESS SYSTEMS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title AUDIO SIGNAL PROCESSING DEVICE, AUDIO SIGNAL PROCESSING METHOD, AND STORAGE MEDIUM
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T00%3A36%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ARAKAWA%20Takayuki&rft.date=2022-09-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EEP4036911A4%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true