DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS

Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	MA, Yuanxing, CENGARLE, Giulio
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	MA, Yuanxing CENGARLE, Giulio
description	Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts. La présente invention concerne un procédé, des systèmes et des produits programmes d'ordinateur destinés à segmenter un enregistrement binaural de discours en parties contenant de l'auto-discours et en parties contenant un discours extérieur, et à traiter chaque catégorie avec différents paramètres, pour obtenir une présentation globale améliorée. La segmentation est basée sur une combinaison de : i) classification trame par trame basée sur des caractéristiques, et ii) détection d'une dissimilarité par des procédés statistiques. Les informations de segmentation sont ensuite utilisées par une chaîne d'amélioration du discours, où des paramètres indépendants sont utilisés pour traiter les parties d'auto-discours et de discours extérieur.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_WO2022155205A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>WO2022155205A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_WO2022155205A13</originalsourceid><addsrcrecordid>eNrjZLBycQ1xdQ7x9PdTcPRzUXD183D0c3b1dfULUfB3UwgOcHV19lDw9FNw8vRzDA1y9FEIcnX2D3Lx9HMP5mFgTUvMKU7lhdLcDMpuriHOHrqpBfnxqcUFicmpeakl8eH-RgZGRoampkYGpo6GxsSpAgAzFimb</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS</title><source>esp@cenet</source><creator>MA, Yuanxing ; CENGARLE, Giulio</creator><creatorcontrib>MA, Yuanxing ; CENGARLE, Giulio</creatorcontrib><description>Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts. La présente invention concerne un procédé, des systèmes et des produits programmes d'ordinateur destinés à segmenter un enregistrement binaural de discours en parties contenant de l'auto-discours et en parties contenant un discours extérieur, et à traiter chaque catégorie avec différents paramètres, pour obtenir une présentation globale améliorée. La segmentation est basée sur une combinaison de : i) classification trame par trame basée sur des caractéristiques, et ii) détection d'une dissimilarité par des procédés statistiques. Les informations de segmentation sont ensuite utilisées par une chaîne d'amélioration du discours, où des paramètres indépendants sont utilisés pour traiter les parties d'auto-discours et de discours extérieur.</description><language>eng ; fre</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220721&DB=EPODOC&CC=WO&NR=2022155205A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220721&DB=EPODOC&CC=WO&NR=2022155205A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>MA, Yuanxing</creatorcontrib><creatorcontrib>CENGARLE, Giulio</creatorcontrib><title>DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS</title><description>Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts. La présente invention concerne un procédé, des systèmes et des produits programmes d'ordinateur destinés à segmenter un enregistrement binaural de discours en parties contenant de l'auto-discours et en parties contenant un discours extérieur, et à traiter chaque catégorie avec différents paramètres, pour obtenir une présentation globale améliorée. La segmentation est basée sur une combinaison de : i) classification trame par trame basée sur des caractéristiques, et ii) détection d'une dissimilarité par des procédés statistiques. Les informations de segmentation sont ensuite utilisées par une chaîne d'amélioration du discours, où des paramètres indépendants sont utilisés pour traiter les parties d'auto-discours et de discours extérieur.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLBycQ1xdQ7x9PdTcPRzUXD183D0c3b1dfULUfB3UwgOcHV19lDw9FNw8vRzDA1y9FEIcnX2D3Lx9HMP5mFgTUvMKU7lhdLcDMpuriHOHrqpBfnxqcUFicmpeakl8eH-RgZGRoampkYGpo6GxsSpAgAzFimb</recordid><startdate>20220721</startdate><enddate>20220721</enddate><creator>MA, Yuanxing</creator><creator>CENGARLE, Giulio</creator><scope>EVB</scope></search><sort><creationdate>20220721</creationdate><title>DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS</title><author>MA, Yuanxing ; CENGARLE, Giulio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_WO2022155205A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre</language><creationdate>2022</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>MA, Yuanxing</creatorcontrib><creatorcontrib>CENGARLE, Giulio</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>MA, Yuanxing</au><au>CENGARLE, Giulio</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS</title><date>2022-07-21</date><risdate>2022</risdate><abstract>Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts. La présente invention concerne un procédé, des systèmes et des produits programmes d'ordinateur destinés à segmenter un enregistrement binaural de discours en parties contenant de l'auto-discours et en parties contenant un discours extérieur, et à traiter chaque catégorie avec différents paramètres, pour obtenir une présentation globale améliorée. La segmentation est basée sur une combinaison de : i) classification trame par trame basée sur des caractéristiques, et ii) détection d'une dissimilarité par des procédés statistiques. Les informations de segmentation sont ensuite utilisées par une chaîne d'amélioration du discours, où des paramètres indépendants sont utilisés pour traiter les parties d'auto-discours et de discours extérieur.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre
recordid	cdi_epo_espacenet_WO2022155205A1
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T18%3A49%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=MA,%20Yuanxing&rft.date=2022-07-21&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EWO2022155205A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true