METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION
A method and audio processing device for voice anonymization in an audio- or videoconferencing session. The method comprises receiving a plurality of input audio samples comprising speech, calculating a frequency spectrum of each the plurality of input audio samples, calculating a smoothed spectral...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Hvidsten, Knut Inge |
description | A method and audio processing device for voice anonymization in an audio- or videoconferencing session. The method comprises receiving a plurality of input audio samples comprising speech, calculating a frequency spectrum of each the plurality of input audio samples, calculating a smoothed spectral magnitude envelope of a first of the plurality of frequency spectrums to determine a plurality of formant features of the speech, each of the plurality of formant features being located at different frequencies in the frequency spectrum, determining one random scaling factor for the audio- or videoconferencing session, determining, based on the one random scaling factor, a voice anonymization function shifting the formant location of at least one of the plurality of formants, and applying the voice anonymization function on the frequency spectrum of each the subsequent plurality of input audio samples in the audio- or videoconferencing session. |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2024005936A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2024005936A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2024005936A13</originalsourceid><addsrcrecordid>eNrjZLDydQ3x8HdRcPQD4lAXT3-FgCB_Z9fgYE8_dwUX1zBPZ1cFN_8ghTB_EMvRz98v0tczyjHE09-Ph4E1LTGnOJUXSnMzKLu5hjh76KYW5MenFhckJqfmpZbEhwYbGRiZGBiYWhqbORoaE6cKAFEQKfM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION</title><source>esp@cenet</source><creator>Hvidsten, Knut Inge</creator><creatorcontrib>Hvidsten, Knut Inge</creatorcontrib><description>A method and audio processing device for voice anonymization in an audio- or videoconferencing session. The method comprises receiving a plurality of input audio samples comprising speech, calculating a frequency spectrum of each the plurality of input audio samples, calculating a smoothed spectral magnitude envelope of a first of the plurality of frequency spectrums to determine a plurality of formant features of the speech, each of the plurality of formant features being located at different frequencies in the frequency spectrum, determining one random scaling factor for the audio- or videoconferencing session, determining, based on the one random scaling factor, a voice anonymization function shifting the formant location of at least one of the plurality of formants, and applying the voice anonymization function on the frequency spectrum of each the subsequent plurality of input audio samples in the audio- or videoconferencing session.</description><language>eng</language><subject>ACOUSTICS ; ELECTRIC COMMUNICATION TECHNIQUE ; ELECTRICITY ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION ; TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240104&DB=EPODOC&CC=US&NR=2024005936A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240104&DB=EPODOC&CC=US&NR=2024005936A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Hvidsten, Knut Inge</creatorcontrib><title>METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION</title><description>A method and audio processing device for voice anonymization in an audio- or videoconferencing session. The method comprises receiving a plurality of input audio samples comprising speech, calculating a frequency spectrum of each the plurality of input audio samples, calculating a smoothed spectral magnitude envelope of a first of the plurality of frequency spectrums to determine a plurality of formant features of the speech, each of the plurality of formant features being located at different frequencies in the frequency spectrum, determining one random scaling factor for the audio- or videoconferencing session, determining, based on the one random scaling factor, a voice anonymization function shifting the formant location of at least one of the plurality of formants, and applying the voice anonymization function on the frequency spectrum of each the subsequent plurality of input audio samples in the audio- or videoconferencing session.</description><subject>ACOUSTICS</subject><subject>ELECTRIC COMMUNICATION TECHNIQUE</subject><subject>ELECTRICITY</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><subject>TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLDydQ3x8HdRcPQD4lAXT3-FgCB_Z9fgYE8_dwUX1zBPZ1cFN_8ghTB_EMvRz98v0tczyjHE09-Ph4E1LTGnOJUXSnMzKLu5hjh76KYW5MenFhckJqfmpZbEhwYbGRiZGBiYWhqbORoaE6cKAFEQKfM</recordid><startdate>20240104</startdate><enddate>20240104</enddate><creator>Hvidsten, Knut Inge</creator><scope>EVB</scope></search><sort><creationdate>20240104</creationdate><title>METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION</title><author>Hvidsten, Knut Inge</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2024005936A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2024</creationdate><topic>ACOUSTICS</topic><topic>ELECTRIC COMMUNICATION TECHNIQUE</topic><topic>ELECTRICITY</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><topic>TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION</topic><toplevel>online_resources</toplevel><creatorcontrib>Hvidsten, Knut Inge</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hvidsten, Knut Inge</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION</title><date>2024-01-04</date><risdate>2024</risdate><abstract>A method and audio processing device for voice anonymization in an audio- or videoconferencing session. The method comprises receiving a plurality of input audio samples comprising speech, calculating a frequency spectrum of each the plurality of input audio samples, calculating a smoothed spectral magnitude envelope of a first of the plurality of frequency spectrums to determine a plurality of formant features of the speech, each of the plurality of formant features being located at different frequencies in the frequency spectrum, determining one random scaling factor for the audio- or videoconferencing session, determining, based on the one random scaling factor, a voice anonymization function shifting the formant location of at least one of the plurality of formants, and applying the voice anonymization function on the frequency spectrum of each the subsequent plurality of input audio samples in the audio- or videoconferencing session.</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | eng |
recordid | cdi_epo_espacenet_US2024005936A1 |
source | esp@cenet |
subjects | ACOUSTICS ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION |
title | METHOD AND AUDIO PROCESSING DEVICE FOR VOICE ANONYMIZATION |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T20%3A05%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Hvidsten,%20Knut%20Inge&rft.date=2024-01-04&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2024005936A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |