Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding
Voice anonymization has been developed as a technique for preserving privacy by replacing the speaker's voice in a speech signal with that of a pseudo-speaker, thereby obscuring the original voice attributes from machine recognition and human perception. In this paper, we focus on altering the...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Voice anonymization has been developed as a technique for preserving privacy
by replacing the speaker's voice in a speech signal with that of a
pseudo-speaker, thereby obscuring the original voice attributes from machine
recognition and human perception. In this paper, we focus on altering the voice
attributes against machine recognition while retaining human perception. We
referred to this as the asynchronous voice anonymization. To this end, a speech
generation framework incorporating a speaker disentanglement mechanism is
employed to generate the anonymized speech. The speaker attributes are altered
through adversarial perturbation applied on the speaker embedding, while human
perception is preserved by controlling the intensity of perturbation.
Experiments conducted on the LibriSpeech dataset showed that the speaker
attributes were obscured with their human perception preserved for 60.71% of
the processed utterances. |
---|---|
DOI: | 10.48550/arxiv.2406.08200 |