Using Deepfake Technologies for Word Emphasis Detection
In this work, we consider the task of automated emphasis detection for spoken language. This problem is challenging in that emphasis is affected by the particularities of speech of the subject, for example the subject accent, dialect or voice. To address this task, we propose to utilize deep fake te...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this work, we consider the task of automated emphasis detection for spoken
language. This problem is challenging in that emphasis is affected by the
particularities of speech of the subject, for example the subject accent,
dialect or voice. To address this task, we propose to utilize deep fake
technology to produce an emphasis devoid speech for this speaker. This requires
extracting the text of the spoken voice, and then using a voice sample from the
same speaker to produce emphasis devoid speech for this task. By comparing the
generated speech with the spoken voice, we are able to isolate patterns of
emphasis which are relatively easy to detect. |
---|---|
DOI: | 10.48550/arxiv.2305.07791 |