Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation
Various parametric representations have been proposed to model the speech signal. While the performance of such vocoders is well-known in the context of speech processing, their extrapolation to singing voice synthesis might not be straightforward. The goal of this paper is twofold. First, a compara...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Various parametric representations have been proposed to model the speech
signal. While the performance of such vocoders is well-known in the context of
speech processing, their extrapolation to singing voice synthesis might not be
straightforward. The goal of this paper is twofold. First, a comparative
subjective evaluation is performed across four existing techniques suitable for
statistical parametric synthesis: traditional pulse vocoder, Deterministic plus
Stochastic Model, Harmonic plus Noise Model and GlottHMM. The behavior of these
techniques as a function of the singer type (baritone, counter-tenor and
soprano) is studied. Secondly, the artifacts occurring in high-pitched voices
are discussed and possible approaches to overcome them are suggested. |
---|---|
DOI: | 10.48550/arxiv.2006.04142 |