Speech synthesis from phonemic transcription

In this paper we will describe the portion of the text-to-speech conversion system which accepts a phonetic transcription, stress, and timing information as its input, and outputs the corresponding speech wave. The synthesis program is a form of dyadic concatenation of LPC area segments obtained fro...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Acoustical Society of America 1978-11, Vol.64 (S1), p.S163-S163
1. Verfasser:	Olive, Joseph P.
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper we will describe the portion of the text-to-speech conversion system which accepts a phonetic transcription, stress, and timing information as its input, and outputs the corresponding speech wave. The synthesis program is a form of dyadic concatenation of LPC area segments obtained from natural speech; in addition, the program also calculates the amplitude and intonation. The dyadic concatenation is performed using a matrix of phoneme transitions to obtain the LPC area parameters; the amplitude is obtained by rules which depend on the class of the phonemes involved in the amplitude computation. The fundamental frequency contour is obtained by an adjustment of a stored fundamental frequency contour to fit the utterance to be synthesized. This adjustment is performed by selecting the portion of the stored fundamental frequency contour which fits the number of words and their function in the utterance. The location of the peak of the contour for each word in the utterance is then adjusted to fall at the location of the primary stress.
ISSN:	0001-4966 1520-8524
DOI:	10.1121/1.2003967