Speech synthesis method and device, and medium

The invention provides a speech synthesis method and device and a medium, and relates to the field of artificial intelligence, and the speech synthesis method comprises the steps: obtaining to-be-synthesized phoneme information; processing the phoneme information by using a non-autoregressive acoust...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHONG RONGXIU, YANG HUIBAO, LIU YING, ZHANG SHILEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a speech synthesis method and device and a medium, and relates to the field of artificial intelligence, and the speech synthesis method comprises the steps: obtaining to-be-synthesized phoneme information; processing the phoneme information by using a non-autoregressive acoustic model to obtain first Mel spectrum information corresponding to the phoneme information; and synthesizing a target voice according to the first Mel spectrum information. In the speech synthesis process, the non-autoregressive acoustic model is specifically adopted to process phoneme information and obtain the corresponding Mel spectrum, the parallel capability of a processor can be fully utilized, then the synthesis speed can be increased, error accumulation and error transmission are reduced, and the speech synthesis robustness is improved while the speech synthesis speed is increased. 本发明提供一种语音合成方法、设备及介质,涉及人工智能领域,其中,所述语音合成方法包括:获取待合成的音素信息;利用非自回归声学模型处理所述音素信息,获取所述音素信息对应的第一梅尔频谱信息;根据所述第一梅尔频谱信息,合成目标语音。在语音合成过程中,具体采用非