Segmental sinusoidal model for speech coding

Speech signal could be represented as a combination of sinusoidal signal with infinite combination of amplitude, frequency and phase. On quantization based on peak to peak, speech signal is detected its peaks, both of positive and negative. Then time distance between peak to peak would be quantized....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Setiawan, F.B., Sugihartono, Soegijoko, S., Tjondronegoro, S.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Speech signal could be represented as a combination of sinusoidal signal with infinite combination of amplitude, frequency and phase. On quantization based on peak to peak, speech signal is detected its peaks, both of positive and negative. Then time distance between peak to peak would be quantized. In this paper, we explain a new method to quantize the speech signal which is segmented into peak to peak based on sinusoidal modeling. The part of signal between positive peak and following negative peak or vice versa is estimated as a half period of sinusoidal signal. Magnitude between peaks is the double of the ed sine amplitude. The experiment result showed that synthesis signal quality is reduced on the high frequency interval. Human perception due to the synthesis signal is good enough, because of less sensitivity human perception above 1 kHz.
DOI:10.1109/ICICS.2007.4449788