SPEECH CODING METHOD USING SYNTHESIS ANALYSIS

A method comprising the steps of performing a linear prediction analysis of a speech signal (S) digitised in a series of frames divided into sub-frames, in order to determine the parameters of a short-term synthesis filter; carrying out an open loop analysis to detect voiced signal frames and determ...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	MAUC, MICHEL, NAVARRO, WILLIAM
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A method comprising the steps of performing a linear prediction analysis of a speech signal (S) digitised in a series of frames divided into sub-frames, in order to determine the parameters of a short-term synthesis filter; carrying out an open loop analysis to detect voiced signal frames and determine, for each voiced frame, a degree of signal voicing (MV) and a long-term prediction delay search interval containing a number of delays depending on the degree of voicing; carrying out a closed-loop predictive analysis of the speech signal to select, for at least some sub-frames of the voiced frames, a long-term prediction delay contained in the search interval and constituting a long-term synthesis filter parameter; and determining a stochastic excitation for each sub-frame, to minimise a perceptually weighted deviation between the speech signal and the stochastic excitation filtered by the long-term and short-term synthesis filters. Le procédé comprend les étapes suivantes: analyse par prédiction linéaire du signal de parole (S) numérisé en trames successives divisées en sous-trames, pour déterminer des paramètres d'un filtre de synthèse à court terme; analyse en boucle ouverte pour détecter les trames voisées du signal et pour déterminer, pour chaque trame voisée, un degré de voisement du signal (MV) et un intervalle de recherche d'un retard de prédiction à long terme contenant un nombre de retards dépendant du degré de voisement; analyse prédictive en boucle fermée du signal de parole pour sélectionner, pour certaines au moins des sous-trames des trames voisées, un retard de prédiction à long terme contenu dans l'intervalle de recherche et constituant un paramètre d'un filtre de synthèse à long terme; et détermination d'une excitation stochastique pour chaque sous-trame, de façon à minimiser un écart pondéré perceptuellement entre le signal de parole et l'excitation stochastique filtrée par les filtres de synthèse à long terme et à court terme.