Perceptual audio coding using sinusoidal/optimum wavelet representation

A perceptual audio coder, in which each audio segment is adaptively analyzed using either a sinusoidal or an optimum wavelet basis according to the time-varying characteristics of the audio signals, has been constructed. The basis optimization is achieved by a novel switched filter bank scheme, whic...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Circuits, systems, and signal processing systems, and signal processing, 2002-10, Vol.21 (5), p.511-524
Hauptverfasser:	SATHIDEVI, P. S, VENKATARAMANI, Y
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Audio signals Coding Coding standards Coding, codes Compression ratio Discrete cosine transform Discrete Wavelet Transform Exact sciences and technology Filter banks Information, signal and communications theory Motion pictures MPEG encoders Optimization Signal analysis Signal and communications theory Signal processing Sine waves Speech processing Switches Telecommunications and information theory Wavelet transforms
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A perceptual audio coder, in which each audio segment is adaptively analyzed using either a sinusoidal or an optimum wavelet basis according to the time-varying characteristics of the audio signals, has been constructed. The basis optimization is achieved by a novel switched filter bank scheme, which switches between a uniform filter bank structure (discrete cosine transform) and a non-uniform filter bank structure (discrete wavelet transform). A major artifact of the International ISO/Moving Pictures Experts Group (MPEG) audio coding standard (MPEG-I layers 1 and 2) known as pre-echo distortion which uses a uniform filter bank structure for audio signal analysis, is almost eliminated in the proposed coder. A perceptual masking model implemented using a high-resolution wavelet packet filter bank with 27 subbands, closely mimicking the critical bands of the human auditory system, is employed in this audio coder. The resulting scheme is a variable bit-rate audio coder, which provides compression ratios comparable to MPEG-I layers 1 and 2 with almost transparent quality.
ISSN:	0278-081X 1531-5878
DOI:	10.1007/s00034-002-0402-8