ProxIQA: A Proxy Approach to Perceptual Optimization of Learned Image Compression

The use of ℓ p (p = 1,2) norms has largely dominated the measurement of loss in neural networks due to their simplicity and analytical properties. However, when used to assess the loss of visual information, these simple norms are not very consistent with human perception. Here, we describe a differ...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing 2021, Vol.30, p.360-373
Hauptverfasser:	Chen, Li-Heng, Bampis, Christos G., Li, Zhi, Norkin, Andrey, Bovik, Alan C.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms convolutional neural networks Data Compression - methods deep compression Humans Image analysis Image coding Image compression Image Processing, Computer-Assisted - methods Image quality Indexes Loss measurement Neural networks Neural Networks, Computer Norms Optimization perceptual image/video quality Perceptual optimization Task analysis Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The use of ℓ p (p = 1,2) norms has largely dominated the measurement of loss in neural networks due to their simplicity and analytical properties. However, when used to assess the loss of visual information, these simple norms are not very consistent with human perception. Here, we describe a different "proximal" approach to optimize image analysis networks against quantitative perceptual models. Specifically, we construct a proxy network, broadly termed ProxIQA, which mimics the perceptual model while serving as a loss layer of the network. We experimentally demonstrate how this optimization framework can be applied to train an end-to-end optimized image compression network. By building on top of an existing deep image compression model, we are able to demonstrate a bitrate reduction of as much as 31% over MSE optimization, given a specified perceptual quality (VMAF) level.
ISSN:	1057-7149 1941-0042
DOI:	10.1109/TIP.2020.3036752