An extended context-based entropy hybrid modeling for image compression

Recently deep learning has been introduced to the field of image compression. In this paper, we present a hybrid coding framework that combines entropy coding, deep learning, and traditional coding framework. In the base layer of the encoding, we use convolutional neural networks to learn the latent...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Signal processing. Image communication 2021-07, Vol.95, p.116244, Article 116244
Hauptverfasser: Fu, Haisheng, Liang, Feng, Lei, Bo, Zhang, Qian, Liang, Jie, Tu, Chengjie, Zhang, Guohe
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Recently deep learning has been introduced to the field of image compression. In this paper, we present a hybrid coding framework that combines entropy coding, deep learning, and traditional coding framework. In the base layer of the encoding, we use convolutional neural networks to learn the latent representation and importance map of the original image respectively. The importance map is then used to guide the bit allocation of the latent representation. A context model is also developed to help the entropy coding after the masked quantization. Another network is used to get a coarse reconstruction of the image in the base layer. The residual between the input and the coarse reconstruction is then obtained and encoded by the traditional BPG codec as the enhancement layer of the bit stream. We only need to train a basic model and the proposed scheme can realize image compression at different bit rates, thanks to the use of the traditional codec. Experimental results using the Kodak, Urban100 and BSD100 datasets show that the proposed scheme outperforms many deep learning-based methods and traditional codecs including BPG in MS-SSIM metric across a wide range of bit rates. It also exceeds some latest hybrid schemes in RGB444 domain on Kodak dataset in both PSNR and MS-SSIM metrics. [Display omitted] •An Extended Context-Based Entropy Hybrid Modeling for Image Compression.•Improved autoencoder architecture.•Outperforms the state-of-the-art deep learning-based layered coding scheme and traditional codecs such as BPG in RGB444 space.
ISSN:0923-5965
1879-2677
DOI:10.1016/j.image.2021.116244