Spatially adaptive image compression using a tiled deep network

International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit ra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Minnen, David, Toderici, George, Covell, Michele, Chinen, Troy, Johnston, Nick, Shor, Joel, Hwang, Sung Jin, Vincent, Damien, Singh, Saurabh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Minnen, David
Toderici, George
Covell, Michele
Chinen, Troy
Johnston, Nick
Shor, Joel
Hwang, Sung Jin
Vincent, Damien
Singh, Saurabh
description International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.
doi_str_mv 10.48550/arxiv.1802.02629
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1802_02629</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1802_02629</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-3d54c705e6b8f20d564be1f5dd99d6e1b00d8ae01a7cde21b1e15a44f205a73</originalsourceid><addsrcrecordid>eNotj8tOwzAQRb1hgUo_gBX-gYSxYzvJCqEK2kqVkIB9NMlMKos0sZzQx9_3Aau7Obo6R4hHBakprIVnjEe_T1UBOgXtdHkvXr4CTh677iSRMEx-z9LvcMuyGXYh8jj6oZe_o--3EuXkOyZJzEH2PB2G-PMg7lrsRp7_70x8vr99L1bJ5mO5XrxuEnR5mWRkTZODZVcXrQayztSsWktUluRY1QBUIIPCvCHWqlasLBpzYS3m2Uw8_Z3e_KsQL4bxVF07qltHdgacskOe</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Spatially adaptive image compression using a tiled deep network</title><source>arXiv.org</source><creator>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</creator><creatorcontrib>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</creatorcontrib><description>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</description><identifier>DOI: 10.48550/arxiv.1802.02629</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2018-02</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1802.02629$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1802.02629$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Minnen, David</creatorcontrib><creatorcontrib>Toderici, George</creatorcontrib><creatorcontrib>Covell, Michele</creatorcontrib><creatorcontrib>Chinen, Troy</creatorcontrib><creatorcontrib>Johnston, Nick</creatorcontrib><creatorcontrib>Shor, Joel</creatorcontrib><creatorcontrib>Hwang, Sung Jin</creatorcontrib><creatorcontrib>Vincent, Damien</creatorcontrib><creatorcontrib>Singh, Saurabh</creatorcontrib><title>Spatially adaptive image compression using a tiled deep network</title><description>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAQRb1hgUo_gBX-gYSxYzvJCqEK2kqVkIB9NMlMKos0sZzQx9_3Aau7Obo6R4hHBakprIVnjEe_T1UBOgXtdHkvXr4CTh677iSRMEx-z9LvcMuyGXYh8jj6oZe_o--3EuXkOyZJzEH2PB2G-PMg7lrsRp7_70x8vr99L1bJ5mO5XrxuEnR5mWRkTZODZVcXrQayztSsWktUluRY1QBUIIPCvCHWqlasLBpzYS3m2Uw8_Z3e_KsQL4bxVF07qltHdgacskOe</recordid><startdate>20180207</startdate><enddate>20180207</enddate><creator>Minnen, David</creator><creator>Toderici, George</creator><creator>Covell, Michele</creator><creator>Chinen, Troy</creator><creator>Johnston, Nick</creator><creator>Shor, Joel</creator><creator>Hwang, Sung Jin</creator><creator>Vincent, Damien</creator><creator>Singh, Saurabh</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180207</creationdate><title>Spatially adaptive image compression using a tiled deep network</title><author>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-3d54c705e6b8f20d564be1f5dd99d6e1b00d8ae01a7cde21b1e15a44f205a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Minnen, David</creatorcontrib><creatorcontrib>Toderici, George</creatorcontrib><creatorcontrib>Covell, Michele</creatorcontrib><creatorcontrib>Chinen, Troy</creatorcontrib><creatorcontrib>Johnston, Nick</creatorcontrib><creatorcontrib>Shor, Joel</creatorcontrib><creatorcontrib>Hwang, Sung Jin</creatorcontrib><creatorcontrib>Vincent, Damien</creatorcontrib><creatorcontrib>Singh, Saurabh</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Minnen, David</au><au>Toderici, George</au><au>Covell, Michele</au><au>Chinen, Troy</au><au>Johnston, Nick</au><au>Shor, Joel</au><au>Hwang, Sung Jin</au><au>Vincent, Damien</au><au>Singh, Saurabh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Spatially adaptive image compression using a tiled deep network</atitle><date>2018-02-07</date><risdate>2018</risdate><abstract>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</abstract><doi>10.48550/arxiv.1802.02629</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1802.02629
ispartof
issn
language eng
recordid cdi_arxiv_primary_1802_02629
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Spatially adaptive image compression using a tiled deep network
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T04%3A28%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Spatially%20adaptive%20image%20compression%20using%20a%20tiled%20deep%20network&rft.au=Minnen,%20David&rft.date=2018-02-07&rft_id=info:doi/10.48550/arxiv.1802.02629&rft_dat=%3Carxiv_GOX%3E1802_02629%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true