Spatially adaptive image compression using a tiled deep network

International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit ra...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Minnen, David, Toderici, George, Covell, Michele, Chinen, Troy, Johnston, Nick, Shor, Joel, Hwang, Sung Jin, Vincent, Damien, Singh, Saurabh
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Minnen, David Toderici, George Covell, Michele Chinen, Troy Johnston, Nick Shor, Joel Hwang, Sung Jin Vincent, Damien Singh, Saurabh
description	International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.
doi_str_mv	10.48550/arxiv.1802.02629
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1802_02629</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1802_02629</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-3d54c705e6b8f20d564be1f5dd99d6e1b00d8ae01a7cde21b1e15a44f205a73</originalsourceid><addsrcrecordid>eNotj8tOwzAQRb1hgUo_gBX-gYSxYzvJCqEK2kqVkIB9NMlMKos0sZzQx9_3Aau7Obo6R4hHBakprIVnjEe_T1UBOgXtdHkvXr4CTh677iSRMEx-z9LvcMuyGXYh8jj6oZe_o--3EuXkOyZJzEH2PB2G-PMg7lrsRp7_70x8vr99L1bJ5mO5XrxuEnR5mWRkTZODZVcXrQayztSsWktUluRY1QBUIIPCvCHWqlasLBpzYS3m2Uw8_Z3e_KsQL4bxVF07qltHdgacskOe</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Spatially adaptive image compression using a tiled deep network</title><source>arXiv.org</source><creator>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</creator><creatorcontrib>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</creatorcontrib><description>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</description><identifier>DOI: 10.48550/arxiv.1802.02629</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2018-02</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1802.02629$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1802.02629$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Minnen, David</creatorcontrib><creatorcontrib>Toderici, George</creatorcontrib><creatorcontrib>Covell, Michele</creatorcontrib><creatorcontrib>Chinen, Troy</creatorcontrib><creatorcontrib>Johnston, Nick</creatorcontrib><creatorcontrib>Shor, Joel</creatorcontrib><creatorcontrib>Hwang, Sung Jin</creatorcontrib><creatorcontrib>Vincent, Damien</creatorcontrib><creatorcontrib>Singh, Saurabh</creatorcontrib><title>Spatially adaptive image compression using a tiled deep network</title><description>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAQRb1hgUo_gBX-gYSxYzvJCqEK2kqVkIB9NMlMKos0sZzQx9_3Aau7Obo6R4hHBakprIVnjEe_T1UBOgXtdHkvXr4CTh677iSRMEx-z9LvcMuyGXYh8jj6oZe_o--3EuXkOyZJzEH2PB2G-PMg7lrsRp7_70x8vr99L1bJ5mO5XrxuEnR5mWRkTZODZVcXrQayztSsWktUluRY1QBUIIPCvCHWqlasLBpzYS3m2Uw8_Z3e_KsQL4bxVF07qltHdgacskOe</recordid><startdate>20180207</startdate><enddate>20180207</enddate><creator>Minnen, David</creator><creator>Toderici, George</creator><creator>Covell, Michele</creator><creator>Chinen, Troy</creator><creator>Johnston, Nick</creator><creator>Shor, Joel</creator><creator>Hwang, Sung Jin</creator><creator>Vincent, Damien</creator><creator>Singh, Saurabh</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180207</creationdate><title>Spatially adaptive image compression using a tiled deep network</title><author>Minnen, David ; Toderici, George ; Covell, Michele ; Chinen, Troy ; Johnston, Nick ; Shor, Joel ; Hwang, Sung Jin ; Vincent, Damien ; Singh, Saurabh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-3d54c705e6b8f20d564be1f5dd99d6e1b00d8ae01a7cde21b1e15a44f205a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Minnen, David</creatorcontrib><creatorcontrib>Toderici, George</creatorcontrib><creatorcontrib>Covell, Michele</creatorcontrib><creatorcontrib>Chinen, Troy</creatorcontrib><creatorcontrib>Johnston, Nick</creatorcontrib><creatorcontrib>Shor, Joel</creatorcontrib><creatorcontrib>Hwang, Sung Jin</creatorcontrib><creatorcontrib>Vincent, Damien</creatorcontrib><creatorcontrib>Singh, Saurabh</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Minnen, David</au><au>Toderici, George</au><au>Covell, Michele</au><au>Chinen, Troy</au><au>Johnston, Nick</au><au>Shor, Joel</au><au>Hwang, Sung Jin</au><au>Vincent, Damien</au><au>Singh, Saurabh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Spatially adaptive image compression using a tiled deep network</atitle><date>2018-02-07</date><risdate>2018</risdate><abstract>International Conference on Image Processing 2017 Deep neural networks represent a powerful class of function approximators that can learn to compress and reconstruct images. Existing image compression algorithms based on neural networks learn quantized representations with a constant spatial bit rate across each image. While entropy coding introduces some spatial variation, traditional codecs have benefited significantly by explicitly adapting the bit rate based on local image complexity and visual saliency. This paper introduces an algorithm that combines deep neural networks with quality-sensitive bit rate adaptation using a tiled network. We demonstrate the importance of spatial context prediction and show improved quantitative (PSNR) and qualitative (subjective rater assessment) results compared to a non-adaptive baseline and a recently published image compression model based on fully-convolutional neural networks.</abstract><doi>10.48550/arxiv.1802.02629</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1802.02629
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1802_02629
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	Spatially adaptive image compression using a tiled deep network
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T04%3A28%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Spatially%20adaptive%20image%20compression%20using%20a%20tiled%20deep%20network&rft.au=Minnen,%20David&rft.date=2018-02-07&rft_id=info:doi/10.48550/arxiv.1802.02629&rft_dat=%3Carxiv_GOX%3E1802_02629%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true