Correlation Matching Transformation Transformers for UHD Image Restoration

This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. The former learns multi-level high-resolution features and fuses low-high fe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Wang, Cong, Pan, Jinshan, Wang, Wei, Fu, Gang, Liang, Siyuan, Wang, Mengzhu, Wu, Xiao-Ming, Liu, Jun
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Wang, Cong Pan, Jinshan Wang, Wei Fu, Gang Liang, Siyuan Wang, Mengzhu Wu, Xiao-Ming Liu, Jun
description	This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. The former learns multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter explores more representative features learning from the high-resolution ones to facilitate better restoration. To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one. To that end, we propose two new modules: Dual-path Correlation Matching Transformation module (DualCMT) and Adaptive Channel Modulator (ACM). The DualCMT selects top C/r (r is greater or equal to 1 which controls the squeezing level) correlation channels from the max-pooling/mean-pooling high-resolution features to replace low-resolution ones in Transformers, which can effectively squeeze useless content to improve the feature representation in low-resolution space to facilitate better recovery. The ACM is exploited to adaptively modulate multi-level high-resolution features, enabling to provide more useful features to low-resolution space for better learning. Experimental results show that our UHDformer reduces about ninety-seven percent model sizes compared with most state-of-the-art methods while significantly improving performance under different training sets on 3 UHD image restoration tasks, including low-light image enhancement, image dehazing, and image deblurring. The source codes will be made available at https://github.com/supersupercong/UHDformer.
doi_str_mv	10.48550/arxiv.2406.00629
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2406_00629</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2406_00629</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-511f5c9091d9f927e3227a5d0f46082b9545dc50a7ddc4f2e5c1c190cc6b8c673</originalsourceid><addsrcrecordid>eNo9j81Kw0AUhWfThVQfwJXzAol3JnNnMktJW1upCBLX4XZ-aqBJZBJE315NpauPczgc-Bi7FZCrEhHuKX21n7lUoHMALe0Ve6qGlMKJpnbo-TNN7r3tj7xO1I9xSN25v8SQRv5L_rZd8V1Hx8BfwzgNaZ5ds0Wk0xhu_rlk9WZdV9ts__K4qx72GWljMxQiorNghbfRShMKKQ2hh6g0lPJgUaF3CGS8dyrKgE44YcE5fSidNsWS3Z1vZ5nmI7Udpe_mT6qZpYofjmFHtg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Correlation Matching Transformation Transformers for UHD Image Restoration</title><source>arXiv.org</source><creator>Wang, Cong ; Pan, Jinshan ; Wang, Wei ; Fu, Gang ; Liang, Siyuan ; Wang, Mengzhu ; Wu, Xiao-Ming ; Liu, Jun</creator><creatorcontrib>Wang, Cong ; Pan, Jinshan ; Wang, Wei ; Fu, Gang ; Liang, Siyuan ; Wang, Mengzhu ; Wu, Xiao-Ming ; Liu, Jun</creatorcontrib><description>This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. The former learns multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter explores more representative features learning from the high-resolution ones to facilitate better restoration. To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one. To that end, we propose two new modules: Dual-path Correlation Matching Transformation module (DualCMT) and Adaptive Channel Modulator (ACM). The DualCMT selects top C/r (r is greater or equal to 1 which controls the squeezing level) correlation channels from the max-pooling/mean-pooling high-resolution features to replace low-resolution ones in Transformers, which can effectively squeeze useless content to improve the feature representation in low-resolution space to facilitate better recovery. The ACM is exploited to adaptively modulate multi-level high-resolution features, enabling to provide more useful features to low-resolution space for better learning. Experimental results show that our UHDformer reduces about ninety-seven percent model sizes compared with most state-of-the-art methods while significantly improving performance under different training sets on 3 UHD image restoration tasks, including low-light image enhancement, image dehazing, and image deblurring. The source codes will be made available at https://github.com/supersupercong/UHDformer.</description><identifier>DOI: 10.48550/arxiv.2406.00629</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-06</creationdate><rights>http://creativecommons.org/licenses/by-nc-nd/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2406.00629$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2406.00629$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Cong</creatorcontrib><creatorcontrib>Pan, Jinshan</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><creatorcontrib>Fu, Gang</creatorcontrib><creatorcontrib>Liang, Siyuan</creatorcontrib><creatorcontrib>Wang, Mengzhu</creatorcontrib><creatorcontrib>Wu, Xiao-Ming</creatorcontrib><creatorcontrib>Liu, Jun</creatorcontrib><title>Correlation Matching Transformation Transformers for UHD Image Restoration</title><description>This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. The former learns multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter explores more representative features learning from the high-resolution ones to facilitate better restoration. To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one. To that end, we propose two new modules: Dual-path Correlation Matching Transformation module (DualCMT) and Adaptive Channel Modulator (ACM). The DualCMT selects top C/r (r is greater or equal to 1 which controls the squeezing level) correlation channels from the max-pooling/mean-pooling high-resolution features to replace low-resolution ones in Transformers, which can effectively squeeze useless content to improve the feature representation in low-resolution space to facilitate better recovery. The ACM is exploited to adaptively modulate multi-level high-resolution features, enabling to provide more useful features to low-resolution space for better learning. Experimental results show that our UHDformer reduces about ninety-seven percent model sizes compared with most state-of-the-art methods while significantly improving performance under different training sets on 3 UHD image restoration tasks, including low-light image enhancement, image dehazing, and image deblurring. The source codes will be made available at https://github.com/supersupercong/UHDformer.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNo9j81Kw0AUhWfThVQfwJXzAol3JnNnMktJW1upCBLX4XZ-aqBJZBJE315NpauPczgc-Bi7FZCrEhHuKX21n7lUoHMALe0Ve6qGlMKJpnbo-TNN7r3tj7xO1I9xSN25v8SQRv5L_rZd8V1Hx8BfwzgNaZ5ds0Wk0xhu_rlk9WZdV9ts__K4qx72GWljMxQiorNghbfRShMKKQ2hh6g0lPJgUaF3CGS8dyrKgE44YcE5fSidNsWS3Z1vZ5nmI7Udpe_mT6qZpYofjmFHtg</recordid><startdate>20240602</startdate><enddate>20240602</enddate><creator>Wang, Cong</creator><creator>Pan, Jinshan</creator><creator>Wang, Wei</creator><creator>Fu, Gang</creator><creator>Liang, Siyuan</creator><creator>Wang, Mengzhu</creator><creator>Wu, Xiao-Ming</creator><creator>Liu, Jun</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240602</creationdate><title>Correlation Matching Transformation Transformers for UHD Image Restoration</title><author>Wang, Cong ; Pan, Jinshan ; Wang, Wei ; Fu, Gang ; Liang, Siyuan ; Wang, Mengzhu ; Wu, Xiao-Ming ; Liu, Jun</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-511f5c9091d9f927e3227a5d0f46082b9545dc50a7ddc4f2e5c1c190cc6b8c673</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Wang, Cong</creatorcontrib><creatorcontrib>Pan, Jinshan</creatorcontrib><creatorcontrib>Wang, Wei</creatorcontrib><creatorcontrib>Fu, Gang</creatorcontrib><creatorcontrib>Liang, Siyuan</creatorcontrib><creatorcontrib>Wang, Mengzhu</creatorcontrib><creatorcontrib>Wu, Xiao-Ming</creatorcontrib><creatorcontrib>Liu, Jun</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Wang, Cong</au><au>Pan, Jinshan</au><au>Wang, Wei</au><au>Fu, Gang</au><au>Liang, Siyuan</au><au>Wang, Mengzhu</au><au>Wu, Xiao-Ming</au><au>Liu, Jun</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Correlation Matching Transformation Transformers for UHD Image Restoration</atitle><date>2024-06-02</date><risdate>2024</risdate><abstract>This paper proposes UHDformer, a general Transformer for Ultra-High-Definition (UHD) image restoration. UHDformer contains two learning spaces: (a) learning in high-resolution space and (b) learning in low-resolution space. The former learns multi-level high-resolution features and fuses low-high features and reconstructs the residual images, while the latter explores more representative features learning from the high-resolution ones to facilitate better restoration. To better improve feature representation in low-resolution space, we propose to build feature transformation from the high-resolution space to the low-resolution one. To that end, we propose two new modules: Dual-path Correlation Matching Transformation module (DualCMT) and Adaptive Channel Modulator (ACM). The DualCMT selects top C/r (r is greater or equal to 1 which controls the squeezing level) correlation channels from the max-pooling/mean-pooling high-resolution features to replace low-resolution ones in Transformers, which can effectively squeeze useless content to improve the feature representation in low-resolution space to facilitate better recovery. The ACM is exploited to adaptively modulate multi-level high-resolution features, enabling to provide more useful features to low-resolution space for better learning. Experimental results show that our UHDformer reduces about ninety-seven percent model sizes compared with most state-of-the-art methods while significantly improving performance under different training sets on 3 UHD image restoration tasks, including low-light image enhancement, image dehazing, and image deblurring. The source codes will be made available at https://github.com/supersupercong/UHDformer.</abstract><doi>10.48550/arxiv.2406.00629</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2406.00629
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2406_00629
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	Correlation Matching Transformation Transformers for UHD Image Restoration
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T23%3A14%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Correlation%20Matching%20Transformation%20Transformers%20for%20UHD%20Image%20Restoration&rft.au=Wang,%20Cong&rft.date=2024-06-02&rft_id=info:doi/10.48550/arxiv.2406.00629&rft_dat=%3Carxiv_GOX%3E2406_00629%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true