TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution

Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condens...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Dong, Linwei, Fan, Qingnan, Guo, Yihong, Wang, Zhonghao, Zhang, Qi, Chen, Jinwei, Luo, Yawei, Zou, Changqing
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Dong, Linwei Fan, Qingnan Guo, Yihong Wang, Zhonghao Zhang, Qi Chen, Jinwei Luo, Yawei Zou, Changqing
description	Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.
doi_str_mv	10.48550/arxiv.2411.18263
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_18263</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_18263</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_182633</originalsourceid><addsrcrecordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><source>arXiv.org</source><creator>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creator><creatorcontrib>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creatorcontrib><description>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><identifier>DOI: 10.48550/arxiv.2411.18263</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.18263$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.18263$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><description>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</recordid><startdate>20241127</startdate><enddate>20241127</enddate><creator>Dong, Linwei</creator><creator>Fan, Qingnan</creator><creator>Guo, Yihong</creator><creator>Wang, Zhonghao</creator><creator>Zhang, Qi</creator><creator>Chen, Jinwei</creator><creator>Luo, Yawei</creator><creator>Zou, Changqing</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241127</creationdate><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><author>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_182633</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dong, Linwei</au><au>Fan, Qingnan</au><au>Guo, Yihong</au><au>Wang, Zhonghao</au><au>Zhang, Qi</au><au>Chen, Jinwei</au><au>Luo, Yawei</au><au>Zou, Changqing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</atitle><date>2024-11-27</date><risdate>2024</risdate><abstract>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</abstract><doi>10.48550/arxiv.2411.18263</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2411.18263
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2411_18263
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition
title	TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T15%3A53%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=TSD-SR:%20One-Step%20Diffusion%20with%20Target%20Score%20Distillation%20for%20Real-World%20Image%20Super-Resolution&rft.au=Dong,%20Linwei&rft.date=2024-11-27&rft_id=info:doi/10.48550/arxiv.2411.18263&rft_dat=%3Carxiv_GOX%3E2411_18263%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true