TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution

Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condens...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Dong, Linwei, Fan, Qingnan, Guo, Yihong, Wang, Zhonghao, Zhang, Qi, Chen, Jinwei, Luo, Yawei, Zou, Changqing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Dong, Linwei
Fan, Qingnan
Guo, Yihong
Wang, Zhonghao
Zhang, Qi
Chen, Jinwei
Luo, Yawei
Zou, Changqing
description Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.
doi_str_mv 10.48550/arxiv.2411.18263
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_18263</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_18263</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_182633</originalsourceid><addsrcrecordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><source>arXiv.org</source><creator>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creator><creatorcontrib>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creatorcontrib><description>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><identifier>DOI: 10.48550/arxiv.2411.18263</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.18263$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.18263$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><description>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</recordid><startdate>20241127</startdate><enddate>20241127</enddate><creator>Dong, Linwei</creator><creator>Fan, Qingnan</creator><creator>Guo, Yihong</creator><creator>Wang, Zhonghao</creator><creator>Zhang, Qi</creator><creator>Chen, Jinwei</creator><creator>Luo, Yawei</creator><creator>Zou, Changqing</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241127</creationdate><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><author>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_182633</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dong, Linwei</au><au>Fan, Qingnan</au><au>Guo, Yihong</au><au>Wang, Zhonghao</au><au>Zhang, Qi</au><au>Chen, Jinwei</au><au>Luo, Yawei</au><au>Zou, Changqing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</atitle><date>2024-11-27</date><risdate>2024</risdate><abstract>Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.</abstract><doi>10.48550/arxiv.2411.18263</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2411.18263
ispartof
issn
language eng
recordid cdi_arxiv_primary_2411_18263
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T15%3A53%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=TSD-SR:%20One-Step%20Diffusion%20with%20Target%20Score%20Distillation%20for%20Real-World%20Image%20Super-Resolution&rft.au=Dong,%20Linwei&rft.date=2024-11-27&rft_id=info:doi/10.48550/arxiv.2411.18263&rft_dat=%3Carxiv_GOX%3E2411_18263%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true