TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condens...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Dong, Linwei Fan, Qingnan Guo, Yihong Wang, Zhonghao Zhang, Qi Chen, Jinwei Luo, Yawei Zou, Changqing |
description | Pre-trained text-to-image diffusion models are increasingly applied to
real-world image super-resolution (Real-ISR) task. Given the iterative
refinement nature of diffusion models, most existing approaches are
computationally expensive. While methods such as SinSR and OSEDiff have emerged
to condense inference steps via distillation, their performance in image
restoration or details recovery is not satisfied. To address this, we propose
TSD-SR, a novel distillation framework specifically designed for real-world
image super-resolution, aiming to construct an efficient and effective one-step
model. We first introduce the Target Score Distillation, which leverages the
priors of diffusion models and real image references to achieve more realistic
image restoration. Secondly, we propose a Distribution-Aware Sampling Module to
make detail-oriented gradients more readily accessible, addressing the
challenge of recovering fine details. Extensive experiments demonstrate that
our TSD-SR has superior restoration results (most of the metrics perform the
best) and the fastest inference speed (e.g. 40 times faster than SeeSR)
compared to the past Real-ISR approaches based on pre-trained diffusion priors. |
doi_str_mv | 10.48550/arxiv.2411.18263 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_18263</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_18263</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_182633</originalsourceid><addsrcrecordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><source>arXiv.org</source><creator>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creator><creatorcontrib>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</creatorcontrib><description>Pre-trained text-to-image diffusion models are increasingly applied to
real-world image super-resolution (Real-ISR) task. Given the iterative
refinement nature of diffusion models, most existing approaches are
computationally expensive. While methods such as SinSR and OSEDiff have emerged
to condense inference steps via distillation, their performance in image
restoration or details recovery is not satisfied. To address this, we propose
TSD-SR, a novel distillation framework specifically designed for real-world
image super-resolution, aiming to construct an efficient and effective one-step
model. We first introduce the Target Score Distillation, which leverages the
priors of diffusion models and real image references to achieve more realistic
image restoration. Secondly, we propose a Distribution-Aware Sampling Module to
make detail-oriented gradients more readily accessible, addressing the
challenge of recovering fine details. Extensive experiments demonstrate that
our TSD-SR has superior restoration results (most of the metrics perform the
best) and the fastest inference speed (e.g. 40 times faster than SeeSR)
compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><identifier>DOI: 10.48550/arxiv.2411.18263</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.18263$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.18263$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><description>Pre-trained text-to-image diffusion models are increasingly applied to
real-world image super-resolution (Real-ISR) task. Given the iterative
refinement nature of diffusion models, most existing approaches are
computationally expensive. While methods such as SinSR and OSEDiff have emerged
to condense inference steps via distillation, their performance in image
restoration or details recovery is not satisfied. To address this, we propose
TSD-SR, a novel distillation framework specifically designed for real-world
image super-resolution, aiming to construct an efficient and effective one-step
model. We first introduce the Target Score Distillation, which leverages the
priors of diffusion models and real image references to achieve more realistic
image restoration. Secondly, we propose a Distribution-Aware Sampling Module to
make detail-oriented gradients more readily accessible, addressing the
challenge of recovering fine details. Extensive experiments demonstrate that
our TSD-SR has superior restoration results (most of the metrics perform the
best) and the fastest inference speed (e.g. 40 times faster than SeeSR)
compared to the past Real-ISR approaches based on pre-trained diffusion priors.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsKwjAUQLM4iPoBTuYHUvuU4moVnYSm4BiC3tRA2pSb1Mffa4u70xnOGQ4hyygM0jzLwrXEl34EcRpFQZTHm2RKRMULxsstPbfAuIeOFlqp3mnb0qf2d1pJrMFTfrUIX-e8Nkb6QSuLtARp2MWiudFTI2ugvO8AWQnOmn6o5mSipHGw-HFGVod9tTuycUV0qBuJbzEsiXEp-V98AFzfQRI</recordid><startdate>20241127</startdate><enddate>20241127</enddate><creator>Dong, Linwei</creator><creator>Fan, Qingnan</creator><creator>Guo, Yihong</creator><creator>Wang, Zhonghao</creator><creator>Zhang, Qi</creator><creator>Chen, Jinwei</creator><creator>Luo, Yawei</creator><creator>Zou, Changqing</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241127</creationdate><title>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</title><author>Dong, Linwei ; Fan, Qingnan ; Guo, Yihong ; Wang, Zhonghao ; Zhang, Qi ; Chen, Jinwei ; Luo, Yawei ; Zou, Changqing</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_182633</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Dong, Linwei</creatorcontrib><creatorcontrib>Fan, Qingnan</creatorcontrib><creatorcontrib>Guo, Yihong</creatorcontrib><creatorcontrib>Wang, Zhonghao</creatorcontrib><creatorcontrib>Zhang, Qi</creatorcontrib><creatorcontrib>Chen, Jinwei</creatorcontrib><creatorcontrib>Luo, Yawei</creatorcontrib><creatorcontrib>Zou, Changqing</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dong, Linwei</au><au>Fan, Qingnan</au><au>Guo, Yihong</au><au>Wang, Zhonghao</au><au>Zhang, Qi</au><au>Chen, Jinwei</au><au>Luo, Yawei</au><au>Zou, Changqing</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution</atitle><date>2024-11-27</date><risdate>2024</risdate><abstract>Pre-trained text-to-image diffusion models are increasingly applied to
real-world image super-resolution (Real-ISR) task. Given the iterative
refinement nature of diffusion models, most existing approaches are
computationally expensive. While methods such as SinSR and OSEDiff have emerged
to condense inference steps via distillation, their performance in image
restoration or details recovery is not satisfied. To address this, we propose
TSD-SR, a novel distillation framework specifically designed for real-world
image super-resolution, aiming to construct an efficient and effective one-step
model. We first introduce the Target Score Distillation, which leverages the
priors of diffusion models and real image references to achieve more realistic
image restoration. Secondly, we propose a Distribution-Aware Sampling Module to
make detail-oriented gradients more readily accessible, addressing the
challenge of recovering fine details. Extensive experiments demonstrate that
our TSD-SR has superior restoration results (most of the metrics perform the
best) and the fastest inference speed (e.g. 40 times faster than SeeSR)
compared to the past Real-ISR approaches based on pre-trained diffusion priors.</abstract><doi>10.48550/arxiv.2411.18263</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2411.18263 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2411_18263 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T15%3A53%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=TSD-SR:%20One-Step%20Diffusion%20with%20Target%20Score%20Distillation%20for%20Real-World%20Image%20Super-Resolution&rft.au=Dong,%20Linwei&rft.date=2024-11-27&rft_id=info:doi/10.48550/arxiv.2411.18263&rft_dat=%3Carxiv_GOX%3E2411_18263%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |