Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models
The proposed method, Discriminator Guidance, aims to improve sample generation of pre-trained diffusion models. The approach introduces a discriminator that gives explicit supervision to a denoising sample path whether it is realistic or not. Unlike GANs, our approach does not require joint training...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Kim, Dongjun Kim, Yeongmin Kwon, Se Jung Kang, Wanmo Moon, Il-Chul |
description | The proposed method, Discriminator Guidance, aims to improve sample
generation of pre-trained diffusion models. The approach introduces a
discriminator that gives explicit supervision to a denoising sample path
whether it is realistic or not. Unlike GANs, our approach does not require
joint training of score and discriminator networks. Instead, we train the
discriminator after score training, making discriminator training stable and
fast to converge. In sample generation, we add an auxiliary term to the
pre-trained score to deceive the discriminator. This term corrects the model
score to the data score at the optimal discriminator, which implies that the
discriminator helps better score estimation in a complementary way. Using our
algorithm, we achive state-of-the-art results on ImageNet 256x256 with FID 1.83
and recall 0.64, similar to the validation data's FID (1.68) and recall (0.66).
We release the code at https://github.com/alsdudrla10/DG. |
doi_str_mv | 10.48550/arxiv.2211.17091 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2211_17091</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2211_17091</sourcerecordid><originalsourceid>FETCH-LOGICAL-a671-dac7074292dbab31db5211c7b36a0fb077c59369c7431e2d8d989ac2ecbceb583</originalsourceid><addsrcrecordid>eNotj71OwzAURr0woMIDMOEXSPBPEscjKhCQikC0Usfo2r6BKxUH2WmBtycUpm85-nQOYxdSlFVb1-IK0hcdSqWkLKURVp6y7QsOFCm-8g4jJpjogPw5jR5z5p80vfEbyj7RO0WYxsS7PQWIHjlFvvZjwsJBxjBTw7DPNEb-OAbc5TN2MsAu4_n_Ltjm7nazvC9WT93D8npVQGNkEcAbYSplVXDgtAyunt28cboBMThhjK-tbqw3lZaoQhtsa8Er9M6jq1u9YJd_t8ey_mMWhfTd_xb2x0L9A_BITM4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models</title><source>arXiv.org</source><creator>Kim, Dongjun ; Kim, Yeongmin ; Kwon, Se Jung ; Kang, Wanmo ; Moon, Il-Chul</creator><creatorcontrib>Kim, Dongjun ; Kim, Yeongmin ; Kwon, Se Jung ; Kang, Wanmo ; Moon, Il-Chul</creatorcontrib><description>The proposed method, Discriminator Guidance, aims to improve sample
generation of pre-trained diffusion models. The approach introduces a
discriminator that gives explicit supervision to a denoising sample path
whether it is realistic or not. Unlike GANs, our approach does not require
joint training of score and discriminator networks. Instead, we train the
discriminator after score training, making discriminator training stable and
fast to converge. In sample generation, we add an auxiliary term to the
pre-trained score to deceive the discriminator. This term corrects the model
score to the data score at the optimal discriminator, which implies that the
discriminator helps better score estimation in a complementary way. Using our
algorithm, we achive state-of-the-art results on ImageNet 256x256 with FID 1.83
and recall 0.64, similar to the validation data's FID (1.68) and recall (0.66).
We release the code at https://github.com/alsdudrla10/DG.</description><identifier>DOI: 10.48550/arxiv.2211.17091</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning</subject><creationdate>2022-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2211.17091$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2211.17091$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Kim, Dongjun</creatorcontrib><creatorcontrib>Kim, Yeongmin</creatorcontrib><creatorcontrib>Kwon, Se Jung</creatorcontrib><creatorcontrib>Kang, Wanmo</creatorcontrib><creatorcontrib>Moon, Il-Chul</creatorcontrib><title>Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models</title><description>The proposed method, Discriminator Guidance, aims to improve sample
generation of pre-trained diffusion models. The approach introduces a
discriminator that gives explicit supervision to a denoising sample path
whether it is realistic or not. Unlike GANs, our approach does not require
joint training of score and discriminator networks. Instead, we train the
discriminator after score training, making discriminator training stable and
fast to converge. In sample generation, we add an auxiliary term to the
pre-trained score to deceive the discriminator. This term corrects the model
score to the data score at the optimal discriminator, which implies that the
discriminator helps better score estimation in a complementary way. Using our
algorithm, we achive state-of-the-art results on ImageNet 256x256 with FID 1.83
and recall 0.64, similar to the validation data's FID (1.68) and recall (0.66).
We release the code at https://github.com/alsdudrla10/DG.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj71OwzAURr0woMIDMOEXSPBPEscjKhCQikC0Usfo2r6BKxUH2WmBtycUpm85-nQOYxdSlFVb1-IK0hcdSqWkLKURVp6y7QsOFCm-8g4jJpjogPw5jR5z5p80vfEbyj7RO0WYxsS7PQWIHjlFvvZjwsJBxjBTw7DPNEb-OAbc5TN2MsAu4_n_Ltjm7nazvC9WT93D8npVQGNkEcAbYSplVXDgtAyunt28cboBMThhjK-tbqw3lZaoQhtsa8Er9M6jq1u9YJd_t8ey_mMWhfTd_xb2x0L9A_BITM4</recordid><startdate>20221128</startdate><enddate>20221128</enddate><creator>Kim, Dongjun</creator><creator>Kim, Yeongmin</creator><creator>Kwon, Se Jung</creator><creator>Kang, Wanmo</creator><creator>Moon, Il-Chul</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20221128</creationdate><title>Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models</title><author>Kim, Dongjun ; Kim, Yeongmin ; Kwon, Se Jung ; Kang, Wanmo ; Moon, Il-Chul</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a671-dac7074292dbab31db5211c7b36a0fb077c59369c7431e2d8d989ac2ecbceb583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Kim, Dongjun</creatorcontrib><creatorcontrib>Kim, Yeongmin</creatorcontrib><creatorcontrib>Kwon, Se Jung</creatorcontrib><creatorcontrib>Kang, Wanmo</creatorcontrib><creatorcontrib>Moon, Il-Chul</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Kim, Dongjun</au><au>Kim, Yeongmin</au><au>Kwon, Se Jung</au><au>Kang, Wanmo</au><au>Moon, Il-Chul</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models</atitle><date>2022-11-28</date><risdate>2022</risdate><abstract>The proposed method, Discriminator Guidance, aims to improve sample
generation of pre-trained diffusion models. The approach introduces a
discriminator that gives explicit supervision to a denoising sample path
whether it is realistic or not. Unlike GANs, our approach does not require
joint training of score and discriminator networks. Instead, we train the
discriminator after score training, making discriminator training stable and
fast to converge. In sample generation, we add an auxiliary term to the
pre-trained score to deceive the discriminator. This term corrects the model
score to the data score at the optimal discriminator, which implies that the
discriminator helps better score estimation in a complementary way. Using our
algorithm, we achive state-of-the-art results on ImageNet 256x256 with FID 1.83
and recall 0.64, similar to the validation data's FID (1.68) and recall (0.66).
We release the code at https://github.com/alsdudrla10/DG.</abstract><doi>10.48550/arxiv.2211.17091</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2211.17091 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2211_17091 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning |
title | Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T00%3A21%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Refining%20Generative%20Process%20with%20Discriminator%20Guidance%20in%20Score-based%20Diffusion%20Models&rft.au=Kim,%20Dongjun&rft.date=2022-11-28&rft_id=info:doi/10.48550/arxiv.2211.17091&rft_dat=%3Carxiv_GOX%3E2211_17091%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |