Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple concept LoRAs to jointly support multiple...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-11
Hauptverfasser: Gu, Yuchao, Wang, Xintao, Wu, Jay Zhangjie, Shi, Yujun, Chen, Yunpeng, Fan, Zihan, Xiao, Wuyou, Zhao, Rui, Chang, Shuning, Wu, Weijia, Ge, Yixiao, Shan, Ying, Mike Zheng Shou
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Gu, Yuchao
Wang, Xintao
Wu, Jay Zhangjie
Shi, Yujun
Chen, Yunpeng
Fan, Zihan
Xiao, Wuyou
Zhao, Rui
Chang, Shuning
Wu, Weijia
Ge, Yixiao
Shan, Ying
Mike Zheng Shou
description Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion. Mix-of-Show adopts an embedding-decomposed LoRA (ED-LoRA) for single-client tuning and gradient fusion for the center node to preserve the in-domain essence of single concepts and support theoretically limitless concept fusion. Additionally, we introduce regionally controllable sampling, which extends spatially controllable sampling (e.g., ControlNet and T2I-Adaptor) to address attribute binding and missing object problems in multi-concept sampling. Extensive experiments demonstrate that Mix-of-Show is capable of composing multiple customized concepts with high fidelity, including characters, objects, and scenes.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2820821532</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2820821532</sourcerecordid><originalsourceid>FETCH-proquest_journals_28208215323</originalsourceid><addsrcrecordid>eNqNiksKwjAUAIMgWNQ7BFwH2herxZ1UxYXdqPsSbYKpNa_mg9LTq-gBXA3DTI9EwHnCsinAgIydq-M4htkc0pRH5FToJ0PFDhd8LOhKnqXxVjS6kxXd4YPthbnSZSVaL7xGQxVaWoTGa5ajOcvW0zw4jzfdfTsqutJKBfeRAivZuBHpK9E4Of5xSCab9THfstbiPUjnyxqDNe9UQgZxBknKgf93vQBIKkVT</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2820821532</pqid></control><display><type>article</type><title>Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models</title><source>Open Access: Freely Accessible Journals by multiple vendors</source><creator>Gu, Yuchao ; Wang, Xintao ; Wu, Jay Zhangjie ; Shi, Yujun ; Chen, Yunpeng ; Fan, Zihan ; Xiao, Wuyou ; Zhao, Rui ; Chang, Shuning ; Wu, Weijia ; Ge, Yixiao ; Shan, Ying ; Mike Zheng Shou</creator><creatorcontrib>Gu, Yuchao ; Wang, Xintao ; Wu, Jay Zhangjie ; Shi, Yujun ; Chen, Yunpeng ; Fan, Zihan ; Xiao, Wuyou ; Zhao, Rui ; Chang, Shuning ; Wu, Weijia ; Ge, Yixiao ; Shan, Ying ; Mike Zheng Shou</creatorcontrib><description>Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion. Mix-of-Show adopts an embedding-decomposed LoRA (ED-LoRA) for single-client tuning and gradient fusion for the center node to preserve the in-domain essence of single concepts and support theoretically limitless concept fusion. Additionally, we introduce regionally controllable sampling, which extends spatially controllable sampling (e.g., ControlNet and T2I-Adaptor) to address attribute binding and missing object problems in multi-concept sampling. Extensive experiments demonstrate that Mix-of-Show is capable of composing multiple customized concepts with high fidelity, including characters, objects, and scenes.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Controllability ; Customization ; Sampling ; Tuning</subject><ispartof>arXiv.org, 2023-11</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Gu, Yuchao</creatorcontrib><creatorcontrib>Wang, Xintao</creatorcontrib><creatorcontrib>Wu, Jay Zhangjie</creatorcontrib><creatorcontrib>Shi, Yujun</creatorcontrib><creatorcontrib>Chen, Yunpeng</creatorcontrib><creatorcontrib>Fan, Zihan</creatorcontrib><creatorcontrib>Xiao, Wuyou</creatorcontrib><creatorcontrib>Zhao, Rui</creatorcontrib><creatorcontrib>Chang, Shuning</creatorcontrib><creatorcontrib>Wu, Weijia</creatorcontrib><creatorcontrib>Ge, Yixiao</creatorcontrib><creatorcontrib>Shan, Ying</creatorcontrib><creatorcontrib>Mike Zheng Shou</creatorcontrib><title>Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models</title><title>arXiv.org</title><description>Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion. Mix-of-Show adopts an embedding-decomposed LoRA (ED-LoRA) for single-client tuning and gradient fusion for the center node to preserve the in-domain essence of single concepts and support theoretically limitless concept fusion. Additionally, we introduce regionally controllable sampling, which extends spatially controllable sampling (e.g., ControlNet and T2I-Adaptor) to address attribute binding and missing object problems in multi-concept sampling. Extensive experiments demonstrate that Mix-of-Show is capable of composing multiple customized concepts with high fidelity, including characters, objects, and scenes.</description><subject>Controllability</subject><subject>Customization</subject><subject>Sampling</subject><subject>Tuning</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNiksKwjAUAIMgWNQ7BFwH2herxZ1UxYXdqPsSbYKpNa_mg9LTq-gBXA3DTI9EwHnCsinAgIydq-M4htkc0pRH5FToJ0PFDhd8LOhKnqXxVjS6kxXd4YPthbnSZSVaL7xGQxVaWoTGa5ajOcvW0zw4jzfdfTsqutJKBfeRAivZuBHpK9E4Of5xSCab9THfstbiPUjnyxqDNe9UQgZxBknKgf93vQBIKkVT</recordid><startdate>20231110</startdate><enddate>20231110</enddate><creator>Gu, Yuchao</creator><creator>Wang, Xintao</creator><creator>Wu, Jay Zhangjie</creator><creator>Shi, Yujun</creator><creator>Chen, Yunpeng</creator><creator>Fan, Zihan</creator><creator>Xiao, Wuyou</creator><creator>Zhao, Rui</creator><creator>Chang, Shuning</creator><creator>Wu, Weijia</creator><creator>Ge, Yixiao</creator><creator>Shan, Ying</creator><creator>Mike Zheng Shou</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231110</creationdate><title>Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models</title><author>Gu, Yuchao ; Wang, Xintao ; Wu, Jay Zhangjie ; Shi, Yujun ; Chen, Yunpeng ; Fan, Zihan ; Xiao, Wuyou ; Zhao, Rui ; Chang, Shuning ; Wu, Weijia ; Ge, Yixiao ; Shan, Ying ; Mike Zheng Shou</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_28208215323</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Controllability</topic><topic>Customization</topic><topic>Sampling</topic><topic>Tuning</topic><toplevel>online_resources</toplevel><creatorcontrib>Gu, Yuchao</creatorcontrib><creatorcontrib>Wang, Xintao</creatorcontrib><creatorcontrib>Wu, Jay Zhangjie</creatorcontrib><creatorcontrib>Shi, Yujun</creatorcontrib><creatorcontrib>Chen, Yunpeng</creatorcontrib><creatorcontrib>Fan, Zihan</creatorcontrib><creatorcontrib>Xiao, Wuyou</creatorcontrib><creatorcontrib>Zhao, Rui</creatorcontrib><creatorcontrib>Chang, Shuning</creatorcontrib><creatorcontrib>Wu, Weijia</creatorcontrib><creatorcontrib>Ge, Yixiao</creatorcontrib><creatorcontrib>Shan, Ying</creatorcontrib><creatorcontrib>Mike Zheng Shou</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gu, Yuchao</au><au>Wang, Xintao</au><au>Wu, Jay Zhangjie</au><au>Shi, Yujun</au><au>Chen, Yunpeng</au><au>Fan, Zihan</au><au>Xiao, Wuyou</au><au>Zhao, Rui</au><au>Chang, Shuning</au><au>Wu, Weijia</au><au>Ge, Yixiao</au><au>Shan, Ying</au><au>Mike Zheng Shou</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models</atitle><jtitle>arXiv.org</jtitle><date>2023-11-10</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion. Mix-of-Show adopts an embedding-decomposed LoRA (ED-LoRA) for single-client tuning and gradient fusion for the center node to preserve the in-domain essence of single concepts and support theoretically limitless concept fusion. Additionally, we introduce regionally controllable sampling, which extends spatially controllable sampling (e.g., ControlNet and T2I-Adaptor) to address attribute binding and missing object problems in multi-concept sampling. Extensive experiments demonstrate that Mix-of-Show is capable of composing multiple customized concepts with high fidelity, including characters, objects, and scenes.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-11
issn 2331-8422
language eng
recordid cdi_proquest_journals_2820821532
source Open Access: Freely Accessible Journals by multiple vendors
subjects Controllability
Customization
Sampling
Tuning
title Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T18%3A15%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Mix-of-Show:%20Decentralized%20Low-Rank%20Adaptation%20for%20Multi-Concept%20Customization%20of%20Diffusion%20Models&rft.jtitle=arXiv.org&rft.au=Gu,%20Yuchao&rft.date=2023-11-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2820821532%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2820821532&rft_id=info:pmid/&rfr_iscdi=true