Improving Fine-Grained Control via Aggregation of Multiple Diffusion Models
While many diffusion models perform well when controlling for particular aspect among style, character, and interaction, they struggle with fine-grained control due to dataset limitations and intricate model architecture design. This paper introduces a novel algorithm, Aggregation of Multiple Diffus...
Gespeichert in:
Hauptverfasser: | , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | While many diffusion models perform well when controlling for particular
aspect among style, character, and interaction, they struggle with fine-grained
control due to dataset limitations and intricate model architecture design.
This paper introduces a novel algorithm, Aggregation of Multiple Diffusion
Models (AMDM), which synthesizes features from multiple diffusion models into a
specified model, activating specific features for fine-grained control.
Experimental results demonstrate that AMDM significantly improves fine-grained
control without training, proving its effectiveness. Additionally, it reveals
that diffusion models initially focus on features such as position, attributes,
and style, with later stages improving generation quality and consistency. AMDM
offers a new perspective for tackling the challenges of fine-grained
conditional control generation in diffusion models: We can fully utilize
existing or develop new conditional diffusion models that control specific
aspects, and then aggregate them using AMDM algorithm. This eliminates the need
for constructing complex datasets, designing intricate model architectures, and
incurring high training costs. Code is available at:
https://github.com/Hammour-steak/AMDM. |
---|---|
DOI: | 10.48550/arxiv.2410.01262 |