Boomerang: Local sampling on image manifolds using diffusion models
The inference stage of diffusion models can be seen as running a reverse-time diffusion stochastic differential equation, where samples from a Gaussian latent distribution are transformed into samples from a target distribution that usually reside on a low-dimensional manifold, e.g., an image manifo...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The inference stage of diffusion models can be seen as running a reverse-time
diffusion stochastic differential equation, where samples from a Gaussian
latent distribution are transformed into samples from a target distribution
that usually reside on a low-dimensional manifold, e.g., an image manifold. The
intermediate values between the initial latent space and the image manifold can
be interpreted as noisy images, with the amount of noise determined by the
forward diffusion process noise schedule. We utilize this interpretation to
present Boomerang, an approach for local sampling of image manifolds. As
implied by its name, Boomerang local sampling involves adding noise to an input
image, moving it closer to the latent space, and then mapping it back to the
image manifold through a partial reverse diffusion process. Thus, Boomerang
generates images on the manifold that are ``similar,'' but nonidentical, to the
original input image. We can control the proximity of the generated images to
the original by adjusting the amount of noise added. Furthermore, due to the
stochastic nature of the reverse diffusion process in Boomerang, the generated
images display a certain degree of stochasticity, allowing us to obtain local
samples from the manifold without encountering any duplicates. Boomerang offers
the flexibility to work seamlessly with any pretrained diffusion model, such as
Stable Diffusion, without necessitating any adjustments to the reverse
diffusion process. We present three applications for Boomerang. First, we
provide a framework for constructing privacy-preserving datasets having
controllable degrees of anonymity. Second, we show that using Boomerang for
data augmentation increases generalization performance and outperforms
state-of-the-art synthetic data augmentation. Lastly, we introduce a perceptual
image enhancement framework, which enables resolution enhancement. |
---|---|
DOI: | 10.48550/arxiv.2210.12100 |