Disentangled Inference for GANs With Latently Invertible Autoencoder

Generative Adversarial Networks (GANs) can synthesize more and more realistic images. However, one fundamental issue hinders their practical applications: the incapability of encoding real samples in the latent space. Many semantic image editing applications rely on inverting the given image into th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of computer vision 2022-05, Vol.130 (5), p.1259-1276
Hauptverfasser:	Zhu, Jiapeng, Zhao, Deli, Zhang, Bo, Zhou, Bolei
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial Intelligence Coders Computer Imaging Computer programs Computer Science Datasets Entanglement Generative adversarial networks Image manipulation Image processing Image Processing and Computer Vision Inference Liquors Pattern Recognition Pattern Recognition and Graphics Photovoltaic cells Vision
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Generative Adversarial Networks (GANs) can synthesize more and more realistic images. However, one fundamental issue hinders their practical applications: the incapability of encoding real samples in the latent space. Many semantic image editing applications rely on inverting the given image into the latent space and then manipulating inverted code. One possible solution is to learn an encoder for GAN via Variational Auto-Encoder. However, the entanglement of the latent space poses a major challenge for learning the encoder. To tackle the challenge and enable inference in GANs, we propose a novel method named Latently Invertible Autoencoder (LIA). In LIA, an invertible network and its inverse mapping are symmetrically embedded in the latent space of an autoencoder. The decoder of LIA is first trained as a standard GAN with the invertible network, and then the encoder is learned from a disentangled autoencoder by detaching the invertible network from LIA. It thus avoids the entanglement problem caused by the latent space. Extensive experiments on the FFHQ face dataset and three LSUN datasets validate the effectiveness of LIA for the image inversion and its applications. Code and models are available at https://github.com/genforce/lia .
ISSN:	0920-5691 1573-1405
DOI:	10.1007/s11263-022-01598-5