Review: Recent advances for the diffusion model

As the generative model technology becomes more and more popular, more and more people have invested in the research of the current State-of-the-art (SOTA) generative model-diffusion model. This paper reviews all SOTA generation models using the diffusion model for text-to-image generation since the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series 2024-02, Vol.2711 (1), p.12005
1. Verfasser: Wei, Yufeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:As the generative model technology becomes more and more popular, more and more people have invested in the research of the current State-of-the-art (SOTA) generative model-diffusion model. This paper reviews all SOTA generation models using the diffusion model for text-to-image generation since the emergence of the diffusion model, including the denoising diffusion probabilistic model (DDPM), DALL·E model, imagen model, stable diffusion model, and diffusion transformer architecture (DiT) model. In the theoretical section, the basic principles behind the diffusion model are reviewed in detail in the way of mathematical calculation, including the training process of the model and the mathematical principles behind the sampling process. Moreover, this paper focuses on the technical characteristics of these models and various improvements made after model iteration, such as model structure optimization, more efficient and accurate training methods, and the application of other optimization techniques widely used in the field of deep learning to diffusion models. In the end, the technical route of the development of the diffusion model is summarized, and some predictions are made.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/2711/1/012005