Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Text-to-image diffusion models have achieved tremendous success in the field of controllable image generation, while also coming along with issues of privacy leakage and data copyrights. Membership inference arises in these contexts as a potential auditing method for detecting unauthorized data usag...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Text-to-image diffusion models have achieved tremendous success in the field
of controllable image generation, while also coming along with issues of
privacy leakage and data copyrights. Membership inference arises in these
contexts as a potential auditing method for detecting unauthorized data usage.
While some efforts have been made on diffusion models, they are not applicable
to text-to-image diffusion models due to the high computation overhead and
enhanced generalization capabilities. In this paper, we first identify a
conditional overfitting phenomenon in text-to-image diffusion models,
indicating that these models tend to overfit the conditional distribution of
images given the corresponding text rather than the marginal distribution of
images only. Based on this observation, we derive an analytical indicator,
namely Conditional Likelihood Discrepancy (CLiD), to perform membership
inference, which reduces the stochasticity in estimating memorization of
individual samples. Experimental results demonstrate that our method
significantly outperforms previous methods across various data distributions
and dataset scales. Additionally, our method shows superior resistance to
overfitting mitigation strategies, such as early stopping and data
augmentation. |
---|---|
DOI: | 10.48550/arxiv.2405.14800 |