DEBIASING VISION-LANGUAGE MODELS WITH ADDITIVE RESIDUALS

The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utili...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hemani, Mayur, Seth, Ashish, Agarwal, Chirag
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utilizing an image encoder of a vision-language neural network. Additionally, in some embodiments, the disclosed systems extract a protected attribute encoding from the encoded image representation of the digital image utilizing an additive residual learner. Upon extracting the protected attribute encoding, in some implementations, the disclosed systems determine a debiased image encoding for the digital image by combining the protected attribute encoding and the encoded image representation.