DEBIASING VISION-LANGUAGE MODELS WITH ADDITIVE RESIDUALS

The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utili...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Hemani, Mayur, Seth, Ashish, Agarwal, Chirag
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utilizing an image encoder of a vision-language neural network. Additionally, in some embodiments, the disclosed systems extract a protected attribute encoding from the encoded image representation of the digital image utilizing an additive residual learner. Upon extracting the protected attribute encoding, in some implementations, the disclosed systems determine a debiased image encoding for the digital image by combining the protected attribute encoding and the encoded image representation.