DEBIASING VISION-LANGUAGE MODELS WITH ADDITIVE RESIDUALS
The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utili...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utilizing an image encoder of a vision-language neural network. Additionally, in some embodiments, the disclosed systems extract a protected attribute encoding from the encoded image representation of the digital image utilizing an additive residual learner. Upon extracting the protected attribute encoding, in some implementations, the disclosed systems determine a debiased image encoding for the digital image by combining the protected attribute encoding and the encoded image representation. |
---|