METHOD AND SYSTEM FOR IDENTIFYING AND MITIGATING BIAS WHILE TRAINING DEEP LEARNING MODELS
This disclosure relates generally to identification and mitigation of bias while training deep learning models. Conventional methods do not provide effective methods for bias identification, and they require pre-defined concepts and rules for bias mitigation. The embodiments of the present disclosur...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This disclosure relates generally to identification and mitigation of bias while training deep learning models. Conventional methods do not provide effective methods for bias identification, and they require pre-defined concepts and rules for bias mitigation. The embodiments of the present disclosure train an auto-encoder to produce a generalized representation of an input image by decomposing into a set of latent embedding. The set of latent embedding are used to learn the shape and color concepts of the input image. The feature specialization is done by training an auto-encoder to reconstruct the input image using the shape embedding modulated by color embedding. To identify the bias, permutation invariant neural network is trained for classification task and attribution scores corresponding to each concept embedding are computed. The method also performs de-biasing the classifier by training it with a set of counterfactual images generated by modifying the latent embedding learned by the auto-encoder. |
---|