MODEL BIAS DETECTION

Aspects of the present disclosure provide techniques for detecting latent bias in machine learning models. Embodiments include receiving a data set comprising features of a plurality of individuals. Embodiments include receiving identifying information for each individual of the plurality of individ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: HORESH, Yair, MEIR LADOR, Shir, DE SHETLER, Natalie Grace, BEN ARIE, Aviv, MISHRAKY, Elhanan
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Aspects of the present disclosure provide techniques for detecting latent bias in machine learning models. Embodiments include receiving a data set comprising features of a plurality of individuals. Embodiments include receiving identifying information for each individual of the plurality of individuals. Embodiments include predicting, for each respective individual of the plurality of individuals, a probability that the respective individual belongs to a given class based on the identifying information for the given individual. Embodiments include providing, as inputs to a machine learning model, the features of the plurality of individuals from the data set. Embodiments include receiving outputs from the machine learning model in response to the inputs. Embodiments include determining whether the machine learning model is biased against the given class based on the outputs and the probability that each respective individual of the plurality of individuals belongs to the given class.