Learning fair representations by separating the relevance of potential information

Representation learning has recently been used to remove sensitive information from data and improve the fairness of machine learning algorithms in social applications. However, previous works that used neural networks are opaque and poorly interpretable, as it is difficult to intuitively determine...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information processing & management 2022-11, Vol.59 (6), p.103103, Article 103103
Hauptverfasser:	Quan, Tangkun, Zhu, Fei, Ling, Xinghong, Liu, Quan
Format:	Artikel
Sprache:	eng
Schlagworte:	Fair classification Fair machine learning Fair representation Interpretability of representations
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Representation learning has recently been used to remove sensitive information from data and improve the fairness of machine learning algorithms in social applications. However, previous works that used neural networks are opaque and poorly interpretable, as it is difficult to intuitively determine the independence between representations and sensitive information. The internal correlation among data features has not been fully discussed, and it may be the key to improving the interpretability of neural networks. A novel fair representation algorithm referred to as FRC is proposed from this conjecture. It indicates how representations independent of multiple sensitive attributes can be learned by applying specific correlation constraints on representation dimensions. Specifically, dimensions of the representation and sensitive attributes are treated as statistical variables. The representation variables are divided into two parts related to and unrelated to the sensitive variables by adjusting their absolute correlation coefficient with sensitive variables. The potential impact of sensitive information on representations is concentrated in the related part. The unrelated part of the representation can be used in downstream tasks to yield fair results. FRC takes the correlation between dimensions as the key to solving the problem of fair representation. Empirical results show that our representations enhance the ability of neural networks to show fairness and achieve better fairness-accuracy tradeoffs than state-of-the-art works. •Analyzing the correlation between representation dimension and sensitive information.•Separating potentially sensitive information in the representation.•Proposing two fair representation models based on our discussion.•Making it more intuitive to comprehend the fairness of representation.•Evaluating our representation learning approach on downstream machine learning tasks.
ISSN:	0306-4573 1873-5371
DOI:	10.1016/j.ipm.2022.103103