Neural network for ordinal classification of imbalanced data by minimizing a Bayesian cost

•An estimate of the Bayes cost is proposed as the loss to train neural networks for ordinal classification of imbalanced data.•The network parameters, as well as the decision thresholds, are updated during training to minimize the Bayes cost.•The neural network architecture has a single neuron in th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Pattern recognition 2023-05, Vol.137, p.109303, Article 109303
Hauptverfasser:	Lázaro, Marcelino, Figueiras-Vidal, Aníbal R.
Format:	Artikel
Sprache:	eng
Schlagworte:	Bayes cost Imbalanced Ordinal classification Parzen windows
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•An estimate of the Bayes cost is proposed as the loss to train neural networks for ordinal classification of imbalanced data.•The network parameters, as well as the decision thresholds, are updated during training to minimize the Bayes cost.•The neural network architecture has a single neuron in the output layer (one-dimensional input space).•Both shallow networks and deep networks can be used.•Experiments with real data show the accuracy and flexibility of the proposed method, specially in imbalanced problems. Ordinal classification of imbalanced data is a challenging problem that appears in many real world applications. The challenge is to simultaneously consider the order of the classes and the class imbalance, which can notably improve the performance metrics. The Bayesian formulation allows to deal with these two characteristics jointly: It takes into account the prior probability of each class and the decision costs, which can be used to include the imbalance and the ordinal information, respectively. We propose to use the Bayesian formulation to train neural networks, which have shown excellent results in many classification tasks. A loss function is proposed to train networks with a single neuron in the output layer and a threshold based decision rule. The loss is an estimate of the Bayesian classification cost, based on the Parzen windows estimator, which is fitted for a thresholded decision. Experiments with several real datasets show that the proposed method provides competitive results in different scenarios, due to its high flexibility to specify the relative importance of the errors in the classification of patterns of different classes, considering the order and independently of the probability of each class.
ISSN:	0031-3203 1873-5142
DOI:	10.1016/j.patcog.2023.109303