ASER: Adapted squared error relevance for rare cases prediction in imbalanced regression
Many real‐world data mining applications involve using imbalanced datasets to obtain predictive models. Imbalanced data can hinder the model performance of learning algorithms in rare cases. Although there are many well‐researched classification task solutions, most of them cannot be directly applie...
Gespeichert in:
Veröffentlicht in: | Journal of chemometrics 2023-11, Vol.37 (11) |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Many real‐world data mining applications involve using imbalanced datasets to obtain predictive models. Imbalanced data can hinder the model performance of learning algorithms in rare cases. Although there are many well‐researched classification task solutions, most of them cannot be directly applied to regression task. One of the challenges in imbalanced regression is to find a suitable evaluation and optimization standard that can improve the predictive ability of the model without severe model bias. Based on the importance of rare cases, this study proposes a new evaluation metric called adapted squared error relevance (ASER) by defining new relevance function and weighting functions. This metric weights data points by defining the importance of rare cases and assigns different weights to losses of the same size at different rare cases, thus enabling the model selected by this evaluation metric to better predict rare cases. ASER is compared with SER on 32 real datasets and 9 simulated datasets to verify the predictive performance of the selected model at rare cases. The experimental results show that the new evaluation metric ASER can obtain a high prediction performance at rare cases, while also not losing too much prediction accuracy in common cases.
In order to better evaluate the predictive ability of the model in rare cases, we propose a new evaluation metric for imbalanced regression called adapted squared error relevance (ASER). ASER weights data points by defining the importance of cases and assigns different weights to losses of the same size at different cases. ASER can obtain a high prediction performance at rare cases, while also not losing too much prediction accuracy in common cases. |
---|---|
ISSN: | 0886-9383 1099-128X |
DOI: | 10.1002/cem.3515 |