On the use of data filtering techniques for credit risk prediction with instance-based models

► The performance of data filtering for credit risk prediction is assessed. ► Twenty filtering algorithms are evaluated on eight credit databases. ► Statistical tests show a significant improvement in performance of the filtered sets. Many techniques have been proposed for credit risk prediction, fr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Expert systems with applications 2012-12, Vol.39 (18), p.13267-13276
Hauptverfasser:	Garcia, V, Marques, AI, Sanchez, J S
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Artificial intelligence Credit risk Editing Expert systems Filtering Filtration Finance Instance selection Mathematical models Nearest neighbour rule Outlier Risk Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	► The performance of data filtering for credit risk prediction is assessed. ► Twenty filtering algorithms are evaluated on eight credit databases. ► Statistical tests show a significant improvement in performance of the filtered sets. Many techniques have been proposed for credit risk prediction, from statistical models to artificial intelligence methods. However, very few research efforts have been devoted to deal with the presence of noise and outliers in the training set, which may strongly affect the performance of the prediction model. Accordingly, the aim of the present paper is to systematically investigate whether the application of filtering algorithms leads to an increase in accuracy of instance-based classifiers in the context of credit risk assessment. The experimental results with 20 different algorithms and 8 credit databases show that the filtered sets perform significantly better than the non-preprocessed training sets when using the nearest neighbour decision rule. The experiments also allow to identify which techniques are most robust and accurate when confronted with noisy credit data.
ISSN:	0957-4174 1873-6793
DOI:	10.1016/j.eswa.2012.05.075