Intelligent defaulter Prediction using Data Science Process

The machine learning classifiers employed are not good enough to clearly classify the loan defaulters. In order to alleviate this problem, the data science process is adopted. During the machine learning phase, the random forest classifier with tuned hyperparameters is trained on the loan lending da...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series 2021-05, Vol.1916 (1), p.12001
Hauptverfasser: Raviteja, Chilagani, Santosh, D Teja
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The machine learning classifiers employed are not good enough to clearly classify the loan defaulters. In order to alleviate this problem, the data science process is adopted. During the machine learning phase, the random forest classifier with tuned hyperparameters is trained on the loan lending dataset for obtaining the bank loan defaulter model. This classifier is learned by using a smaller number of features as predictor variables obtained after the Exploratory Data Analysis phase. The ability of the model to correctly classify the unseen loan applicant’s data is evaluated in terms of the classifier accuracy and attending the false positives during the model diagnosis from the confusion matrix which unattended will prove fatal for the loan lending banks. The conclusion drawn from this analysis is that the performance metric of the classifier namely the classifier accuracy for Random Forest has outperformed the state-ofart statistical classifiers.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1916/1/012001