A supervised machine learning algorithm for detecting and predicting fraud in credit card transactions

Fraudsters are now more active in their attacks on credit card transactions than ever before. With the advancement in data science and machine learning, various algorithms have been developed to determine whether a transaction is fraudulent. We study the performance of three different machine learni...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Decision analytics journal 2023-03, Vol.6, p.1-12, Article 100163
Hauptverfasser: Afriyie, Jonathan Kwaku, Tawiah, Kassim, Pels, Wilhemina Adoma, Addai-Henne, Sandra, Dwamena, Harriet Achiaa, Owiredu, Emmanuel Odame, Ayeh, Samuel Amening, Eshun, John
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Fraudsters are now more active in their attacks on credit card transactions than ever before. With the advancement in data science and machine learning, various algorithms have been developed to determine whether a transaction is fraudulent. We study the performance of three different machine learning models: logistic regression, random forest, and decision trees to classify, predict, and detect fraudulent credit card transactions. We compare these models’ performance and show that random forest produces a maximum accuracy of 96% (with an area under the curve value of 98.9%) in predicting and detecting fraudulent credit card transactions. Thus, we recommend random forest as the most appropriate machine learning algorithm for predicting and detecting fraud in credit card transactions. Credit Card holders above 60 years were found to be mostly victims of these fraudulent transactions, with a greater proportion of fraudulent transactions occurring between the hours of 22:00GMT and 4:00GMT. •The machine learning models of logistic regression, random forest, and decision trees are evaluated for detecting fraudulent credit card transactions.•Random forest is the most suitable model for predicting fraudulent transactions.•Balancing a dataset ensures that the model does not favour the majority class solely.•Credit Card holders above 60 years are mostly the victims of fraud transactions.•A greater proportion of fraudulent transactions occur between the hours of 22:00GMT and 4:00GMT.
ISSN:2772-6622
2772-6622
DOI:10.1016/j.dajour.2023.100163