Feature engineering strategies for credit card fraud detection

•Credit card fraud detection evaluation measure.•Each example is assumed to have different financial cost.•Transaction aggregation strategy for predicting fraud.•Periodic features using the von Mises distribution.•Code is open source and available at albahnsen.com/CostSensitiveClassification. Every...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2016-06, Vol.51, p.134-142
Hauptverfasser: Correa Bahnsen, Alejandro, Aouada, Djamila, Stojanovic, Aleksandar, Ottersten, Björn
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Credit card fraud detection evaluation measure.•Each example is assumed to have different financial cost.•Transaction aggregation strategy for predicting fraud.•Periodic features using the von Mises distribution.•Code is open source and available at albahnsen.com/CostSensitiveClassification. Every year billions of Euros are lost worldwide due to credit card fraud. Thus, forcing financial institutions to continuously improve their fraud detection systems. In recent years, several studies have proposed the use of machine learning and data mining techniques to address this problem. However, most studies used some sort of misclassification measure to evaluate the different solutions, and do not take into account the actual financial costs associated with the fraud detection process. Moreover, when constructing a credit card fraud detection model, it is very important how to extract the right features from the transactional data. This is usually done by aggregating the transactions in order to observe the spending behavioral patterns of the customers. In this paper we expand the transaction aggregation strategy, and propose to create a new set of features based on analyzing the periodic behavior of the time of a transaction using the von Mises distribution. Then, using a real credit card fraud dataset provided by a large European card processing company, we compare state-of-the-art credit card fraud detection models, and evaluate how the different sets of features have an impact on the results. By including the proposed periodic features into the methods, the results show an average increase in savings of 13%.
ISSN:0957-4174
1873-6793
1873-6793
DOI:10.1016/j.eswa.2015.12.030