Gradient boosting and Shapley additive explanations for fraud detection in electricity distribution grids
Summary Fraud in electrical energy consumption represents a critical economic burden for utility companies around the world. Despite systematic efforts to mitigate electricity theft, this practice persists mostly in developing countries where companies rely on traditional detection methods. In Brazi...
Gespeichert in:
Veröffentlicht in: | International transactions on electrical energy systems 2021-09, Vol.31 (9), p.n/a |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Summary
Fraud in electrical energy consumption represents a critical economic burden for utility companies around the world. Despite systematic efforts to mitigate electricity theft, this practice persists mostly in developing countries where companies rely on traditional detection methods. In Brazil it is estimated that around 7% of the total electrical energy available for consumption in 2016 was lost due to frauds. Here we describe an efficient and scalable system to predict fraudulent behavior and guide in loco inspections. We compared the performances of several machine learning algorithms using consumption and inspection data provided by CPFL Energia. We show that proper feature engineering and boosted classification trees trained with XGBoost are able to extract patterns related to fraud occurrence and to achieve predictive power of practical consequences. Moreover, we demonstrate how Shapley additive explanation (SHAP) values can be employed to build user friendly explanations. Together, the proposed model and its explainers contribute not only to reveal potentially fraudulent behavior but also to understand root causes, what can be used to devise robust mitigation strategies.
We devise a machine learning‐based system to predict fraudulent behaviour and guide in loco inspections. We show that boosted decision trees can efficiently extract fraud patterns. The decisions generated can be explained by SHAP values. |
---|---|
ISSN: | 2050-7038 2050-7038 |
DOI: | 10.1002/2050-7038.13046 |