Gradient boosting and Shapley additive explanations for fraud detection in electricity distribution grids

Summary Fraud in electrical energy consumption represents a critical economic burden for utility companies around the world. Despite systematic efforts to mitigate electricity theft, this practice persists mostly in developing countries where companies rely on traditional detection methods. In Brazi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International transactions on electrical energy systems 2021-09, Vol.31 (9), p.n/a
Hauptverfasser:	Santos, Ricardo N., Yamouni, Sami, Albiero, Beatriz, Vicente, Renato, Silva, Juliano, Souza, Tales, Freitas Souza, Mario, Lei, Zhili
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Developing countries Electric power distribution Electricity Energy consumption energy distribution Fraud Inspection LDCs Machine learning Mitigation SHAP Theft XGBoost
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Summary Fraud in electrical energy consumption represents a critical economic burden for utility companies around the world. Despite systematic efforts to mitigate electricity theft, this practice persists mostly in developing countries where companies rely on traditional detection methods. In Brazil it is estimated that around 7% of the total electrical energy available for consumption in 2016 was lost due to frauds. Here we describe an efficient and scalable system to predict fraudulent behavior and guide in loco inspections. We compared the performances of several machine learning algorithms using consumption and inspection data provided by CPFL Energia. We show that proper feature engineering and boosted classification trees trained with XGBoost are able to extract patterns related to fraud occurrence and to achieve predictive power of practical consequences. Moreover, we demonstrate how Shapley additive explanation (SHAP) values can be employed to build user friendly explanations. Together, the proposed model and its explainers contribute not only to reveal potentially fraudulent behavior but also to understand root causes, what can be used to devise robust mitigation strategies. We devise a machine learning‐based system to predict fraudulent behaviour and guide in loco inspections. We show that boosted decision trees can efficiently extract fraud patterns. The decisions generated can be explained by SHAP values.
ISSN:	2050-7038 2050-7038
DOI:	10.1002/2050-7038.13046