Automated detection of photovoltaic cleaning events: A performance comparison of techniques as applied to a broad set of labeled photovoltaic data sets

Extracting accurate soiling loss information from photovoltaic (PV) production data first requires segmenting the time series data per natural or manually occurring cleaning events. Maintenance logs are often incomplete, rain data are often unavailable, and the debate on rain thresholds for cleaning...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Progress in photovoltaics 2022-05, Vol.30 (5), p.567-577
Hauptverfasser: Muller, Matthew, Perry, Kirsten, Micheli, Leonardo, Almonacid, Florencia, Fernández, Eduardo F.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Extracting accurate soiling loss information from photovoltaic (PV) production data first requires segmenting the time series data per natural or manually occurring cleaning events. Maintenance logs are often incomplete, rain data are often unavailable, and the debate on rain thresholds for cleaning and dew or wind cleanings is still ongoing. The present work aims to overtake these issues by improving automated methods to detect these cleaning events and therefore improve extraction of soiling loss information. Time series power production data from 22 PV inverters were labeled for natural or manually occurring cleaning events. The data sets were carefully selected to include varying degrees of soiling, cleaning events, and noise. Several algorithms, including filtering logic and change point detection, were examined for efficacy at detecting the labeled cleanings. All the methods introduced except for changepoint detection showed significant improvement at detecting the labeled cleaning events per the mean F1 score. Furthermore, the highest performing cleaning detection algorithm achieved an absolute increase in the mean F1 score of 43% over the default version of the RdTools stochastic rate and recovery (SRR) algorithm. The highest performing algorithm included irradiance filtering and a cleaning detection threshold, adjusted based on the 40‐day centered rolling median of the absolute day‐to‐day deviations in the daily performance index (PI). These improvements are promising as cleaning detection is an essential step in the automated analysis of PV soiling. Over a 1000 combinations of tuning parameters, algorithms, and filtering criteria have been tested to improve automated cleaning detection within 22 times series from PV inverters. The highest performing cleaning detection algorithm achieved an absolute increase in the mean F1 score of 43% over the default version of the RdTools stochastic rate and recovery (SRR) algorithm. The data sets have been made publicly available in Duramat per https://datahub.duramat.org/project/example-data.
ISSN:1062-7995
1099-159X
DOI:10.1002/pip.3523