Cluster Analysis to Preprocess the Building Power Usage Data Without Domain Knowledge
This paper aims to provide the advantage of applying cluster analysis as a data preprocessing algorithm. Daily power usage of the office building during a year is analyzed in this study. Density-based clustering algorithm is applied in this study to find outliers of the data. Calendar day of the dat...
Gespeichert in:
Veröffentlicht in: | Journal of electrical engineering & technology 2020, 15(2), , pp.685-692 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper aims to provide the advantage of applying cluster analysis as a data preprocessing algorithm. Daily power usage of the office building during a year is analyzed in this study. Density-based clustering algorithm is applied in this study to find outliers of the data. Calendar day of the data is mapped on the circular time domain to consider the seasonality of power data. Optimal parameters for the data normalization and clustering is found by iterative search procedures. The result of this study found many possible outliers even without considerations for the detailed domain knowledge about the data themselves. Advanced studies such as modeling or statistical analyses can take advantage of outlier-free data from the data preprocessing. |
---|---|
ISSN: | 1975-0102 2093-7423 |
DOI: | 10.1007/s42835-020-00372-2 |