Online nonparametric monitoring of heterogeneous data streams with partial observations based on Thompson sampling
With the rapid advancement of sensor technology driven by Internet-of-Things-enabled applications, tremendous amounts of measurements of heterogeneous data streams are frequently acquired for online process monitoring. Such massive data, involving a large number of data streams with high sampling fr...
Gespeichert in:
Veröffentlicht in: | IIE transactions 2023-04, Vol.55 (4), p.392-404 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | With the rapid advancement of sensor technology driven by Internet-of-Things-enabled applications, tremendous amounts of measurements of heterogeneous data streams are frequently acquired for online process monitoring. Such massive data, involving a large number of data streams with high sampling frequency, incur high costs on data collection, transmission, and analysis in practice. As a result, the resource constraint often restricts the data observability to only a subset of data streams at each data acquisition time, posing significant challenges in many online monitoring applications. Unfortunately, existing methods do not provide a general framework for monitoring heterogeneous data streams with partial observations. In this article, we propose a nonparametric monitoring and sampling algorithm to quickly detect abnormalities occurring to heterogeneous data streams. In particular, an approximation framework is incorporated with an antirank-based CUSUM procedure to collectively estimate the underlying status of all data streams based on partially observed data. Furthermore, an intelligent sampling strategy based on Thompson sampling is proposed to dynamically observe the informative data streams and balance between exploration and exploitation to facilitate quick anomaly detection. Theoretical justification of the proposed algorithm is also investigated. Both simulations and case studies are conducted to demonstrate the superiority of the proposed method. |
---|---|
ISSN: | 2472-5854 2472-5862 |
DOI: | 10.1080/24725854.2022.2039423 |