Multivariate Time Series Cleaning under Speed Constraints
Errors are common in time series due to unreliable sensor measurements. Existing methods focus on univariate data but do not utilize the correlation between dimensions. Cleaning each dimension separately may lead to a less accurate result, as some errors can only be identified in the multivariate ca...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Errors are common in time series due to unreliable sensor measurements.
Existing methods focus on univariate data but do not utilize the correlation
between dimensions. Cleaning each dimension separately may lead to a less
accurate result, as some errors can only be identified in the multivariate
case. We also point out that the widely used minimum change principle is not
always the best choice. Instead, we try to change the smallest number of data
to avoid a significant change in the data distribution. In this paper, we
propose MTCSC, the constraint-based method for cleaning multivariate time
series. We formalize the repair problem, propose a linear-time method to employ
online computing, and improve it by exploiting data trends. We also support
adaptive speed constraint capturing. We analyze the properties of our proposals
and compare them with SOTA methods in terms of effectiveness, efficiency versus
error rates, data sizes, and applications such as classification. Experiments
on real datasets show that MTCSC can have higher repair accuracy with less time
consumption. Interestingly, it can be effective even when there are only weak
or no correlations between the dimensions. |
---|---|
DOI: | 10.48550/arxiv.2411.01214 |