Enhancing Time Series Predictors With Generalized Extreme Value Loss

Time series prediction has wide applications in many safety-critical scenarios, including meteorology and finance. According to previous studies, time series of recorded events, e.g., river level and stock price, usually contain a non-trivial proportion of extreme events (e.g., flood and financial c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on knowledge and data engineering 2023-02, Vol.35 (2), p.1473-1487
Hauptverfasser: Zhang, Mi, Ding, Daizong, Pan, Xudong, Yang, Min
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Time series prediction has wide applications in many safety-critical scenarios, including meteorology and finance. According to previous studies, time series of recorded events, e.g., river level and stock price, usually contain a non-trivial proportion of extreme events (e.g., flood and financial crisis), which are featured with extremely large/small values, occur in time series data with a relatively low frequency, and may have huge societal consequences if overlooked by a predictive model (i.e., predictor). Despite its significance in time series, we however observe the conventional square loss in time series prediction would ignore the modeling of extreme events. Specifically, we prove the square loss as a learning objective of the predictor behaves equivalently as a Gaussian kernel density estimator (KDE) on the recorded events, which is light-tailed itself and unable to model the ground-truth event distribution, usually heavy-tailed due to the existence of extreme events. Considering the benefits of forecasting extreme events, we propose a unified loss form called Generalized Extreme Value Loss (GEVL), which bridges the misalignment between the tail parts of the estimation and the ground-truth via transformations on either the observed events or the estimator. Following the proposed framework, we present three heavy-tailed kernels, i.e., shifted Gaussian, Gumbel and Fréchet kernels, and derive the corresponding GEVLs which show different levels of trade-off between modeling effectiveness and computational resources, suitable for various downstream tasks. Comprehensive experiments on a diverse set of time series predictors and real-world datasets validate that, our novel loss form substantially enhances representative time series predictors in modeling extreme events. For example, for CO2 concentration rate prediction and stock price prediction, our proposed Fréchet GEVL respectively reduces the RMSE of 6 representative DNN-based time series predictors on extreme events by over 20\% 20% and 17\% 17% on average, with a maximum reduction of
ISSN:1041-4347
1558-2191
DOI:10.1109/TKDE.2021.3108831