Prediction of environmental missing data time series by Support Vector Machine Regression and Correlation Dimension estimation
Environmental time series are often affected by missing data, namely data unavailability at certain time points. This paper presents the Iterated Imputation and Prediction algorithm, that allows the prediction of time series with missing data. The algorithm uses iteratively the Correlation Dimension...
Gespeichert in:
Veröffentlicht in: | Environmental modelling & software : with environment data news 2022-04, Vol.150, p.105343, Article 105343 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Environmental time series are often affected by missing data, namely data unavailability at certain time points. This paper presents the Iterated Imputation and Prediction algorithm, that allows the prediction of time series with missing data. The algorithm uses iteratively the Correlation Dimension Estimation of the underlying dynamic system generating the time series to fix the model order (i.e., how many past samples are required to model the time series accurately), and the Support Vector Machine Regression to estimate the skeleton of time series. Experimental validation of the algorithm on three environmental time series with missing data, expressing the concentration of Ozone in three European sites, shows a small average percentage prediction error for all time series on the test set.
•The paper presents Iterated Imputation and Prediction (IIP) algorithm for the missing data time series prediction .•IIP uses Correlation Dimension and Support Vector Machine Regression to estimate the model order and the skeleton of time series.•Correlation Dimension is estimated with the proposed Grassberger-Procaccia-Hough algorithm. |
---|---|
ISSN: | 1364-8152 1873-6726 |
DOI: | 10.1016/j.envsoft.2022.105343 |