SMOOTHED FULL-SCALE APPROXIMATION OF GAUSSIAN PROCESS MODELS FOR COMPUTATION OF LARGE SPATIAL DATA SETS

Gaussian process (GP) models encounter computational difficulties with large spatial data sets, because the models’ computational complexity grows cubically with the sample size n. Although a full-scale approximation (FSA) using a block modulating function provides an e ective way to approximate GP...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistica Sinica 2020, Vol.29 (4), p.1711-1737
Hauptverfasser: Zhang, Bohai, Sang, Huiyan, Huang, Jianhua Z.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Gaussian process (GP) models encounter computational difficulties with large spatial data sets, because the models’ computational complexity grows cubically with the sample size n. Although a full-scale approximation (FSA) using a block modulating function provides an e ective way to approximate GP models, it has several shortcomings. These include a less smooth prediction surface on block boundaries and sensitivity to the knot set under small-scale data dependence. To address these issues, we propose a smoothed full-scale approximation (SFSA) method for analyzing large spatial data sets. The SFSA leads to a class of scalable GP models, with covariance functions that consist of two parts: a reduced-rank covariance function that captures large-scale spatial dependence, and a covariance that adjusts the local covariance approximation errors of the reduced-rank part, both within blocks and between neighboring blocks. This method reduces the prediction errors on block boundaries, and leads to inference and prediction results that are more robust under different dependence scales owing to the better approximation of the residual covariance. The proposed method provides a unied view of approximation methods for GP models, grouping several existing computational methods for large spatial data sets into one common framework. These methods include the predictive process, FSA, and nearest neighboring block GP methods, allowing efficient algorithms that provide robust and accurate model inferences and predictions for large spatial data sets within a united framework. We illustrate the e ectiveness of the SFSA approach using simulation studies and a total column ozone data set.
ISSN:1017-0405
1996-8507
DOI:10.5705/ss.202017.0008