SMOOTHED FULL-SCALE APPROXIMATION OF GAUSSIAN PROCESS MODELS FOR COMPUTATION OF LARGE SPATIAL DATA SETS
Gaussian process (GP) models encounter computational difficulties with large spatial data sets, because the models’ computational complexity grows cubically with the sample size n. Although a full-scale approximation (FSA) using a block modulating function provides an e ective way to approximate GP...
Gespeichert in:
Veröffentlicht in: | Statistica Sinica 2020, Vol.29 (4), p.1711-1737 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Gaussian process (GP) models encounter computational difficulties with large spatial data sets, because the models’ computational complexity grows cubically with the sample size n. Although a full-scale approximation (FSA) using a block modulating function provides an e ective way to approximate GP models, it has several shortcomings. These include a less smooth prediction surface on block boundaries and sensitivity to the knot set under small-scale data dependence. To address these issues, we propose a smoothed full-scale approximation (SFSA) method for analyzing large spatial data sets. The SFSA leads to a class of scalable GP models, with covariance functions that consist of two parts: a reduced-rank covariance function that captures large-scale spatial dependence, and a covariance that adjusts the local covariance approximation errors of the reduced-rank part, both within blocks and between neighboring blocks. This method reduces the prediction errors on block boundaries, and leads to inference and prediction results that are more robust under different dependence scales owing to the better approximation of the residual covariance. The proposed method provides a unied view of approximation methods for GP models, grouping several existing computational methods for large spatial data sets into one common framework. These methods include the predictive process, FSA, and nearest neighboring block GP methods, allowing efficient algorithms that provide robust and accurate model inferences and predictions for large spatial data sets within a united framework. We illustrate the e ectiveness of the SFSA approach using simulation studies and a total column ozone data set. |
---|---|
ISSN: | 1017-0405 1996-8507 |
DOI: | 10.5705/ss.202017.0008 |