Gaussian Predictive Process Models for Large Spatial Data Sets

Type: Article

Publication Date: 2008-07-08

Citations: 992

DOI: https://doi.org/10.1111/j.1467-9868.2008.00663.x

Abstract

Summary With scientific data available at geocoded locations, investigators are increasingly turning to spatial process models for carrying out statistical inference. Over the last decade, hierarchical models implemented through Markov chain Monte Carlo methods have become especially popular for spatial modelling, given their flexibility and power to fit models that would be infeasible with classical methods as well as their avoidance of possibly inappropriate asymptotics. However, fitting hierarchical spatial models often involves expensive matrix decompositions whose computational complexity increases in cubic order with the number of spatial locations, rendering such models infeasible for large spatial data sets. This computational burden is exacerbated in multivariate settings with several spatially dependent response variables. It is also aggravated when data are collected at frequent time points and spatiotemporal process models are used. With regard to this challenge, our contribution is to work with what we call predictive process models for spatial and spatiotemporal data. Every spatial (or spatiotemporal) process induces a predictive process model (in fact, arbitrarily many of them). The latter models project process realizations of the former to a lower dimensional subspace, thereby reducing the computational burden. Hence, we achieve the flexibility to accommodate non-stationary, non-Gaussian, possibly multivariate, possibly spatiotemporal processes in the context of large data sets. We discuss attractive theoretical properties of these predictive processes. We also provide a computational template encompassing these diverse settings. Finally, we illustrate the approach with simulated and real data sets.

Locations

  • Journal of the Royal Statistical Society Series B (Statistical Methodology) - View - PDF
  • PubMed Central - View
  • Europe PMC (PubMed Central) - View - PDF
  • PubMed - View

Similar Works

Action Title Year Authors
+ PDF Chat Adaptive Gaussian predictive process models for large spatial datasets 2011 Rajarshi Guhaniyogi
Andrew O. Finley
Sudipto Banerjee
Alan E. Gelfand
+ PDF Chat Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets 2015 Abhirup Datta
Sudipto Banerjee
Andrew O. Finley
Alan E. Gelfand
+ Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets 2014 Abhirup Datta
Sudipto Banerjee
Andrew O. Finley
Alan E. Gelfand
+ PDF Chat Approximate Bayesian inference for large spatial datasets using predictive process models 2011 Jo Eidsvik
Andrew O. Finley
Sudipto Banerjee
HĂ„vard Rue
+ PDF Chat Scalable Predictions for Spatial Probit Linear Mixed Models Using Nearest Neighbor Gaussian Processes 2022 Arkajyoti Saha
Abhirup Datta
Sudipto Banerjee
+ High-Dimensional Bayesian Geostatistics 2017 Sudipto Banerjee
+ High-Dimensional Bayesian Geostatistics 2017 Sudipto Banerjee
+ Methods for Analyzing Large Spatial Data: A Review and Comparison 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ Improved evaluation of predictive probabilities in probit models with Gaussian process priors 2020 Jian Cao
Daniele Durante
Marc G. Genton
+ High-Dimensional Bayesian Geostatistics 2017 Sudipto Banerjee
+ PDF Chat An accuracy-runtime trade-off comparison of scalable Gaussian process approximations for spatial data 2025 Filippo Rambelli
Fabio Sigrist
+ A Case Study Competition Among Methods for Analyzing Large Spatial Data 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ Modelling Big, Heterogeneous, Non-Gaussian Spatial and Spatio-Temporal Data using FRK 2021 Matthew Sainsbury-Dale
Andrew Zammit‐Mangion
Noel Cressie
+ PDF Chat A Variational Approach for Modeling High-dimensional Spatial Generalized Linear Mixed Models 2024 Jin Hyung Lee
Ben Seiyon Lee
+ Sparse Cholesky matrices in spatial statistics 2021 Abhirup Datta
+ A Case Study Competition Among Methods for Analyzing Large Spatial Data 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ PDF Chat Spatial factor modeling: A Bayesian matrix‐normal approach for misaligned data 2021 Lu Zhang
Sudipto Banerjee
+ Modelling, Fitting, and Prediction with Non-Gaussian Spatial and Spatio-Temporal Data using FRK 2021 Matthew Sainsbury-Dale
Andrew Zammit‐Mangion
Noel Cressie
+ The integrated nested Laplace approximation applied to spatial log-Gaussian Cox process models 2022 Kenneth Flagg
Andrew Hoegh
+ Nonstationary Nearest Neighbor Gaussian Process: hierarchical model architecture and MCMC sampling 2022 SĂ©bastien Coube-Sisqueille
Sudipto Banerjee
BenoĂźt Liquet

Works That Cite This (526)

Action Title Year Authors
+ Smoothing and Mean–Covariance Estimation of Functional Data with a Bayesian Hierarchical Model 2015 Jingjing Yang
Hongxiao Zhu
Taeryon Choi
Dennis D. Cox
+ PDF Chat A review of predictive uncertainty estimation with machine learning 2024 Hristos Tyralis
Georgia Papacharalampous
+ PDF Chat The how and why of Bayesian nonparametric causal inference 2022 Antonio R. Linero
Joseph Antonelli
+ PDF Chat sdmTMB: An R Package for Fast, Flexible, and User-Friendly Generalized Linear Mixed Effects Models with Spatial and Spatiotemporal Random Fields 2022 Sean C. Anderson
Eric J. Ward
Philina A. English
Lewis A. K. Barnett
James T. Thorson
+ PDF Chat Exact Gaussian processes for massive datasets via non-stationary sparsity-discovering kernels 2023 Marcus M. Noack
Harinarayan Krishnan
Mark D. Risser
Kristofer G. Reyes
+ PDF Chat Boundary Detection Using a Bayesian Hierarchical Model for Multiscale Spatial Data 2019 Kai Qu
Jonathan R. Bradley
Xufeng Niu
+ Bayesian Latent Variable Co-kriging Model in Remote Sensing for Quality Flagged Observations 2023 Bledar A. Konomi
Emily L. Kang
Ayat Almomani
Jonathan Hobbs
+ Efficient Bayesian modeling of large lattice data using spectral properties of Laplacian matrix 2019 Ghadeer Jasim Mohammed Mahdi
Avishek Chakraborty
Mark E. Arnold
Anthony G. Rebelo
+ PDF Chat SpaceANOVA: Spatial Co-occurrence Analysis of Cell Types in Multiplex Imaging Data Using Point Process and Functional ANOVA 2024 Souvik Seal
Brian Neelon
Peggi M. Angel
Elizabeth C. O’Quinn
Elizabeth G. Hill
Thao Vu
Debashis Ghosh
Anand S. Mehta
Kristin Wallace
Alexander V. Alekseyenko
+ Going off grid: Computationally efficient inference for log-Gaussian Cox processes 2011 Daniel Simpson
Janine Illian
Finn Lindgren
Sigrunn H. SĂžrbye
HĂ„vard Rue