An accuracy-runtime trade-off comparison of scalable Gaussian process approximations for spatial data

Type: Preprint

Publication Date: 2025-01-20

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2501.11448

Abstract

Gaussian processes (GPs) are flexible, probabilistic, non-parametric models widely employed in various fields such as spatial statistics, time series analysis, and machine learning. A drawback of Gaussian processes is their computational cost having $\mathcal{O}(N^3)$ time and $\mathcal{O}(N^2)$ memory complexity which makes them prohibitive for large datasets. Numerous approximation techniques have been proposed to address this limitation. In this work, we systematically compare the accuracy of different Gaussian process approximations concerning marginal likelihood evaluation, parameter estimation, and prediction taking into account the time required to achieve a certain accuracy. We analyze this trade-off between accuracy and runtime on multiple simulated and large-scale real-world datasets and find that Vecchia approximations consistently emerge as the most accurate in almost all experiments. However, for certain real-world data sets, low-rank inducing point-based methods, i.e., full-scale and modified predictive process approximations, can provide more accurate predictive distributions for extrapolation.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat Iterative Methods for Full-Scale Gaussian Process Approximations for Large Spatial Data 2024 Tim Gyger
Reinhard Furrer
Fabio Sigrist
+ PDF Chat A General Framework for Vecchia Approximations of Gaussian Processes 2020 Matthias Katzfuß
Joseph Guinness
+ PDF Chat Vecchia–Laplace approximations of generalized Gaussian processes for big non-Gaussian spatial data 2020 Daniel Zilber
Matthias Katzfuß
+ PDF Chat When Gaussian Process Meets Big Data: A Review of Scalable GPs 2020 Haitao Liu
Yew-Soon Ong
Xiaobo Shen
Jianfei Cai
+ When Gaussian Process Meets Big Data: A Review of Scalable GPs 2018 Haitao Liu
Yew-Soon Ong
Xiaobo Shen
Jianfei Cai
+ Methods for Analyzing Large Spatial Data: A Review and Comparison 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ A Case Study Competition Among Methods for Analyzing Large Spatial Data 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ PDF Chat Block Vecchia Approximation for Scalable and Efficient Gaussian Process Computations 2024 Qilong Pan
Sameh Abdulah
Marc G. Genton
Ying Sun
+ Spatial Process Approximations: Assessing Their Necessity 2023 Hao Zhang
+ A Case Study Competition Among Methods for Analyzing Large Spatial Data 2017 Matthew J. Heaton
Abhirup Datta
Andrew O. Finley
Reinhard Furrer
Rajarshi Guhaniyogi
Florian Gerber
Robert B. Gramacy
Dorit Hammerling
Matthias Katzfuß
Finn Lindgren
+ PDF Chat Compactly-supported nonstationary kernels for computing exact Gaussian processes on big data 2024 Mark D. Risser
Marcus M. Noack
Hengrui Luo
Ronald Pandolfi
+ Improved evaluation of predictive probabilities in probit models with Gaussian process priors 2020 Jian Cao
Daniele Durante
Marc G. Genton
+ PDF Chat Implementation and Analysis of GPU Algorithms for Vecchia Approximation 2024 Zachary James
Joseph Guinness
+ Gaussian Processes for High-Dimensional, Large Data Sets: A Review 2022 Mengrui Mina Jiang
Giulia Pedrielli
Szu Hui Ng
+ Parallel Gaussian Process Regression with Low-Rank Covariance Matrix Approximations 2014 Jie Chen
N. Cao
Kian Hsiang Low
Ruofei Ouyang
Colin Keng-Yan Tan
Patrick Jaillet
+ Parallel Gaussian Process Regression with Low-Rank Covariance Matrix Approximations 2013 Jie Chen
N. Cao
Kian Hsiang Low
Ruofei Ouyang
Colin Keng-Yan Tan
Patrick Jaillet
+ PDF Chat Gaussian Predictive Process Models for Large Spatial Data Sets 2008 Sudipto Banerjee
Alan E. Gelfand
Andrew O. Finley
Huiyan Sang
+ A Vecchia Approximation for High-Dimensional Gaussian Cumulative Distribution Functions Arising from Spatial Data 2020 Maurício Abujamra Nascimento
Benjamin A. Shaby
+ Fixed-Domain Asymptotics Under Vecchia's Approximation of Spatial Process Likelihoods 2021 Lu Zhang
Wenpin Tang
Sudipto Banerjee
+ Fixed-Domain Asymptotics Under Vecchia's Approximation of Spatial Process Likelihoods 2023 Lu Zhang
Wenpin Tang
Sudipto Banerjee

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors