Exploring the Efficacy of Statistical and Deep Learning Methods for Large Spatial Datasets: A Case Study
Arnab Hazra (),
Pratik Nag (),
Rishikesh Yadav () and
Ying Sun ()
Additional contact information
Arnab Hazra: Indian Institute of Technology Kanpur
Pratik Nag: King Abdullah University of Science and Technology (KAUST)
Rishikesh Yadav: HEC Montreal
Ying Sun: King Abdullah University of Science and Technology (KAUST)
Journal of Agricultural, Biological and Environmental Statistics, 2025, vol. 30, issue 1, No 11, 254 pages
Abstract:
Abstract Increasingly large and complex spatial datasets pose massive inferential challenges due to high computational and storage costs. Our study is motivated by the KAUST Competition on Large Spatial Datasets 2023, which tasked participants with estimating spatial covariance-related parameters and predicting values at testing sites, along with uncertainty estimates. We compared various statistical and deep learning approaches through cross-validation and ultimately selected the Vecchia approximation technique for model fitting. To overcome the constraints in the R package GpGp, which lacked support for fitting zero-mean Gaussian processes and direct uncertainty estimation—two things that are necessary for the competition, we developed additional R functions. Besides, we implemented certain subsampling-based approximations and parametric smoothing for skewed sampling distributions of the estimators. Our team DesiBoys secured the first position in two out of four sub-competitions and the second position in the other two, validating the effectiveness of our proposed strategies. Moreover, we extended our evaluation to a large real spatial satellite-derived dataset on total precipitable water, where we compared the predictive performances of different models using multiple diagnostics.
Keywords: Cross-validation; Deep learning; Gaussian process; Large spatial datasets; Total precipitable water; Vecchia approximation (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s13253-024-00602-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:jagbes:v:30:y:2025:i:1:d:10.1007_s13253-024-00602-4
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/13253
DOI: 10.1007/s13253-024-00602-4
Access Statistics for this article
Journal of Agricultural, Biological and Environmental Statistics is currently edited by Stephen Buckland
More articles in Journal of Agricultural, Biological and Environmental Statistics from Springer, The International Biometric Society, American Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().