Model-Based Geostatistics Under Spatially Varying Preferential Sampling
André Victor Ribeiro Amaral (),
Elias Teixeira Krainski,
Ruiman Zhong and
Paula Moraga
Additional contact information
André Victor Ribeiro Amaral: King Abdullah University of Science and Technology
Elias Teixeira Krainski: King Abdullah University of Science and Technology
Ruiman Zhong: King Abdullah University of Science and Technology
Paula Moraga: King Abdullah University of Science and Technology
Journal of Agricultural, Biological and Environmental Statistics, 2024, vol. 29, issue 4, No 6, 766-792
Abstract:
Abstract Geostatistics is concerned with the estimation and prediction of spatially continuous phenomena using data obtained at a discrete set of locations. In geostatistics, preferential sampling occurs when these locations are not independent of the latent spatial field, and common modeling approaches that do not account for such a dependence structure might yield wrong inferences. To overcome this issue, some methods have been proposed to model data collected under preferential sampling. However, while these methods assume a constant degree of preferentiality, real data may present a degree of preferentiality that varies over space. For that reason, we propose a new model that accounts for preferential sampling by including a spatially varying coefficient that describes the dependence strength between the process that models the sampling locations and the latent field. To do so, we approximate the preferentiality component by a set of basis functions with the corresponding coefficients being estimated using the integrated nested Laplace approximation (INLA) method. By doing that, we allow the degree of preferentiality to vary over the domain with low computational burden. We assess our model performance by means of a simulation study and use it to analyze the average $$\text {PM}_{2.5}$$ PM 2.5 levels in the USA in 2022. We conclude that, given enough observed events, our model, along with the implemented inference routine, retrieves well the latent field itself and the spatially varying preferentiality surface, even under misspecified scenarios. Also, we offer guidelines for the specification and size of the set of basis functions. Supplementary materials accompanying this paper appear online.
Keywords: Air pollution; Geostatistics; INLA; Preferential sampling; Spatial statistics (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s13253-023-00571-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:jagbes:v:29:y:2024:i:4:d:10.1007_s13253-023-00571-0
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/13253
DOI: 10.1007/s13253-023-00571-0
Access Statistics for this article
Journal of Agricultural, Biological and Environmental Statistics is currently edited by Stephen Buckland
More articles in Journal of Agricultural, Biological and Environmental Statistics from Springer, The International Biometric Society, American Statistical Association
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().