Clarifying Influences of Sampling Bias (Concentration) and Locational Errors (Uncertainties) on Precision or Generality of Species Distribution Models
Brice B. Hanberry ()
Additional contact information
Brice B. Hanberry: United States Department of Agriculture, Forest Service, Rocky Mountain Research Station, Rapid City, SD 57702, USA
Land, 2025, vol. 14, issue 8, 1-19
Abstract:
Locational errors and sampling bias may produce unrepresentative species distribution models. To decompose the influence of errors, I modeled species distributions of 31 mammal species from georeferenced records and random samples from range maps, with potential sources of errors added or removed, using the random forests algorithm. Errors included the addition of (1) cities, (2) administrative centers, (3) records flagged as potential errors (e.g., outliers), and (4) urban records to range map samples; the removal of (5) flagged records and (6) urban records from georeferenced records; and the addition of (7) random points and (8) clustered points to georeferenced records. I also examined separation between thinned and unthinned (i.e., locally concentrated) records and ocean and land areas. Errors generally did not perturb species distributions, particularly if errors were located within species ranges. The greatest departure relative to unaltered models (mean niche overlap values of 0.96 out of 1) was due to the addition of administrative centers at a 13% error rate. Because locational errors overall do not occur in modern georeferenced records, outliers may provide important samples from undersampled areas. Delineating land from ocean coordinates may require a land layer at the highest available resolution and buffered to match the distance of locational uncertainty for georeferenced records. Predicted areas for species distributions increased along the spectrum of models from concentrated georeferenced records, thinned records, and random samples from range maps. Species distributions modeled with all georeferenced records will have the greatest sampling concentration (to differentiate from bias, because predictive modeling is not hypothesis testing), resulting in model locational precision, whereas species distribution models from random samples of range maps will have locational generality (rather than errors). The risk of removing samples of suitable conditions is the generation of unrepresentative models whereas the benefit of sample removal is slightly more generalized models, but which also may represent overpredictions.
Keywords: centroid; cities; location; ocean; outlier; thinning; urban (search for similar items in EconPapers)
JEL-codes: Q15 Q2 Q24 Q28 Q5 R14 R52 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2073-445X/14/8/1620/pdf (application/pdf)
https://www.mdpi.com/2073-445X/14/8/1620/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jlands:v:14:y:2025:i:8:p:1620-:d:1720977
Access Statistics for this article
Land is currently edited by Ms. Carol Ma
More articles in Land from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().