Analyses of the variable selection using correlation methods: An approach to the importance of statistical inferences in the modelling process
Mauricio Díaz-Vallejo,
Alexander Peña-Peniche,
Claudio Mota-Vargas,
Javier Piña-Torres,
Daniel Valencia-Rodríguez,
Coral E. Rangel-Rivera,
Juliana Gaviria-Hernández and
Octavio Rojas-Soto
Ecological Modelling, 2024, vol. 498, issue C
Abstract:
Selecting the best set of variables in ecological niche models (ENM) and species distribution models (SDM) has become a topic of interest in correlative models, leading to the use of statistical methods to estimate the relationships between variables. However, selecting sets of variables requires several decisions, such as choosing sources of information (i.e., species records and calibration areas) and statistical methods to optimize the modelling process while preventing the overestimation of parameters. In the present study, we analyzed four scenarios for selecting variables in ENM/SDM, including the implication of using the Pearson and Spearman correlation methods, with two strategies to extract the variables' information: species records and calibration areas. First, we conducted a bibliographic review to determine the most used methods to select variables. 134 of the 150 articles selected applied correlation methods, 47 used Pearson and 18 Spearman, and the remaining 69 did not specify the type of correlation method. Also, 19 articles employed species records, 20 used calibration areas, and 95 did not specify how they selected variables, showing the absence of clarity and consistency in variables selection. Then, we explored the same four combinations for 56 bird species. We conducted normality tests for the variables per species and found a tendency for non-normal distributions. Furthermore, we performed Pearson and Spearman correlations using species records and calibration areas as extraction strategies and discussed the differences between each one. Finally, we built different sets of variables and performance SDM for six species and found that the set of variables selected has a different composition based on their strategy. Our findings highlight the absence of clarity and consistency in describing correlation coefficients commonly used for environmental variable selection and emphasize its significant implications.
Keywords: Ecological niche modelling; Pearson; Spearman; Species distribution modelling; BIOCLIM; Variables interaction networks (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0304380024002813
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:ecomod:v:498:y:2024:i:c:s0304380024002813
DOI: 10.1016/j.ecolmodel.2024.110893
Access Statistics for this article
Ecological Modelling is currently edited by Brian D. Fath
More articles in Ecological Modelling from Elsevier
Bibliographic data for series maintained by Catherine Liu ().