A multi-model relationship detection method to assist with naïve exploration of high-dimensional data
James Christopher Wiley,
Simon English,
Kinsey Church,
Richard A Ward,
James Flowers and
Céline Chamoun
No 3h9yp_v1, OSF Preprints from Center for Open Science
Abstract:
This paper presents a method for classifying predictors Xp based on their relationship to an outcome variable Y; as having main effects, interactions, collinearity, or no effects. The presented method operates by combining complimentary information from a multivariate model and a series of bivariate models. We demonstrate how the method works using simulated data. In addition, we experimentally vary the effect sizes in our data generation process to see if the proposed method can detect different relationships between predictors Xp and outcome Y at varied strengths. We also vary the sample size (n) and observe the impact on relationship classification. We find that the proposed method functions as desired within the constraints of this study. We propose future simulation designs for continued testing of said method. We conclude by providing broad instructions for applying this method. Our goal is to use this method to develop initial analytical profiles of high-dimensional data in naïve data exploration contexts. This work stems from trying to find an efficient alternative to scatterplot matrices when exploring data that contain thousands of variables.
Date: 2025-02-21
New Economics Papers: this item is included in nep-ecm
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://osf.io/download/67b353e9780d7136620c7eb5/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:osfxxx:3h9yp_v1
DOI: 10.31219/osf.io/3h9yp_v1
Access Statistics for this paper
More papers in OSF Preprints from Center for Open Science
Bibliographic data for series maintained by OSF ().