SUBSETBYVIF: Stata module to select a subset of covariates constrained by VIF
Walton Plummer () and
William D. Dupont ()
Additional contact information
William D. Dupont: Vanderbilt University School of Medicine
Statistical Software Components from Boston College Department of Economics
Abstract:
subsetByVIF selects subsets of the covariates listed in varlist such that each covariate in a given subset has a VIF that is less than or equal to a specified value given by viflist. We are frequently faced with analyzing data sets in which the ratio of covariates to patients is high. There are several approaches to analyzing such data including penalized regression methods, k-fold cross-validation techniques, and bagging. A problem with any of these approaches is that, even after the elimination of variables causing multi-collinearity, the variance-covariance matrix of the remaining covariates is often highly ill-conditioned. The subsetByVIF program reduces the number of covariates to the largest subsample such that the maximum VIF for each variable in the subsample is less than some value specified by the user. These variables are selected without regard to the dependent variable of interest, which should mitigate problems due to overfitting. The use of this program should improve the convergence properties of many methods of exploratory data analysis.
Language: Stata
Requires: Stata version 15
Keywords: covariates; VIF; subset; conditioning (search for similar items in EconPapers)
Date: 2019-04-21, Revised 2019-04-28
Note: This module should be installed from within Stata by typing "ssc install subsetbyvif". The module is made available under terms of the GPL v3 (https://www.gnu.org/licenses/gpl-3.0.txt). Windows users should not attempt to download these files with a web browser.
References: Add references at CitEc
Citations:
Downloads: (external link)
http://fmwww.bc.edu/repec/bocode/s/subsetbyvif.ado program code (text/plain)
http://fmwww.bc.edu/repec/bocode/s/subsetbyvif.sthlp help file (text/plain)
http://fmwww.bc.edu/repec/bocode/s/sample_subsetbyvif.do sample do file (text/plain)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:boc:bocode:s458635
Ordering information: This software item can be ordered from
http://repec.org/docs/ssc.php
Access Statistics for this software item
More software in Statistical Software Components from Boston College Department of Economics Boston College, 140 Commonwealth Avenue, Chestnut Hill MA 02467 USA. Contact information at EDIRC.
Bibliographic data for series maintained by Christopher F Baum ().