Variable importance in matched case–control studies in settings of high dimensional data
Raji Balasubramanian,
E. Andres Houseman,
Brent A. Coull,
Michael H. Lev,
Lee H. Schwamm and
Rebecca A. Betensky
Journal of the Royal Statistical Society Series C, 2014, vol. 63, issue 4, 639-655
Abstract:
type="main" xml:id="rssc12056-abs-0001">
We propose a method for assessing variable importance in matched case–control investigations and other highly stratified studies characterized by high dimensional data (p>>n). In simulated and real data sets, we show that the algorithm proposed performs better than a conventional univariate method (conditional logistic regression) and a popular multivariable algorithm (random forests) that does not take the matching into account. The methods are applicable to wide ranging, high impact clinical studies including metabolomic, proteomic studies and neuroimaging analyses, such as those assessing stroke and Alzheimer's disease. The methods proposed have been implemented in a freely available R library ( http://cran .r-project.org/web/packages/RPCLR/index.html ).
Date: 2014
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1111/rssc.2014.63.issue-4 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jorssc:v:63:y:2014:i:4:p:639-655
Ordering information: This journal article can be ordered from
http://ordering.onli ... 1111/(ISSN)1467-9876
Access Statistics for this article
Journal of the Royal Statistical Society Series C is currently edited by R. Chandler and P. W. F. Smith
More articles in Journal of the Royal Statistical Society Series C from Royal Statistical Society Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().