Adjusting for population differences using machine learning methods
Lauren Cappiello,
Zhiwei Zhang,
Changyu Shen,
Neel M. Butala,
Xinping Cui and
Robert W. Yeh
Journal of the Royal Statistical Society Series C, 2021, vol. 70, issue 3, 750-769
Abstract:
The use of real‐world data for medical treatment evaluation frequently requires adjusting for population differences. We consider this problem in the context of estimating mean outcomes and treatment differences in a well‐defined target population, using clinical data from a study population that overlaps with but differs from the target population in terms of patient characteristics. The current literature on this subject includes a variety of statistical methods, which generally require correct specification of at least one parametric regression model. In this article, we propose to use machine learning methods to estimate nuisance functions and incorporate the machine learning estimates into existing doubly robust estimators. This leads to nonparametric estimators that are n‐consistent, asymptotically normal and asymptotically efficient under general conditions. Simulation results demonstrate that the proposed methods perform reasonably well in realistic settings. The methods are illustrated with a cardiology example concerning aortic stenosis.
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/rssc.12486
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jorssc:v:70:y:2021:i:3:p:750-769
Ordering information: This journal article can be ordered from
http://ordering.onli ... 1111/(ISSN)1467-9876
Access Statistics for this article
Journal of the Royal Statistical Society Series C is currently edited by R. Chandler and P. W. F. Smith
More articles in Journal of the Royal Statistical Society Series C from Royal Statistical Society Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().