Software for Distributed Computation on Medical Databases: A Demonstration Project
Balasubramanian Narasimhan,
Daniel L. Rubin,
Samuel M. Gross,
Marina Bendersky and
Philip W. Lavori
Journal of Statistical Software, 2017, vol. 077, issue i13
Abstract:
Bringing together the information latent in distributed medical databases promises to personalize medical care by enabling reliable, stable modeling of outcomes with rich feature sets (including patient characteristics and treatments received). However, there are barriers to aggregation of medical data, due to lack of standardization of ontologies, privacy concerns, proprietary attitudes toward data, and a reluctance to give up control over end use. Aggregation of data is not always necessary for model fitting. In models based on maximizing a likelihood, the computations can be distributed, with aggregation limited to the intermediate results of calculations on local data, rather than raw data. Distributed fitting is also possible for singular value decomposition. There has been work on the technical aspects of shared computation for particular applications, but little has been published on the software needed to support the "social networking" aspect of shared computing, to reduce the barriers to collaboration. We describe a set of software tools that allow the rapid assembly of a collaborative computational project, based on the flexible and extensible R statistical software and other open source packages, that can work across a heterogeneous collection of database environments, with full transparency to allow local officials concerned with privacy protections to validate the safety of the method. We describe the principles, architecture, and successful test results for the site-stratified Cox model and rank-k singular value decomposition.
Date: 2017-05-03
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.jstatsoft.org/index.php/jss/article/view/v077i13/v77i13.pdf
https://www.jstatsoft.org/index.php/jss/article/do ... /distcomp_1.0.tar.gz
https://www.jstatsoft.org/index.php/jss/article/do ... 7i13-replication.zip
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:jss:jstsof:v:077:i13
DOI: 10.18637/jss.v077.i13
Access Statistics for this article
Journal of Statistical Software is currently edited by Bettina Grün, Edzer Pebesma and Achim Zeileis
More articles in Journal of Statistical Software from Foundation for Open Access Statistics
Bibliographic data for series maintained by Christopher F. Baum (baum@bc.edu).