EconPapers    
Economics at your fingertips  
 

Statistical Data Mining of Clinical Data

Ilya Lipkovich (), Bohdana Ratitch and Cristina Ivanescu
Additional contact information
Ilya Lipkovich: Eli Lilly and Company
Bohdana Ratitch: Bayer Inc., Montreal
Cristina Ivanescu: IQVIA

Chapter Chapter 6 in Quantitative Methods in Pharmaceutical Research and Development, 2020, pp 225-315 from Springer

Abstract: Abstract This chapter provides an introduction into the diverse field of data mining, as viewed from the perspective of a clinical statistician. We start with a discussion of data mining and its relationship with machine learning and classical statistics. To facilitate the presentation of material, we map some common problems occurring in analysis of clinical data onto general machine learning tasks, such as supervised, unsupervised, and semi-supervised learning. We then review key concepts of data mining and machine learning with emphasis on methods that are most relevant for analyses of clinical data. We also present our view of the key elements of a statistical analysis plan that ensure principled data mining of randomized clinical trials. This topic is rarely addressed, yet of interest for many clinical statisticians who are routinely using data mining to gain insights and knowledge from the available data beyond the “pre-specified analyses.” To illustrate the ideas and methods, we provide three case studies based on real and simulated data sets, covering a range of important tasks rarely addressed in common literature on data mining, such as personalized medicine (subgroup identification and dynamic treatment regime optimization) and estimation of treatment effect in the presence of treatment switching. The chapter provides comprehensive up-to-date references to literature on both theory and application of data mining to clinical data, as well as to available software, mostly R packages and SAS procedures.

Keywords: Data mining; Machine learning; Ensemble learning; Subgroup identification; Causal inference (search for similar items in EconPapers)
Date: 2020
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-030-48555-9_6

Ordering information: This item can be ordered from
http://www.springer.com/9783030485559

DOI: 10.1007/978-3-030-48555-9_6

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-05-12
Handle: RePEc:spr:sprchp:978-3-030-48555-9_6