Contribution of Structure Learning Algorithms in Social Epidemiology: Application to Real-World Data
Helene Colineaux (),
Benoit Lepage,
Pierre Chauvin,
Chloe Dimeglio,
Cyrille Delpierre and
Thomas Lefèvre
Additional contact information
Helene Colineaux: EQUITY Team, Centre d’Epidémiologie et de Recherche en Santé des POPulations (CERPOP), Institut National de la Santé et de la Recherche Médicale (INSERM)—Toulouse III University, 37 Allées Jules Guesde, 31062 Toulouse, France
Benoit Lepage: EQUITY Team, Centre d’Epidémiologie et de Recherche en Santé des POPulations (CERPOP), Institut National de la Santé et de la Recherche Médicale (INSERM)—Toulouse III University, 37 Allées Jules Guesde, 31062 Toulouse, France
Pierre Chauvin: UMRS 1136, Pierre Louis Institute of Epidemiology and Public Health, Department of Social Epidemiology, Institut National de la Santé et de la Recherche Médicale (INSERM), Sorbonne University, 75005 Paris, France
Chloe Dimeglio: Toulouse Institute for Infectious and Inflammatory Diseases (INFINITY), Institut National de la Santé et de la Recherche Médicale (INSERM), UMR 1291, Centre National de la Recherche Scientifique (CNRS), UMR 5051, 31300 Toulouse, France
Cyrille Delpierre: EQUITY Team, Centre d’Epidémiologie et de Recherche en Santé des POPulations (CERPOP), Institut National de la Santé et de la Recherche Médicale (INSERM)—Toulouse III University, 37 Allées Jules Guesde, 31062 Toulouse, France
Thomas Lefèvre: UMRS 1136, Pierre Louis Institute of Epidemiology and Public Health, Department of Social Epidemiology, Institut National de la Santé et de la Recherche Médicale (INSERM), Sorbonne University, 75005 Paris, France
IJERPH, 2025, vol. 22, issue 3, 1-15
Abstract:
Epidemiologists often handle large datasets with numerous variables and are currently seeing a growing wealth of techniques for data analysis, such as machine learning. Critical aspects involve addressing causality, often based on observational data, and dealing with the complex relationships between variables to uncover the overall structure of variable interactions, causal or not. Structure learning (SL) methods aim to automatically or semi-automatically reveal the structure of variables’ relationships. The objective of this study is to delineate some of the potential contributions and limitations of structure learning methods when applied to social epidemiology topics and the search for determinants of healthcare system access. We applied SL techniques to a real-world dataset, namely the 2010 wave of the SIRS cohort, which included a sample of 3006 adults from the Paris region, France. Healthcare utilization, encompassing both direct and indirect access to care, was the primary outcome. Candidate determinants included health status, demographic characteristics, and socio-cultural and economic positions. We present two approaches: a non-automated epidemiological method (an initial expert knowledge network and stepwise logistic regression models) and three SL techniques using various algorithms, with and without knowledge constraints. We compared the results based on the presence, direction, and strength of specific links within the produced network. Although the interdependencies and relative strengths identified by both approaches were similar, the SL algorithms detect fewer associations with the outcome than the non-automated method. Relationships between variables were sometimes incorrectly oriented when using a purely data-driven approach. SL algorithms can be valuable in exploratory stages, helping to generate new hypotheses or mining novel databases. However, results should be validated against prior knowledge and supplemented with additional confirmatory analyses.
Keywords: causal discovery; directed acyclic graph; graphical models; social epidemiology; structure learning; Bayesian network; healthcare system utilization (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1660-4601/22/3/348/pdf (application/pdf)
https://www.mdpi.com/1660-4601/22/3/348/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:22:y:2025:i:3:p:348-:d:1601032
Access Statistics for this article
IJERPH is currently edited by Ms. Jenna Liu
More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().