MR2G: A novel framework for causal network inference using GWAS summary data
Zhaotong Lin,
Wei Pan and
Haoran Xue
PLOS Genetics, 2026, vol. 22, issue 5, 1-21
Abstract:
Inferring a causal network among multiple traits is essential for unraveling complex biological relationships and informing interventions. Mendelian randomization (MR) has emerged as a powerful tool for causal inference, utilizing genetic variants as instrumental variables (IVs) to estimate causal effects. However, when the directions of causal relationships among traits are unknown, reconstructing the underlying causal network becomes challenging. In particular, the presence of cycles or feedback loops, which are common in biological systems, poses additional challenges for causal network inference, and remains largely under-studied with standard MR approaches and existing IV-based network inference methods. To address these issues, we introduce MR2G, a new statistical framework that enables robust inference of causal networks, including those with cycles, directly from GWAS summary statistics. MR2G is built on a formally defined recursive causal graph model that rigorously links direct causal effects to (univariable) MR estimands. It recovers a biologically interpretable causal network from pairwise MR effect estimates, while incorporating a network-informed IV screening strategy to reduce pleiotropic bias and improve robustness. Through realistic simulations, MR2G demonstrates superior accuracy and robustness in recovering complex causal structures, including those involving feedback loops. We apply MR2G to GWAS summary statistics for six complex diseases and nine cardiometabolic risk factors. MR2G not only recovers well-established causal pathways but also uncovers multiple feedback relationships, highlighting its utility in disentangling complex and biologically plausible causal networks from large-scale genetic data.Author summary: Unraveling how complex traits causally influence each others is essential for understanding disease mechanisms and guiding effective interventions. However, inferring such a causal network from observational data remains challenging due to hidden confounding, and most existing methods assume that the underlying network has no feedback loops, which is often unrealistic in biological systems. In this work, we developed MR2G, a statistical framework that infers a causal network of multiple traits directly from GWAS summary statistics, while allowing for cycles and feedback loops. Our approach is based on a recursive causal model that formally links direct causal effects to Mendelian randomization estimands, and incorporates a novel instrumental variable screening strategy to reduce bias from horizontal pleiotropy. We applied MR2G to 15 complex traits including cardiometabolic risk factors and diseases, and identified biologically supported causal pathways, such as the direct effects of blood pressure and LDL cholesterol on coronary artery disease, and a bidirectional relationship between atrial fibrillation and coronary artery disease.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1012144 (text/html)
https://journals.plos.org/plosgenetics/article/fil ... 12144&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pgen00:1012144
DOI: 10.1371/journal.pgen.1012144
Access Statistics for this article
More articles in PLOS Genetics from Public Library of Science
Bibliographic data for series maintained by plosgenetics ().