Single-cell mutational burden distributions in birth–death processes
Christo Morison,
Dudley Stark and
Weini Huang
PLOS Computational Biology, 2025, vol. 21, issue 7, 1-18
Abstract:
Genetic mutations are footprints of cancer evolution and reveal critical dynamic parameters of tumour growth, which otherwise are hard to measure in vivo. The mutation accumulation in tumour cell populations has been described by various statistics, such as site frequency spectra (SFS), single-cell division distributions (DD) and mutational burden distributions (MBD). While DD and SFS have been intensively studied in phylogenetics especially after the development of whole genome sequencing technology of bulk samples, MBD has drawn attention more recently with the single-cell sequencing data. Although those statistics all arise from the same somatic evolutionary process, an integrated understanding of these distributions is missing and requires novel mathematical tools to better inform the ecological and evolutionary dynamics of tumours. Here we introduce dynamical matrices to analyse and unite the SFS, DD and MBD and derive recurrence relations for the expectations of these three distributions. While we successfully recover classic exact results in pure-birth cases for the SFS and the DD through our new framework, we derive a new expression for the MBD and approximate all three distributions when death is introduced. We demonstrate a natural link between the SFS and the single-cell MBD, and show that the MBD can be regenerated through the DD. Counter-intuitively, the single-cell MBD is mainly driven by the stochasticity arising in the DD, rather than the extra stochasticity in the number of mutations at each cell division.Author summary: Somatic mutations accumulated in tissue growth and maintenance lead to genetic variation in tumours and healthy tissues. The patterns of those mutations have been used to reveal tumour history. Here, we developed a general framework to unite different statistical properties of mutation distributions between bulk sequencing data and single-cell data. The site frequency spectra from bulk data, division distributions and single-cell mutational burden distributions from single-cell data can be connected using dynamic matrices and recurrence relations. Counter-intuitively, the stochasticity in the number of mutations acquired in each cell division does not play a critical role in the single-cell mutation burden distribution.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013241 (text/html)
https://journals.plos.org/ploscompbiol/article/fil ... 13241&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pcbi00:1013241
DOI: 10.1371/journal.pcbi.1013241
Access Statistics for this article
More articles in PLOS Computational Biology from Public Library of Science
Bibliographic data for series maintained by ploscompbiol ().