Mutation Clusters from Cancer Exome

Kakushadze, Zura; Yu, Willie

Mutation Clusters from Cancer Exome

Zura Kakushadze and Willie Yu

Abstract: We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1,389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics such as novel blood-test methods currently in development.

Date: 2017-07
New Economics Papers: this item is included in nep-cmp and nep-hea
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Published in Genes 8(8) (2017) 201

Downloads: (external link)
http://arxiv.org/pdf/1707.08504 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:1707.08504

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().