Graph Theoretic and Pearson Correlation-Based Discovery of Network Biomarkers for Cancer
Raihanul Bari Tanvir,
Tasmia Aqila,
Mona Maharjan,
Abdullah Al Mamun and
Ananda Mohan Mondal
Additional contact information
Raihanul Bari Tanvir: School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
Tasmia Aqila: School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
Mona Maharjan: School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
Abdullah Al Mamun: School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
Ananda Mohan Mondal: School of Computing and Information Sciences, Florida International University, Miami, FL 33199, USA
Data, 2019, vol. 4, issue 2, 1-12
Abstract:
Two graph theoretic concepts—clique and bipartite graphs—are explored to identify the network biomarkers for cancer at the gene network level. The rationale is that a group of genes work together by forming a cluster or a clique-like structures to initiate a cancer. After initiation, the disease signal goes to the next group of genes related to the second stage of a cancer, which can be represented as a bipartite graph. In other words, bipartite graphs represent the cross-talk among the genes between two disease stages. To prove this hypothesis, gene expression values for three cancers— breast invasive carcinoma (BRCA), colorectal adenocarcinoma (COAD) and glioblastoma multiforme (GBM)—are used for analysis. First, a co-expression gene network is generated with highly correlated gene pairs with a Pearson correlation coefficient ≥ 0.9. Second, clique structures of all sizes are isolated from the co-expression network. Then combining these cliques, three different biomarker modules are developed—maximal clique-like modules, 2-clique-1-bipartite modules, and 3-clique-2-bipartite modules. The list of biomarker genes discovered from these network modules are validated as the essential genes for causing a cancer in terms of network properties and survival analysis. This list of biomarker genes will help biologists to design wet lab experiments for further elucidating the complex mechanism of cancer.
Keywords: bipartite graph; clique; network biomarker; Pearson correlation coefficient (PCC); gene co-expression network (search for similar items in EconPapers)
JEL-codes: C8 C80 C81 C82 C83 (search for similar items in EconPapers)
Date: 2019
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2306-5729/4/2/81/pdf (application/pdf)
https://www.mdpi.com/2306-5729/4/2/81/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jdataj:v:4:y:2019:i:2:p:81-:d:237623
Access Statistics for this article
Data is currently edited by Ms. Cecilia Yang
More articles in Data from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().