EconPapers    
Economics at your fingertips  
 

Identification of Biomarkers and Molecular Pathways Implicated in Smoking and COVID-19 Associated Lung Cancer Using Bioinformatics and Machine Learning Approaches

Md Ali Hossain, Mohammad Zahidur Rahman, Touhid Bhuiyan () and Mohammad Ali Moni ()
Additional contact information
Md Ali Hossain: Department of Computer Science and Engineering, Jahangirnagar University, Dhaka 1342, Bangladesh
Mohammad Zahidur Rahman: Department of Computer Science and Engineering, Jahangirnagar University, Dhaka 1342, Bangladesh
Touhid Bhuiyan: School of IT, Washington University of Science and Technology, Alexandria, VA 22314, USA
Mohammad Ali Moni: Faculty of Health and Behavioural Sciences, The University of Queensland, Brisbane 4072, Australia

IJERPH, 2024, vol. 21, issue 11, 1-14

Abstract: Lung cancer (LC) is a significant global health issue, with smoking as the most common cause. Recent epidemiological studies have suggested that individuals who smoke are more susceptible to COVID-19. In this study, we aimed to investigate the influence of smoking and COVID-19 on LC using bioinformatics and machine learning approaches. We compared the differentially expressed genes (DEGs) between LC, smoking, and COVID-19 datasets and identified 26 down-regulated and 37 up-regulated genes shared between LC and smoking, and 7 down-regulated and 6 up-regulated genes shared between LC and COVID-19. Integration of these datasets resulted in the identification of ten hub genes (SLC22A18, CHAC1, ROBO4, TEK, NOTCH4, CD24, CD34, SOX2, PITX2, and GMDS) from protein-protein interaction network analysis. The WGCNA R package was used to construct correlation network analyses for these shared genes, aiming to investigate the relationships among them. Furthermore, we also examined the correlation of these genes with patient outcomes through survival curve analyses. The gene ontology and pathway analyses were performed to find out the potential therapeutic targets for LC in smoking and COVID-19 patients. Moreover, machine learning algorithms were applied to the TCGA RNAseq data of LC to assess the performance of these common genes and ten hub genes, demonstrating high performances. The identified hub genes and molecular pathways can be utilized for the development of potential therapeutic targets for smoking and COVID-19-associated LC.

Keywords: lung cancer; COVID-19; smoking; comorbidity; protein-protein interaction; WGCNA; pathway analysis; ROC curve; survival analysis (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1660-4601/21/11/1392/pdf (application/pdf)
https://www.mdpi.com/1660-4601/21/11/1392/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:21:y:2024:i:11:p:1392-:d:1503980

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jijerp:v:21:y:2024:i:11:p:1392-:d:1503980