Implementation of RNN-LSTM with L1 regularization for predicting labels from chimpanzee DNA sequences using pseudo-labeling
Sugiyarto Surono (),
Goh Khang Wen (),
Arif Rahman (),
Lalu M. Irham () and
Sintia Afriyani ()
International Journal of Innovative Research and Scientific Studies, 2025, vol. 8, issue 3, 2774-2786
Abstract:
Chimpanzee genome research plays a crucial role in understanding evolution, health, and biological functions. However, incomplete labeling of DNA sequence data presents a challenge for accurate genomic classification. This study aims to improve chimpanzee DNA sequence classification by addressing label scarcity and data imbalance through a deep learning approach. A Recurrent Neural Network Long Short-Term Memory (RNN-LSTM) model with L1 Regularization and pseudo-labeling is employed to enhance classification performance. The workflow includes numerical encoding of DNA sequences, pseudo-labeling to augment training data, and model training using Stochastic Gradient Descent (SGD) optimization. Performance evaluation is conducted using classification accuracy and AUC metrics. Results show that the proposed approach achieves high classification accuracy, with an AUC ranging from 0.94 to 0.99, significantly improving the handling of imbalanced datasets. The integration of pseudo-labeling effectively leverages unlabeled DNA sequences, leading to a more robust genomic classification model. These findings highlight the potential of combining RNN-LSTM with L1 Regularization and pseudo-labeling to address incomplete labeling in genomic datasets. The study advances genomic classification techniques and supports Goal 3: Good Health and Well-being of the Sustainable Development Goals (SDGs) by enhancing DNA sequence classification accuracy, facilitating early disease detection, precision medicine, and evolutionary studies.
Keywords: Chimpanzee genome analysis; Goal 3; Good health and well-being (SDGs); L1 regularization feature selection; Pseudo-labeling in genomics; RNN-LSTM for DNA sequence classification. (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://ijirss.com/index.php/ijirss/article/view/7083/1467 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aac:ijirss:v:8:y:2025:i:3:p:2774-2786:id:7083
Access Statistics for this article
International Journal of Innovative Research and Scientific Studies is currently edited by Natalie Jean
More articles in International Journal of Innovative Research and Scientific Studies from Innovative Research Publishing
Bibliographic data for series maintained by Natalie Jean ().