PathoRM: Computational inference of pathogenic RNA methylation sites by incorporating multi-view features
Hui Liu,
Jiani Ma,
Xianjun Ma and
Lin Zhang
PLOS Computational Biology, 2025, vol. 21, issue 11, 1-20
Abstract:
Identifying pathogenic RNA methylation sites with a reasonable biological explanation has important implications for the treatment of diseases. Due to the limitations of in vitro experiments in identifying pathogenic RNA methylation sites, there is a growing need for computational workflows to enable accurate inference. Here, motivated by this profound meaning, we developed PathoRM, a biologically informed deep learning model, to infer associations between RNA methylation sites and diseases. PathoRM could provide convincing pathogenic RNA methylation sites and unravel the enigma of pathology in the epi-transcriptomic layer. PathoRM fuses RNA methylation host sequences and pathogenic descriptions as inputs, and subsequently employs large language models, multi-view learning algorithm, graph neural networks, an adversarial training approach, and “guilty-by-association”-derived negative sampling approach. PathoRM distils the semantically enriched feature embeddings, leading to more accurate and robust prediction performance across the metrics and datasets. Notably, incorporated with attention mechanism, PathoRM bestows itself biological interpretability through illuminating the dark matters in the host sequences of RNA methylation sites. This work is expected to assist in the discovery of pathogenic RNA methylation sites and conserved motifs, contributing to the advancement of genome research. Codes and pre-trained model are accessible at https://github.com/jianiM/PathoRM.Author summary: RNA methylation (RM) is a pivotal epi-transcriptomic modification that alters RNA nucleotides through methyl group additions, profoundly impacting gene expression, cellular differentiation, and essential biological processes crucial for maintaining cellular function. Dysregulation of RM is intricately linked to various diseases. Given the challenges associated with identifying pathogenic sites through laboratory examination, there is an urgent need for computational workflows capable of accurately inferring pathogenic RNA methylation sites to facilitate comprehensive biological investigations. Here, we developed PathoRM, a biologically informed deep learning model aimed at elucidating the associations among RNA methylation sites and diseases. PathoRM integrates RNA methylation host sequences and pathogenic descriptions using large language models, multi-view learning algorithms, graph neural networks, adversarial training, and a negative sampling method derived from “guilty-by-association” principles. By distilling semantically enriched feature embeddings, PathoRM achieves promising predictive accuracy and robustness across diverse metrics and datasets. Notably, even without explicit annotations for sites, PathoRM can capture the intrinsic pathogenic regions, which is overlapped with the conserved motif, in the RM host sequence, offering biological insights into the decision-making procedure.
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013654 (text/html)
https://journals.plos.org/ploscompbiol/article/fil ... 13654&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pcbi00:1013654
DOI: 10.1371/journal.pcbi.1013654
Access Statistics for this article
More articles in PLOS Computational Biology from Public Library of Science
Bibliographic data for series maintained by ploscompbiol ().