ERNIE-RNA: an RNA language model with structure-enhanced representations
Weijie Yin,
Zhaoyu Zhang,
Shuo Zhang,
Liang He,
Ruiyang Zhang,
Rui Jiang,
Gan Liu,
Jingyi Wang,
Xuegong Zhang (),
Tao Qin () and
Zhen Xie ()
Additional contact information
Weijie Yin: Tsinghua University
Zhaoyu Zhang: Tsinghua University
Shuo Zhang: Tsinghua University
Liang He: Microsoft Research AI for Science
Ruiyang Zhang: Tsinghua University
Rui Jiang: Tsinghua University
Gan Liu: Hesheng Beiyin (Qing Dao) Co. Ltd
Jingyi Wang: Hesheng Beiyin (Qing Dao) Co. Ltd
Xuegong Zhang: Tsinghua University
Tao Qin: Microsoft Research AI for Science
Zhen Xie: Tsinghua University
Nature Communications, 2025, vol. 16, issue 1, 1-17
Abstract:
Abstract Existing RNA language models (RLMs) largely overlook structural information in RNA sequences, leading to incomplete feature extraction and suboptimal performance on downstream tasks. In this study, we present ERNIE-RNA (Enhanced Representations with Base-Pairing Restriction for RNA Modeling), an RNA pre-trained language model based on a modified BERT (Bidirectional Encoder Representations from Transformers). Notably, ERNIE-RNA’s attention maps exhibit superior ability to capture RNA structural features through zero-shot prediction, outperforming conventional methods like RNAfold and RNAstructure, suggesting that ERNIE-RNA naturally develops comprehensive representations of RNA architecture during pre-training. Moreover, after fine-tuning, ERNIE-RNA achieves state-of-the-art (SOTA) performance across various downstream tasks, including RNA structure and function predictions. In summary, ERNIE-RNA provides versatile features that can be effectively applied to a wide range of research tasks. Our findings highlight that integrating key knowledge-based priors into the BERT framework may enhance the performance of other language models.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-64972-0 Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-64972-0
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-64972-0
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().