A molecular video-derived foundation model for scientific drug discovery
Hongxin Xiang,
Li Zeng,
Linlin Hou,
Kenli Li,
Zhimin Fu,
Yunguang Qiu,
Ruth Nussinov,
Jianying Hu,
Michal Rosen-Zvi,
Xiangxiang Zeng () and
Feixiong Cheng ()
Additional contact information
Hongxin Xiang: Hunan University
Li Zeng: Hunan University
Linlin Hou: Hunan University
Kenli Li: Hunan University
Zhimin Fu: Cleveland Clinic
Yunguang Qiu: Cleveland Clinic
Ruth Nussinov: National Cancer Institute
Jianying Hu: Yorktown Heights
Michal Rosen-Zvi: IBM Research Labs
Xiangxiang Zeng: Hunan University
Feixiong Cheng: Cleveland Clinic
Nature Communications, 2024, vol. 15, issue 1, 1-13
Abstract:
Abstract Accurate molecular representation of compounds is a fundamental challenge for prediction of drug targets and molecular properties. In this study, we present a molecular video-based foundation model, named VideoMol, pretrained on 120 million frames of 2 million unlabeled drug-like and bioactive molecules. VideoMol renders each molecule as a video with 60-frame and designs three self-supervised learning strategies on molecular videos to capture molecular representation. We show high performance of VideoMol in predicting molecular targets and properties across 43 drug discovery benchmark datasets. VideoMol achieves high accuracy in identifying antiviral molecules against common diverse disease-specific drug targets (i.e., BACE1 and EP4). Drugs screened by VideoMol show better binding affinity than molecular docking, revealing the effectiveness in understanding the three-dimensional structure of molecules. We further illustrate interpretability of VideoMol using key chemical substructures.
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-024-53742-z Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-53742-z
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-024-53742-z
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().