Mol2Context-vec: learning molecular representation from context awareness for drug discovery

Author:

Lv Qiujie1,Chen Guanxing1,Zhao Lu2,Zhong Weihe1,Yu-Chian Chen Calvin134

Affiliation:

1. School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, 510275, China

2. The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510655, China

3. Department of Medical Research, China Medical University Hospital, Taichung 40447, Taiwan

4. Department of Bioinformatics and Medical Engineering, Asia University, Taichung 41354, Taiwan

Abstract

Abstract With the rapid development of proteomics and the rapid increase of target molecules for drug action, computer-aided drug design (CADD) has become a basic task in drug discovery. One of the key challenges in CADD is molecular representation. High-quality molecular expression with chemical intuition helps to promote many boundary problems of drug discovery. At present, molecular representation still faces several urgent problems, such as the polysemy of substructures and unsmooth information flow between atomic groups. In this research, we propose a deep contextualized Bi-LSTM architecture, Mol2Context-vec, which can integrate different levels of internal states to bring dynamic representations of molecular substructures. And the obtained molecular context representation can capture the interactions between any atomic groups, especially a pair of atomic groups that are topologically distant. Experiments show that Mol2Context-vec achieves state-of-the-art performance on multiple benchmark datasets. In addition, the visual interpretation of Mol2Context-vec is very close to the structural properties of chemical molecules as understood by humans. These advantages indicate that Mol2Context-vec can be used as a reliable and effective tool for molecular expression. Availability: The source code is available for download in https://github.com/lol88/Mol2Context-vec.

Funder

Guangzhou Science and Technology Fund

Science, Technology & Innovation Commission of Shenzhen Municipality

China Medical University Hospital

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference71 articles.

1. Paving the road to 21st century toxicology;Toxcast Chemical Landscape;Chem Res Toxicol,2016

2. Low data drug discovery with one-shot learning;Altae-Tran;ACS Cent Sci,2017

3. Lbsizecleav: improved support vector machine (svm)-based prediction of dicer cleavage sites using loop/bulge length;Bao;BMC Bioinform,2016

4. Transfer learning for drug discovery;Cai;J Med Chem,2020

5. Utilizing edge features in graph neural networks via variational information maximization;Chen,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3