Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)-Reference-Cited by-同舟云学术

Literature mining for context-specific molecular relations using multimodal representations (COMMODAR)

Published:2020-10 Issue:S5 Volume:21 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Lee Jaehyun,Lee Doheon,Lee Kwang Hyung^ORCID

Abstract

Abstract Biological contextual information helps understand various phenomena occurring in the biological systems consisting of complex molecular relations. The construction of context-specific relational resources vastly relies on laborious manual extraction from unstructured literature. In this paper, we propose COMMODAR, a machine learning-based literature mining framework for context-specific molecular relations using multimodal representations. The main idea of COMMODAR is the feature augmentation by the cooperation of multimodal representations for relation extraction. We leveraged biomedical domain knowledge as well as canonical linguistic information for more comprehensive representations of textual sources. The models based on multiple modalities outperformed those solely based on the linguistic modality. We applied COMMODAR to the 14 million PubMed abstracts and extracted 9214 context-specific molecular relations. All corpora, extracted data, evaluation results, and the implementation code are downloadable at https://github.com/jae-hyun-lee/commodar. Ccs concepts • Computing methodologies~Information extraction • Computing methodologies~Neural networks • Applied computing~Biological networks.

Funder

National Research Foundation of Korea

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/s12859-020-3396-y.pdf

Reference31 articles.

1. Topol EJ. Individualized medicine from prewomb to tomb. Cell. 2014;157(1):241–53.

2. Yoon S, et al. Context-based resolution of semantic conflicts in biological pathways. BMC Med Inform Decis Mak. 2015;15(1):S3.

3. Mosca R, et al. dSysMap: exploring the edgetic role of disease mutations. Nat Methods. 2015;12(3):167–8.

4. Lu H-C, Herrera Braga J, Fraternali F. PinSnps: structural and functional analysis of SNPs in the context of protein interaction networks. Bioinformatics. 2016;32(16):2534–6.

5. Higueruelo AP, Jubb H, Blundell TL. TIMBAL v2: update of a database holding small molecules modulating protein–protein interactions. Database. 2013;2013:bat039.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhanced disease-disease association with information enriched disease representation;Mathematical Biosciences and Engineering;2023

2. Reconstruction of the Cytokine Signaling in Lysosomal Storage Diseases by Literature Mining and Network Analysis;Frontiers in Cell and Developmental Biology;2021-08-20