Affiliation:
1. College of Computer Science and Technology, Jilin University, Changchun, Jilin, China
2. College of Computer Science
and Technology, Inner Mongolia Minzu University, Tongliao, Inner Mongolia, China
Abstract
Background:
Chemical compounds and proteins/genes are an important class of entities in
biomedical research, and their interactions play a key role in precision medicine, drug discovery, basic
clinical research, and building knowledge bases. Many computational methods have been proposed to
identify chemical–protein interactions. However, the majority of these proposed models cannot model
long-distance dependencies between chemical and protein, and the neural networks used to suffer from
gradient descent, with little taking into account the characteristics of the chemical structure characteristics of the compound.
Methods:
To address the above limitations, we propose a novel model, SIMEON, to identify chemical–
protein interactions. First, an input sequence is represented with pre-trained language model and an attention mechanism is used to uncover contribution degree of different words to entity relations and potential semantic information. Secondly, key features are extracted by a multi-layer stacked Bidirectional Gated Recurrent Units (Bi-GRU)-normalization residual network module to resolve higherorder dependencies while overcoming network degradation. Finally, the representation is introduced to
be enhanced by external knowledge regarding the chemical structure characteristics of the compound
external knowledge
Results:
Excellent experimental results show that our stacked integration model combines the advantages of Bi-GRU, normalization methods, and external knowledge to improve the performance of
the model by complementing each other
Conclusion:
Our proposed model shows good performance in chemical-protein interaction extraction,
and it can be used as a useful complement to biological experiments to identify chemical-protein interactions.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Jilin Province
Science & Technology Research Projects of Colleges and Universities in Inner Mongolia Autonomous Region
Inner Mongolia Science & Technology Project
Inner Mongolia Minzu University Doctoral Research Start-up Fund Project
Publisher
Bentham Science Publishers Ltd.
Subject
Computational Mathematics,Genetics,Molecular Biology,Biochemistry