Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings-Reference-Cited by-同舟云学术

Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings

Published:2024-01-17 Issue: Volume:10 Page:
ISSN:2296-858X
Container-title:Frontiers in Medicine
language:
Short-container-title:Front. Med.

Author:

Zulfiqar Hasan,Guo Zhiling,Ahmad Ramala Masood,Ahmed Zahoor,Cai Peiling,Chen Xiang,Zhang Yang,Lin Hao,Shi Zheng

Abstract

Snake venom contains many toxic proteins that can destroy the circulatory system or nervous system of prey. Studies have found that these snake venom proteins have the potential to treat cardiovascular and nervous system diseases. Therefore, the study of snake venom protein is conducive to the development of related drugs. The research technologies based on traditional biochemistry can accurately identify these proteins, but the experimental cost is high and the time is long. Artificial intelligence technology provides a new means and strategy for large-scale screening of snake venom proteins from the perspective of computing. In this paper, we developed a sequence-based computational method to recognize snake toxin proteins. Specially, we utilized three different feature descriptors, namely g-gap, natural vector and word 2 vector, to encode snake toxin protein sequences. The analysis of variance (ANOVA), gradient-boost decision tree algorithm (GBDT) combined with incremental feature selection (IFS) were used to optimize the features, and then the optimized features were input into the deep learning model for model training. The results show that our model can achieve a prediction performance with an accuracy of 82.00% in 10-fold cross-validation. The model is further verified on independent data, and the accuracy rate reaches to 81.14%, which demonstrated that our model has excellent prediction performance and robustness.

Publisher

Frontiers Media SA

Reference36 articles.

1. Snake venom toxins targeted at the nervous system;Osipov;Snake Venoms Toxinol,2017

2. Structure and function of snake venom cysteine-rich secretory proteins;Yamazaki;Toxicon,2004

3. Snake three-finger α-neurotoxins and nicotinic acetylcholine receptors: molecules, mechanisms and medicine;Nirthanan;Biochem Pharmacol,2020

4. Snake as a symbol in medicine and pharmacy-a historical study;Okuda;Yakushigaku Zasshi,2000

5. From animal poisons and venoms to medicines: achievements, challenges and perspectives in drug discovery;Bordon;Front Pharmacol,2020

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Accurately identifying positive and negative regulation of apoptosis using fusion features and machine learning methods;Computational Biology and Chemistry;2024-12

2. Prediction of cancer drug combinations based on multidrug learning and cancer expression information injection;Future Generation Computer Systems;2024-11

3. StackDPPred: Multiclass prediction of defensin peptides using stacked ensemble learning with optimized features;Methods;2024-10

4. Advanced deep learning approaches enable high-throughput biological and biomedicine data analysis;Methods;2024-10

5. A protein pre-trained model-based approach for the identification of the liquid-liquid phase separation (LLPS) proteins;International Journal of Biological Macromolecules;2024-10