Affiliation:
1. College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, 010018, China
2. College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot, 010018, China
Abstract
Background:
Bovine viral diarrhea virus (BVDV) can cause diarrhea, abortion, and immunosuppression
in cattle, imposing huge economic losses for the global cattle industry. The pathogenic
and immune mechanisms of BVDV remain elusive. The development of a BVDV-gene knowledge base
can provide clues to reveal the interaction of BVDV with host cells. However, the traditional method of
manually establishing a knowledge base is time-consuming and inefficient. The method of developing a
knowledge base based on deep learning has noticeably attracted scholars' attention recently.
Objective:
The study aimed to explore the substitution of deep learning for manual mining of BVDVrelated
genes and to develop a knowledge graph of the relationship between BVDV and related genes.
Methods:
A deep learning-based biomedical knowledge graph development method was proposed,
which used deep learning to mine biomedical knowledge, model BVDV and various gene concepts, and
store data in a graphical database. First, the PubMed database was used as the data source and crawler
technology to obtain abstract data on the relationship between BVDV and various host genes. Pretrained
BioBERT model was used for biomedical named entity recognition to obtain all types of gene
entities, and the pre-trained BERT model was utilized for relationship extraction to achieve the relationship
between BVDV and various gene entities. Then, it was combined with manual proofreading to obtain
structured triple data with high accuracy. Finally, the Neo4j graph database was used to store data
and to develop the knowledge graph of the relationship between BVDV and related genes.
Results:
The results showed the obtainment of 71 gene entity types, including PRL4, MMP-7, TGIF1,
etc. 9 relation types of BVDV and gene entities were obtained, including "can downregulate expression
of", "can upregulate expression of", "can suppress expression of", etc. The knowledge graph was developed
using deep learning to mine biomedical knowledge combined with manual proofreading, which
was faster and more efficient than the traditional method of establishing knowledge base manually, and
the retrieval of semantic information by storing data in graph database was also more efficient.
Conclusion:
A BVDV-gene knowledge graph was preliminarily developed, which provided a basis for
studying the interaction between BVDV and host cells.
Funder
Inner Mongolia Autonomous Region Science and Technology Major Project
Natural Science Foundation of Inner Mongolia of China
Higher Education Research Project of Inner Mongolia of China
Publisher
Bentham Science Publishers Ltd.
Subject
Computational Mathematics,Genetics,Molecular Biology,Biochemistry
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Bovine Viral Diarrhea Virus Named Entity Recognition Based on BioBERT and MRC;International Journal of Pattern Recognition and Artificial Intelligence;2024-07-25