Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation

Author:

Luo LingyunORCID,Feng JingtaoORCID,Yu HuijunORCID,Wang JiaolongORCID

Abstract

Background As the manual creation and maintenance of biomedical ontologies are labor-intensive, automatic aids are desirable in the lifecycle of ontology development. Objective Provided with a set of concept names in the Foundational Model of Anatomy (FMA), we propose an innovative method for automatically generating the taxonomy and the partonomy structures among them, respectively. Methods Our approach comprises 2 main tasks: The first task is predicting the direct relation between 2 given concept names by utilizing word embedding methods and training 2 machine learning models, Convolutional Neural Networks (CNN) and Bidirectional Long Short-term Memory Networks (Bi-LSTM). The second task is the introduction of an original granularity-based method to identify the semantic structures among a group of given concept names by leveraging these trained models. Results Results show that both CNN and Bi-LSTM perform well on the first task, with F1 measures above 0.91. For the second task, our approach achieves an average F1 measure of 0.79 on 100 case studies in the FMA using Bi-LSTM, which outperforms the primitive pairwise-based method. Conclusions We have investigated an automatic way of predicting a hierarchical relationship between 2 concept names; based on this, we have further invented a methodology to structure a group of concept names automatically. This study is an initial investigation that will shed light on further work on the automatic creation and enrichment of biomedical ontologies.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference32 articles.

1. KimuraJShibasakiHRecent Advances in Clinical NeurophysiologyProceedings of the 10th International Congress of Emg and Clinical Neurophysiology1995The 10th International Congress of EMG and Clinical NeurophysiologyOctoberNew YorkElsevier1519

2. Automatic ontology construction from text: a review from shallow to deep learning trend

3. BodenreiderOQuality Assurance in Biomedical Terminologies and Ontologies201048A report to the Board of Scientific CounselorsApr 2010Bethesda

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Research on the Implementation of Financial Risk Control Models and Algorithms Based on Machine Learning;International Journal of e-Collaboration;2024-08-12

2. Using Generative Large Language Models for Hierarchical Relationship Prediction in Medical Ontologies;2024 IEEE 12th International Conference on Healthcare Informatics (ICHI);2024-06-03

3. An enrichment multi-layer Arabic text classification model based on siblings patterns extraction;Neural Computing and Applications;2024-03-15

4. GOGCN: using deep learning to support insertion of new concepts into gene ontology;5th International Conference on Information Science, Electrical, and Automation Engineering (ISEAE 2023);2023-08-10

5. Self-prediction of relations in GO facilitates its quality auditing;Journal of Biomedical Informatics;2023-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3