Towards clearer recognition and easier usefulness: development of a cross-lingual atherosclerotic cerebrovascular disease Ontology (Preprint)

Author:

Ma HetongORCID,Shen LiuORCID,Wang JiayangORCID,Li Zixiao,Li JiaoORCID

Abstract

BACKGROUND

Atherosclerotic cerebrovascular disease could result in a great number of deaths and disabilities. However, it did not acquire enough attention. Up till now, less information, statistics, or clear consensus on the disease was revealed. Thus, no systematic concept datasets were released to help clinicians in the field to clarify the scope, assist research, and offer maximized value.

OBJECTIVE

The aims of this study were to (1) develop a comprehensive cross-lingual atherosclerotic cerebrovascular disease ontology. (2) describe the workflow, schema, and hierarchical structure, and the highlighted content of the ontology (3) design a brand-new rehabilitation ontology which was an important part overlooked in the existing ontologies (4) implement the evaluation of the proposed ontology (5) apply the proposed ontology to real-world scenarios and electronic health records to realize information retrieval, named entity recognition, novel expression discovery, and knowledge fusion.

METHODS

We implemented 9 steps based on the ontology development 101 methodologies combined with expert opinions. The final ontology included clinical requirements collection and specification, background investigation and knowledge acquisition, ontology selection and reuse, scope identification, schema definition, concept extraction, concept extension, ontology verification, and ontology evaluation.

RESULTS

The current ontology included 10 top-level classes, respectively clinical manifestation, comorbidity, complication, diagnosis, model of atherosclerotic cerebrovascular disease, pathogenesis, prevention, rehabilitation, risk factor, and treatment. Totally, there are 1715 concepts in the 11-level ontology, covering 4588 Chinese terms, 6617 English terms, and 972 definitions. The ontology could be applied in real-world scenarios such as information retrieval, new expression discovery, named entity recognition, and knowledge fusion, and the use case proved that it could offer satisfying support to related medical scenarios.

CONCLUSIONS

The proposed ontology provided a clear set of cross-lingual concepts and terms with an explicit hierarchical structure, helping scientific researchers to quickly retrieve relevant medical literature, assisting data scientists to efficiently identify relevant contents in electronic health records, and providing a clear domain framework for academic reference.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3