Two complementary AI approaches for predicting UMLS semantic group assignment: heuristic reasoning and deep learning-Reference-Cited by-同舟云学术

Two complementary AI approaches for predicting UMLS semantic group assignment: heuristic reasoning and deep learning

Published:2023-08-01 Issue:12 Volume:30 Page:1887-1894
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Mao Yuqing¹,Miller Randolph A¹,Bodenreider Olivier¹,Nguyen Vinh¹,Fung Kin Wah¹^ORCID

Affiliation:

1. National Library of Medicine, National Institutes of Health , Bethesda, Maryland, USA

Abstract

Abstract Objective Use heuristic, deep learning (DL), and hybrid AI methods to predict semantic group (SG) assignments for new UMLS Metathesaurus atoms, with target accuracy ≥95%. Materials and Methods We used train-test datasets from successive 2020AA–2022AB UMLS Metathesaurus releases. Our heuristic “waterfall” approach employed a sequence of 7 different SG prediction methods. Atoms not qualifying for a method were passed on to the next method. The DL approach generated BioWordVec and SapBERT embeddings for atom names, BioWordVec embeddings for source vocabulary names, and BioWordVec embeddings for atom names of the second-to-top nodes of an atom’s source hierarchy. We fed a concatenation of the 4 embeddings into a fully connected multilayer neural network with an output layer of 15 nodes (one for each SG). For both approaches, we developed methods to estimate the probability that their predicted SG for an atom would be correct. Based on these estimations, we developed 2 hybrid SG prediction methods combining the strengths of heuristic and DL methods. Results The heuristic waterfall approach accurately predicted 94.3% of SGs for 1 563 692 new unseen atoms. The DL accuracy on the same dataset was also 94.3%. The hybrid approaches achieved an average accuracy of 96.5%. Conclusion Our study demonstrated that AI methods can predict SG assignments for new UMLS atoms with sufficient accuracy to be potentially useful as an intermediate step in the time-consuming task of assigning new atoms to UMLS concepts. We showed that for SG prediction, combining heuristic methods and DL methods can produce better results than either alone.

Funder

NIH

National Library of Medicine

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

https://academic.oup.com/jamia/article-pdf/30/12/1887/53477601/ocad152.pdf

Reference33 articles.

1. The unified medical language system (UMLS): integrating biomedical terminology;Bodenreider;Nucleic Acids Res,2004

2. UMLS users and uses: a current overview;Amos;J Am Med Inform Assoc,2020

3. The unified medical language system;Lindberg;Yearb Med Inform,1993

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Standards in action: historical and current perspectives;Journal of the American Medical Informatics Association;2023-11-17

2. Dynamic Routing Policies for Multi-Skill Call Centers Using Deep Q Network;Mathematics;2023-11-16