Semantic Structuring of and Information Extraction from Medical Documents Using the UMLS-Reference-Cited by-同舟云学术

Semantic Structuring of and Information Extraction from Medical Documents Using the UMLS

Published:2008 Issue:05 Volume:47 Page:425-434
ISSN:0026-1270
Container-title:Methods of Information in Medicine
language:en
Short-container-title:Methods Inf Med

Author:

Denecke K.

Abstract

Summary Objectives: This paper introduces SeReMeD (Semantic Representation of Medical Documents), a method for automatically generating knowledge representations from natural language documents. The suitability of the Unified Medical Language System (UMLS) as domain knowledge for this method is analyzed. Methods: SeReMeD combines existing language engineering methods and semantic transformation rules for mapping syntactic information to semantic roles. In this way, the relevant content of medical documents is mapped to semantic structures. In order to extract specific data, these semantic structures are searched for concepts and semantic roles. A study is carried out that uses SeReMeD to detect specific data in medical narratives such as documented diagnoses or procedures. Results: The system is tested on chest X-ray reports. In first evaluations of the system’s performance, the generation of semantic structures achieves a correctness of 80%, whereas the extraction of documented findings obtains values of 93% precision and 83% recall. Conclusions: The results suggest that the methods described here can be used to accurately extract data from medical narratives, although there is also some potential for improving the results. The proposed methods provide two main benefits. By using existing language engineering methods, the effort required to construct a medical information extraction system is reduced. It is also possible to change the domain knowledge and therefore to create a more (or less) specialized system, capable of handling various medical sub-domains.

Publisher

Georg Thieme Verlag KG

Subject

Health Information Management,Advanced and Specialized Nursing,Health Informatics

Link

http://www.thieme-connect.de/products/ejournals/pdf/10.3414/ME0508.pdf

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automated thematic analysis of health information technology (HIT) related incident reports;Knowledge Management & E-Learning: An International Journal;2021-12-30

2. Study on structured method of Chinese MRI report of nasopharyngeal carcinoma;BMC Medical Informatics and Decision Making;2021-07

3. Biological Network Mining;Modeling Transcriptional Regulation;2021

4. HerCulB: content-based information extraction and retrieval for cultural heritage of the Balkans;The Electronic Library;2020-10-30

5. The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis (Preprint);2020-05-25