Author:
Loda ,Krebs ,Danhof ,Schreder ,Solimando ,Strifler ,Rasche ,Kortüm ,Kerscher ,Knop ,Puppe ,Einsele ,Bittrich
Abstract
Background: Natural language processing (NLP) is a powerful tool supporting the generation of Real-World Evidence (RWE). There is no NLP system that enables the extensive querying of parameters specific to multiple myeloma (MM) out of unstructured medical reports. We therefore created a MM-specific ontology to accelerate the information extraction (IE) out of unstructured text. Methods: Our MM ontology consists of extensive MM-specific and hierarchically structured attributes and values. We implemented “A Rule-based Information Extraction System” (ARIES) that uses this ontology. We evaluated ARIES on 200 randomly selected medical reports of patients diagnosed with MM. Results: Our system achieved a high F1-Score of 0.92 on the evaluation dataset with a precision of 0.87 and recall of 0.98. Conclusions: Our rule-based IE system enables the comprehensive querying of medical reports. The IE accelerates the extraction of data and enables clinicians to faster generate RWE on hematological issues. RWE helps clinicians to make decisions in an evidence-based manner. Our tool easily accelerates the integration of research evidence into everyday clinical practice.
Reference25 articles.
1. Zentrum für Krebsregisterdaten. Multiples Myelomhttps://www.krebsdaten.de/Krebs/DE/Content/Krebsarten/Multiples%20Myelom/multiples_myelom_node.html
2. Real-world evidence research based on big data
3. Natural Language Processing in Oncology
4. Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review
5. Semi-Automatic Terminology Generation for Information Extraction from German Chest X-Ray Reports;Krebs;Stud. Health Technol. Inf.,2017
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献