Author:
Almudaifer Abdullateef I.,Covington Whitney,Hairston JaMor,Deitch Zachary,Anand Ankit,Carroll Caleb M.,Crisan Estera,Bradford William,Walter Lauren A.,Eaton Ellen F.,Feldman Sue S.,Osborne John D.
Abstract
Abstract
Background
The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier.
Methods
We develop and evaluate a multi-task transformer architecture design where modifiers are learned and predicted jointly using the publicly available SemEval 2015 Task 14 corpus and a new Opioid Use Disorder (OUD) data set that contains modifiers shared with SemEval as well as novel modifiers specific for OUD. We evaluate the effectiveness of our multi-task learning approach versus previously published systems and assess the feasibility of transfer learning for clinical entity modifiers when only a portion of clinical modifiers are shared.
Results
Our approach achieved state-of-the-art results on the ShARe corpus from SemEval 2015 Task 14, showing an increase of 1.1% on weighted accuracy, 1.7% on unweighted accuracy, and 10% on micro F1 scores.
Conclusions
We show that learned weights from our shared model can be effectively transferred to a new partially matched data set, validating the use of transfer learning for clinical text modifiers.
Funder
Alabama Department of Mental Health,United States
Agency for Healthcare Research and Quality
Substance Abuse and Mental Health Services Administration
National Institute of Arthritis and Musculoskeletal and Skin Diseases,United States
Publisher
Springer Science and Business Media LLC
Reference35 articles.
1. Zhong Z, Chen D. A frustratingly easy approach for entity and relation extraction. 2020. arXiv preprint arXiv:2010.12812.
2. Wadden D, Wennberg U, Luan Y, Hajishirzi H. Entity, relation, and event extraction with contextualized span representations. 2019. arXiv preprint arXiv:1909.03546.
3. Soares LB, FitzGerald N, Ling J, Kwiatkowski T. Matching the blanks: distributional similarity for relation learning. 2019. arXiv preprint arXiv:1906.03158.
4. Fraile Navarro D, Ijaz K, Rezazadegan D, Rahimi-Ardabili H, Dras M, Coiera E, Berkovsky S. Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review. Int J Med Inform. 2023;177:105122. https://doi.org/10.1016/j.ijmedinf.2023.105122.
5. Chapman WW, Bridewell W, Hanbury P, Cooper GF, Buchanan BG. Evaluation of negation phrases in narrative clinical reports. In: Proceedings of the AMIA Symposium. American Medical Informatics Association; 2001. pp. 105.