Affiliation:
1. Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria
Abstract
A semi-structured clinical problem list containing ∼1.9 million de-identified entries linked to ICD-10 codes was used to identify closely related real-world expressions. A log-likelihood based co-occurrence analysis generated seed-terms, which were integrated as part of a k-NN search, by leveraging SapBERT for the generation of an embedding representation.