The SMART Text2FHIR Pipeline


Miller Timothy A.,McMurry Andrew J.,Jones James,Gottlieb Daniel,Mandl Kenneth D.ORCID


AbstractObjectiveTo implement an open source, free, and easily deployable high throughput natural language processing module to extract concepts from clinician notes and map them to Fast Healthcare Interoperability Resources (FHIR).Materials and MethodsUsing a popular open-source NLP tool (Apache cTAKES), we create FHIR resources that use modifier extensions to represent negation and NLP sourcing, and another extension to represent provenance of extracted concepts.ResultsThe SMART Text2FHIR Pipeline is an open-source tool, released through standard package managers, and publicly available container images that implement the mappings, enabling ready conversion of clinical text to FHIR.DiscussionWith the increased data liquidity because of new interoperability regulations, NLP processes that can output FHIR can enable a common language for transporting structured and unstructured data. This framework can be valuable for critical public health or clinical research use cases.ConclusionFuture work should include mapping more categories of NLP-extracted information into FHIR resources and mappings from additional open-source NLP tools.


Cold Spring Harbor Laboratory

Reference24 articles.

1. Lin C , Karlson EW , Dligach D , Ramirez MP , Miller T a. , Mo H , et al. Automatic identification of methotrexate-induced liver toxicity in patients with rheumatoid arthritis from the electronic medical record. J Am Med Inform Assoc. 2014;23–30.

2. MIMIC-III Benchmarks [Internet]. YerevaNN; 2022 [cited 2022 Mar 2]. Available from:

3. Natural language processing and machine learning to identify alcohol misuse from the electronic health record in trauma patients: development and internal validation;J Am Med Inform Assoc,2019

4. A Computable Phenotype Improves Cohort Ascertainment in a Pediatric Pulmonary Hypertension Registry;J Pediatr,2017

5. Large-scale identification of patients with cerebral aneurysms using natural language processing







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3