Affiliation:
1. SysDICE GmbH Franz‐Volhard‐Str. 5 68167 Mannheim Germany
2. Mechatronic GmbH Europaplatz 5 64293 Darmstadt Germany
Abstract
AbstractThe growing adoption of Model‐Based Systems Engineering (MBSE) in the medical sector has prompted a significant emphasis on the digitization of medical standards into norm models. This transformation promotes consistency and allows for tracing system model elements to the corresponding norm model elements. Despite these efforts, the current digitization activities heavily rely on manual extraction and transformation, particularly from PDF documents into SysML models. Concurrently, the proliferation of Artificial Intelligence (AI) applications in recent years presents an opportunity to automate such activities. This paper contributes to the integration of AI with MBSE, focusing solely on the extraction and transformation of medical standards information from documents into SysML norm models. It explores the initial outcomes of augmenting data extraction from medical standards using recent AI algorithms and integrating them into MBSE practices. The evaluation involves two approaches, an open‐source multimodal classifier model and a proprietary large language model. The study assesses these approaches on a medical standard and outlines future work, including the exploration of an open‐source large language model approach.
Reference35 articles.
1. AakankshaChowdhery SharanNarang JacobDevlin MaartenBosma GauravMishra AdamRoberts …NoahFiedel. (2022). PaLM: Scaling Language Modeling with Pathways.Journal of Machine Learning Research.
2. Abdoun N. &Chami M.(2022).Automatic Text Classification of PDF Documents using NLP Techniques. 32nd Annual INCOSE International Symposium 25–30 June 2022 — Detroit MI.https://doi.org/10.1002/iis2.12997
3. AshishVaswani NoamShazeer NikiParmar JakobUszkoreit LlionJones Aidan N.Gomez LukaszKaiser IlliaPolosukhin(2017). Attention is All you Need.NIPS.
4. Bast H. &Korzen C.(2017). A benchmark and evaluation for text extraction from PDF.ACM/IEEE Joint Conference on Digital Libraries 99–108.https://doi.org/10.1109/jcdl.2017.7991564
5. Brown T. B. Mann B. Ryder N. NickRyder Subbiah M. Kaplan J. …Amodei D.(2020). Language Models are Few-Shot Learners.Neural Information Processing Systems.