Affiliation:
1. Naval Surface Warfare Center Indian Head Division Indian Head MD 20640 USA
Abstract
AbstractWe present a demonstration of the utility of Natural Language Processing (NLP) for aiding research into energetic materials and associated systems. The NLP method enables machine understanding of textual data, offering an automated route to knowledge discovery and information extraction from energetics text. We apply three established unsupervised NLP models: Latent Dirichlet Allocation, Word2Vec, and the Transformer to a large curated dataset of energetics‐related scientific articles. We demonstrate that each NLP algorithm is capable of identifying energetic topics and concepts, generating a language model which aligns with Subject Matter Expert knowledge. Furthermore, we present a document classification pipeline for energetics text. Our classification pipeline achieves 59–76 % accuracy depending on the NLP model used, with the highest performing Transformer model rivaling inter‐annotator agreement metrics. The NLP approaches studied in this work can identify concepts germane to energetics and therefore hold promise as a tool for accelerating energetics research efforts and energetics material development.
Subject
General Chemical Engineering,General Chemistry
Reference56 articles.
1. Applying machine learning techniques to predict the properties of energetic materials
2. B. C. Barnes D. C. Elton Z. Boukouvalas D. E. Taylor W. D. Mattson M. D. Fuge P. W. Chung arXiv1807.06156 2018.
3. Locally Optimizable Joint Embedding Framework to Design Nitrogen‐Rich Molecules that are Similar but Improved
4. D. C. Elton D. Turakhia N. Reddy Z. Boukouvalas M. D. Fuge R. M. Doherty P. W. Chung arXiv1903.00415 2019.
5. M. Puerto M. Kellett R. Nikopoulou M. D. Fuge R. Doherty P. W. Chung Z. Boukouvalas arXiv2206.00773 2022.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献