Identifying Chemical Reactions and Their Associated Attributes in Patents

Author:

Mahendran Darshini,Gurdin Gabrielle,Lewinski Nastassja,Tang Christina,McInnes Bridget T.

Abstract

Chemical patents are an essential source of information about novel chemicals and chemical reactions. However, with the increasing volume of such patents, mining information about these chemicals and chemical reactions has become a time-intensive and laborious endeavor. In this study, we present a system to extract chemical reaction events from patents automatically. Our approach consists of two steps: 1) named entity recognition (NER)—the automatic identification of chemical reaction parameters from the corresponding text, and 2) event extraction (EE)—the automatic classifying and linking of entities based on their relationships to each other. For our NER system, we evaluate bidirectional long short-term memory (BiLSTM)-based and bidirectional encoder representations from transformer (BERT)-based methods. For our EE system, we evaluate BERT-based, convolutional neural network (CNN)-based, and rule-based methods. We evaluate our NER and EE components independently and as an end-to-end system, reporting the precision, recall, and F1 score. Our results show that the BiLSTM-based method performed best at identifying the entities, and the CNN-based method performed best at extracting events.

Funder

National Science Foundation

Publisher

Frontiers Media SA

Reference24 articles.

1. Publicly Available Clinical Bert Embeddings;Alsentzer;arXiv preprint arXiv:1904.03323,2019

2. Project Title CharlesP. 2013

3. Named Entity Recognition in Chemical Patents Using Ensemble of Contextual Language Models;Copara;arXiv pSreprint arXiv:2007.12569,2020

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Big Data Analysis of Patents and Their Applications in Biomedical Research and Development;Exploring Computational Pharmaceutics ‐ AI and Modeling in Pharma 4.0;2024-06-21

2. Unleashing the Power of Knowledge Extraction from Scientific Literature in Catalysis;Journal of Chemical Information and Modeling;2022-06-30

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3