Design and implementation of information extraction system for scientific literature using fine-tuned deep learning models-Reference-Cited by-同舟云学术

Design and implementation of information extraction system for scientific literature using fine-tuned deep learning models

Published:2022-03 Issue:1 Volume:22 Page:31-38
ISSN:1559-6915
Container-title:ACM SIGAPP Applied Computing Review
language:en
Short-container-title:SIGAPP Appl. Comput. Rev.

Author:

Won Kwanghee¹,Jang Youngsun¹,Choi Hyung-do²,Shin Sung¹

Affiliation:

1. South Dakota State University, Brookings, SD

2. Electronics and Telecom Research Institute, Daejeon, South Korea

Abstract

This paper presents an overview of a quality scoring system that utilizes pre-trained deep neural network models. Two types of DL models, a classification and extractive question answering (EQA) models are used to implement components of the system. The abstracts of the scientific literature are classified into two groups, in-vivo and in-vitro, and a question and answering model architecture is constructed for extracting the following types of information (animal type, the number of animals, exposure dose, and signal frequency). The Bidirectional Encoder Representations of Transformers (BERT) model pre-trained with a large text corpus is used as our baseline model for classification and EQA tasks. The models are fine-tuned with 455 EMF-related research papers. In our experiments, the fine-tuned model showed improved performance on EQA tasks for the four-categories of questions compared to the baseline, and it also showed improvements on similar questions that were not used in training. This suggests the importance of retraining of deep learning model specifically in some areas requiring domain expertise such as scientific papers. However, additional research is needed on some implementation issues, in such cases where there are still multiple answers, or where there is no answer given in a context.

Publisher

Association for Computing Machinery (ACM)

Subject

Industrial and Manufacturing Engineering

Link

https://dl.acm.org/doi/pdf/10.1145/3530043.3530047

Reference33 articles.

1. K. Pearce , T. Zhan , A. Komanduri , and J. Zhan . A Comparative Study of Transformer-Based Language Models on Extractive Question Answering . In Computing Research Repository (CoRR) , October 2021 . K. Pearce, T. Zhan, A. Komanduri, and J. Zhan. A Comparative Study of Transformer-Based Language Models on Extractive Question Answering. In Computing Research Repository (CoRR), October 2021.

2. D. Bahdanau , KH. Cho , and Y. Bengio . Neural Machine Translation by Jointly Learning to Align and Translate . In the Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015) , May 2015 , San Diego, CA, USA. D. Bahdanau, KH. Cho, and Y. Bengio. Neural Machine Translation by Jointly Learning to Align and Translate. In the Proceedings of the 3rd International Conference on Learning Representations (ICLR 2015), May 2015, San Diego, CA, USA.

3. A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

4. P. Badjatiya , S. Gupta , M. Gupta , and V. Varma . Deep Learning for Hate Speech Detection in Tweets. In the Proceedings of the 26th International Conference on World Wide Web Companion, (WWW '17 Companion) , April 2017 , Perth, Australia . 759--760. International World Wide Web Conference Committee . P. Badjatiya, S. Gupta, M. Gupta, and V. Varma. Deep Learning for Hate Speech Detection in Tweets. In the Proceedings of the 26th International Conference on World Wide Web Companion, (WWW '17 Companion), April 2017, Perth, Australia. 759--760. International World Wide Web Conference Committee.

5. C. Qu , L. Yang , M. Qiu , W. B. Croft , Y. Zhang , and M. Iyyer . BERT with History Answer Embedding for Conversational Question Answering. In the Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19) , July 2019 , New York, NY , United States. 1133--1136. Association for Computing Machinery . C. Qu, L. Yang, M. Qiu, W. B. Croft, Y. Zhang, and M. Iyyer. BERT with History Answer Embedding for Conversational Question Answering. In the Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19), July 2019, New York, NY, United States. 1133--1136. Association for Computing Machinery.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Harnessing Generative Pre-Trained Transformers for Construction Accident Prediction with Saliency Visualization;Applied Sciences;2024-01-12

2. Extractive Question Answering with Contrastive Puzzles and Reweighted Clues;Lecture Notes in Computer Science;2024

3. Research category classification of scientific articles on human health risks of electromagnetic fields using pre-trained BERT;ICT Express;2023-08

4. Classification of Research Papers on Radio Frequency Electromagnetic Field (RF-EMF) Using Graph Neural Networks (GNN);Applied Sciences;2023-04-05

5. Systematic Literature Review of Information Extraction From Textual Data: Recent Methods, Applications, Trends, and Challenges;IEEE Access;2023