RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm-Reference-Cited by-同舟云学术

RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm

Published:2022-05-06 Issue: Volume:2022 Page:1-21
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Gharagozlou Hamid¹^ORCID,Mohammadzadeh Javad¹^ORCID,Bastanfard Azam¹^ORCID,Ghidary Saeed Shiry²^ORCID

Affiliation:

1. Department of Computer Engineering, Karaj Branch, Islamic Azad University, Karaj, Iran

2. School of Digital, Technologies, and Arts, Staffordshire University, Stoke-on-Trent, UK

Abstract

Answer selection (AS) is a critical subtask of the open-domain question answering (QA) problem. The present paper proposes a method called RLAS-BIABC for AS, which is established on attention mechanism-based long short-term memory (LSTM) and the bidirectional encoder representations from transformers (BERT) word embedding, enriched by an improved artificial bee colony (ABC) algorithm for pretraining and a reinforcement learning-based algorithm for training backpropagation (BP) algorithm. BERT can be comprised in downstream work and fine-tuned as a united task-specific architecture, and the pretrained BERT model can grab different linguistic effects. Existing algorithms typically train the AS model with positive-negative pairs for a two-class classifier. A positive pair contains a question and a genuine answer, while a negative one includes a question and a fake answer. The output should be one for positive and zero for negative pairs. Typically, negative pairs are more than positive, leading to an imbalanced classification that drastically reduces system performance. To deal with it, we define classification as a sequential decision-making process in which the agent takes a sample at each step and classifies it. For each classification operation, the agent receives a reward, in which the prize of the majority class is less than the reward of the minority class. Ultimately, the agent finds the optimal value for the policy weights. We initialize the policy weights with the improved ABC algorithm. The initial value technique can prevent problems such as getting stuck in the local optimum. Although ABC serves well in most tasks, there is still a weakness in the ABC algorithm that disregards the fitness of related pairs of individuals in discovering a neighboring food source position. Therefore, this paper also proposes a mutual learning technique that modifies the produced candidate food source with the higher fitness between two individuals selected by a mutual learning factor. We tested our model on three datasets, LegalQA, TrecQA, and WikiQA, and the results show that RLAS-BIABC can be recognized as a state-of-the-art method.

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/7839840.pdf

Reference141 articles.

1. QuGAN: Quasi Generative Adversarial Network for Tibetan Question Answering Corpus Generation

2. A multi-size neural network with attention mechanism for answer selection;J. Huang,2021

3. Linguistic properties matter for implicit discourse relation recognition: combining semantic interaction, topic continuity and attribution;W. Lei

4. Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users;S. Li,2020

5. Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SeismoNet: A proximal policy optimization-based earthquake early warning system using dilated convolution layers and online data augmentation;Expert Systems with Applications;2024-11

2. Aspect-based sentiment analysis: A dual-task learning architecture using imbalanced maximized-area under the curve proximate support vector machine and reinforcement learning;Information Sciences;2024-09

3. Smartphone detector examination for transportation mode identification utilizing imbalanced maximizing-area under the curve proximal support vector machine;Signal, Image and Video Processing;2024-08-13

4. Melanoma classification using generative adversarial network and proximal policy optimization;Photochemistry and Photobiology;2024-07-30

5. ELRL-MD: a deep learning approach for myocarditis diagnosis using cardiac magnetic resonance images with ensemble and reinforcement learning integration;Physiological Measurement;2024-05-01