Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream Task-Reference-Cited by-同舟云学术

Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream Task

Published:2021-03 Issue:2 Volume:12 Page:1-24
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Bashar Md Abul¹,Nayak Richi¹

Affiliation:

1. Queensland University of Technology, QLD, Australia

Abstract

Language model (LM) has become a common method of transfer learning in Natural Language Processing (NLP) tasks when working with small labeled datasets. An LM is pretrained using an easily available large unlabelled text corpus and is fine-tuned with the labelled data to apply to the target (i.e., downstream) task. As an LM is designed to capture the linguistic aspects of semantics, it can be biased to linguistic features. We argue that exposing an LM model during fine-tuning to instances that capture diverse semantic aspects (e.g., topical, linguistic, semantic relations) present in the dataset will improve its performance on the underlying task. We propose a Mixed Aspect Sampling (MAS) framework to sample instances that capture different semantic aspects of the dataset and use the ensemble classifier to improve the classification performance. Experimental results show that MAS performs better than random sampling as well as the state-of-the-art active learning models to abuse detection tasks where it is hard to collect the labelled data for building an accurate classifier.

Funder

QUT IFE Catapult fund

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3446343

Reference89 articles.

1. Detecting Hate Speech Against Women in English Tweets

2. Random-Sets for Dealing with Uncertainties in Relevance Feature

3. Latent topic feedback for information retrieval