A Distilled BERT with Hidden State and Soft Label Learning for Sentiment Classification-Reference-Cited by-同舟云学术

A Distilled BERT with Hidden State and Soft Label Learning for Sentiment Classification

Published:2020-12-01 Issue:1 Volume:1693 Page:012076
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Wei Shuyong,Yu Defa,Lv Chenguo

Abstract

Abstract BERT is a pre-trained language model. Although the model is proven to be highly performant in a variety of natural language understanding tasks, its large size makes it hard to implement in practical situation where computing resource is limited. In order to improve the model efficiency of BERT for sentiment analysis task, we propose a novel distilled version of BERT. It distills knowledge from the full-size BERT model, which serves as the teacher model. The distilled model efficiently learns the last hidden state and soft label of the teacher model, which are different from previous models. We use distillation learning objective that is able to effectively transfer knowledge from the original big model to the compact model. Our model reduces BERT model size by ∼40%, but retains ∼98.2% of performance in sentiment classification task. Our model achieves promising results in SST-2 sentiment analysis, and outperforms previous distilled model.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1693/1/012076/pdf

Reference10 articles.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. BERT-Based Knowledge Distillation for Sentiment Analysis Model;Computer Science and Application;2023

2. Exploring the Effect of N-grams with BOW and TF-IDF Representations on Detecting Fake News;2022 International Conference on Data Analytics for Business and Industry (ICDABI);2022-10-25

3. Bert Distillation to Enhance the Performance of Machine Learning Models for Sentiment Analysis on Movie Review Data;2022 9th International Conference on Computing for Sustainable Global Development (INDIACom);2022-03-23