LEBERT :Lite and Efficiently Optimized BERT PRetraining Approach-Reference-Cited by-同舟云学术

LEBERT :Lite and Efficiently Optimized BERT PRetraining Approach

Published:2024-01-23 Issue: Volume: Page:110-114
ISSN:2581-9429
Container-title:International Journal of Advanced Research in Science, Communication and Technology
language:en
Short-container-title:IJARSCT

Author:

Priyanka Yadav ¹,Anjali Sharma ¹

Affiliation:

1. Institute of Distance and Open Learning, Mumbai, Maharashtra, India

Abstract

The extensive generalization of these models can lead to overfitting, causing the model to perform poorly on unseen data and thereby not realizing its full potential. To address this challenge systematically, we propose a novel approach for lightweight and efficient fine-tuning of BERT (Bidirectional Encoder Representations from Transformers) that aims to achieve improved generalization and harness the maximum capabilities of BERT. Our proposed approach incorporates various regularization techniques designed to adaptively manage the model's complexity. We plan to conduct experiments using this approach across various NLP tasks, including GLUE (Wang et al., 2019), RACE (Lai et al., 2017), and SQuAD (Rajpurkar et al., 2016).

Publisher

Naksh Solutions

Subject

General Medicine

Reference14 articles.

1. [1] Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Maten, Yanqi Zhou, Wei Li, Peter J. Liu. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer.2020

2. [2] Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss Gretchen Krueger, Tom Henighan Rewon Child Aditya Ramesh Daniel M. Ziegler Jeffrey Wu Clemens Winter Christopher Hesse Mark Chen Eric Sigler Mateusz Litwin Scott Gray Benjamin Chess Jack Clark Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei. Language Models are Few-Shot Learners.2020

3. [3] Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova. BERT: Pre-schooling of Deep Bidirectional Transformers for Language Understanding.2019

4. [4] Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy & Samuel R. Bowman. GLUE: A MULTI-TASK BENCHMARK AND ANALYSIS PLATFORM FOR NATURAL LANGUAGE UNDERSTANDING. 2019.

5. [5] Guokun Lai, Qizhe Xie, Hanxiao Liu, Yiming Yang and Eduard Hovy. RACE: Large-scale Reading Comprehension Dataset from Examinations.2017