An Enhanced Neural Word Embedding Model for Transfer Learning-Reference-Cited by-同舟云学术

An Enhanced Neural Word Embedding Model for Transfer Learning

Published:2022-03-10 Issue:6 Volume:12 Page:2848
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Kowsher Md.^ORCID,Sobuj Md. Shohanur Islam^ORCID,Shahriar Md. Fahim^ORCID,Prottasha Nusrat Jahan,Arefin Mohammad Shamsul^ORCID,Dhar Pranab Kumar,Koshiba Takeshi^ORCID

Abstract

Due to the expansion of data generation, more and more natural language processing (NLP) tasks are needing to be solved. For this, word representation plays a vital role. Computation-based word embedding in various high languages is very useful. However, until now, low-resource languages such as Bangla have had very limited resources available in terms of models, toolkits, and datasets. Considering this fact, in this paper, an enhanced BanglaFastText word embedding model is developed using Python and two large pre-trained Bangla models of FastText (Skip-gram and cbow). These pre-trained models were trained on a collected large Bangla corpus (around 20 million points of text data, in which every paragraph of text is considered as a data point). BanglaFastText outperformed Facebook’s FastText by a significant margin. To evaluate and analyze the performance of these pre-trained models, the proposed work accomplished text classification based on three popular textual Bangla datasets, and developed models using various machine learning classical approaches, as well as a deep neural network. The evaluations showed a superior performance over existing word embedding techniques and the Facebook Bangla FastText pre-trained model for Bangla NLP. In addition, the performance in the original work concerning these textual datasets provides excellent results. A Python toolkit is proposed, which is convenient for accessing the models and using the models for word embedding, obtaining semantic relationships word-by-word or sentence-by-sentence; sentence embedding for classical machine learning approaches; and also the unsupervised finetuning of any Bangla linguistic dataset.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/6/2848/pdf

Reference37 articles.

1. Efficient Estimation of Word Representations in Vector Space;Mikolov;arXiv,2013

2. Modern Information Retrieval;Baeza-Yates,1999

3. Natural Language Processing (almost) from Scratch;Collobert;arXiv,2011

4. Attention Is All You Need;Vaswani;arXiv,2017

5. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding;Devlin;arXiv,2019

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models (LLMs): survey, technical frameworks, and future challenges;Artificial Intelligence Review;2024-08-18

2. Arabic Paraphrase Generation Using Transformer-Based Approaches;IEEE Access;2024

3. Emotion Classification using Generative Pre-trained Embedding and Machine Learning;2023 IEEE International Conference on Machine Learning and Applied Network Technologies (ICMLANT);2023-12-14

4. Utilizing XGBoost for the Prediction of Material Corrosion Rates from Embedded Tabular Data using Large Language Model;2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM);2023-12-05

5. Genetic data visualization using literature text-based neural networks: Examples associated with myocardial infarction;Neural Networks;2023-08