An Arabic Dialects Dictionary Using Word Embeddings-Reference-Cited by-同舟云学术

An Arabic Dialects Dictionary Using Word Embeddings

Published:2019-07 Issue:3 Volume:6 Page:18-31
ISSN:2334-4598
Container-title:International Journal of Rough Sets and Data Analysis
language:en
Short-container-title:

Author:

Chaimae Azroumahli¹,El Younoussi Yacine¹,Moussaoui Otman¹,Zahidi Youssra¹

Affiliation:

1. National School of Applied Sciences, Abdel Malek Essaâdi University, Morocco

Abstract

The dialectical Arabic and the Modern Standard Arabic lacks sufficient standardized language resources to enable the tasks of Arabic language processing, despite it being an active research area. This work addresses this issue by firstly highlighting the steps and the issues related to building a multi Arabic dialect corpus using web data from blogs and social media platforms (i.e. Facebook, Twitter, etc.). This is to create a vectorized dictionary for the crawled data using the word Embeddings. In other terms, the goal of this article is to build an updated multi-dialect data set, and then, to extract an annotated corpus from it.

Publisher

IGI Global

Subject

General Medicine

Reference33 articles.

1. Arabic Diacritics based Steganography

2. Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text

3. Al-sabbagh, R., & Girju, R. (2012). YADAC: Yet another Dialectal Arabic Corpus. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (pp. 2882–2889). Academic Press.

4. AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Natural Language Processing for Arabic Sentiment Analysis: A Systematic Literature Review;IEEE Transactions on Big Data;2024-10

2. Towards an Open Domain Arabic Question Answering System: Assessment of the Bert Approach;Communications in Computer and Information Science;2024

3. Arabic Named Entity Recognition: Approaches, Datasets, and Comparative Study;Lecture Notes in Networks and Systems;2024

4. BERT for Arabic NLP Applications: Pretraining and Finetuning MSA and Arabic Dialects;Communications in Computer and Information Science;2023-11-21