Augmenting Semantic Lexicons Using Word Embeddings and Transfer Learning-Reference-Cited by-同舟云学术

Augmenting Semantic Lexicons Using Word Embeddings and Transfer Learning

Published:2022-01-24 Issue: Volume:4 Page:
ISSN:2624-8212
Container-title:Frontiers in Artificial Intelligence
language:
Short-container-title:Front. Artif. Intell.

Author:

Alshaabi Thayer,Van Oort Colin M.,Fudolig Mikaela Irene,Arnold Michael V.,Danforth Christopher M.,Dodds Peter Sheridan

Abstract

Sentiment-aware intelligent systems are essential to a wide array of applications. These systems are driven by language models which broadly fall into two paradigms: Lexicon-based and contextual. Although recent contextual models are increasingly dominant, we still see demand for lexicon-based models because of their interpretability and ease of use. For example, lexicon-based models allow researchers to readily determine which words and phrases contribute most to a change in measured sentiment. A challenge for any lexicon-based approach is that the lexicon needs to be routinely expanded with new words and expressions. Here, we propose two models for automatic lexicon expansion. Our first model establishes a baseline employing a simple and shallow neural network initialized with pre-trained word embeddings using a non-contextual approach. Our second model improves upon our baseline, featuring a deep Transformer-based network that brings to bear word definitions to estimate their lexical polarity. Our evaluation shows that both models are able to score new words with a similar accuracy to reviewers from Amazon Mechanical Turk, but at a fraction of the cost.

Funder

National Science Foundation

MassMutual Financial Group

Google

Publisher

Frontiers Media SA

Subject

Artificial Intelligence

Reference117 articles.

1. Tensorflow: a system for large-scale machine learning;Abadi,2016

2. Sentiment analysis of Twitter data;Agarwal,2011

3. On positivity bias in negative reviews;Aithal,2021

4. Storywrangler: a massive exploratorium for sociolinguistic, cultural, socioeconomic, and political timelines using Twitter;Alshaabi;Sci. Adv.

5. How the world's collective attention is being paid to a pandemic: COVID-19 related n-gram time series for 24 languages on Twitter;Alshaabi;PLoS One

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhanced Lexicon based Hybrid Method for Slang and Punctuation Scoring for Aspect Based Sentiment Analysis;2024 6th International Conference on Electrical Engineering and Information & Communication Technology (ICEEICT);2024-05-02

2. Punctuation and lexicon aid representation: A hybrid model for short text sentiment analysis on social media platform;Journal of King Saud University - Computer and Information Sciences;2024-03

3. Text-based emotion recognition using contextual phrase embedding model;Multimedia Tools and Applications;2023-03-16

4. Sentiment and structure in word co-occurrence networks on Twitter;Applied Network Science;2022-02-14