Idiomatic Expression Identification using Semantic Compatibility

Author:

Zeng Ziheng1,Bhat Suma2

Affiliation:

1. Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Champaign, IL USA. zzeng13@illinois.edu

2. Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Champaign, IL USA. spbhat2@illinois.edu

Abstract

AbstractIdiomatic expressions are an integral part of natural language and constantly being added to a language. Owing to their non-compositionality and their ability to take on a figurative or literal meaning depending on the sentential context, they have been a classical challenge for NLP systems. To address this challenge, we study the task of detecting whether a sentence has an idiomatic expression and localizing it when it occurs in a figurative sense. Prior research for this task has studied specific classes of idiomatic expressions offering limited views of their generalizability to new idioms. We propose a multi-stage neural architecture with attention flow as a solution. The network effectively fuses contextual and lexical information at different levels using word and sub-word representations. Empirical evaluations on three of the largest benchmark datasets with idiomatic expressions of varied syntactic patterns and degrees of non-compositionality show that our proposed model achieves new state-of-the-art results. A salient feature of the model is its ability to identify idioms unseen during training with gains from 1.4% to 30.8% over competitive baselines on the largest dataset.

Publisher

MIT Press

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Deep learning-based idiomatic expression recognition for the Amharic language;PLOS ONE;2023-12-14

2. Transitive and Intransitive Verb Analysis for Idiomatic Expression Understanding: An NLP-based Framework;International Journal of Advanced Research in Science, Communication and Technology;2023-06-27

3. Computational Actualization of Idioms: Authorial Corpus-based Idiomaticity of Contemporary British Fiction;2022 IEEE 17th International Conference on Computer Sciences and Information Technologies (CSIT);2022-11-10

4. Getting BART to Ride the Idiomatic Train: Learning to Represent Idiomatic Expressions;Transactions of the Association for Computational Linguistics;2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3