Multi-sentence and multi-intent classification using RoBERTa and graph convolutional neural network

Author:

Ravi Kumar1,Singh Ajay2,Gautam Chandan3

Affiliation:

1. HCL Technologies Canada Inc

2. BTIS, HCLTech Ltd

3. I2R, Agency for Science, Technology and Research

Abstract

Abstract Citation analysis has garnered significant attention in academia, particularly in the realm of scientometrics analysis. Most studies related to citation analysis focus on quantitative aspects, assigning equal weight to every citation regardless of its placement within the paper. However, understanding the distribution of citation weight across different sections of a research article is crucial for citation analysis and impact assessment. Therefore, the analysis of citation intent becomes a pivotal task in determining the qualitative importance of a citation within a scientific article. In this context, we undertook two essential tasks related to citation analysis: citation length analysis and citation intent analysis. Through citation length analysis, we identified the optimal number of citation sentences to consider around a cited sentence. Simultaneously, citation intent analysis aimed to categorize citations into seven distinct types, namely background, motivation, uses, extends, similarities, differences, and future work. For the latter task, we introduced two novel architectures based on graph neural networks, namely CiteIntentRoBERTaGCN and CiteIntentRoBERTaGAT. The performance of these proposed models was evaluated on five multi-intent datasets curated from 1,200 research papers, considering different context lengths. The results demonstrated that the proposed models achieved state-of-the-art performance.

Publisher

Research Square Platform LLC

Reference50 articles.

1. Teufel S, Siddharthan A, Tidhar D. Automatic classification of citation function. Proceedings of the 2006 conference on empirical methods in natural language processing, 2006, p. 103–10.

2. Citation analysis and discourse analysis revisited;White HD;Appl Linguist,2004

3. MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting;Lauscher A;ArXiv Preprint ArXiv,2021

4. Athar A, Teufel S. Detection of Implicit Citations for Sentiment Detection. Proceedings of the Workshop on Detecting Structure in Scholarly Discourse, Association for Computational Linguistics; 2012, p. 18–26.

5. Article citation study: Context enhanced citation sentiment detection;Vyas V;ArXiv Preprint ArXiv,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3