Modeling the Paraphrase Detection Task over a Heterogeneous Graph Network with Data Augmentation-Reference-Cited by-同舟云学术

Modeling the Paraphrase Detection Task over a Heterogeneous Graph Network with Data Augmentation

Published:2020-09-01 Issue:9 Volume:11 Page:422
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Anchiêta Rafael T.^ORCID,Sousa Rogério F. de^ORCID,Pardo Thiago A. S.^ORCID

Abstract

Paraphrase detection is a Natural-Language Processing (NLP) task that aims at automatically identifying whether two sentences convey the same meaning (even with different words). For the Portuguese language, most of the works model this task as a machine-learning solution, extracting features and training a classifier. In this paper, following a different line, we explore a graph structure representation and model the paraphrase identification task over a heterogeneous network. We also adopt a back-translation strategy for data augmentation to balance the dataset we use. Our approach, although simple, outperforms the best results reported for the paraphrase detection task in Portuguese, showing that graph structures may capture better the semantic relatedness among sentences.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/11/9/422/pdf

Reference51 articles.

1. What Is a Paraphrase?

2. Generating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unmasking artificial intelligence (AI): Identifying articles written by AI models;Indian Journal of Clinical Anaesthesia;2024-06-15

2. Spotting the artificial intelligence mask: Detecting articles written by language models/ ChatGPT;Indian Journal of Anaesthesia;2023-09

3. Dual-Channel Heterogeneous Graph Network for Author Name Disambiguation;Information;2021-09-18