Graph-Based Siamese Network for Authorship Verification-Reference-Cited by-同舟云学术

Graph-Based Siamese Network for Authorship Verification

Published:2022-01-17 Issue:2 Volume:10 Page:277
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Embarcadero-Ruiz Daniel,Gómez-Adorno Helena^ORCID,Embarcadero-Ruiz Alberto,Sierra Gerardo^ORCID

Abstract

In this work, we propose a novel approach to solve the authorship identification task on a cross-topic and open-set scenario. Authorship verification is the task of determining whether or not two texts were written by the same author. We model the documents in a graph representation and then a graph neural network extracts relevant features from these graph representations. We present three strategies to represent the texts as graphs based on the co-occurrence of the POS labels of words. We propose a Siamese Network architecture composed of graph convolutional networks along with pooling and classification layers. We present different variants of the architecture and discuss the performance of each one. To evaluate our approach we used a collection of fanfiction texts provided by the PAN@CLEF 2021 shared task in two settings: a “small” corpus and a “large” corpus. Our graph-based approach achieved average scores (AUC ROC, F1, Brier score, F0.5u, and C@1) between 90% and 92.83% when training on the “small” and “large” corpus, respectively. Our model obtain results comparable to those of the state of the art in this task and greater than traditional baselines.

Funder

DGAPA-UNAM PAPIIT

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/2/277/pdf

Reference46 articles.

1. Authorship Attribution

2. A survey of modern authorship attribution methods

3. A Survey On Authorship Attribution Approaches;Mekala;Int. J. Comput. Eng. Res. (IJCER),2018

4. Who’s At The Keyboard? Authorship Attribution in Digital Evidence Investigations;Chaski;Int. J. Digit. Evid.,2005

5. Effective identification of source code authors using byte-level information

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Genre Classification of Books in Russian with Stylometric Features: A Case Study;Information;2024-06-07

2. Semantic Clustering and Transfer Learning in Social Media Texts Authorship Attribution;IEEE Access;2024

3. A New Text Representation Technique-Based Approach for Authorship Verification;Springer Proceedings in Mathematics & Statistics;2024

4. A generalized solution to verify authorship and detect style change in multi-authored documents;Proceedings of the International Conference on Advances in Social Networks Analysis and Mining;2023-11-06

5. ST-iFGSM: Enhancing Robustness of Human Mobility Signature Identification Model via Spatial-Temporal Iterative FGSM;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04