Is your document novel? Let attention guide you. An attention-based model for document-level novelty detection-Reference-Cited by-同舟云学术

Is your document novel? Let attention guide you. An attention-based model for document-level novelty detection

Published:2020-04-24 Issue:4 Volume:27 Page:427-454
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Ghosal Tirthankar,Edithal Vignesh,Ekbal Asif,Bhattacharyya Pushpak,Chivukula Srinivasa Satya Sameer Kumar,Tsatsaronis George

Abstract

AbstractDetecting, whether a document contains sufficient new information to be deemed as novel, is of immense significance in this age of data duplication. Existing techniques for document-level novelty detection mostly perform at the lexical level and are unable to address the semantic-level redundancy. These techniques usually rely on handcrafted features extracted from the documents in a rule-based or traditional feature-based machine learning setup. Here, we present an effective approach based on neural attention mechanism to detect document-level novelty without any manual feature engineering. We contend that the simple alignment of texts between the source and target document(s) could identify the state of novelty of a target document. Our deep neural architecture elicits inference knowledge from a large-scale natural language inference dataset, which proves crucial to the novelty detection task. Our approach is effective and outperforms the standard baselines and recent work on document-level novelty detection by a margin of

$\sim$

3% in terms of accuracy.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference62 articles.

1. Network intrusion and fault detection: a statistical anomaly approach

2. Cross-document event coreference

3. A study of retrospective and on-line event detection

4. Collins-Thompson, K. , Ogilvie, P. , Zhang, Y. and Callan, J. (2002). Information filtering, novelty detection, and named-page finding. In TREC.

5. Topic-conditioned novelty detection

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Infectious risk events and their novelty in event-based surveillance: new definitions and annotated corpus;Language Resources and Evaluation;2024-03-05

2. SciND: a new triplet-based dataset for scientific novelty detection via knowledge graphs;International Journal on Digital Libraries;2024-01-08

3. Systematic Literature Review and Bibliometric Analysis on Addressing the Vanishing Gradient Issue in Deep Neural Networks for Text Data;Communications in Computer and Information Science;2024

4. Construction of Academic Innovation Chain Based on Multi-level Clustering of Field Literature;Lecture Notes in Computer Science;2024

5. Recent Advancements in Misinformation Detection;The Information Retrieval Series;2024