Accurate discovery of co-derivative documents via duplicate text detection-Reference-Cited by-同舟云学术

Accurate discovery of co-derivative documents via duplicate text detection

Published:2006-11 Issue:7 Volume:31 Page:595-609
ISSN:0306-4379
Container-title:Information Systems
language:en
Short-container-title:Information Systems

Author:

Bernstein Yaniv,Zobel Justin

Publisher

Elsevier BV

Subject

Hardware and Architecture,Information Systems,Software

Reference24 articles.

1. A.Z. Broder, On the resemblance and containment of documents, in: Compression and Complexity of Sequences (SEQUENCES’97)’, 1997, pp. 21–29.

2. M. Sanderson, Duplicate detection in the Reuters collection, Technical Report TR-1997-5, University of Glasgow, 1997.

3. N. Shivakumar, H. García-Molina, SCAM: a copy detection mechanism for digital documents, in: Proceedings of the Second Annual Conference on the Theory and Practice of Digital Libraries, 1995.

4. Methods for identifying versioned and plagiarised documents;Hoad;Journal of the American Society for Information Science and Technology,2003

5. U. Manber, Finding similar files in a large file system, in: Proceedings of the USENIX Winter 1994 Technical Conference, 1994, pp. 1–10.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lucene-P2: A Distributed Platform for Privacy-Preserving Text-based Search;IEEE Transactions on Dependable and Secure Computing;2020

2. Indexing Word Sequences for Ranked Retrieval;ACM Transactions on Information Systems;2014-01

3. Boilerplate Detection and Recoding;Lecture Notes in Computer Science;2014

4. Fast, Practical Algorithms for Computing All the Repeats in a String;Mathematics in Computer Science;2010-04-15

5. Exploiting Sentence-Level Features for Near-Duplicate Document Detection;Information Retrieval Technology;2009