Affiliation:
1. Tahar Moulay University of Saida, Algeria
2. Department of Computer Science, Tahar Moulay University of Saida, Algeria, Saida, Algeria
Abstract
In the last decade with the new technology, it is important to allow users to access information freely, while at the same time, restrict them from illegal copying and distribution of information. In the age of information technologies plagiarism has become a topical subject in the digital world and turned into a serious problem. The author's work deals with the development of a new system for combating this phenomenon using a new insect behaviour algorithm called Groping cockroaches classifier GCC. Each suspicious text (cockroach) will be classified (hidden) in a class (shelter) that can be plagiarism or no-plagiarism, using a security function that is based on the attractiveness of each class (calculated using the aggregation operators (shelter darkness, congeners attraction and security quality)) and the displacement probability (calculated using the naive Bayes algorithm). The experimental results performed on the Pan 09 dataset and using the validation measures (recall, precision, f-measure, and entropy), have demonstrated that GCC has clear advantages over others plagiarism detection techniques existed in literature. Finally, a set of service was added in order to detect the different cases of plagiarism such as plagiarism with translation, plagiarism of idea, plagiarism with synonymy, and plagiarism paraphrase.
Reference54 articles.
1. PDLK: Plagiarism detection using linguistic knowledge
2. Back, T., & Schwefel, H. P. (1996, May). Evolutionary computation: An overview. Proceedings of IEEE International Conference onEvolutionary Computation (pp. 20-29). IEEE.
3. On Automatic Plagiarism Detection Based on n-Grams Comparison
4. Basile, C. (2009). A plagiarism detection procedure in three steps: selection, matches and squares. Proceeding of the SEPLN ’09 pan 09 3rd workshop and 1st international compétition on plagiarism, San Sebastian, Spain (pp. 19-23). IEEE.
5. A plagiarism detection procedure in three steps: Selection, matches and “squares”.;C.Basile;Proc. SEPLN,2009