Author:
Shrestha Shiva,Shakya Sushan,Gautam Sandeep
Abstract
Plagiarism is the main problem in the digital world, as people use others’ content without giving prior credit to the creator. Therefore, there should be proper and efficient algorithms to find plagiarized content on the Internet. This research proposes two algorithms: the winnowing algorithm and the extended winnowing algorithm. The winnowing algorithm can only calculate the similarity rate between documents, whereas the extended algorithm can mark the plagiarized text segment in the compared records along with their similarity rates. The similarity rate in both algorithms has been calculated using the Jaccard Coefficient. Although the extended algorithm is beneficial as it provides a text marking feature, it consumes more computation power, which is discussed in this study. There are research works done previously using this approach, but none has compared the algorithms’ performance on small texts. Thus, this research utilizes the Twitter form of data to test these algorithms’ performance, as it contains a maximum of 280 characters. The application proposed to detect plagiarism in tweets has been developed using Python as the backend and React as the front-end technology.
Publisher
Inventive Research Organization
Reference15 articles.
1. [1] Plagiarism | University of Oxford. (n.d.). Retrieved from https://www.ox.ac.uk/students/academic/guidance/skills/plagiarism/
2. [2] Ulinnuha, N., Thohir, M., Novitasari, D. C. R., Asyhar, A. H., & Arifin, A. Z. (2018). Implementation of winnowing algorithm for document plagiarism detection. Proceeding of EECSI, 631-636.
3. [3] Number of worldwide social network users 2027 | Statista. (2023, February 13). Retrieved from https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/]
4. [4] Mason, S., & Singh, L. (2022). Reporting and discoverability of “Tweets” quoted in published scholarship: current practice and ethical implications. Research Ethics, 18(2), 93–113. https://doi.org/10.1177/17470161221076948
5. [5] Schleimer, S., Wilkerson, D. S., & Aiken, A. (2003, June). Winnowing: local algorithms for document fingerprinting. In Proceedings of the 2003 ACM SIGMOD international conference on Management of data (pp. 76-85).