Comparison of Nazief-Adriani and Paice-Husk algorithm for Indonesian text stemming process-Reference-Cited by-同舟云学术

Comparison of Nazief-Adriani and Paice-Husk algorithm for Indonesian text stemming process

Published:2021-03-01 Issue:3 Volume:1098 Page:032044
ISSN:1757-8981
Container-title:IOP Conference Series: Materials Science and Engineering
language:
Short-container-title:IOP Conf. Ser.: Mater. Sci. Eng.

Author:

Jumadi J,Maylawati D S,Pratiwi L D,Ramdhani M A

Abstract

Abstract Stemming is a process contained in the pre-processing stage that recognizes basic words (stem word) by combining or solving each of the variants of a word. Every language is unique, the most popular stemming algorithm for Indonesian text is Nazief-Adriani algorithm. Therefore, this study aims to compare Nazief-Adriani algorithm with another stemming algorithm for Indonesian text, that is Paice-Husk stemming algorithm which is commonly used for English. Beside, Nazief-Adriani and Paice-Husk algorithm for stemming process, this study use McCabe Cyclometic Complexity Metrix to evaluate the complexity of algorithm. Based on the experiment result with 20 sentences as data with a thousand words, the accuracy of the Nazief-Adriani algorithm is better than the Paice-Husk algorithm, which is 91.87% compared to 64.43%. Likewise, in terms of complexity, the algorithm is still more complex Paice-Husk than Nazief-Adriani. However, in terms of processing time, the Paice-Husk algorithm is slightly faster than the Nazief-Adriani algorithm. These results indicate that the Paice-Husk algorithm requires a more complete implementation of Indonesian morphological and grammatical rules to produce the better Indonesian stem words.

Publisher

IOP Publishing

Subject

General Medicine

Link

https://iopscience.iop.org/article/10.1088/1757-899X/1098/3/032044/pdf

Reference33 articles.

1. Text Knowledge Mining: And Approach to Text Mining;Torre;ESTYLF08,2008

2. Text mining;Witten,2004

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unlocking Insights: A Literature Review on Enhanced Confix Stripping and Nazief & Adriani Algorithm Modifications for Makassar Language Text Stemming;International Journal of Innovative Science and Research Technology (IJISRT);2024-03-16

2. Analisis Sentimen: Pengaruh Jam Kerja Terhadap Kesehatan Mental Generasi Z;Journal of Applied Computer Science and Technology;2024-02-03

3. Comparison of Modified Nazief&Adriani and Modified Enhanced Confix Stripping algorithms for Madurese Language Stemming;INTENSIF: Jurnal Ilmiah Penelitian dan Penerapan Teknologi Sistem Informasi;2023-08-05

4. Spelling Correction Using the Levenshtein Distance and Nazief and Adriani Algorithm for Keyword Search Process Indonesian Qur'an Translation;2022 Seventh International Conference on Informatics and Computing (ICIC);2022-12-08

5. Stemming Algorithm for the Indonesian Language: A Scientometric View;2022 IEEE Creative Communication and Innovative Technology (ICCIT);2022-11-22