Comparison of Modified Nazief&Adriani and Modified Enhanced Confix Stripping algorithms for Madurese Language Stemming-Reference-Cited by-同舟云学术

Comparison of Modified Nazief&Adriani and Modified Enhanced Confix Stripping algorithms for Madurese Language Stemming

Published:2023-08-05 Issue:2 Volume:7 Page:276-289
ISSN:2549-6824
Container-title:INTENSIF: Jurnal Ilmiah Penelitian dan Penerapan Teknologi Sistem Informasi
language:
Short-container-title:intensif

Author:

Lindrawati Enni^ORCID,Utami Ema^ORCID,Yaqin Ainul^ORCID

Abstract

The Madurese language has a unique morphology. The morphological uniqueness can be used to find basic words. The basic word process is called stemming. Stemming can be developed into an application for translating Madurese into Indonesian and even other languages. It can support the development of a Madurese language text plagiarism system. Stemming research on the Madurese language is still rare. Therefore, this study aims to find the basic words of the Madurese language using modifications to the Nazief & Adriani algorithm and Enhanced Confix Stripping (ECS) modifications. The study used 1000 Madurese words, consisting of 630 prefix words, 74 ending words, and 296 confix words. The results showed that the modification of the Nazief & Adriani algorithm was better, shown by the accuracy obtained of 88.8% with overstemming of 0.7% and understemming of 10.5%. As for ECS, an accuracy of 74.0% was obtained, 0.4% overstemming, and 25.6% understemming. In the same process, Nazief&Adriani's modification is faster than the ECS modification. For the Nazief&Adriani modification, it takes 13.31 seconds while for the ECS modification, it takes 210.88.

Publisher

Universitas Nusantara PGRI Kediri

Subject

General Medicine

Reference30 articles.

1. F. L. Fitri Lintang and F. Ulfatun Najicha, “Nilai-Nilai Sila Persatuan Indonesia Dalam Keberagaman Kebudayaan Indonesia,” J. Glob. Citiz. J. Ilm. Kaji. Pendidik. Kewarganegaraan, vol. 11, no. 1, pp. 79–85, 2022, doi: 10.33061/jgz.v11i1.7469.

2. R. Peter and M. S. Simatupang, “Keberagaman Bahasa Dan Budaya Sebagai Kekayaan Bangsa Indonesia,” Dialekt. J. Bahasa, Sastra Dan Budaya, vol. 9, no. 1, pp. 96–105, 2022, doi: 10.33541/dia.v9i1.4028.

3. A. F. Hidayati, “Afiks Nomina Deverbal dalam Kumpulan Cerpen Bahasa Madura,” Konf. Linguist. Tah. Atma Jaya 19, pp. 17–20, 2021.

4. I. Irwiandi and M. Norman, “Proses Morfologis pada Bahasa Madura: Studi pada Mahasiswa Madura di Universitas Trunojoyo,” AIJER Algazali Int. J. Educ. Res., vol. 5, no. 1, pp. 68–75, 2022.

5. T. Winarti et al., “Penanganan Kasus Overstemming dan Understemming dengan Modifikasi Algoritma Stemming,” IOP Conf. Ser. Mater. Sci. Eng., vol. 6, no. 1, pp. 199–206, 2020, doi: 10.18517/ijaseit.7.5.1705.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sentiment Analysis of YouTube Users on Blackpink Kpop Group Using IndoBERT;INTENSIF: Jurnal Ilmiah Penelitian dan Penerapan Teknologi Sistem Informasi;2024-08-01

2. Unlocking Insights: A Literature Review on Enhanced Confix Stripping and Nazief & Adriani Algorithm Modifications for Makassar Language Text Stemming;International Journal of Innovative Science and Research Technology (IJISRT);2024-03-16

3. Classification of Indonesian Tweet Bullying on Twitter Using K-Nearest Neighbor;2023 International Conference on Informatics, Multimedia, Cyber and Informations System (ICIMCIS);2023-11-07