Decision Tree versus k-NN: A Performance Comparison for Air Quality Classification in Indonesia
-
Published:2024-05-18
Issue:1
Volume:2
Page:9-16
-
ISSN:3025-8618
-
Container-title:Infolitika Journal of Data Science
-
language:
-
Short-container-title:Infolitika J. Data Sci.
Author:
Sasmita Novi Reandy,Ramadeska Siti,Kesuma Zurnila Marli,Noviandy Teuku Rizky,Maulana Aga,Khairul Mhd,Suhendra Rivansyah
Abstract
Air quality can affect human health, the environment, and the sustainability of ecosystems, so efforts are needed to monitor and control air quality. The Plume Air Quality Index (PAQI) is one of the indices to measure and determine the level of air quality. In measuring the accuracy of the air quality level, it is necessary to do the right classification. Some previous studies have conducted classification analysis using the decision tree and K-Nearest Neighbor (k-NN) methods, but only evaluated using accuracy values. Therefore, this study uses both methods to evaluate the results of air quality level classification not only with accuracy but also with precision, recall, and F1-score. Secondary data of pollutant concentration values and PAQI categories based on particulate matter (PM2.5 and PM10), nitrogen dioxide (NO2), and ozone (O3) derived from Plume Labs for 33 provincial capitals in Indonesia in the time period from July 1 to December 31, 2022, were used in this study. From the results of comparing the performance of the two methods, it is found that the decision tree has a greater performance value than the performance value of k-NN. The decision tree performance values for accuracy, precision, recall and F1-score are 90.67%, 90.61%, 90.67%, and 90.63%, respectively. So, it can be concluded that the decision tree performs better than k-NN in classifying PAQI categories with better overall evaluation metric values.
Publisher
PT. Heca Sentra Analitika
Reference30 articles.
1. Agista, P. I., Gusdini, N., and Maharani, M. D. D. (2020). Air Quality Analysis with Air Pollution Standard Index (ISPU) and the Distribution of Pollutant Levels in DKI Jakarta Province, Jurnal SEOI – Fakultas Teknik Universitas Sahid Jakarta, Vol. 2, No. 2, 39–57. 2. Ahmad, N., Ul-Saufie, A. Z., Shaziayani, W. N., Abidin, A. W. Z., Zulazmi, N. E. S., and Harb, S. M. (2022). Evaluating the Performance of Random Forest and Multiple Linear Regression for Higher Observed PM10 Concentrations, Israa University Journal of Applied Science, Vol. 6, No. 1, 72–90. doi:10.52865/WHPM9019. 3. Idroes, G. M., Noviandy, T. R., Maulana, A., Zahriah, Z., Suhendrayatna, S., Suhartono, E., Khairan, K., Kusumo, F., Helwani, Z., and Abd Rahman, S. (2023). Urban Air Quality Classification Using Machine Learning Approach to Enhance Environmental Monitoring, Leuser Journal of Environmental Studies, Vol. 1, No. 2, 62–68. doi:10.60084/ljes.v1i2.99. 4. Plume. (2019). Plume AQI: An Air Quality Index Aligned with Health Recommendations. 5. Sanmorino, A., Alie, J., Ariati, N., and Wulanda, S. V. (2022). K-NN Based Air Classification as Indicator of the Index of Air Quality in Palembang, Jurnal Dan Penelitian Teknik Informatika, Vol. 7, No. 3, 853–859. doi:10.33395/sinkron.v7i3.11469.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|