Abstract
Background: The Village Fund is an initiative by the central government to promote equitable regional development. However, it has also led to corruption. Many Indonesians share their opinions on the Village Fund on social media platforms like X, and news coverage is extensive on portals like detik.com. Objective: This study aims to classify data from social media and news coverage to enhance understanding. Methods: The research improves the decision tree algorithm by integrating other algorithms and techniques such as XGBoost and SMOTE. Ensuring high accuracy is vital for the credibility of machine learning classifications among the public. The study uses two different datasets, necessitating varied testing approaches. For the news portal dataset, a single test with seven labels is conducted, followed by enhancement with XGBoost. The X dataset undergoes two tests with datasets of 1200 and 3078 entries, using three labels. Conclusion: The evaluation results indicate that the highest accuracy achieved with the news portal data was 82%, thanks to a combination of decision tree algorithms with various parameters and the balancing effect of SMOTE. For the Twitter dataset with 3078 entries, the highest accuracy reached 95%, attributed to the application of ensemble techniques, particularly boosting.
Publisher
Universitas Nusantara PGRI Kediri