Abstract
The application of machine learning algorithms across various fields is gaining momentum, and the results increasingly emphasize the need for further testing and implementation. This is driven by the potential to streamline and expedite numerous processes. In this paper, we have employed five algorithms: KNN, Decision Tree, Random Forest, Logistic Regression, and Naive Bayes, and these algorithms have been tested in three large datasets. On average, their performance ranges from a minimum of 80% to a maximum of 90%. Data preprocessing has been completed, and concurrently, we have implemented the SMOTE algorithm to address the challenge of unbalanced data in this research. Simultaneously, the Naïve Bayes algorithm yields the most favorable results of Accuracy, Precision, Recall, and F1 Score, for the “is_arrested” class. Furthermore, to assess the performance of each algorithm, we employed metrics including Accuracy, Precision, Recall, and F1 Score. These metrics allowed us to decide which algorithm achieved the most effective classification.
Publisher
International Association of Online Engineering (IAOE)