Affiliation:
1. 1 Trendyol, Data Science , ˙Istanbul , Türkiye
2. 2 Department of Computer Engineering , Cukurova University , Adana , Türkiye
Abstract
Abstract
The goal of this study is to develop churn models for sellers on the e-commerce marketplace by using machine learning methods. In this sense, three approaches were applied for developing the models. The dataset used in this study includes ten features, which are maturity type, maturity interval, city of the seller, total revenue of the seller, total transaction of the seller, sector type of the seller, business type of the seller, sales channel, installment option and discount type. Random Forest (RF) and Logistic Regression (LR) were used for churn analysis in all of the approaches. In the first approach, models were developed without applying preprocessing operations on the dataset. In the second and third approaches, under sampling and oversampling methods, respectively, were used to balance the data set. By using stratified cross validation on the dataset, F-Scores of the churn models were obtained. The results show that F-Scores were 0.76, 0.71 and 0.92 for the three approaches developed with RF, and 0.84, 0.68 and 0.69 for the three approaches developed with LR, respectively.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献