An autonomous mixed data oversampling method for AIOT-based churn recognition and personalized recommendations using behavioral segmentation

Author:

Fatima Ghulam1,Khan Salabat12,Aadil Farhan1ORCID,Kim Do Hyuen3,Atteia Ghada4,Alabdulhafith Maali4

Affiliation:

1. Department of Computer Science, Comsats University Islamabad, Attock Campus Pakistan, Attock, Punjab, Pakistan

2. Big Data Research Center, Jeju National University, Jeju, Korea

3. Department of Computer Engineering, Jeju National University, Jeju Special Self-Governing Province, South Korea

4. Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Riyadh, Saudi Arabia

Abstract

The telecom sector is currently undergoing a digital transformation by integrating artificial intelligence (AI) and Internet of Things (IoT) technologies. Customer retention in this context relies on the application of autonomous AI methods for analyzing IoT device data patterns in relation to the offered service packages. One significant challenge in existing studies is treating churn recognition and customer segmentation as separate tasks, which diminishes overall system accuracy. This study introduces an innovative approach by leveraging a unified customer analytics platform that treats churn recognition and segmentation as a bi-level optimization problem. The proposed framework includes an Auto Machine Learning (AutoML) oversampling method, effectively handling three mixed datasets of customer churn features while addressing imbalanced-class distribution issues. To enhance performance, the study utilizes the strength of oversampling methods like synthetic minority oversampling technique for nominal and continuous features (SMOTE-NC) and synthetic minority oversampling with encoded nominal and continuous features (SMOTE-ENC). Performance evaluation, using 10-fold cross-validation, measures accuracy and F1-score. Simulation results demonstrate that the proposed strategy, particularly Random Forest (RF) with SMOTE-NC, outperforms standard methods with SMOTE. It achieves accuracy rates of 79.24%, 94.54%, and 69.57%, and F1-scores of 65.25%, 81.87%, and 45.62% for the IBM, Kaggle Telco and Cell2Cell datasets, respectively. The proposed method autonomously determines the number and density of clusters. Factor analysis employing Bayesian logistic regression identifies influential factors for accurate customer segmentation. Furthermore, the study segments consumers behaviorally and generates targeted recommendations for personalized service packages, benefiting decision-makers.

Funder

Princess Nourah bint Abdulrahman University Researchers Supporting Project number

Princess Nourah bint Abdulrahman University

National Research Foundation of Korea

Creative Research Project

Publisher

PeerJ

Subject

General Computer Science

Reference40 articles.

1. Customer Churn prediction modelling based on behavioural patterns analysis using deep learning;Agrawal,2018

2. Customer churn prediction in telecom using machine learning in big data platform;Ahmad;Journal of Big Data,2019

3. An enhanced ensemble classifier for telecom churn prediction using cost based uplift modelling;Ahmed;International Journal of Information Technology,2019

4. Comparing oversampling techniques to handle the class imbalance problem: a customer churn prediction case study;Amin;IEEE Access,2016

5. Cross-company customer churn prediction in telecommunication: a comparison of data transformation methods;Amin;International Journal of Information Management,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3