Multi-Class Imbalanced Data Handling with Concept Drift in Fog Computing: A Taxonomy, Review, and Future Directions

Author:

Shareef Farhana1ORCID,ijaz humaira2ORCID,Shojafar Mohammad3ORCID,Naeem Muhammad Asif4ORCID

Affiliation:

1. Department of CS & IT, University of Sargodha, Sargodha, Pakistan

2. Computer Science & IT, University of Sargodha, Sargodha, Pakistan

3. 5G & 6G Innovation Centre (5G/6GIC), University of Surrey, Guildford, United Kingdom of Great Britain and Northern Ireland

4. Department of Computer Science, National University of Computer and Emerging Sciences, Islamabad Pakistan

Abstract

A network of actual physical objects or “IoT components” linked to the internet and equipped with sensors, electronics, software, and network connectivity is known as the Internet of Things (IoT). This ability of the IoT components to gather and share data is made possible by this network connectivity. Many IoT devices are currently operating, which generate a lot of data. When these IoT devices started collecting data, the cloud was the only place to analyze, filter, pre-process, and aggregate it. However, when it comes to IoT, the cloud has restrictions regarding latency and a more centralized method of distributing programs. A new form of computing called Fog computing has been proposed to address the shortcomings of current cloud computing. In an IoT context, sensors regularly communicate signal information, and edge devices process the data obtained from these sensors using Fog computing. The sensors’ internal or external problems, security breaches, or the integration of heterogeneous equipment contribute to the imbalanced data, i.e., comparatively speaking, one class has more instances than the other. As a result of this data, the pattern extraction is imbalanced . Recent attempts have concentrated heavily on binary-class imbalanced concerns with exactly two classes. However, the classification of multi-class imbalanced data is an issue that needs to be fixed in Fog computing, even if it is widespread in other fields, including text categorization, human activity detection, and medical diagnosis. The study intends to deal with this problem. It presents a systematic, thorough, and in-depth comparative analysis of several binary-class and multi-class imbalanced data handling strategies for batch and streaming data in IoT networks and Fog computing. There are five major objectives in this study. Firstly, reviewing the Fog computing concept. Secondly, outlining the optimization metric used in Fog computing. Thirdly, focusing on binary and multi-class batch data handling for IoT networks and Fog computing. Fourthly, reviewing and comparing the current imbalanced data handling methodologies for multi-class data streams. Fifthly, explaining how to cope with the concept drift, including novel and recurring classes, targeted optimization measures, and evaluation tools. Finally, the best performance metrics and tools for concept drift, binary-class (batch and stream) data, and multi-class (batch and stream) data are highlighted.

Publisher

Association for Computing Machinery (ACM)

Reference156 articles.

1. Shaik Masthan Babu, A Jaya Lakshmi, and B Thirumala Rao. 2015. A study on cloud based Internet of Things: CloudIoT. In 2015 global conference on communication technologies (GCCT). IEEE, 60–65.

2. Bushra Jamil Humaira Ijaz Mohammad Shojafar Kashif Munir and Rajkumar Buyya. 2022. Resource Allocation and Task Scheduling in Fog Computing and Internet of Everything Environments: A Taxonomy Review and Future Directions. ACM Computing Surveys (CSUR)(2022).

3. DeL-IoT: A deep ensemble learning approach to uncover anomalies in IoT;Tsogbaatar Enkhtur;Internet of Things,2021

4. Ajit Jaokar. [n.d.]. Data Science for Internet of Things (IOT): Ten differences from traditional data science. https://www.kdnuggets.com/2016/09/data-science-iot-10-differences.html

5. David Friedman and Structure. 2015. Get to know the four types of data in the internet of things. https://readwrite.com/five-types-data-internet-of-things/

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3