Effective Multi-Label Classification Using Data Preprocessing-Reference-Cited by-同舟云学术

Effective Multi-Label Classification Using Data Preprocessing

Published:2021 Issue: Volume: Page:90-109
ISSN:2327-1981
Container-title:Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance
language:
Short-container-title:

Author:

Tidake Vaishali S.¹^ORCID,Sane Shirish S.²

Affiliation:

1. MVPS's KBT College of Engineering, Nashik, India

2. K. K. Wagh Institute of Engineering Education and Research, Nashik, India

Abstract

Usage of feature similarity is expected when the nearest neighbors are to be explored. Examples in multi-label datasets are associated with multiple labels. Hence, the use of label dissimilarity accompanied by feature similarity may reveal better neighbors. Information extracted from such neighbors is explored by devised MLFLD and MLFLD-MAXP algorithms. Among three distance metrics used for computation of label dissimilarity, Hamming distance has shown the most improved performance and hence used for further evaluation. The performance of implemented algorithms is compared with the state-of-the-art MLkNN algorithm. They showed an improvement for some datasets only. This chapter introduces parameters MLE and skew. MLE, skew, along with outlier parameter help to analyze multi-label and imbalanced nature of datasets. Investigation of datasets for various parameters and experimentation explored the need for data preprocessing for removing outliers. It revealed an improvement in the performance of implemented algorithms for all measures, and effectiveness is empirically validated.

Publisher

IGI Global

Reference37 articles.

1. Evaluation of distance measures for hierarchical multilabel classification in functional genomics.;D.Aleksovski;Proceedings of the 1st workshop on learning from multi-label data (MLD) held in conjunction with ECML/PKDD,2009

2. Charte, F., Rivera, A., del Jesus, M. J., & Herrera, F. (2013). A First Approach to Deal with Imbalance in Multi-label Datasets. HAIS 2013, LNAI 8073, 150–160.

3. Addressing Imbalance in Multi-Label Classification Using Structured Hellinger Forests;Z. A.Daniels;Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17).,2017

4. A tutorial on multi-label classification techniques;A.de Carvalho;Studies in Computational Intelligence 205,2009

5. Collective multi-label classification