Unified Graph-Based Missing Label Propagation Method for Multilabel Text Classification-Reference-Cited by-同舟云学术

Unified Graph-Based Missing Label Propagation Method for Multilabel Text Classification

Published:2022-01-31 Issue:2 Volume:14 Page:286
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Taha Adil Yaseen,Tiun Sabrina,Rahman Abdul Hadi Abd,Ayob Masri^ORCID,Abdulameer Ali Sabah

Abstract

In multilabel classification, each sample can be allocated to multiple class labels at the same time. However, one of the prominent problems of multilabel classification is missing labels (incomplete labels) in multilabel text. The multilabel classification performance is reduced significantly with the presence of missing labels. In order to address the incomplete or missing label problem, this study proposes two methods: an aggregated feature and label graph-based missing label handling method (GB-AS), and a unified graph-based missing label propagation method (UG-MLP). GB-AS is used to obtain an initial label matrix based on the similarity of both document levels: feature-based weighting representation and label-based weighting representation. On the other hand, UG-MLP is introduced to construct a mixed graph that combines GB-AS and label correlations into a single groundwork. A high-order label correlation is learned from the incomplete training data and applied to supplement the missing label matrix, which guides the creation of multilabel classification models. The combination of the mixed graphs by UG-MLP is aimed to obtain the benefits of both graphs to increase the classification performance. To evaluate UG-MLP, the metrics of precision, recall and F-measure were used on three benchmark datasets, namely, the Reuters-21578, Bibtex and Enron datasets. The experimental results show that UG-MLP outperformed GB-AS as well as other state-of-the-art approaches. Therefore, we can infer from the findings that by plotting a unified graph based on joining aggregated feature and label weightings together with the label correlation, the performance of multilabel classification can be improved.

Funder

ministry of higher education Malaysia

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/14/2/286/pdf

Reference35 articles.

1. A Structure-Induced Framework for Multi-Label Feature Selection With Highly Incomplete Labels

2. Semi-supervised multi-label classification using incomplete label information

3. Improving multi-label classification with missing labels by learning label-specific features

4. Multi-Label Low-dimensional Embedding with Missing Labels

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Uncovering hidden patterns: low-rank label correlations for multi-label weak-label learning;International Journal of Machine Learning and Cybernetics;2024-09-11

2. Enhancing identification performance of cognitive impairment high-risk based on a semi-supervised learning method;Journal of Biomedical Informatics;2024-09

3. Integrated self-supervised label propagation for label imbalanced sets;Applied Intelligence;2024-06-28

4. An Optimized Arabic Multilabel Text Classification Approach Using Genetic Algorithm and Ensemble Learning;Applied Sciences;2023-09-13

5. Incremental label propagation for data sets with imbalanced labels;Neurocomputing;2023-05