Author:
Ge Wenxin,Wang Yibin,Xu Yuting,Cheng Yusheng
Abstract
AbstractIn multi-label learning, label-specific feature learning can effectively avoid some ineffectual features that interfere with the classification performance of the model. However, most of the existing label-specific feature learning algorithms improve the performance of the model for classification by constraining the solution space through label correlation. The non-equilibrium of the label distribution not only leads to some spurious correlations mixed in with the calculated label correlations but also diminishes the performance of the classification model. Causal learning can improve the classification performance and robustness of the model by capturing real causal relationships from limited data. Based on this, this paper proposes a causality-driven intra-class non-equilibrium label-specific features learning, named CNSF. Firstly, the causal relationship between the labels is learned by the Peter-Clark algorithm. Secondly, the label density of all instances is calculated by the intra-class non-equilibrium method, which is used to relieve the non-equilibrium distribution of original labels. Then, the correlation of the density matrix is calculated using cosine similarity and combined with causality to construct the causal density correlation matrix, to solve the problem of spurious correlation mixed in the label correlation obtained by traditional methods. Finally, the causal density correlation matrix is used to induce label-specific feature learning. Compared with eight state-of-the-art multi-label algorithms on thirteen datasets, the experimental results prove the reasonability and effectiveness of the algorithms in this paper.
Funder
Anqing Normal University Graduate Innovation Fund
the Science and Technology on Parallel and Distributed Processing Laboratory
National Natural Science Foundation of Anhui
Publisher
Springer Science and Business Media LLC
Reference32 articles.
1. Wei W, Wu Q, Chen D, Zhang YD, Liu W, Duan GH, Luo X (2021) Automatic image annotation based on an improved nearest neighbor technique with tag semantic extension model. Procedia Comput Sci 183:616–623
2. Qian T, Li F, Zhang MS, Jin GN, Fan P, Dai WH (2022) Contrastive learning from label distribution: a case study on text classification. Neurocomputing 507:208–220
3. Xia WQ, Zheng LY, Fang JB, Li FC, Zhou Y, Zeng ZY, Zhang B, Li ZR, Li HL, Zhu F (2022) PFmulDL: a novel strategy enabling multi-class and multi-label protein function annotation by integrating diverse deep learning methods. Comput Biol Med 145:105465
4. Liu SH, Wang B, Liu B, Yang LT (2022) Multi-community graph convolution networks with decision fusion for personalized recommendation. In: Pacific-Asia conference on knowledge discovery and data mining, Chengdu, China, pp 16–28
5. Lin YJ, Liu HY, Zhao H, Hu QH, Zhu XQ, Wu XD (2022) Hierarchical feature selection based on label distribution learning. IEEE Trans Knowl Data Eng 35(6):5964–5976
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献