Abstract
Existing active learning algorithms typically assume that the data provided are complete. Nonetheless, data with missing values are common in real-world applications, and active learning on incomplete data is less studied. This paper studies the problem of active learning for ordinal classification on incomplete data. Although cutting-edge imputation methods can be used to impute the missing values before commencing active learning, inaccurately imputed instances are unavoidable and may degrade the ordinal classifier’s performance once labeled. Therefore, the crucial question in this work is how to reduce the negative impact of imprecisely filled instances on active learning. First, to avoid selecting filled instances with high imputation imprecision, we propose penalizing the query selection with a novel imputation uncertainty measure that combines a feature-level imputation uncertainty and a knowledge-level imputation uncertainty. Second, to mitigate the adverse influence of potentially labeled imprecisely imputed instances, we suggest using a diversity-based uncertainty sampling strategy to select query instances in specified candidate instance regions. Extensive experiments on nine public ordinal classification datasets with varying value missing rates show that the proposed approach outperforms several baseline methods.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science
Reference30 articles.
1. A comparative study of machine learning methods for ordinal classification with absolute and relative information;Tang;Knowledge-Based Systems,2021
2. G.K. Georgoulas, P.S. Karvelis, D. Gavrilis, C.D. Stylios and G. Nikolakopoulos, An ordinal classification approach for CTG categorization. In 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Jeju Island, South Korea, July 11–15, 2017, IEEE, 2017, pp. 2642–2645.
3. An ordinal classification framework for bank failure prediction: methodology and empirical evidence for US banks;Manthoulis;European Journal of Operational Research,2020
4. Rank consistent ordinal regression for neural networks with application to age estimation;Cao;Pattern Recognition Letters,2020
5. Support vector machine active learning with applications to text classification;Tong;Journal of Machine Learning Research,2001
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献