Active learning with biased non-response to label requests-Reference-Cited by-同舟云学术

Active learning with biased non-response to label requests

Published:2024-05-25 Issue:4 Volume:38 Page:2117-2140
ISSN:1384-5810
Container-title:Data Mining and Knowledge Discovery
language:en
Short-container-title:Data Min Knowl Disc

Author:

Robinson Thomas S.,Tax Niek,Mudd Richard,Guy Ido

Abstract

AbstractActive learning can improve the efficiency of training prediction models by identifying the most informative new labels to acquire. However, non-response to label requests can impact active learning’s effectiveness in real-world contexts. We conceptualise this degradation by considering the type of non-response present in the data, demonstrating that biased non-response is particularly detrimental to model performance. We argue that biased non-response is likely in contexts where the labelling process, by nature, relies on user interactions. To mitigate the impact of biased non-response, we propose a cost-based correction to the sampling strategy–the Upper Confidence Bound of the Expected Utility (UCB-EU)–that can, plausibly, be applied to any active learning algorithm. Through experiments, we demonstrate that our method successfully reduces the harm from labelling non-response in many settings. However, we also characterise settings where the non-response bias in the annotations remains detrimental under UCB-EU for specific sampling methods and data generating processes. Finally, we evaluate our method on a real-world dataset from an e-commerce platform. We show that UCB-EU yields substantial performance improvements to conversion models that are trained on clicked impressions. Most generally, this research serves to both better conceptualise the interplay between types of non-response and model improvements via active learning, and to provide a practical, easy-to-implement correction that mitigates model degradation.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10618-024-01026-x.pdf

Reference41 articles.

1. Amin K, DeSalvo G, Rostamizadeh A (2021) Learning with labeling induced abstentions. In: Advances in Neural Information Processing Systems, pp 12576–12586

2. Attenberg J, Provost F (2011) Inactive learning? difficulties employing active learning in practice. ACM SIGKDD Explorations Newsl 12(2):36–41

3. Audibert JY, Bubeck S, Munos R (2010) Best arm identification in multi-armed bandits. In: COLT, pp 41–53

4. Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770

5. Bartók G, Foster DP, Pál D et al (2014) Partial monitoring-classification, regret bounds, and algorithms. Math Oper Res 39(4):967–997