Eliciting and Learning with Soft Labels from Every Annotator-Reference-Cited by-同舟云学术

Eliciting and Learning with Soft Labels from Every Annotator

Published:2022-10-14 Issue:1 Volume:10 Page:40-52
ISSN:2769-1349
Container-title:Proceedings of the AAAI Conference on Human Computation and Crowdsourcing
language:
Short-container-title:HCOMP

Author:

Collins Katherine M.,Bhatt Umang,Weller Adrian

Abstract

The labels used to train machine learning (ML) models are of paramount importance. Typically for ML classification tasks, datasets contain hard labels, yet learning using soft labels has been shown to yield benefits for model generalization, robustness, and calibration. Earlier work found success in forming soft labels from multiple annotators' hard labels; however, this approach may not converge to the best labels and necessitates many annotators, which can be expensive and inefficient. We focus on efficiently eliciting soft labels from individual annotators. We collect and release a dataset of soft labels (which we call CIFAR-10S) over the CIFAR-10 test set via a crowdsourcing study (N=248). We demonstrate that learning with our labels achieves comparable model performance to prior approaches while requiring far fewer annotators -- albeit with significant temporal costs per elicitation. Our elicitation methodology therefore shows nuanced promise in enabling practitioners to enjoy the benefits of improved model performance and reliability with fewer annotators, and serves as a guide for future dataset curators on the benefits of leveraging richer information, such as categorical uncertainty, from individual annotators.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boosting wisdom of the crowd for medical image annotation using training performance and task features;Cognitive Research: Principles and Implications;2024-05-20

2. Label Smarter, Not Harder: CleverLabel for Faster Annotation of Ambiguous Image Classification with Higher Quality;Lecture Notes in Computer Science;2024

3. Addressing the Binning Problem in Calibration Assessment through Scalar Annotations;Transactions of the Association for Computational Linguistics;2024

4. Machine Learning Models for Improved Cell Screening;Lecture Notes in Computer Science;2024

5. Judgment Sieve: Reducing Uncertainty in Group Judgments through Interventions Targeting Ambiguity versus Disagreement;Proceedings of the ACM on Human-Computer Interaction;2023-09-28