CrowdWT

Author:

Tu Jinzheng1,Yu Guoxian1,Wang Jun1,Domeniconi Carlotta2,Guo Maozu3,Zhang Xiangliang4

Affiliation:

1. Shandong University, Jinan, China

2. George Mason University, Fairfax, VA

3. Beijing University of Civil Engineering and Architecture, Beijing, China

4. King Abdullah University of Science and Technology, Thuwal, SA

Abstract

Crowdsourcing is a relatively inexpensive and efficient mechanism to collect annotations of data from the open Internet. Crowdsourcing workers are paid for the provided annotations, but the task requester usually has a limited budget. It is desirable to wisely assign the appropriate task to the right workers, so the overall annotation quality is maximized while the cost is reduced. In this article, we propose a novel task assignment strategy (CrowdWT) to capture the complex interactions between tasks and workers, and properly assign tasks to workers. CrowdWT first develops a Worker Bias Model (WBM) to jointly model the worker’s bias, the ground truths of tasks, and the task features. WBM constructs a mapping between task features and worker annotations to dynamically assign the task to a group of workers, who are more likely to give correct annotations for the task. CrowdWT further introduces a Task Difficulty Model (TDM), which builds a Kernel ridge regressor based on task features to quantify the intrinsic difficulty of tasks and thus to assign the difficult tasks to more reliable workers. Finally, CrowdWT combines WBM and TDM into a unified model to dynamically assign tasks to a group of workers and recall more reliable and even expert workers to annotate the difficult tasks. Our experimental results on two real-world datasets and two semi-synthetic datasets show that CrowdWT achieves high-quality answers within a limited budget, and has the best performance against competitive methods.<?vsp -1.5pt?>

Funder

Natioanl Natural Science Foundation of China

Qilu Scholarship of Shandong University

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. On Dynamically Pricing Crowdsourcing Tasks;ACM Transactions on Knowledge Discovery from Data;2023-02-20

2. Self-paced annotations of crowd workers;Knowledge and Information Systems;2022-09-22

3. Label augmented and weighted majority voting for crowdsourcing;Information Sciences;2022-08

4. Research on Fuzzy Decision-Making Method of Task Allocation for Ship Multiagent Collaborative Design;Advances in Mathematical Physics;2022-07-18

5. Improving data and model quality in crowdsourcing using co-training-based noise correction;Information Sciences;2022-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3