Beat the Machine

Author:

Attenberg Joshua1,Ipeirotis Panos2,Provost Foster2

Affiliation:

1. Etsy, Brooklyn, NY

2. New York University, USA

Abstract

We present techniques for gathering data that expose errors of automatic predictive models. In certain common settings, traditional methods for evaluating predictive models tend to miss rare but important errors—most importantly, cases for which the model is confident of its prediction (but wrong). In this article, we present a system that, in a game-like setting, asks humans to identify cases that will cause the predictive model-based system to fail. Such techniques are valuable in discovering problematic cases that may not reveal themselves during the normal operation of the system and may include cases that are rare but catastrophic. We describe the design of the system, including design iterations that did not quite work. In particular, the system incentivizes humans to provide examples that are difficult for the model to handle by providing a reward proportional to the magnitude of the predictive model's error. The humans are asked to “Beat the Machine” and find cases where the automatic model (“the Machine”) is wrong. Experiments show that the humans using Beat the Machine identify more errors than do traditional techniques for discovering errors in predictive models, and, indeed, they identify many more errors where the machine is (wrongly) confident it is correct. Furthermore, those cases the humans identify seem to be not simply outliers, but coherent areas missed completely by the model. Beat the Machine identifies the “unknown unknowns.” Beat the Machine has been deployed at an industrial scale by several companies. The main impact has been that firms are changing their perspective on and practice of evaluating predictive models. There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are also unknown unknowns. There are things we don't know we don't know .” -- Donald Rumsfeld

Funder

George Kellner Faculty Fellowship

Andre Meyer Faculty Fellowship

Google Focused Award

NEC Faculty Fellowship

Moore-Sloan Data Science Environment at NYU

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems and Management,Information Systems

Cited by 58 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

2. Commercial Dispute Resolution and AI;The Cambridge Handbook of Private Law and Artificial Intelligence;2024-03-28

3. Corporate and Commercial Law;The Cambridge Handbook of Private Law and Artificial Intelligence;2024-03-28

4. On monitorability of AI;AI and Ethics;2024-02-06

5. Exploratory machine learning with unknown unknowns;Artificial Intelligence;2024-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3