Distribution-free, Risk-controlling Prediction Sets-Reference-Cited by-同舟云学术

Distribution-free, Risk-controlling Prediction Sets

Published:2021-12-31 Issue:6 Volume:68 Page:1-34
ISSN:0004-5411
Container-title:Journal of the ACM
language:en
Short-container-title:J. ACM

Author:

Bates Stephen¹,Angelopoulos Anastasios¹,Lei Lihua²,Malik Jitendra¹,Jordan Michael¹

Affiliation:

1. UC Berkeley, Berkeley, CA, USA

2. Stanford University, Stanford, CA, USA

Abstract

While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying learning systems in consequential settings also requires calibrating and communicating the uncertainty of predictions. To convey instance-wise uncertainty for prediction tasks, we show how to generate set-valued predictions from a black-box predictor that controls the expected loss on future test points at a user-specified level. Our approach provides explicit finite-sample guarantees for any dataset by using a holdout set to calibrate the size of the prediction sets. This framework enables simple, distribution-free, rigorous error control for many tasks, and we demonstrate it in five large-scale machine learning problems: (1) classification problems where some mistakes are more costly than others; (2) multi-label classification, where each observation has multiple associated labels; (3) classification problems where the labels have a hierarchical structure; (4) image segmentation, where we wish to predict a set of pixels containing an object of interest; and (5) protein structure prediction. Last, we discuss extensions to uncertainty quantification for ranking, metric learning, and distributionally robust learning.

Funder

National Science Foundation Graduate Research Fellowship Program and a Berkeley Fellowship

Army Research Office

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3478535

Reference62 articles.

1. The nonexistence of certain statistical procedures in nonparametric problems;Bahadur R. R.;Ann. Math. Statist.,1956

2. Predictive inference with the jackknife+;Barber Rina Foygel;Ann. Statist.,2021

3. On Hoeffding’s inequalities

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Random projection ensemble conformal prediction for high-dimensional classification;Chemometrics and Intelligent Laboratory Systems;2024-10

2. Conformalized Link Prediction on Graph Neural Networks;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Functional protein mining with conformal guarantees;2024-06-28

4. Leveraging conformal prediction to annotate enzyme function space with limited false positives;PLOS Computational Biology;2024-05-29

5. E-DM: Evaluating Diffusion Model by Conformal Prediction;2024 IEEE International Symposium on Biomedical Imaging (ISBI);2024-05-27