A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models-Reference-Cited by-同舟云学术

A Probabilistic Re-Intepretation of Confidence Scores in Multi-Exit Models

Published:2021-12-21 Issue:1 Volume:24 Page:1
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Pomponi Jary^ORCID,Scardapane Simone^ORCID,Uncini Aurelio^ORCID

Abstract

In this paper, we propose a new approach to train a deep neural network with multiple intermediate auxiliary classifiers, branching from it. These ‘multi-exits’ models can be used to reduce the inference time by performing early exit on the intermediate branches, if the confidence of the prediction is higher than a threshold. They rely on the assumption that not all the samples require the same amount of processing to yield a good prediction. In this paper, we propose a way to train jointly all the branches of a multi-exit model without hyper-parameters, by weighting the predictions from each branch with a trained confidence score. Each confidence score is an approximation of the real one produced by the branch, and it is calculated and regularized while training the rest of the model. We evaluate our proposal on a set of image classification benchmarks, using different neural models and early-exit stopping criteria.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/24/1/1/pdf

Reference25 articles.

1. Deep Visual Attention Prediction

2. Three factors influencing minima in sgd;Jastrzębski;arXiv,2017

3. Multi-scale dense networks for resource efficient image classification;Huang;arXiv,2017

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conditional computation in neural networks: Principles and research trends;Intelligenza Artificiale;2024-07-31

2. Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges;ACM Computing Surveys;2022-12-03

3. Adaptive Signal Processing and Machine Learning Using Entropy and Information Theory;Entropy;2022-10-08

4. Single-layer vision transformers for more accurate early exits with less overhead;Neural Networks;2022-09