Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing-Reference-Cited by-同舟云学术

Learning Anytime Predictions in Neural Networks via Adaptive Loss Balancing

Published:2019-07-17 Issue: Volume:33 Page:3812-3821
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Hu Hanzhang,Dey Debadeepta,Hebert Martial,Bagnell J. Andrew

Abstract

This work considers the trade-off between accuracy and testtime computational cost of deep neural networks (DNNs) via anytime predictions from auxiliary predictions. Specifically, we optimize auxiliary losses jointly in an adaptive weighted sum, where the weights are inversely proportional to average of each loss. Intuitively, this balances the losses to have the same scale. We demonstrate theoretical considerations that motivate this approach from multiple viewpoints, including connecting it to optimizing the geometric mean of the expectation of each loss, an objective that ignores the scale of losses. Experimentally, the adaptive weights induce more competitive anytime predictions on multiple recognition data-sets and models than non-adaptive approaches including weighing all losses equally. In particular, anytime neural networks (ANNs) can achieve the same accuracy faster using adaptive weights on a small network than using static constant weights on a large one. For problems with high performance saturation, we also show a sequence of exponentially deepening ANNs can achieve near-optimal anytime results at any budget, at the cost of a const fraction of extra computation.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic memory–based dynamic neural network using memristive ternary CIM and CAM for 2D and 3D vision;Science Advances;2024-08-16

2. WASP: Efficient Power Management Enabling Workload-Aware, Self-Powered AIoT Devices;IEEE Transactions on Parallel and Distributed Systems;2024-08

3. OptimML: Joint Control of Inference Latency and Server Power Consumption for ML Performance Optimization;ACM Transactions on Autonomous and Adaptive Systems;2024-05-07

4. Can neural networks benefit from objectives that encourage iterative convergent computations? A case study of ResNets and object classification;PLOS ONE;2024-03-21

5. QoS-Aware Inference Acceleration Using Adaptive Depth Neural Networks;IEEE Access;2024