Quit When You Can: Efficient Evaluation of Ensembles by Optimized Ordering-Reference-Cited by-同舟云学术

Quit When You Can: Efficient Evaluation of Ensembles by Optimized Ordering

Published:2021-07-08 Issue:4 Volume:17 Page:1-20
ISSN:1550-4832
Container-title:ACM Journal on Emerging Technologies in Computing Systems
language:en
Short-container-title:J. Emerg. Technol. Comput. Syst.

Author:

Wang Serena¹^ORCID,Gupta Maya¹,You Seungil²

Affiliation:

1. Google Research, Mountain View, CA, USA

2. Kakao Mobility, Jeju City, Republic of Korea

Abstract

Given a classifier ensemble and a dataset, many examples may be confidently and accurately classified after only a subset of the base models in the ensemble is evaluated. Dynamically deciding to classify early can reduce both mean latency and CPU without harming the accuracy of the original ensemble. To achieve such gains, we propose jointly optimizing the evaluation order of the base models and early-stopping thresholds. Our proposed objective is a combinatorial optimization problem, but we provide a greedy algorithm that achieves a 4-approximation of the optimal solution under certain assumptions, which is also the best achievable polynomial-time approximation bound. Experiments on benchmark and real-world problems show that the proposed Quit When You Can (QWYC) algorithm can speed up average evaluation time by 1.8–2.7 times on even jointly trained ensembles, which are more difficult to speed up than independently or sequentially trained ensembles. QWYC’s joint optimization of ordering and thresholds also performed better in experiments than previous fixed orderings, including gradient boosted trees’ ordering.

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3451209

Reference42 articles.

1. R. L. Burden and J. D. Faires. 1985. Numerical Analysis (3rd ed.). PWS Publishers. R. L. Burden and J. D. Faires. 1985. Numerical Analysis (3rd ed.). PWS Publishers.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Decision Tree Ensembles for Energy-Efficient Inference on IoT Edge Nodes;IEEE Internet of Things Journal;2024-01-01

2. Low-Overhead Early-Stopping Policies for Efficient Random Forests Inference on Microcontrollers;VLSI-SoC: Technology Advancement on SoC Design;2022