Affiliation:
1. University of California, San Diego, CA, USA
Abstract
Deep learning now offers state-of-the-art accuracy for many prediction tasks. A form of deep learning called deep convolutional neural networks (CNNs) are especially popular on image, video, and time series data. Due to its high computational cost, CNN inference is often a bottleneck in analytics tasks on such data. Thus, a lot of work in the computer architecture, systems, and compilers communities study how to make CNN inference faster. In this work, we show that by elevating the abstraction level and re-imagining CNN inference as
queries
, we can bring to bear database-style query optimization techniques to improve CNN inference efficiency. We focus on tasks that perform CNN inference
repeatedly
on inputs that are only
slightly different
. We identify two popular CNN tasks with this behavior:
occlusion-based explanations
(OBE) and
object recognition in videos
(ORV). OBE is a popular method for “explaining” CNN predictions. It outputs a heatmap over the input to show which regions (e.g., image pixels) mattered most for a given prediction. It leads to many re-inference requests on locally modified inputs. ORV uses CNNs to identify and track objects across video frames. It also leads to many re-inference requests. We cast such tasks in a unified manner as a novel instance of the
incremental view maintenance
problem and create a comprehensive algebraic framework for incremental CNN inference that reduces computational costs. We produce
materialized views
of features produced inside a CNN and connect them with a novel
multi-query optimization
scheme for CNN re-inference. Finally, we also devise novel OBE-specific and ORV-specific approximate inference optimizations exploiting their semantics. We prototype our ideas in Python to create a tool called
Krypton
that supports both CPUs and GPUs. Experiments with real data and CNNs show that
Krypton
reduces runtimes by up to 5× (respectively, 35×) to produce exact (respectively, high-quality approximate) results without raising resource requirements.
Funder
Hellman Fellowship and by the NIDDK of the NIH
Publisher
Association for Computing Machinery (ACM)
Reference73 articles.
1. ImageNet Large Scale Visual Recognition Challenge
2. Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning
3. Mohammad Tariqul Islam etal 2017. Abnormality detection and localization in chest x-rays using deep convolutional neural networks. Arxiv Preprint Arxiv:1705.09850 (2017). Mohammad Tariqul Islam et al. 2017. Abnormality detection and localization in chest x-rays using deep convolutional neural networks. Arxiv Preprint Arxiv:1705.09850 (2017).
4. Using Deep Learning for Image-Based Plant Disease Detection
5. CosRec: 2D Convolutional Neural Networks for Sequential Recommendation
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献