Incremental and Approximate Computations for Accelerating Deep CNN Inference-Reference-Cited by-同舟云学术

Incremental and Approximate Computations for Accelerating Deep CNN Inference

Published:2020-12-11 Issue:4 Volume:45 Page:1-42
ISSN:0362-5915
Container-title:ACM Transactions on Database Systems
language:en
Short-container-title:ACM Trans. Database Syst.

Author:

Nakandala Supun¹,Nagrecha Kabir¹,Kumar Arun¹,Papakonstantinou Yannis¹

Affiliation:

1. University of California, San Diego, CA, USA

Abstract

Deep learning now offers state-of-the-art accuracy for many prediction tasks. A form of deep learning called deep convolutional neural networks (CNNs) are especially popular on image, video, and time series data. Due to its high computational cost, CNN inference is often a bottleneck in analytics tasks on such data. Thus, a lot of work in the computer architecture, systems, and compilers communities study how to make CNN inference faster. In this work, we show that by elevating the abstraction level and re-imagining CNN inference as queries , we can bring to bear database-style query optimization techniques to improve CNN inference efficiency. We focus on tasks that perform CNN inference repeatedly on inputs that are only slightly different . We identify two popular CNN tasks with this behavior: occlusion-based explanations (OBE) and object recognition in videos (ORV). OBE is a popular method for “explaining” CNN predictions. It outputs a heatmap over the input to show which regions (e.g., image pixels) mattered most for a given prediction. It leads to many re-inference requests on locally modified inputs. ORV uses CNNs to identify and track objects across video frames. It also leads to many re-inference requests. We cast such tasks in a unified manner as a novel instance of the incremental view maintenance problem and create a comprehensive algebraic framework for incremental CNN inference that reduces computational costs. We produce materialized views of features produced inside a CNN and connect them with a novel multi-query optimization scheme for CNN re-inference. Finally, we also devise novel OBE-specific and ORV-specific approximate inference optimizations exploiting their semantics. We prototype our ideas in Python to create a tool called Krypton that supports both CPUs and GPUs. Experiments with real data and CNNs show that Krypton reduces runtimes by up to 5× (respectively, 35×) to produce exact (respectively, high-quality approximate) results without raising resource requirements.

Funder

Hellman Fellowship and by the NIDDK of the NIH

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3397461

Reference73 articles.

1. ImageNet Large Scale Visual Recognition Challenge

2. Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning

3. Mohammad Tariqul Islam etal 2017. Abnormality detection and localization in chest x-rays using deep convolutional neural networks. Arxiv Preprint Arxiv:1705.09850 (2017). Mohammad Tariqul Islam et al. 2017. Abnormality detection and localization in chest x-rays using deep convolutional neural networks. Arxiv Preprint Arxiv:1705.09850 (2017).

4. Using Deep Learning for Image-Based Plant Disease Detection

5. CosRec: 2D Convolutional Neural Networks for Sequential Recommendation

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hybrid Evaluation for Occlusion-based Explanations on CNN Inference Queries;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

2. An improved semantic segmentation algorithm for high-resolution remote sensing images based on DeepLabv3+;Scientific Reports;2024-04-27

3. A Lightweight High-Resolution Remote Sensing Image Cultivated Land Extraction Method Integrating Transfer Learning and SENet;IEEE Access;2024

4. InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models;Proceedings of the 17th ACM Conference on Recommender Systems;2023-09-14

5. A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning;Animals;2023-02-25