LTTng CLUST: A System-Wide Unified CPU and GPU Tracing Tool for OpenCL Applications-Reference-Cited by-同舟云学术

LTTng CLUST: A System-Wide Unified CPU and GPU Tracing Tool for OpenCL Applications

Published:2015-08-19 Issue: Volume:2015 Page:1-14
ISSN:1687-8655
Container-title:Advances in Software Engineering
language:en
Short-container-title:Advances in Software Engineering

Author:

Couturier David¹,Dagenais Michel R.¹

Affiliation:

1. Department of Computer and Software Engineering, Polytechnique Montreal, P.O. Box 6079, Station Downtown, Montreal, QC, Canada H3C 3A7

Abstract

As computation schemes evolve and many new tools become available to programmers to enhance the performance of their applications, many programmers started to look towards highly parallel platforms such as Graphical Processing Unit (GPU). Offloading computations that can take advantage of the architecture of the GPU is a technique that has proven fruitful in recent years. This technology enhances the speed and responsiveness of applications. Also, as a side effect, it reduces the power requirements for those applications and therefore extends portable devices battery life and helps computing clusters to run more power efficiently. Many performance analysis tools such as LTTng, strace and SystemTap already allow Central Processing Unit (CPU) tracing and help programmers to use CPU resources more efficiently. On the GPU side, different tools such as Nvidia’s Nsight, AMD’s CodeXL, and third party TAU and VampirTrace allow tracing Application Programming Interface (API) calls and OpenCL kernel execution. These tools are useful but are completely separate, and none of them allow a unified CPU-GPU tracing experience. We propose an extension to the existing scalable and highly efficient LTTng tracing platform to allow unified tracing of GPU along with CPU’s full tracing capabilities.

Funder

Natural Sciences and Engineering Research Council of Canada

Publisher

Hindawi Limited

Link

http://downloads.hindawi.com/archive/2015/940628.pdf

Reference13 articles.

1. Lockless multi-core high-throughput buffering scheme for kernel tracing

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analyzing GPU Performance in Virtualized Environments: A Case Study;Future Internet;2024-02-23

2. LTTng‐HSA: Bringing LTTng tracing to HSA‐based GPU runtimes;Concurrency and Computation: Practice and Experience;2019-04-03

3. Tracing and Profiling Machine Learning Dataflow Applications on GPU;International Journal of Parallel Programming;2019-02-11

4. Low-level trace correlation on heterogeneous embedded systems;EURASIP Journal on Embedded Systems;2017-01-23

5. Detection of Common Problems in Real-Time and Multicore Systems Using Model-Based Constraints;Scientific Programming;2016