Efficient memory management for hardware accelerated Java Virtual Machines-Reference-Cited by-同舟云学术

Efficient memory management for hardware accelerated Java Virtual Machines

Published:2009-08 Issue:4 Volume:14 Page:1-18
ISSN:1084-4309
Container-title:ACM Transactions on Design Automation of Electronic Systems
language:en
Short-container-title:ACM Trans. Des. Autom. Electron. Syst.

Author:

Bertels Peter¹,Heirman Wim¹,D'Hollander Erik¹,Stroobandt Dirk¹

Affiliation:

1. Ghent University, Gent, Belgium

Abstract

Application-specific hardware accelerators can significantly improve a system's performance. In a Java-based system, we then have to consider a hybrid architecture that consists of a Java Virtual Machine running on a general-purpose processor connected to the hardware accelerator. In such a hybrid architecture, data communication between the accelerator and the general-purpose processor can incur a significant cost, which may even annihilate the original performance improvement of adding the accelerator. A careful layout of the data in the memory structure is therefore of major importance to maintain the acceleration performance benefits. This article addresses the reduction of the communication cost in a distributed shared memory consisting of the main memory of the processor and the accelerator's local memory, which are unified in the Java heap. Since memory access times are highly nonuniform, a suitable allocation of objects in either main memory or the accelerator's local memory can significantly reduce the communication cost. We propose several techniques for finding the optimal location for each Java object's data, either statically through profiling or dynamically at runtime. We show how we can reduce communication cost by up to 86% for the SPECjvm and DaCapo benchmarks. We also show that the best strategy is application dependent and also depends on the relative cost of remote versus local accesses. For a relative cost higher than 10, a self-learning dynamic approach often results in the best performance.

Funder

OptiMMA

FlexWare

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Link

https://dl.acm.org/doi/pdf/10.1145/1562514.1562516

Reference20 articles.

1. Dynamic reconfiguration with binary translation

2. Efficient measurement of data flow enabling communication-aware parallelisation

3. The DaCapo benchmarks

4. A co-design strategy for embedded Java applications based on a hardware interface with invocation semantics

5. Scalable, Wavelet-Based Video: From Server to Hardware-Accelerated Client

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unified Shared Memory: Friend or Foe? Understanding the Implications of Unified Memory on Managed Heaps;Proceedings of the 20th ACM SIGPLAN International Conference on Managed Programming Languages and Runtimes;2023-10-19

2. Using method interception for hardware/software co-development;Design Automation for Embedded Systems;2009-07-01