Allowing for ILP in an embedded Java processor

Author:

Radhakrishnan Ramesh1,Talla Deependra1,John Lizy Kurian1

Affiliation:

1. Laboratory for Computer Architecture, Electrical and Computer Engineering Department, The University of Texas at Austin, Austin, Texas

Abstract

Java processors are ideal for embedded and network computing applications such as Internet TV's, set-top boxes, smart phones, and other consumer electronics applications. In this paper, we investigate cost-effective microarchitectural techniques to exploit parallelism in Java bytecode streams. Firstly, we propose the use of a fill unit that stores decoded bytecodes into a decoded bytecode cache. This mechanism improves the fetch and decode bandwidth of Java processors by 2 to 3 times. These additional hardware units can also be used to perform optimizations such as instruction folding. This is particularly significant because experiments with the Verilog model of Sun Microsystems pico Java-II core demonstrates that instruction folding lies in the critical path. Moving folding logic from the critical path of the processor to the fill unit allows to improve the clock frequency by 25%. Out-of-order ILP exploitation is not investigated due to the prohibitive cost, but in-order dual-issue with a 64-entry decoded bytecode cache is seen to result in 10% to 14% improvement in execution cycles. Another contribution of the paper is a stack disambiguation technique that allows elimination of false dependencies between different types of stack accesses. Stack disambiguation further exposes parallelism and a dual in-order issue microengine with a 64-entry bytecode cache yields an additional 10% reduction in cycles, leading to an aggregate reduction of 17% to 24% in execution cycles.

Publisher

Association for Computing Machinery (ACM)

Reference27 articles.

1. The structure and performance of interpreters

2. Compiling Java just in time

3. A. Wolfe "First Java-specific chip takes wing " Electronic Engineering Times April 1997. http://www t echweb corn/wire / news / 1997 / 09 / 0922j ava- .html. A. Wolfe "First Java-specific chip takes wing " Electronic Engineering Times April 1997. http://www t echweb corn/wire / news / 1997 / 09 / 0922j ava- .html.

4. PicoJava: a direct execution engine for Java bytecode

5. picoJava-I: the Java virtual machine in hardware

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A Java Processor IP Design for Embedded SoC;ACM Transactions on Embedded Computing Systems;2015-03-25

2. Exploiting an abstract-machine-based framework in the design of a Java ILP processor;Journal of Systems Architecture;2009-01

3. On the Design of a Dual-Execution Modes Processor: Architecture and Preliminary Evaluation;Frontiers of High Performance Computing and Networking – ISPA 2006 Workshops;2006

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3