A SIMD optimization framework for retargetable compilers

Author:

Hohenauer Manuel1,Engel Felix1,Leupers Rainer1,Ascheid Gerd1,Meyr Heinrich1

Affiliation:

1. RWTH Aachen University, Germany

Abstract

Retargetable C compilers are currently widely used to quickly obtain compiler support for new embedded processors and to perform early processor architecture exploration. A partially inherent problem of the retargetable compilation approach, though, is the limited code quality as compared to hand-written compilers or assembly code due to the lack of dedicated optimizations techniques. This problem can be circumvented by designing flexible, retargetable code optimization techniques that apply to a certain range of target architectures. This article focuses on target machines with SIMD instruction support, a common feature in embedded processors for multimedia applications. However, SIMD optimization is known to be a difficult task since SIMD architectures are largely nonuniform, support only a limited set of data types and impose several memory alignment constraints. Additionally, such techniques require complicated loop transformations, which are tailored to the SIMD architecture in order to exhibit the necessary amount of parallelism in the code. Thus, integrating the SIMD optimization and the required loop transformations together in a single retargeting formalism is an ambitious challenge. In this article, we present an efficient and quickly retargetable SIMD code optimization framework that is integrated into an industrial retargetable C compiler. Experimental results for different processors demonstrate that the proposed technique applies to real-life target machines and that it produces code quality improvements close to the theoretical limit.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Reference42 articles.

1. Associated Computer Experts (ACE). The COSY compiler development system. http://www.ace.nl. Associated Computer Experts (ACE). The COSY compiler development system. http://www.ace.nl.

2. Advanced RISC Machines Ltd. The ARM11 processor. http://www.arm.com. Advanced RISC Machines Ltd. The ARM11 processor. http://www.arm.com.

3. Conversion of control dependence to data dependence

4. Automatic translation of FORTRAN programs to vector form

Cited by 12 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. When Function Inlining Meets WebAssembly: Counterintuitive Impacts on Runtime Performance;Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering;2023-11-30

2. A Case Study of Performance Optimization in a Heterogeneous Environment;2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW);2017-10

3. Vectorization in PyPy's Tracing Just-In-Time Compiler;Proceedings of the 19th International Workshop on Software and Compilers for Embedded Systems;2016-05-23

4. Evaluating vector data type usage in OpenCL kernels;Concurrency and Computation: Practice and Experience;2014-10-23

5. C Compilers and Code Optimization for DSPs;Handbook of Signal Processing Systems;2013

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3