Evolution of thread-level parallelism in desktop applications

Author:

Blake Geoffrey1,Dreslinski Ronald G.1,Mudge Trevor1,Flautner Krisztián2

Affiliation:

1. University of Michigan, Ann Arbor, MI, USA

2. ARM, Cambridge, United Kingdom

Abstract

As the effective limits of frequency and instruction level parallelism have been reached, the strategy of microprocessor vendors has changed to increase the number of processing cores on a single chip each generation. The implicit expectation is that software developers will write their applications with concurrency in mind to take advantage of this sudden change in direction. In this study we analyze whether software developers for laptop/desktop machines have followed the recent hardware trends by creating software for chip multi-processing. We conduct a study of a wide range of applications on Microsoft Windows 7 and Apple's OS X Snow Leopard, measuring Thread Level Parallelism on a high performance workstation and a low power desktop. In addition, we explore graphics processing units (GPUs) and their impact on chip multi-processing. We compare our findings to a study done 10 years ago which concluded that a second core was sufficient to improve system responsiveness. Our results on today's machines show that, 10 years later, surprisingly 2-3 cores are more than adequate for most applications and that the GPU often remains under-utilized. However, in some application specific domains an 8 core SMT system with a 240 core GPU can be effectively utilized. Overall these studies suggest that many-core architectures are not a natural fit for current desktop/laptop applications.

Publisher

Association for Computing Machinery (ACM)

Reference48 articles.

1. IEEE. Standard for Threads Interface to POSIX. P1003.1c 1996. IEEE. Standard for Threads Interface to POSIX. P1003.1c 1996.

Cited by 37 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. NeurOPar, A Neural Network-Driven EDP Optimization Strategy for Parallel Workloads;2023 IEEE 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD);2023-10-17

2. Exploring Efficient Microservice Level Parallelism;2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2022-05

3. Optimizing the EDP of OpenMP applications via concurrency throttling and frequency boosting;Journal of Systems Architecture;2022-02

4. Synergically Rebalancing Parallel Execution via DCT and Turbo Boosting;2021 58th ACM/IEEE Design Automation Conference (DAC);2021-12-05

5. Reinforcement Learning-Based Power Management Policy for Mobile Device Systems;IEEE Transactions on Circuits and Systems I: Regular Papers;2021-10

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3