Affiliation:
1. Indian Institute of Technology Delhi, India
Abstract
In this article, we present the design of a novel parallel architecture simulator called
ParTejas
.
ParTejas
is a timing simulation engine that gets its execution traces from instrumented binaries using a fast shared-memory-based mechanism. Subsequently, the waiting threads simulate the execution of multiple pipelines and an elaborate memory system with support for multilevel coherent caches.
ParTejas
is written in Java and primarily derives its speedups from the use of novel data structures. Specifically, it uses lock-free slot schedulers to design an entity called a
parallel port
that effectively models the contention at shared resources in the CPU and memory system. Parallel ports remove the need for fine-grained synchronization and allow each thread to use its local clock. Unlike conventional simulators that use barriers for synchronization at epoch boundaries, we use a sophisticated type of barrier, known as a phaser. A
phaser
allows threads to perform additional work without waiting for other threads to arrive at the barrier. Additionally, we use a host of Java-specific optimizations and use profiling to effectively schedule the threads. With all our optimizations, we demonstrate a speedup of 11.8× for a multi-issue in-order pipeline and 10.9× for an out-of-order pipeline with 64 threads, for a suite of seven Splash2 and Parsec benchmarks. The simulation error is limited to 2% to 4% as compared to strictly sequential simulation
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,Modeling and Simulation
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. PInTE: Probabilistic Induction of Theft Evictions;2022 IEEE International Symposium on Workload Characterization (IISWC);2022-11
2. Fast and Accurate Statistical Simulation of Shared-Memory Applications on Multicore Systems;IEEE Transactions on Parallel and Distributed Systems;2022-10-01
3. ChunkedTejas;ACM Transactions on Modeling and Computer Simulation;2020-07-31
4. Two monologues do not make a dialogue: the need for medical specialty–specific communication workshops: population-based study;BMJ Leader;2019-01-21
5. Fast parallel simulation of a manycore architecture with a flit-level on-chip network model;Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation;2018-07-15