1. An efficient sequential consistency implementation with dynamic race detection for GPUs;Journal of Parallel and Distributed Computing;2024-05
2. Turn-based Spatiotemporal Coherence for GPUs;ACM Transactions on Architecture and Code Optimization;2023-07-19
3. FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems;2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2023-02
4. HeteroGen: Automatic Synthesis of Heterogeneous Cache Coherence Protocols;2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2022-04
5. Only Buffer When You Need To: Reducing On-chip GPU Traffic with Reconfigurable Local Atomic Buffers;2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2022-04