1. Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Multi-Core CPUs;IEEE Transactions on Parallel and Distributed Systems;2024-03
2. Characterize and Optimize Dense Linear Solver on Multi-core CPUs;2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS);2023-12-17
3. wrBench: Comparing Cache Architectures and Coherency Protocols on ARMv8 Many-Core Systems;Journal of Computer Science and Technology;2023-11-30
4. Occamy: Elastically Sharing a SIMD Co-processor across Multiple CPU Cores;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2023-03-25
5. EVE: Ephemeral Vector Engines;2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2023-02