1. Splitwise: Efficient Generative LLM Inference Using Phase Splitting;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29
2. Heuristic Scheduling of Streaming Applications for Energy Efficiency on Heterogeneous Multicores;2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2023-12-17
3. A scheduling algorithm based on critical factors for heterogeneous multicore processors;Concurrency and Computation: Practice and Experience;2023-11-20
4. Real-Time AI in Social Edge;Social Edge Computing;2023
5. Managing Heterogeneous Datacenters with Tokens;ACM Transactions on Architecture and Code Optimization;2018-06-22