RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems-Reference-Cited by-同舟云学术

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

Published:2023-04-30 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the ACM Web Conference 2023
language:
Short-container-title:

Author:

Zhou Jiahong¹^ORCID,Mao Shunhui¹^ORCID,Yang Guoliang¹^ORCID,Tang Bo¹^ORCID,Xie Qianlong¹^ORCID,Lin Lebin¹^ORCID,Wang Xingxing¹^ORCID,Wang Dong¹^ORCID

Affiliation:

1. Meituan, China

Funder

Meituan

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3543507.3583313

Reference44 articles.

1. Joshua Achiam , David Held , Aviv Tamar , and Pieter Abbeel . 2017 . Constrained policy optimization . In International conference on machine learning. PMLR, 22–31 . Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel. 2017. Constrained policy optimization. In International conference on machine learning. PMLR, 22–31.

2. Relaxations of Weakly Coupled Stochastic Dynamic Programs

3. Rishabh Agarwal , Dale Schuurmans , and Mohammad Norouzi . 2020 . An optimistic perspective on offline reinforcement learning . In International Conference on Machine Learning. PMLR, 104–114 . Rishabh Agarwal, Dale Schuurmans, and Mohammad Norouzi. 2020. An optimistic perspective on offline reinforcement learning. In International Conference on Machine Learning. PMLR, 104–114.

4. Eitan Altman . 1999. Constrained Markov decision processes . Routledge . Eitan Altman. 1999. Constrained Markov decision processes. Routledge.

5. Kiam Heong Ang , Gregory Chong , and Yun Li. 2005. PID control system analysis, design, and technology . IEEE transactions on control systems technology 13, 4 ( 2005 ), 559–576. Kiam Heong Ang, Gregory Chong, and Yun Li. 2005. PID control system analysis, design, and technology. IEEE transactions on control systems technology 13, 4 (2005), 559–576.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An IoT-based Approach to Expert Recommendation in Community Question Answering for Disaster Recovery;2023 IEEE International Conference on Data Mining Workshops (ICDMW);2023-12-04