Safe-NORA: Safe Reinforcement Learning-based Mobile Network Resource Allocation for Diverse User Demands-Reference-Cited by-同舟云学术

Safe-NORA: Safe Reinforcement Learning-based Mobile Network Resource Allocation for Diverse User Demands

Published:2023-10-21 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
language:
Short-container-title:

Author:

Huang Wenzhen¹^ORCID,Li Tong¹^ORCID,Cao Yuting²^ORCID,Lyu Zhe²^ORCID,Liang Yanping²^ORCID,Yu Li²^ORCID,Jin Depeng¹^ORCID,Zhang Junge³^ORCID,Li Yong¹^ORCID

Affiliation:

1. Tsinghua University, Beijing, China

2. China Mobile Research Institute, Beijing, China

3. Institute of Automation, Chinese Academy of Sciences, Beijing, China

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3583780.3615043

Reference40 articles.

1. Joshua Achiam , David Held , Aviv Tamar , and Pieter Abbeel . 2017 . Constrained policy optimization . In International conference on machine learning. PMLR, 22--31 . Joshua Achiam, David Held, Aviv Tamar, and Pieter Abbeel. 2017. Constrained policy optimization. In International conference on machine learning. PMLR, 22--31.

2. Eitan Altman . 1999. Constrained Markov decision processes: stochastic modeling . Routledge . Eitan Altman. 1999. Constrained Markov decision processes: stochastic modeling. Routledge.

3. Dongsheng Ding , Xiaohan Wei , Zhuoran Yang , Zhaoran Wang , and Mihailo Jovanovic . 2021 . Provably efficient safe exploration via primal-dual policy optimization . In International Conference on Artificial Intelligence and Statistics. PMLR, 3304--3312 . Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, and Mihailo Jovanovic. 2021. Provably efficient safe exploration via primal-dual policy optimization. In International Conference on Artificial Intelligence and Statistics. PMLR, 3304--3312.

4. Natural policy gradient primal-dual method for constrained markov decision processes;Ding Dongsheng;Advances in Neural Information Processing Systems,2020

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An intelligent fuzzy reinforcement learning-based routing algorithm with guaranteed latency and bandwidth in SDN: Application of video conferencing services;Egyptian Informatics Journal;2024-09

2. Diffusion Model-based Mobile Traffic Generation with Open Data for Network Planning and Optimization;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Enhancing Reptile search algorithm with shifted distribution estimation strategy for coverage optimization in wireless sensor networks;Heliyon;2024-08

4. Mobile User Traffic Generation Via Multi-Scale Hierarchical GAN;ACM Transactions on Knowledge Discovery from Data;2024-07-26

5. A novel approach for energy consumption management in cloud centers based on adaptive fuzzy neural systems;Cluster Computing;2024-07-21