1. Joshua Achiam David Held Aviv Tamar and Pieter Abbeel. 2017. Constrained policy optimization. In ICML. PMLR 22–31. Joshua Achiam David Held Aviv Tamar and Pieter Abbeel. 2017. Constrained policy optimization. In ICML. PMLR 22–31.
2. Clarence Agbi , Zhen Song , and Bruce Krogh . 2012. Parameter identifiability for multi-zone building models. In 2012 IEEE 51st IEEE CDC . IEEE , 6951–6956. Clarence Agbi, Zhen Song, and Bruce Krogh. 2012. Parameter identifiability for multi-zone building models. In 2012 IEEE 51st IEEE CDC. IEEE, 6951–6956.
3. Uncertainty Estimation for Safe Human-Robot Collaboration Using Conservation Measures
4. Gnu-RL
5. Kurtland Chua 2018. Deep reinforcement learning in a handful of trials using probabilistic dynamics models. NeurIPS ( 2018 ). Kurtland Chua 2018. Deep reinforcement learning in a handful of trials using probabilistic dynamics models. NeurIPS (2018).