Preference-Conditioned Language-Guided Abstraction-Reference-Cited by-同舟云学术

Preference-Conditioned Language-Guided Abstraction

Published:2024-03-11 Issue: Volume:33 Page:572-581
ISSN:
Container-title:Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction
language:
Short-container-title:

Author:

Peng Andi¹^ORCID,Bobu Andreea²^ORCID,Li Belinda Z.¹^ORCID,Sumers Theodore R.³^ORCID,Sucholutsky Ilia³^ORCID,Kumar Nishanth¹^ORCID,Griffiths Thomas L.³^ORCID,Shah Julie A.¹^ORCID

Affiliation:

1. MIT, Cambridge, MA, USA

2. Boston Dynamics AI Institute, Cambridge, MA, USA

3. Princeton, Princeton, NJ, USA

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3610977.3634930

Reference62 articles.

1. David Abel, John Salvatier, Andreas Stuhlmüller, and Owain Evans. 2017. Agent-agnostic human-in-the-loop reinforcement learning. arXiv preprint arXiv:1701.04079 (2017).

2. Gati Aher Rosa I. Arriaga and Adam Tauman Kalai. 2023. Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies. arXiv:2208.10264 [cs.CL]

3. Michael Ahn Anthony Brohan Noah Brown Yevgen Chebotar Omar Cortes Byron David Chelsea Finn Keerthana Gopalakrishnan Karol Hausman Alex Herzog et al. 2022. Do as i can not as i say: Grounding language in robotic affordances. arXiv preprint arXiv:2204.01691 (2022).

4. Out of One, Many: Using Language Models to Simulate Human Samples

5. Yuntao Bai, Andy Jones, Kamal Ndousse, Amanda Askell, Anna Chen, Nova DasSarma, Dawn Drain, Stanislav Fort, Deep Ganguli, T. J. Henighan, Nicholas Joseph, Saurav Kadavath, John Kernion, Tom Conerly, Sheer El-Showk, Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez, Tristan Hume, Scott Johnston, Shauna Kravec, Liane Lovitt, Neel Nanda, Catherine Olsson, Dario Amodei, Tom B. Brown, Jack Clark, Sam McCandlish, Christopher Olah, Benjamin Mann, and Jared Kaplan. 2022. Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. ArXiv abs/2204.05862 (2022). https://api. semanticscholar.org/CorpusID:248118878