Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

Author:

Fernandes Patrick123,Madaan Aman4,Liu Emmy4,Farinhas António23,Martins Pedro Henrique5,Bertsch Amanda4,de Souza José G. C.5,Zhou Shuyan4,Wu Tongshuang4,Neubig Graham46,Martins André F. T.235

Affiliation:

1. Carnegie Mellon University, USA. pfernand@cs.cmu.edu

2. Instituto Superior Técnico (Lisbon ELLIS Unit), Portugal

3. Instituto de Telecomunicações, Portugal

4. Carnegie Mellon University, USA

5. Unbabel, Portugal

6. Inspired Cognition, USA

Abstract

Abstract Natural language generation has witnessed significant advancements due to the training of large language models on vast internet-scale datasets. Despite these advancements, there exists a critical challenge: These models can inadvertently generate content that is toxic, inaccurate, and unhelpful, and existing automatic evaluation metrics often fall short of identifying these shortcomings. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of recent research that has leveraged human feedback to improve natural language generation. First, we introduce a taxonomy distilled from existing research to categorize and organize the varied forms of feedback. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which uses large language models to make judgments based on a set of principles and minimize the need for human intervention. We also release a website of this survey at feedback-gap-survey.info.

Publisher

MIT Press

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Reference160 articles.

1. Rl4f: Generating natural language feedback with reinforcement learning for repairing model outputs;Akyürek,2023

2. Power to the people: The role of humans in interactive machine learning;Amershi;AI Magazine,2014

3. Concrete problems in AI safety;Amodei;CoRR,2016

4. Identifying weaknesses in machine translation metrics through minimum Bayes risk decoding: A case study for COMET;Amrhein,2022

5. Director: Generator-classifiers for supervised language modeling;Arora,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3