A Survey of Controllable Text Generation Using Transformer-based Pre-trained Language Models-Reference-Cited by-同舟云学术

A Survey of Controllable Text Generation Using Transformer-based Pre-trained Language Models

Published:2023-10-06 Issue:3 Volume:56 Page:1-37
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Zhang Hanqing¹^ORCID,Song Haolin¹^ORCID,Li Shaoyu¹^ORCID,Zhou Ming²^ORCID,Song Dawei¹^ORCID

Affiliation:

1. Beijing Institute of Technology, China

2. Langboat Technology, China

Abstract

Controllable Text Generation (CTG) is an emerging area in the field of natural language generation (NLG). It is regarded as crucial for the development of advanced text generation technologies that better meet the specific constraints in practical applications. In recent years, methods using large-scale pre-trained language models (PLMs), in particular the widely used Transformer-based PLMs, have become a new paradigm of NLG, allowing generation of more diverse and fluent text. However, due to the limited level of interpretability of deep neural networks, the controllability of these methods needs to be guaranteed. To this end, controllable text generation using Transformer-based PLMs has become a rapidly growing yet challenging new research hotspot. A diverse range of approaches have emerged in the past 3 to 4 years, targeting different CTG tasks that require different types of controlled constraints. In this article, we present a systematic critical review on the common tasks, main approaches, and evaluation methods in this area. Finally, we discuss the challenges that the field is facing, and put forward various promising future directions. To the best of our knowledge, this is the first survey article to summarize the state-of-the-art CTG techniques from the perspective of Transformer-based PLMs. We hope it can help researchers and practitioners in the related fields to quickly track the academic and technological frontier, providing them with a landscape of the area and a roadmap for future research.

Funder

Natural Science Foundation of Beijing

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3617680

Reference172 articles.

1. Ali Amin-Nejad, Julia Ive, and Sumithra Velupillai. 2020. Exploring Transformer text generation for medical dataset augmentation. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 4699–4708. https://aclanthology.org/2020.lrec-1.578

2. Guided Open Vocabulary Image Captioning with Constrained Beam Search

3. Wilker Aziz, Sheila Castilho, and Lucia Specia. 2012. PET: A tool for post-editing and assessing machine translation. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’12). European Language Resources Association (ELRA), Istanbul, Turkey, 3982–3987. http://www.lrec-conf.org/proceedings/lrec2012/pdf/985_Paper.pdf

4. RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

5. Longformer: The long-document Transformer;Beltagy Iz;CoRR,2020

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Classification of Malicious URLs: M-BERT—A Modified BERT Variant for Enhanced Semantic Understanding;IEEE Access;2024

2. Optimizing Prompts Using In-Context Few-Shot Learning for Text-to-Image Generative Models;IEEE Access;2024

3. Combating Fake News on Social Media: A Fusion Approach for Improved Detection and Interpretability;IEEE Access;2024

4. A recent survey on controllable text generation: A causal perspective;Fundamental Research;2024-01

5. Foundation and large language models: fundamentals, challenges, opportunities, and social impacts;Cluster Computing;2023-11-27