Dialogue-adaptive language model pre-training from quality estimation☆-Reference-Cited by-同舟云学术

Dialogue-adaptive language model pre-training from quality estimation☆

Published:2023-01 Issue: Volume:516 Page:27-35
ISSN:0925-2312
Container-title:Neurocomputing
language:en
Short-container-title:Neurocomputing

Author:

Li Junlong,Zhang Zhuosheng,Zhao Hai

Funder

National Natural Science Foundation of China

Publisher

Elsevier BV

Subject

Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications

Reference52 articles.

1. J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, Minneapolis, Minnesota, (2019), pp. 4171–4186. doi:10.18653/v1/N19-1423. URL:https://www.aclweb.org/anthology/N19-1423.

2. A. Radford, K. Narasimhan, T. Salimans, I. Sutskever, Improving language understanding by generative pre-training, Technical report, OpenAI. URL:https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf.

3. Z. Yang, Z. Dai, Y. Yang, J.G. Carbonell, R. Salakhutdinov, Q.V. Le, Xlnet: Generalized autoregressive pretraining for language understanding, in: H.M. Wallach, H. Larochelle, A. Beygelzimer, F. d’Alché-Buc, E.B. Fox, R. Garnett (Eds.), Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8–14, 2019, Vancouver, BC, Canada, 2019, pp. 5754–5764. URL:https://proceedings.neurips.cc/paper/2019/hash/dc6a7e655d7e5840e66733e9ee67cc69-Abstract.html.

4. Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, V. Stoyanov, RoBERTa: A robustly optimized bert pretraining approach, arXiv:1907.11692. URL:https://arxiv.org/abs/1907.11692.

5. Y. Sun, S. Wang, Y. Li, S. Feng, X. Chen, H. Zhang, X. Tian, D. Zhu, H. Tian, H. Wu, ERNIE: Enhanced Representation through Knowledge Integration, arXiv:1904.09223. URL:https://arxiv.org/abs/1904.09223.

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards robust neural networks: Exploring counterfactual causality-based repair;Expert Systems with Applications;2024-12

2. On Representation Learning-based Methods for Effective, Efficient, and Scalable Code Retrieval;Neurocomputing;2024-10

3. Dialogue agents 101: a beginner’s guide to critical ingredients for designing effective conversational systems;Natural Language Processing;2024-09-09

4. Offline prompt polishing for low quality instructions;Neurocomputing;2024-09

5. Differential privacy in deep learning: A literature survey;Neurocomputing;2024-07