The Life Cycle of Knowledge in Big Language Models: A Survey-Reference-Cited by-同舟云学术

The Life Cycle of Knowledge in Big Language Models: A Survey

Published:2024-01-12 Issue:2 Volume:21 Page:217-238
ISSN:2731-538X
Container-title:Machine Intelligence Research
language:en
Short-container-title:Mach. Intell. Res.

Author:

Cao Boxi^ORCID,Lin Hongyu,Han Xianpei^ORCID,Sun Le

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11633-023-1416-x.pdf

Reference178 articles.

1. N. J. Nilsson. Artificial intelligence. In Proceedings of the 6th IFIP Congress 1974, Stockholm, Sweden, pp.778-801,1974.

2. J. Devlin, M. W. Chang, K. Lee, K. Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, Minneapolis, USA, pp.4171–4186, 2019. DOI: https://doi.org/10.18653/v1/N19-1423.

3. Y. H. Liu, M. Ott, N. Goyal, J. F. Du, M. Joshi, D. Q. Chen, O. Levy, M. Lewis, L. Zettlemoyer, V. Stoyanov. RoBERTa: A robustly optimized BERT pretraining approach. [Online], Available: https://arxiv.org/abs/1907.11692, 2019.

4. C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena, Y. Q. Zhou, W. Li, P. J. Liu. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, vol. 21, no. 1, Article number 140, 2020.

5. A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, I. Sutskever. Language models are unsupervised multitask learners. OpenAI Blog, vol.1, no. 8, Article number 9, 2019.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Matching tasks to objectives: Fine-tuning and prompt-tuning strategies for encoder-decoder pre-trained language models;Applied Intelligence;2024-07-23

2. We Live in Interesting Times: Introduction to the Special Section on Big Data & Behavior Science;Perspectives on Behavior Science;2024-03