A Review of Current Trends, Techniques, and Challenges in Large Language Models (LLMs)-Reference-Cited by-同舟云学术

A Review of Current Trends, Techniques, and Challenges in Large Language Models (LLMs)

Published:2024-03-01 Issue:5 Volume:14 Page:2074
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Patil Rajvardhan¹^ORCID,Gudivada Venkat²

Affiliation:

1. School of Computing, Grand Valley State University, Allendale Charter Township, MI 49401, USA

2. Computer Science Department, East Carolina University, Greenville, NC 27858, USA

Abstract

Natural language processing (NLP) has significantly transformed in the last decade, especially in the field of language modeling. Large language models (LLMs) have achieved SOTA performances on natural language understanding (NLU) and natural language generation (NLG) tasks by learning language representation in self-supervised ways. This paper provides a comprehensive survey to capture the progression of advances in language models. In this paper, we examine the different aspects of language models, which started with a few million parameters but have reached the size of a trillion in a very short time. We also look at how these LLMs transitioned from task-specific to task-independent to task-and-language-independent architectures. This paper extensively discusses different pretraining objectives, benchmarks, and transfer learning methods used in LLMs. It also examines different finetuning and in-context learning techniques used in downstream tasks. Moreover, it explores how LLMs can perform well across many domains and datasets if sufficiently trained on a large and diverse dataset. Next, it discusses how, over time, the availability of cheap computational power and large datasets have improved LLM’s capabilities and raised new challenges. As part of our study, we also inspect LLMs from the perspective of scalability to see how their performance is affected by the model’s depth, width, and data size. Lastly, we provide an empirical comparison of existing trends and techniques and a comprehensive analysis of where the field of LLM currently stands.

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/5/2074/pdf

Reference106 articles.

1. Distributional structure;Harris;Word,1954

2. A statistical approach to machine translation;Brown;Comput. Linguist.,1990

3. Computer evaluation of indexing and text processing;Salton;J. ACM (JACM),1968

4. A statistical interpretation of term specificity and its application in retrieval;Jones;J. Doc.,1972

5. A vector space model for automatic indexing;Salton;Commun. ACM,1975

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improved Non-Player Character (NPC) behavior using evolutionary algorithm—A systematic review;Entertainment Computing;2025-01

2. Managing workplace AI risks and the future of work;American Journal of Industrial Medicine;2024-09-02

3. Combining large language models with enterprise knowledge graphs: a perspective on enhanced natural language understanding;Frontiers in Artificial Intelligence;2024-08-27

4. Key attribute generation from review texts based on in-context learning for recommender systems;Applied Intelligence;2024-08-08

5. Prompts and Large Language Models: A New Tool for Drafting, Reviewing and Interpreting Contracts?;Law, Technology and Humans;2024-07-30