Affiliation:
1. Jain University (Deemed), India
2. Department of Foreign Languages and Literature, Asia University, Taiwan
3. Department of Business Administration, Asia University, Taiwan
Abstract
Large language models (LLMs) are a revolutionary development that allows machines to comprehend and produce text, similar to that of humans on a never-before-seen scale. This chapter examines the basic ideas underlying LLMs with an emphasis on their applications, training approaches, and architecture. Deep neural networks with billions of parameters are used by LLMs, such as the GPT-3 model, to capture complex linguistic patterns and contextual subtleties. Massive datasets, frequently drawn from a variety of online sources, are used in the training process to impart a thorough understanding of language. Consequently, LLMs show remarkable abilities in tasks like question answering, language translation, and text generation. Issues like bias, ethical issues, and interpretability thus become important concerns. So, this chapter outlines the main elements of LLMs, discusses their advantages, reviews current research, and addresses the ethical issues surrounding their application.
Reference50 articles.
1. Adiwardana, D., Luong, M. T., So, D. R., Hall, J., Fiedel, N., Thoppilan, R., & Le, Q. V. (2020). Towards a human-like open-domain chatbot. arXiv preprint arXiv:2001.09977.
2. Deep Learning of a Pre-trained Language Model’s Joke Classifier Using GPT-2.;N. A.Akbar;Journal of Hunan University Natural Sciences,2021
3. Do Large Language Models Understand Us?
4. Ayers, J. W. (2023). Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum. JAMA Internal Medicine, 183, 589–596. doi:10 . 1001 / jamainternmed
5. AzunreP. (2021). Transfer learning for natural language processing. Simon and Schuster.