Harmonizing Fine-tuned Llama 2 for Content Generation with Stable Diffusion for Image Synthesis in Article Creation-Reference-Cited by-同舟云学术

Harmonizing Fine-tuned Llama 2 for Content Generation with Stable Diffusion for Image Synthesis in Article Creation

Published:2024-09 Issue:3 Volume:6 Page:219-234
ISSN:2582-4252
Container-title:Journal of Innovative Image Processing
language:en
Short-container-title:JIIP

Author:

P. Shenbagam,K S. Thrisha Vaishnavi,S. Hariprakassh,K. Abhirami,B. Abiram,K S. Rakesh Nandhaa

Abstract

The research explores the integration of generative AI in multimedia content production using a fine-tuned Llama 2 model for text generation and the Stable Diffusion algorithm for image synthesis. The research analyses the fine-tuned Llama 2-7b-chat model's adaptability to specific content generation contexts, enhanced by a unique dataset and QLoRa, a Quantized Low-Rank Adaptation for parameter-efficient fine-tuning, achieving significant reductions in training loss and nuanced quality in the generated content. Notably, the model's evaluation yielded an impressive perplexity score of 1.49, indicating advanced predictive performance. Additionally, stable diffusion's ability to transform textual descriptions into intricate images, highlighting its potential in AI-mediated content creation is demonstrated. The experiments and qualitative analyses reveal improvements in efficiency and creativity, emphasizing the collaborative potential of these models to revolutionize multidisciplinary content generation. The research underscores the transformative impact of fine-tuned generative models on content creation and offers insights into the broader implications for future AI research, while acknowledging the critical need for ethical considerations in the deployment of such technologies.

Publisher

Inventive Research Organization

Reference22 articles.

1. [1] Ashish Vaswani, Noam Shazeer, Niki Parmar,et al.”Attention Is All You Need”. 31st International Conference on Neural Information Processing Systems(NeurIPS), no. 07 (2023): 6000–6010.

2. [2] Touvron, Hugo, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov et al. "Llama 2: Open foundation and fine-tuned chat models." arXiv preprint arXiv:2307.09288 (2023).

3. [3] Lingling Xu, Haoran Xie, Si-Zhao Joe Qin, et al.” Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment”. Nature Machine Intelligence, no. 05 (2023): 220-235.

4. [4] Hu, Edward J., Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. "Lora: Low-rank adaptation of large language models." arXiv preprint arXiv:2106.09685 (2021).

5. [5] Dettmers, Tim, Artidoro Pagnoni, Ari Holtzman, and Luke Zettlemoyer. "Qlora: Efficient finetuning of quantized llms." Advances in Neural Information Processing Systems 36 (2024).