VeriGen: A Large Language Model for Verilog Code Generation-Reference-Cited by-同舟云学术

VeriGen: A Large Language Model for Verilog Code Generation

Published:2024-04-22 Issue:3 Volume:29 Page:1-31
ISSN:1084-4309
Container-title:ACM Transactions on Design Automation of Electronic Systems
language:en
Short-container-title:ACM Trans. Des. Autom. Electron. Syst.

Author:

Thakur Shailja¹^ORCID,Ahmad Baleegh¹^ORCID,Pearce Hammond²^ORCID,Tan Benjamin³^ORCID,Dolan-Gavitt Brendan¹^ORCID,Karri Ramesh¹^ORCID,Garg Siddharth¹^ORCID

Affiliation:

1. New York University, New York, USA

2. University of New South Wales, Sydney, Australia

3. University of Calgary, Calgary, Canada

Abstract

In this study, we explore the capability of Large Language Models (LLMs) to automate hardware design by automatically completing partial Verilog code, a common language for designing and modeling digital systems. We fine-tune pre-existing LLMs on Verilog datasets compiled from GitHub and Verilog textbooks. We evaluate the functional correctness of the generated Verilog code using a specially designed test suite, featuring a custom problem set and testing benches. Here, our fine-tuned open-source CodeGen-16B model outperforms the commercial state-of-the-art GPT-3.5-turbo model with a 1.1% overall increase. Upon testing with a more diverse and complex problem set, we find that the fine-tuned model shows competitive performance against state-of-the-art gpt-3.5-turbo, excelling in certain scenarios. Notably, it demonstrates a 41% improvement in generating syntactically correct Verilog code across various problem categories compared to its pre-trained counterpart, highlighting the potential of smaller, in-house LLMs in hardware design automation. We release our training/evaluation scripts and LLM checkpoints as open-source contributions.

Funder

NSF

ARO

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3643681

Reference36 articles.

1. Aakash Ahmad Muhammad Waseem Peng Liang Mahdi Fahmideh Mst Shamima Aktar and Tommi Mikkonen. 2023. Towards Human-Bot Collaborative Software Architecting with ChatGPT. In Proceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering (Oulu Finland) (EASE’23). Association for Computing Machinery New York NY USA 279–285. 10.1145/3593434.3593468

2. Baleegh Ahmad Shailja Thakur Benjamin Tan Ramesh Karri and Hammond Pearce. 2024. On Hardware Security Bug Code Fixes By Prompting Large Language Models. IEEE Transactions on Information Forensics and Security (2024). 1–1. 10.1109/TIFS.2024.3374558

3. AI21. 2021. Jurassic-1 Language Models - AI21 Studio Docs. Retrieved November 2022 from https://studio.ai21.com/docs/jurassic1-language-models/#general-purpose-models

4. Chisel

5. Chip-Chat: Challenges and Opportunities in Conversational Hardware Design

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization;Proceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD;2024-09-09

2. Human Language to Analog Layout Using GLayout Layout Automation Framework;Proceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD;2024-09-09

3. The potential of LLMs in hardware design;Journal of Engineering Research;2024-08

4. Hardware Trojan Dataset of RISC-V and Web3 Generated with ChatGPT-4;Data;2024-06-19

5. Investigation and Implementation of AI-HDLCoder for Automated VHDL Code Synthesis and Code Generation for Hardware SoC Development;2024 35th Irish Signals and Systems Conference (ISSC);2024-06-13