Abstract
AbstractThe design of RNA plays a crucial role in developing RNA vaccines, nucleic acid therapeutics, and innovative biotechnological tools. Nevertheless, existing techniques lack versatility across various tasks and frequently suffer from a deficiency of automated generation. Inspired by the remarkable success of Large Language Models (LLMs) in the realm of protein and molecule design, we present GenerRNA, the first large-scale pre-trained model for RNA generation, aiming to further automate RNA design. Our approach eliminates the need for secondary structure or other prior knowledge and is capable ofde novogeneration of RNA with stable secondary structures while ensuring its distinctiveness from existing sequences. This widens our exploration of RNA space, thereby enriching our understanding of RNA structures and functions. Moreover, GenerRNA is fine-tunable on smaller, more specialized datasets for particular subtasks. This flexibility and versatility enables the generation of RNAs with desired specific functionalities or properties. Upon fine-tuning GenerRNA, we successfully generated novel RNA sequences exhibiting high affinity for target proteins. GenerRNA is freely available at the following repository:https://github.com/pfnet-research/GenerRNA
Publisher
Cold Spring Harbor Laboratory
Reference47 articles.
1. Systematic Evolution of Ligands by Exponential Enrichment: RNA Ligands to Bacteriophage T4 DNA Polymerase
2. Folding and Finding RNA Secondary Structure
3. Design of rnas: comparing programs for inverse rna folding;Briefings in bioinformatics,2018
4. Generative pre-trained transformer: A comprehensive review on enabling technologies, potential applications, emerging challenges, and future directions;arXiv,2023
5. Generative models for protein sequence modeling: recent advances and future directions;Briefings in Bioinformatics,2023
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献