Affiliation:
1. The University of Texas Health Science Center at Houston
Abstract
Abstract
Tumorigenic functions due to the formation of fusion genes were targeted for cancer therapeutics (i.e., kinase inhibitors). However, there are still many fusion proteins awaiting being targeted for therapeutics with multiple different mechanisms. Due to the lack of knowledge of the fusion protein sequence and 3D structure, there were not many studies available. To fill this gap, we developed a new computational pipeline and a resource of human fusion proteins, named FusionPDB available at https://compbio.uth.edu/FusionPDB. FusionPDB provides ~ 42K fusion protein sequences (of 16K in-frame fusion genes), 2300 + 1267 fusion protein 3D structures (of 2300 recurrent in-frame fusion genes and 266 manually curated in-frame fusion genes), and virtual screening results of 1267 fusion proteins. FusionPDB is the only resource providing whole 3D structures of fusion proteins and comprehensive knowledge of human fusion proteins. It will be regularly updated until covering all human fusion proteins in the future.
Publisher
Research Square Platform LLC
Reference30 articles.
1. STRING: a database of predicted functional associations between proteins;Mering Cv;Nucleic acids research,2003
2. Highly accurate protein structure prediction with AlphaFold;Jumper J;Nature,2021
3. Accurate prediction of protein structures and interactions using a three-track neural network;Baek M;Science,2021
4. ChiTaRS 5.0: the comprehensive database of chimeric transcripts matched with druggable fusions and 3D chromatin maps;Balamurali D;Nucleic Acids Res,2020
5. ChimerDB 4.0: an updated and expanded database of fusion genes;Jang YE;Nucleic Acids Res,2020