SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model-Reference-Cited by-同舟云学术

SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model

Published:2022-03-23 Issue:4 Volume:13 Page:568
ISSN:2073-4425
Container-title:Genes
language:en
Short-container-title:Genes

Author:

Zhang Yikang,Chu Xiaomin,Jiang Yelu,Wu Hongjie,Quan Lijun^ORCID

Abstract

A large number of inorganic and organic compounds are able to bind DNA and form complexes, among which drug-related molecules are important. Chromatin accessibility changes not only directly affect drug–DNA interactions, but they can promote or inhibit the expression of the critical genes associated with drug resistance by affecting the DNA binding capacity of TFs and transcriptional regulators. However, the biological experimental techniques for measuring it are expensive and time-consuming. In recent years, several kinds of computational methods have been proposed to identify accessible regions of the genome. Existing computational models mostly ignore the contextual information provided by the bases in gene sequences. To address these issues, we proposed a new solution called SemanticCAP. It introduces a gene language model that models the context of gene sequences and is thus able to provide an effective representation of a certain site in a gene sequence. Basically, we merged the features provided by the gene language model into our chromatin accessibility model. During the process, we designed methods called SFA and SFC to make feature fusion smoother. Compared to DeepSEA, gkm-SVM, and k-mer using public benchmarks, our model proved to have better performance, showing a 1.25% maximum improvement in auROC and a 2.41% maximum improvement in auPRC.

Funder

Natural Science Foundation of Jiangsu Province Youth Fund

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Genetics (clinical),Genetics

Link

https://www.mdpi.com/2073-4425/13/4/568/pdf

Reference38 articles.

1. An Overview of the Optical and Electrochemical Methods for Detection of DNA-Drug Interactions;Aleksić;Acta Chim. Slov.,2014

2. Modeling the causal regulatory network by integrating chromatin accessibility and transcriptome data

3. Chromatin accessibility changes at intergenic regions are associated with ovarian cancer drug resistance

4. Specific Gain- and Loss-of-Function Phenotypes Induced by Satellite-Specific DNA-Binding Drugs Fed to Drosophila melanogaster

5. DNase-seq: A High-Resolution Technique for Mapping Active Gene Regulatory Elements across the Genome from Mammalian Cells

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The evolution and mutational robustness of chromatin accessibility in Drosophila;Genome Biology;2023-10-16