Abstract
Abstract
Hormone-binding proteins (HBPs) are carrier proteins that specifically bind to targeted hormones. Some evidence suggests that the abnormal expression of HBPs causes various diseases. Therefore, it is significant to accurately identify HBPs to study these diseases. Recently, many researchers have proposed traditional machine learning methods to complete this work, but these methods are neither suitable for training on large-scale datasets nor take into account the contextual features of HBPs. In this paper, I propose a new deep learning method, TCN-HBP, to distinguish HBPs. TCN-HBP consists of a coding layer, embedding layer, convolutional neural network (CNN) layer and temporal convolutional network (TCN) layer. The coding and embedding layers extend the protein sequences into two-dimensional matrix data. The CNN layer convolves the matrix data to form feature maps. The TCN layer captures the contextual features present in the feature maps. Experiments show that the data generalization capabilities and recognition accuracy (99.15%) of TCN-HBP on large datasets perform better than previous methods.
Subject
General Physics and Astronomy