Affiliation:
1. Bohai University
2. Jinzhou Tecther’s Training College
Abstract
The structure of DNA binding proteins is identified that has great significance for the study of gene expression regulation mechanism.The new recognition method is proposed to identify the super-secondary structure and structure domain of DNA-binding protein in this paper. The nucleotide transition probability is calculated by the known DNA-binding protein binding locus sequence. Using mouse data which downloaded from the TRANSFAC establish the binding protein super-secondary structure recognition models. The probability score is calculated by the transition probability of the binding site and the background. This method differs from the conventional method, It is neither the amino acid sequence of the protein, nor the use of homologous proteins. In order to verify the validity of the algorithm, 10 DNA-binding proteins of drosophila and yeast are used to do the experiment. The experimental results show that our method has very good recognition result.
Publisher
Trans Tech Publications, Ltd.
Reference11 articles.
1. L.F.O. Rocha, Toward a better understanding of structural divergences in proteins using different secondary structure assignment methods, Journal of Molecular Structure, vol. 1063, no. 24, pp.242-250, (2014).
2. Liang Kong, Lichao Zhang, Jinfeng Lv, Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition, Journal of Theoretical Biology, vol. 344, no. 3, pp.12-18, (2014).
3. Alexander V. Efimov, Super-secondary structures and modeling of protein folds, Protein Supersecondary Structures Methods in Molecular Biology, vol. 932, no. 1, pp.77-89, (2013).
4. Bai Li, Ya Li, Ligang Gong, Protein secondary structure optimization using an improved artificial bee colony algorithm based on AB off-lattice model, Engineering applications of Artificial Intelligence, vol. 27, no. 1, pp.70-79, (2014).
5. Jianyu Shi, Yanning Zhang, Structural classification of protein domain based on template match and pyramid decomposition, Application Research of Computer, vol. 29, no. 6, pp.2081-2084, (2012).