Affiliation:
1. Shangqiu Institute of Technology, Shangqiu 476000, China
Abstract
Aiming at the problems of poor accuracy of data feature extraction and large classification error in library archives data classification methods, an automatic classification method of library archives data based on data mining is designed. Firstly, the linear relationship between the characteristic variables of library archives data is determined, and the linear coefficient of archives data characteristics is calculated; Then, the characteristic states of library archives data are divided into three states, the characteristic data are normalized, and the adaptive differential evolution algorithm is used to remove the noise in the characteristics of library archives data; Finally, the mapping relation training model in data mining is used to input the data feature training set, and the file data features are labeled according to different weights; Establish automatic data classification model. The experimental results show that the highest accuracy of this method is about 97%.
Subject
Artificial Intelligence,Computer Networks and Communications,Software
Reference14 articles.
1. R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification – ScienceDirect;Bania;Artificial Intelligence in Medicine,2021
2. E. Casey, A. Nelson and J. Hyde, Standardization of file recovery classification and authentication, Digital Investigation 31(02) (2019), 100873.
3. Research on archive information fast analysis algorithm based on data mining technology;Gan;Modern Electronics Technique,2019
4. Kernel compositional embedding and its application in linguistic structured data classification
5. Automatic classification of single-molecule charge transport data with an unsupervised machine-learning algorithm;Huang;Physical Chemistry Chemical Physics,2020