Feature Extraction of Museum Big Data Text Information Based on the Similarity Mapping Algorithm-Reference-Cited by-同舟云学术

Feature Extraction of Museum Big Data Text Information Based on the Similarity Mapping Algorithm

Published:2022-03-09 Issue: Volume:2022 Page:1-9
ISSN:1875-905X
Container-title:Mobile Information Systems
language:en
Short-container-title:Mobile Information Systems

Author:

Yang Zhe¹^ORCID,Wang Huiqin¹,Tang Qixuan¹,Wang Ting¹,Wang Shaowen¹,Kong Yulei¹

Affiliation:

1. School of Management, Xi’an University of Architecture and Technology, Xi’an, Shaanxi, China

Abstract

Under big data, a large number of features, as well as their complex data types, make traditional feature extraction and knowledge reasoning unable to adapt to new conditions. To solve these problems, this study proposes a museum big data feature extraction method based on a similarity mapping algorithm. Under the museum big data analysis, the museum big data text information is collected through web crawler technology. The web crawler is used to index the content of websites all across the Internet so that the museum websites can appear in search engine results and the collected text information is denoised and smoothed by a Gaussian filter to construct the processed text information set mapping matrix. The semantic similarity is computed according to the text word concept. Based on the calculation results, through word frequency and document probability inverse document frequency weight, the museum big data text information features are extracted. Simulation results show that the proposed method has high accuracy and short extraction time. Through the comparative analysis, it can be realized that this method not only solves the problems existing in traditional methods but also lays a foundation for the analysis of museum massive data.

Publisher

Hindawi Limited

Subject

Computer Networks and Communications,Computer Science Applications

Link

http://downloads.hindawi.com/journals/misy/2022/9611559.pdf

Reference32 articles.

1. Correction to: Text feature extraction based on deep learning: a review

2. Deep Learning-based Extraction of Algorithmic Metadata in Full-Text Scholarly Documents

3. Privacy-Preserving Krawtchouk Moment feature extraction over encryptedimage data;C. Tyab;ScienceDirect. Information Sciences,2020

4. Feature extraction for document text using Latent Dirichlet Allocation

5. Ad hoc information extraction for clinical data warehouses;D. Georg;Methods of Information in Medicine,2918