Affiliation:
1. School of Foreign Languages, Changsha University, Changsha 410205, China
2. The First Clinical College, Changsha Medical University, Changsha 410205, China
Abstract
Aiming at the problems of poor data quality and low application rate in the construction of existing media corpus, this paper proposes the construction and application research of media corpus based on big data. Media corpus data are collected, the data are divided into four categories, the heuristic data item column sorting algorithm is introduced to sort all collection processes, the minimum value of data item collection rate is determined, on this basis, the maximum value of quantity in media corpus is determined, and data collection is realized in media corpus data through sliding window. Then, the state characteristics and probability distribution of feature data are determined by dynamic Bayesian network, the relationship between the state variables and dimensions of media corpus data is determined, and the media corpus data state is processed by component to complete the preprocessing of media corpus data; finally, through the application research of storage and encryption of the designed database through big data technology, the storage structure data and encryption secret key are designed to realize the construction and application of media corpus. The experimental results show that the data quality of the media corpus constructed by the proposed method is high, and its application rate has been improved to a certain extent.
Subject
Electrical and Electronic Engineering
Reference16 articles.
1. Eine digitale analyse sterreichischer printmedien auf basis des Austrian media corpus;P. Aprent;Zeitgeschichte,2018
2. Language choice and gender in a Nordic social media corpus
3. Political and media discourses about integrating refugees in the
UK
4. Multiomic Big Data Analysis Challenges: Increasing Confidence in the Interpretation of Artificial Intelligence Assessments
5. The database selection based on hesitant linguistic information aggregation algorithm;T. Gao;Control Engineering of China,2019
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献