Affiliation:
1. Deputy Director of Graduate Office, Shandong College of Arts, Jinan, Shandong 250300, China
Abstract
With the continuous growth of the film market and the consumption demand of audiences, the value of film content has become increasingly apparent. Extracting movie content elements is an important step in quantitative analysis of movie content. In this paper, based on text mining technology, deep data analysis of movie reviews is carried out using TF-IDF and machine learning to visualize high-frequency words, and sentiment analysis is performed on reviews to find out the hidden deep information behind the big data of movie reviews. Although the extracted keywords can draw the content and characteristics of the film for us, they cannot establish the correlation with the creative elements of the film and television works. Therefore, in this paper, the extracted keywords are clustered to find the central words of these words. First of all, in order to improve the representativeness of keywords, the Epoch data set and the Batchsize data set are used to conduct experimental analysis on the model in this paper. Through comparative experiments, it is concluded that when Batchsize = 32 and Epoch = 25, the model achieves the optimal classification effect. The analysis shows that when the training times are too small, the model is not fully learned, and when the training times are too many, the model will be overfitted, which will reduce the accuracy of the model, resulting in the opposite effect. The keywords extracted above can draw the content and characteristics of the movie for us.
Subject
General Engineering,General Mathematics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献