Abstract
With the enormous growth rate in the number of movies coming into our lives, it can be very challenging to decide whether a movie is suitable for a family or not. Almost every country has a Movie Rating System that determines movies’ suitability age. But these current movie rating systems require watching the full movie with a professional. In this paper, we developed a model which can determine the rating level of the movie by only using its subtitle without any professional interfere. To convert the text data to numbers, we use TF-IDF vectorizer, WIDF vectorizer and Glasgow Weighting Scheme. We utilized random forest, support vector machine, k-nearest neighbor and multinomial naive bayes to find the best combination that achieves the highest results. We achieved an accuracy of 85%. The result of our classification approach is promising and can be used by the movie rating committee for pre-evaluation.
Cautionary Note: In some chapters of this paper may contain some words that many will find offensive or inappropriateness; however, this cannot be avoided owing to the nature of the work
Publisher
Gazi Universitesi Fen Bilimleri Dergisi Part C: Tasarim ve Teknoloji
Reference33 articles.
1. Park SB, Kim HN, Kim H, Jo GS "Exploiting script-subtitles alignment to scene boundary dectection in movie". 2010 IEEE International Symposium on Multimedia, Taichung, Taiwan, 13-15 December 2010.
2. Katsiouli P, Tsetsos V, Hadjiefthymiades S. "Semantic Video Classification Based on Subtitles and Domain Terminologies". KAMC 2007 Workshop on Knowledge Acquisition from Multimedia Content, Genoa, Italy, 5 December 2007.
3. Lison P, Meena R. "Automatic turn segmentation for movie & tv subtitles". 2016 IEEE Spoken Language Technology Workshop (SLT), San Juan, Porto Riko, 13-16 December 2016.
4. Vajjala S, Meurers D. "Exploring measures of 'readability' for spoken language: Analyzing linguistic features of subtitles to identify age-specific tv programs", 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR), Gothenburg, Sweden, 27 April 2014.
5. von Boguszewski N, Moin S, Bhowmick A, Yimam SM, Biemann C. "How Hateful are Movies? A Study and Prediction on Movie Subtitles". arXiv preprint, 2108.10724(1), 2021.