Affiliation:
1. School of Computer Science and Engineering, Vellore Institute of Technology, Chennai 600127, India
2. Department of Computer Science & Engineering, Canara Engineering College, Mangalore, India
Abstract
This paper presents an automated Kannada subtitle generator from Kannada video which is implemented to assist people with auditory problems for watching videos. Henceforth the subtitle generation has become an important task for supporting such special people and it integrates an audio extraction and a speech recognition module. Three phases of the proposed technique were implemented, such as extracting audio from video, Recognition of Speech and Generation of Subtitle. An adaptive speech recognition module is implemented AMFCC for feature extraction which was an alternative to the most commonly used FFT. Hankel transform which was similar to FFT, but includes no elementary particles such as FFT. In addition to it, in the decoder acoustic module, such as Adaptive Hidden Markov Model using the Baum-Welch algorithm is utilized instead of a Viterbi algorithm to reduce the computational time and memory usage. The text file from the speech recognition module is rendered to synchronize the missing offset with the video using parallel processing by defining the start time, the end time, the delay time. Best outcomes are demonstrated by the experimental results of the proposed technique with 98.4% of accuracy compare with existing techniques. The proposed technique which gives 3.8% better accuracy performance compare with existing technique i.e. MFCC, DNN and CNN.
Publisher
World Scientific Pub Co Pte Ltd
Subject
Artificial Intelligence,Information Systems,Control and Systems Engineering,Software