An Approach for Automated Kannada Subtitle Generation from Kannada Video-Reference-Cited by-同舟云学术

An Approach for Automated Kannada Subtitle Generation from Kannada Video

Published:2023-05 Issue:Supp01 Volume:31 Page:101-119
ISSN:0218-4885
Container-title:International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
language:en
Short-container-title:Int. J. Unc. Fuzz. Knowl. Based Syst.

Author:

Santosh ¹²,Jenila Livingston L. M.¹

Affiliation:

1. School of Computer Science and Engineering, Vellore Institute of Technology, Chennai 600127, India

2. Department of Computer Science & Engineering, Canara Engineering College, Mangalore, India

Abstract

This paper presents an automated Kannada subtitle generator from Kannada video which is implemented to assist people with auditory problems for watching videos. Henceforth the subtitle generation has become an important task for supporting such special people and it integrates an audio extraction and a speech recognition module. Three phases of the proposed technique were implemented, such as extracting audio from video, Recognition of Speech and Generation of Subtitle. An adaptive speech recognition module is implemented AMFCC for feature extraction which was an alternative to the most commonly used FFT. Hankel transform which was similar to FFT, but includes no elementary particles such as FFT. In addition to it, in the decoder acoustic module, such as Adaptive Hidden Markov Model using the Baum-Welch algorithm is utilized instead of a Viterbi algorithm to reduce the computational time and memory usage. The text file from the speech recognition module is rendered to synchronize the missing offset with the video using parallel processing by defining the start time, the end time, the delay time. Best outcomes are demonstrated by the experimental results of the proposed technique with 98.4% of accuracy compare with existing techniques. The proposed technique which gives 3.8% better accuracy performance compare with existing technique i.e. MFCC, DNN and CNN.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Information Systems,Control and Systems Engineering,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218488523400068

Reference22 articles.

1. ALISA: An automatic lightly supervised speech segmentation and alignment tool

2. Some Essential Themes in Building the Case for Captions in Language Learning

3. Multiple camera in car audio–visual speech recognition using phonetic and visemic information

4. Community of Inquiry as an instructional approach: What effects of teaching, social and cognitive presences are there in blended synchronous learning and teaching?