Open Access archive

Visual Speech Recognition for Kannada Language Using VGG16 Convolutional Neural Network

Shashidhar Rudregowda, JSS Science and Technology University
Sudarshan Patil Kulkarni, JSS Science and Technology University
Gururaj H L, Manipal Institute of Technology
Vinayakumar Ravi, Prince Mohammad Bin Fahd University

Document Type

Article

Publication Title

Acoustics

Abstract

Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of the narrators. Visual speech significantly depends on the visual features derived from the image sequences. Visual speech recognition is a stimulating process that poses various challenging tasks to human machine-based procedures. VSR methods clarify the tasks by using machine learning. Visual speech helps people who are hearing impaired, laryngeal patients, and are in a noisy environment. In this research, authors developed our dataset for the Kannada Language. The dataset contained five words, which are Avanu, Bagge, Bari, Guruthu, Helida, and these words are randomly chosen. The average duration of each video is 1 s to 1.2 s. The machine learning method is used for feature extraction and classification. Here, authors applied VGG16 Convolution Neural Network for our custom dataset, and relu activation function is used to get an accuracy of 91.90% and the recommended system confirms the effectiveness of the system. The proposed output is compared with HCNN, ResNet-LSTM, Bi-LSTM, and GLCM-ANN, and evidenced the effectiveness of the recommended system.

First Page

343

Last Page

353

DOI

10.3390/acoustics5010020

Publication Date

3-1-2023

Recommended Citation

Rudregowda, Shashidhar; Patil Kulkarni, Sudarshan; H L, Gururaj; and Ravi, Vinayakumar, "Visual Speech Recognition for Kannada Language Using VGG16 Convolutional Neural Network" (2023). Open Access archive. 5823.
https://impressions.manipal.edu/open-access-archive/5823

This document is currently not available here.

COinS

Open Access archive

Visual Speech Recognition for Kannada Language Using VGG16 Convolutional Neural Network

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Search

Browse

Author Corner

Open Access archive

Visual Speech Recognition for Kannada Language Using VGG16 Convolutional Neural Network

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Share

Search

Browse

Author Corner