Normal view MARC view ISBD view

Speech Recognition System Using Wav2vec Model (Punjabi Language) / (Record no. 611703)

000 -LEADER
fixed length control field	02244nam a22001817a 4500
003 - CONTROL NUMBER IDENTIFIER
control field	NUST
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20240923123627.0
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	621.382,YAS
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Yaseen, Kashif
9 (RLIN)	125985
245 ## - TITLE STATEMENT
Title	Speech Recognition System Using Wav2vec Model (Punjabi Language) /
Statement of responsibility, etc.	Capt Kashif Yaseen, Capt Adeel Zafar, Maj Awais Ali.
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc.	MCS, NUST
Name of publisher, distributor, etc.	Rawalpindi
Date of publication, distribution, etc.	2024
300 ## - PHYSICAL DESCRIPTION
Extent	55 p
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note	Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. Technology for understanding spoken words by computers has improved a lot recently. But for languages like Punjabi, it's still hard for computers to understand speech well. The complexity of Punjabi phonology, compounded by variations in accent and pronunciation, poses substantial challenges for automatic speech recognition systems. As a result, the need for a robust Punjabi sound recognition system has become increasingly evident. Our project aims to solve this problem by using a special computer model called Wav2Vec. We train this model to understand Punjabi sounds better, so it can transcribe speech more accurately. So far, no work has been done in the field of Punjabi speech recognition system. Our approach involves pre-processing Punjabi audio data, training the Wav2Vec model, and fine-tuning it using transfer learning techniques. The final output is presented through a user-friendly Graphical User Interface (GUI), illustrating the outcomes of our Punjabi sound recognition system in a clear and accessible manner, facilitating easy interaction with transcribed speech for users of varying technical abilities. In this paper, the focus is on the development of the spontaneous speech model for the recognition of the Punjabi language. The GUI for Punjabi speech model also has been created and tested. The recognition accuracy is good for Punjabi sentences and much higher for Punjabi words. The python programming are used to build a speech model for Punjabi live speech.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element	UG EE Project
9 (RLIN)	118090
651 ## - SUBJECT ADDED ENTRY--GEOGRAPHIC NAME
Geographic name	BEE-57
9 (RLIN)	125983
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name	Supervisor Dr. Shibli Nisar
9 (RLIN)	112570
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type	Project Report

No items available.

NUST Institutions Library Catalogue

NUST INSTITUTIONS' LIBRARY CATALOGUE

Speech Recognition System Using Wav2vec Model (Punjabi Language) / (Record no. 611703)