Speech Recognition System Using Wav2vec Model (Punjabi Language) / (Record no. 611703)

000 -LEADER
fixed length control field 02244nam a22001817a 4500
003 - CONTROL NUMBER IDENTIFIER
control field NUST
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20240923123627.0
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 621.382,YAS
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Yaseen, Kashif
9 (RLIN) 125985
245 ## - TITLE STATEMENT
Title Speech Recognition System Using Wav2vec Model (Punjabi Language) /
Statement of responsibility, etc. Capt Kashif Yaseen, Capt Adeel Zafar, Maj Awais Ali.
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc. MCS, NUST
Name of publisher, distributor, etc. Rawalpindi
Date of publication, distribution, etc. 2024
300 ## - PHYSICAL DESCRIPTION
Extent 55 p
505 ## - FORMATTED CONTENTS NOTE
Formatted contents note Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. Technology for understanding spoken words by computers has improved a lot recently. But for languages like Punjabi, it's still hard for computers to understand speech well. The complexity of Punjabi phonology, compounded by variations in accent and pronunciation, poses substantial challenges for automatic speech recognition systems. As a result, the need for a robust Punjabi sound recognition system has become increasingly evident. Our project aims to solve this problem by using a special computer model called Wav2Vec. We train this model to understand Punjabi sounds better, so it can transcribe speech more accurately. So far, no work has been done in the field of Punjabi speech recognition system. Our approach involves pre-processing Punjabi audio data, training the Wav2Vec model, and fine-tuning it using transfer learning techniques. The final output is presented through a user-friendly Graphical User Interface (GUI), illustrating the outcomes of our Punjabi sound recognition system in a clear and accessible manner, facilitating easy interaction with transcribed speech for users of varying technical abilities. In this paper, the focus is on the development of the spontaneous speech model for the recognition of the Punjabi language. The GUI for Punjabi speech model also has been created and tested. The recognition accuracy is good for Punjabi sentences and much higher for Punjabi words. The python programming are used to build a speech model for Punjabi live speech.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element UG EE Project
9 (RLIN) 118090
651 ## - SUBJECT ADDED ENTRY--GEOGRAPHIC NAME
Geographic name BEE-57
9 (RLIN) 125983
700 ## - ADDED ENTRY--PERSONAL NAME
Personal name Supervisor Dr. Shibli Nisar
9 (RLIN) 112570
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme
Koha item type Project Report

No items available.

© 2023 Central Library, National University of Sciences and Technology. All Rights Reserved.