NUST Institutions Library Catalogue catalog › Details for: Overlapped Speech Separation System (OSSS) /

Normal view MARC view ISBD view

Overlapped Speech Separation System (OSSS) / NC Eemaan Aziz, NC Hashir Rizwan, NC Ayesha Riaz, NC Usman Awan.

By: Aziz, Eemaan Contributor(s): Supervisor Dr. Shibli Nisar Material type: Text

TextPublisher: MCS, NUST Rawalpindi 2024Description: 84 pSubject(s): UG EE Project | BEE-57DDC classification: 621.382,AZI

Contents:

Overlapped Speech refers to speakers speaking simultaneously (i.e. speech mixture). Speech separation has long been an active research topic in the signal processing community, with its importance in a wide range of applications such as hearable devices and telecommunication systems. It is a fundamental problem for all higher-level speech processing tasks. With recent progress in deep neural networks, the separation performance has been significantly advanced by various new problems. The problem formulation of time-domain, end-to-end speech separation naturally arises to tackle the disadvantages in frequency-domain systems. We’ve used a dual path recurrent neural network for separation of mixed audios. DPRNN (Dual-Path RNN) primarily separates in the time domain for audio source separation. It leverages recurrent neural networks (RNNs) to process temporal sequences of audio data. DPRNN focuses on exploiting temporal dependencies within audio signals for effective separation. We looked into the training objectives for separating and improving the robustness under reverberant environments. This project is further analyzed and can be used as the basis for future works.

Tags from this library: No tags from this library for this title. Log in to add tags.

Holdings ( 1 )
Title notes ( 1 )
Comments ( 0 )

Item type	Current location	Home library	Shelving location	Call number	Status	Date due	Barcode	Item holds
Project Report	Military College of Signals (MCS)	Military College of Signals (MCS)	General Stacks	621.382,AZI (Browse shelf)	Available		MCSPTC-478

Total holds: 0

Overlapped Speech refers to speakers speaking simultaneously (i.e. speech mixture). Speech separation has long been an active research topic in the signal processing community, with its importance in a wide range of applications such as hearable devices and telecommunication systems. It is a fundamental problem for all higher-level speech processing tasks.
With recent progress in deep neural networks, the separation performance has been significantly advanced by various new problems. The problem formulation of time-domain, end-to-end speech separation naturally arises to tackle the disadvantages in frequency-domain systems. We’ve used a dual path recurrent neural network for separation of mixed audios.
DPRNN (Dual-Path RNN) primarily separates in the time domain for audio source separation. It leverages recurrent neural networks (RNNs) to process temporal sequences of audio data. DPRNN focuses on exploiting temporal dependencies within audio signals for effective separation. We looked into the training objectives for separating and improving the robustness under reverberant environments. This project is further analyzed and can be used as the basis for future works.

There are no comments on this title.

to post a comment.